PI0026236 (gene) Melon (PI 482460) v1

Overview
NamePI0026236
Typegene
OrganismCucumis metuliferus (Melon (PI 482460) v1)
DescriptionProcollagen-proline 4-dioxygenase
Locationchr11: 4559496 .. 4563540 (-)
RNA-Seq ExpressionPI0026236
SyntenyPI0026236
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGCGCGCGCGCGCTCTTTCTAATTTGATCCGATCGAGACTATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCAAGTAAAGTGAAGCAGATTTCATGGAAACCAAGGTTTTTGTCGCTATCATTTTCAATTGCACGGATCTTCGTTTTGTTTTCTTCTTCGTTCTATTGTGTTTGTTTGCTGAGAAAGTAGTGGAGCGGAAATTATTGAACTCTAACTTGCTTTAGCTTCCAACTTCCGGTCGTTTAGGTTTAGTTCAGTGTCTATTGATAGTAATTTATGCTTTTCAAATTCTTGATAAGGCGATTTCACTGAATGTTGCAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGTAAGTTTAGATTTAGTGATATTCATTATTCAGTTTGATATGGAAGTCATTTCGTTTCGTCATTGTAAATCTATTGTTGAAGGATTGTTTTTCTTCTGCGACAGGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGTTAGGTTGAATCTTCGCACTAACAATCTTCTAAGTTTTCTGGATGTTTTGTCTAATTTCTCTTTCGATTTTTGTTTTTGATCGGAATCAAGCTTCTGTTACTAGTCGTCACCGAAGCTATTTATTGAATTAAACTAAATGGAAGAATCTTCTCCGTTGCGAAACTGAAATATATTGCCTTCTCAAAGAGGTTAAACAATCAAATTACGGCAGGAATGTTCCTGATACATTGATGCAATAATCAAAATTAAAGAGTAAACAATCTTTCAGGAATAATAACTTAAAACCTCCTCCTCTTCTAACTTGTTGAATTAGGACTAGATTAACAAGTTTGAGCGGTTAGGTTTCAATATATTTAATATACAGTGTATTTTGTGCTCTTCCCCTGTATCTTAAAATTGATGTATATGGAAGGACAAATAATTATTTCCTTTGGTTCTGAGTATCAGTTAAGATCTTTACACCCATGTTTGTATCGAACATTTACATTTTTATTTGATATTTAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGGTACACGATTTGTTTCACTGTCAGATCACATATTTTGTCATTGATATGTTATGTCATATGCATTTGCATTACTGCAGCTGATTGCCATAAGTAACATTCATGGTTTCTTTATAATGAAATGGTTGTCAATGTCGACTATCACTGCTAAATGCCGTGTGTATTTCTCGTTTTTCAGAAAATGGGGAGGATATACAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGGTTAGTTCAAGCGTAATTCACAAACTACTTTACTAGAACTACACTTTATATTCCTCATTGTAAGACTACCTCCTTTTAGATCTCAGTAGTCTCTAAATAATAATCATACAACTCTTGTCTTCTGATGGAGAAGATCCCTTTATAAGACTCTGATACGTGGAAATATTGAAATCAAACATGAGAAAGACTCGATGTTGGATAATACAGAGGATGTTGCCATTATTTTATAGGAATTACGACTGGAAATGACCTGATTTTTGTTCAGGTTAATGATACTGAACCCGTTTACCCGATTTGAACCATTTTCTTGTTTATTTTTCTTCTTTTTGGATTTTGTTTTTCAATCAAAATGTCTATCAATTGATCTATATTCATTATGGATGCAATACAATCCAGCTTAAAGGGATTTAGCACTTTTTGCTACCATCTTTTAAGTACTTCAGTTAGGACAAATGTTATTGCAGTCATTTCAAAGGACCGAAATTCACCAATTATCTAAGATTTGAACCAAACTATTGGTGAATTGGATTAGACATGAAACATCTAAGATTATTGAAGTTGTTCCTCCCTGAGAAGCCAACTTGTAACAAACTGAAATAATTAGTTATGAAGTTACATATAGATTGGAAATGAATTTGATTCAGAAATTACTATAATCAATAGCAAATCTGTTTAACCTCGTTGTATAACCTCTGAGCAGAAATCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAGTTGCAGGTGAGTCACCAAACAAACTATAAATGCTTCTTTTTCAATACTTAAAAAAACTATAGAACTTTCATGTAATTGACGTGCTTTGTTATTTAACATGATTCCCAATTGGAATGATGTAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGATACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTGGGAAACATTGGGAATTGTACTGATCAAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGCTGCAGGATCTGTTGATTTCATCTCTAACATTACACCCAAAAATTTCGATACCTTTTGGCGGTAAACTTCTTACTCTTTGGTTGGATAGTAGTAATTAACTTGTTGTTAAGTTCCCATTTAGTAACCATTTGGTTTTTAGTTTTTGAAAATTAAGCCTATAAACACTTATTCTCCCTCTAGATTTCTTATTTTGTAATGACTATTTATTCATGTTTTAAAAAACCAAGTCAAATTTTAGAATCTGAAAGGAGTAGAGTTTTTAAAAAGGGGCATTTTCAAAAATAGCAAACTAGTGAAACTATTTACAAAATATAACAAAATTTATCAAGCAAGCTTTCTATACTTTCAATGATACATACTTTTGTTTTTTGTTTTTAATTTGGCTAAGATTTCAACTTTTGTACTTAATCATAAGAATATGAGTGGAAATAGGTTTAATTTTCAAAACTAAAAACAAAATGATTACTAGACGAGACCTAAATTATTGGCTTAAGTTTGAGAATTTATTTCCTTTTGAGTGACACTTTGATTTGTTTAGGCTTTCCTATACTTAAAAGTCGTGTTTCTTCCGGAAAGAAATAGGTTAAAATTGCAGATTTAGTCCTAATAGTTTGAAAAGAACATAAGTTAGAATTTAGTTTATGGTTTATAATTTAAATTTAGTTTTTATGGTAGGACTAATTATGAGAATTTTATCAACCTATAAAGGCTTAAGATTACAAGGATTAAATTCTAACTTTTAACTGTTCATTTTAACTTTCTTTGAATGTAGGGATCAAATTTGTAATTTAACCAAAAATATAATTACTTCGATAAAGCTTTGTCCCTCATTTCTTTATTTTTTTTTTTTTTTTTTTTTCCTTTTGTGTCATTTGGAAAGTTAGTCATTTGTGGAATGGAAAGCGCTACAACATTGATTGTAAAGCTACGGATGGATGAAGCAGTAGCCGTTGGGTAATTATGACCTTTCCTTTTTTAACTGATGATTAATGTATTACTTTATTGTCATTATTTTTCTTGTTTGATTTTTGATATGATTTTCTTATAAGAAAGAATATTGTTGTTTTAAAAGCTAAAGTTTATATACATTACAGAAAAAAAAAAGAGTAATTTTGCTGATGTGACAATGAAGTGATTAGAAAGAAGTGTTTTTAAATATAGCAAAATTTTACTTTCAATCTATGTGGTTAAATGTAGCAAAATTTTTCTTTCAATTTGTGATAAGCCAACCATGATAGGCATGATTAGTTTAACAGAAAAAAA

mRNA sequence

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGCGCGCGCGCGCTCTTTCTAATTTGATCCGATCGAGACTATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCAAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATACAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGATACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTGGGAAACATTGGGAATTGTACTGATCAAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGCTGCAGGATCTGTTGATTTCATCTCTAACATTACACCCAAAAATTTCGATACCTTTTGGCGTCATTTGTGGAATGGAAAGCGCTACAACATTGATTGTAAAGCTACGGATGGATGAAGCAGTAGCCGTTGGGTAATTATGACCTTTCCTTTTTTAACTGATGATTAATGTATTACTTTATTGTCATTATTTTTCTTGTTTGATTTTTGATATGATTTTCTTATAAGAAAGAATATTGTTGTTTTAAAAGCTAAAGTTTATATACATTACAGAAAAAAAAAAGAGTAATTTTGCTGATGTGACAATGAAGTGATTAGAAAGAAGTGTTTTTAAATATAGCAAAATTTTACTTTCAATCTATGTGGTTAAATGTAGCAAAATTTTTCTTTCAATTTGTGATAAGCCAACCATGATAGGCATGATTAGTTTAACAGAAAAAAA

Coding sequence (CDS)

ATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGTTTGTTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCAAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATACAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTAATGTGACCAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCAAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGATACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTGGGAAACATTGGGAATTGTACTGATCAAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGCTGCAGGATCTGTTGA

Protein sequence

MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Homology
BLAST of PI0026236 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 6.3e-128
Identity = 220/296 (74.32%), Postives = 249/296 (84.12%), Query Frame = 0

Query: 5   RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLV 64
           R LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+V
Sbjct: 4   RGLLISFFAIFSVLLQSST-SLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMV 63

Query: 65  SIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGE 124
           S+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPIVSGIEDKIS WTFLPKENGE
Sbjct: 64  SLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGE 123

Query: 125 DIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHR 184
           DIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R
Sbjct: 124 DIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRR 183

Query: 185 RAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKW 244
              E  EDLS+CAK+G+AVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKW
Sbjct: 184 VLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKW 243

Query: 245 IHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           IHVDSF + +   GNCTD NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Sbjct: 244 IHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of PI0026236 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 1.7e-125
Identity = 214/294 (72.79%), Postives = 253/294 (86.05%), Query Frame = 0

Query: 7   LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           LLF+ ++L+   ++ STC    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+
Sbjct: 9   LLFVAILLV--LLQSSTC-LISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISL 68

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSGIEDK+S WTFLPKENGED+
Sbjct: 69  AKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDL 128

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRA 186
           QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R  
Sbjct: 129 QVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSL 188

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E  +DLS+CAKKG+AVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIH
Sbjct: 189 SENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 248

Query: 247 VDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSF K L + GNCTD NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 249 VDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of PI0026236 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 1.6e-99
Identity = 178/308 (57.79%), Postives = 227/308 (73.70%), Query Frame = 0

Query: 6   NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYE 65
           +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YE
Sbjct: 10  SLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYE 69

Query: 66  GFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKI 125
           GFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+
Sbjct: 70  GFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKL 129

Query: 126 SAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGE 185
           +AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGE
Sbjct: 130 AAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGE 189

Query: 186 TVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCP 245
           TVFP+ +  + +     D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CP
Sbjct: 190 TVFPMWKGKATQL---KDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCP 249

Query: 246 VLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY 301
           V+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMVGS +  GY
Sbjct: 250 VVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGY 309

BLAST of PI0026236 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 8.7e-93
Identity = 163/277 (58.84%), Postives = 207/277 (74.73%), Query Frame = 0

Query: 25  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGK 84
           S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+
Sbjct: 18  SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGE 77

Query: 85  SKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDY 144
           S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQKY+ H+DY
Sbjct: 78  SEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDY 137

Query: 145 FVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAV 204
           F DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AV
Sbjct: 138 FYDKKALELGGHRIATVLMYLSNVTKGGETVFP---NWKGKTPQLKDDSWSKCAKQGYAV 197

Query: 205 KPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQ 264
           KP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D 
Sbjct: 198 KPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKK---LVCVDD 257

Query: 265 NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 258 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of PI0026236 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 2.2e-64
Identity = 114/209 (54.55%), Postives = 156/209 (74.64%), Query Frame = 0

Query: 42  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNK 101
           +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 102 DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVL 161
           D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HYDYFVD+ N   GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 162 MYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAI 221
           MYLS+V +GGETVFP A  +     +    +LSEC KKG++VKP+ GDALLF+S+ P+A 
Sbjct: 199 MYLSDVEEGGETVFPAANMNFSSVPWY--NELSECGKKGLSVKPRMGDALLFWSMRPDAT 258

Query: 222 PDTNSLHGGCPVLEGEKWSATKWIHVDSF 251
            D  SLHGGCPV+ G KWS+TKW+HV  +
Sbjct: 259 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of PI0026236 vs. ExPASy TrEMBL
Match: A0A1S3C816 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103497901 PE=3 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 1.3e-171
Identity = 295/300 (98.33%), Postives = 298/300 (99.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SSHRRAYETDEDLSECAKKG+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. ExPASy TrEMBL
Match: A0A5A7SVW6 (Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G001080 PE=3 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 1.3e-171
Identity = 295/300 (98.33%), Postives = 298/300 (99.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SSHRRAYETDEDLSECAKKG+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. ExPASy TrEMBL
Match: A0A0A0KCQ5 (Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_6G067350 PE=3 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 5.3e-170
Identity = 293/300 (97.67%), Postives = 296/300 (98.67%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKF NLLFIFLIL SSF+RESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
            SHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. ExPASy TrEMBL
Match: A0A6J1H545 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111460227 PE=3 SV=1)

HSP 1 Score: 579.7 bits (1493), Expect = 7.0e-162
Identity = 280/300 (93.33%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+KDPIVSGIEDKI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S  RRA ETDEDLSECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+C+D NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of PI0026236 vs. ExPASy TrEMBL
Match: A0A6J1L4G1 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111499059 PE=3 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 1.7e-160
Identity = 279/300 (93.00%), Postives = 290/300 (96.67%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KFR LLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRYLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+KDPIVSGIEDKI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S  RRA ETDEDLSECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+CTD NESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSAELPGYCRRSCRIC 300

BLAST of PI0026236 vs. NCBI nr
Match: XP_008458517.1 (PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >XP_008458518.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >KAA0033447.1 putative prolyl 4-hydroxylase 4 [Cucumis melo var. makuwa])

HSP 1 Score: 612.1 bits (1577), Expect = 2.6e-171
Identity = 295/300 (98.33%), Postives = 298/300 (99.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFRNLLF FLILISSFVRESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SSHRRAYETDEDLSECAKKG+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. NCBI nr
Match: XP_004144967.1 (probable prolyl 4-hydroxylase 4 [Cucumis sativus] >XP_011656650.1 probable prolyl 4-hydroxylase 4 [Cucumis sativus] >KGN46177.1 hypothetical protein Csa_004844 [Cucumis sativus])

HSP 1 Score: 606.7 bits (1563), Expect = 1.1e-169
Identity = 293/300 (97.67%), Postives = 296/300 (98.67%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKF NLLFIFLIL SSF+RESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
            SHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. NCBI nr
Match: XP_038874583.1 (probable prolyl 4-hydroxylase 4 [Benincasa hispida])

HSP 1 Score: 605.1 bits (1559), Expect = 3.2e-169
Identity = 291/300 (97.00%), Postives = 297/300 (99.00%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFRNLLFIFLILIS  VRESTCSYAGSAS+TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFIFLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISK+KDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHG+KYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGEKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S+HRRAYETDEDLSECA+KG+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SAHRRAYETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIGNCTD NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of PI0026236 vs. NCBI nr
Match: KAG6575033.1 (putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 580.1 bits (1494), Expect = 1.1e-161
Identity = 280/300 (93.33%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+KDPIVSGIEDKI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S  RRA ETDEDL+ECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLTECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+CTD NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of PI0026236 vs. NCBI nr
Match: XP_022959148.1 (probable prolyl 4-hydroxylase 4 [Cucurbita moschata] >XP_022959149.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata])

HSP 1 Score: 579.7 bits (1493), Expect = 1.4e-161
Identity = 280/300 (93.33%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MFKFRNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KFRNLLF+FLILISS VRES+CSYAGSA++TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSG SKLSTVRTSSGMFISK+KDPIVSGIEDKI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHRLATVLMYLSNVTKGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S  RRA ETDEDLSECA++G+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+C+D NESCERWAALGECTKNPEYMVGSPE+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of PI0026236 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 458.4 bits (1178), Expect = 4.5e-129
Identity = 220/296 (74.32%), Postives = 249/296 (84.12%), Query Frame = 0

Query: 5   RNLLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLV 64
           R LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+V
Sbjct: 4   RGLLISFFAIFSVLLQSST-SLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMV 63

Query: 65  SIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGE 124
           S+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPIVSGIEDKIS WTFLPKENGE
Sbjct: 64  SLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGE 123

Query: 125 DIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHR 184
           DIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVTKGGETVFP AE  S R
Sbjct: 124 DIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRR 183

Query: 185 RAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKW 244
              E  EDLS+CAK+G+AVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKW
Sbjct: 184 VLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKW 243

Query: 245 IHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           IHVDSF + +   GNCTD NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Sbjct: 244 IHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of PI0026236 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 450.3 bits (1157), Expect = 1.2e-126
Identity = 214/294 (72.79%), Postives = 253/294 (86.05%), Query Frame = 0

Query: 7   LLFIFLILISSFVRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           LLF+ ++L+   ++ STC    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+
Sbjct: 9   LLFVAILLV--LLQSSTC-LISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISL 68

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSGIEDK+S WTFLPKENGED+
Sbjct: 69  AKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDL 128

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRA 186
           QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVTKGGETVFP A++ S R  
Sbjct: 129 QVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSL 188

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E  +DLS+CAKKG+AVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIH
Sbjct: 189 SENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 248

Query: 247 VDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSF K L + GNCTD NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 249 VDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of PI0026236 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 364.0 bits (933), Expect = 1.2e-100
Identity = 178/308 (57.79%), Postives = 227/308 (73.70%), Query Frame = 0

Query: 6   NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYE 65
           +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YE
Sbjct: 10  SLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYE 69

Query: 66  GFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKI 125
           GFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E K+
Sbjct: 70  GFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKL 129

Query: 126 SAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGE 185
           +AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLSNV KGGE
Sbjct: 130 AAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGE 189

Query: 186 TVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCP 245
           TVFP+ +  + +     D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CP
Sbjct: 190 TVFPMWKGKATQL---KDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCP 249

Query: 246 VLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMVGSPEMPGY 301
           V+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMVGS +  GY
Sbjct: 250 VVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGY 309

BLAST of PI0026236 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 341.7 bits (875), Expect = 6.1e-94
Identity = 163/277 (58.84%), Postives = 207/277 (74.73%), Query Frame = 0

Query: 25  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGK 84
           S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+
Sbjct: 18  SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGE 77

Query: 85  SKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDY 144
           S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQKY+ H+DY
Sbjct: 78  SEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDY 137

Query: 145 FVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAV 204
           F DK  +  GGHR+ATVLMYLSNVTKGGETVFP       +     D+  S+CAK+G AV
Sbjct: 138 FYDKKALELGGHRIATVLMYLSNVTKGGETVFP---NWKGKTPQLKDDSWSKCAKQGYAV 197

Query: 205 KPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQ 264
           KP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D 
Sbjct: 198 KPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKK---LVCVDD 257

Query: 265 NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 258 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of PI0026236 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 341.7 bits (875), Expect = 6.1e-94
Identity = 172/316 (54.43%), Postives = 222/316 (70.25%), Query Frame = 0

Query: 6   NLLFIF-LILISS----FVRESTCSYAGS--------ASATVDPSKVKQISWKPRAFVYE 65
           +L F+F L LISS    F+  S+ +  GS        +S   DP++V Q+SW PR F+YE
Sbjct: 10  SLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYE 69

Query: 66  GFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KLSTVRTSSGMFISKNK---DPI 125
           GFL+D ECDH + +A+ +L++S VADNDSG+S      +S VR SS    + +    D I
Sbjct: 70  GFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFIANMDSLEIDDI 129

Query: 126 VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYL 185
           VS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYL
Sbjct: 130 VSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYL 189

Query: 186 SNVTKGGETVFPLAEKSSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDT 245
           SNV KGGETVFP+ +  + +     D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+
Sbjct: 190 SNVEKGGETVFPMWKGKATQL---KDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDS 249

Query: 246 NSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDQNESCERWAALGECTKNPEYMV 301
           NSLHG CPV+EGEKWSAT+WIHV SF +       C D+N SCE+WA  GEC KNP YMV
Sbjct: 250 NSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMV 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LAN36.3e-12874.32Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU31.7e-12572.79Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8L9701.6e-9957.79Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A88.7e-9358.84Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q9LN202.2e-6454.55Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A1S3C8161.3e-17198.33Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103497901 PE=3 S... [more]
A0A5A7SVW61.3e-17198.33Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A0A0KCQ55.3e-17097.67Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_6G067350 PE=... [more]
A0A6J1H5457.0e-16293.33Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111460227 ... [more]
A0A6J1L4G11.7e-16093.00Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111499059 PE... [more]
Match NameE-valueIdentityDescription
XP_008458517.12.6e-17198.33PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >XP_008458518.1 PREDIC... [more]
XP_004144967.11.1e-16997.67probable prolyl 4-hydroxylase 4 [Cucumis sativus] >XP_011656650.1 probable proly... [more]
XP_038874583.13.2e-16997.00probable prolyl 4-hydroxylase 4 [Benincasa hispida][more]
KAG6575033.11.1e-16193.33putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022959148.11.4e-16193.33probable prolyl 4-hydroxylase 4 [Cucurbita moschata] >XP_022959149.1 probable pr... [more]
Match NameE-valueIdentityDescription
AT5G18900.14.5e-12974.322-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.11.2e-12672.79P4H isoform 2 [more]
AT3G28480.11.2e-10057.79Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.16.1e-9458.84Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.26.1e-9454.43Oxoglutarate/iron-dependent oxygenase [more]
InterPro
Analysis Name: InterPro Annotations of Melon (PI 482460) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 46..246
e-value: 3.5E-62
score: 222.6
IPR003582ShKT domainPFAMPF01549ShKcoord: 259..300
e-value: 2.1E-5
score: 24.9
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 260..300
score: 8.944857
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 126..246
e-value: 5.2E-20
score: 72.2
NoneNo IPR availableGENE3D1.10.10.1940coord: 253..300
e-value: 4.3E-5
score: 25.4
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 38..247
e-value: 5.3E-75
score: 253.7
NoneNo IPR availablePANTHERPTHR10869:SF175PROLYL 4-HYDROXYLASE SUBUNIT ALPHA-LIKE PROTEINcoord: 8..300
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 8..300
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 122..247
score: 12.313966

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
PI0026236.1PI0026236.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen