CmoCh08G011630 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh08G011630
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 4-dioxygenase
LocationCmo_Chr08: 7420604 .. 7424756 (+)
RNA-Seq ExpressionCmoCh08G011630
SyntenyCmoCh08G011630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACTGGTGTGAGAGGAAGCTCTGAGAAATCCTGGATTTGGAAACCTCGAACAAATTTTCTTCCATAGTTCTCTCTCTCACTCTCTTTCTATTTTGATCCAAGCGAAATTATGTCCAGATTTCGCTCTATGTTATTCATCTTCTTGATTTCGATTGCATCGGTTGTTCGAGAATCCATTTGTTCGCCTGCTCGTTCGGCGAGCACCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCAAGGTTTTTAGCAGAATCGATTTCAAATGCATGTCTCTTTTTTATTTTTATTTTTTTGTTCTATTGTGTTTGTTTGCTGAGAAAGTTGTGGAACGGGAAATTATTGAACTCTTGATTGTTTTAGCTTCGAAGTTTTGGTCATCTAGGTTGAGTTCAGTGACCTTTAACTTAGTGCTGATAGTAAATTAGTAATTAAGTTATCTTCTATTTTCCAAATTTTGATTAGATGATTTCACTGATGTTGCAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCTATTGTAAGTTAAGATTAGTGATATTCATCATTGAGTTTGATATGGAATACATACCGTCAATGTGAATCTCTGTTTTTCTTCTGCGAAAGGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTCCTAAGAGCAAGGTTGGTTTTGATCTCCCTACAAAATAATCTTCCAGTTTTTCTGGATATTTTCTCGACTCTCTCTTTTGATTTTTGTTTTTGATCGGAAACAAACATCCGTTACTACTCGTCATTGAAGCTATTTATTGAATTAAACTAAATGGAAGAAACTTCGCTGTTACGATACTGAAATAGATTGCCTTCTCAAAGAGGTTAAACAATCACATTACGATAGGAACGGTCCTGATAGCATGGATGCACTAATAAACATTGCCTCTTGTGTTTAAGCTTTAGATGTTAGTTTTTTATCCAGGAGGTCCTAAAACAAAATTAGGGAGAAAACAAGTTTGCAGAATCAATTACTTAAAACTTCCTTCACTCTTATCTTGTTAAATCAGGACTAGTCTCAAAAGCTTGAGCCGTTATGTTACAATTGATCTAATATCTTGTCTAGACCAGTAGGCTTTGTGCTCTTCCCACTTGTGTCTTAAAATTAATTTATGGTATGGCAATGAATACTTCCTTTGGTTCTATTAAGTAGCAGTTGAGATCTCTTATTACCCCATGTTTGTATTGCGCATTTACATCTTTATTTGATATTCAGGATGCTATTGTTTCTGGCATAGAGGATAAAATTGCTGCGTGGACTTTTCTTCCAAAAGGTATACAATTTGTTTACTATCAAATCGCATAATTCACCATTGATATGTTATGTTATATGTGAGATCTCACATCGATTGGGGAGGAGAACGAAACATTTTTTTATAAGTGTGTGGAAACCTTTCCCTAGAAGACGCGTTTTAAAACTGTGAAGCTGACAACAATACGTCAGGGCTAAAACAGACAATATTTGCTAGTGGTAGCTTAGGCTGTTTTAAATGGTATCAGAGCCAGACGTCATGTGATGTGCCAGCGAGGGCGCCGGCCCCCAAGGGAGTTGAATTGTGAGATCCCAAATTGGTTAGAGAGGAGAGCGAAACATTCCTTATAAGGGTGTGAAAACCTCTTCCTAGTAGATGTGTTTTAAAACCGTGAAGCTGACAGCGATACATAACGGGCCAAAGTGGACAATATCTGCTAGTACATTATATATATTTGAATCACTACAACTGATTTTCTATAAGTAACATTCATGATGTCTTTATACCATGGGGCAGCCAAGGAGAGTATATGTATTGGATGGATGTCATGCCCAATATTGCTGCTAAATGTGTGGTGTATTTTTCACTTTCAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACACTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTCCTCATGTATCTCTCTGATGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGGTTAGTTCGAGTGTTGATTCAGAAACTACTTTACTAGCTAGAACTACACTATTATACTCCTTTATGTTCACTTTATTTTAGGGAAAGATTATTGAGTCTCTTAAGTTAAATAAGCTGGGCAGAAAATGATGATATGTGTGAGATCCCATGTCGGTTGGGGAGGAAAACGAAGCATTTTTTATAAGGGTGTAGAAACCTCCCCCTAGCAGACGCGTTTTAAAAACCTTGCGGGAAAGCTCAAAAACAAAAAAATCTGCTAGCAGTGGGCTTCTGTTACAAATGGTATTTGAGCTAGACACATGCAATGTACCAGCGAAGAGGCTGAGTCCCGAAGGGGGTGGACACGAGGCAGTGTGCCAGCAAGGACGTTGGGCTCCAAAGGAGAGTGGATTTAGGGGGTTCCACATCGATTGGAGAAGGAAACGAGTGCTAGCGAGGACACTAGACCTTGAAAGAGGGTGGGGAGGAGAATGAAACGTTCTTTATAACCTCCCCCTCCTTTACTAGCTAGAACTACTCTACGATAGTCCTCATGTTCAATTGGAAGATCACCTCTTCTTATATCTCAGTACTCTCTATATATAATAAACCTTACCACTCAGCTTTCTGATGGAAAGGCCCACTCTCTAACTAGGGTTTTTCTGGGAAGACTCAGTGTTGGATAATCCAGAGAACTTTACCATAATTTTCTAGACCTATATATTGTTCAGGAAGACGATATCAAATCCCGTTACCCCATTCAAAACTCTTTCCATGATTCTGTTTCTCAATCAAAATGTCTATGAATTGAACCATATTCATTGTGGATGCAATAGAATCCACCTTAGAGGGGCATTAATTGTTGTAGAAAATTTCAATTGCTAAGAACTGAAATTTTCGATTTTCGATTTTCGATTTTCGATTTTTGAAGTCTCCAAGATTCAAACAAAACTATTGGTGAATTGGACTACACATGAAACATCTTGGATTACTGAAACTTTTCATCAACTTTCAATAAAACTCAATAATTTGTATTAAAGTCGCATATGAATTTGTTTAACCTTGTTATATATAACCTCTGTGAGCAGAAATCTCCCCACAGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGACTGTGCAAGGAAAGGAATCGCAGGTGAATTGTTACTTCTTCAGTACTTATCAAACTATAAAACTTGCATGAAATCGATTTGCTTTATCTCATTTTATTCCCATTTGGAACGGTGTAGTGAAACCAAAGAAAGGCGATGCCCTTCTCTTCTTTAGCCTTGAACCAAATGCCATCCCGGACACCAAAAGTCTACATGGAGGTTGCCCTGTTCTTGAAGGAGAAAAATGGTCGGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGCAAACGTTGGGAACTGCACTGATCTAAATGAAAGCTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAAAAACCCAGAGTACATGGTCGGATCTCCAGAGCTTCCTGGCTACTGTAGGAGGAGTTGCAGGACCTGTTGATCTCAACAGTATGTTCTTTCCACTCATATATTTCCATACCTTTTATTTGGTGAATATTAATTGTAGTATTTTTAAATTATTGGGTTCGTTTGAAATTTCCTTTCCTTTTGAGTGGCACTTTGCTTTGTTTCCCATAGTGAGGTATTGTTTCTTCCAAAAGGAAATACATAATGACTTAGATAAAGTTGTCACTCAGTCATGAATTCTTAACCTTTTATTTTTTGGTGTCGTTTGGCACGTCAGTCGTTGTGGAATGAAAAGCACTTATTGCAGTCTGAAGCACGGAAGCATAAAGCAGTAGCCTTGGTAATTATAACCCTTTCTTTTTTCCTTTCGTTTTTCAATTGATAAGACTACCTTTATTATGTTCATTTTTCTTATAGGAGTATTTTAGCTTTAAAAGCTATAGTTTATCTTTATCTACATTATAGGAAACAGTAATTTGGCTTATGTAGCAACGAAATAGTTAGGGAGGGACTTTTTGAAGTGATTACGG

mRNA sequence

ACTGGTGTGAGAGGAAGCTCTGAGAAATCCTGGATTTGGAAACCTCGAACAAATTTTCTTCCATAGTTCTCTCTCTCACTCTCTTTCTATTTTGATCCAAGCGAAATTATGTCCAGATTTCGCTCTATGTTATTCATCTTCTTGATTTCGATTGCATCGGTTGTTCGAGAATCCATTTGTTCGCCTGCTCGTTCGGCGAGCACCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCTATTGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTCCTAAGAGCAAGGATGCTATTGTTTCTGGCATAGAGGATAAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACACTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTCCTCATGTATCTCTCTGATGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGAAATCTCCCCACAGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGACTGTGCAAGGAAAGGAATCGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTCTTCTTTAGCCTTGAACCAAATGCCATCCCGGACACCAAAAGTCTACATGGAGGTTGCCCTGTTCTTGAAGGAGAAAAATGGTCGGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGCAAACGTTGGGAACTGCACTGATCTAAATGAAAGCTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAAAAACCCAGAGTACATGGTCGGATCTCCAGAGCTTCCTGGCTACTGTAGGAGGAGTTGCAGGACCTGTTGATCTCAACATCGTTGTGGAATGAAAAGCACTTATTGCAGTCTGAAGCACGGAAGCATAAAGCAGTAGCCTTGGTAATTATAACCCTTTCTTTTTTCCTTTCGTTTTTCAATTGATAAGACTACCTTTATTATGTTCATTTTTCTTATAGGAGTATTTTAGCTTTAAAAGCTATAGTTTATCTTTATCTACATTATAGGAAACAGTAATTTGGCTTATGTAGCAACGAAATAGTTAGGGAGGGACTTTTTGAAGTGATTACGG

Coding sequence (CDS)

ATGTCCAGATTTCGCTCTATGTTATTCATCTTCTTGATTTCGATTGCATCGGTTGTTCGAGAATCCATTTGTTCGCCTGCTCGTTCGGCGAGCACCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCTATTGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTCCTAAGAGCAAGGATGCTATTGTTTCTGGCATAGAGGATAAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACACTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTCCTCATGTATCTCTCTGATGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGAAATCTCCCCACAGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGACTGTGCAAGGAAAGGAATCGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTCTTCTTTAGCCTTGAACCAAATGCCATCCCGGACACCAAAAGTCTACATGGAGGTTGCCCTGTTCTTGAAGGAGAAAAATGGTCGGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGCAAACGTTGGGAACTGCACTGATCTAAATGAAAGCTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAAAAACCCAGAGTACATGGTCGGATCTCCAGAGCTTCCTGGCTACTGTAGGAGGAGTTGCAGGACCTGTTGA

Protein sequence

MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Homology
BLAST of CmoCh08G011630 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 451.4 bits (1160), Expect = 7.8e-126
Identity = 215/296 (72.64%), Postives = 248/296 (83.78%), Query Frame = 0

Query: 5   RSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLI 64
           R  L I   +I SV+ +S  S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++
Sbjct: 3   RRGLLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMV 62

Query: 65  SIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGE 124
           S+A++ LKRS VADN+SG+SK S VRTSSG FI K KD IVSGIEDKI+ WTFLPKENGE
Sbjct: 63  SLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGE 122

Query: 125 DIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHR 184
           DIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R
Sbjct: 123 DIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRR 182

Query: 185 RASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKW 244
             SE  EDLSDCA++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKW
Sbjct: 183 VLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKW 242

Query: 245 IHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
           IHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC+ C
Sbjct: 243 IHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of CmoCh08G011630 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 3.6e-123
Identity = 210/300 (70.00%), Postives = 252/300 (84.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MS  R  L +F+  +  +++ S C    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLEC
Sbjct: 1   MSMSRLGLLLFVAILLVLLQSSTCL-ISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLIS+A+  L+RS VADN++G+S++S VRTSSG FI K KD IVSGIEDK++ WTFLPK
Sbjct: 61  DHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++
Sbjct: 121 ENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQE 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
              R  SE  +DLSDCA+KGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWS
Sbjct: 181 FSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 241 ATKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of CmoCh08G011630 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 1.3e-96
Identity = 162/272 (59.56%), Postives = 207/272 (76.10%), Query Frame = 0

Query: 29  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLST 88
           ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S 
Sbjct: 46  ASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE 105

Query: 89  VRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKV 148
           VRTSSGMF+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE+GQ+YE H+DYF D+ 
Sbjct: 106 VRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 149 NIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHRRASETDEDLSDCARKGIAVKPKKG 208
           N+  GGHR+ATVLMYLS+V KGGETVFPM +    +     D+  ++CA++G AVKP+KG
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQGYAVKPRKG 225

Query: 209 DALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNCTDLNESCE 268
           DALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE
Sbjct: 226 DALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCE 285

Query: 269 RWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
           +WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 286 KWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of CmoCh08G011630 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 8.1e-91
Identity = 163/280 (58.21%), Postives = 208/280 (74.29%), Query Frame = 0

Query: 23  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNES 82
           I S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +S
Sbjct: 16  IFSQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDS 75

Query: 83  GKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHY 142
           G+S+ S VRTSSGMF+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE+GQ+Y+ H+
Sbjct: 76  GESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHF 135

Query: 143 DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-MAEKSPHRRASETDEDLSDCARKG 202
           DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S CA++G
Sbjct: 136 DYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLK----DDSWSKCAKQG 195

Query: 203 IAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNC 262
            AVKP+KGDALLFF+L  N   D  SLHG CPV+EGEKWSAT+WIHV SF K       C
Sbjct: 196 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKL---VC 255

Query: 263 TDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
            D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 256 VDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of CmoCh08G011630 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 2.2e-64
Identity = 115/209 (55.02%), Postives = 157/209 (75.12%), Query Frame = 0

Query: 42  ISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSK 101
           +SW+PRAFVY  FL+  EC++LIS+A+  + +S V D+E+GKSK S VRTSSG F+ + +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 102 DAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVL 161
           D I+  IE +IA +TF+P ++GE +QVL YE GQ+YE HYDYFVD+ N   GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 162 MYLSDVTKGGETVFPMAEKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAI 221
           MYLSDV +GGETVFP A  + +  +     +LS+C +KG++VKP+ GDALLF+S+ P+A 
Sbjct: 199 MYLSDVEEGGETVFPAA--NMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 258

Query: 222 PDTKSLHGGCPVLEGEKWSATKWIHVDSF 251
            D  SLHGGCPV+ G KWS+TKW+HV  +
Sbjct: 259 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of CmoCh08G011630 vs. ExPASy TrEMBL
Match: A0A6J1EQM4 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 PE=3 SV=1)

HSP 1 Score: 616.3 bits (1588), Expect = 6.7e-173
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of CmoCh08G011630 vs. ExPASy TrEMBL
Match: A0A6J1KCK1 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111494385 PE=3 SV=1)

HSP 1 Score: 609.4 bits (1570), Expect = 8.2e-171
Identity = 296/300 (98.67%), Postives = 299/300 (99.67%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MS+FRS+LFIFLISIASVVRESICS ARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRSLLFIFLISIASVVRESICSSARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLS+CARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of CmoCh08G011630 vs. ExPASy TrEMBL
Match: A0A6J1EWM7 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.5e-161
Identity = 283/283 (100.00%), Postives = 283/283 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 284
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 283

BLAST of CmoCh08G011630 vs. ExPASy TrEMBL
Match: A0A6J1ER58 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.5e-161
Identity = 283/283 (100.00%), Postives = 283/283 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 284
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 283

BLAST of CmoCh08G011630 vs. ExPASy TrEMBL
Match: A0A6J1EUU4 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.5e-161
Identity = 283/283 (100.00%), Postives = 283/283 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 284
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYM 283

BLAST of CmoCh08G011630 vs. NCBI nr
Match: XP_022930331.1 (probable prolyl 4-hydroxylase 4 isoform X4 [Cucurbita moschata] >XP_022930332.1 probable prolyl 4-hydroxylase 4 isoform X4 [Cucurbita moschata])

HSP 1 Score: 616.3 bits (1588), Expect = 1.4e-172
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of CmoCh08G011630 vs. NCBI nr
Match: XP_023514355.1 (probable prolyl 4-hydroxylase 4 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023514356.1 probable prolyl 4-hydroxylase 4 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 615.1 bits (1585), Expect = 3.1e-172
Identity = 299/300 (99.67%), Postives = 300/300 (100.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of CmoCh08G011630 vs. NCBI nr
Match: XP_023000081.1 (probable prolyl 4-hydroxylase 4 [Cucurbita maxima] >XP_023000082.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima])

HSP 1 Score: 609.4 bits (1570), Expect = 1.7e-170
Identity = 296/300 (98.67%), Postives = 299/300 (99.67%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MS+FRS+LFIFLISIASVVRESICS ARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRSLLFIFLISIASVVRESICSSARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLS+CARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of CmoCh08G011630 vs. NCBI nr
Match: KAG6593935.1 (putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 594.0 bits (1530), Expect = 7.4e-166
Identity = 290/296 (97.97%), Postives = 294/296 (99.32%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRS 297
           ATKWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGY + S
Sbjct: 241 ATKWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYLQSS 296

BLAST of CmoCh08G011630 vs. NCBI nr
Match: KAG7026278.1 (putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 585.5 bits (1508), Expect = 2.6e-163
Identity = 297/358 (82.96%), Postives = 300/358 (83.80%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE- 180
           ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE 
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEL 180

Query: 181 ---------------------------------------------------------KSP 240
                                                                    KSP
Sbjct: 181 DTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSP 240

Query: 241 HRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSAT 300
           HRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Sbjct: 241 HRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT 300

BLAST of CmoCh08G011630 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 451.4 bits (1160), Expect = 5.5e-127
Identity = 215/296 (72.64%), Postives = 248/296 (83.78%), Query Frame = 0

Query: 5   RSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLI 64
           R  L I   +I SV+ +S  S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++
Sbjct: 3   RRGLLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMV 62

Query: 65  SIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGE 124
           S+A++ LKRS VADN+SG+SK S VRTSSG FI K KD IVSGIEDKI+ WTFLPKENGE
Sbjct: 63  SLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGE 122

Query: 125 DIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHR 184
           DIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R
Sbjct: 123 DIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRR 182

Query: 185 RASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKW 244
             SE  EDLSDCA++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKW
Sbjct: 183 VLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKW 242

Query: 245 IHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
           IHVDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC+ C
Sbjct: 243 IHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of CmoCh08G011630 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 442.6 bits (1137), Expect = 2.6e-124
Identity = 210/300 (70.00%), Postives = 252/300 (84.00%), Query Frame = 0

Query: 1   MSRFRSMLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MS  R  L +F+  +  +++ S C    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLEC
Sbjct: 1   MSMSRLGLLLFVAILLVLLQSSTCL-ISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLEC 60

Query: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120
           DHLIS+A+  L+RS VADN++G+S++S VRTSSG FI K KD IVSGIEDK++ WTFLPK
Sbjct: 61  DHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180
           ENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++
Sbjct: 121 ENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQE 180

Query: 181 SPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240
              R  SE  +DLSDCA+KGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWS
Sbjct: 181 FSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWS 240

Query: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300
           ATKWIHVDSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 241 ATKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of CmoCh08G011630 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 354.4 bits (908), Expect = 9.2e-98
Identity = 162/272 (59.56%), Postives = 207/272 (76.10%), Query Frame = 0

Query: 29  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLST 88
           ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S 
Sbjct: 46  ASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE 105

Query: 89  VRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKV 148
           VRTSSGMF+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE+GQ+YE H+DYF D+ 
Sbjct: 106 VRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 149 NIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHRRASETDEDLSDCARKGIAVKPKKG 208
           N+  GGHR+ATVLMYLS+V KGGETVFPM +    +     D+  ++CA++G AVKP+KG
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQGYAVKPRKG 225

Query: 209 DALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNCTDLNESCE 268
           DALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE
Sbjct: 226 DALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCE 285

Query: 269 RWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
           +WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 286 KWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of CmoCh08G011630 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 335.1 bits (858), Expect = 5.8e-92
Identity = 163/280 (58.21%), Postives = 208/280 (74.29%), Query Frame = 0

Query: 23  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNES 82
           I S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +S
Sbjct: 16  IFSQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDS 75

Query: 83  GKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHY 142
           G+S+ S VRTSSGMF+ K +D IV+ +E K+AAWTFLP+ENGE +Q+L YE+GQ+Y+ H+
Sbjct: 76  GESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHF 135

Query: 143 DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-MAEKSPHRRASETDEDLSDCARKG 202
           DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S CA++G
Sbjct: 136 DYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLK----DDSWSKCAKQG 195

Query: 203 IAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNC 262
            AVKP+KGDALLFF+L  N   D  SLHG CPV+EGEKWSAT+WIHV SF K       C
Sbjct: 196 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKL---VC 255

Query: 263 TDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
            D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 256 VDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of CmoCh08G011630 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 332.8 bits (852), Expect = 2.9e-91
Identity = 157/280 (56.07%), Postives = 201/280 (71.79%), Query Frame = 0

Query: 29  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKS---- 88
           ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S    
Sbjct: 46  ASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE 105

Query: 89  -KLSTVRTSSGMFIPKSK---DAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESH 148
             +S VR SS           D IVS +E K+AAWTFLP+ENGE +Q+L YE+GQ+YE H
Sbjct: 106 DSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 165

Query: 149 YDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEKSPHRRASETDEDLSDCARKG 208
           +DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM +    +     D+  ++CA++G
Sbjct: 166 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQG 225

Query: 209 IAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWSATKWIHVDSFSKNLANVGNC 268
            AVKP+KGDALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C
Sbjct: 226 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGC 285

Query: 269 TDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 301
            D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 286 MDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LAN37.8e-12672.64Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU33.6e-12370.00Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8L9701.3e-9659.56Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A88.1e-9158.21Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q9LN202.2e-6455.02Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A6J1EQM46.7e-173100.00Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 ... [more]
A0A6J1KCK18.2e-17198.67Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111494385 PE... [more]
A0A6J1EWM74.5e-161100.00Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 ... [more]
A0A6J1ER584.5e-161100.00Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 ... [more]
A0A6J1EUU44.5e-161100.00Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111436810 ... [more]
Match NameE-valueIdentityDescription
XP_022930331.11.4e-172100.00probable prolyl 4-hydroxylase 4 isoform X4 [Cucurbita moschata] >XP_022930332.1 ... [more]
XP_023514355.13.1e-17299.67probable prolyl 4-hydroxylase 4 isoform X1 [Cucurbita pepo subsp. pepo] >XP_0235... [more]
XP_023000081.11.7e-17098.67probable prolyl 4-hydroxylase 4 [Cucurbita maxima] >XP_023000082.1 probable prol... [more]
KAG6593935.17.4e-16697.97putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7026278.12.6e-16382.96putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
AT5G18900.15.5e-12772.642-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.12.6e-12470.00P4H isoform 2 [more]
AT3G28480.19.2e-9859.56Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.15.8e-9258.21Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.22.9e-9156.07Oxoglutarate/iron-dependent oxygenase [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 259..300
e-value: 0.0072
score: 25.5
IPR003582ShKT domainPFAMPF01549ShKcoord: 259..300
e-value: 6.3E-4
score: 20.2
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 260..300
score: 8.652472
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 46..246
e-value: 6.3E-63
score: 225.0
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 38..247
e-value: 3.6E-75
score: 254.2
NoneNo IPR availablePANTHERPTHR10869:SF175PROLYL 4-HYDROXYLASE SUBUNIT ALPHA-LIKE PROTEINcoord: 6..300
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 126..246
e-value: 8.2E-20
score: 71.5
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 6..300
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 122..247
score: 12.522829

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh08G011630.1CmoCh08G011630.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen