Moc06g38520 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc06g38520
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProcollagen-proline 4-dioxygenase
Locationchr6: 29885502 .. 29888265 (-)
RNA-Seq ExpressionMoc06g38520
SyntenyMoc06g38520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTCGTCTTCCCGTCTTACTTCTTTTAGCGACTGCAATTTCGTTCTTAAGCTGCCTTGCACAAAGGTGATAACCGAACGCCATTTTACATCTCAGCTGTGTCTTTTTCATTTCGAAGTCGTGGGTTCTGTAGATCCGCAATTGGAATTGTATTCGTAAAACTGATTTTCGTTTCACTTCCTAATACTGAAATTGTTGAATGATTTGTCGATGCATGATTGAGTTTGATATCGTGGTTTTTTTCCCTCCTTTCTGCCAGCAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGATCGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGTCAGGTACTTGCAGAATCTCTCATACGATACTGTTGACGTGAAACTGTGAAATCATTTAGAATTACTCCTAGGTTGTGGCTTATGAATTAGTTCTCTTTTCTCCTCTTTTTGTGACAGAATGAACTTCAAAACATTTTGAAGTTCTTTTATACTGTACATATTTGTTGATTAATTTAATAGGAAGTCTGATTTAACTTTTTTTGCTATAGGGTTTTCTTGTATAAAGGATTTCTCTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGTATGTTTCTGCCATGTTCTTCATAGTATCTTGAACATGGCTTCAGCTGTTTCTGACGTGTATATAACTTTTGGGCGTAGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCAGGAGCCATTTTAAACACAACAGTACGTTCCTGTGCTTTTCGAAGTTTATGGGTTGGTTTGATACCACACCTAGTGGTTCATGGTGGAGTTCTAACCATAGAAAATTCTTTTGTAATACCTTTTGATGCTATTCTTTTAATGGTTATTGACAATTGAAAGGTATTTTTTTTGTAGTTCCCCCTAGGCTGTTCTTTTCTCCCCTTTTAAATATAGTTCTCACTTCTCTGTTTCTTAAAAAAAAAAAGTGATGCTGCAGAATCTGTCACATCGAGATTTTCTTATCATGTTTTCTTAAAATTGACAGAAGCAGCATACTATTAATCTTATATCGCTTCGCATTAGTTTCTACATGCTAGTAATTATGCAATCCATTAGTTTAGACCTTTTTCTGACATTGTTGTTTTATTTATTTCTTGTGGTTTTGGTCAGTATTCAACTAATACATGGATGAACAGTTTGAGTTTATCTCATTGTTCAATGTATTTTCTATTGGACTATGTTTGTTTAAAAGCATTCTGTTATTAGTTTTTATGGTGGTAGTGGTACTACCAAAGTATCAAATATCTGTGTTGTGTATATTAGGATTTGGACTTGGTTACACCTACTATTGTTGATCTTAATCTATTACCATTGATGTGGGGTTTTGTTTTCAAAGATTCCATCTCTCATTCCCTAAATCCTTGATGGGTCACTTTTAGCATTGATGATAGACGATATGCTCTCAAATTGCTCTCATTTTTTTTTATTTTCAAACATATAAAACATTCTTTGTTGCTGAGTTTTCTGGGGAACCTGCAGTCGCTTGAAGGGAATCTCAAAAAATTCGGGCTCTAAATGTCTTGTGAAGAGTGAAGTCAGAAATTTCTCTCTCTGGTTTTATAGATATAAAGCACTAAAAAGTGTGGGGGATGGTGGTAAAACCTAAGATAATATATACTAGGATCTCATAATTGTTACTGAAATTCTTTGTTCTTGGTGACTCCACTATGACTTGTCAGTCAGCATCAATTGTCTCCCAAAATTTGTAACAAGCCTGACTTAAGGGAGTATTTGTTGGGATCATAGAACTTATAATTGTGCTTTTTTTTGTGGTACCAACTATTCCTGTTGGTGAAGCTGCTACTGTTAGTCTATTTCTAGGTTCTATGAGATTTACGATTCGTTTATTTTGTTGGGGCTTGTGTCTTAGGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGGTATTTTCCACCAGTACTGTGTACTGCTTTTCCTTCCACATTTTCTGGATAATGCCCCTTGTATTTAATCTTCTTCTAAACTTCTCAGTTCTATTGTCTTCAGATTATAGCATGCCTTTGCAGATTTTGCAATATGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCGCTTTCCTGAATCAAAGGTGAGAGAAAATACTCAAACATCCGTGGCCAACTGGCCATAACACCGTACTAATGACTGCCATTTAAATGCCTCAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCACTGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA

mRNA sequence

ATGGATTCTCGTCTTCCCGTCTTACTTCTTTTAGCGACTGCAATTTCGTTCTTAAGCTGCCTTGCACAAAGCAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGATCGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTTCTCTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTTTGCAATATGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCACTGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA

Coding sequence (CDS)

ATGGATTCTCGTCTTCCCGTCTTACTTCTTTTAGCGACTGCAATTTCGTTCTTAAGCTGCCTTGCACAAAGCAATTTGATTAGTGGGCGCAAGGGTTTAAGGGACCAATTGATCGAAAGTGTACCTTTGAGCTACTCTAATCATTCTGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGATTTCTCTCAGATGAGGAGTGTGATCACCTTATTTCTTTGGCTACAAGTTCAGAAGATAAACCTTCTGGGAACAGTACTGACTCTGGGAACACTGTCCCAACCAAAATTCTAAAGAGTTCAGGAGCCATTTTAAACACAACAGATGATATCATTGCAAGGATCGAGAATCGAATTGCTGTGTGGACTTTTCTTCCAAAAGATTATAGCATGCCTTTGCAGATTTTGCAATATGGGGGTGAAGAAGCAGAGCATAAGTACGTTTTTGGTAACAGATCTGCAATGTTGTCCAGTGAGCCTTTGATGGCCACAGTAGTTCTGTATCTCTCAGATTCTGCTAGCGGTGGCGAGATGCGCTTTCCTGAATCAAAGGTAAAGAGCAGATTTTGGTCAGACCGGAGAAAGAAAAACAACATTCTGAGACCAGTGAAAGGCAATGCAGTTCTTATTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTCCCATACCCGATCTCCGATACTCGATGGGGAATTGTGGATTGCAACAAAATTCTTCTACTTAAGACCAATCACTGGGAATAAACACACAGACGAACCTGATGGAGACTGTAATGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGAACGAAACGCTGTTTTCATGATTGGTTCTCCAGATTACTATGGAACATGTAGAAAAAGCTGCAACGCATGTTGA

Protein sequence

MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
Homology
BLAST of Moc06g38520 vs. NCBI nr
Match: XP_022159842.1 (probable prolyl 4-hydroxylase 12 [Momordica charantia])

HSP 1 Score: 632.1 bits (1629), Expect = 2.5e-177
Identity = 310/310 (100.00%), Postives = 310/310 (100.00%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV
Sbjct: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180
           IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS
Sbjct: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180

Query: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240
           DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI
Sbjct: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240

Query: 241 LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY 300
           LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
Sbjct: 241 LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY 300

Query: 301 GTCRKSCNAC 311
           GTCRKSCNAC
Sbjct: 301 GTCRKSCNAC 310

BLAST of Moc06g38520 vs. NCBI nr
Match: XP_038906497.1 (probable prolyl 4-hydroxylase 12 [Benincasa hispida])

HSP 1 Score: 535.4 bits (1378), Expect = 3.2e-148
Identity = 264/311 (84.89%), Postives = 280/311 (90.03%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PLSYSNHSGRIDPSRVVQV
Sbjct: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SW+PRVFLYKGFLSDEECDHLISLA++SED PSGNS  SGNTV TK+L SSG ILNT+DD
Sbjct: 61  SWQPRVFLYKGFLSDEECDHLISLASNSEDNPSGNSAGSGNTVSTKLLNSSGVILNTSDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180
           IIARIEN+IAVWTFLPKD+ MP QI+QY GEEAEHKY +GN SAM SSEPLMATVVLYLS
Sbjct: 121 IIARIENQIAVWTFLPKDHGMPFQIMQYRGEEAEHKYFYGNGSAMSSSEPLMATVVLYLS 180

Query: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240
           DSA GGEM FPESKVKS+FWSDRRKKNN LRPVKGNA+L FSVHLNASPDKSS HTRSPI
Sbjct: 181 DSARGGEMLFPESKVKSKFWSDRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHTRSPI 240

Query: 241 LDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPDY 300
           L+GELW+ATKFFYLRP TGNK T E D D C DEDKSCPQWAAIGECERN VFMIGSPDY
Sbjct: 241 LNGELWVATKFFYLRPTTGNKRTVESDVDGCIDEDKSCPQWAAIGECERNTVFMIGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 310

BLAST of Moc06g38520 vs. NCBI nr
Match: XP_004152378.1 (probable prolyl 4-hydroxylase 12 [Cucumis sativus] >KGN49777.2 hypothetical protein Csa_000298 [Cucumis sativus])

HSP 1 Score: 515.0 bits (1325), Expect = 4.5e-142
Identity = 255/312 (81.73%), Postives = 277/312 (88.78%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLATA SF +CLAQSNLISGRKGLRD+L++  PLSYSN+SGRIDPSRVVQV
Sbjct: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SG TV T++L SSG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAML-SSEPLMATVVLYL 180
           I+ARIENR+A+WT LPKD+SMP QI+QY GEEA+HKY +GNRSAML SSEPLMATVVLYL
Sbjct: 121 IVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGE+ FPESKVKS+FWS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSP
Sbjct: 181 SDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSP 240

Query: 241 ILDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPD 300
           I DGELW+ATKF YL P  GNKHT + D D C DEDKSCPQWAAIGECERNAVFM+GSPD
Sbjct: 241 IRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD 300

Query: 301 YYGTCRKSCNAC 311
           YYGTCRKSCNAC
Sbjct: 301 YYGTCRKSCNAC 311

BLAST of Moc06g38520 vs. NCBI nr
Match: XP_008436994.1 (PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo])

HSP 1 Score: 507.7 bits (1306), Expect = 7.2e-140
Identity = 254/312 (81.41%), Postives = 272/312 (87.18%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLL ATA SF +CLAQSNLISGRKGLRDQL++  PLSYSN S RIDPSRVVQV
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SGNTV T++L  SG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAM-LSSEPLMATVVLYL 180
           IIARIENRIAVWT LPKD+ MP QI+QY GEEA+HKY +GNRSAM  SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGEM FPESKVKS+FWS RRKK N LRPVKGNA+L FSVHLNASPDKSS H R P
Sbjct: 181 SDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYP 240

Query: 241 ILDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPD 300
           I +GELW+ATKF YLRP TGNKHT + + D C DEDKSCPQWAAIGECERNAVFM+GSPD
Sbjct: 241 IRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPD 300

Query: 301 YYGTCRKSCNAC 311
           YYGTCRKSCNAC
Sbjct: 301 YYGTCRKSCNAC 311

BLAST of Moc06g38520 vs. NCBI nr
Match: KAG6579383.1 (putative prolyl 4-hydroxylase 12, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016863.1 putative prolyl 4-hydroxylase 12 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 507.3 bits (1305), Expect = 9.4e-140
Identity = 253/314 (80.57%), Postives = 275/314 (87.58%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLA A SF SCLAQSN ISGRKGLRDQ++ S  LSYSNHS RIDPSRVVQ+
Sbjct: 1   MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQI 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SW+PR FLYKGFLSDEECDHLI+LA++SEDKPS N+  S NTV TK L +SGAILNTTDD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE-HKYVFGNRSAMLSSEPLMATVVLYL 180
           IIARIENRIAVWTFLPKD+SMP QI+QYGGEEA  HKY FGNRSAM SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGE+ FP SKVK RFWSD+RKKNN LRPVKGNAVL FSVHLNASPDKS  H+R+P
Sbjct: 181 SDSASGGEILFPVSKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTP 240

Query: 241 ILDGELWIATKFFYLRP-ITGNKHTDEP--DGDCNDEDKSCPQWAAIGECERNAVFMIGS 300
           ILDG+LW+ATKFFY+RP  TGN+H  E   D DC DED+SCP+WAAIGEC+RNAVFMIGS
Sbjct: 241 ILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS 300

Query: 301 PDYYGTCRKSCNAC 311
           PDYYGTCRKSCNAC
Sbjct: 301 PDYYGTCRKSCNAC 314

BLAST of Moc06g38520 vs. ExPASy Swiss-Prot
Match: Q8GXT7 (Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 9.6e-63
Identity = 130/286 (45.45%), Postives = 180/286 (62.94%), Query Frame = 0

Query: 30  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA 89
           RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL 
Sbjct: 28  RKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLR 87

Query: 90  TSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQI 149
             + +  S ++   G T                D ++A IE +++ WTFLP +    +++
Sbjct: 88  KETTEVYSVDA--DGKT--------------QLDPVVAGIEEKVSAWTFLPGENGGSIKV 147

Query: 150 LQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRR 209
             Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   
Sbjct: 148 RSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK--NSCL 207

Query: 210 KKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTD 269
           +  NILRPVKGNA+L F+  LNAS D  S+H R P++ GEL +ATK  Y +     +   
Sbjct: 208 EGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK----KQARI 267

Query: 270 EPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Sbjct: 268 EESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 291

BLAST of Moc06g38520 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 7.4e-55
Identity = 120/323 (37.15%), Postives = 185/323 (57.28%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCL-----AQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPS 60
           MDSR    + LA ++ FL  L     A +  ++     RD  +  + +  S  S   DP+
Sbjct: 1   MDSR----IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSV--IKMKTSASSFGFDPT 60

Query: 61  RVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAIL 120
           RV Q+SW PRVFLY+GFLSDEECDH I LA    +K      DSG +V +++  SSG  L
Sbjct: 61  RVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFL 120

Query: 121 N-TTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLM 180
           +   DDI++ +E ++A WTFLP++    +QIL Y  G +   H   F +++ +      +
Sbjct: 121 SKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 180

Query: 181 ATVVLYLSDSASGGEMRFP-----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNA 240
           ATV++YLS+   GGE  FP      +++K   W++  K+   ++P KG+A+L F++H NA
Sbjct: 181 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 240

Query: 241 SPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECE 300
           + D +S H   P+++GE W AT++ +++     +        C DE+ SC +WA  GEC+
Sbjct: 241 TTDSNSLHGSCPVVEGEKWSATRWIHVKSF---ERAFNKQSGCMDENVSCEKWAKAGECQ 300

Query: 301 RNAVFMIGSPDYYGTCRKSCNAC 311
           +N  +M+GS   +G CRKSC AC
Sbjct: 301 KNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of Moc06g38520 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 9.7e-55
Identity = 114/274 (41.61%), Postives = 162/274 (59.12%), Query Frame = 0

Query: 46  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVP 105
           S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K       DSG +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 106 TKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGN 165
           +++  SSG  L    DDI+A +E ++A WTFLP++    LQIL Y  G +   H   F +
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYD 140

Query: 166 RSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGN 225
           + A+      +ATV++YLS+   GGE  FP  K     +K   WS   K+   ++P KG+
Sbjct: 141 KKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGD 200

Query: 226 AVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKS 285
           A+L F++HLN + D +S H   P+++GE W AT++ ++R     K        C D+ +S
Sbjct: 201 ALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV------CVDDHES 260

Query: 286 CPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           C +WA  GECE+N ++M+GS    G CRKSC AC
Sbjct: 261 CQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Moc06g38520 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.9e-50
Identity = 109/285 (38.25%), Postives = 169/285 (59.30%), Query Frame = 0

Query: 37  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNS 96
           L++S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +   
Sbjct: 18  LLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVAD 77

Query: 97  TDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEA 156
            D+G +  + +  SSG  ++   D I++ IE++++ WTFLPK+    LQ+L+Y  G +  
Sbjct: 78  NDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYD 137

Query: 157 EHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRK 216
            H   F ++  +      +ATV+LYLS+   GGE  FP+++  SR          SD  K
Sbjct: 138 AHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAK 197

Query: 217 KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDE 276
           K   ++P KGNA+L F++  +A PD  S H   P+++GE W ATK+ +   +        
Sbjct: 198 KGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH---VDSFDKILT 257

Query: 277 PDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
            DG+C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Sbjct: 258 HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Moc06g38520 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.9e-50
Identity = 106/301 (35.22%), Postives = 179/301 (59.47%), Query Frame = 0

Query: 21  LAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDH 80
           +A+  L+     +   L++S     S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH
Sbjct: 1   MARRGLLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDH 60

Query: 81  LISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFLPKDY 140
           ++SLA +S  + +    DSG +  +++  SSG  ++   D I++ IE++I+ WTFLPK+ 
Sbjct: 61  MVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKEN 120

Query: 141 SMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKS 200
              +Q+L+Y  G +   H   F ++  ++     MAT+++YLS+   GGE  FP++++ S
Sbjct: 121 GEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPS 180

Query: 201 R--------FWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIAT 260
           R          SD  K+   ++P KG+A+L F++H +A PD  S H   P+++GE W AT
Sbjct: 181 RRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSAT 240

Query: 261 KFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNA 311
           K+ +   +        P G+C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC A
Sbjct: 241 KWIH---VDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKA 298

BLAST of Moc06g38520 vs. ExPASy TrEMBL
Match: A0A6J1E0X9 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111026141 PE=3 SV=1)

HSP 1 Score: 632.1 bits (1629), Expect = 1.2e-177
Identity = 310/310 (100.00%), Postives = 310/310 (100.00%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV
Sbjct: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180
           IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS
Sbjct: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180

Query: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240
           DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI
Sbjct: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240

Query: 241 LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY 300
           LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY
Sbjct: 241 LDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYY 300

Query: 301 GTCRKSCNAC 311
           GTCRKSCNAC
Sbjct: 301 GTCRKSCNAC 310

BLAST of Moc06g38520 vs. ExPASy TrEMBL
Match: A0A1S3AT39 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103482556 PE=3 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 3.5e-140
Identity = 254/312 (81.41%), Postives = 272/312 (87.18%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLL ATA SF +CLAQSNLISGRKGLRDQL++  PLSYSN S RIDPSRVVQV
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SGNTV T++L  SG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAM-LSSEPLMATVVLYL 180
           IIARIENRIAVWT LPKD+ MP QI+QY GEEA+HKY +GNRSAM  SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGEM FPESKVKS+FWS RRKK N LRPVKGNA+L FSVHLNASPDKSS H R P
Sbjct: 181 SDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYP 240

Query: 241 ILDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPD 300
           I +GELW+ATKF YLRP TGNKHT + + D C DEDKSCPQWAAIGECERNAVFM+GSPD
Sbjct: 241 IRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPD 300

Query: 301 YYGTCRKSCNAC 311
           YYGTCRKSCNAC
Sbjct: 301 YYGTCRKSCNAC 311

BLAST of Moc06g38520 vs. ExPASy TrEMBL
Match: A0A6J1E2P0 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111430280 PE=3 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 1.0e-139
Identity = 252/314 (80.25%), Postives = 274/314 (87.26%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLA A SF SCLAQSN ISGRKGLRDQ++ S  LSYSNHS RIDPSRVVQ+
Sbjct: 1   MDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQI 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SW+PR FLYKGFLSDEECDHLI+LA++SEDKPS N+  S NTV TK L +SGAILNTTDD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE-HKYVFGNRSAMLSSEPLMATVVLYL 180
           II RIENRIAVWTFLPKD+SMP QI++YGGEEA  HKY FGNRSAM SSEPLMATVVLYL
Sbjct: 121 IIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGE+ FP SKVK RFWSDRRKKNN LRPVKGNAVL FSVHLNASPDKS  H+R+P
Sbjct: 181 SDSASGGEILFPVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTP 240

Query: 241 ILDGELWIATKFFYLRP-ITGNKHTDEP--DGDCNDEDKSCPQWAAIGECERNAVFMIGS 300
           ILDG+LW+ATKFFY+RP  TGN+H  E   D DC DED+SCP+WAAIGEC+RNAVFMIGS
Sbjct: 241 ILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS 300

Query: 301 PDYYGTCRKSCNAC 311
           PDYYGTCRKSCNAC
Sbjct: 301 PDYYGTCRKSCNAC 314

BLAST of Moc06g38520 vs. ExPASy TrEMBL
Match: A0A5A7TKX1 (Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1167G00060 PE=3 SV=1)

HSP 1 Score: 505.0 bits (1299), Expect = 2.3e-139
Identity = 252/312 (80.77%), Postives = 272/312 (87.18%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLL ATA SF +CLAQSNLISGRKGLRDQL++  PLSYSN S RIDPSRVVQV
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDR-PLSYSNQSVRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SWRPRVFLYKGFLSD+ECDHLISLA++S+D PS NS  SGNTV T++L  SG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAM-LSSEPLMATVVLYL 180
           IIARIENRIAVWT LPKD+ MP QI+QY GEEA+HKY +GNRSAM  SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSASGGEM FPESKVKS+FWS RRKK N LRPVKGNA+L FSVHLNASPDKSS H R P
Sbjct: 181 SDSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYP 240

Query: 241 ILDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPD 300
           I +GELW+ATKF YLRP TGNKHT + + D C DEDKSCPQWAAIGECERNAVFM+GSPD
Sbjct: 241 IRNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPD 300

Query: 301 YYGTCRKSCNAC 311
           YYGTCRKSCNAC
Sbjct: 301 YYGTCRKSCNAC 311

BLAST of Moc06g38520 vs. ExPASy TrEMBL
Match: A0A6J1IBS3 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111471522 PE=3 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 1.9e-138
Identity = 251/314 (79.94%), Postives = 273/314 (86.94%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLA A SF SCLAQSN ISGRKGLRDQ++ S  LSYSNHS RIDPSRVVQ+
Sbjct: 1   MDSRLNFLLLLAAAFSFPSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQI 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120
           SW+PR FLYKGFLSDEECDHLI+LA++SEDKPS N+  S NTV TK L +SGAILNTTDD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDD 120

Query: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAE-HKYVFGNRSAMLSSEPLMATVVLYL 180
           IIARIENRIAVW FLPKD+SMP QI+QYGGEEA   KY FGNRSAM SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWLFLPKDHSMPFQIMQYGGEEAAGRKYFFGNRSAMPSSEPLMATVVLYL 180

Query: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240
           SDSA+GGE+ FP SKVK RFWSDRRKKNN LRPVKGNAVL FSVHLNASPDKS  H+R+P
Sbjct: 181 SDSANGGEILFPVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTP 240

Query: 241 ILDGELWIATKFFYLRP-ITGNKHTDEP--DGDCNDEDKSCPQWAAIGECERNAVFMIGS 300
           ILDG+LW+ATKFFY+RP  TGN+H  E   D DC DED+SCP+WAAIGEC+RNAVFMIGS
Sbjct: 241 ILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS 300

Query: 301 PDYYGTCRKSCNAC 311
           PDYYGTCRKSCNAC
Sbjct: 301 PDYYGTCRKSCNAC 314

BLAST of Moc06g38520 vs. TAIR 10
Match: AT4G25600.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 241.9 bits (616), Expect = 6.9e-64
Identity = 130/286 (45.45%), Postives = 180/286 (62.94%), Query Frame = 0

Query: 30  RKGLRDQLIES----VPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA 89
           RK LRD+ I S       SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL 
Sbjct: 28  RKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLR 87

Query: 90  TSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDDIIARIENRIAVWTFLPKDYSMPLQI 149
             + +  S ++   G T                D ++A IE +++ WTFLP +    +++
Sbjct: 88  KETTEVYSVDA--DGKT--------------QLDPVVAGIEEKVSAWTFLPGENGGSIKV 147

Query: 150 LQYGGEEAEHKY-VFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSRFWSDRR 209
             Y  E++  K   FG   + +  E L+ATVVLYLS++  GGE+ FP S++K +  +   
Sbjct: 148 RSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK--NSCL 207

Query: 210 KKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTD 269
           +  NILRPVKGNA+L F+  LNAS D  S+H R P++ GEL +ATK  Y +     +   
Sbjct: 208 EGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK----KQARI 267

Query: 270 EPDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           E  G+C+DED++C +WA +GEC++N V+MIGSPDYYGTCRKSCNAC
Sbjct: 268 EESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 291

BLAST of Moc06g38520 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 215.7 bits (548), Expect = 5.3e-56
Identity = 120/323 (37.15%), Postives = 185/323 (57.28%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCL-----AQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPS 60
           MDSR    + LA ++ FL  L     A +  ++     RD  +  + +  S  S   DP+
Sbjct: 1   MDSR----IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSV--IKMKTSASSFGFDPT 60

Query: 61  RVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAIL 120
           RV Q+SW PRVFLY+GFLSDEECDH I LA    +K      DSG +V +++  SSG  L
Sbjct: 61  RVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFL 120

Query: 121 N-TTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSAMLSSEPLM 180
           +   DDI++ +E ++A WTFLP++    +QIL Y  G +   H   F +++ +      +
Sbjct: 121 SKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 180

Query: 181 ATVVLYLSDSASGGEMRFP-----ESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNA 240
           ATV++YLS+   GGE  FP      +++K   W++  K+   ++P KG+A+L F++H NA
Sbjct: 181 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 240

Query: 241 SPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQWAAIGECE 300
           + D +S H   P+++GE W AT++ +++     +        C DE+ SC +WA  GEC+
Sbjct: 241 TTDSNSLHGSCPVVEGEKWSATRWIHVKSF---ERAFNKQSGCMDENVSCEKWAKAGECQ 300

Query: 301 RNAVFMIGSPDYYGTCRKSCNAC 311
           +N  +M+GS   +G CRKSC AC
Sbjct: 301 KNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of Moc06g38520 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 215.3 bits (547), Expect = 6.9e-56
Identity = 114/274 (41.61%), Postives = 162/274 (59.12%), Query Frame = 0

Query: 46  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPS-GNSTDSGNTVP 105
           S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    +K       DSG +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 106 TKILKSSGAIL-NTTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGN 165
           +++  SSG  L    DDI+A +E ++A WTFLP++    LQIL Y  G +   H   F +
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYD 140

Query: 166 RSAMLSSEPLMATVVLYLSDSASGGEMRFPESK-----VKSRFWSDRRKKNNILRPVKGN 225
           + A+      +ATV++YLS+   GGE  FP  K     +K   WS   K+   ++P KG+
Sbjct: 141 KKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGD 200

Query: 226 AVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKS 285
           A+L F++HLN + D +S H   P+++GE W AT++ ++R     K        C D+ +S
Sbjct: 201 ALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV------CVDDHES 260

Query: 286 CPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           C +WA  GECE+N ++M+GS    G CRKSC AC
Sbjct: 261 CQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Moc06g38520 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 208.0 bits (528), Expect = 1.1e-53
Identity = 120/331 (36.25%), Postives = 185/331 (55.89%), Query Frame = 0

Query: 1   MDSRLPVLLLLATAISFLSCL-----AQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPS 60
           MDSR    + LA ++ FL  L     A +  ++     RD  +  + +  S  S   DP+
Sbjct: 1   MDSR----IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSV--IKMKTSASSFGFDPT 60

Query: 61  RVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTK-----ILKS 120
           RV Q+SW PRVFLY+GFLSDEECDH I LA    +K      DSG +V ++     + +S
Sbjct: 61  RVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQS 120

Query: 121 SGAILN----TTDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEAEHKYVFGNRSA 180
           S  I N      DDI++ +E ++A WTFLP++    +QIL Y  G +   H   F +++ 
Sbjct: 121 SSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQAN 180

Query: 181 MLSSEPLMATVVLYLSDSASGGEMRFP-----ESKVKSRFWSDRRKKNNILRPVKGNAVL 240
           +      +ATV++YLS+   GGE  FP      +++K   W++  K+   ++P KG+A+L
Sbjct: 181 LELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALL 240

Query: 241 IFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDEPDGDCNDEDKSCPQ 300
            F++H NA+ D +S H   P+++GE W AT++ +++     +        C DE+ SC +
Sbjct: 241 FFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSF---ERAFNKQSGCMDENVSCEK 300

Query: 301 WAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           WA  GEC++N  +M+GS   +G CRKSC AC
Sbjct: 301 WAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of Moc06g38520 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 201.1 bits (510), Expect = 1.3e-51
Identity = 109/285 (38.25%), Postives = 169/285 (59.30%), Query Frame = 0

Query: 37  LIESVPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNS 96
           L++S     S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +  + +   
Sbjct: 18  LLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVAD 77

Query: 97  TDSGNTVPTKILKSSGAILNT-TDDIIARIENRIAVWTFLPKDYSMPLQILQY--GGEEA 156
            D+G +  + +  SSG  ++   D I++ IE++++ WTFLPK+    LQ+L+Y  G +  
Sbjct: 78  NDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYD 137

Query: 157 EHKYVFGNRSAMLSSEPLMATVVLYLSDSASGGEMRFPESKVKSR--------FWSDRRK 216
            H   F ++  +      +ATV+LYLS+   GGE  FP+++  SR          SD  K
Sbjct: 138 AHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAK 197

Query: 217 KNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPILDGELWIATKFFYLRPITGNKHTDE 276
           K   ++P KGNA+L F++  +A PD  S H   P+++GE W ATK+ +   +        
Sbjct: 198 KGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH---VDSFDKILT 257

Query: 277 PDGDCNDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
            DG+C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Sbjct: 258 HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022159842.12.5e-177100.00probable prolyl 4-hydroxylase 12 [Momordica charantia][more]
XP_038906497.13.2e-14884.89probable prolyl 4-hydroxylase 12 [Benincasa hispida][more]
XP_004152378.14.5e-14281.73probable prolyl 4-hydroxylase 12 [Cucumis sativus] >KGN49777.2 hypothetical prot... [more]
XP_008436994.17.2e-14081.41PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo][more]
KAG6579383.19.4e-14080.57putative prolyl 4-hydroxylase 12, partial [Cucurbita argyrosperma subsp. sororia... [more]
Match NameE-valueIdentityDescription
Q8GXT79.6e-6345.45Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 S... [more]
Q8L9707.4e-5537.15Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A89.7e-5541.61Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
F4JAU31.9e-5038.25Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8LAN31.9e-5035.22Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A6J1E0X91.2e-177100.00Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111026141... [more]
A0A1S3AT393.5e-14081.41Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103482556 PE=3 S... [more]
A0A6J1E2P01.0e-13980.25Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111430280 ... [more]
A0A5A7TKX12.3e-13980.77Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A6J1IBS31.9e-13879.94Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111471522 PE... [more]
Match NameE-valueIdentityDescription
AT4G25600.16.9e-6445.45Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.15.3e-5637.15Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.16.9e-5641.61Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.21.1e-5336.25Oxoglutarate/iron-dependent oxygenase [more]
AT3G06300.11.3e-5138.25P4H isoform 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 64..253
e-value: 2.3E-17
score: 73.7
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 269..310
e-value: 0.0018
score: 27.6
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 270..310
score: 8.436107
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 56..253
e-value: 8.0E-39
score: 135.5
NoneNo IPR availablePANTHERPTHR10869:SF102PROLYL 4-HYDROXYLASE 12-RELATEDcoord: 1..310
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..20
score: 5.0
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc06g38520.1Moc06g38520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019511 peptidyl-proline hydroxylation
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen