CsGy5G004710 (gene) Cucumber (Gy14) v2

NameCsGy5G004710
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionprobable prolyl 4-hydroxylase 12
LocationChr5 : 3065403 .. 3068231 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAATATATTCAATTTGTCGCAACGGTGAGAAAGTGGAAGTTCAAGCTTCAAGGAAGTGGCGGTGGGATCCAGATTCGTAGAACTACATAAGACGCCATGGATTTTTACACCACCAGCTTTCTCAAAATCTAATTTTGTCTTCAACTTCGTCTTGTTTCGATCTTCGTTCCACCCATCCATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGGTGATAAACCAAACCCACCCCATTTTACATCTCATTTCGAAATCTTAGGTTTTACGTATTCCATTATTCGCATCATATGTTTACTGCATGAATTGGGTTCTCAATTCTCATTAATTTCGTGTATGGATGGATGTTAAACCGAAATCGTGTTAGTAAAACTGATTTTCATTTCACTTTGGAATGATTTGATGATACATGATTGCGTTTCATAACATGGGATTTTATTCTTTCCCTTGAATCTGCTTTGTTTACTCTGCCAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCGATTGGTTGACAGACCTTTGAGCTACTCAAACTATTCTGGTAGAATCGACCCATCAAGAGTTGTACAAGTCTCTTGGCGACCAAGGTCAGGTACTTGCAGAATCTCTCTTATGATTCGGAATTATCATATAAACTTTAAAAAGTATATTGATTAACTTAGTAGAAAGTCTGATTTAACTTCTTTTGCTGTAGGGTTTTCTTGTATAAAGGTTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTCTTTGGTATTTTTCTGCCCTGTTCTTGTTAGTATCATAAACATGCTTCATCTGTTTCTGACGTGCATACAACTTTTAGGCTTCAAATTCGGAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGATCACTGTCTCAACCGAATTGCTAAACAGTTCAGGAGTCATTTTAAACACAACAGTATGTTCCTGTGCATTTTGAAGTTTATTCAATGGTTTGAACAGAAGACTTGGTGGTCAAATTTGTTCTCATAGCACGAGATTTGCTTATCTTGTTTTCTTGAAGTTTATGTTAAAGTATTATATTTTTTCAATCATATATTGTTTTCTGTTAGTTTCTAGACGTGAGCAATTAGGCAGCCCATCAGTTATAAAACTTTTTTTTTGTTATTATTGTTCTCTTTGTTTCCGGTATTGTTATTCAAAATACAACTACAAATGTTCGTGGATGATGCTTGGCAATTGTTTGAGTTAAATTAGTTGTTCAATGTATTTTCGATTGAAATATGTTCAGTATAACGTTTTCTGTTGATAGTTTTCACAGTGGTACTACCAAACATCCGTGTTGCATGTTAGAATTAGGCACTTAGTATATCTACTGTTGTTGCTTTTAATCTATAATTATTGAAGGTGTGGGGTTTTATTTCCAAAGGTTCCTTCTCTGTTTTCTCTTTCCTTAAAATTTCTTGATGGGTCACTTTTAGGATTGATGCATTTCATATGCTGTCAACCCGTCCTAAAACTCATAGAATCATGCTTTGTTGTTGGCATCCACTGTCTTTCAAATTTATAGCAAACCATGGCTCAAAGGGAGTTCATGACCCAATAATCAGTTTTGTGCTTCCTTTTGTAAGGCATTTATTTCTATTCATTGTATTTATGAGATAAAACTCCAAGTCACTGCAGTTGATTGATTTCAAGTTTCTATGAGATTTGTTCTTTGATTTCTTGGTTGGGCTTGTTTCTTAGGATGATATCGTTGCAAGAATTGAAAATCGACTTGCAATATGGACTTTGCTTCCAAAAGGTATTTCTCATCAGTGCTGTATTGCACTGGCTTTGCCTTTTTTCATTTTCCTGATAATGCTCCTTCTGTTTGATTTTCTTTTCTTCTAAACATATTAGTTTTATTGTCTTCAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAAAGCACAAGTACTTTTATGGCAACAGATCAGCAATGTTGCCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGTGGCGGTGAAATACTGTTTCCAGAATCAAAGGTGAGGGGAAGTACTCAAAGATCTGTGGCCATAACAATGTACTCATGATTACCTTTTTCTCAATGCCTCAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTTTCTGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATCCCCAATACGCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTATACTTAGGACCACCCGCTGGGAATAAACACACTATTCAATCCGATGTAGATGGGTGCTTTGATGAAGATAAAAGCTGCCCTCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTGTTCATGGTTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGATTGATGCACGACCAAATTCAAGTAAAAATTTCTCTCGTTCTGATTTGAGCAACTATTTCTTTATTTATTTATTCCTAATTTCATGCATGTACTACACCAAATGATTCTTAGATATGTAGCTGTAATATTGCTCATCTTCTTGGAACATGAGACAATCACGTTCTAGTCCTTAGGTTTGAGATGTATTTAACTCTTGCTTAACTTTTTTTAATTTAATTGCTATATTTGTAGATCTTTTGGATCTTTGCAACTATCTTAATAATATTAATAATATGATTACATGATGATTGAGTATATTAAAA

mRNA sequence

GGAATATATTCAATTTGTCGCAACGGTGAGAAAGTGGAAGTTCAAGCTTCAAGGAAGTGGCGGTGGGATCCAGATTCGTAGAACTACATAAGACGCCATGGATTTTTACACCACCAGCTTTCTCAAAATCTAATTTTGTCTTCAACTTCGTCTTGTTTCGATCTTCGTTCCACCCATCCATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCGATTGGTTGACAGACCTTTGAGCTACTCAAACTATTCTGGTAGAATCGACCCATCAAGAGTTGTACAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGATCACTGTCTCAACCGAATTGCTAAACAGTTCAGGAGTCATTTTAAACACAACAGATGATATCGTTGCAAGAATTGAAAATCGACTTGCAATATGGACTTTGCTTCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAAAGCACAAGTACTTTTATGGCAACAGATCAGCAATGTTGCCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGTGGCGGTGAAATACTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTTTCTGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATCCCCAATACGCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTATACTTAGGACCACCCGCTGGGAATAAACACACTATTCAATCCGATGTAGATGGGTGCTTTGATGAAGATAAAAGCTGCCCTCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTGTTCATGGTTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGATTGATGCACGACCAAATTCAAGTAAAAATTTCTCTCGTTCTGATTTGAGCAACTATTTCTTTATTTATTTATTCCTAATTTCATGCATGTACTACACCAAATGATTCTTAGATATGTAGCTGTAATATTGCTCATCTTCTTGGAACATGAGACAATCACGTTCTAGTCCTTAGGTTTGAGATGTATTTAACTCTTGCTTAACTTTTTTTAATTTAATTGCTATATTTGTAGATCTTTTGGATCTTTGCAACTATCTTAATAATATTAATAATATGATTACATGATGATTGAGTATATTAAAA

Coding sequence (CDS)

ATGGATTCTCGTCTTAACTTTTTGCTTCTGTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCTCAAAGCAATTTGATTAGTGGCCGTAAGGGTTTAAGGGACCGATTGGTTGACAGACCTTTGAGCTACTCAAACTATTCTGGTAGAATCGACCCATCAAGAGTTGTACAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCGGAAGACAATCCTTCTAGGAATAGTGCTGGTTCGGGGATCACTGTCTCAACCGAATTGCTAAACAGTTCAGGAGTCATTTTAAACACAACAGATGATATCGTTGCAAGAATTGAAAATCGACTTGCAATATGGACTTTGCTTCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAAAGCACAAGTACTTTTATGGCAACAGATCAGCAATGTTGCCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCAGATTCTGCTAGTGGCGGTGAAATACTGTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAGGCCGAAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAATTCTTTTTTTTTCTGTGCATCTTAATGCCTCTCCAGACAAGAGTAGCTACCACATTCGATCCCCAATACGCGATGGGGAGTTGTGGGTTGCTACAAAATTCTTATACTTAGGACCACCCGCTGGGAATAAACACACTATTCAATCCGATGTAGATGGGTGCTTTGATGAAGATAAAAGCTGCCCTCAATGGGCTGCCATTGGCGAATGCGAACGAAATGCTGTGTTCATGGTTGGTTCTCCAGATTACTATGGTACATGTAGAAAAAGCTGCAATGCATGTTGA

Protein sequence

MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
BLAST of CsGy5G004710 vs. NCBI nr
Match: XP_004152378.1 (PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis sativus])

HSP 1 Score: 638.3 bits (1645), Expect = 1.4e-179
Identity = 311/311 (100.00%), Postives = 311/311 (100.00%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS 60
           MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI 120

Query: 121 VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS 180
           VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS
Sbjct: 121 VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS 180

Query: 181 DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI 240
           DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI
Sbjct: 181 DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI 240

Query: 241 RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY 300
           RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY
Sbjct: 241 RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 312
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of CsGy5G004710 vs. NCBI nr
Match: XP_008436994.1 (PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo])

HSP 1 Score: 597.0 bits (1538), Expect = 3.6e-167
Identity = 289/311 (92.93%), Postives = 297/311 (95.50%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS 60
           MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG TVSTELLN SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS 180
           +ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI 240
           DSASGGE+LFPESKVKSKFWSGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY 300
           R+GELWVATKFLYL PP GNKHTI S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 312
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of CsGy5G004710 vs. NCBI nr
Match: KGN50302.1 (hypothetical protein Csa_5G166460 [Cucumis sativus])

HSP 1 Score: 597.0 bits (1538), Expect = 3.6e-167
Identity = 288/289 (99.65%), Postives = 289/289 (100.00%), Query Frame = 0

Query: 23  QSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS 82
           +SNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS
Sbjct: 6   KSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS 65

Query: 83  LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF 142
           LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF
Sbjct: 66  LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF 125

Query: 143 QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG 202
           QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG
Sbjct: 126 QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG 185

Query: 203 RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH 262
           RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH
Sbjct: 186 RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH 245

Query: 263 TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
           TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
Sbjct: 246 TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 294

BLAST of CsGy5G004710 vs. NCBI nr
Match: XP_022159842.1 (probable prolyl 4-hydroxylase 12 [Momordica charantia])

HSP 1 Score: 516.2 bits (1328), Expect = 8.0e-143
Identity = 255/312 (81.73%), Postives = 277/312 (88.78%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQV 60
           MDSRL  LLLLATA SF +CLAQSNLISGRKGLRD+L++  PLSYSN+SGRIDPSRVVQV
Sbjct: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLISLA++SED PS NS  SG TV T++L SSG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120

Query: 121 IVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYL 180
           I+ARIENR+A+WT LPKD+SMP QI+QY GEEA+HKY +GNRSAML SSEPLMATVVLYL
Sbjct: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAML-SSEPLMATVVLYL 180

Query: 181 SDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSP 240
           SDSASGGE+ FPESKVKS+FWS RRKKNN LRPVKGNA+L FSVHLNASPDKSS H RSP
Sbjct: 181 SDSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSP 240

Query: 241 IRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPD 300
           I DGELW+ATKF YL P  GNKHT + D D C DEDKSCPQWAAIGECERNAVFM+GSPD
Sbjct: 241 ILDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPD 300

Query: 301 YYGTCRKSCNAC 312
           YYGTCRKSCNAC
Sbjct: 301 YYGTCRKSCNAC 310

BLAST of CsGy5G004710 vs. NCBI nr
Match: XP_023549812.1 (probable prolyl 4-hydroxylase 12 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 511.5 bits (1316), Expect = 2.0e-141
Identity = 251/315 (79.68%), Postives = 281/315 (89.21%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR-PLSYSNYSGRIDPSRVVQV 60
           MDSRLNFLLL A AFSFS+CLAQSN +SGRKGLRD++V+   LSYSN+  RIDPSRVVQ+
Sbjct: 1   MDSRLNFLLLFAAAFSFSSCLAQSNSVSGRKGLRDQMVNSGHLSYSNHFERIDPSRVVQI 60

Query: 61  SWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDD 120
           SW+PRVFLYKGFLSDEECDHLI+LASNSED PSR++AGS  TVST+ L +SG +LNTTDD
Sbjct: 61  SWQPRVFLYKGFLSDEECDHLIALASNSEDKPSRSNAGSRNTVSTKFLGNSGAVLNTTDD 120

Query: 121 IVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAK-HKYFYGNRSAMLPSSEPLMATVVLY 180
           I+ARIENR+A+WT LPKDHSMPFQIMQY GEEA  HKYF+GNRSAM PSSEPLMATVVLY
Sbjct: 121 IIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAM-PSSEPLMATVVLY 180

Query: 181 LSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRS 240
           LSDSASGGEILFP SKVK +FWS RRKKNNFLRPVKGNA+LFFSVHLNASPDKS YH R+
Sbjct: 181 LSDSASGGEILFPVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRT 240

Query: 241 PIRDGELWVATKFLYLGPPA-GNKHTIQSDV-DGCFDEDKSCPQWAAIGECERNAVFMVG 300
           PI DG+LWVATKF Y+ P A GN+H ++S V D C DED+SCP+WAAIGEC+RNAVFM+G
Sbjct: 241 PILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIG 300

Query: 301 SPDYYGTCRKSCNAC 312
           SPDYYGTCRKSCNAC
Sbjct: 301 SPDYYGTCRKSCNAC 314

BLAST of CsGy5G004710 vs. TAIR10
Match: AT4G25600.1 (Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 240.7 bits (613), Expect = 1.2e-63
Identity = 137/310 (44.19%), Postives = 183/310 (59.03%), Query Frame = 0

Query: 7   FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSW 66
           FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW
Sbjct: 8   FLILMITMSSSSPPFCSG---GSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSW 67

Query: 67  RPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIV 126
            PRVFLY+GFLS+EECDHLISL   + +  S ++ G      T+L           D +V
Sbjct: 68  LPRVFLYRGFLSEEECDHLISLRKETTEVYSVDADG-----KTQL-----------DPVV 127

Query: 127 ARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSD 186
           A IE +++ WT LP ++    ++  Y  E++  K  Y          E L+ATVVLYLS+
Sbjct: 128 AGIEEKVSAWTFLPGENGGSIKVRSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSN 187

Query: 187 SASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIR 246
           +  GGE+LFP S++K K  +   +  N LRPVKGNAILFF+  LNAS D  S H+R P+ 
Sbjct: 188 TTQGGELLFPNSEMKPK--NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVV 247

Query: 247 DGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY 306
            GEL VATK +Y       K     +   C DED++C +WA +GEC++N V+M+GSPDYY
Sbjct: 248 KGELLVATKLIY-----AKKQARIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYY 291

Query: 307 GTCRKSCNAC 312
           GTCRKSCNAC
Sbjct: 308 GTCRKSCNAC 291

BLAST of CsGy5G004710 vs. TAIR10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 212.6 bits (540), Expect = 3.4e-55
Identity = 115/277 (41.52%), Postives = 164/277 (59.21%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVS 104
           S++S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       SG +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 105 TELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRG---EEAKHKYFYG 164
           +E+  SSG+ L    DDIVA +E +LA WT LP+++    QI+ Y      +    YFY 
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYD 140

Query: 165 NRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVK 224
            ++  L      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P K
Sbjct: 141 KKALELGGHR--IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRK 200

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDE 284
           G+A+LFF++HLN + D +S H   P+ +GE W AT+++++    G K  +      C D+
Sbjct: 201 GDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHV-RSFGKKKLV------CVDD 260

Query: 285 DKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
            +SC +WA  GECE+N ++MVGS    G CRKSC AC
Sbjct: 261 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of CsGy5G004710 vs. TAIR10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 206.8 bits (525), Expect = 1.9e-53
Identity = 130/332 (39.16%), Postives = 187/332 (56.33%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVV 60
           MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV 
Sbjct: 1   MDSRI--FLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVT 60

Query: 61  QVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNPSRNSAGSGITVSTELLNSSG 120
           Q+SW PRVFLY+GFLSDEECDH I LA      S   DN S  S  S  +VS  +  SS 
Sbjct: 61  QLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSV-VRQSSS 120

Query: 121 VILN----TTDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLP 180
            I N      DDIV+ +E +LA WT LP+++    QI+ Y  G++ +  + Y +  A L 
Sbjct: 121 FIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLE 180

Query: 181 SSEPLMATVVLYLSDSASGGEILFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFF 240
                +ATV++YLS+   GGE +FP      +++K   W+   K+   ++P KG+A+LFF
Sbjct: 181 LGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFF 240

Query: 241 SVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIQSDVDGCFDEDKSCP 300
           ++H NA+ D +S H   P+ +GE W AT+++++     A NK +      GC DE+ SC 
Sbjct: 241 NLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQS------GCMDENVSCE 300

Query: 301 QWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
           +WA  GEC++N  +MVGS   +G CRKSC AC
Sbjct: 301 KWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of CsGy5G004710 vs. TAIR10
Match: AT3G06300.1 (P4H isoform 2)

HSP 1 Score: 187.2 bits (474), Expect = 1.5e-47
Identity = 103/277 (37.18%), Postives = 164/277 (59.21%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVST 104
           S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +    +     +G +  +
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 105 ELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRS 164
           ++  SSG  ++   D IV+ IE++L+ WT LPK++    Q+++Y  G++    + Y +  
Sbjct: 87  DVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDK 146

Query: 165 AMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVK 224
             +      +ATV+LYLS+   GGE +FP+++  S+          S   KK   ++P K
Sbjct: 147 VNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKK 206

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDE 284
           GNA+LFF++  +A PD  S H   P+ +GE W ATK++++     +   I +    C D 
Sbjct: 207 GNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV----DSFDKILTHDGNCTDV 266

Query: 285 DKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
           ++SC +WA +GEC +N  +MVG+P+  G CR+SC AC
Sbjct: 267 NESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of CsGy5G004710 vs. TAIR10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 183.7 bits (465), Expect = 1.7e-46
Identity = 100/283 (35.34%), Postives = 166/283 (58.66%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVST 104
           S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S    +     SG +  +
Sbjct: 26  SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFS 85

Query: 105 ELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRS 164
           E+  SSG  ++   D IV+ IE++++ WT LPK++    Q+++Y  G++    + Y +  
Sbjct: 86  EVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDK 145

Query: 165 AMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVK 224
             +      MAT+++YLS+   GGE +FP++++ S+          S   K+   ++P K
Sbjct: 146 VNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRK 205

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG------PPAGNKHTIQSDV 284
           G+A+LFF++H +A PD  S H   P+ +GE W ATK++++        P+GN        
Sbjct: 206 GDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTPSGN-------- 265

Query: 285 DGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
             C D ++SC +WA +GEC +N  +MVG+ +  G CR+SC AC
Sbjct: 266 --CTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of CsGy5G004710 vs. Swiss-Prot
Match: sp|Q8GXT7|P4H12_ARATH (Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 2.1e-62
Identity = 137/310 (44.19%), Postives = 183/310 (59.03%), Query Frame = 0

Query: 7   FLLLLATAFSFSTCLAQSNLISGRKGLRDRLV-----DRPLSYSNYSGRIDPSRVVQVSW 66
           FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW
Sbjct: 8   FLILMITMSSSSPPFCSG---GSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSW 67

Query: 67  RPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIV 126
            PRVFLY+GFLS+EECDHLISL   + +  S ++ G      T+L           D +V
Sbjct: 68  LPRVFLYRGFLSEEECDHLISLRKETTEVYSVDADG-----KTQL-----------DPVV 127

Query: 127 ARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSD 186
           A IE +++ WT LP ++    ++  Y  E++  K  Y          E L+ATVVLYLS+
Sbjct: 128 AGIEEKVSAWTFLPGENGGSIKVRSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSN 187

Query: 187 SASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIR 246
           +  GGE+LFP S++K K  +   +  N LRPVKGNAILFF+  LNAS D  S H+R P+ 
Sbjct: 188 TTQGGELLFPNSEMKPK--NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVV 247

Query: 247 DGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYY 306
            GEL VATK +Y       K     +   C DED++C +WA +GEC++N V+M+GSPDYY
Sbjct: 248 KGELLVATKLIY-----AKKQARIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYY 291

Query: 307 GTCRKSCNAC 312
           GTCRKSCNAC
Sbjct: 308 GTCRKSCNAC 291

BLAST of CsGy5G004710 vs. Swiss-Prot
Match: sp|Q8L970|P4H7_ARATH (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.2e-54
Identity = 126/323 (39.01%), Postives = 187/323 (57.89%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCL---AQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVV 60
           MDSR+   L  +  F F+  L   A +  ++     RD  V + +  S  S   DP+RV 
Sbjct: 1   MDSRI--FLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVT 60

Query: 61  QVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILN-T 120
           Q+SW PRVFLY+GFLSDEECDH I LA    +        SG +V +E+  SSG+ L+  
Sbjct: 61  QLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKR 120

Query: 121 TDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRSAMLPSSEPLMATV 180
            DDIV+ +E +LA WT LP+++    QI+ Y  G++ +  + Y +  A L      +ATV
Sbjct: 121 QDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATV 180

Query: 181 VLYLSDSASGGEILFP-----ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPD 240
           ++YLS+   GGE +FP      +++K   W+   K+   ++P KG+A+LFF++H NA+ D
Sbjct: 181 LMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 240

Query: 241 KSSYHIRSPIRDGELWVATKFLYLG--PPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECE 300
            +S H   P+ +GE W AT+++++     A NK +      GC DE+ SC +WA  GEC+
Sbjct: 241 SNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQS------GCMDENVSCEKWAKAGECQ 300

Query: 301 RNAVFMVGSPDYYGTCRKSCNAC 312
           +N  +MVGS   +G CRKSC AC
Sbjct: 301 KNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of CsGy5G004710 vs. Swiss-Prot
Match: sp|F4J0A8|P4H6_ARATH (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 6.2e-54
Identity = 115/277 (41.52%), Postives = 164/277 (59.21%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-EDNPSRNSAGSGITVS 104
           S++S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       SG +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 105 TELLNSSGVIL-NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRG---EEAKHKYFYG 164
           +E+  SSG+ L    DDIVA +E +LA WT LP+++    QI+ Y      +    YFY 
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYD 140

Query: 165 NRSAMLPSSEPLMATVVLYLSDSASGGEILFPESK-----VKSKFWSGRRKKNNFLRPVK 224
            ++  L      +ATV++YLS+   GGE +FP  K     +K   WS   K+   ++P K
Sbjct: 141 KKALELGGHR--IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRK 200

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDE 284
           G+A+LFF++HLN + D +S H   P+ +GE W AT+++++    G K  +      C D+
Sbjct: 201 GDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHV-RSFGKKKLV------CVDD 260

Query: 285 DKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
            +SC +WA  GECE+N ++MVGS    G CRKSC AC
Sbjct: 261 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of CsGy5G004710 vs. Swiss-Prot
Match: sp|F4JAU3|P4H2_ARATH (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 2.8e-46
Identity = 103/277 (37.18%), Postives = 164/277 (59.21%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVST 104
           S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLISLA  +    +     +G +  +
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 105 ELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRS 164
           ++  SSG  ++   D IV+ IE++L+ WT LPK++    Q+++Y  G++    + Y +  
Sbjct: 87  DVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDK 146

Query: 165 AMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVK 224
             +      +ATV+LYLS+   GGE +FP+++  S+          S   KK   ++P K
Sbjct: 147 VNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKK 206

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDE 284
           GNA+LFF++  +A PD  S H   P+ +GE W ATK++++     +   I +    C D 
Sbjct: 207 GNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV----DSFDKILTHDGNCTDV 266

Query: 285 DKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
           ++SC +WA +GEC +N  +MVG+P+  G CR+SC AC
Sbjct: 267 NESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of CsGy5G004710 vs. Swiss-Prot
Match: sp|Q8LAN3|P4H4_ARATH (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 3.1e-45
Identity = 100/283 (35.34%), Postives = 166/283 (58.66%), Query Frame = 0

Query: 45  SNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVST 104
           S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++SLA  S    +     SG +  +
Sbjct: 26  SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFS 85

Query: 105 ELLNSSGVILNT-TDDIVARIENRLAIWTLLPKDHSMPFQIMQY-RGEEAKHKYFYGNRS 164
           E+  SSG  ++   D IV+ IE++++ WT LPK++    Q+++Y  G++    + Y +  
Sbjct: 86  EVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDK 145

Query: 165 AMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSK--------FWSGRRKKNNFLRPVK 224
             +      MAT+++YLS+   GGE +FP++++ S+          S   K+   ++P K
Sbjct: 146 VNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRK 205

Query: 225 GNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLG------PPAGNKHTIQSDV 284
           G+A+LFF++H +A PD  S H   P+ +GE W ATK++++        P+GN        
Sbjct: 206 GDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTPSGN-------- 265

Query: 285 DGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
             C D ++SC +WA +GEC +N  +MVG+ +  G CR+SC AC
Sbjct: 266 --CTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of CsGy5G004710 vs. TrEMBL
Match: tr|A0A1S3AT39|A0A1S3AT39_CUCME (probable prolyl 4-hydroxylase 12 OS=Cucumis melo OX=3656 GN=LOC103482556 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 2.4e-167
Identity = 289/311 (92.93%), Postives = 297/311 (95.50%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS 60
           MDSRLNFLLL ATAFSFSTCLAQSNLISGRKGLRD+LVDRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSG TVSTELLN SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS 180
           +ARIENR+A+WTLLPKDH MPFQIMQYRGEEAKHKYFYGNRSAM  SSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI 240
           DSASGGE+LFPESKVKSKFWSGRRKK NFLRPVKGNAILFFSVHLNASPDKSSYHIR PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY 300
           R+GELWVATKFLYL PP GNKHTI S++DGC DEDKSCPQWAAIGECERNAVFMVGSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 312
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of CsGy5G004710 vs. TrEMBL
Match: tr|A0A0A0KPE4|A0A0A0KPE4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166460 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 2.4e-167
Identity = 288/289 (99.65%), Postives = 289/289 (100.00%), Query Frame = 0

Query: 23  QSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS 82
           +SNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS
Sbjct: 6   KSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS 65

Query: 83  LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF 142
           LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF
Sbjct: 66  LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF 125

Query: 143 QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG 202
           QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG
Sbjct: 126 QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG 185

Query: 203 RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH 262
           RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH
Sbjct: 186 RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH 245

Query: 263 TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 312
           TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC
Sbjct: 246 TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 294

BLAST of CsGy5G004710 vs. TrEMBL
Match: tr|A0A2P4NAB4|A0A2P4NAB4_QUESU (Putative prolyl 4-hydroxylase 12 OS=Quercus suber OX=58331 GN=CFP56_60861 PE=4 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 3.5e-94
Identity = 179/315 (56.83%), Postives = 224/315 (71.11%), Query Frame = 0

Query: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR----PLSYSNYSGRIDPSRV 60
           M S L+ LLLLA   SF + LA+S     RK LRD+   +     L +S +S +IDPSRV
Sbjct: 1   MASLLSILLLLAFTSSFQSLLAES-----RKELRDKEASQETFIQLGHSVHSNKIDPSRV 60

Query: 61  VQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNT 120
           VQ+SW+PRVFLYKGFLSDEECDHLI+LA   ++N   N   SG   +  LL SS + L+ 
Sbjct: 61  VQLSWQPRVFLYKGFLSDEECDHLITLAHGMKENGLGNDDNSGHVGTDRLLRSSEIPLDI 120

Query: 121 TDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVV 180
            DD+V+RIE R++ WT LPK++S P QIM Y  EE   KY Y    + L  ++PLMA VV
Sbjct: 121 EDDVVSRIEERISAWTFLPKENSRPLQIMHYGLEEVDKKYNYLGNKSTLELTKPLMAIVV 180

Query: 181 LYLSDSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHI 240
           LYLS+   GGEI FP+S+VKSK WSG  K +N LRP+KGNAILFF+VH NASPD SS H 
Sbjct: 181 LYLSNITQGGEIHFPDSEVKSKIWSGCTKSSNILRPIKGNAILFFTVHPNASPDNSSSHA 240

Query: 241 RSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVG 300
           R PI +GE+W ATKF ++G  +G K  +++D   C DE++SCP+WAAIGEC+RN V+M+G
Sbjct: 241 RCPIVEGEMWHATKFFHVGSISGEKLPLKTDGTDCTDEEESCPKWAAIGECQRNPVYMIG 300

Query: 301 SPDYYGTCRKSCNAC 312
           SPDYYGTCRKSCNAC
Sbjct: 301 SPDYYGTCRKSCNAC 310

BLAST of CsGy5G004710 vs. TrEMBL
Match: tr|A0A2R6PMB8|A0A2R6PMB8_ACTCH (Prolyl 4-hydroxylase 12 OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc28070 PE=4 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 1.2e-86
Identity = 167/313 (53.35%), Postives = 215/313 (68.69%), Query Frame = 0

Query: 8   LLLLATAFSFSTCLAQSNLISGRKGLRDRLVDR----PLSYSNYSGRIDPSRVVQVSWRP 67
           ++LLA AF FS   A+    SGRK LR + V++     L  S +S RIDPSRV+++SWRP
Sbjct: 7   IILLAFAFCFSNLYAE----SGRKELRSKGVNQGDVIQLGRSIHSNRIDPSRVIELSWRP 66

Query: 68  RVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVAR 127
           RVFLY+GFLS+EECD LIS      +N   N   SG   S +L  S+   L+  DDIVA+
Sbjct: 67  RVFLYRGFLSEEECDQLISWTKRKNENSMENGGVSGNVNSRKLSGSTEASLSMDDDIVAK 126

Query: 128 IENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSA 187
           +E R++ WT LPK +  PFQ++    E+AK K+ Y    +ML  +EPLMATVVLYLS+ +
Sbjct: 127 VEERISAWTFLPKGNGNPFQVLHSGLEDAKEKFDYFGNKSMLELNEPLMATVVLYLSNVS 186

Query: 188 SGGEILFPESKV-----KSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRS 247
            GG+ILFPES++     K+K WS  RK +N LRP KGNAILFF+ HLNA+PD+SS H R 
Sbjct: 187 QGGQILFPESELENSHTKNKIWSDCRKSSNVLRPTKGNAILFFNSHLNATPDRSSPHARC 246

Query: 248 PIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSP 307
           P+ +GE+W ATKF  +      K + Q + D C DED++C QWAAIGEC RN+VFM+GSP
Sbjct: 247 PVLEGEMWFATKFFSIRTITKEKVSSQLEDDDCTDEDENCSQWAAIGECRRNSVFMIGSP 306

Query: 308 DYYGTCRKSCNAC 312
           DYYGTCRKSC+AC
Sbjct: 307 DYYGTCRKSCSAC 315

BLAST of CsGy5G004710 vs. TrEMBL
Match: tr|A0A2I4HT83|A0A2I4HT83_9ROSI (probable prolyl 4-hydroxylase 12 isoform X2 OS=Juglans regia OX=51240 GN=LOC109021218 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 6.6e-85
Identity = 168/312 (53.85%), Postives = 218/312 (69.87%), Query Frame = 0

Query: 6   NFLLLLATAFSFSTCLAQSN--LISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRP 65
           + LLLL  A SF  C  +S+    SG++  ++ ++   L +S  S RIDPSRVVQ+SW+P
Sbjct: 6   SILLLLVFASSFLICFTESSRKKFSGKQSNQETVI--KLGHSVDSNRIDPSRVVQLSWQP 65

Query: 66  RVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVAR 125
           RVFLYKGFLS EECDHLISL    +     N+  S   V+  LL SS + LN  DD+V+R
Sbjct: 66  RVFLYKGFLSVEECDHLISLVHGRKKEDLGNNGNSEHVVTNRLLMSSKMHLNIEDDVVSR 125

Query: 126 IENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKY-FYGNRSAMLPSSEPLMATVVLYLSDS 185
           IE+R++ WT LPK++S P Q+M Y  E+    Y F+GNR  +L  +EPLMA +VLYLS+ 
Sbjct: 126 IEDRISAWTFLPKENSRPLQVMHYGLEKVDRNYNFFGNRD-LLGLTEPLMAIIVLYLSNV 185

Query: 186 ASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRD 245
             GGEILFPESK+K+  WS     ++  RP+KGNAILFF++H NASPDKSS H R P+ +
Sbjct: 186 TQGGEILFPESKLKNTIWSD-CTGSSIPRPIKGNAILFFTLHPNASPDKSSSHARCPVLE 245

Query: 246 GELWVATKFLYLGPPAGNKHTIQSDVD---GCFDEDKSCPQWAAIGECERNAVFMVGSPD 305
           GE+W ATKF ++   +G K + +SD     GC DE ++CP+WAAIGEC+RN VFM+GSPD
Sbjct: 246 GEMWHATKFFHIRSISGEKVSPESDGSDDTGCIDEAENCPRWAAIGECQRNPVFMIGSPD 305

Query: 306 YYGTCRKSCNAC 312
           YYGTCRKSCNAC
Sbjct: 306 YYGTCRKSCNAC 313

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004152378.11.4e-179100.00PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis sativus][more]
XP_008436994.13.6e-16792.93PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo][more]
KGN50302.13.6e-16799.65hypothetical protein Csa_5G166460 [Cucumis sativus][more]
XP_022159842.18.0e-14381.73probable prolyl 4-hydroxylase 12 [Momordica charantia][more]
XP_023549812.12.0e-14179.68probable prolyl 4-hydroxylase 12 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT4G25600.11.2e-6344.19Oxoglutarate/iron-dependent oxygenase[more]
AT3G28490.13.4e-5541.52Oxoglutarate/iron-dependent oxygenase[more]
AT3G28480.21.9e-5339.16Oxoglutarate/iron-dependent oxygenase[more]
AT3G06300.11.5e-4737.18P4H isoform 2[more]
AT5G18900.11.7e-4635.342-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q8GXT7|P4H12_ARATH2.1e-6244.19Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 S... [more]
sp|Q8L970|P4H7_ARATH1.2e-5439.01Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
sp|F4J0A8|P4H6_ARATH6.2e-5441.52Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
sp|F4JAU3|P4H2_ARATH2.8e-4637.18Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
sp|Q8LAN3|P4H4_ARATH3.1e-4535.34Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3AT39|A0A1S3AT39_CUCME2.4e-16792.93probable prolyl 4-hydroxylase 12 OS=Cucumis melo OX=3656 GN=LOC103482556 PE=4 SV... [more]
tr|A0A0A0KPE4|A0A0A0KPE4_CUCSA2.4e-16799.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166460 PE=4 SV=1[more]
tr|A0A2P4NAB4|A0A2P4NAB4_QUESU3.5e-9456.83Putative prolyl 4-hydroxylase 12 OS=Quercus suber OX=58331 GN=CFP56_60861 PE=4 S... [more]
tr|A0A2R6PMB8|A0A2R6PMB8_ACTCH1.2e-8653.35Prolyl 4-hydroxylase 12 OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY0... [more]
tr|A0A2I4HT83|A0A2I4HT83_9ROSI6.6e-8553.85probable prolyl 4-hydroxylase 12 isoform X2 OS=Juglans regia OX=51240 GN=LOC1090... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0031418L-ascorbic acid binding
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: INTERPRO
TermDefinition
IPR006620Pro_4_hyd_alph
IPR003582ShKT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G004710.1CsGy5G004710.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 270..311
e-value: 1.3E-4
score: 31.3
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 271..311
score: 8.471
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 63..253
e-value: 9.5E-14
score: 61.7
NoneNo IPR availableGENE3DG3DSA:2.60.120.620coord: 55..253
e-value: 3.5E-38
score: 133.5
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 8..311
NoneNo IPR availablePANTHERPTHR10869:SF102PROLYL 4-HYDROXYLASE 12-RELATEDcoord: 8..311