Lsi09G004730 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi09G004730
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProcollagen-proline 4-dioxygenase
Locationchr09: 4985735 .. 4988667 (+)
RNA-Seq ExpressionLsi09G004730
SyntenyLsi09G004730
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAATTTGTCGCAACCGCGAGGGACTGGAAGTTGAAGAAAGTGGCGGTGGGATCCAGATTCATAGAATTACAGAAGAGACCATGGATTTCTAAACTAGCTCCTTTCTGAAAATCTATATCCACATTCTCTAATTTTTCAACTTCATCTTGTTTCGATCTTCGTCCCAGCCATCCATGGATTCTCGTCTCCACTTTTTGCTTCTTTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGGTGATGACCGAACCCCATTTTACATCTCATTTCGAAATCTTAGGTTTTACGTATTCCCTTCTTCGTATCTTATGTTTGCTGCCTAATTTGGATTCTCATCAATTTCGTCTACTGATGTTCAATCGGAATTGTGTTCGTAAAACTGATTTTCGTTTCACTTTGAAATGATTTTGTTGATGCTTGATTGAGTTTCATGACATGGGTTTTTCTTCTTTCCCTTGAATCTGTTTTCTTCATTCTTCCAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATTGCTTGATAGACCTTTAAGCTACTCAAATCATTCAGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGAGACCAAGGTCAGGTACTTGCAGAATCTCTCTTATGATTTGGAATTATCTCATCAACTTCAAAGCATATTGATTAACTTAGTAGAAAGTCTGATTTAACTTTTCTTGCTGTAGGGTTTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTTTTTGGTATTTTTCTGCCCTGTTCTTGTTAGTATAATCAACTTGCTTCAGCTTTTTTGACATGCATATAACTTTTGGGGTTAGGCTTCAAGTTCAGAAGACAATCCATCTGGGAACAGTGCTGGTTCCAGGAACACTGTCTCAACCAACTTGCTAAGCAGTTCAGGAGTCATTTTAAACACAACAGTATGTTCGTGTGCTTTCTGAAGTTTATGCAATAGTTTGAATAGTACACTTTGTGGTCAAATTTGTATTTCATAGCACAAGATTTGTTTATATTGTTTTCTTGAAGTTTATGTTAAAGTAGTATATTTTTCAATCATATATTGTTTTCTGTTAGTTTCTACACGAGCAATTATACAGTCCATCAGTTGTAGAATTTTTTCTGTCATTATTGTTTTCTTTATTTCCAGTTTTGGTTTGTATTCAAAATACAACTACAAATGTACATGGATAAACAGTTTCATCCATGCTTGGCATTTGTTTGAGTTAAATTAGTCGTTCAATGTGTTTCCCATTGAAATATGTTCAGTCTAAAGTTTTCTGTTGATAGTTTTCACAGTGGTACTACCAAATATCCGTGTTGCATATTAGGATTTGGTACTTGGTTTACCTACTGTTATTGCTTTTAATCTATAATTATTAAAGGTGTGGGGTTGTAGTTCCAAAGATTCCTTTGGGTCACTTTTAGGATTGATGACACATTTCATATGCAGTCAACCTGTCCTAATACTCATAGAAGCATGCTTTGATGCTGTATTTTCTGGGGACTCGGTAGTAGTTTTAAAGGAATCTTAAACAACTCTAAACTCTAACTCTAAATGTCCTGTGAAGTCAGACGTTTGATTTCTGCATTTGTGGATAAAAACACAAAAAGTGTGGAGGATGGTAGTATAACATAATCTATGATGATTTATGCAAGATTCATGTTACTGAAAAAATTTGTTTTTTATGATTCGATTTTGAGTTGTTGGCATCAATTGTCTTTCAAAATTTATAAAGAACCATGGCTCAAGGGAGTGATTGTCCGACCCAAGAATTTAGTATTGTGCTTCCTTTTGGTAATACCATCTATTTCTGTTCATGGTATTTATGTATTGTGGAGCTTGTCAATTACTTCTTAAGTAACTAGGTCTTCAATTTCTTGTTTGTGCTTGTTTCTTAGGATGATATAATTGCAAGAATTGAAACTCGAATTGCACTGTGGACTCTTCTCCCAAAAGGTATTTCTCATCGGTGCTGTACTTGCAACTTTGCCTTTTTCATTTTCCTGATAATGCTCCTTTTATTTGATTTTCTTTTCTTCTAAACATATTAGTTTTATTGTCTTCAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAGAGCATAAGTACTTTTATGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCGGATTCTGCTCGCGGTGGCGAGATGCTCTTTCCAGAATCAAAGGTGAGCGGAAGCACTCAAAGATCTGTGGCTATAACAATGTTCTGATGACTGCCTTTTTCTCAATGCCTCAGGTAAAGAGCAAATTTTGGTCAAGCCGGAGAAAGAAAAACAACTTTCTGACACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACCCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCACGGGGAATAAACACACAGTTGAATCCGATGAAGACGGGTGCATTGATGAAGATAAAAGCTGTCCTCAATGGGCTGCCATTGGTGAATGCGAACGAAATGCTGTGTTCATGATCGGTTCTCCAGATTATTATGGTACATGTAGAAAAAGTTGTAATGCATGTTGAAGCATAACTAAATTCATGTAAAAATTATTCTCGTCGTCCTGATTTGAGTAAGTATTTGTTTATTTTTTCTGATTTCAAACAAGTTTGGATATTGAATTCTATTGGCATATCTCTGGGTTAGAAGTTACTTCTTGCACCTTTAGAACCATATAGGATAGGAATTCTGCTAACACTGTAAGCCATTGTATTAATGGGATAATTACATTTTAGTATTTAGGTTTGAGATATGTTTCTATTT

mRNA sequence

TTAATTTGTCGCAACCGCGAGGGACTGGAAGTTGAAGAAAGTGGCGGTGGGATCCAGATTCATAGAATTACAGAAGAGACCATGGATTTCTAAACTAGCTCCTTTCTGAAAATCTATATCCACATTCTCTAATTTTTCAACTTCATCTTGTTTCGATCTTCGTCCCAGCCATCCATGGATTCTCGTCTCCACTTTTTGCTTCTTTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATTGCTTGATAGACCTTTAAGCTACTCAAATCATTCAGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGAGACCAAGGGTTTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTTTTTGGCTTCAAGTTCAGAAGACAATCCATCTGGGAACAGTGCTGGTTCCAGGAACACTGTCTCAACCAACTTGCTAAGCAGTTCAGGAGTCATTTTAAACACAACAGATGATATAATTGCAAGAATTGAAACTCGAATTGCACTGTGGACTCTTCTCCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAGAGCATAAGTACTTTTATGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCGGATTCTGCTCGCGGTGGCGAGATGCTCTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAAGCCGGAGAAAGAAAAACAACTTTCTGACACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACCCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCACGGGGAATAAACACACAGTTGAATCCGATGAAGACGGGTGCATTGATGAAGATAAAAGCTGTCCTCAATGGGCTGCCATTGGTGAATGCGAACGAAATGCTGTGTTCATGATCGGTTCTCCAGATTATTATGGTACATGTAGAAAAAGTTGTAATGCATGTTGAAGCATAACTAAATTCATGTAAAAATTATTCTCGTCGTCCTGATTTGAGTAAGTATTTGTTTATTTTTTCTGATTTCAAACAAGTTTGGATATTGAATTCTATTGGCATATCTCTGGGTTAGAAGTTACTTCTTGCACCTTTAGAACCATATAGGATAGGAATTCTGCTAACACTGTAAGCCATTGTATTAATGGGATAATTACATTTTAGTATTTAGGTTTGAGATATGTTTCTATTT

Coding sequence (CDS)

ATGGATTCTCGTCTCCACTTTTTGCTTCTTTTAGCGACTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATTGCTTGATAGACCTTTAAGCTACTCAAATCATTCAGGAAGAATCGACCCATCAAGAGTTGTCCAAGTCTCTTGGAGACCAAGGGTTTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTTTTTGGCTTCAAGTTCAGAAGACAATCCATCTGGGAACAGTGCTGGTTCCAGGAACACTGTCTCAACCAACTTGCTAAGCAGTTCAGGAGTCATTTTAAACACAACAGATGATATAATTGCAAGAATTGAAACTCGAATTGCACTGTGGACTCTTCTCCCAAAAGATCATAGCATGCCTTTTCAGATCATGCAATACAGGGGTGAAGAAGCAGAGCATAAGTACTTTTATGGCAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCGGATTCTGCTCGCGGTGGCGAGATGCTCTTTCCAGAATCAAAGGTAAAGAGCAAATTTTGGTCAAGCCGGAGAAAGAAAAACAACTTTCTGACACCAGTGAAAGGCAATGCAATTCTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACCCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAGACCAACCACGGGGAATAAACACACAGTTGAATCCGATGAAGACGGGTGCATTGATGAAGATAAAAGCTGTCCTCAATGGGCTGCCATTGGTGAATGCGAACGAAATGCTGTGTTCATGATCGGTTCTCCAGATTATTATGGTACATGTAGAAAAAGTTGTAATGCATGTTGA

Protein sequence

MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC
Homology
BLAST of Lsi09G004730 vs. ExPASy Swiss-Prot
Match: Q8GXT7 (Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 6.9e-61
Identity = 136/311 (43.73%), Postives = 184/311 (59.16%), Query Frame = 0

Query: 7   FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSW 66
           FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW
Sbjct: 8   FLILMITMSSSSPPFCSG---GSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSW 67

Query: 67  RPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDII 126
            PRVFLY+GFLS+EECDHLI L   + +  S ++ G                    D ++
Sbjct: 68  LPRVFLYRGFLSEEECDHLISLRKETTEVYSVDADGK----------------TQLDPVV 127

Query: 127 ARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSD 186
           A IE +++ WT LP ++    ++  Y  E++  K  ++G   +    E L+ATVVLYLS+
Sbjct: 128 AGIEEKVSAWTFLPGENGGSIKVRSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSN 187

Query: 187 SARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPIL 246
           + +GGE+LFP S++K K  +S  +  N L PVKGNAILFF+  LNAS D  S H R P++
Sbjct: 188 TTQGGELLFPNSEMKPK--NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVV 247

Query: 247 NGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY 306
            GEL VATK  Y       K     +E G C DED++C +WA +GEC++N V+MIGSPDY
Sbjct: 248 KGELLVATKLIYA------KKQARIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDY 291

Query: 307 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 308 YGTCRKSCNAC 291

BLAST of Lsi09G004730 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 3.1e-53
Identity = 112/276 (40.58%), Postives = 165/276 (59.78%), Query Frame = 0

Query: 45  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVS 104
           S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       S  +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 105 TNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRG---EEAEHKYFYG 164
           + + +SSG+ L    DDI+A +E ++A WT LP+++    QI+ Y      +    YFY 
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFY- 140

Query: 165 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKG 224
           ++ A+      +ATV++YLS+  +GGE +FP  K     +K   WS   K+   + P KG
Sbjct: 141 DKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKG 200

Query: 225 NAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDED 284
           +A+LFF++HLN + D +S H   P++ GE W AT++ ++R + G K  V      C+D+ 
Sbjct: 201 DALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVR-SFGKKKLV------CVDDH 260

Query: 285 KSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           +SC +WA  GECE+N ++M+GS    G CRKSC AC
Sbjct: 261 ESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Lsi09G004730 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 2.0e-52
Identity = 114/319 (35.74%), Postives = 185/319 (57.99%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQV 60
           MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQL 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILN-TTD 120
           SW PRVFLY+GFLSDEECDH I LA    +        S  +V + + +SSG+ L+   D
Sbjct: 61  SWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQD 120

Query: 121 DIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVL 180
           DI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++
Sbjct: 121 DIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLM 180

Query: 181 YLSDSARGGEMLFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKS 240
           YLS+  +GGE +FP      +++K   W+   K+   + P KG+A+LFF++H NA+ D +
Sbjct: 181 YLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSN 240

Query: 241 SYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV 300
           S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA  GEC++N  
Sbjct: 241 SLHGSCPVVEGEKWSATRWIHVK----SFERAFNKQSGCMDENVSCEKWAKAGECQKNPT 300

Query: 301 FMIGSPDYYGTCRKSCNAC 311
           +M+GS   +G CRKSC AC
Sbjct: 301 YMVGSDKDHGYCRKSCKAC 314

BLAST of Lsi09G004730 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 6.9e-45
Identity = 103/282 (36.52%), Postives = 165/282 (58.51%), Query Frame = 0

Query: 45  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGS 104
           S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLI LA      S+  DN +G S  S
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 105 RNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKY 164
               S+    S G      D I++ IE +++ WT LPK++    Q+++Y  G++ + H  
Sbjct: 87  DVRTSSGTFISKG-----KDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFD 146

Query: 165 FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNF 224
           ++ ++  ++     +ATV+LYLS+  +GGE +FP+++  S+   S          KK   
Sbjct: 147 YFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIA 206

Query: 225 LTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDED 284
           + P KGNA+LFF++  +A PD  S H   P++ GE W ATK+ ++     +   + + + 
Sbjct: 207 VKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV----DSFDKILTHDG 266

Query: 285 GCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
            C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Sbjct: 267 NCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Lsi09G004730 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 4.5e-44
Identity = 99/283 (34.98%), Postives = 162/283 (57.24%), Query Frame = 0

Query: 45  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS------EDNPSGNSAGS 104
           S+ S  ++PS+V QVS +PR F+Y+GFL++ ECDH++ LA +S       DN SG S  S
Sbjct: 26  SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFS 85

Query: 105 RNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQY---RGEEAEHK 164
               S+    S G      D I++ IE +I+ WT LPK++    Q+++Y   +  +A   
Sbjct: 86  EVRTSSGTFISKG-----KDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFD 145

Query: 165 YFYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNN 224
           YF+   + +      MAT+++YLS+  +GGE +FP++++ S+   S          K+  
Sbjct: 146 YFHDKVNIVRGGH-RMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGI 205

Query: 225 FLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDE 284
            + P KG+A+LFF++H +A PD  S H   P++ GE W ATK+ ++     +   + +  
Sbjct: 206 AVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHV----DSFDRIVTPS 265

Query: 285 DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
             C D ++SC +WA +GEC +N  +M+G+ +  G CR+SC AC
Sbjct: 266 GNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Lsi09G004730 vs. ExPASy TrEMBL
Match: A0A1S3AT39 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103482556 PE=3 SV=1)

HSP 1 Score: 568.2 bits (1463), Expect = 2.2e-158
Identity = 280/311 (90.03%), Postives = 290/311 (93.25%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS NTVST LL+ SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLS 180
           IARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEMLFPESKVKSKFWS RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
            NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of Lsi09G004730 vs. ExPASy TrEMBL
Match: A0A5A7TKX1 (Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1167G00060 PE=3 SV=1)

HSP 1 Score: 565.5 bits (1456), Expect = 1.4e-157
Identity = 278/311 (89.39%), Postives = 290/311 (93.25%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           WRPRVFLYKGFLSD+ECDHLI LAS+S+DNPS NSAGS NTVST LL+ SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLS 180
           IARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEMLFPESKVKSKFWS RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
            NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of Lsi09G004730 vs. ExPASy TrEMBL
Match: A0A6J1E0X9 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111026141 PE=3 SV=1)

HSP 1 Score: 530.0 bits (1364), Expect = 6.5e-147
Identity = 264/311 (84.89%), Postives = 277/311 (89.07%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PLSYSNHSGRIDPSRVVQV
Sbjct: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLI LA+SSED PSGNS  S NTV T +L SSG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120

Query: 121 IIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLS 180
           IIARIE RIA+WT LPKD+SMP QI+QY GEEAEHKY +GNRSAM SSEPLMATVVLYLS
Sbjct: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEM FPESKVKS+FWS RRKKNN L PVKGNA+L FSVHLNASPDKSS HTRSPI
Sbjct: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
           L+GELW+ATKFFYLRP TGNKHT E D D C DEDKSCPQWAAIGECERNAVFMIGSPDY
Sbjct: 241 LDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 310

BLAST of Lsi09G004730 vs. ExPASy TrEMBL
Match: A0A0A0KPE4 (Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G166460 PE=3 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 5.5e-146
Identity = 255/289 (88.24%), Postives = 270/289 (93.43%), Query Frame = 0

Query: 23  QSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIF 82
           +SNLISGRKGLRD+L+DRPLSYSN+SGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLI 
Sbjct: 6   KSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIS 65

Query: 83  LASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPF 142
           LAS+SEDNPS NSAGS  TVST LL+SSGVILNTTDDI+ARIE R+A+WTLLPKDHSMPF
Sbjct: 66  LASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDIVARIENRLAIWTLLPKDHSMPF 125

Query: 143 QIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSS 202
           QIMQYRGEEA+HKYFYGNRSAM  SSEPLMATVVLYLSDSA GGE+LFPESKVKSKFWS 
Sbjct: 126 QIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLSDSASGGEILFPESKVKSKFWSG 185

Query: 203 RRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKH 262
           RRKKNNFL PVKGNAILFFSVHLNASPDKSSYH RSPI +GELWVATKF YL P  GNKH
Sbjct: 186 RRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKH 245

Query: 263 TVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           T++SD DGC DEDKSCPQWAAIGECERNAVFM+GSPDYYGTCRKSCNAC
Sbjct: 246 TIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 294

BLAST of Lsi09G004730 vs. ExPASy TrEMBL
Match: A0A6J1E2P0 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111430280 PE=3 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 3.7e-142
Identity = 255/314 (81.21%), Postives = 280/314 (89.17%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQV 60
           MDSRL FLLLLA AFSFS+CLAQSN ISGRKGLRDQ+++   LSYSNHS RIDPSRVVQ+
Sbjct: 1   MDSRLTFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQI 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDD 120
           SW+PR FLYKGFLSDEECDHLI LAS+SED PS N+AGSRNTVST  L +SG ILNTTDD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDD 120

Query: 121 IIARIETRIALWTLLPKDHSMPFQIMQYRGEEAE-HKYFYGNRSAMSSSEPLMATVVLYL 180
           II RIE RIA+WT LPKDHSMPFQIM+Y GEEA  HKYF+GNRSAM SSEPLMATVVLYL
Sbjct: 121 IIGRIENRIAVWTFLPKDHSMPFQIMKYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYL 180

Query: 181 SDSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSP 240
           SDSA GGE+LFP SKVK +FWS RRKKNNFL PVKGNA+LFFSVHLNASPDKS YH+R+P
Sbjct: 181 SDSASGGEILFPVSKVKRRFWSDRRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTP 240

Query: 241 ILNGELWVATKFFYLRP-TTGNKHTVESD-EDGCIDEDKSCPQWAAIGECERNAVFMIGS 300
           IL+G+LWVATKFFY+RP  TGN+H VES  +D CIDED+SCP+WAAIGEC+RNAVFMIGS
Sbjct: 241 ILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS 300

Query: 301 PDYYGTCRKSCNAC 311
           PDYYGTCRKSCNAC
Sbjct: 301 PDYYGTCRKSCNAC 314

BLAST of Lsi09G004730 vs. NCBI nr
Match: XP_038906497.1 (probable prolyl 4-hydroxylase 12 [Benincasa hispida])

HSP 1 Score: 594.0 bits (1530), Expect = 7.6e-166
Identity = 290/310 (93.55%), Postives = 298/310 (96.13%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLLLATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSNHSGRIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNHSGRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           W+PRVFLYKGFLSDEECDHLI LAS+SEDNPSGNSAGS NTVST LL+SSGVILNT+DDI
Sbjct: 61  WQPRVFLYKGFLSDEECDHLISLASNSEDNPSGNSAGSGNTVSTKLLNSSGVILNTSDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSD 180
           IARIE +IA+WT LPKDH MPFQIMQYRGEEAEHKYFYGN SAMSSSEPLMATVVLYLSD
Sbjct: 121 IARIENQIAVWTFLPKDHGMPFQIMQYRGEEAEHKYFYGNGSAMSSSEPLMATVVLYLSD 180

Query: 181 SARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPIL 240
           SARGGEMLFPESKVKSKFWS RRKKNNFL PVKGNAILFFSVHLNASPDKSSYHTRSPIL
Sbjct: 181 SARGGEMLFPESKVKSKFWSDRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHTRSPIL 240

Query: 241 NGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY 300
           NGELWVATKFFYLRPTTGNK TVESD DGCIDEDKSCPQWAAIGECERN VFMIGSPDYY
Sbjct: 241 NGELWVATKFFYLRPTTGNKRTVESDVDGCIDEDKSCPQWAAIGECERNTVFMIGSPDYY 300

Query: 301 GTCRKSCNAC 311
           GTCRKSCNAC
Sbjct: 301 GTCRKSCNAC 310

BLAST of Lsi09G004730 vs. NCBI nr
Match: XP_008436994.1 (PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo])

HSP 1 Score: 568.2 bits (1463), Expect = 4.5e-158
Identity = 280/311 (90.03%), Postives = 290/311 (93.25%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS NTVST LL+ SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLS 180
           IARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEMLFPESKVKSKFWS RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
            NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of Lsi09G004730 vs. NCBI nr
Match: XP_004152378.1 (probable prolyl 4-hydroxylase 12 [Cucumis sativus] >KGN49777.2 hypothetical protein Csa_000298 [Cucumis sativus])

HSP 1 Score: 566.6 bits (1459), Expect = 1.3e-157
Identity = 277/311 (89.07%), Postives = 292/311 (93.89%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLLLATAFSFSTCLAQSNLISGRKGLRD+L+DRPLSYSN+SGRIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLLATAFSFSTCLAQSNLISGRKGLRDRLVDRPLSYSNYSGRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           WRPRVFLYKGFLSDEECDHLI LAS+SEDNPS NSAGS  TVST LL+SSGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLS 180
           +ARIE R+A+WTLLPKDHSMPFQIMQYRGEEA+HKYFYGNRSAM  SSEPLMATVVLYLS
Sbjct: 121 VARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKYFYGNRSAMLPSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGE+LFPESKVKSKFWS RRKKNNFL PVKGNAILFFSVHLNASPDKSSYH RSPI
Sbjct: 181 DSASGGEILFPESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHLNASPDKSSYHIRSPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
            +GELWVATKF YL P  GNKHT++SD DGC DEDKSCPQWAAIGECERNAVFM+GSPDY
Sbjct: 241 RDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of Lsi09G004730 vs. NCBI nr
Match: KAA0043468.1 (putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa])

HSP 1 Score: 565.5 bits (1456), Expect = 2.9e-157
Identity = 278/311 (89.39%), Postives = 290/311 (93.25%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQVS 60
           MDSRL+FLLL ATAFSFSTCLAQSNLISGRKGLRDQL+DRPLSYSN S RIDPSRVVQVS
Sbjct: 1   MDSRLNFLLLFATAFSFSTCLAQSNLISGRKGLRDQLVDRPLSYSNQSVRIDPSRVVQVS 60

Query: 61  WRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDI 120
           WRPRVFLYKGFLSD+ECDHLI LAS+S+DNPS NSAGS NTVST LL+ SGVILNTTDDI
Sbjct: 61  WRPRVFLYKGFLSDDECDHLISLASNSKDNPSRNSAGSGNTVSTELLNGSGVILNTTDDI 120

Query: 121 IARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAM-SSSEPLMATVVLYLS 180
           IARIE RIA+WTLLPKDH MPFQIMQYRGEEA+HKYFYGNRSAM SSSEPLMATVVLYLS
Sbjct: 121 IARIENRIAVWTLLPKDHGMPFQIMQYRGEEAKHKYFYGNRSAMSSSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEMLFPESKVKSKFWS RRKK NFL PVKGNAILFFSVHLNASPDKSSYH R PI
Sbjct: 181 DSASGGEMLFPESKVKSKFWSGRRKKKNFLRPVKGNAILFFSVHLNASPDKSSYHIRYPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
            NGELWVATKF YLRP TGNKHT++S+ DGCIDEDKSCPQWAAIGECERNAVFM+GSPDY
Sbjct: 241 RNGELWVATKFLYLRPPTGNKHTIDSNIDGCIDEDKSCPQWAAIGECERNAVFMVGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 311

BLAST of Lsi09G004730 vs. NCBI nr
Match: XP_022159842.1 (probable prolyl 4-hydroxylase 12 [Momordica charantia])

HSP 1 Score: 530.0 bits (1364), Expect = 1.4e-146
Identity = 264/311 (84.89%), Postives = 277/311 (89.07%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSNLISGRKGLRDQLLDR-PLSYSNHSGRIDPSRVVQV 60
           MDSRL  LLLLATA SF +CLAQSNLISGRKGLRDQL++  PLSYSNHSGRIDPSRVVQV
Sbjct: 1   MDSRLPVLLLLATAISFLSCLAQSNLISGRKGLRDQLIESVPLSYSNHSGRIDPSRVVQV 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDD 120
           SWRPRVFLYKGFLSDEECDHLI LA+SSED PSGNS  S NTV T +L SSG ILNTTDD
Sbjct: 61  SWRPRVFLYKGFLSDEECDHLISLATSSEDKPSGNSTDSGNTVPTKILKSSGAILNTTDD 120

Query: 121 IIARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLS 180
           IIARIE RIA+WT LPKD+SMP QI+QY GEEAEHKY +GNRSAM SSEPLMATVVLYLS
Sbjct: 121 IIARIENRIAVWTFLPKDYSMPLQILQYGGEEAEHKYVFGNRSAMLSSEPLMATVVLYLS 180

Query: 181 DSARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPI 240
           DSA GGEM FPESKVKS+FWS RRKKNN L PVKGNA+L FSVHLNASPDKSS HTRSPI
Sbjct: 181 DSASGGEMRFPESKVKSRFWSDRRKKNNILRPVKGNAVLIFSVHLNASPDKSSSHTRSPI 240

Query: 241 LNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAVFMIGSPDY 300
           L+GELW+ATKFFYLRP TGNKHT E D D C DEDKSCPQWAAIGECERNAVFMIGSPDY
Sbjct: 241 LDGELWIATKFFYLRPITGNKHTDEPDGD-CNDEDKSCPQWAAIGECERNAVFMIGSPDY 300

Query: 301 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 301 YGTCRKSCNAC 310

BLAST of Lsi09G004730 vs. TAIR 10
Match: AT4G25600.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 235.7 bits (600), Expect = 4.9e-62
Identity = 136/311 (43.73%), Postives = 184/311 (59.16%), Query Frame = 0

Query: 7   FLLLLATAFSFSTCLAQSNLISGRKGLRDQLL-----DRPLSYSNHSGRIDPSRVVQVSW 66
           FL+L+ T  S S           RK LRD+ +     D   SY   S  +DP+RV+Q+SW
Sbjct: 8   FLILMITMSSSSPPFCSG---GSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSW 67

Query: 67  RPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILNTTDDII 126
            PRVFLY+GFLS+EECDHLI L   + +  S ++ G                    D ++
Sbjct: 68  LPRVFLYRGFLSEEECDHLISLRKETTEVYSVDADGK----------------TQLDPVV 127

Query: 127 ARIETRIALWTLLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSD 186
           A IE +++ WT LP ++    ++  Y  E++  K  ++G   +    E L+ATVVLYLS+
Sbjct: 128 AGIEEKVSAWTFLPGENGGSIKVRSYTSEKSGKKLDYFGEEPSSVLHESLLATVVLYLSN 187

Query: 187 SARGGEMLFPESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKSSYHTRSPIL 246
           + +GGE+LFP S++K K  +S  +  N L PVKGNAILFF+  LNAS D  S H R P++
Sbjct: 188 TTQGGELLFPNSEMKPK--NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRCPVV 247

Query: 247 NGELWVATKFFYLRPTTGNKHTVESDEDG-CIDEDKSCPQWAAIGECERNAVFMIGSPDY 306
            GEL VATK  Y       K     +E G C DED++C +WA +GEC++N V+MIGSPDY
Sbjct: 248 KGELLVATKLIYA------KKQARIEESGECSDEDENCGRWAKLGECKKNPVYMIGSPDY 291

Query: 307 YGTCRKSCNAC 311
           YGTCRKSCNAC
Sbjct: 308 YGTCRKSCNAC 291

BLAST of Lsi09G004730 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 210.3 bits (534), Expect = 2.2e-54
Identity = 112/276 (40.58%), Postives = 165/276 (59.78%), Query Frame = 0

Query: 45  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLASSS-EDNPSGNSAGSRNTVS 104
           S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA    E +       S  +  
Sbjct: 21  SSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESED 80

Query: 105 TNLLSSSGVIL-NTTDDIIARIETRIALWTLLPKDHSMPFQIMQYRG---EEAEHKYFYG 164
           + + +SSG+ L    DDI+A +E ++A WT LP+++    QI+ Y      +    YFY 
Sbjct: 81  SEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFY- 140

Query: 165 NRSAMSSSEPLMATVVLYLSDSARGGEMLFPESK-----VKSKFWSSRRKKNNFLTPVKG 224
           ++ A+      +ATV++YLS+  +GGE +FP  K     +K   WS   K+   + P KG
Sbjct: 141 DKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKG 200

Query: 225 NAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDED 284
           +A+LFF++HLN + D +S H   P++ GE W AT++ ++R + G K  V      C+D+ 
Sbjct: 201 DALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVR-SFGKKKLV------CVDDH 260

Query: 285 KSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
           +SC +WA  GECE+N ++M+GS    G CRKSC AC
Sbjct: 261 ESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Lsi09G004730 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 209.1 bits (531), Expect = 4.9e-54
Identity = 121/328 (36.89%), Postives = 188/328 (57.32%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQV 60
           MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQL 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGSRNTVSTNLLSSSGVI 120
           SW PRVFLY+GFLSDEECDH I LA      S   DN SG S  S ++VS  +  SS  I
Sbjct: 61  SWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSV-VRQSSSFI 120

Query: 121 LN----TTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSS 180
            N      DDI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +   
Sbjct: 121 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 180

Query: 181 EPLMATVVLYLSDSARGGEMLFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSV 240
              +ATV++YLS+  +GGE +FP      +++K   W+   K+   + P KG+A+LFF++
Sbjct: 181 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 240

Query: 241 HLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAA 300
           H NA+ D +S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA 
Sbjct: 241 HPNATTDSNSLHGSCPVVEGEKWSATRWIHVK----SFERAFNKQSGCMDENVSCEKWAK 300

Query: 301 IGECERNAVFMIGSPDYYGTCRKSCNAC 311
            GEC++N  +M+GS   +G CRKSC AC
Sbjct: 301 AGECQKNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of Lsi09G004730 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 207.6 bits (527), Expect = 1.4e-53
Identity = 114/319 (35.74%), Postives = 185/319 (57.99%), Query Frame = 0

Query: 1   MDSRLHFLLLLATAFSFSTCLAQSN-LISGRKGLRDQLLDRPLSYSNHSGRIDPSRVVQV 60
           MDSR+     L   F+     +  N  ++     RD  + + +  S  S   DP+RV Q+
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQL 60

Query: 61  SWRPRVFLYKGFLSDEECDHLIFLASSSEDNPSGNSAGSRNTVSTNLLSSSGVILN-TTD 120
           SW PRVFLY+GFLSDEECDH I LA    +        S  +V + + +SSG+ L+   D
Sbjct: 61  SWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQD 120

Query: 121 DIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVL 180
           DI++ +E ++A WT LP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++
Sbjct: 121 DIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLM 180

Query: 181 YLSDSARGGEMLFP-----ESKVKSKFWSSRRKKNNFLTPVKGNAILFFSVHLNASPDKS 240
           YLS+  +GGE +FP      +++K   W+   K+   + P KG+A+LFF++H NA+ D +
Sbjct: 181 YLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSN 240

Query: 241 SYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDEDGCIDEDKSCPQWAAIGECERNAV 300
           S H   P++ GE W AT++ +++    +     + + GC+DE+ SC +WA  GEC++N  
Sbjct: 241 SLHGSCPVVEGEKWSATRWIHVK----SFERAFNKQSGCMDENVSCEKWAKAGECQKNPT 300

Query: 301 FMIGSPDYYGTCRKSCNAC 311
           +M+GS   +G CRKSC AC
Sbjct: 301 YMVGSDKDHGYCRKSCKAC 314

BLAST of Lsi09G004730 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 182.6 bits (462), Expect = 4.9e-46
Identity = 103/282 (36.52%), Postives = 165/282 (58.51%), Query Frame = 0

Query: 45  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLIFLA------SSSEDNPSGNSAGS 104
           S+ S  I+PS+V QVS +PR F+Y+GFL+D ECDHLI LA      S+  DN +G S  S
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 105 RNTVSTNLLSSSGVILNTTDDIIARIETRIALWTLLPKDHSMPFQIMQY-RGEEAE-HKY 164
               S+    S G      D I++ IE +++ WT LPK++    Q+++Y  G++ + H  
Sbjct: 87  DVRTSSGTFISKG-----KDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFD 146

Query: 165 FYGNRSAMSSSEPLMATVVLYLSDSARGGEMLFPESKVKSKFWSSRR--------KKNNF 224
           ++ ++  ++     +ATV+LYLS+  +GGE +FP+++  S+   S          KK   
Sbjct: 147 YFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIA 206

Query: 225 LTPVKGNAILFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKHTVESDED 284
           + P KGNA+LFF++  +A PD  S H   P++ GE W ATK+ ++     +   + + + 
Sbjct: 207 VKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV----DSFDKILTHDG 266

Query: 285 GCIDEDKSCPQWAAIGECERNAVFMIGSPDYYGTCRKSCNAC 311
            C D ++SC +WA +GEC +N  +M+G+P+  G CR+SC AC
Sbjct: 267 NCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GXT76.9e-6143.73Probable prolyl 4-hydroxylase 12 OS=Arabidopsis thaliana OX=3702 GN=P4H12 PE=2 S... [more]
F4J0A83.1e-5340.58Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q8L9702.0e-5235.74Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4JAU36.9e-4536.52Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8LAN34.5e-4434.98Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A1S3AT392.2e-15890.03Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103482556 PE=3 S... [more]
A0A5A7TKX11.4e-15789.39Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A6J1E0X96.5e-14784.89Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111026141... [more]
A0A0A0KPE45.5e-14688.24Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G166460 PE=... [more]
A0A6J1E2P03.7e-14281.21Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111430280 ... [more]
Match NameE-valueIdentityDescription
XP_038906497.17.6e-16693.55probable prolyl 4-hydroxylase 12 [Benincasa hispida][more]
XP_008436994.14.5e-15890.03PREDICTED: probable prolyl 4-hydroxylase 12 [Cucumis melo][more]
XP_004152378.11.3e-15789.07probable prolyl 4-hydroxylase 12 [Cucumis sativus] >KGN49777.2 hypothetical prot... [more]
KAA0043468.12.9e-15789.39putative prolyl 4-hydroxylase 12 [Cucumis melo var. makuwa][more]
XP_022159842.11.4e-14684.89probable prolyl 4-hydroxylase 12 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT4G25600.14.9e-6243.73Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.12.2e-5440.58Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.24.9e-5436.89Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.11.4e-5335.74Oxoglutarate/iron-dependent oxygenase [more]
AT3G06300.14.9e-4636.52P4H isoform 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 63..252
e-value: 3.2E-15
score: 66.6
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 269..310
e-value: 1.7E-4
score: 30.9
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 270..310
score: 8.52967
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 50..252
e-value: 3.8E-38
score: 133.2
NoneNo IPR availablePANTHERPTHR10869:SF102PROLYL 4-HYDROXYLASE 12-RELATEDcoord: 1..310
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 1..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi09G004730.1Lsi09G004730.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019511 peptidyl-proline hydroxylation
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen