Lag0016732 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0016732
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionProcollagen-proline 4-dioxygenase
Locationchr12: 40735722 .. 40738853 (+)
RNA-Seq ExpressionLag0016732
SyntenyLag0016732
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCAAATTTCTCAATCTGTTACTTATCTTCTTGATTTCGATCGCATCGGTTGCTCGAGAATCAACTTGTTCGTATGCTGGTTCAGCTAGCTCCACCGTAGATCCCAGTAAAGTGAAGCAGATTTCATGGAAACCGAGGTTTTTAGTGCTATCATTTTCAAACGCACGTATCTTTTTTTTTTTTTTGTTCTACTGTGTTTGTTTGCTGAGAAATTTGTGGAGCCGAAAGTATTGAACTCTTAAATTGTTTTAGCTTCTAACTTTCGGTCGTCTAGGTTGAGTTCTGTGACTGTTCGCTACTGAAATGGCCTTAATTTTGTGCTGATACTATTAAGGCATCTACTGTTTATGCTTTGGAAAATTTTTTATTAAACGATTTCACTGATGTTGCAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTTGAATGCGACCACCTGGTTTCTATTGTAAGTTGAGCTTTAGTTATATTCATAATTCTGTTCGATGTGGAAGTCATGCTGTCGATGTGAATCTCTGGTTGATGGCTCGTTTTTCTTCTGCGATAGGCGAGATCCGAGCTGAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCTGGAATGTTCATTTCTAAGAGCAAGGTTAGGTTTGAGTTCCATACAAATAATCTTCCAATTTTTCCGGATATTTTCTTTAATTTCTGTTTCGATTTTTGTTTTTGATCGGAAACAAACTTCTGTTACTAGTCTTCACTGGAAATTGGATTAAACTAAATGGAAGAAACTTCGCTGTTACGAAAGTGAAATATATTTCCTTCTCAAAGAGGTTAAACAATCACTTCACGATAGGAATGTTCCTGGTAGCATGGATGCACTAATTAACATTGCATCTTGTTTGTAAGTTTTGGATCTTTGTTTGATTCACGAGGTCCATTACGTAGACTGTTAATACAAAATTAGGGAGTAAACAATTTTACAGGAATAATAACTTAAAACCTCCTTCAATCCTATCTTGTTAAATTAGGACTAGATTAAAAAATTTGAGCGGTTGGGTTATAATGAATCTAATATTTTGTCTTAAGTATCAGTTAAGATCTCTTCTTTTTACCCCATGTTTGTATCGTATATTGACATTGTAACTTGATGTTTAGGATCCTATTGTTAATGGCATAGAGGAAAAAATTGCTGCGTGGACTTTTCTTCCAAAAGGTATACGATTTTTTTCAGTCAAACACATATTTGGGTTCTGATTTTTTTCACAGTCAAATCACACAAACACATGCATATCATAAGGGCTTTTTTTTGGATTCTGTGGAAGGAACAAAACCAAAGGGTTTCTTTTTTTGGAAAAAAAGTATACCTTTGAAGGGTTTTTCCATCTCGTTATTTACTGTGCTATATCTTGGTTTAATATGTCCAATTTCTTTGCTTCCTATAGTTATGCTTCCCTCTTAACCTACTGGAAAAGCTTTTTGTAATCACCATGGATTTTCTATCCTCGTTGTAAATTTCATACATTAATGAAATTGTTTCTTATCAATATATATACATACACATGCTTTTCAATTACTACACCTGATTGCCACAAGTGACATTCATGGTGTCTCTATACCTGGGAGAGGCCAAGGAGTGCATATATATATTATTGAATGGTTGTCATGCCAAGTATTTCTGCTAAATCTAAATTTGGTGTATTTCTTGATTTTCAGAAAATGGGGAAGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAATCACATTATGATTACTTTGTTGACAAGGTGAATATTGCATGGGGAGGACATCGTATGGCTACTGTCCTCATGTATCTCTCTGACGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGGTTAGTCCAAGTGTTGATTCACAAATTACTTTACTAGAACTACACTGCTATATTCCTCATCTTCAATTGTAAGATCACCTCTTCTCAGATCTCACTACTCTCTAAATAAAAAACCTTACTATTCAGTCTTCTGATGAAAAGCATCCTTTTCACGACTCAGGGTGTATAAGATTAATTCCAATACATTTAGAAAACACTTATATTTGAAAAATTGAACCCTCCTATTTGTGTTCATAGTCTTTATATATATATGTCCTGGAACCACTCAATGTTGGATAATCCAGAGGATTTCACCATAATTGTATAGAAATTCCTACTGCAAACAACCTAATTATTGTTCAGGTTGATGGTATCGAACCCTGTTACCGCATCTGAACCTCTTTCCATGATTCTTTTTATCAAACAAAATGCATATCAATTGATCCATATTCATTGTGGATGCAATAGTATCTAGCTTAGAGGCATTTAGCAGTTGCTGCTCCACCGCCTCTTTATTGTAGAAAACTTCATTTGGGAAAAATGTTATTGTAATTATTGTAAATGATTGAAATTGACCAATTCCCTAAGATTTTGACCAAACTATTGGTGAATTGGATTAGACATGAAACATCTTGGATTACTGAAAATTTCCCCACGAAGAAGTCAACTTTTAGCAAACTTCTACAATTGGTATTGAAGTTACAGATAGATTGGACATGAATTTGGTTAAGAATAGCAGGATCTGTTTAACCTTGTTATATAACCTCTGTGAGCAGAAATCTCCCCACCGGAGGGCTTCCGAAACAGACGAGGATCTCTCAGAGTGTGCAAAGAAAGGAATTGCTGGTGAGTTGTTAACCAAATTATGAATGCTTCTTCAGTAATCTTCAAACTATAAAACTTGCAAGAAATTTATGAACTTTTTATCTGACTTCATTCCCACTTGGAACGGTGTAGTGAAACCTAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCCATTCCAGACACCAACAGTCTGCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCTACAAAATGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAAACATTGGGAACTGCACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGAGAATGCACGAAAAACCCAGAGTACATGGTCGGATCTTCAGAACTTCCTGGCTACTGTAGGAGGAGTTGCAGGATCTGTTGA

mRNA sequence

ATGTCCAAATTTCTCAATCTGTTACTTATCTTCTTGATTTCGATCGCATCGGTTGCTCGAGAATCAACTTGTTCGTATGCTGGTTCAGCTAGCTCCACCGTAGATCCCAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTTGAATGCGACCACCTGGTTTCTATTGCGAGATCCGAGCTGAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCTGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTAATGGCATAGAGGAAAAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAATCACATTATGATTACTTTGTTGACAAGGTGAATATTGCATGGGGAGGACATCGTATGGCTACTGTCCTCATGTATCTCTCTGACGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGAAATCTCCCCACCGGAGGGCTTCCGAAACAGACGAGGATCTCTCAGAGTGTGCAAAGAAAGGAATTGCTGTGAAACCTAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCCATTCCAGACACCAACAGTCTGCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCTACAAAATGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAAACATTGGGAACTGCACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGAGAATGCACGAAAAACCCAGAGTACATGGTCGGATCTTCAGAACTTCCTGGCTACTGTAGGAGGAGTTGCAGGATCTGTTGA

Coding sequence (CDS)

ATGTCCAAATTTCTCAATCTGTTACTTATCTTCTTGATTTCGATCGCATCGGTTGCTCGAGAATCAACTTGTTCGTATGCTGGTTCAGCTAGCTCCACCGTAGATCCCAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTTGAATGCGACCACCTGGTTTCTATTGCGAGATCCGAGCTGAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGCTCTGGAATGTTCATTTCTAAGAGCAAGGATCCTATTGTTAATGGCATAGAGGAAAAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAATCACATTATGATTACTTTGTTGACAAGGTGAATATTGCATGGGGAGGACATCGTATGGCTACTGTCCTCATGTATCTCTCTGACGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGAAATCTCCCCACCGGAGGGCTTCCGAAACAGACGAGGATCTCTCAGAGTGTGCAAAGAAAGGAATTGCTGTGAAACCTAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCCATTCCAGACACCAACAGTCTGCATGGAGGCTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCTACAAAATGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAAACATTGGGAACTGCACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGAGAATGCACGAAAAACCCAGAGTACATGGTCGGATCTTCAGAACTTCCTGGCTACTGTAGGAGGAGTTGCAGGATCTGTTGA

Protein sequence

MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC
Homology
BLAST of Lag0016732 vs. NCBI nr
Match: XP_038874583.1 (probable prolyl 4-hydroxylase 4 [Benincasa hispida])

HSP 1 Score: 581.3 bits (1497), Expect = 4.9e-162
Identity = 280/300 (93.33%), Postives = 291/300 (97.00%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KF NLL IFLI I+ V RESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFIFLILISLVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SGKSKLSTVRTSSGMFISKSKDPIV+GIE+KI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHG+KYESHYDYFVDKVNIAWGGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGEKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S HRRA ETDEDLSECA+KGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SAHRRAYETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Lag0016732 vs. NCBI nr
Match: XP_008458517.1 (PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >XP_008458518.1 PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >KAA0033447.1 putative prolyl 4-hydroxylase 4 [Cucumis melo var. makuwa])

HSP 1 Score: 577.0 bits (1486), Expect = 9.3e-161
Identity = 277/300 (92.33%), Postives = 290/300 (96.67%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KF NLL  FLI I+S  RESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SGKSKLSTVRTSSGMFISK+KDPIV+GIE+KI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S HRRA ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Lag0016732 vs. NCBI nr
Match: XP_023000081.1 (probable prolyl 4-hydroxylase 4 [Cucurbita maxima] >XP_023000082.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima])

HSP 1 Score: 576.6 bits (1485), Expect = 1.2e-160
Identity = 279/300 (93.00%), Postives = 288/300 (96.00%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF +LL IFLISIASV RES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRSLLFIFLISIASVVRESICSSARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHL+SIARSELKRSEVADNESGKSKLSTVRTSSGMFI KSKD IV+GIE+KIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQ+YESHYDYFVDKVNIAWGGHR+ATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSECA+KGIAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNL N+GNCTDLNESCERWAALGECTKNPEYMVGS ELPGYCRRSCR C
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of Lag0016732 vs. NCBI nr
Match: KAG6575033.1 (putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 575.5 bits (1482), Expect = 2.7e-160
Identity = 278/300 (92.67%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF NLL +FLI I+SV RES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SG SKLSTVRTSSGMFISKSKDPIV+GIE+KIAAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SP RRASETDEDL+ECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLTECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+CTDLNESCERWAALGECTKNPEYMVGS ELPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of Lag0016732 vs. NCBI nr
Match: XP_022959148.1 (probable prolyl 4-hydroxylase 4 [Cucurbita moschata] >XP_022959149.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata])

HSP 1 Score: 575.1 bits (1481), Expect = 3.5e-160
Identity = 278/300 (92.67%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF NLL +FLI I+SV RES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SG SKLSTVRTSSGMFISKSKDPIV+GIE+KIAAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SP RRASETDEDLSECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+C+DLNESCERWAALGECTKNPEYMVGS ELPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of Lag0016732 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 2.4e-127
Identity = 218/293 (74.40%), Postives = 251/293 (85.67%), Query Frame = 0

Query: 8   LLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LLI   +I SV  +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A
Sbjct: 6   LLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLA 65

Query: 68  RSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQ 127
           ++ LKRS VADN+SG+SK S VRTSSG FISK KDPIV+GIE+KI+ WTFLPKENGEDIQ
Sbjct: 66  KASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQ 125

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRAS 187
           VLRYEHGQKY++H+DYF DKVNI  GGHRMAT+LMYLS+VTKGGETVFP AE    R  S
Sbjct: 126 VLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLS 185

Query: 188 ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
           E  EDLS+CAK+GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHV
Sbjct: 186 ENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHV 245

Query: 248 DSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           DSF + +   GNCTD+NESCERWA LGECTKNPEYMVG++ELPGYCRRSC+ C
Sbjct: 246 DSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Lag0016732 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 2.8e-123
Identity = 209/293 (71.33%), Postives = 251/293 (85.67%), Query Frame = 0

Query: 8   LLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LL+F+  +  + + STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A
Sbjct: 8   LLLFVAILLVLLQSSTC-LISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLA 67

Query: 68  RSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQ 127
           +  L+RS VADN++G+S++S VRTSSG FISK KDPIV+GIE+K++ WTFLPKENGED+Q
Sbjct: 68  KENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQ 127

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRAS 187
           VLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  S
Sbjct: 128 VLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLS 187

Query: 188 ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
           E  +DLS+CAKKGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHV
Sbjct: 188 ENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV 247

Query: 248 DSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           DSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+ E+PG CRRSC+ C
Sbjct: 248 DSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Lag0016732 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 4.0e-98
Identity = 171/291 (58.76%), Postives = 215/291 (73.88%), Query Frame = 0

Query: 11  FLISIASVARESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARS 70
           FL   ++    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ 
Sbjct: 27  FLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKG 86

Query: 71  ELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQVL 130
           +L++S VADN+SG+S  S VRTSSGMF+SK +D IV+ +E K+AAWTFLP+ENGE +Q+L
Sbjct: 87  KLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQIL 146

Query: 131 RYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRASET 190
            YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM +    +     
Sbjct: 147 HYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLK 206

Query: 191 DEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDS 250
           D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV S
Sbjct: 207 DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKS 266

Query: 251 FSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           F +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 267 FERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of Lag0016732 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 337.4 bits (864), Expect = 1.6e-91
Identity = 170/298 (57.05%), Postives = 216/298 (72.48%), Query Frame = 0

Query: 5   LNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLV 64
           L+LLLIF             S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+
Sbjct: 11  LSLLLIF-------------SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLI 70

Query: 65  SIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENG 124
            +A+ +L++S  VAD +SG+S+ S VRTSSGMF++K +D IV  +E K+AAWTFLP+ENG
Sbjct: 71  KLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENG 130

Query: 125 EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFP-MAEKSP 184
           E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P
Sbjct: 131 EALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTP 190

Query: 185 HRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSAT 244
             +    D+  S+CAK+G AVKP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT
Sbjct: 191 QLK----DDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSAT 250

Query: 245 KWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           +WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 251 RWIHVRSFGKKK---LVCVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Lag0016732 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 6.9e-66
Identity = 118/209 (56.46%), Postives = 158/209 (75.60%), Query Frame = 0

Query: 42  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSK 101
           +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+E+GKSK S VRTSSG F+ + +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 102 DPIVNGIEEKIAAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVL 161
           D I+  IE++IA +TF+P ++GE +QVL YE GQKYE HYDYFVD+ N   GG RMAT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 162 MYLSDVTKGGETVFPMAEKSPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAI 221
           MYLSDV +GGETVFP A  + +  +     +LSEC KKG++VKP+ GDALLF+S+ P+A 
Sbjct: 199 MYLSDVEEGGETVFPAA--NMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 258

Query: 222 PDTNSLHGGCPVLEGEKWSATKWIHVDSF 251
            D  SLHGGCPV+ G KWS+TKW+HV  +
Sbjct: 259 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of Lag0016732 vs. ExPASy TrEMBL
Match: A0A1S3C816 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103497901 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.5e-161
Identity = 277/300 (92.33%), Postives = 290/300 (96.67%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KF NLL  FLI I+S  RESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SGKSKLSTVRTSSGMFISK+KDPIV+GIE+KI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S HRRA ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Lag0016732 vs. ExPASy TrEMBL
Match: A0A5A7SVW6 (Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G001080 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.5e-161
Identity = 277/300 (92.33%), Postives = 290/300 (96.67%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M KF NLL  FLI I+S  RESTCSYAGSAS+TVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SGKSKLSTVRTSSGMFISK+KDPIV+GIE+KI+AWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           S HRRA ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLG+IGNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Lag0016732 vs. ExPASy TrEMBL
Match: A0A6J1KCK1 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111494385 PE=3 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 5.9e-161
Identity = 279/300 (93.00%), Postives = 288/300 (96.00%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF +LL IFLISIASV RES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRSLLFIFLISIASVVRESICSSARSASTTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHL+SIARSELKRSEVADNESGKSKLSTVRTSSGMFI KSKD IV+GIE+KIAAWTFLPK
Sbjct: 61  DHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQ+YESHYDYFVDKVNIAWGGHR+ATVLMYLSDVTKGGETVFPMAEK
Sbjct: 121 ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SPHRRASETDEDLSECA+KGIAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWS
Sbjct: 181 SPHRRASETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTKSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNL N+GNCTDLNESCERWAALGECTKNPEYMVGS ELPGYCRRSCR C
Sbjct: 241 ATKWIHVDSFSKNLANVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC 300

BLAST of Lag0016732 vs. ExPASy TrEMBL
Match: A0A6J1H545 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111460227 PE=3 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 1.7e-160
Identity = 278/300 (92.67%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF NLL +FLI I+SV RES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRNLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SG SKLSTVRTSSGMFISKSKDPIV+GIE+KIAAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SP RRASETDEDLSECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+C+DLNESCERWAALGECTKNPEYMVGS ELPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCSDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRIC 300

BLAST of Lag0016732 vs. ExPASy TrEMBL
Match: A0A6J1L4G1 (Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111499059 PE=3 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 2.9e-160
Identity = 278/300 (92.67%), Postives = 292/300 (97.33%), Query Frame = 0

Query: 1   MSKFLNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MSKF  LL +FLI I+SV RES+CSYAGSA+STVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MSKFRYLLFLFLILISSVVRESSCSYAGSATSTVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPK 120
           DHLVSIARSELKRSEVADN+SG SKLSTVRTSSGMFISKSKDPIV+GIE+KIAAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGDSKLSTVRTSSGMFISKSKDPIVSGIEDKIAAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIA GGHR+ATVLMYLS+VTKGGETVFP+AEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIARGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 SPHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           SP RRASETDEDLSECA++GIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SPRRRASETDEDLSECARQGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGNIG+CTDLNESCERWAALGECTKNPEYMVGS+ELPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGNIGDCTDLNESCERWAALGECTKNPEYMVGSAELPGYCRRSCRIC 300

BLAST of Lag0016732 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 456.4 bits (1173), Expect = 1.7e-128
Identity = 218/293 (74.40%), Postives = 251/293 (85.67%), Query Frame = 0

Query: 8   LLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LLI   +I SV  +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A
Sbjct: 6   LLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLA 65

Query: 68  RSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQ 127
           ++ LKRS VADN+SG+SK S VRTSSG FISK KDPIV+GIE+KI+ WTFLPKENGEDIQ
Sbjct: 66  KASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQ 125

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRAS 187
           VLRYEHGQKY++H+DYF DKVNI  GGHRMAT+LMYLS+VTKGGETVFP AE    R  S
Sbjct: 126 VLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLS 185

Query: 188 ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
           E  EDLS+CAK+GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHV
Sbjct: 186 ENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHV 245

Query: 248 DSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           DSF + +   GNCTD+NESCERWA LGECTKNPEYMVG++ELPGYCRRSC+ C
Sbjct: 246 DSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Lag0016732 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 443.0 bits (1138), Expect = 2.0e-124
Identity = 209/293 (71.33%), Postives = 251/293 (85.67%), Query Frame = 0

Query: 8   LLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LL+F+  +  + + STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A
Sbjct: 8   LLLFVAILLVLLQSSTC-LISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLA 67

Query: 68  RSELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQ 127
           +  L+RS VADN++G+S++S VRTSSG FISK KDPIV+GIE+K++ WTFLPKENGED+Q
Sbjct: 68  KENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQ 127

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRAS 187
           VLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  S
Sbjct: 128 VLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLS 187

Query: 188 ETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
           E  +DLS+CAKKGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHV
Sbjct: 188 ENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHV 247

Query: 248 DSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           DSF K L + GNCTD+NESCERWA LGEC KNPEYMVG+ E+PG CRRSC+ C
Sbjct: 248 DSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Lag0016732 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 359.4 bits (921), Expect = 2.8e-99
Identity = 171/291 (58.76%), Postives = 215/291 (73.88%), Query Frame = 0

Query: 11  FLISIASVARESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARS 70
           FL   ++    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ 
Sbjct: 27  FLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKG 86

Query: 71  ELKRSEVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENGEDIQVL 130
           +L++S VADN+SG+S  S VRTSSGMF+SK +D IV+ +E K+AAWTFLP+ENGE +Q+L
Sbjct: 87  KLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQIL 146

Query: 131 RYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKSPHRRASET 190
            YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM +    +     
Sbjct: 147 HYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLK 206

Query: 191 DEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDS 250
           D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV S
Sbjct: 207 DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKS 266

Query: 251 FSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           F +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 267 FERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of Lag0016732 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 337.4 bits (864), Expect = 1.2e-92
Identity = 170/298 (57.05%), Postives = 216/298 (72.48%), Query Frame = 0

Query: 5   LNLLLIFLISIASVARESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLV 64
           L+LLLIF             S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+
Sbjct: 11  LSLLLIF-------------SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLI 70

Query: 65  SIARSELKRS-EVADNESGKSKLSTVRTSSGMFISKSKDPIVNGIEEKIAAWTFLPKENG 124
            +A+ +L++S  VAD +SG+S+ S VRTSSGMF++K +D IV  +E K+AAWTFLP+ENG
Sbjct: 71  KLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENG 130

Query: 125 EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFP-MAEKSP 184
           E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P
Sbjct: 131 EALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTP 190

Query: 185 HRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSAT 244
             +    D+  S+CAK+G AVKP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT
Sbjct: 191 QLK----DDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSAT 250

Query: 245 KWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           +WIHV SF K       C D +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 251 RWIHVRSFGKKK---LVCVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Lag0016732 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 337.0 bits (863), Expect = 1.5e-92
Identity = 165/299 (55.18%), Postives = 209/299 (69.90%), Query Frame = 0

Query: 11  FLISIASVARESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARS 70
           FL   ++    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ 
Sbjct: 27  FLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKG 86

Query: 71  ELKRSEVADNESGKS-----KLSTVRTSSGMFISKSK---DPIVNGIEEKIAAWTFLPKE 130
           +L++S VADN+SG+S      +S VR SS    +      D IV+ +E K+AAWTFLP+E
Sbjct: 87  KLEKSMVADNDSGESVESEDSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEE 146

Query: 131 NGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRMATVLMYLSDVTKGGETVFPMAEKS 190
           NGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM +  
Sbjct: 147 NGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK-- 206

Query: 191 PHRRASETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA 250
             +     D+  +ECAK+G AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSA
Sbjct: 207 -GKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSA 266

Query: 251 TKWIHVDSFSKNLGNIGNCTDLNESCERWAALGECTKNPEYMVGSSELPGYCRRSCRIC 301
           T+WIHV SF +       C D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 267 TRWIHVKSFERAFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874583.14.9e-16293.33probable prolyl 4-hydroxylase 4 [Benincasa hispida][more]
XP_008458517.19.3e-16192.33PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis melo] >XP_008458518.1 PREDIC... [more]
XP_023000081.11.2e-16093.00probable prolyl 4-hydroxylase 4 [Cucurbita maxima] >XP_023000082.1 probable prol... [more]
KAG6575033.12.7e-16092.67putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022959148.13.5e-16092.67probable prolyl 4-hydroxylase 4 [Cucurbita moschata] >XP_022959149.1 probable pr... [more]
Match NameE-valueIdentityDescription
Q8LAN32.4e-12774.40Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU32.8e-12371.33Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q8L9704.0e-9858.76Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A81.6e-9157.05Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q9LN206.9e-6656.46Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A1S3C8164.5e-16192.33Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103497901 PE=3 S... [more]
A0A5A7SVW64.5e-16192.33Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A6J1KCK15.9e-16193.00Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111494385 PE... [more]
A0A6J1H5451.7e-16092.67Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111460227 ... [more]
A0A6J1L4G12.9e-16092.67Procollagen-proline 4-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111499059 PE... [more]
Match NameE-valueIdentityDescription
AT5G18900.11.7e-12874.402-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.12.0e-12471.33P4H isoform 2 [more]
AT3G28480.12.8e-9958.76Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.11.2e-9257.05Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.21.5e-9255.18Oxoglutarate/iron-dependent oxygenase [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 259..300
e-value: 0.0081
score: 25.4
IPR003582ShKT domainPFAMPF01549ShKcoord: 259..300
e-value: 9.4E-5
score: 22.9
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 260..300
score: 8.27237
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 46..246
e-value: 5.8E-63
score: 225.2
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 126..246
e-value: 2.7E-20
score: 73.0
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 38..247
e-value: 4.6E-76
score: 257.1
NoneNo IPR availablePANTHERPTHR10869:SF175PROLYL 4-HYDROXYLASE SUBUNIT ALPHA-LIKE PROTEINcoord: 7..300
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 7..300
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 122..247
score: 12.434464

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0016732.1Lag0016732.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen