Bhi06G000377 (gene) Wax gourd (B227) v1

Overview
NameBhi06G000377
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionProcollagen-proline 4-dioxygenase
Locationchr6: 10177272 .. 10184093 (+)
RNA-Seq ExpressionBhi06G000377
SyntenyBhi06G000377
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATTTATATCAAATGAATCTTTCTCTCTCCGTCGTAGTTTCGTCGTTTTTTTATGTAAAAATTCCAAAAAATAGAGAGATTTTATTTGAAGTTTCTGAATGTGCAATTTCCGAAAAACAGGCCACCGATTCTCAATTTTCATTTCATTGTCTTGATAGTTGAATTTTGTAATAAATTCAAAGCCATTTACTTTTCTTCTTCCCTTCTCGCACATCGAAGAACTCTTGAGCAGTTGAATTTTTTCTTGTTCATTTCTCCGATTTGATATCGGAGAAAACAATCATGGATTCCCGACGATTCCTCGCTTTTTGTCTCTGTTTTCTCTCCGTCTTTACAGGCTTCGCTCGCTTGCCGGAATTGCGTTCGCAGAAGAAATCGTACGATTCTTTTCTTTCGTCTTCTTCTTGCCCTTCATTTTTTTTGGTAATTTCACGAATCCCACTGACGCATGGATTTGGATTCAACTGTTTTTGGAATCAGAAGTGGATCTGTGATTCGATTGAAGACGGATTCATCTCCGCTCGTTTTCGATCCAACACGAGTCACTCAACTCTCCTGGGAACCCAGGTAATCTGTCTTTCTCTCTTAGGCGATCGAATAAGATGCTCCTTTACTCTCTGGCTTGATTATGTAGAAAGATAATCCTGACCACATTTCCCATCAATCAGTTTTTAATGTATCCTTCATTGGCGATGTGTTTGTTAATATTCCTCTGAAAACTTCAAATCTATAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGATCTGGTAAGTAATTTTGGAACGGTTTGTTTTAATTTAGATTTCAATATTGTGGTAATTGCATATGTTTGTTTGATATCTATTATTTTCGGGGGGAAATGGAATAGGCCAAGGATAAATTGGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTCCTTCGGAAAGCCCAGGTGCGTCAGTTTATTTGAATCATCTCATCCATGATTATAATGTTCGACTTTTTGAACTTATTTTTATGTTATTGGATGTAATAGAACGTGGTATCATGGTATGTTCTTGTAGAATTAGGTGAGAAGGATAATGTAATTGTAGATGTTAATGTATGGAGTTACAAACGTGTTATCCAATATCTTGGTGTTGGTAAACATTCTAAGGTGAAAACTTCTAGGGAAAACTGTGTGAACTTGTATAAATTATGACTGCATGAATTTTCCATAATGTGTTTGTGCGGATGTATGTAAGATTACTTGTTTTAGACAGAAAGTGATGTCTGAAAATAAATGAAAAAACTTGTTCTCGATGTGTAGTTTGACCTTGTATATGTGTTTGGGGCATTGTTTTGGAGGAGAATTATCAACATCCCTGTTCAATAACCTGTTCCAACAATATCCCTTGATATCATATCATGAGGAAGAATCCATCTACCCTCTCTCTTCTTATCATCATCCATACTGTTGGCATTGGATGTTCAACTAAATGACATTCTTTGCAACATCCTTGACTTCCTCTTCCTTTGAAATGGATTGTTAGAACCTACAGATCTCCATTTAGAAGAGTAGTAGGGTAAATATTCATTGTTATTACCTCAGGTGTTTGAGAACTTGGTTTCTGGGATTCTGCTTGTCTTTAGAGAAAATGATTTCCTTCGTTAATAAAACTTATGTAGAAGATATATTGACTTCAAGTCTAAACACCTGGTGAAATGACCAGAAGAGCAATAATAAATTGCTATTTTCGTTAAACTTGAAGCTTCAATATATCTTCTACACAAACCCAGCTGGAAAACTATGAGTTGTATCAATGTTTCATAATGAGATTACTCAAGTCCCCTCCCGAGCTTATATCTTAACCCCAGAAGGAGGCGTGAAGGCAGCAGACACTTCCCTTTTGTAATATCTGCTGTCTAAAAACTTATTTTCCAGCTAAAACTCATAGTTGAGTAATCTCATTATGAAACATTGATACAAGTCATTACCATTTAACAAGTAATAAATACAAATGTACAATGTTTTTGTACATTCCTATTGCAAATGTCGAAACGAGATGATTTCTACAATCATTCCGCCGACTGGTTCTAGGCACAACCCATCGTCCTTTTCTTGATACCTGGGGGATAAGAAAACATTTAGAAAACGTGAGCTGGAAGACTCAGTGAGTGGTAATTTGAATTAAAATAGTATTTGCTTTCGAAAACATTTTTGGAAAGATAATATCACAACATATGCAACCATAGAATATATCATCTCAGTCTGGTAATTGAGACAAAGTTGGCATGATCACATATTCCTCCAAATATGCAATAAATCATCAATAACCTGGTTTGGGGAACCAATTTCTAGTACCCCAAACAACTCGTCAAATATGCACATGTCGACAATTCCCGCTAGATATACCTGGGCACGCTGGCAATCCCAGCCAAATGTCCACGAAATATGCTAACAATATCCTAGGTACATTTTCCAATTTCCAATCAAACCAATGAAAAACAAGCATCCAACGAACCATAGTCAAAACAATCATCCAATTAAAGAATGGTAAACTAGTAGTTAAAATCCAAGAAAACATATTTACAGATCATGCCAAGTTTTCAAAACTGTACATATATATTATCACATATGTATAAATACATTGAAACAATCAAAATAAACCACTCACCGTTGCGGGGTCTGCTTCCTCCAAAGTTCTTAATTCCCTTATTTCCCTTGTATTAACCAAGTAATTGCTTAGATTCCCGTGCTTATGGTTAATCATAGTCTTGAAACAATTCAAATTTGGTTTACTAACTCTAAAAATCCCAAAACAAACCTTTGAACTGGAAAAAGACAATTGATGGCTGCAAGGTTGACTTTTCTGCACAGTTGGGCACAAGGGTTGACCTTCCCTACACAGTTGGGTGCATGGTTTGACAGACAACTTGTTGTTGATGCAAGACAACTCCATCTACCGCAAGGTTGGGCGCATGGATGTTGCCAATGATGCCAAGATTGTGCCAACTTCTTCCAATCACACTTTTATCACTCTCTTTTTGAAGGACTTTTGAGTTGTGAGGTGAAAGAGAAGTCCCTGAGGGTATTTATAGGCTCTTACCCTTCATCCTTGTCATGCTTCCTTGCCACCAACACATGACACCTCCACCTTCCAAGCTTTTTGAGTGTCAACACCACTGTAGGCTGGTATCTTACTTCCTTGCCACCACCAACTTTTATGTAAAGCATGTTGTGCCTATCACCCACAAAGCCAACACATGCTACCCTTTGAGTTGGCCAACTCTTGTTTTGCATGCCCAAAATTGCATCTTCATCTTCTAGCTCCTACTTAAACCAACCAAACTTCACCCGATTGAGCCATACTCCTCTCTTTCTTGCAATTAATTTAACTTTCCTACCATGGGCGCATGGCCCTCTTGTTGGGCGCATGGTCCCTCCATCCATGCGTCCAACTCCCTCATATTTTCCATTCTTGGATCTAACTTCTTTTTTGTTGTACTATGACTTCTCTTGATTATGTCCGAAATGTTCAACAAGGCTTGCTATGCGTCCATCCTCTCCCCTATGCGCCCATCTTTAGTTTTTCCTAGGGTTTCTCTAGTTGTCATGTCCTATGTTGTCAATCACTTAGCCATACTTCTTCTTTCAACATCCCCCCTAGCCCCTAATCTTGTAAAATCTTGCACCAATCTCATGTTCACTTAAGGTGTGTATGATTCATCTCATGCTCTCTTAACTTGCTAACAACTTGACTCTTAATATTCAAGAACTTCCCATGAATGCATTATTTCTCATGATTGACGCATGGTTGCCCTTGTTGGGCGCATGGACCCTTCTTTCATGCGTCCAGCTCTCATTTCCTCCCTCTTGGTTTTTGTTTCCCTTGCTACAATAGAATTTCTTCCCTCCAAAATGTCCAAAATTGCCCCCTATGCGCCAACCGAGTAAGTGTTCTCCTCCGATTAAAGCATGGCTTGATTATAGCATAGCCTCTTCCGACTAAATCATAGCGTGATTGTAGCATAGCCTCTTCTGATTAAAGCATAGCTTGAACCCTAGTATGCGCCCATGAAGCCCTAGTATGCGCCCATGAACCCTAGTATGCGTCAATGAATCCTAGTATGTGTCCATGAACCCCAGTATGCGCCCATGAACCCTAGTATGGGCCCAACTTGTCTTTGACCATCCTAGGGTTGCGCCCATCTTTGTCCTAGATAGTGTAGCTCCTTGTGTTTAGCGTGTATTCTCCTCCAATTATAGCATAGCTCTCAAGATTTCATCAAGGTGACTTCTTTTTTTCCTTCGGAGTCCAATGAGGGCATCTTGGAATCCTCTCCATGATTAGGTCATGCGCCCATCCCTTGATGTATGCACCAATCATGCCTCATGTGCCCATATGATGATTACTAATGCATGGTTGGCTACATGGTGCCCCAACTAGGCTTCCAACTTGTCTCATTGTTCCCTACTTGATTTATAATAAAGGTACGCACTTGTTCCTTTGAGGTATGCGCCCATCCTAGGCTTACTTCCCTAGGTATGCACCCGACTTGATCCATAGACTTCTTTGTTGTAATTTTTTTCCAAGGGGTTGTCTTATACTTCCACCTTATTCCCCATGGATCCAAAACGGATTCCATAAGATTCCCCTACGCGCCCCAAAACCTACATTATGCGCCCATCTCCCCCCGTTTACTCACTCTTCCATAACTTCTACCTTTCAAACATGGTTCTTTTCTATCAACCCATGGTATAGCCATATCTCCTACTTGTCCCTCATTTTCCCACAAGGCTCTTATGCTTTAGTCAAACTTTTAAATTCAATTTTGGAAAACTCGGATTTCAAATCTTGCTAAGGTCAGGATGTCACAATAATATAGGTCTGCCTATACACTGCAAATTGGTTAATTGAATTACAATCCATTACTAATTTTACCTCCAAGTCTAAACTAAAGAAACATGGATGACAACAATTAAAGATTAAATTTATTTTGTTAGTGATAATATTGATGACAGTATTTAGAATGATCTCTAGACTATTTATTTAAGAAAAAGCGACTCTTGATAGATTTGTTTTACATATCAATTTTTTAATATTTCAATCATGGCTTATGCCTCAAATCGCTTTGAAGCTGAATTATAATGTATATGATTTGTTTTGTGCTTTGAAGGATGAGATTGTTGCTGCCATTGAGGCCAGGATATCTGCATGGACATTACTTCCCGCAGGTATAATTGTTGGTTGCTCATGGTCATGGATGCAATTTTTTCTTCCCTTTAAATTTCAGATTATAAAGCTATTCCAATCACTTACCTTTCTTTCTTGGAACTGTTGTTAAAATCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTGGGTGGCCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGGTATGGCAATGGTTCTGCTACTTCAGTGTCTTTGTTTTCTGAAAGACGGAAGTATATTTTTTTTTTTGTTGCCAACTGTGATGACTTTGTCTTCTGCAGTTTAAAGAATCTCAAGAAAAGGATGAAAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGGTAGGTATTTTTATGAGTACAGATCACAAGTCATCAATGTATGATGCTACATTTTCAATTTAGAAAAGGCCACGGATGGCATATCCATCATTGATAGACCTTCCCTGAAGTATCTTGAACTTGATCTTTCTTTCTGGTCTTTGAGAAAGGGTACTCATCGCTGGCAAATATTCTTTATAGTGAGCCATCTTTCCAAGAGATGCTCGTCTGCGTCATTGATTTGTCACTATAGTCTACTCAATCTATAGTCTGGTTTAGAAAAACTTTGGTTGTCCAATTAGTTTGAAACGTTAGCTGAGACTATTGCTTGTGATTTTGCATGTTCTAACTTGTATTGTTATAAAAGGCACATCTTGACGCCATTACAAAGGCAATGCACACTCAGTAAAACCTATTTGTAGTTCCTTTTTCTTTAGAAGTGAAATATATTGCTTTGATGTTTATGTCTCTAAACATTATTAATTTTGTAGGCTCTATTGTAAAGTGAAATATATTTCTTGACTTTTTCATTATGAGGATCTATTCTATTTTTTTTCCTTGTTTACTATAGCTTCTACTACTTGGCTGTTCATTGCAGTTAAAGCACGGAAGGGTGATGCACTGTTGTTCTTCAGCCTCCGTCCCGATGCAACGACGGATGTCAAAAGCTTGCACGGTAGTTGCCCTGTAATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCACGTGAGATCCTTTGAGAAGGCAACTCGTGTAAGTAGGCAGGATTGTGTGGACGAGAACGAAAATTGCCAGATATGGGCAAAAAGGGGAGAGTGCAAAAAGAACCCGACCTACATGGTGGGTTCTGAAGATGCTTTAGGATACTGTAGGAAGAGTTGCAGAGCATGTTAAAAACCTAGGAAGAAGAAGTACACATCTCTATCTCACTTTTTTTTTTGTTGATTCTGTGATGGCTATGTATATAAAAACATTGGGCAGTAACTGGGTATACAATACGAGTGGATATTACATTTCTTTCATTTAACCCCTTGTAGTAGTAATTAATTAGCCACAAGTGTTTCATTTGGTAATAAAAGCAATGAGAAGTTTTCTCATGTATGATGCTTATTGATTGTAG

mRNA sequence

AAAAAAATTTATATCAAATGAATCTTTCTCTCTCCGTCGTAGTTTCGTCGTTTTTTTATGTAAAAATTCCAAAAAATAGAGAGATTTTATTTGAAGTTTCTGAATGTGCAATTTCCGAAAAACAGGCCACCGATTCTCAATTTTCATTTCATTGTCTTGATAGTTGAATTTTGTAATAAATTCAAAGCCATTTACTTTTCTTCTTCCCTTCTCGCACATCGAAGAACTCTTGAGCAGTTGAATTTTTTCTTGTTCATTTCTCCGATTTGATATCGGAGAAAACAATCATGGATTCCCGACGATTCCTCGCTTTTTGTCTCTGTTTTCTCTCCGTCTTTACAGGCTTCGCTCGCTTGCCGGAATTGCGTTCGCAGAAGAAATCAAGTGGATCTGTGATTCGATTGAAGACGGATTCATCTCCGCTCGTTTTCGATCCAACACGAGTCACTCAACTCTCCTGGGAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTGGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTCCTTCGGAAAGCCCAGGATGAGATTGTTGCTGCCATTGAGGCCAGGATATCTGCATGGACATTACTTCCCGCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTGGGTGGCCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGAAAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCACGGAAGGGTGATGCACTGTTGTTCTTCAGCCTCCGTCCCGATGCAACGACGGATGTCAAAAGCTTGCACGGTAGTTGCCCTGTAATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCACGTGAGATCCTTTGAGAAGGCAACTCGTGTAAGTAGGCAGGATTGTGTGGACGAGAACGAAAATTGCCAGATATGGGCAAAAAGGGGAGAGTGCAAAAAGAACCCGACCTACATGGTGGGTTCTGAAGATGCTTTAGGATACTGTAGGAAGAGTTGCAGAGCATGTTAAAAACCTAGGAAGAAGAAGTACACATCTCTATCTCACTTTTTTTTTTGTTGATTCTGTGATGGCTATGTATATAAAAACATTGGGCAGTAACTGGGTATACAATACGAGTGGATATTACATTTCTTTCATTTAACCCCTTGTAGTAGTAATTAATTAGCCACAAGTGTTTCATTTGGTAATAAAAGCAATGAGAAGTTTTCTCATGTATGATGCTTATTGATTGTAG

Coding sequence (CDS)

ATGGATTCCCGACGATTCCTCGCTTTTTGTCTCTGTTTTCTCTCCGTCTTTACAGGCTTCGCTCGCTTGCCGGAATTGCGTTCGCAGAAGAAATCAAGTGGATCTGTGATTCGATTGAAGACGGATTCATCTCCGCTCGTTTTCGATCCAACACGAGTCACTCAACTCTCCTGGGAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGATCTGGCCAAGGATAAATTGGAGAAGTCAATGGTAGCAGATAATGAGTCTGGTAAGAGTGTAAGTAGTGAAGTCCGAACGAGTTCTGGCATGTTCCTTCGGAAAGCCCAGGATGAGATTGTTGCTGCCATTGAGGCCAGGATATCTGCATGGACATTACTTCCCGCAGAAAATGGAGAATCCATTCAAATTCTTCACTATGAGAATGGTCAAAAGTATGAACCACATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTGGGTGGCCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGAAAGCTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCACGGAAGGGTGATGCACTGTTGTTCTTCAGCCTCCGTCCCGATGCAACGACGGATGTCAAAAGCTTGCACGGTAGTTGCCCTGTAATTGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCACGTGAGATCCTTTGAGAAGGCAACTCGTGTAAGTAGGCAGGATTGTGTGGACGAGAACGAAAATTGCCAGATATGGGCAAAAAGGGGAGAGTGCAAAAAGAACCCGACCTACATGGTGGGTTCTGAAGATGCTTTAGGATACTGTAGGAAGAGTTGCAGAGCATGTTAA

Protein sequence

MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVGSEDALGYCRKSCRAC
Homology
BLAST of Bhi06G000377 vs. TAIR 10
Match: AT3G28480.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 457.6 bits (1176), Expect = 8.0e-129
Identity = 219/315 (69.52%), Postives = 257/315 (81.59%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPE---LRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLS 60
           MDSR FLAF LCFL      +  P     RS     GSVI++KT +S   FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDE 120
           W PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 121 IVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY 180
           IV+ +EA+++AWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMY
Sbjct: 121 IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMY 180

Query: 181 LSNVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKS 240
           LSNVEKGGET+FP  + K +Q KD+SW++CA++GYAVK RKGDALLFF+L P+ATTD  S
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 241 LHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVG 300
           LHGSCPV+EGEKWSAT+WIHV+SFE+A    +  C+DEN +C+ WAK GEC+KNPTYMVG
Sbjct: 241 LHGSCPVVEGEKWSATRWIHVKSFERAFN-KQSGCMDENVSCEKWAKAGECQKNPTYMVG 300

Query: 301 SEDALGYCRKSCRAC 313
           S+   GYCRKSC+AC
Sbjct: 301 SDKDHGYCRKSCKAC 314

BLAST of Bhi06G000377 vs. TAIR 10
Match: AT3G28480.2 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 431.0 bits (1107), Expect = 8.0e-121
Identity = 212/323 (65.63%), Postives = 250/323 (77.40%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPE---LRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLS 60
           MDSR FLAF LCFL      +  P     RS     GSVI++KT +S   FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSE-----VRTSSGMFLR 120
           W PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+SV SE     VR SS     
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFIAN 120

Query: 121 KAQ---DEIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGH 180
                 D+IV+ +EA+++AWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGH
Sbjct: 121 MDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGH 180

Query: 181 RIATVLMYLSNVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRP 240
           RIATVLMYLSNVEKGGET+FP  + K +Q KD+SW++CA++GYAVK RKGDALLFF+L P
Sbjct: 181 RIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHP 240

Query: 241 DATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECK 300
           +ATTD  SLHGSCPV+EGEKWSAT+WIHV+SFE+A    +  C+DEN +C+ WAK GEC+
Sbjct: 241 NATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-KQSGCMDENVSCEKWAKAGECQ 300

Query: 301 KNPTYMVGSEDALGYCRKSCRAC 313
           KNPTYMVGS+   GYCRKSC+AC
Sbjct: 301 KNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of Bhi06G000377 vs. TAIR 10
Match: AT3G28490.1 (Oxoglutarate/iron-dependent oxygenase )

HSP 1 Score: 410.6 bits (1054), Expect = 1.1e-114
Identity = 206/313 (65.81%), Postives = 239/313 (76.36%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEP 60
           MDS+ FLAF L  L +F+                     +  S     DPTR+TQLSW P
Sbjct: 1   MDSQYFLAFSLSLLLIFS---------------------QISSFSFSVDPTRITQLSWTP 60

Query: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIV 120
           RAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IV
Sbjct: 61  RAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIV 120

Query: 121 AAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLS 180
           A +EA+++AWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLS
Sbjct: 121 ANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLS 180

Query: 181 NVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLH 240
           NV KGGET+FPN + K  Q KD+SWS CA++GYAVK RKGDALLFF+L  + TTD  SLH
Sbjct: 181 NVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLH 240

Query: 241 GSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVGSE 300
           GSCPVIEGEKWSAT+WIHVRSF K   V    CVD++E+CQ WA  GEC+KNP YMVGSE
Sbjct: 241 GSCPVIEGEKWSATRWIHVRSFGKKKLV----CVDDHESCQEWADAGECEKNPMYMVGSE 288

Query: 301 DALGYCRKSCRAC 313
            +LG+CRKSC+AC
Sbjct: 301 TSLGFCRKSCKAC 288

BLAST of Bhi06G000377 vs. TAIR 10
Match: AT5G18900.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 351.3 bits (900), Expect = 8.1e-97
Identity = 171/285 (60.00%), Postives = 215/285 (75.44%), Query Frame = 0

Query: 31  KSSGSVIRLKTDSSPLVFDPTRVTQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMV 90
           +SS S+I     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S V
Sbjct: 19  QSSTSLI----SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAV 78

Query: 91  ADNESGKSVSSEVRTSSGMFLRKAQDEIVAAIEARISAWTLLPAENGESIQILHYENGQK 150
           ADN+SG+S  SEVRTSSG F+ K +D IV+ IE +IS WT LP ENGE IQ+L YE+GQK
Sbjct: 79  ADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQK 138

Query: 151 YEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDESWSDC 210
           Y+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  E  SDC
Sbjct: 139 YDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDC 198

Query: 211 ARKGYAVKARKGDALLFFSLRPDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRV 270
           A++G AVK RKGDALLFF+L PDA  D  SLHG CPVIEGEKWSATKWIHV SF++    
Sbjct: 199 AKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTP 258

Query: 271 SRQDCVDENENCQIWAKRGECKKNPTYMVGSEDALGYCRKSCRAC 313
           S  +C D NE+C+ WA  GEC KNP YMVG+ +  GYCR+SC+AC
Sbjct: 259 S-GNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Bhi06G000377 vs. TAIR 10
Match: AT3G06300.1 (P4H isoform 2 )

HSP 1 Score: 339.3 bits (869), Expect = 3.2e-93
Identity = 165/274 (60.22%), Postives = 210/274 (76.64%), Query Frame = 0

Query: 43  SSP-LVFDPTRVTQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSS 102
           SSP  + +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 103 EVRTSSGMFLRKAQDEIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDK 162
           +VRTSSG F+ K +D IV+ IE ++S WT LP ENGE +Q+L YE+GQKY+ HFD+FHDK
Sbjct: 87  DVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDK 146

Query: 163 VNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDESWSDCARKGYAVKARK 222
           VN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  +  SDCA+KG AVK +K
Sbjct: 147 VNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKK 206

Query: 223 GDALLFFSLRPDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENEN 282
           G+ALLFF+L+ DA  D  SLHG CPVIEGEKWSATKWIHV SF+K       +C D NE+
Sbjct: 207 GNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL-THDGNCTDVNES 266

Query: 283 CQIWAKRGECKKNPTYMVGSEDALGYCRKSCRAC 313
           C+ WA  GEC KNP YMVG+ +  G CR+SC+AC
Sbjct: 267 CERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Bhi06G000377 vs. ExPASy Swiss-Prot
Match: Q8L970 (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 1.1e-127
Identity = 219/315 (69.52%), Postives = 257/315 (81.59%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPE---LRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLS 60
           MDSR FLAF LCFL      +  P     RS     GSVI++KT +S   FDPTRVTQLS
Sbjct: 1   MDSRIFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQLS 60

Query: 61  WEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDE 120
           W PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+SV SEVRTSSGMFL K QD+
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 121 IVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMY 180
           IV+ +EA+++AWT LP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMY
Sbjct: 121 IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMY 180

Query: 181 LSNVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKS 240
           LSNVEKGGET+FP  + K +Q KD+SW++CA++GYAVK RKGDALLFF+L P+ATTD  S
Sbjct: 181 LSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNS 240

Query: 241 LHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVG 300
           LHGSCPV+EGEKWSAT+WIHV+SFE+A    +  C+DEN +C+ WAK GEC+KNPTYMVG
Sbjct: 241 LHGSCPVVEGEKWSATRWIHVKSFERAFN-KQSGCMDENVSCEKWAKAGECQKNPTYMVG 300

Query: 301 SEDALGYCRKSCRAC 313
           S+   GYCRKSC+AC
Sbjct: 301 SDKDHGYCRKSCKAC 314

BLAST of Bhi06G000377 vs. ExPASy Swiss-Prot
Match: F4J0A8 (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 1.6e-113
Identity = 206/313 (65.81%), Postives = 239/313 (76.36%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEP 60
           MDS+ FLAF L  L +F+                     +  S     DPTR+TQLSW P
Sbjct: 1   MDSQYFLAFSLSLLLIFS---------------------QISSFSFSVDPTRITQLSWTP 60

Query: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSVSSEVRTSSGMFLRKAQDEIV 120
           RAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S  SEVRTSSGMFL K QD+IV
Sbjct: 61  RAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIV 120

Query: 121 AAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLS 180
           A +EA+++AWT LP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLS
Sbjct: 121 ANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVLMYLS 180

Query: 181 NVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLH 240
           NV KGGET+FPN + K  Q KD+SWS CA++GYAVK RKGDALLFF+L  + TTD  SLH
Sbjct: 181 NVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLH 240

Query: 241 GSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVGSE 300
           GSCPVIEGEKWSAT+WIHVRSF K   V    CVD++E+CQ WA  GEC+KNP YMVGSE
Sbjct: 241 GSCPVIEGEKWSATRWIHVRSFGKKKLV----CVDDHESCQEWADAGECEKNPMYMVGSE 288

Query: 301 DALGYCRKSCRAC 313
            +LG+CRKSC+AC
Sbjct: 301 TSLGFCRKSCKAC 288

BLAST of Bhi06G000377 vs. ExPASy Swiss-Prot
Match: Q8LAN3 (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.1e-95
Identity = 171/285 (60.00%), Postives = 215/285 (75.44%), Query Frame = 0

Query: 31  KSSGSVIRLKTDSSPLVFDPTRVTQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMV 90
           +SS S+I     SS +  +P++V Q+S +PRAF+Y+GFL++ ECDH++ LAK  L++S V
Sbjct: 19  QSSTSLI----SSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAV 78

Query: 91  ADNESGKSVSSEVRTSSGMFLRKAQDEIVAAIEARISAWTLLPAENGESIQILHYENGQK 150
           ADN+SG+S  SEVRTSSG F+ K +D IV+ IE +IS WT LP ENGE IQ+L YE+GQK
Sbjct: 79  ADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQK 138

Query: 151 YEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDESWSDC 210
           Y+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  E  SDC
Sbjct: 139 YDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDC 198

Query: 211 ARKGYAVKARKGDALLFFSLRPDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRV 270
           A++G AVK RKGDALLFF+L PDA  D  SLHG CPVIEGEKWSATKWIHV SF++    
Sbjct: 199 AKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVTP 258

Query: 271 SRQDCVDENENCQIWAKRGECKKNPTYMVGSEDALGYCRKSCRAC 313
           S  +C D NE+C+ WA  GEC KNP YMVG+ +  GYCR+SC+AC
Sbjct: 259 S-GNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Bhi06G000377 vs. ExPASy Swiss-Prot
Match: F4JAU3 (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 4.5e-92
Identity = 165/274 (60.22%), Postives = 210/274 (76.64%), Query Frame = 0

Query: 43  SSP-LVFDPTRVTQLSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSS 102
           SSP  + +P++V Q+S +PRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S
Sbjct: 27  SSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVS 86

Query: 103 EVRTSSGMFLRKAQDEIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDK 162
           +VRTSSG F+ K +D IV+ IE ++S WT LP ENGE +Q+L YE+GQKY+ HFD+FHDK
Sbjct: 87  DVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDK 146

Query: 163 VNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDESWSDCARKGYAVKARK 222
           VN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  +  SDCA+KG AVK +K
Sbjct: 147 VNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKK 206

Query: 223 GDALLFFSLRPDATTDVKSLHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENEN 282
           G+ALLFF+L+ DA  D  SLHG CPVIEGEKWSATKWIHV SF+K       +C D NE+
Sbjct: 207 GNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL-THDGNCTDVNES 266

Query: 283 CQIWAKRGECKKNPTYMVGSEDALGYCRKSCRAC 313
           C+ WA  GEC KNP YMVG+ +  G CR+SC+AC
Sbjct: 267 CERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Bhi06G000377 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 1.4e-66
Identity = 122/209 (58.37%), Postives = 159/209 (76.08%), Query Frame = 0

Query: 56  LSWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQ 115
           LSWEPRAF+Y  FLS +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FLR+ +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 116 DEIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVL 175
           D+I+  IE RI+ +T +PA++GE +Q+LHYE GQKYEPH+D+F D+ N + GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 176 MYLSNVEKGGETIFP--NSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATT 235
           MYLS+VE+GGET+FP  N  F      +E  S+C +KG +VK R GDALLF+S+RPDAT 
Sbjct: 199 MYLSDVEEGGETVFPAANMNFSSVPWYNE-LSECGKKGLSVKPRMGDALLFWSMRPDATL 258

Query: 236 DVKSLHGSCPVIEGEKWSATKWIHVRSFE 263
           D  SLHG CPVI G KWS+TKW+HV  ++
Sbjct: 259 DPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286

BLAST of Bhi06G000377 vs. ExPASy TrEMBL
Match: A0A1S3C8G4 (Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103498028 PE=3 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 2.9e-163
Identity = 284/316 (89.87%), Postives = 298/316 (94.30%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELR----SQKKSSGSVIRLKTDSSPLVFDPTRVTQL 60
           MDSR FLAF LCFLSVFT FARLPE R    S K+S+GSV+RLKTDSSPL+FDPTRVTQL
Sbjct: 1   MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQL 60

Query: 61  SWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120
           SW+PRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120

Query: 121 EIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180
           +IVA +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM
Sbjct: 121 KIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180

Query: 181 YLSNVEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVK 240
           YLSNVEKGGETIFPNSEFKESQEKD+SWSDC+RKGYAVKA+KGDALLFFSL  DATTD +
Sbjct: 181 YLSNVEKGGETIFPNSEFKESQEKDDSWSDCSRKGYAVKAQKGDALLFFSLHLDATTDER 240

Query: 241 SLHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMV 300
           SLHGSCPVIEGEKWSATKWIHVRSFEK  RVSRQDCVDENENC  WAKRGECKKNPTYMV
Sbjct: 241 SLHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMV 300

Query: 301 GSEDALGYCRKSCRAC 313
           GSE ALGYCRKSC+AC
Sbjct: 301 GSEGALGYCRKSCKAC 316

BLAST of Bhi06G000377 vs. ExPASy TrEMBL
Match: A0A0A0KS38 (Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G633280 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.7e-161
Identity = 280/313 (89.46%), Postives = 294/313 (93.93%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEP 60
           MDSR FLAF LCFLSVFT FARLPE R+ K+SSGSV+RLKTDSSPL+FDPTRVTQLSW+P
Sbjct: 1   MDSRPFLAFSLCFLSVFTAFARLPETRTHKQSSGSVLRLKTDSSPLIFDPTRVTQLSWQP 60

Query: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120
           RAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVSSEVRTSSGMFLRKAQDE+VA
Sbjct: 61  RAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVA 120

Query: 121 AIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180
            +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN
Sbjct: 121 GVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180

Query: 181 VEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLHG 240
           VEKGGETIFPNSEFKESQ KDESWSDC+RKGYAVKA+KGDALLFFSL  DATTD +SLHG
Sbjct: 181 VEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHG 240

Query: 241 SCPVIEGEKWSATKWIHVRSFEKAT-RVSRQDCVDENENCQIWAKRGECKKNPTYMVGSE 300
           SCPVI GEKWSATKWIHVRSFEK T RVSRQ CVDENENC  WAK+GECKKNPTYMVGS 
Sbjct: 241 SCPVIAGEKWSATKWIHVRSFEKITSRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSG 300

Query: 301 DALGYCRKSCRAC 313
            ALGYCRKSC+AC
Sbjct: 301 GALGYCRKSCKAC 313

BLAST of Bhi06G000377 vs. ExPASy TrEMBL
Match: A0A6J1BXN9 (Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111006412 PE=3 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 4.0e-160
Identity = 277/313 (88.50%), Postives = 293/313 (93.61%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEP 60
           MDS RFL+F LCFL VFT  ARLP++R+ KK SGSV+RLK + SPL+FDPTRVTQLSW+P
Sbjct: 1   MDSPRFLSFSLCFLFVFTALARLPDMRAHKKISGSVLRLKGEPSPLIFDPTRVTQLSWQP 60

Query: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120
           RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SGKSVSSEVRTSSGMFL KAQDEIVA
Sbjct: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNNSGKSVSSEVRTSSGMFLHKAQDEIVA 120

Query: 121 AIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180
           A+EARI+AWT LPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSN
Sbjct: 121 AVEARIAAWTFLPAENGESIQILHYENGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSN 180

Query: 181 VEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLHG 240
           VEKGGETIFPNSEFKESQEKD+SWSDCARKGYAVKA+KGDALLFFSL  DATTDVKSLHG
Sbjct: 181 VEKGGETIFPNSEFKESQEKDDSWSDCARKGYAVKAKKGDALLFFSLHLDATTDVKSLHG 240

Query: 241 SCPVIEGEKWSATKWIHVRSFEKATRVSRQ-DCVDENENCQIWAKRGECKKNPTYMVGSE 300
           SCPVIEGEKWSATKWIHVRSFEK TR SR+ DCVDENENC  WAKRGECKKNPTYMVGSE
Sbjct: 241 SCPVIEGEKWSATKWIHVRSFEKPTRPSRRLDCVDENENCASWAKRGECKKNPTYMVGSE 300

Query: 301 DALGYCRKSCRAC 313
            ALGYCRKSC+AC
Sbjct: 301 SALGYCRKSCQAC 313

BLAST of Bhi06G000377 vs. ExPASy TrEMBL
Match: A0A6J1FJ93 (Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111444767 PE=3 SV=1)

HSP 1 Score: 558.1 bits (1437), Expect = 2.3e-155
Identity = 274/312 (87.82%), Postives = 289/312 (92.63%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELRSQKKSSGSVIRLKTDSSPLVFDPTRVTQLSWEP 60
           MDSRRFLAF L FLSV TGFARLPE  + KK SGSV+ LK DS  L+FDPTRVTQLSW+P
Sbjct: 1   MDSRRFLAFSLFFLSVSTGFARLPE--THKKLSGSVLELKRDSPRLIFDPTRVTQLSWQP 60

Query: 61  RAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120
           RAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA
Sbjct: 61  RAFLYKGFLTDQECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQDEIVA 120

Query: 121 AIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180
            IEARISAWT LP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN
Sbjct: 121 GIEARISAWTFLPVENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSN 180

Query: 181 VEKGGETIFPNSEFKESQEKDESWSDCARKGYAVKARKGDALLFFSLRPDATTDVKSLHG 240
           VEKGGETIFPNS F ESQEKD+SWSDCARKGYAVKA+KGDALLFFSL  DATTD +SLHG
Sbjct: 181 VEKGGETIFPNSAF-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDKRSLHG 240

Query: 241 SCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVGSED 300
           SCPVIEGEKWSATKWIHVRSF+KATR+S QDCVDEN+NC  WAKRGEC+KNPTYMVGSE 
Sbjct: 241 SCPVIEGEKWSATKWIHVRSFDKATRISSQDCVDENKNCPSWAKRGECQKNPTYMVGSEG 300

Query: 301 ALGYCRKSCRAC 313
           A+GYCRKSC+AC
Sbjct: 301 AVGYCRKSCKAC 309

BLAST of Bhi06G000377 vs. ExPASy TrEMBL
Match: A0A5D3CTS4 (Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00740 PE=3 SV=1)

HSP 1 Score: 557.4 bits (1435), Expect = 3.9e-155
Identity = 284/375 (75.73%), Postives = 298/375 (79.47%), Query Frame = 0

Query: 1   MDSRRFLAFCLCFLSVFTGFARLPELR----SQKKSSGSVIRLKTDSSPLVFDPTRVTQL 60
           MDSR FLAF LCFLSVFT FARLPE R    S K+S+GSV+RLKTDSSPL+FDPTRVTQL
Sbjct: 1   MDSRPFLAFSLCFLSVFTAFARLPETRMLKHSYKQSTGSVLRLKTDSSPLIFDPTRVTQL 60

Query: 61  SWEPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120
           SW+PRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD
Sbjct: 61  SWQPRAFLYKGFLSDEECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLRKAQD 120

Query: 121 EIVAAIEARISAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180
           +IVA +EARI+AWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM
Sbjct: 121 KIVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLM 180

Query: 181 YLSNVEKGGETIFPNSEFKESQEKDESWSDCARKGY------------------------ 240
           YLSNVEKGGETIFPNSEFKESQEKD+SWSDC+RKGY                        
Sbjct: 181 YLSNVEKGGETIFPNSEFKESQEKDDSWSDCSRKGYAGSTHRLANILYGEPTWFEPLLSD 240

Query: 241 -----------------------------------AVKARKGDALLFFSLRPDATTDVKS 300
                                              AVKA+KGDALLFFSL  DATTD +S
Sbjct: 241 FPRVARLYYRFAIMLLGFGIVPSYTPYGSTTWLFIAVKAQKGDALLFFSLHLDATTDERS 300

Query: 301 LHGSCPVIEGEKWSATKWIHVRSFEKATRVSRQDCVDENENCQIWAKRGECKKNPTYMVG 313
           LHGSCPVIEGEKWSATKWIHVRSFEK  RVSRQDCVDENENC  WAKRGECKKNPTYMVG
Sbjct: 301 LHGSCPVIEGEKWSATKWIHVRSFEKLPRVSRQDCVDENENCPAWAKRGECKKNPTYMVG 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT3G28480.18.0e-12969.52Oxoglutarate/iron-dependent oxygenase [more]
AT3G28480.28.0e-12165.63Oxoglutarate/iron-dependent oxygenase [more]
AT3G28490.11.1e-11465.81Oxoglutarate/iron-dependent oxygenase [more]
AT5G18900.18.1e-9760.002-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G06300.13.2e-9360.22P4H isoform 2 [more]
Match NameE-valueIdentityDescription
Q8L9701.1e-12769.52Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana OX=3702 GN=P4H7 PE=2 SV=... [more]
F4J0A81.6e-11365.81Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana OX=3702 GN=P4H6 PE=2 SV=... [more]
Q8LAN31.1e-9560.00Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana OX=3702 GN=P4H4 PE=2 SV=... [more]
F4JAU34.5e-9260.22Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana OX=3702 GN=P4H2 PE=1 SV=1[more]
Q9LN201.4e-6658.37Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A1S3C8G42.9e-16389.87Procollagen-proline 4-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103498028 PE=3 S... [more]
A0A0A0KS384.7e-16189.46Procollagen-proline 4-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G633280 PE=... [more]
A0A6J1BXN94.0e-16088.50Procollagen-proline 4-dioxygenase OS=Momordica charantia OX=3673 GN=LOC111006412... [more]
A0A6J1FJ932.3e-15587.82Procollagen-proline 4-dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111444767 ... [more]
A0A5D3CTS43.9e-15575.73Procollagen-proline 4-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 60..257
e-value: 7.5E-56
score: 201.5
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 271..312
e-value: 3.8E-4
score: 29.8
IPR003582ShKT domainPROSITEPS51670SHKTcoord: 272..312
score: 8.950704
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 52..258
e-value: 3.1E-77
score: 260.9
NoneNo IPR availablePANTHERPTHR10869:SF140OS03G0803500 PROTEINcoord: 41..312
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 141..257
e-value: 3.4E-20
score: 72.7
IPR045054Prolyl 4-hydroxylasePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 41..312
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 136..258
score: 12.185434

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi06M000377Bhi06M000377mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
cellular_component GO:0005789 endoplasmic reticulum membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen