Bhi07G000118 (gene) Wax gourd (B227) v1

Overview
NameBhi07G000118
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionProcollagen-proline 3-dioxygenase
Locationchr7: 7150969 .. 7160914 (-)
RNA-Seq ExpressionBhi07G000118
SyntenyBhi07G000118
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAATTTGATTTATTCGAATCGTGGAGTTATTCCAAACATTATCGAACTGACAGCTTGCAATTATTTCGGACAATTTCCCAATAGGTTCATCAATGGCGCTGGGTACTGGAGCTCCTACGCGGTACATCGATATTGGATATTTCGTGTGCCTGTAGCCGGGTATCGCTTTGGGTATTCTTGTATCCGAATTCATTTGATCTCCATTTGTCACTTCTTAGTTCTTGGGATCCTTAGAATTGGTGCTCAAACATCATAGTTCAACAATATTTTCCATATTTCACACAATTTTGTTTTGTATCTGACTTTTTTTTGTATATATAGACGCTATAATTTCGATATTCAAGGGCATTTTGTTCGATTACTGCCATTGAAGCTAGAAGAGACGGAGAATTGGACCAAAATGGGAGACGAAGTGGAGAGCCGGCGGCGGCGGCGGCGGCGTCTTATTCTGGAAAATTTCTTAACCCGTGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGTGGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCTACTAACTCTGCTCATTTGATCATGCCTTTTGTTCCAATTAGAGGTAAAACCGCCCCTTCCTCGATTTGTATCATTAGTCTCTTTTGGTAACGTCTATGTGTAATTTTGATTTGACTTCAATTTTTTATGACAGAGAGGTTGAAGGAGAAAGCGGAGGAATTCTTTGGTTGTCATTATGAACTCTTTGTCGAGTTCACTGGCTTGATCAGGTTCTTCTTCTTCCTCCTTATGTGTAAATGTTGAAATGGGTTGTTGTCTATTTTTGCATACAGTTCTAACATTTGAAGAATGAACATAGGGAGTAAAAGATGAAGACCTGTGAGGAAATAGATTCAAAGGATTGTAATAAGTCGTTTTCTTGTGGTTGTTTTTGTCTCGCAGAGTCTGATGACTATGGTACCGTCGCCATTTAGAAATTGAAGCGCATTACTTATTAGGGTCCATAACTTGTATGTTCAACATATCAAGTAATACAAGTAGTTTAGACGCTGTATCAATTTGAGTTGAGCCTTATAAAACAAGTGTGCCATGGACATTTTTTCTCCACAAAATGGGAAAAAGATCTAACAATTTCTCACTTGGAGACAAAACAATAATCTTGTCTCCCACTATGCCGAAAACTGAACATCTTGGCAAATTCCAAGGATGATCTTCTTGGTAAAGTCCGAGGCTTAAAAATTCTATAAACTCAGCAAACAAACAAAACAGGCACCATAAGAAATATAACTGTAAAGCATCCATGCTTGGCCCAAAGGATGAGCAAGCTATGTGAACTTTTCCCCAGACGTAAGAGACAATCAAGCACTTCCACTTCAACTAAGAGGTAACACTCATCATTTCTTCCAACTCAAAGAATTGTTCCTAAGGAAGATAGATTCTATAAATTTTGTTAAGGCTCCCAAGAAGCTTATTCAAGCTCTCCCTCGGTATTATACTCGAAAAGGTGCTCCTCCCTTGCATCCAAGGTATCTGACGTTAATAATGATCTTACTGAAGTTGCCTGTGTACAAGCCCAAGTCCAAGATTTTCCTTTGTCTAAGTCCAATCACAATTCTTCAACACTTATTCCCCTGTAACAAGACTTCAAGCTCTCCTCTCTCTCTATTTTGAATTCAAAGCACCGATTAATACAGAGTGTTCCTCAACAAGTTAGCTCTTGTTGTTACTCAGTTCAAATTGATAGAGAAGATGATTCAGATGTTAGTGTTAGCAGTGAAGAGCTAGAGCTAAATTTTTTGAATCCAAACTTAGAGGGTTCCCTTCTTGAGGAGAACTTTGGAGAAGATCTTGTTGTCTTGTTTGATTGTTCAGCTCAAAAGGAGGTTAGTCAGGTCAATTTAGTTCGCTTTTCTCCTCTTTCTCCATCTCAAATACCCTCCAAGTTTTCCTCCCTAGTCGAAATTTGCAGCCTTTAATTTTTCAAAGCATCCCACCATTCATCTCAGTTGGTAATTTGATTGTTCTTACTCAGTATGAAGACCATTTCATGAATTATTCGGGTCGTCGGAGCAATTTCTTTTAATCTATGAGGAGTTGCATGGTGTTTGATTGTTCTTGCAACAGATTACTTTTCACTGCATTTAAAGATTTCAGAAGTTTTGGGACGTTTGTTTGCTTATTTGGGATTTATTGGCAGCTTGGGTGGTTTTATAGCTTGTTGCGTTTTGAGAAGGATTTTGGTAATCTTGAAGATTTTGATCTTGATGTCGAAAGAGGGTCAAAGAATGGTTGATAGTTAAGTGTCAAGAAGCTGGCTTTTGGTTTCTTCAGTATTCAAGCTGTCCTAGAAGAAGTCTCATTTCATGTCAAAGTTTTTATCCCCCCTCTCTCTCTCTCTCTCTCTCTCTTTTTAAATTTATTATCTTGGTCTAGTTTTGATTTTCTCTTCTTGTATTGGCAAGCTTTTGATACCTTTATGTTTAGTGTTCTCTTTTGTAATTTGAATATTAGTCCCATTTCATTATTTCAATGAAAAGTTATGTTTCCATTTCAAAAATGGATGAATTTAAGTGAAATTCAAGCTTTCAGTCAAGCTGAAGACTTGAACTTTGACCTTGGCGATTACCTTGGTCTTTTTCACTTTGATGCCTCTATTCACCAAGGCCACTTCATCAATTTATTGGAATGTCTGTCAATTGGGCCTGGGTCACGCCAACCTTTGATATCAACTATCTTTCTTAGACTTCCCAATTGTTTTTGGGTTGATAGATCTGATTTACATTCCAAAATCACACTTGCATCCCTTGTTAGCCAACTTGGGAGTATAGATGCGATTCATCTTTTACTGGGTACGAATGTGACATGTTGCATAGGGAGCATGGACACGTGTTTGACATGCGAGAAAATTCATATCTTCTTCATATCTTTTTTTTTCCTGTCTTTCTCTCGCTCTCTGATTTTGGACATGTGAAGACATGCACTAGACACACCCATTTGCACATAAAAAACCAAACCTTTTCTTTTTTTAATTTTTCATGGAAACAACAACGTTAATAAAGAAAAAATGAAGAATACAAGGGCATACAAAAAAACCAAGTCCACAGAAAAGGGAACTCCCTCTACAAAAATGAATCTAACTATGCAAAATAATGCCTATAGAATAATTACAAAAGGTCTTTGAAATTGAAGCCTACAGAGAAGCATGGAAGCGAACAGAAGACCAAACCTCACTAGGTTTCCTTTCCACCTCTCTAAACACCCTACCCTACTATTTCATTCACCCCACAAAACTCATTAGGTTACACACGCCCCAACAGGCCATAAATAGCGACCATTCTCTCGGCATGTTTGAAAAAGACGGTCGACAAAGATGCAAGTCAAAGTCGGTCAGTGAAGTCCATCTGAGAGGAAGGAAGATTGAAAGAAGGTGGGGAGGAGAAGGAAAAACAGGAAAATGGGGGAGGGGGAAAAGGAAAGGTCAAAGGAGCGAGGGTTAGGATTGCTTTAAAGAAATATAAAGTTTTTACTAATTGTTATTTGGTGGGTTTTAATGGGTTTGTTTTTCTGTTTTTCTGTCTTTTTTCCTTTTCTTTCTTTCTTTCTTTCTTTTTTTGGTTTAAATGGGCTTTTCTCTCTATTTAGGTCTTTTCCAATACTTCATTTGATTTACTCTTTTCAGGAAAATTTAGCATGGTAGAACTCGTATTTTTCTTCTAACATTTTTTCATAGTTCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACAGACCCTATCTAAAACAACGTGACTTTTCAGTATGTGGACCAATCTTTCCTGGGAAAGTTGACGTCATTCTTGCTATTCCTTGATGGGCTATTCTTTCATTTCAATTTTTGTTTCATATTTTTAATTTATTTCCTCTACAGTATATTCTCTCTCGAGTTTTTCTTTTCCATTTTTTCCTTCTGGCAAGGGGGACGGTGGTTTTCTATTTTTCCAATTTCTTTACCTTCTCCCTGAAATTCATTTGGTATTTTCGGCTCTATCATTACAGCACTCTAGCTGATAGTTATCTTCATGTGTTTGGCAAAAGGCAGTGTGTTACTTGAATAGTTATGGAGTTGAATTCGGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAGAAACTATCTCACCTTTTTGTGGAGTAAGTTAGATGAAAGCTGCTTACCTCTTTAATTAATTAATTTATTTATTTATTTATTTTGATAAAGAAATAATTTCATTGATGTATGGAATTACAAAAGGGAGAAGGAAAGAAGCCCCAGACGAAAGGAGTTACAAAATACATTTTCAATTCGTCAATAGAGAATCAAAACTATATGAATGGAATAATTTATACAATTTACACCAAGAGATAGCAATGAAGATAACATGCTCTAAACAAAGATTGAATTGTTGAGCTTTGTTGTTAAGTTTAGTATTGACGGTACATGTCACTTATAAAAAATTGGTACATATGCTTTTCTGCCAATCGGAGTGTAGCTGAGGTGGTTAAGCCCTAGACCAAAAAGTCAAAGGTTCTAATCCTCTCCACATATTGTTGAACTCGAAAATGGTACATATGCTACTTTTGAACTCTGAAGAATGTGCTTGTGCTCTTTTGAGCCCTGTAGTCTGCAGTGTTTTATGGTAAATAGAAAAGATAAGTGGAGCTAAATCCTTCACAATTTGAATCAAGACCCATGTTTCATAAATTTTATCTATTCTAGTTGTGGGGATGATGAACGGTTGAGTTTCTTGTTTGATGTCAAAAAGATATGAATAATGTGATAGGGCACATTCTCTGAATTGTTGGATTTTGGTGGTTTTTCCTCTTACAAGAAATGCTTGCGAATAAATTAGTCTGAATAATTATTGATGGTGTTGGATTATAGTTCCTAGTAGTTTGAAAGTTGAAACAATGACCAATCAAGTTTAGTTAATACTCGTTTCACTTTATCAGGATTGTGTGATGTACACGGCTGACAGCGACAATGTTCATTCTGTTGATGAGGTATGTCCAAGCACTCTGAGACCCTCCGAATCTGCTTGTGAAAACATTGTATAAATGAATAACATTAAGTCTTCGCAATGTTAAATATTGGCTAAGCGAAAAGATGTTGCATCTGTGCATTGAGTTTGTATAATATAATGGAGTAATTTCATATACATTCATGACTCGCGATTCCAGTTACCACTGTCATTTTAAGATATTAATTGATAACATTCTCTAGTGGTTTCTAGAACTTCTACAATTGTATGTAAAAAAGATCACAATTCTTAATTTCTTAACCCGGAGACAGCAAGAACACATGCTTCTTGATCCTTTCTGGATGCGGCTGTAATGACGCTAGCCTTGATGCCTTGTGTCTGATGGACTGATCACAATCTTTCTAAAATCAATTGGCATCAGTACGACGAGGATCAGATACAAAAAATTGCTCTTTGAAGTTAACAGACATTTTCTAATACAGGCAATTGAGGGGTGGGAGATTTCAACTTTGATCGAGTCTTTAACCCAAACCAAGTCTTAATCAAGTTTTCTGATGAGGCCTTTTTTGGTTATTCTTTGGAGACAAGTAGGAGCACATTAATTCTTTCTTGCTGTTTTACTTTTAACAGGTAACTGAGAGATCATGCATGTTGATTAATCCTCAAGACAGTTGCTAGTAGACCTCTCTTAGGACAGCTCTACTTCATCTTGCTTACTACATGGTTTCCTTCGATAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGCTCACCCGTGATAGTTCCCATGATGAAGATTCAAAACTCCTTTCCCTTCTTTCACAAAGCCATTTGCACGATCGTCTTCCTGACTCACGCCTACCGCAGCCTCCATCCTGTAATATGTATTGGTTTTCACTGGAAGACGATCCAAATTTCAAGTCCGGTTTTGATATATGCTGGGCAAGACTGCATGCGCTTGGATACGATATCTATTTTCGTGGGGACCATAGTTTTTCAGAGTATCCAGATTTATTCTCACGGGACGTACAATTAGTACAGGGAAATAAGTTATTCTTTCAGGAATTTGAGAACATCCTGCATTTGCTTCAGGTATTCCATCCAAATAACCCTTGGGAGAATCTGGCCGATTTTAGTGTGACTATAAAAATGTTGTAATATTGTGAATCATCAATTACTAGTTACTACTTTGAATAACTTGATTTGTTCTATACGTTTTTCCCGTTGAAAATTTCTGCTGAAACTAAAAGTAACTTCCATATGCTTAAACAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTTGATTCTACCAACATCAAGGAGGATTCAAGCTATGCAGAATATTTATCCCCAAAGAGGAATGTGGGAGTCAGTTACTTTAAATCCGAGTTTTCGAAGGATGATGTGTTGGCCGAATCGGTCTTCTCATCTGCTACTTCTGATGGCAAGGAGAACCAACACTGGCTGGGGTGGGACAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCTATTGGAGAAACAGTCAATCCATATACAGTGTTTCACTTAGTAGCTGAATCTCCCATTTGTGATAAAGTAACCATCCCAAATGCTAGAAGTAGCTGAGCTTCAAGGTTAGTTGTGGGACTTTGTATTGCTTAACTAGAGTTTATTGGATCCTTAATGTTAAAAGGTTAAAAATCATTTTTAGTCCCTAAACTTTCATAAAAGTAATAATTTAGTTTATGAACTTTGGTTTGTAACGATTTGGTCCTTGTATTTTCAATTTTGTAACGATTTAGTTCATGAACTTTACTACGTAATAATTTAGTCCCTGTATTTTAATATTTATAACGATTTAGTCCTTATTGCAAAAATTAATGTTAAAATTTAATGAGATTTCTTGGATAAATAAACCGATAAACTAATAAGAGAATCAATATTTTTATAAAATATAAAGTCTACATCATAAAACAATAAAAGTTGACTTTTAATTTTGGTGAACTTTTTTACACAAGGGACTAGATTGTTACAAATTTAAAAGTACAAGGACGAAATTGTTACATGTCAAAGTTGAGGGACTTAATTGTTATAAAATTGAAAGTACAAAGACTAAATCGTTAAAAACTAAAGCTCAGGACTAAATTGGTACTTTTATGAAAACCTAGGGACCAAAAGTGATTTTTAACCTTAATGTTAAAGAATAATGGAAACAACCACTTTTAAAACAGAAAAGAGGTTATCTCATTCATTTTTCTATTAGAGACTGAATCTCTCAGCAACTTGTGTTGATTTTCTTTGGTTCTCAGCACATGGGTTGTTTATGTAAATCAATAAATAGTAGTAGAAGCAGGTTAGAAAAGATTACAGAAAAAACATGACTTGGAAAAGGCTGAGAGAGTAGATGAGTTTGAATGAGAAAGGTTTCCTGTTACTTCTTAGGAGTGGATGTAAGCATTAAAACGTTGAACCTTCATTTTCTTCATTGCTTGGGAGTTTCTTCTTCTGTAGCTTCAAACGTAATGAGAGTTTTTTTACTTCCAA

mRNA sequence

TTTAATTTGATTTATTCGAATCGTGGAGTTATTCCAAACATTATCGAACTGACAGCTTGCAATTATTTCGGACAATTTCCCAATAGGTTCATCAATGGCGCTGGGTACTGGAGCTCCTACGCGGTACATCGATATTGGATATTTCGTGTGCCTGTAGCCGGACGCTATAATTTCGATATTCAAGGGCATTTTGTTCGATTACTGCCATTGAAGCTAGAAGAGACGGAGAATTGGACCAAAATGGGAGACGAAGTGGAGAGCCGGCGGCGGCGGCGGCGGCGTCTTATTCTGGAAAATTTCTTAACCCGTGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGTGGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCTACTAACTCTGCTCATTTGATCATGCCTTTTGTTCCAATTAGAGAGAGGTTGAAGGAGAAAGCGGAGGAATTCTTTGGTTGTCATTATGAACTCTTTGTCGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACAGACCCTATCTAAAACAACGTGACTTTTCAGCAGTGTGTTACTTGAATAGTTATGGAGTTGAATTCGGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAGAAACTATCTCACCTTTTTGTGGAGATTGTGTGATGTACACGGCTGACAGCGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGCTCACCCGTGATAGTTCCCATGATGAAGATTCAAAACTCCTTTCCCTTCTTTCACAAAGCCATTTGCACGATCGTCTTCCTGACTCACGCCTACCGCAGCCTCCATCCTGTAATATGTATTGGTTTTCACTGGAAGACGATCCAAATTTCAAGTCCGGTTTTGATATATGCTGGGCAAGACTGCATGCGCTTGGATACGATATCTATTTTCGTGGGGACCATAGTTTTTCAGAGTATCCAGATTTATTCTCACGGGACGTACAATTAGTACAGGGAAATAAGTTATTCTTTCAGGAATTTGAGAACATCCTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTTGATTCTACCAACATCAAGGAGGATTCAAGCTATGCAGAATATTTATCCCCAAAGAGGAATGTGGGAGTCAGTTACTTTAAATCCGAGTTTTCGAAGGATGATGTGTTGGCCGAATCGGTCTTCTCATCTGCTACTTCTGATGGCAAGGAGAACCAACACTGGCTGGGGTGGGACAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCTATTGGAGAAACAGTCAATCCATATACAGTGTTTCACTTAGTAGCTGAATCTCCCATTTGTGATAAAGTAACCATCCCAAATGCTAGAAGTAGCTGAGCTTCAAGGTTAGTTGTGGGACTTTGTATTGCTTAACTAGAGTTTATTGGATCCTTAATGTTAAAAGGTTAAAAATCATTTTTAGTCCCTAAACTTTCATAAAAGTAATAATTTAGTTTATGAACTTTGGTTTGTAACGATTTGGTCCTTGTATTTTCAATTTTGTAACGATTTAGTTCATGAACTTTACTACGTAATAATTTAGTCCCTGTATTTTAATATTTATAACGATTTAGTCCTTATTGCAAAAATTAATGTTAAAATTTAATGAGATTTCTTGGATAAATAAACCGATAAACTAATAAGAGAATCAATATTTTTATAAAATATAAAGTCTACATCATAAAACAATAAAAGTTGACTTTTAATTTTGGTGAACTTTTTTACACAAGGGACTAGATTGTTACAAATTTAAAAGTACAAGGACGAAATTGTTACATGTCAAAGTTGAGGGACTTAATTGTTATAAAATTGAAAGTACAAAGACTAAATCGTTAAAAACTAAAGCTCAGGACTAAATTGGTACTTTTATGAAAACCTAGGGACCAAAAGTGATTTTTAACCTTAATGTTAAAGAATAATGGAAACAACCACTTTTAAAACAGAAAAGAGGTTATCTCATTCATTTTTCTATTAGAGACTGAATCTCTCAGCAACTTGTGTTGATTTTCTTTGGTTCTCAGCACATGGGTTGTTTATGTAAATCAATAAATAGTAGTAGAAGCAGGTTAGAAAAGATTACAGAAAAAACATGACTTGGAAAAGGCTGAGAGAGTAGATGAGTTTGAATGAGAAAGGTTTCCTGTTACTTCTTAGGAGTGGATGTAAGCATTAAAACGTTGAACCTTCATTTTCTTCATTGCTTGGGAGTTTCTTCTTCTGTAGCTTCAAACGTAATGAGAGTTTTTTTACTTCCAA

Coding sequence (CDS)

TTTAATTTGATTTATTCGAATCGTGGAGTTATTCCAAACATTATCGAACTGACAGCTTGCAATTATTTCGGACAATTTCCCAATAGGTTCATCAATGGCGCTGGGTACTGGAGCTCCTACGCGGTACATCGATATTGGATATTTCGTGTGCCTGTAGCCGGACGCTATAATTTCGATATTCAAGGGCATTTTGTTCGATTACTGCCATTGAAGCTAGAAGAGACGGAGAATTGGACCAAAATGGGAGACGAAGTGGAGAGCCGGCGGCGGCGGCGGCGGCGTCTTATTCTGGAAAATTTCTTAACCCGTGAAGAATGCAGGGAACTGGAGTTCATTCATAAGAGCTGCTGTACGGTGGGTTATAGACCAAACGTCTTCTCCACCACTCTTTTGCATCTTGTTGCTACTAACTCTGCTCATTTGATCATGCCTTTTGTTCCAATTAGAGAGAGGTTGAAGGAGAAAGCGGAGGAATTCTTTGGTTGTCATTATGAACTCTTTGTCGAGTTCACTGGCTTGATCAGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGATGATAACAGACCCTATCTAAAACAACGTGACTTTTCAGCAGTGTGTTACTTGAATAGTTATGGAGTTGAATTCGGAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAGAAACTATCTCACCTTTTTGTGGAGATTGTGTGATGTACACGGCTGACAGCGACAATGTTCATTCTGTTGATGAGATAACCAATGGAGAAAGGCTTACACTGACATTATGGCTCACCCGTGATAGTTCCCATGATGAAGATTCAAAACTCCTTTCCCTTCTTTCACAAAGCCATTTGCACGATCGTCTTCCTGACTCACGCCTACCGCAGCCTCCATCCTGTAATATGTATTGGTTTTCACTGGAAGACGATCCAAATTTCAAGTCCGGTTTTGATATATGCTGGGCAAGACTGCATGCGCTTGGATACGATATCTATTTTCGTGGGGACCATAGTTTTTCAGAGTATCCAGATTTATTCTCACGGGACGTACAATTAGTACAGGGAAATAAGTTATTCTTTCAGGAATTTGAGAACATCCTGCATTTGCTTCAGGTAGTGCAGTTCCTGTGTTGGAAAGGCAAAGAACTTGATTCTACCAACATCAAGGAGGATTCAAGCTATGCAGAATATTTATCCCCAAAGAGGAATGTGGGAGTCAGTTACTTTAAATCCGAGTTTTCGAAGGATGATGTGTTGGCCGAATCGGTCTTCTCATCTGCTACTTCTGATGGCAAGGAGAACCAACACTGGCTGGGGTGGGACAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCATTTTAAGGAGAGAACTCCTTGGGAGCTTGAGCTATTGGAGAAACAGTCAATCCATATACAGTGTTTCACTTAGTAGCTGA

Protein sequence

FNLIYSNRGVIPNIIELTACNYFGQFPNRFINGAGYWSSYAVHRYWIFRVPVAGRYNFDIQGHFVRLLPLKLEETENWTKMGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWARLHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKELDSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS
Homology
BLAST of Bhi07G000118 vs. TAIR 10
Match: AT1G68080.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 414.5 bits (1064), Expect = 1.2e-115
Identity = 213/384 (55.47%), Postives = 268/384 (69.79%), Query Frame = 0

Query: 94  RLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLK 153
           RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLK
Sbjct: 8   RLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIRERLK 67

Query: 154 EKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSAVCYLNSYGVEFGG 213
           EK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQRDF+AVCYLNSY  +F G
Sbjct: 68  EKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAAVCYLNSYEKDFIG 127

Query: 214 GLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKL 273
           GLF FQ GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LW +RDSSHDEDSKL
Sbjct: 128 GLFRFQSGEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKL 187

Query: 274 LSLLSQSHLHDRLPDSRLPQPPSCNMYWF-SLEDDPNFKSGFDICWARLHALGYDIY-FR 333
           LS LSQ   H+      LP P S NMYWF   +D  N   GFD+C ARLH LG+D++  +
Sbjct: 188 LSRLSQCTSHEVC----LPLPASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQ 247

Query: 334 GDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKELDSTNIKEDS-S 393
           G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+  
Sbjct: 248 GEDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLE 307

Query: 394 YAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDY 453
             + +S  +   ++  KS F  D+ L  + F  + S G++ +  L    +A A  +WE+Y
Sbjct: 308 EVKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEY 367

Query: 454 ASILRRELLGSLSYWRNSQSIYSV 475
           +  L +ELL SL  W+  Q+I+ V
Sbjct: 368 SCKLLKELLSSLPQWKTYQTIHKV 386

BLAST of Bhi07G000118 vs. TAIR 10
Match: AT1G68080.3 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 372.5 bits (955), Expect = 5.2e-103
Identity = 198/384 (51.56%), Postives = 253/384 (65.89%), Query Frame = 0

Query: 94  RLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLK 153
           RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLK
Sbjct: 8   RLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIRERLK 67

Query: 154 EKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSAVCYLNSYGVEFGG 213
           EK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQRDF++             
Sbjct: 68  EKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAS------------- 127

Query: 214 GLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKL 273
                  GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LW +RDSSHDEDSKL
Sbjct: 128 -------GEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKL 187

Query: 274 LSLLSQSHLHDRLPDSRLPQPPSCNMYWF-SLEDDPNFKSGFDICWARLHALGYDIY-FR 333
           LS LSQ   H+      LP P S NMYWF   +D  N   GFD+C ARLH LG+D++  +
Sbjct: 188 LSRLSQCTSHEVC----LPLPASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQ 247

Query: 334 GDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKELDSTNIKEDS-S 393
           G+   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+  
Sbjct: 248 GEDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLE 307

Query: 394 YAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDY 453
             + +S  +   ++  KS F  D+ L  + F  + S G++ +  L    +A A  +WE+Y
Sbjct: 308 EVKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEY 366

Query: 454 ASILRRELLGSLSYWRNSQSIYSV 475
           +  L +ELL SL  W+  Q+I+ V
Sbjct: 368 SCKLLKELLSSLPQWKTYQTIHKV 366

BLAST of Bhi07G000118 vs. TAIR 10
Match: AT1G68080.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 340.9 bits (873), Expect = 1.7e-93
Identity = 185/383 (48.30%), Postives = 238/383 (62.14%), Query Frame = 0

Query: 94  RLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAHLIMPFVPIRERLK 153
           RLIL NFL+  EC+ELE IHKS  T+GYRPNVFSTTL HL+ATNS HLI+PFV IRERLK
Sbjct: 8   RLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIRERLK 67

Query: 154 EKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSAVCYLNSYGVEFGG 213
           EK EE FGC YELF+EFTGLISW +GASIGWHSDDNR YLKQRDF++             
Sbjct: 68  EKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAS------------- 127

Query: 214 GLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKL 273
                  GEP T++P  GD +MYTAD  N+HSVDE+T+GERLTL LW +RDSSHDEDSKL
Sbjct: 128 -------GEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKL 187

Query: 274 LSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWARLHALGYDIY-FRG 333
           LS LSQ                                  FD+C ARLH LG+D++  +G
Sbjct: 188 LSRLSQC---------------------------------FDVCVARLHLLGFDVHSLQG 247

Query: 334 DHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKELDSTNIKEDS-SY 393
           +   ++  +     +QL +G KL  ++F NILH LQVVQF  WK  EL ++N++ D+   
Sbjct: 248 EDHSTDASEQLMGPLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEE 307

Query: 394 AEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDYA 453
            + +S  +   ++  KS F  D+ L  + F  + S G++ +  L    +A A  +WE+Y+
Sbjct: 308 VKAMSHSQLETINALKSVFLLDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYS 336

Query: 454 SILRRELLGSLSYWRNSQSIYSV 475
             L +ELL SL  W+  Q+I+ V
Sbjct: 368 CKLLKELLSSLPQWKTYQTIHKV 336

BLAST of Bhi07G000118 vs. TAIR 10
Match: AT1G68080.4 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 188.3 bits (477), Expect = 1.4e-47
Identity = 107/243 (44.03%), Postives = 150/243 (61.73%), Query Frame = 0

Query: 235 MYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQP 294
           MYTAD  N+HSVDE+T+GERLTL LW +RDSSHDEDSKLLS LSQ   H+      LP P
Sbjct: 1   MYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKLLSRLSQCTSHEVC----LPLP 60

Query: 295 PSCNMYWF-SLEDDPNFKSGFDICWARLHALGYDIY-FRGDHSFSEYPDLFSRDVQLVQG 354
            S NMYWF   +D  N   GFD+C ARLH LG+D++  +G+   ++  +     +QL +G
Sbjct: 61  ASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQGEDHSTDASEQLMGPLQLAKG 120

Query: 355 NKLFFQEFENILHLLQVVQFLCWKGKELDSTNIKEDS-SYAEYLSPKRNVGVSYFKSEFS 414
            KL  ++F NILH LQVVQF  WK  EL ++N++ D+    + +S  +   ++  KS F 
Sbjct: 121 GKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTLEEVKAMSHSQLETINALKSVFL 180

Query: 415 KDDVLAESVFSSATSDGKENQHWLGWDKLAAAAAAWEDYASILRRELLGSLSYWRNSQSI 474
            D+ L  + F  + S G++ +  L    +A A  +WE+Y+  L +ELL SL  W+  Q+I
Sbjct: 181 LDENLVATTFGYSCS-GEDRKDSLDLTGIALAVTSWEEYSCKLLKELLSSLPQWKTYQTI 238

BLAST of Bhi07G000118 vs. ExPASy Swiss-Prot
Match: Q5XGE0 (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenopus tropicalis OX=8364 GN=ogfod3 PE=2 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 3.0e-07
Identity = 27/85 (31.76%), Postives = 47/85 (55.29%), Query Frame = 0

Query: 184 WHSDDNRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQD-GEPETISPFCGDCVMYTADSDN 243
           WH   ++      D++++ YL+ Y  +FGGG F F D G   T+ P  G    +T+ S+N
Sbjct: 225 WHPHIDKVTYGSFDYTSLLYLSDYSQDFGGGRFVFIDEGANRTVEPRTGRLSFFTSGSEN 284

Query: 244 VHSVDEITNGERLTLTLWLTRDSSH 268
           +H V++++ G R  +T+  T +  H
Sbjct: 285 LHRVEKVSWGTRYAITISFTCNPEH 309

BLAST of Bhi07G000118 vs. ExPASy Swiss-Prot
Match: A5WFM3 (PKHD-type hydroxylase PsycPRwf_1523 OS=Psychrobacter sp. (strain PRwf-1) OX=349106 GN=PsycPRwf_1523 PE=3 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 9.2e-04
Identity = 31/110 (28.18%), Postives = 50/110 (45.45%), Query Frame = 0

Query: 179 GASIGWHSDD------NRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFCGD 238
           G   G H D+      +   L + D S   +LN+     GG L    +    +I    GD
Sbjct: 92  GQGYGMHVDNALQTHPDSKQLMRTDLSLTLFLNNPADYEGGELVISDEYGEHSIKLSAGD 151

Query: 239 CVMYTADSDNVHSVDEITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHL 283
            V+Y   S ++H V+ +T+G+RL +  W+      DE  ++L  L  SH+
Sbjct: 152 AVLY--PSTSLHRVNTVTSGQRLAMVTWVQSLVRSDEQRQILHDLDVSHI 199

BLAST of Bhi07G000118 vs. ExPASy TrEMBL
Match: A0A5D3CRE9 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold186G001030 PE=3 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 2.4e-209
Identity = 358/399 (89.72%), Postives = 373/399 (93.48%), Query Frame = 0

Query: 81  MGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAH 140
           M D  ES  R+RRRLILENFL+REECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAH
Sbjct: 1   MVDGAES--RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAH 60

Query: 141 LIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSA 200
           LI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQR+FSA
Sbjct: 61  LIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSA 120

Query: 201 VCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLW 260
           VCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMY ADSDNVHSVDEITNGERLTLTLW
Sbjct: 121 VCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYRADSDNVHSVDEITNGERLTLTLW 180

Query: 261 LTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWAR 320
            TRDSSHDED+KLLSLLSQS LHDR P+S LPQPPSCNMYWFS EDDPNFK GFDICWAR
Sbjct: 181 FTRDSSHDEDAKLLSLLSQSPLHDRFPNSCLPQPPSCNMYWFSPEDDPNFKFGFDICWAR 240

Query: 321 LHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKEL 380
           LHALGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Sbjct: 241 LHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL 300

Query: 381 DSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKL 440
           D+TN+ EDS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSATS GKENQHWLGWDKL
Sbjct: 301 DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKL 360

Query: 441 -AAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 479
             AAAAAWEDYASILRRELLGS S+WRN QSIYSVSL S
Sbjct: 361 VVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of Bhi07G000118 vs. ExPASy TrEMBL
Match: A0A5A7SSL8 (Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold452G001280 PE=3 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 1.2e-208
Identity = 357/399 (89.47%), Postives = 373/399 (93.48%), Query Frame = 0

Query: 81  MGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAH 140
           M D  ES  R+RRRLILENFL+REECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAH
Sbjct: 1   MVDGAES--RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAH 60

Query: 141 LIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSA 200
           LI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQR+FSA
Sbjct: 61  LIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSA 120

Query: 201 VCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLW 260
           VCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTADSDNVHSVDEITNGERLTLTLW
Sbjct: 121 VCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLW 180

Query: 261 LTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWAR 320
            TRDSSHDED+KLLSLLSQS LHDR  +S LPQPPSCNMYWFS E+DPNFK GFDICWAR
Sbjct: 181 FTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWAR 240

Query: 321 LHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKEL 380
           LHALGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Sbjct: 241 LHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL 300

Query: 381 DSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKL 440
           D+TN+ EDS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSATS GKENQHWLGWDKL
Sbjct: 301 DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKL 360

Query: 441 -AAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 479
             AAAAAWEDYASILRRELLGS S+WRN QSIYSVSL S
Sbjct: 361 VVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of Bhi07G000118 vs. ExPASy TrEMBL
Match: A0A1S3C486 (Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 1.2e-208
Identity = 357/399 (89.47%), Postives = 373/399 (93.48%), Query Frame = 0

Query: 81  MGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAH 140
           M D  ES  R+RRRLILENFL+REECRELEFIHKSCCTVGYRPNV STTLLHLVATNSAH
Sbjct: 1   MVDGAES--RQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAH 60

Query: 141 LIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQRDFSA 200
           LI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQR+FSA
Sbjct: 61  LIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSA 120

Query: 201 VCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVDEITNGERLTLTLW 260
           VCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTADSDNVHSVDEITNGERLTLTLW
Sbjct: 121 VCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLW 180

Query: 261 LTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDDPNFKSGFDICWAR 320
            TRDSSHDED+KLLSLLSQS LHDR  +S LPQPPSCNMYWFS E+DPNFK GFDICWAR
Sbjct: 181 FTRDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWAR 240

Query: 321 LHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLLQVVQFLCWKGKEL 380
           LHALGYDIYF GDH FSEYPDLFS+DVQLV G+K+FFQ+FENILHLLQVVQFLCWKGKEL
Sbjct: 241 LHALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKEL 300

Query: 381 DSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSDGKENQHWLGWDKL 440
           D+TN+ EDS YAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSATS GKENQHWLGWDKL
Sbjct: 301 DTTNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKL 360

Query: 441 -AAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 479
             AAAAAWEDYASILRRELLGS S+WRN QSIYSVSL S
Sbjct: 361 VVAAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of Bhi07G000118 vs. ExPASy TrEMBL
Match: A0A0A0KMN7 (Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=3 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 9.6e-206
Identity = 356/413 (86.20%), Postives = 371/413 (89.83%), Query Frame = 0

Query: 81  MGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFSTTLLHLVATNSAH 140
           M D  ES  R+RRRLILENFL+REECRELEFIHKSC TVGYRPNVFSTTLLHLVATNSAH
Sbjct: 1   MVDGAES--RQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAH 60

Query: 141 LIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLIS---------------WTRGASIGWH 200
           LI+PFVPIRE+LKEKAEEFFGCHYELFVEFTGLIS               WTRGASIGWH
Sbjct: 61  LIIPFVPIREKLKEKAEEFFGCHYELFVEFTGLISLHSKAHLQPSSSNLGWTRGASIGWH 120

Query: 201 SDDNRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHS 260
           SDDNRPYLKQR+FSAVCYLNSYGVEFGGGLFHFQDGEPETISPF GDCVMYTAD+DNVHS
Sbjct: 121 SDDNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHS 180

Query: 261 VDEITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLE 320
           VDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS LHDR PDS LPQPPSCNMYWFS E
Sbjct: 181 VDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPE 240

Query: 321 DDPNFKSGFDICWARLHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILH 380
           DDPNFK GFDICWARL ALGYD+YF GDH FSEYPDLF +DVQLV G+K+FFQ+FENILH
Sbjct: 241 DDPNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILH 300

Query: 381 LLQVVQFLCWKGKELDSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSAT 440
           LLQVVQFLCWKGKELDSTN+ EDSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFSSA 
Sbjct: 301 LLQVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAA 360

Query: 441 SDGKENQHWLGWDKLAAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 479
           SDGKENQ WLGWDKL AAAAAWE YASILRRELLGS S+WRN QSIYSVSL S
Sbjct: 361 SDGKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 411

BLAST of Bhi07G000118 vs. ExPASy TrEMBL
Match: A0A6J1JQV6 (Procollagen-proline 3-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111487019 PE=3 SV=1)

HSP 1 Score: 692.2 bits (1785), Expect = 1.5e-195
Identity = 340/411 (82.73%), Postives = 362/411 (88.08%), Query Frame = 0

Query: 68  LPLKLEETENWTKMGDEVESRRRRRRRLILENFLTREECRELEFIHKSCCTVGYRPNVFS 127
           +PLK  ETENW KMGDE E    +R RLILENFLT EECRELEFIHKSCCTVGYRP VFS
Sbjct: 1   MPLK-RETENWMKMGDEAEI--NQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFS 60

Query: 128 TTLLHLVATNSAHLIMPFVPIRERLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSD 187
           TTLLHLV +NSA LIMPFV IRERLKEKAEEFFGC YELFVEFTGLISWTRGA IGWHSD
Sbjct: 61  TTLLHLVVSNSAQLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSD 120

Query: 188 DNRPYLKQRDFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFCGDCVMYTADSDNVHSVD 247
           DNRPYLKQR+F+AVCYLNSYGV+F GGLFHFQDGEP+TISP CGDCVMYTADS NVHSVD
Sbjct: 121 DNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVD 180

Query: 248 EITNGERLTLTLWLTRDSSHDEDSKLLSLLSQSHLHDRLPDSRLPQPPSCNMYWFSLEDD 307
           E+T+GERLTLTLW TRDSSHDED+KLLSLLSQSHLHDRLPDS LPQPPSCNMYWFS +DD
Sbjct: 181 EVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDD 240

Query: 308 PNFKSGFDICWARLHALGYDIYFRGDHSFSEYPDLFSRDVQLVQGNKLFFQEFENILHLL 367
           PNFK GFDICWARLHALGY IYF  DHS SEYPDLFS+DVQLV+GNK+F Q+F++ILH L
Sbjct: 241 PNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHAL 300

Query: 368 QVVQFLCWKGKELDSTNIKEDSSYAEYLSPKRNVGVSYFKSEFSKDDVLAESVFSSATSD 427
           QVVQFL WKGKELDST+ KEDSSYAE LSPKRNVGV +FKSEFSKDD LAESVF  A+SD
Sbjct: 301 QVVQFLYWKGKELDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSD 360

Query: 428 GKENQHWLGWDKLAAAAAAWEDYASILRRELLGSLSYWRNSQSIYSVSLSS 479
            KE QH LGW KLAA A AWEDYAS LRRELL S ++WR SQSIYSV   S
Sbjct: 361 VKEKQHRLGWAKLAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G68080.11.2e-11555.472-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.35.2e-10351.562-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.21.7e-9348.302-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G68080.41.4e-4744.032-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
Match NameE-valueIdentityDescription
Q5XGE03.0e-0731.762-oxoglutarate and iron-dependent oxygenase domain-containing protein 3 OS=Xenop... [more]
A5WFM39.2e-0428.18PKHD-type hydroxylase PsycPRwf_1523 OS=Psychrobacter sp. (strain PRwf-1) OX=3491... [more]
Match NameE-valueIdentityDescription
A0A5D3CRE92.4e-20989.72Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7SSL81.2e-20889.47Procollagen-proline 3-dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A1S3C4861.2e-20889.47Procollagen-proline 3-dioxygenase OS=Cucumis melo OX=3656 GN=LOC103496668 PE=3 S... [more]
A0A0A0KMN79.6e-20686.20Procollagen-proline 3-dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_5G289640 PE=... [more]
A0A6J1JQV61.5e-19582.73Procollagen-proline 3-dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111487019 PE... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 92..262
e-value: 0.0046
score: 3.0
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 74..281
e-value: 1.4E-20
score: 75.8
NoneNo IPR availablePANTHERPTHR14049:SF92-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 87..474
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 177..261
e-value: 3.9E-10
score: 40.5
IPR039575Prolyl 3-hydroxylasePANTHERPTHR14049LEPRECAN 1coord: 87..474
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 155..263
score: 9.891955

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi07M000118Bhi07M000118mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032963 collagen metabolic process
biological_process GO:0019511 peptidyl-proline hydroxylation
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0019797 procollagen-proline 3-dioxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen