CmaCh04G000200 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G000200
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionProlyl 3-hydroxylase 1
LocationCma_Chr04 : 95422 .. 99544 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTAAATATATAAACATATCATGTTTAATATCTATTTTGTTCTCTATTTAAATTTTGTACCGGCTTCTCATCGATGGCGCGTGTTTACGAGAGAGGGATGCATGGAGAAGAAGCAGTCGCCTCGCCATTAAGGATTTCATTCCATTGTTGTTGCAAATTGGGATATCGCTTTCGGTAATTCTCTCTCCAAAACCTGTAATCCCCAATTGACTCTCCTTAATATTATAGTAATCCTTAGAATCATTCTCAAACAGCACAGTTGAACATTGTTTTATATATGTCACACAAATCGTTGTATCTTTGTATATGTTGAAGGTATATTTTTATGGTCAAGGGCATTCCATCCGATTAATGCCATTGAAACGAGAGACGGAGAATTGGATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGTGGCGTCTCATTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCAATTGATCATGCCTTTTGTTTCGATTAGAGGTATGACCACCCATTCCTCCATTTTCTCCATTTTTTCCTTTTTGGTAATGTTTATGCCTAATTTTGATTAGAGTTCAATATTTGTGACAGAGCGGTTGAAGGAGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGGTTATTCTTTTTCCTCCTTATGTGTTAAAGGCTTAAATGGGTGTTAGTCTCTTCTTGCGTAAACCCCAAATTTTGAAGAATGAACATAGAGAGTCAGACGGCCATAAATTAGAAGGCTGAGTCTTTCTTTGTCATTGAAAATCGTCTATATGGTTAACTTTAGTTCAGGGAGTCTGACGGCCATGTAACAGGAGTACGTTATAATCAGCCCAGTCTGGTATGGGCCTTCGCAGGTTGGTTCTTGTGATCCACGGTTTAATTTCTGTGTGTTTAGTATGACCCGTGGAAGGACTAAATCTTGGAATTTCAACGACTTACCTAGACCCGAATACATAAAAAGTTCAAAATCTTGCTTCATAGTTTGTAACAGTGATTGGTTCTTCCTCAAAAATACTTGCAATAATACTGGGTTGGTTGAATGCCATTACAACCGTTTCTTTTGAATATTTGGATAGAATGGAATAAAAGAAATTCTCAAGATTGATACAAAAATCATTGTGAAATTTGGGATTCATTCTATCTTGTTTTTTTTTATTCACTTGGGAGTGTTCTTTTCTTTCTTGTTCTGGCAATCTCATTTGTGATTAATGTCTATATATTTTTCAATGAAATATCTTTTTTTTCGCTCTCTTTTGTGAACTATATAGAATACTTCATTTGATTTAATTTTCATTTTCAAAAAAAAGTGTAGTAGAATCTGTATTTTTCTTCTAACATTCTGTCGTAGTTCAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACAGTATGTGGACCAATCTTTCCTGGGAAAGTCAGATGTCATTTTTTGCTATCCCTTCGACAACCTATTCTTTCATTTCAATTGTAGTTTTATATCCTTTTTATTCTCTCTTGGACTTCTTTTCTGTTTTTCTTTCTGACAAGGGGGAGGGTGGTTCAATTCCTATAGTACCTTCTCCCTGAAATTCATTTGGTATTTCTTCTCCATCATTACAGTACTCTAGCTGATGGTTGTCTTCGTGTATTTGGCAAAAGGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCGAAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTATCTCGCCTCTTTGTGGAGTAAGTTGATTGATAAAAGCTTTTTACCTCTTCCATTTATTTTACTCGGTACATACAGCAATTTTTCTAATAAAAAAAATTGTACATTTCTATGTTTGTCAACATGAATGTAGTTCAATTGGTTAAGACATTAAAACCTCAATTGAAAGGTGGAGGTTCAAATCTCATCCTCTCATATTGTAGAACTCGAAAATGGTACATATGCTTCTTTGAAGGATGTGCTTGTGCTCTTTTGAGCGAAGCAATCTTCAGTATTTTTTGGTCAATAGAAATAATAAGTGGAGCTTAATCCTCGGCAGTATGAATTATAGCCCATGTTTCATTGCTGTTCTTATAGTTCCCTTTCTAATTATTTTAGTAATTATGCCTTCTTTGTAAATTATAATTAATTCTAATTGTGGGGATATTCTATAAGAACGTTTGACTTTGATGTAAATGGAGATGAGTCATGCAGAAAAGGGCATTTTCTCTGAATTGTTGGATTTCGGTGATAGGTTTTTAGTTCATTTATCTATATCAATGAAAAGTTGTTTTTCCTCTTACAAGAAATGCCATATTTGTCTGAATAATTGAAGGTGGTGGATTATATTTGCTAGTAGTTTGAAACAATGTACCAATCAAGTATAGTTAATACATGTTTTGCTTTATCAGGATTGTGTGATGTACACGGCTGACAGCCTCAATGTTCATTCTGTTGATGAGGTATGTCCAAACACTCTGTCCCTCTGAATCTGCATGTGAAATCATTGTTTAAATGAATAACATCAAATCTCTCCCATGGTTCTTTGAGGAATAATCTAACTTTTATTGAGAGGAAATGAAACAATACAAGGAATATAAAATTAAAGCCGACAAATGGAAGTCAAACTATATAAAAAAAGGCTTCAATCCAGTAAAATAAGATCAAAGGGTAATTCAAAAAAGCCTCGTCATCGAAGCCGAGAGAGAGCACATTAATTCTTTCTTGCTGTTTTTCTTTTACTGTAATTGAGGGATCATGCATGTTGATTAATCCTCAAGAGAGTCGCTAGTAGCCCTCTCGTAGACAGCTGTACTTCATCTTGCTTACCACGTGGTTTCCTTGGATAGGTAACCAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTACGGGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTATTCCGTCGAAATCACCCTCAACATTATTCCCCCTTGTGATTATATAGCAGAATCTGGTCGTTTTTCACGTAACGAGAAAAAAAAAGTTGTCATATTCAGAATCGTCAAATACTATAATTAATTCACTTCGTTTTATGTAGTTTCCTCATTGAAAAAGTCTGCTAAAACGAGTTACCTACACCTGCTTAAACAGGTAGTGCAATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTGACTCCAAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTTCTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTTGTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGAAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGACTCTTCCACTTCTGGGATAGTGGCAGTCTCAAATGCTAGAAGTAGCTGAGCTTCAAGGTCAGTTATGGGCCTTTGTACAGCTTAACTAGATTTTACTTGTCCCAAGTGTGCAATCCTTTGTGTTATATTCTTACCCTTTTCAACGATTTAGCCATTTTTCACCGTTTTTAAAGGGATATCATTTCTGTTTCTAAAAAACTTGGTTCTCGTTTTAGTGTCTTAGTATCTGTCT

mRNA sequence

TTTTTTTTTAAATATATAAACATATCATGTTTAATATCTATTTTGTTCTCTATTTAAATTTTGTACCGGCTTCTCATCGATGGCGCGTGTTTACGAGAGAGGGATGCATGGAGAAGAAGCAGTCGCCTCGCCATTAAGGATTTCATTCCATTGTTGTTGCAAATTGGGATATCGCTTTCGGTATATTTTTATGGTCAAGGGCATTCCATCCGATTAATGCCATTGAAACGAGAGACGGAGAATTGGATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGTGGCGTCTCATTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCAATTGATCATGCCTTTTGTTTCGATTAGAGAGCGGTTGAAGGAGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCGAAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTATCTCGCCTCTTTGTGGAGATTGTGTGATGTACACGGCTGACAGCCTCAATGTTCATTCTGTTGATGAGGTAACCAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTACGGGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTGACTCCAAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTTCTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTTGTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGAAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGACTCTTCCACTTCTGGGATAGTGGCAGTCTCAAATGCTAGAAGTAGCTGAGCTTCAAGGTCAGTTATGGGCCTTTGTACAGCTTAACTAGATTTTACTTGTCCCAAGTGTGCAATCCTTTGTGTTATATTCTTACCCTTTTCAACGATTTAGCCATTTTTCACCGTTTTTAAAGGGATATCATTTCTGTTTCTAAAAAACTTGGTTCTCGTTTTAGTGTCTTAGTATCTGTCT

Coding sequence (CDS)

ATGCCATTGAAACGAGAGACGGAGAATTGGATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGTGGCGTCTCATTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTGTACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCAATTGATCATGCCTTTTGTTTCGATTAGAGAGCGGTTGAAGGAGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACAGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCGAAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTATCTCGCCTCTTTGTGGAGATTGTGTGATGTACACGGCTGACAGCCTCAATGTTCATTCTGTTGATGAGGTAACCAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTCTTTCCCTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTTCGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAATTAGTACGGGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTGACTCCAAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTTCTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTTGTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGAAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCCTTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGA

Protein sequence

MPLKRETENWMKMGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
BLAST of CmaCh04G000200 vs. Swiss-Prot
Match: P3H1_HUMAN (Prolyl 3-hydroxylase 1 OS=Homo sapiens GN=P3H1 PE=1 SV=2)

HSP 1 Score: 73.6 bits (179), Expect = 5.8e-12
Identity = 54/222 (24.32%), Postives = 96/222 (43.24%), Query Frame = 1

Query: 19  INQRWRLILENFLTLEECRELEFIHKSCCTVG----------------YRPYVFSTTLLH 78
           +N   R++++  ++  EC+EL+ +     T G                Y   VF    L 
Sbjct: 466 LNGSQRVVMDGVISDHECQELQRLTNVAATSGDGYRGQTSPHTPNEKFYGVTVFKALKLG 525

Query: 79  LVVSNSAQLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARI-GWHSDDNRP 138
                  Q    + ++ E+++   E +F  +  L+  ++ L+  T    +     DD+ P
Sbjct: 526 QEGKVPLQSAHLYYNVTEKVRRIMESYFRLDTPLYFSYSHLVCRTAIEEVQAERKDDSHP 585

Query: 139 Y--------------LKQ------REFTAVCYLNSYGVDFEGGLFHFQDGEPKTIS---- 198
                          +K+      R+++A+ YLN    DF+GG F+F + + KT++    
Sbjct: 586 VHVDNCILNAETLVCVKEPPAYTFRDYSAILYLNG---DFDGGNFYFTELDAKTVTAEVQ 645

Query: 199 PLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 200
           P CG  V +++ + N H V  VT G+R  + LWFT D  H E
Sbjct: 646 PQCGRAVGFSSGTENPHGVKAVTRGQRCAIALWFTLDPRHSE 684

BLAST of CmaCh04G000200 vs. Swiss-Prot
Match: P3H1_MOUSE (Prolyl 3-hydroxylase 1 OS=Mus musculus GN=P3h1 PE=1 SV=2)

HSP 1 Score: 73.6 bits (179), Expect = 5.8e-12
Identity = 53/222 (23.87%), Postives = 101/222 (45.50%), Query Frame = 1

Query: 19  INQRWRLILENFLTLEECRELEFIHKSCCTVG--YR----PYVFSTTLLHLVVSNSAQL- 78
           +N   R++++  ++ +EC+EL+ +  +  T G  YR    P+  +     + V  + +L 
Sbjct: 469 LNGSQRVVMDGVISDDECQELQRLTNAAATSGDGYRGQTSPHTPNEKFYGVTVLKALKLG 528

Query: 79  ---------IMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWT---------RGARIG 138
                       + ++ E+++   E +F  +  L+  ++ L+  T         + +   
Sbjct: 529 QEGKVPLQSARMYYNVTEKVRRVMESYFRLDTPLYFSYSHLVCRTAIEESQAERKDSSHP 588

Query: 139 WHSDD------------NRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTIS---- 198
            H D+              P    R+++A+ YLN    DF+GG F+F + + KT++    
Sbjct: 589 VHVDNCILNAEALMCIKEPPAYTFRDYSAILYLNG---DFDGGNFYFTELDAKTVTAEVQ 648

Query: 199 PLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 200
           P CG  V +++ + N H V  VT G+R  + LWFT D  H E
Sbjct: 649 PQCGRAVGFSSGTENPHGVKAVTRGQRCAIALWFTLDPRHSE 687

BLAST of CmaCh04G000200 vs. Swiss-Prot
Match: P3H1_RAT (Prolyl 3-hydroxylase 1 OS=Rattus norvegicus GN=P3h1 PE=1 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 7.5e-12
Identity = 53/222 (23.87%), Postives = 101/222 (45.50%), Query Frame = 1

Query: 19  INQRWRLILENFLTLEECRELEFIHKSCCTVG--YR----PYVFSTTLLHLVVSNSAQLI 78
           +N   R++++  ++ +EC+EL+ +  +  T G  YR    P+  +     + V  + +L 
Sbjct: 458 LNGSQRVVMDGVISDDECQELQRLTNAAATSGDGYRGQTSPHTPNEKFYGVTVLKALKLG 517

Query: 79  MP----------FVSIRERLKEKAEEFFGCEYELFVEFTGLISWT---------RGARIG 138
                       + ++ E+++   E +F  +  L+  ++ L+  T         + +   
Sbjct: 518 QEGKVPLQSAHMYYNVTEKVRRVMESYFRLDTPLYFSYSHLVCRTAIEESQAERKDSSHP 577

Query: 139 WHSDD------------NRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTIS---- 198
            H D+              P    R+++A+ YLN    DF+GG F+F + + KT++    
Sbjct: 578 VHVDNCILNAESLVCIKEPPAYTFRDYSAILYLNG---DFDGGNFYFTELDAKTVTAEVQ 637

Query: 199 PLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 200
           P CG  V +++ + N H V  VT G+R  + LWFT D  H E
Sbjct: 638 PQCGRAVGFSSGTENPHGVKAVTRGQRCAIALWFTLDPRHSE 676

BLAST of CmaCh04G000200 vs. Swiss-Prot
Match: P3H2_RAT (Prolyl 3-hydroxylase 2 OS=Rattus norvegicus GN=P3h2 PE=1 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 1.3e-11
Identity = 66/238 (27.73%), Postives = 103/238 (43.28%), Query Frame = 1

Query: 8   ENWMKMGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTV--GYR----------PYV 67
           EN   + +  ++N   R++L+N L+ E+CREL  +      V  GYR           + 
Sbjct: 443 ENITFVYNSEQLNGTQRVLLDNVLSEEQCRELHSVASGIMLVGDGYRGKTSPHTPNEKFE 502

Query: 68  FSTTLLHL-------VVSNSAQLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTR 127
            +T L  L       V   SA+L   F  I E+ ++  E +F     L+  +T ++   R
Sbjct: 503 GATVLKALKFGYEGRVPLKSARL---FYDISEKARKIVESYFMLNSTLYFSYTHMV--CR 562

Query: 128 GARIGW-----------HSDD------------NRPYLKQREFTAVCYLNSYGVDFEGGL 187
            A  G            H+D+              P    R+++A+ Y+N    DFEGG 
Sbjct: 563 TALSGQQDRRNDLSHPIHADNCLLDPEANECWKEPPAYTFRDYSALLYMND---DFEGGE 622

Query: 188 FHFQDGEPKT----ISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 200
           F F + + KT    I P CG  + +++   N H V  VT G+R  + LWFT D  + E
Sbjct: 623 FIFTEMDAKTVTASIKPKCGRMISFSSGGENPHGVKAVTRGQRCAVALWFTLDPLYRE 672

BLAST of CmaCh04G000200 vs. Swiss-Prot
Match: P3H2_CHICK (Prolyl 3-hydroxylase 2 OS=Gallus gallus GN=P3H2 PE=2 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.7e-11
Identity = 64/244 (26.23%), Postives = 106/244 (43.44%), Query Frame = 1

Query: 18  EINQRWRLILENFLTLEECRELEFIHKSCCTVG--YRP----------YVFSTTLLHL-- 77
           ++N   R++L+N ++ E+CREL  +       G  YR           +  +T L  L  
Sbjct: 444 QLNGTQRVLLDNVISEEQCRELHRVASGIMLAGDGYRGKTSPHTPNERFEGATVLKALKY 503

Query: 78  -----VVSNSAQLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWT---------RG 137
                V   SA+L   F  I E+ +   E +F     L+  +T L+  T           
Sbjct: 504 GYEGRVPLKSARL---FYDISEKARRIVESYFMLNSTLYFSYTHLVCRTALSGQQERRND 563

Query: 138 ARIGWHSDD------------NRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTIS 197
                H+D+              P    R+++A+ Y+N+   DFEGG F F + + KT++
Sbjct: 564 LSHPIHADNCLLDPEANECWKEPPAYTFRDYSALLYMNA---DFEGGEFIFTEMDAKTVT 623

Query: 198 ----PLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL-----LSLLS 213
               P CG  V +++   N H V  VT G+R  + LWFT D  + E  ++     +++L 
Sbjct: 624 ASIKPKCGRMVSFSSGGENPHGVKAVTKGQRCAVALWFTLDPLYRELERIQADEVIAMLD 681

BLAST of CmaCh04G000200 vs. TrEMBL
Match: A0A0A0KMN7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G289640 PE=4 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 1.0e-185
Identity = 324/411 (78.83%), Postives = 344/411 (83.70%), Query Frame = 1

Query: 13  MGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLI 72
           M D AE  QR RLILENFL+ EECRELEFIHKSC TVGYRP VFSTTLLHLV +NSA LI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60

Query: 73  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLIS---------------WTRGARIGWHSD 132
           +PFV IRE+LKEKAEEFFGC YELFVEFTGLIS               WTRGA IGWHSD
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISLHSKAHLQPSSSNLGWTRGASIGWHSD 120

Query: 133 DNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVD 192
           DNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTAD+ NVHSVD
Sbjct: 121 DNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVD 180

Query: 193 EVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDD 252
           E+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DD
Sbjct: 181 EITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDD 240

Query: 253 PNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHAL 312
           PNFKFGFDICWARL ALGY +YFP DH  SEYPDLF QDVQLV G+KIF QKF++ILH L
Sbjct: 241 PNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL 300

Query: 313 QVVQFLYWKGKELDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSD 372
           QVVQFL WKGKELDST+  EDSSYAE LSPKRNVGV +FKSEFSK+D LAESVF  A+SD
Sbjct: 301 QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASD 360

Query: 373 VKEKQHRLGWAKLAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 409
            KE Q  LGW KL A A AWE YAS LRRELL SF+HWR  QSIYSV   S
Sbjct: 361 GKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 411

BLAST of CmaCh04G000200 vs. TrEMBL
Match: V4UGA2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10015473mg PE=4 SV=1)

HSP 1 Score: 475.3 bits (1222), Expect = 7.3e-131
Identity = 232/389 (59.64%), Postives = 281/389 (72.24%), Query Frame = 1

Query: 20  NQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLIMPFVSIR 79
           ++R R+I+ N L+ EEC ELE IHKSC TVGYRP VFSTTL HL+ +NS+  I+PFV IR
Sbjct: 8   SERRRVIVRNMLSKEECEELELIHKSCSTVGYRPNVFSTTLSHLIATNSSHFIVPFVPIR 67

Query: 80  ERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGV 139
           ERLKEK EEFFGCE+EL +EFTGLISW RGA IGWH DDNRPYLKQR FTAVCYLNSYG 
Sbjct: 68  ERLKEKVEEFFGCEFELVIEFTGLISWARGASIGWHCDDNRPYLKQRHFTAVCYLNSYGK 127

Query: 140 DFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 199
           DF+GGLF FQDGEPKT +P  GD  MYTADS NVHSVDEVT GERLTLTLWF+RDSSHDE
Sbjct: 128 DFQGGLFRFQDGEPKTFAPSAGDVAMYTADSRNVHSVDEVTHGERLTLTLWFSRDSSHDE 187

Query: 200 DAKLLSLLSQSHLH--DRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYG 259
           DAKL+S+LSQ  LH  D++P  CLP P S NMYWFSP      + G +ICWAR++ LGY 
Sbjct: 188 DAKLISILSQKLLHRSDKVPQLCLPLPASSNMYWFSPNQASPDELGCNICWARMNVLGYD 247

Query: 260 IYFPQDHSLS-EYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTDSK 319
           IY+ Q+ S + +  +L  + +QL RG+ +F Q F +ILHALQVVQF +WK  E  ++  +
Sbjct: 248 IYYSQNTSSALDCSELLLEPLQLARGDNLFHQPFANILHALQVVQFFHWKASEFPTSKFE 307

Query: 320 EDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAEA 379
            ++S    LS  +   +   KS F K++ LAE+VF     + KE+Q    WA  +A   A
Sbjct: 308 TEASKVLHLSQSQKENISNLKSVFVKNNQLAETVFRPVIINEKEQQ-SFSWANFSAAVTA 367

Query: 380 WEDYASNLRRELLRSFAHWRTSQSIYSVP 406
           WEDY   L ++LL S  HWRT QSI+S P
Sbjct: 368 WEDYIRKLHKQLLNSLPHWRTHQSIFSCP 395

BLAST of CmaCh04G000200 vs. TrEMBL
Match: F6HFA8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06280 PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.6e-130
Identity = 229/393 (58.27%), Postives = 283/393 (72.01%), Query Frame = 1

Query: 24  RLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLIMPFVSIRERLK 83
           R+IL+NF+++EEC+ELEFIHKSCCTVGYRP VFSTTL HL+ + S  LI+PFV IRERLK
Sbjct: 8   RVILKNFVSVEECKELEFIHKSCCTVGYRPNVFSTTLSHLIATRSPHLILPFVPIRERLK 67

Query: 84  EKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEG 143
           EK EE FGCEYELF+EFTGLISWTRGA IGWHSDDNRPYLKQR+F AVCYLNSYG DF+G
Sbjct: 68  EKLEECFGCEYELFIEFTGLISWTRGASIGWHSDDNRPYLKQRDFAAVCYLNSYGNDFKG 127

Query: 144 GLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL 203
           GLFHFQDG+P TI PL GD VMYTAD  N+HSVDE+T GERLTLTLWF+RD SHDEDAKL
Sbjct: 128 GLFHFQDGDPTTIEPLAGDVVMYTADCRNIHSVDEITDGERLTLTLWFSRDCSHDEDAKL 187

Query: 204 LSLLSQSHLH--DRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFP 263
           + LLSQS LH  +  PD  LP P S +MYWFSP      + GFDICWAR+H LGY ++ P
Sbjct: 188 VCLLSQSQLHSSNNEPDPYLPLPASSSMYWFSPDHISQHQSGFDICWARMHILGYDLFSP 247

Query: 264 QDHSL-------SEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTD 323
           QD S         ++ +   + +QL RG+++F  +F +ILH LQVVQF  WK  +L ++ 
Sbjct: 248 QDKSCFSALDSSCDFSERLMEQLQLARGDELFDLEFVNILHVLQVVQFYSWKASKLQTSK 307

Query: 324 -SKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAV 383
             +E  +    LS  +   ++  ++ F  D  LAE+V    +S  + +QH   W   +A 
Sbjct: 308 VERETENLVVKLSESQREKINNLRTTFLNDQQLAETVL--GTSCGESRQHSFQWVSFSAA 367

Query: 384 AEAWEDYASNLRRELLRSFAHWRTSQSIYSVPY 407
             AWEDY   LR+EL+ S  +WRT Q+I+SVP+
Sbjct: 368 VGAWEDYTRELRKELVLSLPYWRTHQAIFSVPF 398

BLAST of CmaCh04G000200 vs. TrEMBL
Match: M5Y3D7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017932mg PE=4 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 6.2e-130
Identity = 234/394 (59.39%), Postives = 282/394 (71.57%), Query Frame = 1

Query: 13  MGDEAEINQRWR-LILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQL 72
           MGD  E  +  R LIL NFL+ +EC+ELEFIHKS CTVGYRP+VFSTTL HL+ +NSA L
Sbjct: 1   MGDPEEAAEHGRRLILHNFLSFQECKELEFIHKSNCTVGYRPHVFSTTLSHLIATNSAHL 60

Query: 73  IMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAV 132
           IMPFV IRERLKEK EEFFGC+YELFVEFTGLISW+RG+ IGWHSDDNRPYLKQR+F AV
Sbjct: 61  IMPFVPIRERLKEKVEEFFGCQYELFVEFTGLISWSRGSSIGWHSDDNRPYLKQRDFAAV 120

Query: 133 CYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWF 192
           CYLNSYG DF GGLFHFQDG+P TI P  GD V+YTADS N+HSVDE+T GERLTL LWF
Sbjct: 121 CYLNSYGNDFRGGLFHFQDGDPATIVPSGGDVVIYTADSRNIHSVDEITDGERLTLALWF 180

Query: 193 TRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWAR 252
           +RD+++DEDAKL++LLS++ LHD  P+ CLP P S NMYWFSP +   + + GFDICWAR
Sbjct: 181 SRDATYDEDAKLITLLSKNFLHDNAPELCLPFPASSNMYWFSPDQASSDQQLGFDICWAR 240

Query: 253 LHALGYGIYFPQDHS-LSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKE 312
           LH LGY + F QD S  S    L  + ++L RG+++F  +F +ILHALQVVQF  WK  +
Sbjct: 241 LHVLGYDLLFHQDKSYCSNISKLLMEPLRLTRGDELFEHEFINILHALQVVQFYCWKAPD 300

Query: 313 LDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAK 372
             S   +E ++    LS  +   +   KS F+KD  L +SVF   +  V   QH   W  
Sbjct: 301 FKSAKVEETTTVV--LSQSQRERLVCLKSLFAKDVCLVDSVFSNVTF-VGSAQHSFNWVD 360

Query: 373 LAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYS 404
                  WEDY   L REL+ S  HWRT QSI++
Sbjct: 361 FRIAIAKWEDYVRKLHRELVMSLPHWRTQQSIFN 391

BLAST of CmaCh04G000200 vs. TrEMBL
Match: A0A067F0I4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015648mg PE=4 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 4.0e-129
Identity = 230/389 (59.13%), Postives = 278/389 (71.47%), Query Frame = 1

Query: 20  NQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLIMPFVSIR 79
           ++R R+I+ N L+ EEC ELE IHKSC TVGYRP VFSTTL HL+ +NS+  I+PFV IR
Sbjct: 8   SERRRVIVRNILSKEECEELELIHKSCSTVGYRPNVFSTTLSHLIATNSSHFIVPFVPIR 67

Query: 80  ERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGV 139
           ERLKEK EEFFGCE+EL +EFTGLISW RGA IGWH DDNRPYLKQR FTAVCYLNSYG 
Sbjct: 68  ERLKEKVEEFFGCEFELVIEFTGLISWARGASIGWHCDDNRPYLKQRHFTAVCYLNSYGK 127

Query: 140 DFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDE 199
           DF+GGLF FQDGE K  +P  GD  MYTADS NVHSVDEVT GERLTLTLWF+RDSSHDE
Sbjct: 128 DFQGGLFRFQDGETKNFAPSAGDVAMYTADSRNVHSVDEVTHGERLTLTLWFSRDSSHDE 187

Query: 200 DAKLLSLLSQSHLH--DRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYG 259
           DAKL+S+LSQ  LH  D++P  CLP P S NMYWFSP      + G +ICWAR+  LGY 
Sbjct: 188 DAKLISILSQKLLHRSDKVPQLCLPLPASSNMYWFSPNQASPDELGCNICWARMDVLGYD 247

Query: 260 IYFPQDHSLS-EYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTDSK 319
           IY+ Q+ S + +  +L  + +QL RG+ +F Q F +ILHALQVVQF +WK  E  ++  +
Sbjct: 248 IYYSQNTSSALDCSELLLEPLQLARGDNLFHQPFANILHALQVVQFFHWKASEFPTSKFE 307

Query: 320 EDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAEA 379
            ++S    LS  +   +   KS F K++ LAE+VF     + KE+Q    WA  +A   A
Sbjct: 308 TEASKVLHLSQSQKENISNLKSVFVKNNQLAETVFRPVIINEKEQQ-SFSWANFSAAVTA 367

Query: 380 WEDYASNLRRELLRSFAHWRTSQSIYSVP 406
           WEDY   L ++LL S  HWRT QSI+S P
Sbjct: 368 WEDYIRKLHKQLLNSLPHWRTHQSIFSCP 395

BLAST of CmaCh04G000200 vs. TAIR10
Match: AT1G68080.1 (AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 419.5 bits (1077), Expect = 2.4e-117
Identity = 214/385 (55.58%), Postives = 267/385 (69.35%), Query Frame = 1

Query: 24  RLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLIMPFVSIRERLK 83
           RLIL NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS  LI+PFVSIRERLK
Sbjct: 8   RLILHNFLSPAECKELELIHKSSSTIGYRPNVFSTTLSHLIATNSPHLIIPFVSIRERLK 67

Query: 84  EKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEG 143
           EK EE FGCEYELF+EFTGLISW +GA IGWHSDDNR YLKQR+F AVCYLNSY  DF G
Sbjct: 68  EKIEETFGCEYELFIEFTGLISWCKGASIGWHSDDNRSYLKQRDFAAVCYLNSYEKDFIG 127

Query: 144 GLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL 203
           GLF FQ GEP T++P  GD +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHDED+KL
Sbjct: 128 GLFRFQSGEPVTVAPSAGDVIMYTADDRNIHSVDEVTDGERLTLALWFSRDSSHDEDSKL 187

Query: 204 LSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ 263
           LS LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q
Sbjct: 188 LSRLSQCTSH----EVCLPLPASTNMYWFCPHQDGSNQNIGFDVCVARLHLLGFDVHSLQ 247

Query: 264 --DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTDSKEDS- 323
             DHS      L    +QL +G K+ ++KF +ILHALQVVQF +WK  EL +++ + D+ 
Sbjct: 248 GEDHSTDASEQLMG-PLQLAKGGKLLTRKFANILHALQVVQFYHWKASELVTSNVENDTL 307

Query: 324 SYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAEAWED 383
              + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+
Sbjct: 308 EEVKAMSHSQLETINALKSVFLLDENLVATTFGYSCSG-EDRKDSLDLTGIALAVTSWEE 367

Query: 384 YASNLRRELLRSFAHWRTSQSIYSV 405
           Y+  L +ELL S   W+T Q+I+ V
Sbjct: 368 YSCKLLKELLSSLPQWKTYQTIHKV 386

BLAST of CmaCh04G000200 vs. NCBI nr
Match: gi|449445405|ref|XP_004140463.1| (PREDICTED: prolyl 3-hydroxylase 1 [Cucumis sativus])

HSP 1 Score: 667.5 bits (1721), Expect = 1.4e-188
Identity = 324/396 (81.82%), Postives = 344/396 (86.87%), Query Frame = 1

Query: 13  MGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLI 72
           M D AE  QR RLILENFL+ EECRELEFIHKSC TVGYRP VFSTTLLHLV +NSA LI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60

Query: 73  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVC 132
           +PFV IRE+LKEKAEEFFGC YELFVEFTGLISWTRGA IGWHSDDNRPYLKQREF+AVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 133 YLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFT 192
           YLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTAD+ NVHSVDE+T+GERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVDEITNGERLTLTLWFT 180

Query: 193 RDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLH 252
           RDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARL 
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDICWARLR 240

Query: 253 ALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDS 312
           ALGY +YFP DH  SEYPDLF QDVQLV G+KIF QKF++ILH LQVVQFL WKGKELDS
Sbjct: 241 ALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDS 300

Query: 313 TDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAA 372
           T+  EDSSYAE LSPKRNVGV +FKSEFSK+D LAESVF  A+SD KE Q  LGW KL A
Sbjct: 301 TNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASDGKENQQWLGWDKLVA 360

Query: 373 VAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 409
            A AWE YAS LRRELL SF+HWR  QSIYSV   S
Sbjct: 361 AAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 396

BLAST of CmaCh04G000200 vs. NCBI nr
Match: gi|659113939|ref|XP_008456831.1| (PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo])

HSP 1 Score: 665.2 bits (1715), Expect = 7.2e-188
Identity = 324/397 (81.61%), Postives = 345/397 (86.90%), Query Frame = 1

Query: 13  MGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLI 72
           M D AE  QR RLILENFL+ EECRELEFIHKSCCTVGYRP V STTLLHLV +NSA LI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 73  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVC 132
           +PFV IRE+LKEKAEEFFGC YELFVEFTGLISWTRGA IGWHSDDNRPYLKQREF+AVC
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFSAVC 120

Query: 133 YLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFT 192
           YLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTADS NVHSVDE+T+GERLTLTLWFT
Sbjct: 121 YLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 193 RDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLH 252
           RDSSHDEDAKLLSLLSQS LHDR  +SCLPQPPSCNMYWFSP++DPNFKFGFDICWARLH
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 253 ALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDS 312
           ALGY IYFP DH  SEYPDLFSQDVQLV G+KIF QKF++ILH LQVVQFL WKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 313 TDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-A 372
           T+  EDS YAE LSPKRNVGV +FKSEFSK+D LAESVF  A+S  KE QH LGW KL  
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 373 AVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 409
           A A AWEDYAS LRRELL SF+HWR  QSIYSV   S
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 397

BLAST of CmaCh04G000200 vs. NCBI nr
Match: gi|700195676|gb|KGN50853.1| (hypothetical protein Csa_5G289640 [Cucumis sativus])

HSP 1 Score: 657.5 bits (1695), Expect = 1.5e-185
Identity = 324/411 (78.83%), Postives = 344/411 (83.70%), Query Frame = 1

Query: 13  MGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLI 72
           M D AE  QR RLILENFL+ EECRELEFIHKSC TVGYRP VFSTTLLHLV +NSA LI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCSTVGYRPNVFSTTLLHLVATNSAHLI 60

Query: 73  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLIS---------------WTRGARIGWHSD 132
           +PFV IRE+LKEKAEEFFGC YELFVEFTGLIS               WTRGA IGWHSD
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISLHSKAHLQPSSSNLGWTRGASIGWHSD 120

Query: 133 DNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVD 192
           DNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTAD+ NVHSVD
Sbjct: 121 DNRPYLKQREFSAVCYLNSYGVEFGGGLFHFQDGEPETISPFYGDCVMYTADNDNVHSVD 180

Query: 193 EVTSGERLTLTLWFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDD 252
           E+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DD
Sbjct: 181 EITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSPLHDRFPDSCLPQPPSCNMYWFSPEDD 240

Query: 253 PNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHAL 312
           PNFKFGFDICWARL ALGY +YFP DH  SEYPDLF QDVQLV G+KIF QKF++ILH L
Sbjct: 241 PNFKFGFDICWARLRALGYDLYFPGDHDFSEYPDLFFQDVQLVWGDKIFFQKFENILHLL 300

Query: 313 QVVQFLYWKGKELDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSD 372
           QVVQFL WKGKELDST+  EDSSYAE LSPKRNVGV +FKSEFSK+D LAESVF  A+SD
Sbjct: 301 QVVQFLCWKGKELDSTNLSEDSSYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSAASD 360

Query: 373 VKEKQHRLGWAKLAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 409
            KE Q  LGW KL A A AWE YAS LRRELL SF+HWR  QSIYSV   S
Sbjct: 361 GKENQQWLGWDKLVAAAAAWEHYASILRRELLGSFSHWRNCQSIYSVSLDS 411

BLAST of CmaCh04G000200 vs. NCBI nr
Match: gi|659113941|ref|XP_008456833.1| (PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo])

HSP 1 Score: 588.2 bits (1515), Expect = 1.1e-164
Identity = 297/397 (74.81%), Postives = 316/397 (79.60%), Query Frame = 1

Query: 13  MGDEAEINQRWRLILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAQLI 72
           M D AE  QR RLILENFL+ EECRELEFIHKSCCTVGYRP V STTLLHLV +NSA LI
Sbjct: 1   MVDGAESRQRRRLILENFLSREECRELEFIHKSCCTVGYRPNVLSTTLLHLVATNSAHLI 60

Query: 73  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFTAVC 132
           +PFV IRE+LKEKAEEFFGC YELFVEFTGLISWTRGA IGWHSDDNRPYLKQREF+   
Sbjct: 61  IPFVPIREKLKEKAEEFFGCHYELFVEFTGLISWTRGASIGWHSDDNRPYLKQREFS--- 120

Query: 133 YLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFT 192
                                        DCVMYTADS NVHSVDE+T+GERLTLTLWFT
Sbjct: 121 -----------------------------DCVMYTADSDNVHSVDEITNGERLTLTLWFT 180

Query: 193 RDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLH 252
           RDSSHDEDAKLLSLLSQS LHDR  +SCLPQPPSCNMYWFSP++DPNFKFGFDICWARLH
Sbjct: 181 RDSSHDEDAKLLSLLSQSPLHDRFSNSCLPQPPSCNMYWFSPEEDPNFKFGFDICWARLH 240

Query: 253 ALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDS 312
           ALGY IYFP DH  SEYPDLFSQDVQLV G+KIF QKF++ILH LQVVQFL WKGKELD+
Sbjct: 241 ALGYDIYFPGDHDFSEYPDLFSQDVQLVWGDKIFFQKFENILHLLQVVQFLCWKGKELDT 300

Query: 313 TDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-A 372
           T+  EDS YAE LSPKRNVGV +FKSEFSK+D LAESVF  A+S  KE QH LGW KL  
Sbjct: 301 TNLNEDSCYAEYLSPKRNVGVSYFKSEFSKNDGLAESVFSSATSGGKENQHWLGWDKLVV 360

Query: 373 AVAEAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS 409
           A A AWEDYAS LRRELL SF+HWR  QSIYSV   S
Sbjct: 361 AAAAAWEDYASILRRELLGSFSHWRNCQSIYSVSLDS 365

BLAST of CmaCh04G000200 vs. NCBI nr
Match: gi|645228334|ref|XP_008220946.1| (PREDICTED: uncharacterized protein LOC103320983 [Prunus mume])

HSP 1 Score: 481.5 bits (1238), Expect = 1.5e-132
Identity = 238/397 (59.95%), Postives = 286/397 (72.04%), Query Frame = 1

Query: 11  MKMGDEAEINQRWR-LILENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSA 70
           MKMGD  E  +  R LIL NFL+ +EC+ELEFIHKS CTVGYRP+VFSTTL HL+ +NSA
Sbjct: 1   MKMGDPEEAEEHGRRLILHNFLSFQECKELEFIHKSNCTVGYRPHVFSTTLSHLIATNSA 60

Query: 71  QLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISWTRGARIGWHSDDNRPYLKQREFT 130
            LIMPFV IRERLKEK EEFFGC+YELFVEFTGLISW+RG+ IGWHSDDNRPYLKQR+F 
Sbjct: 61  HLIMPFVPIRERLKEKVEEFFGCQYELFVEFTGLISWSRGSSIGWHSDDNRPYLKQRDFA 120

Query: 131 AVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTL 190
           AVCYLNSYG DF+GGLFHFQDG+P TI P  GD V+YTADS N+HSVDE+T GERLTL L
Sbjct: 121 AVCYLNSYGNDFKGGLFHFQDGDPATIVPSGGDVVIYTADSRNIHSVDEITDGERLTLAL 180

Query: 191 WFTRDSSHDEDAKLLSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICW 250
           WF+RD+++DEDAKL++LLSQ+ LHD  P+ CLP P S NMYWFSP +   + + GFDICW
Sbjct: 181 WFSRDATYDEDAKLITLLSQNFLHDNAPELCLPLPASSNMYWFSPDQASSDQQLGFDICW 240

Query: 251 ARLHALGYGIYFPQDHS-LSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKG 310
           ARLH LGY + F QD S  S   +L  + +QL RG+++F  +F +ILHALQVVQF  WK 
Sbjct: 241 ARLHVLGYDLLFHQDKSYCSNISELLMEPLQLTRGDELFEHEFINILHALQVVQFYCWKS 300

Query: 311 KELDSTDSKEDSSYAEGLSPKRNVGVDFFKSEFSKDDALAESVFLYASSDVKEKQHRLGW 370
            +  S   +E ++    LS  +       KS F+KD  L +SVF   +  V+  QH   W
Sbjct: 301 PDFKSAKVEETTTVVV-LSQSQRERFVCLKSLFAKDVCLVDSVFSNVTF-VESAQHSFNW 360

Query: 371 AKLAAVAEAWEDYASNLRRELLRSFAHWRTSQSIYSV 405
                    WEDY   L REL+ S  HWR  QSI++V
Sbjct: 361 VDFTIAIATWEDYVRKLHRELVMSLPHWRIQQSIFNV 395

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P3H1_HUMAN5.8e-1224.32Prolyl 3-hydroxylase 1 OS=Homo sapiens GN=P3H1 PE=1 SV=2[more]
P3H1_MOUSE5.8e-1223.87Prolyl 3-hydroxylase 1 OS=Mus musculus GN=P3h1 PE=1 SV=2[more]
P3H1_RAT7.5e-1223.87Prolyl 3-hydroxylase 1 OS=Rattus norvegicus GN=P3h1 PE=1 SV=1[more]
P3H2_RAT1.3e-1127.73Prolyl 3-hydroxylase 2 OS=Rattus norvegicus GN=P3h2 PE=1 SV=1[more]
P3H2_CHICK1.7e-1126.23Prolyl 3-hydroxylase 2 OS=Gallus gallus GN=P3H2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KMN7_CUCSA1.0e-18578.83Uncharacterized protein OS=Cucumis sativus GN=Csa_5G289640 PE=4 SV=1[more]
V4UGA2_9ROSI7.3e-13159.64Uncharacterized protein OS=Citrus clementina GN=CICLE_v10015473mg PE=4 SV=1[more]
F6HFA8_VITVI1.6e-13058.27Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06280 PE=4 SV=... [more]
M5Y3D7_PRUPE6.2e-13059.39Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017932mg PE=4 SV=1[more]
A0A067F0I4_CITSI4.0e-12959.13Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g015648mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G68080.12.4e-11755.58 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449445405|ref|XP_004140463.1|1.4e-18881.82PREDICTED: prolyl 3-hydroxylase 1 [Cucumis sativus][more]
gi|659113939|ref|XP_008456831.1|7.2e-18881.61PREDICTED: uncharacterized protein LOC103496668 isoform X1 [Cucumis melo][more]
gi|700195676|gb|KGN50853.1|1.5e-18578.83hypothetical protein Csa_5G289640 [Cucumis sativus][more]
gi|659113941|ref|XP_008456833.1|1.1e-16474.81PREDICTED: uncharacterized protein LOC103496668 isoform X2 [Cucumis melo][more]
gi|645228334|ref|XP_008220946.1|1.5e-13259.95PREDICTED: uncharacterized protein LOC103320983 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G000200.1CmaCh04G000200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 107..191
score: 5.9
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 85..193
score: 9
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 22..192
score: 0.
NoneNo IPR availablePANTHERPTHR14049LEPRECAN 1coord: 15..285
score: 5.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh04G000200Cla016884Watermelon (97103) v1cmawmB734
CmaCh04G000200CSPI05G12520Wild cucumber (PI 183967)cmacpiB751
CmaCh04G000200CmoCh04G000210Cucurbita moschata (Rifu)cmacmoB728
CmaCh04G000200Cp4.1LG01g06460Cucurbita pepo (Zucchini)cmacpeB720
CmaCh04G000200Bhi07G000118Wax gourdcmawgoB0899
CmaCh04G000200Carg21976Silver-seed gourdcarcmaB0242
The following gene(s) are paralogous to this gene:

None