CmaCh07G004010 (gene) Cucurbita maxima (Rimu)

NameCmaCh07G004010
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionIron ion binding / oxidoreductase/ oxidoreductase protein
LocationCma_Chr07 : 1733076 .. 1736119 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATTAATTAAGGAAATAATAAAGAAAACTGCTCAGAAGGATCGAAAGCGGAGAGGAAGAAGCGAAATCAATCTCGGCGTCGGAGTTCATCAAGATTTCCCATTACCGACTCGGTGATTTGGACTTTCGAGACGAAGACATTAACCGATACTCTCCTCTGAAATTCCAGATCTTGCTCTCCTCCTTCCATGGCGAAGCACAGGCAACCTCGATTTTCTTCTCGGAAGTCGTCTTCTTCTTCTACTCTCATATTTACCTTGCTCATTATGTTCACCTTCGTCATTCTGATTCTTCTTGCCCTTGGAATTCTCTCGATCCCTGGAAATTCTGGCGCGCCCAAGGCTCATGATCTGAGCTCGATCGTGCGGAAAACTTCCGATGAGTATGTTATCCTCTGTGTTCGATTTCAGTTTTTTTGTTCTGTATGGGGTTGATTTTAAATTGATTGTTTGATTTTGTTTTAGAGTTGACGAGGAGAAAGGAGAGCAGTGGGCTGAAGTGATCTCATGGGAACCTAGAGCCTTCGTTTATCACAATTTTCTGGTATTCATTTTCTCCCCTGATCTATGACGAGGGATTTTTGTTTTATGTTCGTTCGTCGCTCTCTATGCAATTTGTGTATGGAATTATGTGAACTTGGTTGTTGTATGCTGATGATACGTGTTGGATGCGTCTTTGGGGCTGTATATGCCAATGAAAACCAAGGGATTATGTTTACGTTCCTCTTAGAAATCCTTCTTCTCTTATGTTATGCTGAGAGGGTTTGAGGATGATTCGCATCTGTATTTGTCTTTAGATTGTTTTAAGGAATTTCTGAATTGGTTTACTTTTTAATTGCTCAGACAAAGGAGGAATGTGAGTACCTAATCAGCCTTGCCAAGCCTCACATGCAAAAATCTTCCGTTGTTGACAGTGAAACTGGAAAGAGCAAAGATAGCAGGTCTGAAATTCTCTTCCCCAGAATTCCCTTTTGGCAGTAATGCTTGATCTCTAATGTAGAAGCCTCCACGACCTCTCTTTTACAGAGTTCGCACTAGCTCTGGAACATTTTTGCCCAGAGGACGTGATAAGATCATTAGAACCATCGAGAAAAGGATTGCTGATTTCAGCTTCGTACCCGTAGGTATATTTTTTTTAGCCTATTTTCCTCTTCCTGGTATCTTCCCACTACATCCAGCATTGTTCTTGTTTTATTTTATTATGCATGGCTAAACTAAACTTTACAATCTTTGATTTATTTCCTACTGCACCTATGGTTGTGATCGAACTTTTTGCCTATAATCCAGCATTCCCAATCACACAATTGTGCACAAGACTGTTCAAATAGACACATCTACCGTGCTAAGATTCACATGGATGAGGAAGATTTTTAACCATGGTTGAAAAGTTGAATAATGAATATTCACTCGAACGTAGGAACACCAACATGAACCAGATATTTGACCTATGAGCTTAACCACCCGTTGCCTCTTTTTAAGAATGAAAGAGAATTGGGTTCACCTGATAGCAGATGTGCCAATTTTTGTTTATTATATGTAGATATTGTCTTGTTATCTCCTCTTTTCATATTTGTTACTGAAGTTCACAATATATGTAGAGCATGGAGAAGGACTTCAGGTTCTTCATTATGAAGTGGGACAGAAATATGAACCTCATTTTGATTACTTCCTTGATGAATACAATACAAAGAATGGAGGTCAACGTATAGCAACAATGCTTATGTATCTGTAAGTAGTAATTTATATTTATATCCACTATGCTTGGGACTCTAACCAAGTGGTGTTATACCATGTGTGGTTCTCCCATTATTTTAACAACTTCTATCATAATTCAGCTCGGATGTTGAAGAAGGAGGCGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGGATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCTGTTAAACCAAAGAGGGGCGATGCGTTACTTTTCTGGAGCATGAAGCCTGATGCCTCTCTAGATCCATCAAGTCTGCATGGTTAGATTTGGATTCTCTTTAACGTATAATCTACCATTCATTTTACAAAGCATTATAGTTTTCAGTGATGGGCATTTATTTTTGTTATTGGGATTTTTCACATCTCTCGTGTCGAAAGTAATGGGTTTTCTTGATAAAATGATGACGTTTCTTCTGATGATCGTTTATAGGTGGTTGCCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATACAAAGCTTGAGTGGATGCGACTAAGGTACTTTCCATGCCCTAAACCATGCACTAGAATCGTAATCAAACGAACACCTAAAACAACACTACAGCTGATTTTTCATGATTTTCAAAAAACCATAAACCAAATGGGGAATTTCTTTTGGCCTCTCAACATAGAATATACAGCACATTTTTAACTGAGATCCGCTTGAATTGCTGAATCACCTTCTGAATCACCTTCACATTAAGCACGCTATAATTTGATTGAGTCTGGTTTTGAACTCCCCCACCCAAATCCCGCATATTGAGCGCAGCTGATATGCGGAGCTAGGCGATCATATCATGCCATAAAACGAGAAAAAAGAACAATGGAATTCTTTCCTTCTGCGTTGTTTTTTCTTGTTATTGCTGAAAGAACTATTCCTAAAGATGTGATGCTTCATTGTTTGGCAGGTTTGGTTGAATTTTCTTGTTATTGCTGAAAATCAGAGATGCATTTGCTGTAGGGAAAACCACCATTTCTATAATTTAACCTATGAATTTTGATGTGCTTAATGTTTTAATTAGGAGCTTACATTGTGATTGATTTCTTGTTTATGGCCACGGAGGGAATAAGAGTTCATATTTTTGCTAGAAATGCCACTTCCATATGTAAAAAGAACAAAGATTTTAAGCTTGATGTACAGCTCCCTTGCTGCAACATTCTCGTTTAATTTCTATTTGATATTACTTTACTTTATTTTCTGTGTTCATTTCTTTATTTGGAAGATATCGACCATTAAACCTCATTATTATCATTATTATTT

mRNA sequence

AAATTAATTAAGGAAATAATAAAGAAAACTGCTCAGAAGGATCGAAAGCGGAGAGGAAGAAGCGAAATCAATCTCGGCGTCGGAGTTCATCAAGATTTCCCATTACCGACTCGGTGATTTGGACTTTCGAGACGAAGACATTAACCGATACTCTCCTCTGAAATTCCAGATCTTGCTCTCCTCCTTCCATGGCGAAGCACAGGCAACCTCGATTTTCTTCTCGGAAGTCGTCTTCTTCTTCTACTCTCATATTTACCTTGCTCATTATGTTCACCTTCGTCATTCTGATTCTTCTTGCCCTTGGAATTCTCTCGATCCCTGGAAATTCTGGCGCGCCCAAGGCTCATGATCTGAGCTCGATCGTGCGGAAAACTTCCGATGAAGTTGACGAGGAGAAAGGAGAGCAGTGGGCTGAAGTGATCTCATGGGAACCTAGAGCCTTCGTTTATCACAATTTTCTGACAAAGGAGGAATGTGAGTACCTAATCAGCCTTGCCAAGCCTCACATGCAAAAATCTTCCGTTGTTGACAGTGAAACTGGAAAGAGCAAAGATAGCAGAGTTCGCACTAGCTCTGGAACATTTTTGCCCAGAGGACGTGATAAGATCATTAGAACCATCGAGAAAAGGATTGCTGATTTCAGCTTCGTACCCGTAGAGCATGGAGAAGGACTTCAGGTTCTTCATTATGAAGTGGGACAGAAATATGAACCTCATTTTGATTACTTCCTTGATGAATACAATACAAAGAATGGAGGTCAACGTATAGCAACAATGCTTATGTATCTCTCGGATGTTGAAGAAGGAGGCGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGGATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCTGTTAAACCAAAGAGGGGCGATGCGTTACTTTTCTGGAGCATGAAGCCTGATGCCTCTCTAGATCCATCAAGTCTGCATGGTGGTTGCCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATACAAAGCTTGAGTGGATGCGACTAAGGTTTGGTTGAATTTTCTTGTTATTGCTGAAAATCAGAGATGCATTTGCTGTAGGGAAAACCACCATTTCTATAATTTAACCTATGAATTTTGATGTGCTTAATGTTTTAATTAGGAGCTTACATTGTGATTGATTTCTTGTTTATGGCCACGGAGGGAATAAGAGTTCATATTTTTGCTAGAAATGCCACTTCCATATGTAAAAAGAACAAAGATTTTAAGCTTGATGTACAGCTCCCTTGCTGCAACATTCTCGTTTAATTTCTATTTGATATTACTTTACTTTATTTTCTGTGTTCATTTCTTTATTTGGAAGATATCGACCATTAAACCTCATTATTATCATTATTATTT

Coding sequence (CDS)

ATGGCGAAGCACAGGCAACCTCGATTTTCTTCTCGGAAGTCGTCTTCTTCTTCTACTCTCATATTTACCTTGCTCATTATGTTCACCTTCGTCATTCTGATTCTTCTTGCCCTTGGAATTCTCTCGATCCCTGGAAATTCTGGCGCGCCCAAGGCTCATGATCTGAGCTCGATCGTGCGGAAAACTTCCGATGAAGTTGACGAGGAGAAAGGAGAGCAGTGGGCTGAAGTGATCTCATGGGAACCTAGAGCCTTCGTTTATCACAATTTTCTGACAAAGGAGGAATGTGAGTACCTAATCAGCCTTGCCAAGCCTCACATGCAAAAATCTTCCGTTGTTGACAGTGAAACTGGAAAGAGCAAAGATAGCAGAGTTCGCACTAGCTCTGGAACATTTTTGCCCAGAGGACGTGATAAGATCATTAGAACCATCGAGAAAAGGATTGCTGATTTCAGCTTCGTACCCGTAGAGCATGGAGAAGGACTTCAGGTTCTTCATTATGAAGTGGGACAGAAATATGAACCTCATTTTGATTACTTCCTTGATGAATACAATACAAAGAATGGAGGTCAACGTATAGCAACAATGCTTATGTATCTCTCGGATGTTGAAGAAGGAGGCGAGACAGTGTTCCCTGCTGCCAAAGGAAACTTTAGTTCTGTACCTTGGTGGGATGAGCTTTCAGATTGTGGGAAGAAAGGACTTTCTGTTAAACCAAAGAGGGGCGATGCGTTACTTTTCTGGAGCATGAAGCCTGATGCCTCTCTAGATCCATCAAGTCTGCATGGTGGTTGCCCTGTTATCAAGGGGAACAAATGGTCTGCTACTAAATGGATGCGAGTAGAAGAATACAAAGCTTGA

Protein sequence

MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGAPKAHDLSSIVRKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
BLAST of CmaCh07G004010 vs. Swiss-Prot
Match: P4H10_ARATH (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1)

HSP 1 Score: 458.0 bits (1177), Expect = 7.6e-128
Identity = 224/288 (77.78%), Postives = 255/288 (88.54%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNS-GAPKAHDLSSIV 60
           MA+ R  R S+RKSS S TL+F +LIM TFVILILLA GILS+P N+ G+ KA+DL+SIV
Sbjct: 2   MARPRNHRPSARKSSHS-TLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIV 61

Query: 61  RKTSDEV--DEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSET 120
           RKT      D+ K E+W E+ISWEPRA VYHNFLTKEEC+YLI LAKPHM+KS+VVD +T
Sbjct: 62  RKTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 121

Query: 121 GKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHF 180
           GKS DSRVRTSSGTFL RGRDK IR IEKRI+DF+F+PVEHGEGLQVLHYE+GQKYEPH+
Sbjct: 122 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 181

Query: 181 DYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSV 240
           DYF+DEYNT+NGGQRIAT+LMYLSDVEEGGETVFPAAKGN+S+VPWW+ELS+CGK GLSV
Sbjct: 182 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 241

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           KPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Sbjct: 242 KPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYK 288

BLAST of CmaCh07G004010 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 2.0e-120
Identity = 209/287 (72.82%), Postives = 246/287 (85.71%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGAPKAHDLSSIVR 60
           MAK R  RF +RK S+   ++F +L M T V+L+LLA G+ S+P N+      DLS   R
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLF-MLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRR 60

Query: 61  KTSDEVDE--EKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETG 120
             ++  +   ++G+QW EV+SWEPRAFVYHNFL+KEECEYLISLAKPHM KS+VVDSETG
Sbjct: 61  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 120

Query: 121 KSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFD 180
           KSKDSRVRTSSGTFL RGRDKII+TIEKRIAD++F+P +HGEGLQVLHYE GQKYEPH+D
Sbjct: 121 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 180

Query: 181 YFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVK 240
           YF+DE+NTKNGGQR+ATMLMYLSDVEEGGETVFPAA  NFSSVPW++ELS+CGKKGLSVK
Sbjct: 181 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 240

Query: 241 PKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           P+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Sbjct: 241 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286

BLAST of CmaCh07G004010 vs. Swiss-Prot
Match: P4H5_ARATH (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 4.6e-117
Identity = 204/281 (72.60%), Postives = 240/281 (85.41%), Query Frame = 1

Query: 8   RFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPG-NSGAPKAHDLSSIVRK--TSD 67
           R+  RKS S ST  FT+LI+   VILILL LGILS+P  N  + K +DL++IVRK  TS 
Sbjct: 10  RYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSETSS 69

Query: 68  EVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGKSKDSR 127
             +E  GE+W EVISWEPRA VYHNFLT EECE+LISLAKP M KS+VVD +TG SKDSR
Sbjct: 70  GDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR 129

Query: 128 VRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDYFLDEY 187
           VRTSSGTFL RG D+++  IEKRI+DF+F+PVE+GEGLQVLHY+VGQKYEPH+DYFLDE+
Sbjct: 130 VRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEF 189

Query: 188 NTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKPKRGDA 247
           NTKNGGQRIAT+LMYLSDV++GGETVFPAA+GN S+VPWW+ELS CGK+GLSV PK+ DA
Sbjct: 190 NTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDA 249

Query: 248 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           LLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Sbjct: 250 LLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFK 290

BLAST of CmaCh07G004010 vs. Swiss-Prot
Match: P4H8_ARATH (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 1.9e-110
Identity = 198/288 (68.75%), Postives = 232/288 (80.56%), Query Frame = 1

Query: 3   KHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPG-NSGAPKAHDLSSIV-- 62
           K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  N  +    DL++IV  
Sbjct: 4   KPKQLRNKPRKSFSTQT--FTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 63  ---RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 122
              R++  + ++  G++W EVISWEPRAFVYHNFLT EECE+LISLAKP M KS VVD +
Sbjct: 64  IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVK 123

Query: 123 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 182
           TGKS DSRVRTSSGTFL RG D+I+  IE RI+DF+F+P E+GEGLQVLHYEVGQ+YEPH
Sbjct: 124 TGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPH 183

Query: 183 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 242
            DYF DE+N + GGQRIAT+LMYLSDV+EGGETVFPAAKGN S VPWWDELS CGK+GLS
Sbjct: 184 HDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLS 243

Query: 243 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
           V PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Sbjct: 244 VLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CmaCh07G004010 vs. Swiss-Prot
Match: P4H4_ARATH (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana GN=P4H4 PE=2 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 3.2e-65
Identity = 121/231 (52.38%), Postives = 174/231 (75.32%), Query Frame = 1

Query: 56  SSIVRKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDS 115
           +S++  +S  V+  K +Q    +S +PRAFVY  FLT+ EC++++SLAK  +++S+V D+
Sbjct: 22  TSLISSSSVFVNPSKVKQ----VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADN 81

Query: 116 ETGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEP 175
           ++G+SK S VRTSSGTF+ +G+D I+  IE +I+ ++F+P E+GE +QVL YE GQKY+ 
Sbjct: 82  DSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDA 141

Query: 176 HFDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSV--PWWDELSDCGKK 235
           HFDYF D+ N   GG R+AT+LMYLS+V +GGETVFP A+     V     ++LSDC K+
Sbjct: 142 HFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKR 201

Query: 236 GLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
           G++VKP++GDALLF+++ PDA  DP SLHGGCPVI+G KWSATKW+ V+ +
Sbjct: 202 GIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248

BLAST of CmaCh07G004010 vs. TrEMBL
Match: A0A0A0KJH9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507320 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 1.3e-153
Identity = 268/287 (93.38%), Postives = 280/287 (97.56%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSG-APKAHDLSSIV 60
           MAKHRQ RF +RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSG + K HDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGK 120
           RKTSD+VDEEKGEQW EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKS+VVDSETG+
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDK +RTIEKR++DFSF+PVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIAT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CmaCh07G004010 vs. TrEMBL
Match: W9SCS8_9ROSA (Prolyl 4-hydroxylase subunit alpha-1 OS=Morus notabilis GN=L484_026616 PE=4 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 1.1e-136
Identity = 239/291 (82.13%), Postives = 265/291 (91.07%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGAPKAH---DLSS 60
           MAK R  R   RKSSS STL FT+L+MF+FV+LIL+ALGILS+P +SG    H   DLSS
Sbjct: 1   MAKIRHSRLQGRKSSSFSTLTFTMLVMFSFVVLILVALGILSVPSSSGGDSTHKPNDLSS 60

Query: 61  IVRKTSD--EVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDS 120
           IVRK +D  E DE KGE+W EVISWEPRAF+YHNFLTKEECEYLI+LA P+M+KS+VVDS
Sbjct: 61  IVRKNADRSEGDEGKGERWVEVISWEPRAFIYHNFLTKEECEYLINLAMPNMKKSTVVDS 120

Query: 121 ETGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEP 180
           ETGKSKDSRVRTSSGTFL RGRDK+IRTIEKRIADF+F+PVEHGEGLQ+LHYEVGQKYEP
Sbjct: 121 ETGKSKDSRVRTSSGTFLARGRDKVIRTIEKRIADFTFIPVEHGEGLQILHYEVGQKYEP 180

Query: 181 HFDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGL 240
           HFDYFLD++NT+NGGQR+AT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGL
Sbjct: 181 HFDYFLDDFNTQNGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGL 240

Query: 241 SVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           SVKPKRGDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWS+TKWMR  EYKA
Sbjct: 241 SVKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRANEYKA 291

BLAST of CmaCh07G004010 vs. TrEMBL
Match: A0A151THU9_CAJCA (Prolyl 4-hydroxylase subunit alpha-1 OS=Cajanus cajan GN=KK1_012931 PE=4 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 1.8e-136
Identity = 237/290 (81.72%), Postives = 267/290 (92.07%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGA--PKAHDLSSI 60
           MAK R  R   RKSSSSSTLI TL ++FTF++LILLALGILS+P +S +  PK +DL+SI
Sbjct: 1   MAKPRHSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSVPSSSRSNLPKPNDLTSI 60

Query: 61  VRKTSDEVDE--EKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 120
            R T  + D+  E+GEQW EV+SWEPRAFVYHNFLTKEECEYLI +AKP+MQKS+VVDSE
Sbjct: 61  ARNTLHKTDDDDERGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMQKSTVVDSE 120

Query: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 180
           TGKSKDSRVRTSSGTFLPRGRDKIIR IEKRIADF+F+PVEHGEGLQ+LHYEVGQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFTFIPVEHGEGLQILHYEVGQKYEPH 180

Query: 181 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 240
           +DYFLD++NTKNGGQRIAT+LMYL+DVEEGGETVFPAAKGNFSSVPWW+ELS+CGKKGLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLS 240

Query: 241 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           +KPKRGDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWS+TKWMRV+EYKA
Sbjct: 241 IKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVDEYKA 290

BLAST of CmaCh07G004010 vs. TrEMBL
Match: V7CKH4_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G104900g PE=4 SV=1)

HSP 1 Score: 491.1 bits (1263), Expect = 9.1e-136
Identity = 238/290 (82.07%), Postives = 265/290 (91.38%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNS--GAPKAHDLSSI 60
           MAK R  R   RKSSSSSTLI TLL++FTF+ILILLALGILSIP +S    PK +DL+SI
Sbjct: 1   MAKPRYSRLQPRKSSSSSTLILTLLLVFTFLILILLALGILSIPSSSRNDLPKPNDLTSI 60

Query: 61  VRKT--SDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 120
            R T  + + DEE+GEQW EV+SWEPRAFVYHNFLTKEEC+YLI +AKP MQKS+VVDSE
Sbjct: 61  ARNTIQTSDDDEERGEQWVEVVSWEPRAFVYHNFLTKEECDYLIDIAKPSMQKSTVVDSE 120

Query: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 180
           TGKSKDSRVRTSSGTFLPRGRDKI+R IEK+IADFSF+PVEHGEGLQVLHYEVGQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLPRGRDKIVRNIEKKIADFSFIPVEHGEGLQVLHYEVGQKYEPH 180

Query: 181 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 240
           +DYFLD++NTKNGGQRIAT+LMYL+DVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 240

Query: 241 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           +KP+RGDALLFWSMKPDA+LD SSLHGGCPVIKGNKWS+TKW+RV EYKA
Sbjct: 241 IKPRRGDALLFWSMKPDATLDSSSLHGGCPVIKGNKWSSTKWLRVNEYKA 290

BLAST of CmaCh07G004010 vs. TrEMBL
Match: A0A0S3T4Z4_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.10G148300 PE=4 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 3.8e-134
Identity = 234/290 (80.69%), Postives = 264/290 (91.03%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGA--PKAHDLSSI 60
           MAK R  R   RKSSSSSTLI TL ++FTF++LILLALGILSIP +S    PK +DL+SI
Sbjct: 1   MAKPRYSRLQPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRTDLPKPNDLTSI 60

Query: 61  VRKTSD--EVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 120
              T+   +VD+++GEQW EV+SWEPRAFVYHNFLTKEEC+YLI +AKP+MQKS+VVDS+
Sbjct: 61  AHSTTQASDVDDDRGEQWVEVVSWEPRAFVYHNFLTKEECDYLIDVAKPNMQKSTVVDSD 120

Query: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 180
           TGKSKDSRVRTSSGTFL RGRDKIIR IEK++A FSF+PVEHGEGLQVLHYEVGQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLARGRDKIIRKIEKKLAHFSFIPVEHGEGLQVLHYEVGQKYEPH 180

Query: 181 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 240
           +DYFLD++NTKNGGQRIAT+LMYL+DVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 240

Query: 241 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           +KPKRGDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWS+TKWMRV EYKA
Sbjct: 241 IKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVNEYKA 290

BLAST of CmaCh07G004010 vs. TAIR10
Match: AT5G66060.1 (AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 458.0 bits (1177), Expect = 4.3e-129
Identity = 224/288 (77.78%), Postives = 255/288 (88.54%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNS-GAPKAHDLSSIV 60
           MA+ R  R S+RKSS S TL+F +LIM TFVILILLA GILS+P N+ G+ KA+DL+SIV
Sbjct: 2   MARPRNHRPSARKSSHS-TLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIV 61

Query: 61  RKTSDEV--DEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSET 120
           RKT      D+ K E+W E+ISWEPRA VYHNFLTKEEC+YLI LAKPHM+KS+VVD +T
Sbjct: 62  RKTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 121

Query: 121 GKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHF 180
           GKS DSRVRTSSGTFL RGRDK IR IEKRI+DF+F+PVEHGEGLQVLHYE+GQKYEPH+
Sbjct: 122 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 181

Query: 181 DYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSV 240
           DYF+DEYNT+NGGQRIAT+LMYLSDVEEGGETVFPAAKGN+S+VPWW+ELS+CGK GLSV
Sbjct: 182 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 241

Query: 241 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           KPK GDALLFWSM PDA+LDPSSLHGGC VIKGNKWS+TKW+RV EYK
Sbjct: 242 KPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYK 288

BLAST of CmaCh07G004010 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 433.3 bits (1113), Expect = 1.1e-121
Identity = 209/287 (72.82%), Postives = 246/287 (85.71%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGAPKAHDLSSIVR 60
           MAK R  RF +RK S+   ++F +L M T V+L+LLA G+ S+P N+      DLS   R
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLF-MLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRR 60

Query: 61  KTSDEVDE--EKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETG 120
             ++  +   ++G+QW EV+SWEPRAFVYHNFL+KEECEYLISLAKPHM KS+VVDSETG
Sbjct: 61  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 120

Query: 121 KSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFD 180
           KSKDSRVRTSSGTFL RGRDKII+TIEKRIAD++F+P +HGEGLQVLHYE GQKYEPH+D
Sbjct: 121 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 180

Query: 181 YFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVK 240
           YF+DE+NTKNGGQR+ATMLMYLSDVEEGGETVFPAA  NFSSVPW++ELS+CGKKGLSVK
Sbjct: 181 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 240

Query: 241 PKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           P+ GDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWS+TKWM V EYK
Sbjct: 241 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286

BLAST of CmaCh07G004010 vs. TAIR10
Match: AT2G17720.1 (AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 422.2 bits (1084), Expect = 2.6e-118
Identity = 204/281 (72.60%), Postives = 240/281 (85.41%), Query Frame = 1

Query: 8   RFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPG-NSGAPKAHDLSSIVRK--TSD 67
           R+  RKS S ST  FT+LI+   VILILL LGILS+P  N  + K +DL++IVRK  TS 
Sbjct: 10  RYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSETSS 69

Query: 68  EVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGKSKDSR 127
             +E  GE+W EVISWEPRA VYHNFLT EECE+LISLAKP M KS+VVD +TG SKDSR
Sbjct: 70  GDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR 129

Query: 128 VRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDYFLDEY 187
           VRTSSGTFL RG D+++  IEKRI+DF+F+PVE+GEGLQVLHY+VGQKYEPH+DYFLDE+
Sbjct: 130 VRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEF 189

Query: 188 NTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKPKRGDA 247
           NTKNGGQRIAT+LMYLSDV++GGETVFPAA+GN S+VPWW+ELS CGK+GLSV PK+ DA
Sbjct: 190 NTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDA 249

Query: 248 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYK 286
           LLFW+M+PDASLDPSSLHGGCPV+KGNKWS+TKW  V E+K
Sbjct: 250 LLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFK 290

BLAST of CmaCh07G004010 vs. TAIR10
Match: AT4G35810.1 (AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 400.2 bits (1027), Expect = 1.1e-111
Identity = 198/288 (68.75%), Postives = 232/288 (80.56%), Query Frame = 1

Query: 3   KHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPG-NSGAPKAHDLSSIV-- 62
           K +Q R   RKS S+ T  FT++++  FVILIL+ LGI S+P  N  +    DL++IV  
Sbjct: 4   KPKQLRNKPRKSFSTQT--FTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQT 63

Query: 63  ---RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 122
              R++  + ++  G++W EVISWEPRAFVYHNFLT EECE+LISLAKP M KS VVD +
Sbjct: 64  IQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVK 123

Query: 123 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 182
           TGKS DSRVRTSSGTFL RG D+I+  IE RI+DF+F+P E+GEGLQVLHYEVGQ+YEPH
Sbjct: 124 TGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPH 183

Query: 183 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 242
            DYF DE+N + GGQRIAT+LMYLSDV+EGGETVFPAAKGN S VPWWDELS CGK+GLS
Sbjct: 184 HDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLS 243

Query: 243 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
           V PK+ DALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW  V EY
Sbjct: 244 VLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CmaCh07G004010 vs. TAIR10
Match: AT5G18900.1 (AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 250.0 bits (637), Expect = 1.8e-66
Identity = 121/231 (52.38%), Postives = 174/231 (75.32%), Query Frame = 1

Query: 56  SSIVRKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDS 115
           +S++  +S  V+  K +Q    +S +PRAFVY  FLT+ EC++++SLAK  +++S+V D+
Sbjct: 22  TSLISSSSVFVNPSKVKQ----VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADN 81

Query: 116 ETGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEP 175
           ++G+SK S VRTSSGTF+ +G+D I+  IE +I+ ++F+P E+GE +QVL YE GQKY+ 
Sbjct: 82  DSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDA 141

Query: 176 HFDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSV--PWWDELSDCGKK 235
           HFDYF D+ N   GG R+AT+LMYLS+V +GGETVFP A+     V     ++LSDC K+
Sbjct: 142 HFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKR 201

Query: 236 GLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
           G++VKP++GDALLF+++ PDA  DP SLHGGCPVI+G KWSATKW+ V+ +
Sbjct: 202 GIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248

BLAST of CmaCh07G004010 vs. NCBI nr
Match: gi|659080598|ref|XP_008440878.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 551.2 bits (1419), Expect = 1.1e-153
Identity = 269/287 (93.73%), Postives = 280/287 (97.56%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSG-APKAHDLSSIV 60
           MAKHRQ RF +RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSG + K HDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGK 120
           RKTSD+VDEEKGEQW EVISWEPRAF+YHNFLTKEECEYLISLAKPHMQKS+VVDSETG+
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFIYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDK IRTIEKRI+DFSF+PVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTIRTIEKRISDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIAT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CmaCh07G004010 vs. NCBI nr
Match: gi|449434114|ref|XP_004134841.1| (PREDICTED: probable prolyl 4-hydroxylase 10 [Cucumis sativus])

HSP 1 Score: 550.4 bits (1417), Expect = 1.8e-153
Identity = 268/287 (93.38%), Postives = 280/287 (97.56%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSG-APKAHDLSSIV 60
           MAKHRQ RF +RKSSSSSTL+FTLLIMFTFVILILLALGILSIPGNSG + K HDLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 61  RKTSDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSETGK 120
           RKTSD+VDEEKGEQW EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKS+VVDSETG+
Sbjct: 61  RKTSDDVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQ 120

Query: 121 SKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPHFDY 180
           SKDSRVRTSSGTFLPRGRDK +RTIEKR++DFSF+PVEHGEGLQVLHYEVGQKYEPHFDY
Sbjct: 121 SKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDY 180

Query: 181 FLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLSVKP 240
           FLDEYNTKNGGQRIAT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLSVKP
Sbjct: 181 FLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKP 240

Query: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA
Sbjct: 241 KRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287

BLAST of CmaCh07G004010 vs. NCBI nr
Match: gi|703163281|ref|XP_010113287.1| (Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis])

HSP 1 Score: 494.2 bits (1271), Expect = 1.5e-136
Identity = 239/291 (82.13%), Postives = 265/291 (91.07%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGAPKAH---DLSS 60
           MAK R  R   RKSSS STL FT+L+MF+FV+LIL+ALGILS+P +SG    H   DLSS
Sbjct: 1   MAKIRHSRLQGRKSSSFSTLTFTMLVMFSFVVLILVALGILSVPSSSGGDSTHKPNDLSS 60

Query: 61  IVRKTSD--EVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDS 120
           IVRK +D  E DE KGE+W EVISWEPRAF+YHNFLTKEECEYLI+LA P+M+KS+VVDS
Sbjct: 61  IVRKNADRSEGDEGKGERWVEVISWEPRAFIYHNFLTKEECEYLINLAMPNMKKSTVVDS 120

Query: 121 ETGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEP 180
           ETGKSKDSRVRTSSGTFL RGRDK+IRTIEKRIADF+F+PVEHGEGLQ+LHYEVGQKYEP
Sbjct: 121 ETGKSKDSRVRTSSGTFLARGRDKVIRTIEKRIADFTFIPVEHGEGLQILHYEVGQKYEP 180

Query: 181 HFDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGL 240
           HFDYFLD++NT+NGGQR+AT+LMYLSDVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGL
Sbjct: 181 HFDYFLDDFNTQNGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGL 240

Query: 241 SVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           SVKPKRGDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWS+TKWMR  EYKA
Sbjct: 241 SVKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRANEYKA 291

BLAST of CmaCh07G004010 vs. NCBI nr
Match: gi|1012355443|gb|KYP66630.1| (Prolyl 4-hydroxylase subunit alpha-1 [Cajanus cajan])

HSP 1 Score: 493.4 bits (1269), Expect = 2.6e-136
Identity = 237/290 (81.72%), Postives = 267/290 (92.07%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNSGA--PKAHDLSSI 60
           MAK R  R   RKSSSSSTLI TL ++FTF++LILLALGILS+P +S +  PK +DL+SI
Sbjct: 1   MAKPRHSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSVPSSSRSNLPKPNDLTSI 60

Query: 61  VRKTSDEVDE--EKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 120
            R T  + D+  E+GEQW EV+SWEPRAFVYHNFLTKEECEYLI +AKP+MQKS+VVDSE
Sbjct: 61  ARNTLHKTDDDDERGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMQKSTVVDSE 120

Query: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 180
           TGKSKDSRVRTSSGTFLPRGRDKIIR IEKRIADF+F+PVEHGEGLQ+LHYEVGQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFTFIPVEHGEGLQILHYEVGQKYEPH 180

Query: 181 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 240
           +DYFLD++NTKNGGQRIAT+LMYL+DVEEGGETVFPAAKGNFSSVPWW+ELS+CGKKGLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLS 240

Query: 241 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           +KPKRGDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWS+TKWMRV+EYKA
Sbjct: 241 IKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVDEYKA 290

BLAST of CmaCh07G004010 vs. NCBI nr
Match: gi|593789660|ref|XP_007157869.1| (hypothetical protein PHAVU_002G104900g [Phaseolus vulgaris])

HSP 1 Score: 491.1 bits (1263), Expect = 1.3e-135
Identity = 238/290 (82.07%), Postives = 265/290 (91.38%), Query Frame = 1

Query: 1   MAKHRQPRFSSRKSSSSSTLIFTLLIMFTFVILILLALGILSIPGNS--GAPKAHDLSSI 60
           MAK R  R   RKSSSSSTLI TLL++FTF+ILILLALGILSIP +S    PK +DL+SI
Sbjct: 1   MAKPRYSRLQPRKSSSSSTLILTLLLVFTFLILILLALGILSIPSSSRNDLPKPNDLTSI 60

Query: 61  VRKT--SDEVDEEKGEQWAEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSSVVDSE 120
            R T  + + DEE+GEQW EV+SWEPRAFVYHNFLTKEEC+YLI +AKP MQKS+VVDSE
Sbjct: 61  ARNTIQTSDDDEERGEQWVEVVSWEPRAFVYHNFLTKEECDYLIDIAKPSMQKSTVVDSE 120

Query: 121 TGKSKDSRVRTSSGTFLPRGRDKIIRTIEKRIADFSFVPVEHGEGLQVLHYEVGQKYEPH 180
           TGKSKDSRVRTSSGTFLPRGRDKI+R IEK+IADFSF+PVEHGEGLQVLHYEVGQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLPRGRDKIVRNIEKKIADFSFIPVEHGEGLQVLHYEVGQKYEPH 180

Query: 181 FDYFLDEYNTKNGGQRIATMLMYLSDVEEGGETVFPAAKGNFSSVPWWDELSDCGKKGLS 240
           +DYFLD++NTKNGGQRIAT+LMYL+DVEEGGETVFPAAKGNFSSVPWW+ELSDCGKKGLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 240

Query: 241 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
           +KP+RGDALLFWSMKPDA+LD SSLHGGCPVIKGNKWS+TKW+RV EYKA
Sbjct: 241 IKPRRGDALLFWSMKPDATLDSSSLHGGCPVIKGNKWSSTKWLRVNEYKA 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H10_ARATH7.6e-12877.78Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1[more]
P4H3_ARATH2.0e-12072.82Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
P4H5_ARATH4.6e-11772.60Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1[more]
P4H8_ARATH1.9e-11068.75Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1[more]
P4H4_ARATH3.2e-6552.38Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana GN=P4H4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KJH9_CUCSA1.3e-15393.38Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507320 PE=4 SV=1[more]
W9SCS8_9ROSA1.1e-13682.13Prolyl 4-hydroxylase subunit alpha-1 OS=Morus notabilis GN=L484_026616 PE=4 SV=1[more]
A0A151THU9_CAJCA1.8e-13681.72Prolyl 4-hydroxylase subunit alpha-1 OS=Cajanus cajan GN=KK1_012931 PE=4 SV=1[more]
V7CKH4_PHAVU9.1e-13682.07Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G104900g PE=4 SV=1[more]
A0A0S3T4Z4_PHAAN3.8e-13480.69Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.10G148300 PE=... [more]
Match NameE-valueIdentityDescription
AT5G66060.14.3e-12977.78 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G20270.11.1e-12172.82 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G17720.12.6e-11872.60 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G35810.11.1e-11168.75 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G18900.11.8e-6652.38 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|659080598|ref|XP_008440878.1|1.1e-15393.73PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|449434114|ref|XP_004134841.1|1.8e-15393.38PREDICTED: probable prolyl 4-hydroxylase 10 [Cucumis sativus][more]
gi|703163281|ref|XP_010113287.1|1.5e-13682.13Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis][more]
gi|1012355443|gb|KYP66630.1|2.6e-13681.72Prolyl 4-hydroxylase subunit alpha-1 [Cajanus cajan][more]
gi|593789660|ref|XP_007157869.1|1.3e-13582.07hypothetical protein PHAVU_002G104900g [Phaseolus vulgaris][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006525 arginine metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
biological_process GO:0006560 proline metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh07G004010.1CmaCh07G004010.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 162..280
score: 4.1
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 158..281
score: 12
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 82..280
score: 7.8
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 20..285
score: 1.2E
NoneNo IPR availablePANTHERPTHR10869:SF832-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEIN-RELATEDcoord: 20..285
score: 1.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh07G004010CmaCh03G006520Cucurbita maxima (Rimu)cmacmaB523