CSPI02G02650 (gene) Wild cucumber (PI 183967)

NameCSPI02G02650
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionProlyl 4-hydroxylase subunit alpha-2
LocationChr2 : 1824846 .. 1826968 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATATCTAAAGGGAAGTACATCAAGTTACAGGGTAGGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTATTGCTCTTCGCTTCTTCTCTCCTCCTGAAACTTCTCACCACCGTTTCCCTTCCGTCCGGCATACAGCATTTCTAAGGTAATCTGGTAATGCAATGGGTGTGATTCTTTAAGCTTGTTTGTTACAGGAATCAACTGATTTTGATTTCCATTTTGGTAATATTGTTGTTGTTGTAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTATATCATAATTTCTTGGTAAGTTTTTAGAATTGTGGTTTTTTTTTGGAAAAAAAAAGGAAATTTTGTTTTGTTAAAATGGTTGGCTGATATTGGAATGACGTTGGACAGTCCAAAGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTGGTTGATAGCAAAACTGGCGAGAGTGTGGATAGCAGGTATGTCTTTATCATTTTCTTTGTTGTCCTTTTACTAACAGAAACAGAACTTGTCAATTATTGACTATGAACTAACTCTGGTGGGGCTTACACTATGACACATTTAAAAGTGTAAGAAAAAGTAATCTCAAAGAGAGTAAATTTGAACTAAACTCATATTTTAAAGATGATTTATCTATATTTAGTCCTTTCTCATAAGTTATCTCTTTAGGAATTGATTGGTCGGTTCGGTTCTTGGAAAATCAGTTCGGTAAAAGTCCAACCTGAACTGACTACAATAGAACGGTTATGTATAATAGAAGGGTATAAAGTAGAACAATACAATTACAATGAACAAGGCGAACTTATAACTTTTTTCTGTACAATGCCATGTCCACAGGGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCATAGGTAAATAATTCAAAACTTTCATATCACCACGGCTTATTTAGAAATACTTTTCAAAACAGTTTTTGAAAACATGTTCGAACCACTGTTTTTTGCTAGTTGTTTTATTTTGTCTGTGTTAGTGATTTCATTCAATATATCAATCTTTACAATGCATTCAACTGTAGTTATATTGAATAATTCTTCAAAATAGAGTTGCAACCAGAACATAATTCTCAAACAAATCTTTGTTGGACTAAACAAACATTCGCGAAAAATTAGTTAAAATTCATGTGGGATTTTGTTCATTATTCTTGTTTTTGGCCCAAACCAAAGAGGGCTTTTGTGAATGAGCTATTCATTTGCTGCTTGTTGTTTTGTTGATATCACTGGCATTTTGGTCTTAATACATTTTGGTCTTAATACATTTTGGTTAGTGTTTGATTATTAGATTATTAGACTTGTCTCGTTTATTATCATTAACAAAGAGGATTGTTTCTGTTAAAAACAACATTTTGGTAATGATATTCACAATTCCACACAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAATATGATGCGCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTTCTCATGTATTTGTAAGATTCTACTTTGACGATCTCTCAAACTCATCTACCCTCTTCTCTAGTTTCTCTCTGAATCGATACCAATGGCAATGCAGGTCCGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCAGCTGCAAAAGGAAACTTCAGCTCTGTGCCATGGTGGAATGAACTATCTGAATGTGGCAAAGGTGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATGCTACCTTAGACCCTACAAGTTTACATGGTTAGTCCTTTTTCCTTTTCCTCCCTATTTAACTTTAGCATTAGAAGATTGGTTACATCTAAATTATCCTCCCTAATCTTGTTTTTGATATGTAGGTGCCTGTCCTGTCATTAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATGTTGATAAATATATTTAA

mRNA sequence

ATGGCAATATCTAAAGGGAAGTACATCAAGTTACAGGGTAGGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTATTGCTCTTCGCTTCTTCTCTCCTCCTGAAACTTCTCACCACCGTTTCCCTTCCGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTATATCATAATTTCTTGTCCAAAGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTGGTTGATAGCAAAACTGGCGAGAGTGTGGATAGCAGGGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCATAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAATATGATGCGCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTTCTCATGTATTTGTCCGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCAGCTGCAAAAGGAAACTTCAGCTCTGTGCCATGGTGGAATGAACTATCTGAATGTGGCAAAGGTGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATGCTACCTTAGACCCTACAAGTTTACATGGTGCCTGTCCTGTCATTAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATGTTGATAAATATATTTAA

Coding sequence (CDS)

ATGGCAATATCTAAAGGGAAGTACATCAAGTTACAGGGTAGGAAATGGTCCACATTCCAGCTTTCGAAGATGATCATGGCCCTCGTTTTGGCACTTGGGTTTTTCATGCTTATTGCTCTTCGCTTCTTCTCTCCTCCTGAAACTTCTCACCACCGTTTCCCTTCCGTCCGGCATACAGCATTTCTAAGTGATGGGTTGGGGAAGAGAGGGGATCAGTGGGTTGAGTTCATTTCTTGGGAGCCTAGAGCTTTTGTATATCATAATTTCTTGTCCAAAGAAGAATGCTTGTACTTGATTAGTCTTGCAAAACCCCACATGGAGAAATCAACTGTGGTTGATAGCAAAACTGGCGAGAGTGTGGATAGCAGGGTGCGAACCAGTTCTGGGATGTTTCTGAATAGAGGGCAGGACAAAATCATCAGAAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCATAGAGCATGGAGAAGGACTTCAAATTCTCCATTATGAAGTTGGGCAGAAATATGATGCGCATTATGATTACTTTGTTGATGAGTACAACATCAAAAAAGGAGGCCAAAGAATGGCCACCCTTCTCATGTATTTGTCCGATGTTGAAGAGGGGGGCGAGACGGTGTTCCCAGCTGCAAAAGGAAACTTCAGCTCTGTGCCATGGTGGAATGAACTATCTGAATGTGGCAAAGGTGGACTCTCTGTAAAACCAAAAATGGGAGATGCTTTATTGTTCTGGAGCATGAAGCCCGATGCTACCTTAGACCCTACAAGTTTACATGGTGCCTGTCCTGTCATTAGAGGGAACAAATGGTCATGTACAAAGTGGATGCATGTTGATAAATATATTTAA
BLAST of CSPI02G02650 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 2.1e-117
Identity = 204/286 (71.33%), Postives = 230/286 (80.42%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSV-----R 62
           ++K ++ + Q RKWST  L  + M  +L +   ML+A   FS P  +    P       R
Sbjct: 1   MAKLRHSRFQARKWSTLMLV-LFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRR 60

Query: 63  HTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTG 122
                S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC YLISLAKPHM KSTVVDS+TG
Sbjct: 61  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 120

Query: 123 ESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 182
           +S DSRVRTSSG FL RG+DKII+ IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYD
Sbjct: 121 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 180

Query: 183 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVK 242
           YFVDE+N K GGQRMAT+LMYLSDVEEGGETVFPAA  NFSSVPW+NELSECGK GLSVK
Sbjct: 181 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 240

Query: 243 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           P+MGDALLFWSM+PDATLDPTSLHG CPVIRGNKWS TKWMHV +Y
Sbjct: 241 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of CSPI02G02650 vs. Swiss-Prot
Match: P4H10_ARATH (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 3.0e-108
Identity = 187/270 (69.26%), Postives = 216/270 (80.00%), Query Frame = 1

Query: 22  SKMIMALVLALGFFMLIALRFFSPPETSHHRFPS--------VRHTAFLSDGLGKRGDQW 81
           S ++ A+++   F +LI L F      S++   S        VR T   S     + ++W
Sbjct: 18  STLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERW 77

Query: 82  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLN 141
           VE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD KTG+S DSRVRTSSG FL 
Sbjct: 78  VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLA 137

Query: 142 RGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 201
           RG+DK IR IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+A
Sbjct: 138 RGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIA 197

Query: 202 TLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDA 261
           T+LMYLSDVEEGGETVFPAAKGN+S+VPWWNELSECGKGGLSVKPKMGDALLFWSM PDA
Sbjct: 198 TVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDA 257

Query: 262 TLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           TLDP+SLHG C VI+GNKWS TKW+ V +Y
Sbjct: 258 TLDPSSLHGGCAVIKGNKWSSTKWLRVHEY 287

BLAST of CSPI02G02650 vs. Swiss-Prot
Match: P4H8_ARATH (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1)

HSP 1 Score: 368.2 bits (944), Expect = 7.9e-101
Identity = 178/289 (61.59%), Postives = 219/289 (75.78%), Query Frame = 1

Query: 5   KGKYIKLQGRK-WSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FP 64
           K K ++ + RK +ST   + +++ L + L   +L+ L  FS P T+              
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVIL---ILVGLGIFSLPSTNKTSSMPMDLTTIVQ 63

Query: 65  SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 124
           +++      D     GD+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVD 
Sbjct: 64  TIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDV 123

Query: 125 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 184
           KTG+S+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ 
Sbjct: 124 KTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEP 183

Query: 185 HYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGL 244
           H+DYF DE+N++KGGQR+AT+LMYLSDV+EGGETVFPAAKGN S VPWW+ELS+CGK GL
Sbjct: 184 HHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGL 243

Query: 245 SVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           SV PK  DALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW HV +Y
Sbjct: 244 SVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CSPI02G02650 vs. Swiss-Prot
Match: P4H5_ARATH (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 2.6e-99
Identity = 171/289 (59.17%), Postives = 216/289 (74.74%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPS----- 60
           MA    ++++ Q RK  +       + ++L +   +L+ L   S P  + +   +     
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTN 60

Query: 61  -VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 120
            VR +   S      G++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTVVD 
Sbjct: 61  IVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDE 120

Query: 121 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 180
           KTG S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ 
Sbjct: 121 KTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEP 180

Query: 181 HYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGL 240
           HYDYF+DE+N K GGQR+AT+LMYLSDV++GGETVFPAA+GN S+VPWWNELS+CGK GL
Sbjct: 181 HYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGL 240

Query: 241 SVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           SV PK  DALLFW+M+PDA+LDP+SLHG CPV++GNKWS TKW HV ++
Sbjct: 241 SVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289

BLAST of CSPI02G02650 vs. Swiss-Prot
Match: P4H7_ARATH (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 3.4e-67
Identity = 116/207 (56.04%), Postives = 157/207 (75.85%), Query Frame = 1

Query: 77  ISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQ 136
           +SW PR F+Y  FLS EEC + I LAK  +EKS V D+ +GESV+S VRTSSGMFL++ Q
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQ 118

Query: 137 DKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLL 196
           D I+ N+E ++A +TF+P E+GE +QILHYE GQKY+ H+DYF D+ N++ GG R+AT+L
Sbjct: 119 DDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVL 178

Query: 197 MYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLD 256
           MYLS+VE+GGETVFP  KG  + +   +  +EC K G +VKP+ GDALLF+++ P+AT D
Sbjct: 179 MYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 238

Query: 257 PTSLHGACPVIRGNKWSCTKWMHVDKY 284
             SLHG+CPV+ G KWS T+W+HV  +
Sbjct: 239 SNSLHGSCPVVEGEKWSATRWIHVKSF 264

BLAST of CSPI02G02650 vs. TrEMBL
Match: A0A0A0LFF5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009620 PE=4 SV=1)

HSP 1 Score: 542.7 bits (1397), Expect = 2.6e-151
Identity = 257/259 (99.23%), Postives = 257/259 (99.23%), Query Frame = 1

Query: 26  MALVLALGFFMLIALRFFSPPETSHHRFPSVRHTAFLSDGLGKRGDQWVEFISWEPRAFV 85
           MALVLALGFFMLIALRF SPPETSHHRF SVRHTAFLSDGLGKRGDQWVEFISWEPRAFV
Sbjct: 1   MALVLALGFFMLIALRFLSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFV 60

Query: 86  YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK 145
           YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK
Sbjct: 61  YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK 120

Query: 146 RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG 205
           RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG
Sbjct: 121 RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG 180

Query: 206 GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP 265
           GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP
Sbjct: 181 GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP 240

Query: 266 VIRGNKWSCTKWMHVDKYI 285
           VIRGNKWSCTKWMHVDKYI
Sbjct: 241 VIRGNKWSCTKWMHVDKYI 259

BLAST of CSPI02G02650 vs. TrEMBL
Match: A0A061FEJ2_THECC (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_034293 PE=4 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 1.7e-118
Identity = 207/290 (71.38%), Postives = 238/290 (82.07%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWSTFQLS-------KMIMALVLALGFFMLIALRFFSPPE--TSHHRF 62
           ++K ++ +LQ +KWST  L         +++ ++L LG F L      SPP   TS+ R 
Sbjct: 1   MAKVRHSRLQAKKWSTVMLVLSMLFMLTVVLLMLLGLGIFSLPMSTDDSPPNDLTSYRRM 60

Query: 63  PSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVD 122
            S R        LGKRG+QW E +SWEPRAF+YHNFLSKEEC YLI+LAKPHM KSTVVD
Sbjct: 61  ASER-----GKELGKRGEQWTEVLSWEPRAFIYHNFLSKEECEYLINLAKPHMAKSTVVD 120

Query: 123 SKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYD 182
           SKTG S DSRVRTSSGMFL RGQDKIIR+IEKRIAD+TFIP+EHGEGLQ+LHYEVGQKYD
Sbjct: 121 SKTGRSKDSRVRTSSGMFLRRGQDKIIRDIEKRIADYTFIPVEHGEGLQVLHYEVGQKYD 180

Query: 183 AHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGG 242
           AH+DYF+DE+N K GGQRMAT+LMYLSDVEEGGET+FPAAKGNFS+VPWWNELSECGK G
Sbjct: 181 AHFDYFLDEFNTKNGGQRMATMLMYLSDVEEGGETIFPAAKGNFSAVPWWNELSECGKQG 240

Query: 243 LSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           LSVKPKMGDALLFWSM+PDATLDP+SLHG CPVI GNKWS TKW+HV++Y
Sbjct: 241 LSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIMGNKWSSTKWIHVEEY 285

BLAST of CSPI02G02650 vs. TrEMBL
Match: E0CQW5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g00950 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 2.2e-118
Identity = 206/287 (71.78%), Postives = 239/287 (83.28%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPP-----ETSHHRFPSVR 62
           ++KG+Y +  G++WST  L   ++ L+L +   ML+AL   S P       + +   S R
Sbjct: 1   MAKGRYSRGHGKRWSTLALVLSLL-LMLTVVLLMLLALGIVSLPIGTVDSDAANDLSSFR 60

Query: 63  HTAFLS-DGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKT 122
              F   +GLGKRG+QW E +SWEPRAF+YHNFLSKEEC Y+ISLAKP+M+KSTVVDS+T
Sbjct: 61  RKTFDGGEGLGKRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSET 120

Query: 123 GESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHY 182
           G S DSRVRTSSGMFL RG+DKIIR+IEKRIADFTFIP+EHGEGLQ+LHYEVGQKYDAHY
Sbjct: 121 GRSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHY 180

Query: 183 DYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSV 242
           DYF+DE+N K GGQR+ATLLMYLSDVEEGGETVFPA K NFSSVPWWNELSECGK GLSV
Sbjct: 181 DYFLDEFNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSV 240

Query: 243 KPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           KPKMGDALLFWSM+PDATLDP+SLHG CPVI+GNKWS TKWMHV++Y
Sbjct: 241 KPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHVEEY 286

BLAST of CSPI02G02650 vs. TrEMBL
Match: A0A0D2S8Z3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G269000 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 3.2e-117
Identity = 207/291 (71.13%), Postives = 240/291 (82.47%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWST--------FQLSKMIMALVLALGFFMLIALRFFSPPE--TSHHR 62
           ++K ++ +LQ RKWST        F LS +++ ++L LG F L      S P   TS+ R
Sbjct: 1   MAKVRHSRLQARKWSTVTLVLSMLFMLS-VVLLMLLGLGVFFLPINDDDSAPNDLTSYRR 60

Query: 63  FPSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV 122
             S R       GLGKRG+QW E +SWEPRAF+YHNFLSKEEC YLI+LAKPHM KSTVV
Sbjct: 61  MASER-----GKGLGKRGEQWTEVLSWEPRAFIYHNFLSKEECEYLINLAKPHMVKSTVV 120

Query: 123 DSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKY 182
           DSKTG+S DSRVRTSSGMFL RGQDKII++IEKRIAD++FIP+EHGEGLQ+LHYEVGQKY
Sbjct: 121 DSKTGKSKDSRVRTSSGMFLRRGQDKIIKDIEKRIADYSFIPVEHGEGLQVLHYEVGQKY 180

Query: 183 DAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKG 242
           DAH+DYF+DE+N K GGQRMAT+LMYLSDVEEGGET+FPAAKGN SSVPWWNELSECGK 
Sbjct: 181 DAHFDYFLDEFNTKNGGQRMATMLMYLSDVEEGGETIFPAAKGNISSVPWWNELSECGKQ 240

Query: 243 GLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           GL+VKPKMGDALLFWSM+PDATLDP+SLHG CPVI GNKWS TKWMH+++Y
Sbjct: 241 GLAVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIMGNKWSSTKWMHLEEY 285

BLAST of CSPI02G02650 vs. TrEMBL
Match: A0A0B0NIF8_GOSAR (Prolyl 4-hydroxylase subunit alpha-2 OS=Gossypium arboreum GN=F383_02226 PE=4 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 5.5e-117
Identity = 205/291 (70.45%), Postives = 238/291 (81.79%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPE----------TSHHR 62
           ++K ++ +LQ RKWST  L  + M  +L++   ML+ L  FS P           TS+ R
Sbjct: 1   MAKVRHSRLQARKWSTVTLV-LSMLFMLSVVLLMLLGLGIFSLPINNDDSAPNDLTSYRR 60

Query: 63  FPSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV 122
             S R       GLGKRG+QW E +SWEPRAF+YHNFLSKEEC YLI+LAKPHM KSTVV
Sbjct: 61  MASER-----GKGLGKRGEQWTEVLSWEPRAFIYHNFLSKEECEYLINLAKPHMVKSTVV 120

Query: 123 DSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKY 182
           DSKTG+S DSRVRTSSGMFL RGQDKIIR+IE RIAD++FIP+EHGEGLQ+LHYEVGQKY
Sbjct: 121 DSKTGKSKDSRVRTSSGMFLRRGQDKIIRDIENRIADYSFIPVEHGEGLQVLHYEVGQKY 180

Query: 183 DAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKG 242
           DAH+DYF+DE+N K GGQRMAT+LMYLSDVEEGGET+FPAAKGN S+VPWWNELSECGK 
Sbjct: 181 DAHFDYFLDEFNTKNGGQRMATMLMYLSDVEEGGETIFPAAKGNISAVPWWNELSECGKQ 240

Query: 243 GLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           GL+VKPKMGDALLFWSM+PDATLDP+SLHG CPVI GNKWS TKWMH+++Y
Sbjct: 241 GLAVKPKMGDALLFWSMRPDATLDPSSLHGGCPVITGNKWSSTKWMHLEEY 285

BLAST of CSPI02G02650 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 423.3 bits (1087), Expect = 1.2e-118
Identity = 204/286 (71.33%), Postives = 230/286 (80.42%), Query Frame = 1

Query: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSV-----R 62
           ++K ++ + Q RKWST  L  + M  +L +   ML+A   FS P  +    P       R
Sbjct: 1   MAKLRHSRFQARKWSTLMLV-LFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRR 60

Query: 63  HTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTG 122
                S+GLGKRGDQW E +SWEPRAFVYHNFLSKEEC YLISLAKPHM KSTVVDS+TG
Sbjct: 61  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 120

Query: 123 ESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 182
           +S DSRVRTSSG FL RG+DKII+ IEKRIAD+TFIP +HGEGLQ+LHYE GQKY+ HYD
Sbjct: 121 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 180

Query: 183 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVK 242
           YFVDE+N K GGQRMAT+LMYLSDVEEGGETVFPAA  NFSSVPW+NELSECGK GLSVK
Sbjct: 181 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 240

Query: 243 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           P+MGDALLFWSM+PDATLDPTSLHG CPVIRGNKWS TKWMHV +Y
Sbjct: 241 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of CSPI02G02650 vs. TAIR10
Match: AT5G66060.1 (AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 392.9 bits (1008), Expect = 1.7e-109
Identity = 187/270 (69.26%), Postives = 216/270 (80.00%), Query Frame = 1

Query: 22  SKMIMALVLALGFFMLIALRFFSPPETSHHRFPS--------VRHTAFLSDGLGKRGDQW 81
           S ++ A+++   F +LI L F      S++   S        VR T   S     + ++W
Sbjct: 18  STLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERW 77

Query: 82  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLN 141
           VE ISWEPRA VYHNFL+KEEC YLI LAKPHMEKSTVVD KTG+S DSRVRTSSG FL 
Sbjct: 78  VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLA 137

Query: 142 RGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 201
           RG+DK IR IEKRI+DFTFIP+EHGEGLQ+LHYE+GQKY+ HYDYF+DEYN + GGQR+A
Sbjct: 138 RGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIA 197

Query: 202 TLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDA 261
           T+LMYLSDVEEGGETVFPAAKGN+S+VPWWNELSECGKGGLSVKPKMGDALLFWSM PDA
Sbjct: 198 TVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDA 257

Query: 262 TLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           TLDP+SLHG C VI+GNKWS TKW+ V +Y
Sbjct: 258 TLDPSSLHGGCAVIKGNKWSSTKWLRVHEY 287

BLAST of CSPI02G02650 vs. TAIR10
Match: AT4G35810.1 (AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 368.2 bits (944), Expect = 4.5e-102
Identity = 178/289 (61.59%), Postives = 219/289 (75.78%), Query Frame = 1

Query: 5   KGKYIKLQGRK-WSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR---------FP 64
           K K ++ + RK +ST   + +++ L + L   +L+ L  FS P T+              
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVIL---ILVGLGIFSLPSTNKTSSMPMDLTTIVQ 63

Query: 65  SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 124
           +++      D     GD+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVD 
Sbjct: 64  TIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDV 123

Query: 125 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 184
           KTG+S+DSRVRTSSG FLNRG D+I+  IE RI+DFTFIP E+GEGLQ+LHYEVGQ+Y+ 
Sbjct: 124 KTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEP 183

Query: 185 HYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGL 244
           H+DYF DE+N++KGGQR+AT+LMYLSDV+EGGETVFPAAKGN S VPWW+ELS+CGK GL
Sbjct: 184 HHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGL 243

Query: 245 SVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           SV PK  DALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW HV +Y
Sbjct: 244 SVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289

BLAST of CSPI02G02650 vs. TAIR10
Match: AT2G17720.1 (AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 363.2 bits (931), Expect = 1.4e-100
Identity = 171/289 (59.17%), Postives = 216/289 (74.74%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPS----- 60
           MA    ++++ Q RK  +       + ++L +   +L+ L   S P  + +   +     
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTN 60

Query: 61  -VRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 120
            VR +   S      G++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTVVD 
Sbjct: 61  IVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDE 120

Query: 121 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 180
           KTG S DSRVRTSSG FL RG D+++  IEKRI+DFTFIP+E+GEGLQ+LHY+VGQKY+ 
Sbjct: 121 KTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEP 180

Query: 181 HYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGL 240
           HYDYF+DE+N K GGQR+AT+LMYLSDV++GGETVFPAA+GN S+VPWWNELS+CGK GL
Sbjct: 181 HYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGL 240

Query: 241 SVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           SV PK  DALLFW+M+PDA+LDP+SLHG CPV++GNKWS TKW HV ++
Sbjct: 241 SVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289

BLAST of CSPI02G02650 vs. TAIR10
Match: AT5G18900.1 (AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 250.8 bits (639), Expect = 1.0e-66
Identity = 118/212 (55.66%), Postives = 160/212 (75.47%), Query Frame = 1

Query: 74  VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLN 133
           V+ +S +PRAFVY  FL++ EC +++SLAK  +++S V D+ +GES  S VRTSSG F++
Sbjct: 37  VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIS 96

Query: 134 RGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMA 193
           +G+D I+  IE +I+ +TF+P E+GE +Q+L YE GQKYDAH+DYF D+ NI +GG RMA
Sbjct: 97  KGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMA 156

Query: 194 TLLMYLSDVEEGGETVFPAAKGNFSSVPWWN--ELSECGKGGLSVKPKMGDALLFWSMKP 253
           T+LMYLS+V +GGETVFP A+     V   N  +LS+C K G++VKP+ GDALLF+++ P
Sbjct: 157 TILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHP 216

Query: 254 DATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 284
           DA  DP SLHG CPVI G KWS TKW+HVD +
Sbjct: 217 DAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248

BLAST of CSPI02G02650 vs. NCBI nr
Match: gi|778666404|ref|XP_011648735.1| (PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus])

HSP 1 Score: 590.9 bits (1522), Expect = 1.2e-165
Identity = 282/284 (99.30%), Postives = 282/284 (99.30%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSVRHTA 60
           MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRF SPPETSHHRF SVRHTA
Sbjct: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFLSPPETSHHRFSSVRHTA 60

Query: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV 120
           FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV
Sbjct: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV 120

Query: 121 DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180
           DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV
Sbjct: 121 DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180

Query: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240
           DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240

Query: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI 285
           GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI 284

BLAST of CSPI02G02650 vs. NCBI nr
Match: gi|659070731|ref|XP_008456388.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X2 [Cucumis melo])

HSP 1 Score: 581.3 bits (1497), Expect = 9.5e-163
Identity = 276/284 (97.18%), Postives = 280/284 (98.59%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSVRHTA 60
           MA+SKGKYIKLQG+KWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR PSVR TA
Sbjct: 1   MAVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRLPSVRRTA 60

Query: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV 120
           F SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTG+SV
Sbjct: 61  FQSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGKSV 120

Query: 121 DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180
           DSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV
Sbjct: 121 DSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180

Query: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240
           DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240

Query: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI 285
           GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHV+KYI
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVNKYI 284

BLAST of CSPI02G02650 vs. NCBI nr
Match: gi|659070723|ref|XP_008456352.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 552.0 bits (1421), Expect = 6.2e-154
Identity = 261/284 (91.90%), Postives = 271/284 (95.42%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSVRHTA 60
           MA+S GKYIKLQG+KWSTFQLSKMIMALVLALGFFML AL FFSPPETSHHR  SVRHTA
Sbjct: 1   MAVSIGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLSALWFFSPPETSHHRLSSVRHTA 60

Query: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV 120
           FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS+TG+SV
Sbjct: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSETGKSV 120

Query: 121 DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180
           DS VRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGE +QILHY VGQKYDAHYD+FV
Sbjct: 121 DSSVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEDIQILHYAVGQKYDAHYDFFV 180

Query: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240
           DEYN+K  GQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGK GLS+KPKM
Sbjct: 181 DEYNLKSVGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKSGLSIKPKM 240

Query: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI 285
           GDALLFWSMKPD TLDPTSLHGACPVIRGNKWSCTKW+HV+KYI
Sbjct: 241 GDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIHVNKYI 284

BLAST of CSPI02G02650 vs. NCBI nr
Match: gi|700205656|gb|KGN60775.1| (hypothetical protein Csa_2G009620 [Cucumis sativus])

HSP 1 Score: 542.7 bits (1397), Expect = 3.8e-151
Identity = 257/259 (99.23%), Postives = 257/259 (99.23%), Query Frame = 1

Query: 26  MALVLALGFFMLIALRFFSPPETSHHRFPSVRHTAFLSDGLGKRGDQWVEFISWEPRAFV 85
           MALVLALGFFMLIALRF SPPETSHHRF SVRHTAFLSDGLGKRGDQWVEFISWEPRAFV
Sbjct: 1   MALVLALGFFMLIALRFLSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFV 60

Query: 86  YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK 145
           YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK
Sbjct: 61  YHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEK 120

Query: 146 RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG 205
           RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG
Sbjct: 121 RIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEG 180

Query: 206 GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP 265
           GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP
Sbjct: 181 GETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACP 240

Query: 266 VIRGNKWSCTKWMHVDKYI 285
           VIRGNKWSCTKWMHVDKYI
Sbjct: 241 VIRGNKWSCTKWMHVDKYI 259

BLAST of CSPI02G02650 vs. NCBI nr
Match: gi|659070729|ref|XP_008456383.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1 [Cucumis melo])

HSP 1 Score: 531.9 bits (1369), Expect = 6.6e-148
Identity = 258/284 (90.85%), Postives = 264/284 (92.96%), Query Frame = 1

Query: 1   MAISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRFPSVRHTA 60
           MA+SKGKYIKLQG+KWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHR PSVR TA
Sbjct: 1   MAVSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSHHRLPSVRRTA 60

Query: 61  FLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESV 120
           F SDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTG+SV
Sbjct: 61  FQSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGKSV 120

Query: 121 DSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180
           DSRVRTSSGMFLNRGQDKII NIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV
Sbjct: 121 DSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFV 180

Query: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240
           DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240

Query: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKYI 285
           GDALLFWSMKPDATLDPTSLHG         W  + W   D ++
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGCNE--NTLCWGRSNWKTTDDFL 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H3_ARATH2.1e-11771.33Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
P4H10_ARATH3.0e-10869.26Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1[more]
P4H8_ARATH7.9e-10161.59Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1[more]
P4H5_ARATH2.6e-9959.17Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1[more]
P4H7_ARATH3.4e-6756.04Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LFF5_CUCSA2.6e-15199.23Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009620 PE=4 SV=1[more]
A0A061FEJ2_THECC1.7e-11871.382-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein isoform ... [more]
E0CQW5_VITVI2.2e-11871.78Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g00950 PE=4 SV=... [more]
A0A0D2S8Z3_GOSRA3.2e-11771.13Uncharacterized protein OS=Gossypium raimondii GN=B456_009G269000 PE=4 SV=1[more]
A0A0B0NIF8_GOSAR5.5e-11770.45Prolyl 4-hydroxylase subunit alpha-2 OS=Gossypium arboreum GN=F383_02226 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G20270.11.2e-11871.33 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G66060.11.7e-10969.26 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G35810.14.5e-10261.59 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G17720.11.4e-10059.17 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G18900.11.0e-6655.66 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|778666404|ref|XP_011648735.1|1.2e-16599.30PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus][more]
gi|659070731|ref|XP_008456388.1|9.5e-16397.18PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X2 [Cucumis melo][more]
gi|659070723|ref|XP_008456352.1|6.2e-15491.90PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|700205656|gb|KGN60775.1|3.8e-15199.23hypothetical protein Csa_2G009620 [Cucumis sativus][more]
gi|659070729|ref|XP_008456383.1|6.6e-14890.85PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0005794 Golgi apparatus
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G02650.1CSPI02G02650.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 161..279
score: 1.2
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 157..280
score: 12
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 81..279
score: 5.2
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 12..283
score: 6.9E
NoneNo IPR availablePANTHERPTHR10869:SF782-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 12..283
score: 6.9E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI02G02650CmaCh12G011450Cucurbita maxima (Rimu)cmacpiB174
The following gene(s) are paralogous to this gene:

None