Csa6G494930 (gene) Cucumber (Chinese Long) v2

NameCsa6G494930
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionProlyl 4-hydroxylase alpha subunit, putative; contains IPR005123 (Oxoglutarate/iron-dependent dioxygenase)
LocationChr6 : 23849257 .. 23853855 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCAAGTTAACAAAACGCTTCGGAACTCGGTAAATAGGAGGAGCAAATAAGGAAGAGATCGGTGATTGCATCAAAGGAAAGAAGCATGAAAGGCAAAAGCGGAAGAAGTAATTGGAGCTTGAGATCGAAGCTAGGTTTGCCCGCACTAATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTACTCTTCTCTCTCAGGTTGATCCGTGTGCTTCAAGTTTCCAATCACCGTTAGATTTCGTTGCTTCAATGCTTATACTGTAATTTTCGGATTATCTCATTGTTATCGGACGATTGAACAGGATGTAGATGACGATAGGCCGAGGGCGAGGTTGCTTCAATCGGCAAGCGATGTTACGGAGTTTGATTTGATGTCTTCGGGAGAAAACGGCGACGATTCCATTTCTTCGATTCCTTTTCAGGTGCTCTCCGTTTCGTTTTAATTATTAATTACATCTGGTATGTTAGTTTCACCGCATTGTCAAACAGACTAAGTTGTTTTTTAAGGATTGCATGCATTTGTGCGAACATCATTTATCGACGGATGAACCTTGCAACTCATCATTCGATCTTCAGTGATCATGATTTGTTGTTTAGATTATTGCACGTCTAATTTAAAGTTTCCTTTAATGGAGGTTTTGAGCTGGCGACCTCGCGCACTTTATTTCCCCAAGTTTGCAACTGCGGAGCAATGTCAGAGTATAGTTAATTTGGCGAAACCGAAGCTTAGGCCGTCTACATTGGCTCTACGTAAGGGAGAAACCGCAGAGAGCACGAAAGGAGTCCGAACAAGGTATCTTCTGTCTGTCTGTCTGTATTGGCACTTATGATTTTATCAATTATCATCCATTATTCTTTGCACGGCCTCAACATAAGTTTCTTCCTGTACTTCCATTTATAACCAAACTCTGTCCATATATGCGATCTGTTATGGTATGAGAAATTATCCATCGAGATTTCTGAAGTTCAGCTTAGTTATAGATAGTTTCTTTTTAATTCGACTCTTTGGTCACTCTCACCTATGATGTTTATTGGAATTCCGTTTGAACTCCTAGGCATTGGTTTTGGGAACTTCCATCCGCTTTTGTAATTTTACATTATCCATGCAGTTTATTCATATCTAGAATATTTATGTTTCCATTGTAGTTTCTTTATTCTGTTTCGTAATTCGACCAAAATTCCAGGTTAGCCATGGCTATTGGCAAAACTGGAAAATTTTTATTACTTCGTTTTTGGTTTTCGGCTTCATGTATAGAAGGCTCTAGTGGTTATAATGCACGATCTTCTAAATTCTTGGTTATACTCGGGTTATAGGGATATAGAATTTTGTAGAGATGCTTACGTTTGCAATTTGATTCTTTTCTCTTCTCCGCCCCAAGCAGTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGGACTCTGGGTGTAATTGAGGAAAAAATTGCAAGGGCAACTATGATTCCAAGGACGCATGGAGAGGTATTTTTACCACTTTGATCATCAATATTTCCACCCTGTTTTCTTAAGGTAGAAATCTGTTGGTACATCATTCATAAGAGTCTAAAGTTTCAAAAGAACTTGGTTCTAGAATCCTGAGTATGTTTGATGTAAATTTTGATAGGGAGTGTCATGGATTTGAACTCTTTCACTACCACGGAGAAGATGAACCCATTAGCAAGTCATTTCTAACTTGCTTATTAATTTATTGCATCCATCTCTATAGTAGCTGGTCATTGGCCTTCTTCAAATTTAATGCTTAACATGGGATCTTGACCATTAATTTCTTATAGTTGATGGGATTTATATAAATCCACTATGTAGTCAGATTCTGTTTGTTATTTTAAGCTGAGATTAAAGTTTTCTCAAACTTCTCATTTCTTGTTCAAACTTCTGACAGTTTGACAACTTAAAAGAGGATCAGTGTTACCTCTTCACATTTCTTGACTAAATACCAAGGTACACAATAGTAAGTAGTAACAGGTGACATAAAACCTGAGATAATTAGTCATTGTGTTCTATCCATTTGATGGACCTGATACTTTATATATTTCTTTTAATTGTTTGTAGAAATCTGTTATAGAATAGTATGCTTAATTTCATAGTTAATTTATTTACAGGTTACATTTCCACTTGATGTTTTATGTTTTAAGGATGTTTTCTTGCATCTGTAAAAAGTATAGAACTTCTTGTTTCCAAATGTTTTAAAATTGATTGAGGATTGTGTGTTTCAGGCATATAATATTTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGATGCGTTCAAGCCTTCTGAATATGGGCCACAGAAGAGCCAGAGGGTACAACGTAGATATATTAGCTTTTCCTTTGCTTATTGCTCGTAAACATTCAACATTCAATTCTTTTCATTCATTTATAAGAGCTCTGTTACAGAAATATTTTGATCCCCTCCCAAAAAAGGGGGGAGATAGAATTTTGCACTACTTTCCCTGACATTTTAGCATTAAAAAATTCAGGTGGCTTCTTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGGTAAGAGTAGACCACCATAGGACCGTATGAACGAGGTGGCCTTTCTCTTGACATTTTTTACAACCAAGAACATTAGTCAAATGGAGTATTTACTGTGTTTGATTGGCAGAATGGCTTGAACATGGATGGAACCTATAATTTCCAAACATGTATTGGTTTGAAAGTGAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCGGTTTTCCCAAATGGTACAATTGATCCGGTTAGTGTTCCTATTCCTATCCTCTTTTGCTTGATATAGTAGTTTCAATTTGAATGCTGTCTGGATTAATGATGTGTAATGTGTAATGTGTATTGTCGAACCATGAGCTAACAAGGAGCATCTATGCTTTGTTTGAGCATGAAGCTTCTCGATTTATGATAGACTCTCTATTAAACATGTTTATAAGAGGAGGCAGTGAGGCACTGAAAGAGGTAGTGAGGAATAACAGGCTATGAGAAGCTAGTTAGAACTAGAAAGGTGAGCCCAAGTGAAGGTAATAAGAGGTGGGGTGTGAATAGAAGGAAATATATTGTCAGAGAACGAGTTGGAGATTGGTTACTAAGTCCATAAGTTTCGTACTTGAGAAAGTGGAAGCTTTCTAAATCCATCGACCCATCACCTTTCTTGATTAATAAATCTCACTCATCAATTTACGGGTAGGAGCCTAAATCTGCCATGAAAACCAGAATTAGCACATGTTTGTAACAAAGAACAGTAGACTATATCAAAACTACGAACTAATATCATATGCTAGTACAGTAGAAGTTGAAATCAAGGATTACCAAGATCCCAGTCATTAGAAGGAAAAAAACCAAAGTTGTATGACACTAAAGAGCCTTTGGAGAGGACCATCTTCTGTCCCAATTTTCACGATGGATCTGTTCCCAAATAAAGATGTTATAAGGTGATGGTCGTTAGGAAGGATAGGCGTTAGGCACATATTGTGATCAGCATGTAGAGGTGTTGCCGTGTTGGACACAGACATGCTTCAGACACGTATTTTAAGGGTCAAATTTACTTTTAATTTGTTGAGTTTTTATGTTTTGGTGCTGCTGCGGCGTGTCACTCATTCAATGTTGAGACTCATCTTGTTTAGAAAATACAGAAATTAAAATAAAAAGATATATATTTTTTAACTACAACATTTGAGTTCCTAGTAATTAATTAATATATGAAAACTTTCTGAAACTAAACATTTTTTTTAAAGAAAACTAATGAACTTACATCTAACCAAACCCAGCTACATGGTGTTGGAAAATGAGAGTTTATTTGGTTGGGGACTTGTTCTATCTGATAATCTGACCTCGCAACGTTGGGCTCAGGAGACTATTTATTAATATTGTAAGCGGTAGTGTTGAGTTCGAACAAATAAAGAGACTCCATTAAGATTAAGAGATGCATAGGTTATGTTAGTATCTTATGCCGAATCTCATGTTTATTATGCAAAGTCGGCCTTACAGCATGCATGTATTTAAGGTATAGCAATTATTTTGAAAAACATATTTTGTGGACCATTCATTGTTTATGTTATATGCATGTTTGTGCTTTCGTGTTTGCTTGTGTCATTTAGAAGAATTATGATGTAAAGTTTTATATATCAAGAATGGTTTACGAATTAAATGGGTACTGTTGTGTACAGCATTTTGGTTGCTGAGAATCTCATTTATTGCATAATTTTGTAGACATCACTTCATGGAAGCTGCCCTGTGATCAAAGGGCAGAAATGGGTGGCGACCAAGTGGATCAGAGATCAAATGCAGGAGGACTTTTTATACTAAATGCAATTCATAAGCCGTAAATTTCTTCCTCATAATTTTCTCTGTATGATAGAATGAGTTCAAATATTATACAAATTTTGTTTCCCACGCTCTCCTCCTTACCTTACCGATTACATATAGGGTAGTTATTAAGGTTAGAATGTTTTTCCAATTCAGTTTATTGTGTAAAATATTCAGATTATTTTCAAATTAATCAATCAAAATCAAAATATTAGTCTATAA

mRNA sequence

ATGAAAGGCAAAAGCGGAAGAAGTAATTGGAGCTTGAGATCGAAGCTAGGTTTGCCCGCACTAATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTACTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGAGGGCGAGGTTGCTTCAATCGGCAAGCGATGTTACGGAGTTTGATTTGATGTCTTCGGGAGAAAACGGCGACGATTCCATTTCTTCGATTCCTTTTCAGGTTTTGAGCTGGCGACCTCGCGCACTTTATTTCCCCAAGTTTGCAACTGCGGAGCAATGTCAGAGTATAGTTAATTTGGCGAAACCGAAGCTTAGGCCGTCTACATTGGCTCTACGTAAGGGAGAAACCGCAGAGAGCACGAAAGGAGTCCGAACAAGTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGGACTCTGGGTGTAATTGAGGAAAAAATTGCAAGGGCAACTATGATTCCAAGGACGCATGGAGAGGCATATAATATTTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGATGCGTTCAAGCCTTCTGAATATGGGCCACAGAAGAGCCAGAGGGTGGCTTCTTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGAATGGCTTGAACATGGATGGAACCTATAATTTCCAAACATGTATTGGTTTGAAAGTGAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCGGTTTTCCCAAATGGTACAATTGATCCGACATCACTTCATGGAAGCTGCCCTGTGATCAAAGGGCAGAAATGGGTGGCGACCAAGTGGATCAGAGATCAAATGCAGGAGGACTTTTTATACTAA

Coding sequence (CDS)

ATGAAAGGCAAAAGCGGAAGAAGTAATTGGAGCTTGAGATCGAAGCTAGGTTTGCCCGCACTAATCTTCGTTTTATGCCTTTTTTGTTTCCTCGCCGGATTCTTCGGTTCTACTCTTCTCTCTCAGGATGTAGATGACGATAGGCCGAGGGCGAGGTTGCTTCAATCGGCAAGCGATGTTACGGAGTTTGATTTGATGTCTTCGGGAGAAAACGGCGACGATTCCATTTCTTCGATTCCTTTTCAGGTTTTGAGCTGGCGACCTCGCGCACTTTATTTCCCCAAGTTTGCAACTGCGGAGCAATGTCAGAGTATAGTTAATTTGGCGAAACCGAAGCTTAGGCCGTCTACATTGGCTCTACGTAAGGGAGAAACCGCAGAGAGCACGAAAGGAGTCCGAACAAGTTCTGGAGTGTTCTTTAGTGCTTCAGAAGATGAAAGTGGGACTCTGGGTGTAATTGAGGAAAAAATTGCAAGGGCAACTATGATTCCAAGGACGCATGGAGAGGCATATAATATTTTGCGTTATGAGATTGGGCAGAAGTATAATTCTCATTATGATGCGTTCAAGCCTTCTGAATATGGGCCACAGAAGAGCCAGAGGGTGGCTTCTTTCTTGTTGTACTTGACTGATGTTGAAGAAGGTGGAGAAACCATGTTTCCATTTGAGAATGGCTTGAACATGGATGGAACCTATAATTTCCAAACATGTATTGGTTTGAAAGTGAAGCCACGTCAAGGTGATGGACTTCTGTTTTATTCGGTTTTCCCAAATGGTACAATTGATCCGACATCACTTCATGGAAGCTGCCCTGTGATCAAAGGGCAGAAATGGGTGGCGACCAAGTGGATCAGAGATCAAATGCAGGAGGACTTTTTATACTAA

Protein sequence

MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY*
BLAST of Csa6G494930 vs. Swiss-Prot
Match: P4H9_ARATH (Probable prolyl 4-hydroxylase 9 OS=Arabidopsis thaliana GN=P4H9 PE=2 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 6.1e-112
Identity = 198/280 (70.71%), Postives = 226/280 (80.71%), Query Frame = 1

Query: 13  RSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASD-VTEFDLMSSGEN 72
           R KLGL A + V C  CFL GF+GSTLLSQ+V   +PR R+L    +   E   M  G  
Sbjct: 10  RKKLGL-ATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVT 69

Query: 73  GDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 132
           G++SI SIPFQVLSWRPRA+YFP FATAEQCQ+I+  AK  L+PS LALRKGETAE+TKG
Sbjct: 70  GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 129

Query: 133 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 192
            RTSSG F SASE+ +G L  +E KIARATMIPR+HGE++NILRYE+GQKY+SHYD F P
Sbjct: 130 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 189

Query: 193 SEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLL 252
           +EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   Y+++ CIGLKVKPR+GDGLL
Sbjct: 190 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLL 249

Query: 253 FYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           FYSVFPNGTID TSLHGSCPV KG+KWVATKWIRDQ QE+
Sbjct: 250 FYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288

BLAST of Csa6G494930 vs. Swiss-Prot
Match: P4H13_ARATH (Prolyl 4-hydroxylase 13 OS=Arabidopsis thaliana GN=P4H13 PE=2 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 2.8e-85
Identity = 158/280 (56.43%), Postives = 201/280 (71.79%), Query Frame = 1

Query: 9   NWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVD-DDRPRARLLQSASDVTEFDLMS 68
           ++    KL  P +    C F  + GF    L SQ +   + P  R  +S +D T+     
Sbjct: 3   SYGKEKKLVFPYVFIACCFFLAIFGFCFFNLFSQGISFSEIPTTR--RSVNDETD----- 62

Query: 69  SGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAE 128
           S ++G  S+S+IPF  LSW PR  Y P FAT +QC++++++AKPKL+PSTLALRKGETAE
Sbjct: 63  SLDHGS-SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAE 122

Query: 129 STKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYD 188
           +T+  R+   +     EDESG L  IEEKIA AT  P+ + E++NILRY++GQKY+SHYD
Sbjct: 123 TTQNYRS---LHQHTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYD 182

Query: 189 AFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQG 248
           AF  +EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G Y+++ C+GLKVKPRQG
Sbjct: 183 AFHSAEYGPLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQG 242

Query: 249 DGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQ 288
           D + FY++FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ
Sbjct: 243 DAIFFYNLFPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271

BLAST of Csa6G494930 vs. Swiss-Prot
Match: P4H10_ARATH (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 4.4e-46
Identity = 107/237 (45.15%), Postives = 140/237 (59.07%), Query Frame = 1

Query: 57  ASDVTEF---DLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKL 116
           A+D+T      L  SGE  DDS +    +++SW PRA  +  F T E+C+ ++ LAKP +
Sbjct: 53  ANDLTSIVRKTLQRSGE--DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHM 112

Query: 117 RPSTLALRKGETAESTKG-VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYN 176
             ST+   K  T +ST   VRTSSG F +   D+  T+  IE++I+  T IP  HGE   
Sbjct: 113 EKSTVVDEK--TGKSTDSRVRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQ 172

Query: 177 ILRYEIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENG--LNMDG 236
           +L YEIGQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP   G    +  
Sbjct: 173 VLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPW 232

Query: 237 TYNFQTC--IGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 286
                 C   GL VKP+ GD LLF+S+ P+ T+DP+SLHG C VIKG KW +TKW+R
Sbjct: 233 WNELSECGKGGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of Csa6G494930 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 3.7e-45
Identity = 112/272 (41.18%), Postives = 153/272 (56.25%), Query Frame = 1

Query: 21  LIFVLCLFCFLAGFFGSTLLSQDVDDDRP--RARLLQSASDVTEFDLMSSGENGDDSISS 80
           ++F+L +   +   FG   L  + D+  P   +   ++A++ +E      G+ GD     
Sbjct: 23  MLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAATERSE----GLGKRGDQWT-- 82

Query: 81  IPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG-VRTSSG 140
              +VLSW PRA  +  F + E+C+ +++LAKP +  ST+     ET +S    VRTSSG
Sbjct: 83  ---EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSG 142

Query: 141 VFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQ 200
            F     D+   +  IE++IA  T IP  HGE   +L YE GQKY  HYD F        
Sbjct: 143 TFLRRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKN 202

Query: 201 KSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT--YN-FQTC--IGLKVKPRQGDGLLF 260
             QR+A+ L+YL+DVEEGGET+FP  N +N      YN    C   GL VKPR GD LLF
Sbjct: 203 GGQRMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLF 262

Query: 261 YSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI 285
           +S+ P+ T+DPTSLHG CPVI+G KW +TKW+
Sbjct: 263 WSMRPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of Csa6G494930 vs. Swiss-Prot
Match: P4H5_ARATH (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1)

HSP 1 Score: 171.0 bits (432), Expect = 1.9e-41
Identity = 100/277 (36.10%), Postives = 153/277 (55.23%), Query Frame = 1

Query: 13  RSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTEFDLMSSGENG 72
           RS      LI +L +   L G    +L + + +  +         +D+T     S   +G
Sbjct: 19  RSTQAFTVLILLLVVILILLGLGILSLPNANRNSSK--------TNDLTNIVRKSETSSG 78

Query: 73  DDSISSIPF-QVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 132
           D+  +   + +V+SW PRA+ +  F T E+C+ +++LAKP +  ST+   K   ++ ++ 
Sbjct: 79  DEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR- 138

Query: 133 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 192
           VRTSSG F     DE   + VIE++I+  T IP  +GE   +L Y++GQKY  HYD F  
Sbjct: 139 VRTSSGTFLRRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLD 198

Query: 193 SEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT---YNFQTC--IGLKVKPRQ 252
                   QR+A+ L+YL+DV++GGET+FP   G N+           C   GL V P++
Sbjct: 199 EFNTKNGGQRIATVLMYLSDVDDGGETVFPAARG-NISAVPWWNELSKCGKEGLSVLPKK 258

Query: 253 GDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKW 284
            D LLF+++ P+ ++DP+SLHG CPV+KG KW +TKW
Sbjct: 259 RDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKW 283

BLAST of Csa6G494930 vs. TrEMBL
Match: A0A0A0KGG4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G494930 PE=4 SV=1)

HSP 1 Score: 597.8 bits (1540), Expect = 7.1e-168
Identity = 294/294 (100.00%), Postives = 294/294 (100.00%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
           TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY 295
           KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY 294

BLAST of Csa6G494930 vs. TrEMBL
Match: A0A059AL93_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00722 PE=4 SV=1)

HSP 1 Score: 449.9 bits (1156), Expect = 2.4e-123
Identity = 214/291 (73.54%), Postives = 250/291 (85.91%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           ++GK  R +WS +SK+ LP ++   C F FLAGF+GS+LL+QDV     RAR L+S  D 
Sbjct: 13  VRGKPIRQSWSSKSKIELPVVVLA-CSFFFLAGFYGSSLLAQDVSGAGARARALESVGDE 72

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
            ++  +  GE GDDS  SIPFQVLSW PRALYFP FATAEQCQSI+ +AK  L+PSTLAL
Sbjct: 73  RDYVPLPRGETGDDSFVSIPFQVLSWGPRALYFPNFATAEQCQSIIKVAKTGLKPSTLAL 132

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGETAE+TKG+RTSSG+F SASED++GTL +IEEKIAR TM+PR HGEA+N+LRYEIGQ
Sbjct: 133 RKGETAENTKGIRTSSGMFVSASEDKTGTLDIIEEKIARVTMLPREHGEAFNVLRYEIGQ 192

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           +YNSHYDAF P EYGPQKSQRVASFLLYL+DVEEGGET+FPFENG+NMDGTY+FQ C+GL
Sbjct: 193 RYNSHYDAFSPVEYGPQKSQRVASFLLYLSDVEEGGETVFPFENGINMDGTYDFQQCVGL 252

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           KVKPRQGDGLLFYS+ PNGTIDPTSLHGSCPVIKG+KWVATKWIRDQ+Q+D
Sbjct: 253 KVKPRQGDGLLFYSLLPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQVQDD 302

BLAST of Csa6G494930 vs. TrEMBL
Match: V4SQB7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10027070mg PE=4 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 1.5e-122
Identity = 212/289 (73.36%), Postives = 252/289 (87.20%), Query Frame = 1

Query: 3   GKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTE 62
           GKS ++NWSL+SK+ LP  +F+ CLF FLAG  GS+LLSQDV   RP AR+++S  D  E
Sbjct: 4   GKSNKANWSLKSKIELP-FVFLACLFFFLAGLLGSSLLSQDVTVARPSARVVESVKD--E 63

Query: 63  FDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRK 122
           ++ M  G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK  LRPSTLALRK
Sbjct: 64  YEWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 123

Query: 123 GETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKY 182
           GET ++T+G+RTSSGVF SA+EDESGTL +IEEKIA+ TM+PR +GEA+NILRY+IGQKY
Sbjct: 124 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 183

Query: 183 NSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKV 242
           NSHYDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKV
Sbjct: 184 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 243

Query: 243 KPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           KPRQGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 244 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289

BLAST of Csa6G494930 vs. TrEMBL
Match: B9T4J5_RICCO (Prolyl 4-hydroxylase alpha subunit, putative OS=Ricinus communis GN=RCOM_0396560 PE=4 SV=1)

HSP 1 Score: 446.4 bits (1147), Expect = 2.6e-122
Identity = 209/291 (71.82%), Postives = 252/291 (86.60%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           MK K  +  WS++SKLGLP ++F+ CLF FLAG F S L+SQ+V+ D+ R +L     ++
Sbjct: 1   MKAKGSKGKWSIKSKLGLP-VVFLSCLFFFLAGLFASNLISQNVNGDKNRRQLQWVKEEI 60

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
            E+DL+ SG+ GDD ++ IPFQVLSW+PRALYFP FATAEQCQS++N+AKP L PSTLAL
Sbjct: 61  IEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLAL 120

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGET E+TKG+RTSSG+F SASED++G L  IEEKIARATM+PR +GEA+NILRYEIGQ
Sbjct: 121 RKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQ 180

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           KYNSHYDAF P+EYGPQKSQRVASFLLYL+DVEEGGETMFPFEN L++D +Y+F+ CIGL
Sbjct: 181 KYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGL 240

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           +V+PR+GDGLLFYS+FPN TIDPTSLHGSCPVIKG+KWVATKWIRDQ Q+D
Sbjct: 241 QVRPRRGDGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290

BLAST of Csa6G494930 vs. TrEMBL
Match: A0A067GJZ0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022995mg PE=4 SV=1)

HSP 1 Score: 446.4 bits (1147), Expect = 2.6e-122
Identity = 212/289 (73.36%), Postives = 251/289 (86.85%), Query Frame = 1

Query: 3   GKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTE 62
           GKS ++NWSL+SK+ LP  +F+ CLF FLAG  GS+LLSQDV   RP AR+++S  D  E
Sbjct: 4   GKSNKANWSLKSKIELP-FVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKD--E 63

Query: 63  FDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRK 122
           +  M  G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK  LRPSTLALRK
Sbjct: 64  YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 123

Query: 123 GETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKY 182
           GET ++T+G+RTSSGVF SA+EDESGTL +IEEKIA+ TM+PR +GEA+NILRY+IGQKY
Sbjct: 124 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 183

Query: 183 NSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKV 242
           NSHYDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKV
Sbjct: 184 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 243

Query: 243 KPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           KPRQGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 244 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289

BLAST of Csa6G494930 vs. TAIR10
Match: AT4G33910.1 (AT4G33910.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 405.2 bits (1040), Expect = 3.4e-113
Identity = 198/280 (70.71%), Postives = 226/280 (80.71%), Query Frame = 1

Query: 13  RSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASD-VTEFDLMSSGEN 72
           R KLGL A + V C  CFL GF+GSTLLSQ+V   +PR R+L    +   E   M  G  
Sbjct: 10  RKKLGL-ATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVT 69

Query: 73  GDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 132
           G++SI SIPFQVLSWRPRA+YFP FATAEQCQ+I+  AK  L+PS LALRKGETAE+TKG
Sbjct: 70  GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 129

Query: 133 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 192
            RTSSG F SASE+ +G L  +E KIARATMIPR+HGE++NILRYE+GQKY+SHYD F P
Sbjct: 130 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 189

Query: 193 SEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLL 252
           +EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG NM   Y+++ CIGLKVKPR+GDGLL
Sbjct: 190 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLL 249

Query: 253 FYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           FYSVFPNGTID TSLHGSCPV KG+KWVATKWIRDQ QE+
Sbjct: 250 FYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288

BLAST of Csa6G494930 vs. TAIR10
Match: AT2G23096.1 (AT2G23096.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 316.6 bits (810), Expect = 1.6e-86
Identity = 158/280 (56.43%), Postives = 201/280 (71.79%), Query Frame = 1

Query: 9   NWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVD-DDRPRARLLQSASDVTEFDLMS 68
           ++    KL  P +    C F  + GF    L SQ +   + P  R  +S +D T+     
Sbjct: 3   SYGKEKKLVFPYVFIACCFFLAIFGFCFFNLFSQGISFSEIPTTR--RSVNDETD----- 62

Query: 69  SGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAE 128
           S ++G  S+S+IPF  LSW PR  Y P FAT +QC++++++AKPKL+PSTLALRKGETAE
Sbjct: 63  SLDHGS-SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAE 122

Query: 129 STKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYD 188
           +T+  R+   +     EDESG L  IEEKIA AT  P+ + E++NILRY++GQKY+SHYD
Sbjct: 123 TTQNYRS---LHQHTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYD 182

Query: 189 AFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQG 248
           AF  +EYGP  SQRV +FLL+L+ VEEGGETMFPFENG NM+G Y+++ C+GLKVKPRQG
Sbjct: 183 AFHSAEYGPLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQG 242

Query: 249 DGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQ 288
           D + FY++FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ
Sbjct: 243 DAIFFYNLFPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271

BLAST of Csa6G494930 vs. TAIR10
Match: AT5G66060.1 (AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 186.4 bits (472), Expect = 2.5e-47
Identity = 107/237 (45.15%), Postives = 140/237 (59.07%), Query Frame = 1

Query: 57  ASDVTEF---DLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKL 116
           A+D+T      L  SGE  DDS +    +++SW PRA  +  F T E+C+ ++ LAKP +
Sbjct: 53  ANDLTSIVRKTLQRSGE--DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHM 112

Query: 117 RPSTLALRKGETAESTKG-VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYN 176
             ST+   K  T +ST   VRTSSG F +   D+  T+  IE++I+  T IP  HGE   
Sbjct: 113 EKSTVVDEK--TGKSTDSRVRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQ 172

Query: 177 ILRYEIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENG--LNMDG 236
           +L YEIGQKY  HYD F          QR+A+ L+YL+DVEEGGET+FP   G    +  
Sbjct: 173 VLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPW 232

Query: 237 TYNFQTC--IGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 286
                 C   GL VKP+ GD LLF+S+ P+ T+DP+SLHG C VIKG KW +TKW+R
Sbjct: 233 WNELSECGKGGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLR 283

BLAST of Csa6G494930 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 183.3 bits (464), Expect = 2.1e-46
Identity = 112/272 (41.18%), Postives = 153/272 (56.25%), Query Frame = 1

Query: 21  LIFVLCLFCFLAGFFGSTLLSQDVDDDRP--RARLLQSASDVTEFDLMSSGENGDDSISS 80
           ++F+L +   +   FG   L  + D+  P   +   ++A++ +E      G+ GD     
Sbjct: 23  MLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAATERSE----GLGKRGDQWT-- 82

Query: 81  IPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG-VRTSSG 140
              +VLSW PRA  +  F + E+C+ +++LAKP +  ST+     ET +S    VRTSSG
Sbjct: 83  ---EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSG 142

Query: 141 VFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQ 200
            F     D+   +  IE++IA  T IP  HGE   +L YE GQKY  HYD F        
Sbjct: 143 TFLRRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKN 202

Query: 201 KSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT--YN-FQTC--IGLKVKPRQGDGLLF 260
             QR+A+ L+YL+DVEEGGET+FP  N +N      YN    C   GL VKPR GD LLF
Sbjct: 203 GGQRMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLF 262

Query: 261 YSVFPNGTIDPTSLHGSCPVIKGQKWVATKWI 285
           +S+ P+ T+DPTSLHG CPVI+G KW +TKW+
Sbjct: 263 WSMRPDATLDPTSLHGGCPVIRGNKWSSTKWM 280

BLAST of Csa6G494930 vs. TAIR10
Match: AT2G17720.1 (AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 171.0 bits (432), Expect = 1.1e-42
Identity = 100/277 (36.10%), Postives = 153/277 (55.23%), Query Frame = 1

Query: 13  RSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTEFDLMSSGENG 72
           RS      LI +L +   L G    +L + + +  +         +D+T     S   +G
Sbjct: 19  RSTQAFTVLILLLVVILILLGLGILSLPNANRNSSK--------TNDLTNIVRKSETSSG 78

Query: 73  DDSISSIPF-QVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 132
           D+  +   + +V+SW PRA+ +  F T E+C+ +++LAKP +  ST+   K   ++ ++ 
Sbjct: 79  DEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR- 138

Query: 133 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 192
           VRTSSG F     DE   + VIE++I+  T IP  +GE   +L Y++GQKY  HYD F  
Sbjct: 139 VRTSSGTFLRRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLD 198

Query: 193 SEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT---YNFQTC--IGLKVKPRQ 252
                   QR+A+ L+YL+DV++GGET+FP   G N+           C   GL V P++
Sbjct: 199 EFNTKNGGQRIATVLMYLSDVDDGGETVFPAARG-NISAVPWWNELSKCGKEGLSVLPKK 258

Query: 253 GDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKW 284
            D LLF+++ P+ ++DP+SLHG CPV+KG KW +TKW
Sbjct: 259 RDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKW 283

BLAST of Csa6G494930 vs. NCBI nr
Match: gi|449448264|ref|XP_004141886.1| (PREDICTED: probable prolyl 4-hydroxylase 9 [Cucumis sativus])

HSP 1 Score: 597.8 bits (1540), Expect = 1.0e-167
Identity = 294/294 (100.00%), Postives = 294/294 (100.00%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
           TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY 295
           KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDFLY 294

BLAST of Csa6G494930 vs. NCBI nr
Match: gi|659079617|ref|XP_008440351.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 566.6 bits (1459), Expect = 2.5e-158
Identity = 275/292 (94.18%), Postives = 285/292 (97.60%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           MK KSG+SNWSLRSKLGLPALIFVLCLFCFLAGFFGS+LLSQDVDDDRPR+RLLQSASD 
Sbjct: 1   MKAKSGKSNWSLRSKLGLPALIFVLCLFCFLAGFFGSSLLSQDVDDDRPRSRLLQSASDG 60

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
           TEFDLMSSGENGD SISSIPFQVLSWRPRALYFPKFATAEQCQSIVN+AKPKLRPSTLAL
Sbjct: 61  TEFDLMSSGENGDASISSIPFQVLSWRPRALYFPKFATAEQCQSIVNMAKPKLRPSTLAL 120

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGETAE+TKG+RTSSGVFFSASEDESG LGVIEEKIARATMIPRTHGEAYNILRYEIGQ
Sbjct: 121 RKGETAENTKGIRTSSGVFFSASEDESGILGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENG NMDGTYN+Q C+GL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGSNMDGTYNYQACVGL 240

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQEDF 293
           KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQ Q+D+
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQTQDDY 292

BLAST of Csa6G494930 vs. NCBI nr
Match: gi|1011995519|ref|XP_015956578.1| (PREDICTED: probable prolyl 4-hydroxylase 9 [Arachis duranensis])

HSP 1 Score: 461.8 bits (1187), Expect = 8.7e-127
Identity = 224/295 (75.93%), Postives = 261/295 (88.47%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRS-KLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDD---RPRARLLQS 60
           MK K+ + NWSLR+ KLG P  +F++C+F FLAGFFGS L     DD    RP  RLL+S
Sbjct: 1   MKVKTVKGNWSLRTNKLGFP-YVFLICIFFFLAGFFGSALFFHSQDDGHGMRPGPRLLES 60

Query: 61  ASDVTEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPS 120
             + TE++L+ +GE+GDDSI+SIPFQVLSW+PRALYFP FATAEQC+SIV++AK  L+PS
Sbjct: 61  TKEETEYNLLLAGESGDDSITSIPFQVLSWQPRALYFPNFATAEQCESIVDIAKAGLKPS 120

Query: 121 TLALRKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRY 180
           TLALRKGET E+TKG+RTSSGVF SASED++GTL VIEEKIARATMIPR+HGEA+NILRY
Sbjct: 121 TLALRKGETEENTKGIRTSSGVFISASEDKTGTLEVIEEKIARATMIPRSHGEAFNILRY 180

Query: 181 EIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQT 240
           EIGQ+YNSHYDAF PSEYGPQKSQRVASFLLYLTDV+EGGETMFPFENG NMDG+Y++++
Sbjct: 181 EIGQRYNSHYDAFNPSEYGPQKSQRVASFLLYLTDVQEGGETMFPFENGSNMDGSYSYES 240

Query: 241 CIGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           CIGLKV+PRQGDGLLFYS+FPNGTIDPTSLHGSCPVIKG+KWVATKW+RDQ Q D
Sbjct: 241 CIGLKVRPRQGDGLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWVRDQEQYD 294

BLAST of Csa6G494930 vs. NCBI nr
Match: gi|1009145471|ref|XP_015890355.1| (PREDICTED: probable prolyl 4-hydroxylase 9 [Ziziphus jujuba])

HSP 1 Score: 450.3 bits (1157), Expect = 2.6e-123
Identity = 213/292 (72.95%), Postives = 256/292 (87.67%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDV-DDDRPRARLLQSASD 60
           M+GKS + +W+L++KLGLP+ +F+LC F FLAG F ST + QDV    R RARLL+S + 
Sbjct: 12  MRGKSVKPHWTLKTKLGLPS-VFLLCAFFFLAGLFASTFVVQDVYGGGRGRARLLESTNY 71

Query: 61  VTEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLA 120
             E++L+ +GE GDDSI++IPFQVLSW PRALYFP FATAEQC SI+++A+P+LRPSTLA
Sbjct: 72  ENEYELLPNGETGDDSITTIPFQVLSWNPRALYFPNFATAEQCDSIISIAEPRLRPSTLA 131

Query: 121 LRKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIG 180
           LRKGET E+TKG+RTSSGVF SASED++G L VIEEKIAR TM+PR HGEA+N+LRYE+G
Sbjct: 132 LRKGETVENTKGIRTSSGVFISASEDKTGILDVIEEKIARVTMLPRMHGEAFNVLRYEVG 191

Query: 181 QKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIG 240
           Q+YNSHYDAF P+EYGPQKSQRVASFLLYL+DV+EGGETMFPFENG NMDG+Y+F+ C G
Sbjct: 192 QRYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVQEGGETMFPFENGFNMDGSYDFKECTG 251

Query: 241 LKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           LKVKPR+GDGLLFYS+ PNGTID TS+HGSCPVIKG+KWVATKWIRDQ Q+D
Sbjct: 252 LKVKPRKGDGLLFYSLLPNGTIDVTSIHGSCPVIKGKKWVATKWIRDQEQDD 302

BLAST of Csa6G494930 vs. NCBI nr
Match: gi|702461068|ref|XP_010028119.1| (PREDICTED: probable prolyl 4-hydroxylase 9 [Eucalyptus grandis])

HSP 1 Score: 449.9 bits (1156), Expect = 3.4e-123
Identity = 214/291 (73.54%), Postives = 250/291 (85.91%), Query Frame = 1

Query: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
           ++GK  R +WS +SK+ LP ++   C F FLAGF+GS+LL+QDV     RAR L+S  D 
Sbjct: 13  VRGKPIRQSWSSKSKIELPVVVLA-CSFFFLAGFYGSSLLAQDVSGAGARARALESVGDE 72

Query: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
            ++  +  GE GDDS  SIPFQVLSW PRALYFP FATAEQCQSI+ +AK  L+PSTLAL
Sbjct: 73  RDYVPLPRGETGDDSFVSIPFQVLSWGPRALYFPNFATAEQCQSIIKVAKTGLKPSTLAL 132

Query: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
           RKGETAE+TKG+RTSSG+F SASED++GTL +IEEKIAR TM+PR HGEA+N+LRYEIGQ
Sbjct: 133 RKGETAENTKGIRTSSGMFVSASEDKTGTLDIIEEKIARVTMLPREHGEAFNVLRYEIGQ 192

Query: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
           +YNSHYDAF P EYGPQKSQRVASFLLYL+DVEEGGET+FPFENG+NMDGTY+FQ C+GL
Sbjct: 193 RYNSHYDAFSPVEYGPQKSQRVASFLLYLSDVEEGGETVFPFENGINMDGTYDFQQCVGL 252

Query: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 292
           KVKPRQGDGLLFYS+ PNGTIDPTSLHGSCPVIKG+KWVATKWIRDQ+Q+D
Sbjct: 253 KVKPRQGDGLLFYSLLPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQVQDD 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H9_ARATH6.1e-11270.71Probable prolyl 4-hydroxylase 9 OS=Arabidopsis thaliana GN=P4H9 PE=2 SV=1[more]
P4H13_ARATH2.8e-8556.43Prolyl 4-hydroxylase 13 OS=Arabidopsis thaliana GN=P4H13 PE=2 SV=1[more]
P4H10_ARATH4.4e-4645.15Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1[more]
P4H3_ARATH3.7e-4541.18Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
P4H5_ARATH1.9e-4136.10Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KGG4_CUCSA7.1e-168100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G494930 PE=4 SV=1[more]
A0A059AL93_EUCGR2.4e-12373.54Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00722 PE=4 SV=1[more]
V4SQB7_9ROSI1.5e-12273.36Uncharacterized protein OS=Citrus clementina GN=CICLE_v10027070mg PE=4 SV=1[more]
B9T4J5_RICCO2.6e-12271.82Prolyl 4-hydroxylase alpha subunit, putative OS=Ricinus communis GN=RCOM_0396560... [more]
A0A067GJZ0_CITSI2.6e-12273.36Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022995mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33910.13.4e-11370.71 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G23096.11.6e-8656.43 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G66060.12.5e-4745.15 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G20270.12.1e-4641.18 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G17720.11.1e-4236.10 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449448264|ref|XP_004141886.1|1.0e-167100.00PREDICTED: probable prolyl 4-hydroxylase 9 [Cucumis sativus][more]
gi|659079617|ref|XP_008440351.1|2.5e-15894.18PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|1011995519|ref|XP_015956578.1|8.7e-12775.93PREDICTED: probable prolyl 4-hydroxylase 9 [Arachis duranensis][more]
gi|1009145471|ref|XP_015890355.1|2.6e-12372.95PREDICTED: probable prolyl 4-hydroxylase 9 [Ziziphus jujuba][more]
gi|702461068|ref|XP_010028119.1|3.4e-12373.54PREDICTED: probable prolyl 4-hydroxylase 9 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006525 arginine metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
biological_process GO:0006560 proline metabolic process
biological_process GO:0080147 root hair cell development
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU096612cucumber EST collection version 3.0transcribed_cluster
CU106348cucumber EST collection version 3.0transcribed_cluster
CU120602cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa6G494930.1Csa6G494930.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU106348CU106348transcribed_cluster
CU096612CU096612transcribed_cluster
CU120602CU120602transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 172..285
score: 1.4
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 166..286
score: 10
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 88..285
score: 6.6
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 8..290
score: 3.5E
NoneNo IPR availablePANTHERPTHR10869:SF66OXIDOREDUCTASE, 2OG-FE(II) OXYGENASE FAMILY PROTEIN-RELATEDcoord: 8..290
score: 3.5E

The following gene(s) are paralogous to this gene:

None