Cucsa.363400 (gene) Cucumber (Gy14) v1

NameCucsa.363400
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionProlyl 4-hydroxylase alpha subunit, putative
Locationscaffold03611 : 966853 .. 970593 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTCTCTCTCCCTCTCGCGCGCGCGCTCTTTCTAATTTGATCCGATCGAGACTATGTTCAAATTTGATAATCTGTTATTCATCTTCTTGATTTTGACCTCATCGTTTATTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGGTGTTTATTGCTAGCATTTTATAACGAACGGATCTTCTTTTTGTTTTCTTCGTTCTATTGTGTTTGTTTTCTGAGAAAGTTGTGGAACGGAAATTATTCAACTCTAACTTGTTTTAGCTTCTAGCTTCCGGTCGTTTAGGTTTAGTTCAGTGGCTATTGATAGTAATTTATGCTTTTCAAATTCTTGATAAGGCGATTTCACTGAATGTTGCAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGTAAGTTTAGATTTAGTGATATTCATTGTTCAGTTTGATATGGAAGTCATTTCGTTTCGTCAATACAAATCTATTGTTGAAGGATTGTTTTTCTTCTGTGACAGGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCGGGAAAGAGTAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGTTAGGTTGAATCTCCACACTAACAATCTTCTAAGTTTTCTGGATGTTTTCTCTAATTTCTCTTTCGATTTTTGTTTTTGATCGGAATCGAGGTTCTGTTACTAGTCGTCACCGAAGCTATTTATTGAATTAAACTTAATGGAAGAATCTTATCCGTTGCGAAACTGAAATATATTGCCATTTCAAAGAGATTAAACGGCAGGAATGTTCCTGATAGTATGGATGCAATAATCAACATTACGTTCATTCAAAATTAGAGAGTAAACAATCTTTCAGGAATCATAACTTAAAACCTCCTCCTCTTCTATCTTGTTGAATTAGGACTAGATTAACAAGTGTGTATTTTGTGCTCTTCACCTGTATCTTAAAATTGATGTATATAGCAGGACAATAATTATTTCCTTTGGTTCTGTTGAGTATCAGTTGAGATCTTTACACCCATGTTTGTATCGAATATTTACATTTTGATTTGATATTTAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGGTACACGTTCACTGTCAGATCACATATTTTGTAATTGATATGTTATGTCATATGCTTTTGCTTTACTAGAGCTGATTGCCATAAGTAACATTCATGGTTTCTTTATAATGAAATGGTTGTCAATGTCGAGTATCACTGCTAAATGCCGTGTGTATTTCTCGTTTTTCAGAAAATGGGGAGGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACGGTCCTTATGTATCTCTCTAATGTGACCCAAGGTGGTGAAACAGTTTTCCCCTTGGCAGAGGTTAGTTCAAGCGTTAATTCGTACTTTACTAGAATTGCACTTTATATTCCTCATCGTAAGACTACTTCTTCTTAGATCTCAGTACTCTCTAAATAATAATCATACAACTCTGTCTTCTGATGGAGAAGATCCTTTTATAAGACTGATACGTGGAAATATTGAAATGAAACATGAGAAAGACTCGATGTGTTGGATAATCCAGAGGGTGTTGCCATTATTTTATAGGAATCACAACTGGAAATGACCTGATTTTTGTTCAGGTTGATGACACCGAACCCTTTTACCCAATCTGAACCATTTTTCTGTTTCTTTTGTTTAATTTTCTTATTTTTTGGATTTTGTTTTTCTATCTAAATGTTTATCAATCGATCTATATTCATTATGGATGCAATACAATCCTGCTTAGAGGGATTTAGCACTTTTTGCTACCATCATTTTAAGTACTTCAGTTAGGAAAAATGTTATTGCAGTCATTTTAAAGCACCGAAATTCACCAATTATCTAAAGATTTGAACCCAACTATTGTTGAATTGGATTAGACATGAAACATCTGAGATTATTGAAGTTGTTCCTTGCTGAGAAGCCAACTTGAAACAAACTTCTATAATTAGTTATGAAGTTACATATAGATTGGAAATGGATTTGATTCAAAAATTACTTTAATCAATAGCAAACCTGTTTAACCTCGTTGTATAACCTCGGAGCAGAAACCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCTAAGAAAGGAGTTGCAGGTGAATCACCAAACAAACTATAAATGCTTCTTTTTCGCTACTTATAAAAAACTATAAAACTTTCATGTAATTAATTGATGTACTTTGTTATTTAACATGATTCGCTTGGAATGATGTAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAGACATTGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGTTGCAGGATCTGTTGATTTCATTTCAACCTTACACTCGAAATTCCGATACCTTTTCGTGGTAAACTTCTTATTCCTTGGTTGGATAGTAGTAATCAATTTGTTGCTAAGCTTTCGTTCAAACCATTTGGTTTTTAGTTTTTGAAAATTAAGCCTTTAAATACTTATTTCTCGACTAAATTTCTTGTTTTGTAATGATTATTTATCTATATTTTCAAAAACCAAGTCAAATTTTAGAATTTAAAAACAATAGAGTTTTTTAAAAATAACAAAATATGAAACTATTTACGAAATATAACAAAATTGAGAATGATACATACTATTGATTTTTGTTTTCGGAATTTGACAAAGATTTCAATTACCATCGTAAAAAATGAACATGAGTTTTTGAAAATAAAAAAAAACAAATTGGTTACTGGATGAGACCGAATTATTGGCTTTTAAGTTTCGGAATTTCTTTTCGTTTGCATGGCACTTCAATTTGTTTAGGCTTTCCCTGTTACCTACACTCAATTGTCATATGTTTCCTTCCAAAAATGAAATAGCTTAAATTGCAAGTTTAGTACCCATAGTAGGAAAGAAGTTAGAATTCAGCTCATGGCTTATAATGTAAATTTAGTTTTTATGGTAGCACTGTTTATGAAAATTTTATGGACCTATAAAGACTAAAGTCTTAACCTTTAAAATAAGGATTAAATTCCAACTTACTCTAGAATCAATTTTGTAATTTAATAGAAAGATATAACTCTTAGATAAAGCTTTGTCCCTCATGAATTTCTTTACCTTTTTTTTTTCTTTTTTCTTTTTTCTTTTTTGGTTTTGGTGTCATTTGGAAAGTCAGTCATTTGTGGAATGGAAAGCGATACAACATTGATTGTGAAGCTATCGTGGAAGCACAGATGGGATGAAGCAGTAGCCGTTGGGTAATTATGACCTTTCCTTTTTCTTTAAAAAAAATTCATGATTATGTATTACTTTATATTGTCGTTATTTTTCTTACTTGAAT

mRNA sequence

tctctctctctctccctctcGCGCGCGCGCTCTTTCTAATTTGATCCGATCGAGACTATGTTCAAATTTGATAATCTGTTATTCATCTTCTTGATTTTGACCTCATCGTTTATTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCGGGAAAGAGTAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACGGTCCTTATGTATCTCTCTAATGTGACCCAAGGTGGTGAAACAGTTTTCCCCTTGGCAGAGAAACCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCTAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAGACATTGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGTTGCAGGATCTGTTGATTTCATTTCAACCTTACACTCGAAATTCCGATACCTTTTCGTGTCATTTGTGGAATGGAAAGCGATACAACATTGATTGTGAAGCTATCGTGGAAGCACAGATGGGATGAAGCAGTAGCCGTTGGGTAATTATGACCTTTCCTTTTTCTTTAAAAAAAATTCATGATTATGTATTACTTTATATTGTCGTTATTTTTCTTACTTGAAT

Coding sequence (CDS)

ATGTTCAAATTTGATAATCTGTTATTCATCTTCTTGATTTTGACCTCATCGTTTATTCGGGAATCAACTTGTTCTTATGCTGGTTCGGCTAGCGCAACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTAACGGACCTAGAATGCGACCATCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGATTCGGGAAAGAGTAAGCTCAGTACTGTACGAACGAGTTCAGGAATGTTCATTTCTAAAAACAAGGATCCTATTGTTTCTGGCATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGACATTCAGGTATTGAGATACGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGTGGACATCGTTTAGCTACGGTCCTTATGTATCTCTCTAATGTGACCCAAGGTGGTGAAACAGTTTTCCCCTTGGCAGAGAAACCTTCCCACCGGAGAGCTTATGAAACAGACGAGGACCTCTCAGAGTGCGCTAAGAAAGGAGTTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGCCTTGAACCAAATGCTATACCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGGAGACATTGGGAATTGTACTGATCTAAATGAAAGTTGTGAGAGATGGGCTGCCTTAGGGGAATGCACCAAAAACCCAGAATATATGGTTGGATCTCCAGAAATGCCCGGCTACTGTCGGAGGAGTTGCAGGATCTGTTGA

Protein sequence

MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC*
BLAST of Cucsa.363400 vs. Swiss-Prot
Match: P4H4_ARATH (Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana GN=P4H4 PE=2 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 6.2e-128
Identity = 219/294 (74.49%), Postives = 250/294 (85.03%), Query Frame = 1

Query: 7   LLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+
Sbjct: 6   LLISFFAIFSVLLQSST-SLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSL 65

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A++ LKRS VADNDSG+SK S VRTSSG FISK KDPIVSGIEDKIS WTFLPKENGEDI
Sbjct: 66  AKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDI 125

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRA 186
           QVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVT+GGETVFP AE PS R  
Sbjct: 126 QVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVL 185

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E  EDLS+CAK+G+AVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIH
Sbjct: 186 SENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 245

Query: 247 VDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Sbjct: 246 VDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Cucsa.363400 vs. Swiss-Prot
Match: P4H2_ARATH (Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana GN=P4H2 PE=1 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 8.3e-125
Identity = 214/300 (71.33%), Postives = 255/300 (85.00%), Query Frame = 1

Query: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M +   LLF+ ++L    ++ STC  + S S+ ++PSKVKQ+S KPRAFVYEGFLTDLEC
Sbjct: 3   MSRLGLLLFVAILLV--LLQSSTCLIS-SPSSIINPSKVKQVSSKPRAFVYEGFLTDLEC 62

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSGIEDK+S WTFLPK
Sbjct: 63  DHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPK 122

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180
           ENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVT+GGETVFP A++
Sbjct: 123 ENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQE 182

Query: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
            S R   E  +DLS+CAKKG+AVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWS
Sbjct: 183 FSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWS 242

Query: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 243 ATKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Cucsa.363400 vs. Swiss-Prot
Match: P4H7_ARATH (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 8.7e-98
Identity = 165/272 (60.66%), Postives = 209/272 (76.84%), Query Frame = 1

Query: 29  SASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLST 88
           ++S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S 
Sbjct: 46  ASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE 105

Query: 89  VRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKV 148
           VRTSSGMF+SK +D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ 
Sbjct: 106 VRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 149 NIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKG 208
           N+  GGHR+ATVLMYLSNV +GGETVFP+ +    +     D+  +ECAK+G AVKP+KG
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQGYAVKPRKG 225

Query: 209 DALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDLNESCE 268
           DALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE
Sbjct: 226 DALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGCMDENVSCE 285

Query: 269 RWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           +WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 286 KWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314

BLAST of Cucsa.363400 vs. Swiss-Prot
Match: P4H6_ARATH (Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana GN=P4H6 PE=2 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 7.9e-91
Identity = 162/277 (58.48%), Postives = 207/277 (74.73%), Query Frame = 1

Query: 25  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEV-ADNDSGK 84
           S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S V AD DSG+
Sbjct: 18  SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGE 77

Query: 85  SKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDY 144
           S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQKY+ H+DY
Sbjct: 78  SEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDY 137

Query: 145 FVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAV 204
           F DK  +  GGHR+ATVLMYLSNVT+GGETVFP       +     D+  S+CAK+G AV
Sbjct: 138 FYDKKALELGGHRIATVLMYLSNVTKGGETVFP---NWKGKTPQLKDDSWSKCAKQGYAV 197

Query: 205 KPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDL 264
           KP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D 
Sbjct: 198 KPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKK---LVCVDD 257

Query: 265 NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 258 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Cucsa.363400 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 1.4e-63
Identity = 114/209 (54.55%), Postives = 156/209 (74.64%), Query Frame = 1

Query: 42  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNK 101
           +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 102 DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVL 161
           D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HYDYFVD+ N   GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 162 MYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAI 221
           MYLS+V +GGETVFP A    +  +     +LSEC KKG++VKP+ GDALLF+S+ P+A 
Sbjct: 199 MYLSDVEEGGETVFPAAN--MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 258

Query: 222 PDTNSLHGGCPVLEGEKWSATKWIHVDSF 251
            D  SLHGGCPV+ G KWS+TKW+HV  +
Sbjct: 259 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of Cucsa.363400 vs. TrEMBL
Match: A0A0A0KCQ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G067350 PE=4 SV=1)

HSP 1 Score: 621.7 bits (1602), Expect = 4.7e-175
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 1

Query: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180

Query: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Cucsa.363400 vs. TrEMBL
Match: B9RSW4_RICCO (Prolyl 4-hydroxylase alpha subunit, putative OS=Ricinus communis GN=RCOM_0679070 PE=4 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 2.8e-135
Identity = 226/292 (77.40%), Postives = 260/292 (89.04%), Query Frame = 1

Query: 9   FIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIAR 68
           F+FL+L S    +S+ SY GS ++ +DPSKVKQ+SWKPRAFVYEGFLTDLECDHL+S+A+
Sbjct: 7   FVFLLLISLIFHKSS-SYPGSPTSIIDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAK 66

Query: 69  SELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQV 128
           SELKRS VADN+SGKSKLS VRTSSGMFI+K KDPI++GIE+KIS WTFLPKENGED+QV
Sbjct: 67  SELKRSAVADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQV 126

Query: 129 LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYE 188
           LRYEHGQKY+ HYDYF DK+NIA GGHR+ATVLMYLS+V +GGETVFP AE+P  R+A E
Sbjct: 127 LRYEHGQKYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATE 186

Query: 189 TDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVD 248
           + EDLSECAKKG++VKP++GDALLFFSL P AIPD NSLH GCPV+EGEKWSATKWIHVD
Sbjct: 187 SHEDLSECAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVD 246

Query: 249 SFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           SF KN+   GNCTD NESCERWAALGECT NPEYMVGSPE+PGYCRRSC++C
Sbjct: 247 SFDKNIEAGGNCTDKNESCERWAALGECTNNPEYMVGSPELPGYCRRSCKVC 297

BLAST of Cucsa.363400 vs. TrEMBL
Match: W9SGN4_9ROSA (Prolyl 4-hydroxylase subunit alpha-1 OS=Morus notabilis GN=L484_011286 PE=4 SV=1)

HSP 1 Score: 486.5 bits (1251), Expect = 2.3e-134
Identity = 228/293 (77.82%), Postives = 258/293 (88.05%), Query Frame = 1

Query: 8   LFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LF+FL+  SS   ES+ SYAGSA++ ++PSKVKQ+SWKPRAFVYEGFLTDLECDHL+S+A
Sbjct: 8   LFLFLLSISSSFHESSSSYAGSAASIINPSKVKQVSWKPRAFVYEGFLTDLECDHLISLA 67

Query: 68  RSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQ 127
           +SELKRS VADN SGKSKLS VRTSSGMFI K KDPIV+GIEDKIS WTFLPKENGED+Q
Sbjct: 68  KSELKRSAVADNVSGKSKLSEVRTSSGMFIPKAKDPIVAGIEDKISTWTFLPKENGEDMQ 127

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAY 187
           VLRYEHGQKY+ HYDYF DKVNIA GGHR+ATVLMYL++V +GGETVFP AE+  H +A 
Sbjct: 128 VLRYEHGQKYDPHYDYFADKVNIARGGHRIATVLMYLTDVVKGGETVFPSAEESHHHKAS 187

Query: 188 ETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
            TD+DLSECAKKG+AVKP++GDALLFFSL P A+PDT SLH GCPV+EGEKWSATKWIHV
Sbjct: 188 TTDDDLSECAKKGIAVKPRRGDALLFFSLLPTAVPDTISLHAGCPVIEGEKWSATKWIHV 247

Query: 248 DSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           DSF K+L   G CTD NESCERWAALGEC KN EYMVGSPE+PGYCRRSC++C
Sbjct: 248 DSFDKDLSAGGKCTDQNESCERWAALGECNKNREYMVGSPELPGYCRRSCKVC 300

BLAST of Cucsa.363400 vs. TrEMBL
Match: A0A067L7I1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02052 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 8.9e-134
Identity = 230/294 (78.23%), Postives = 260/294 (88.44%), Query Frame = 1

Query: 7   LLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           L F+FL L+ S I   + SY G++S+ +DP+KVKQ+SWKPRAFVY GFLTDLECDHL+S+
Sbjct: 8   LQFLFL-LSISLILHKSGSYPGTSSSIIDPAKVKQVSWKPRAFVYHGFLTDLECDHLISL 67

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A+SELKRS VADN SGKSK++ VRTSSGMFI K KDPIV+GIEDKI+ WTFLPKENGEDI
Sbjct: 68  AKSELKRSAVADNVSGKSKVAEVRTSSGMFIPKGKDPIVAGIEDKIATWTFLPKENGEDI 127

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRA 186
           QVLRYE+GQKY+ HYDYFVD+VNIA GGHRLATVLMYLSNV +GGETVFP AE    R+A
Sbjct: 128 QVLRYEYGQKYDPHYDYFVDRVNIARGGHRLATVLMYLSNVEKGGETVFPSAEDAPRRKA 187

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E DEDLSECAKKG+AVKP++GDALLFFSL PNA+PD +SLH GCPV+EGEKWSATKWIH
Sbjct: 188 NEGDEDLSECAKKGIAVKPRRGDALLFFSLLPNAVPDQSSLHAGCPVIEGEKWSATKWIH 247

Query: 247 VDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSFSKNL   GNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSC++C
Sbjct: 248 VDSFSKNLEADGNCTDLNESCERWAALGECTKNPEYMVGSAELPGYCRRSCKVC 300

BLAST of Cucsa.363400 vs. TrEMBL
Match: A0A0B2P6V3_GLYSO (Prolyl 4-hydroxylase subunit alpha-2 OS=Glycine soja GN=glysoja_004434 PE=4 SV=1)

HSP 1 Score: 475.3 bits (1222), Expect = 5.4e-131
Identity = 224/292 (76.71%), Postives = 255/292 (87.33%), Query Frame = 1

Query: 9   FIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIAR 68
           F+  +L  S   +   SYAGSAS+ V+PSKVKQISWKPRAFVYEGFLTDLECDHL+S+A+
Sbjct: 7   FLLFLLLISKCHQVWGSYAGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAK 66

Query: 69  SELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQV 128
           SELKRS VADN SG+S+LS VRTSSGMFISKNKDPI+SGIEDKIS+WTFLPKENGEDIQV
Sbjct: 67  SELKRSAVADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKENGEDIQV 126

Query: 129 LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYE 188
           LRYEHGQKY+ HYDYF DKVNIA GGHR+ATVLMYL+NVT+GGETVFP AE+P  RR  E
Sbjct: 127 LRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTE 186

Query: 189 TDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVD 248
           T  DLSECAKKG+AVKP +GDALLFFSL  NA PDT+SLH GCPV+EGEKWSATKWIHVD
Sbjct: 187 TSSDLSECAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVD 246

Query: 249 SFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           SF K +G  G+C+D + SCERWA+LGECTKNPEYM+GS ++PGYCR+SC+ C
Sbjct: 247 SFDKTVGAGGDCSDHHVSCERWASLGECTKNPEYMIGSSDVPGYCRKSCKSC 298

BLAST of Cucsa.363400 vs. TAIR10
Match: AT5G18900.1 (AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 458.4 bits (1178), Expect = 3.5e-129
Identity = 219/294 (74.49%), Postives = 250/294 (85.03%), Query Frame = 1

Query: 7   LLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           LL  F  + S  ++ ST S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+
Sbjct: 6   LLISFFAIFSVLLQSST-SLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDHMVSL 65

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A++ LKRS VADNDSG+SK S VRTSSG FISK KDPIVSGIEDKIS WTFLPKENGEDI
Sbjct: 66  AKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDI 125

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRA 186
           QVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLSNVT+GGETVFP AE PS R  
Sbjct: 126 QVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVL 185

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E  EDLS+CAK+G+AVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIH
Sbjct: 186 SENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 245

Query: 247 VDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSF + +   GNCTD+NESCERWA LGECTKNPEYMVG+ E+PGYCRRSC+ C
Sbjct: 246 VDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298

BLAST of Cucsa.363400 vs. TAIR10
Match: AT3G06300.1 (AT3G06300.1 P4H isoform 2)

HSP 1 Score: 448.0 bits (1151), Expect = 4.7e-126
Identity = 214/300 (71.33%), Postives = 255/300 (85.00%), Query Frame = 1

Query: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           M +   LLF+ ++L    ++ STC  + S S+ ++PSKVKQ+S KPRAFVYEGFLTDLEC
Sbjct: 3   MSRLGLLLFVAILLV--LLQSSTCLIS-SPSSIINPSKVKQVSSKPRAFVYEGFLTDLEC 62

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVSGIEDK+S WTFLPK
Sbjct: 63  DHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPK 122

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180
           ENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLSNVT+GGETVFP A++
Sbjct: 123 ENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQE 182

Query: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
            S R   E  +DLS+CAKKG+AVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWS
Sbjct: 183 FSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWS 242

Query: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Sbjct: 243 ATKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299

BLAST of Cucsa.363400 vs. TAIR10
Match: AT3G28480.2 (AT3G28480.2 Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 335.9 bits (860), Expect = 2.6e-92
Identity = 159/280 (56.79%), Postives = 205/280 (73.21%), Query Frame = 1

Query: 29  SASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS---- 88
           ++S   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S    
Sbjct: 46  ASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE 105

Query: 89  -KLSTVRTSSGMFISKNK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESH 148
             +S VR SS    + +    D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H
Sbjct: 106 DSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 165

Query: 149 YDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKG 208
           +DYF D+ N+  GGHR+ATVLMYLSNV +GGETVFP+ +  + +     D+  +ECAK+G
Sbjct: 166 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK---DDSWTECAKQG 225

Query: 209 VAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNC 268
            AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C
Sbjct: 226 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGC 285

Query: 269 TDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
            D N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Sbjct: 286 MDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322

BLAST of Cucsa.363400 vs. TAIR10
Match: AT3G28490.1 (AT3G28490.1 Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 335.1 bits (858), Expect = 4.4e-92
Identity = 162/277 (58.48%), Postives = 207/277 (74.73%), Query Frame = 1

Query: 25  SYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEV-ADNDSGK 84
           S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S V AD DSG+
Sbjct: 18  SQISSFSFSVDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGE 77

Query: 85  SKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDY 144
           S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENGE +Q+L YE+GQKY+ H+DY
Sbjct: 78  SEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDY 137

Query: 145 FVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAV 204
           F DK  +  GGHR+ATVLMYLSNVT+GGETVFP       +     D+  S+CAK+G AV
Sbjct: 138 FYDKKALELGGHRIATVLMYLSNVTKGGETVFP---NWKGKTPQLKDDSWSKCAKQGYAV 197

Query: 205 KPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGDIGNCTDL 264
           KP+KGDALLFF+L  N   D NSLHG CPV+EGEKWSAT+WIHV SF K       C D 
Sbjct: 198 KPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKK---LVCVDD 257

Query: 265 NESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Sbjct: 258 HESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288

BLAST of Cucsa.363400 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 244.6 bits (623), Expect = 7.9e-65
Identity = 114/209 (54.55%), Postives = 156/209 (74.64%), Query Frame = 1

Query: 42  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNK 101
           +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +
Sbjct: 79  LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138

Query: 102 DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVL 161
           D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HYDYFVD+ N   GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198

Query: 162 MYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAI 221
           MYLS+V +GGETVFP A    +  +     +LSEC KKG++VKP+ GDALLF+S+ P+A 
Sbjct: 199 MYLSDVEEGGETVFPAAN--MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDAT 258

Query: 222 PDTNSLHGGCPVLEGEKWSATKWIHVDSF 251
            D  SLHGGCPV+ G KWS+TKW+HV  +
Sbjct: 259 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285

BLAST of Cucsa.363400 vs. NCBI nr
Match: gi|449454448|ref|XP_004144967.1| (PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis sativus])

HSP 1 Score: 621.7 bits (1602), Expect = 6.7e-175
Identity = 300/300 (100.00%), Postives = 300/300 (100.00%), Query Frame = 1

Query: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180

Query: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
           PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Cucsa.363400 vs. NCBI nr
Match: gi|659117281|ref|XP_008458517.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 608.2 bits (1567), Expect = 7.7e-171
Identity = 292/300 (97.33%), Postives = 296/300 (98.67%), Query Frame = 1

Query: 1   MFKFDNLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLEC 60
           MFKF NLLF FLIL SSF+RESTCSYAGSASATVDPS+VKQISWKPRAFVYEGFLTDLEC
Sbjct: 1   MFKFRNLLFFFLILISSFVRESTCSYAGSASATVDPSRVKQISWKPRAFVYEGFLTDLEC 60

Query: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120
           DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK
Sbjct: 61  DHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPK 120

Query: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEK 180
           ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVT+GGETVFPLAEK
Sbjct: 121 ENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTKGGETVFPLAEK 180

Query: 181 PSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240
            SHRRAYETDEDLSECAKKG+AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS
Sbjct: 181 SSHRRAYETDEDLSECAKKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWS 240

Query: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
           ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC
Sbjct: 241 ATKWIHVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300

BLAST of Cucsa.363400 vs. NCBI nr
Match: gi|255551575|ref|XP_002516833.1| (PREDICTED: probable prolyl 4-hydroxylase 4 [Ricinus communis])

HSP 1 Score: 489.6 bits (1259), Expect = 4.0e-135
Identity = 226/292 (77.40%), Postives = 260/292 (89.04%), Query Frame = 1

Query: 9   FIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIAR 68
           F+FL+L S    +S+ SY GS ++ +DPSKVKQ+SWKPRAFVYEGFLTDLECDHL+S+A+
Sbjct: 7   FVFLLLISLIFHKSS-SYPGSPTSIIDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAK 66

Query: 69  SELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQV 128
           SELKRS VADN+SGKSKLS VRTSSGMFI+K KDPI++GIE+KIS WTFLPKENGED+QV
Sbjct: 67  SELKRSAVADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQV 126

Query: 129 LRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYE 188
           LRYEHGQKY+ HYDYF DK+NIA GGHR+ATVLMYLS+V +GGETVFP AE+P  R+A E
Sbjct: 127 LRYEHGQKYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATE 186

Query: 189 TDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVD 248
           + EDLSECAKKG++VKP++GDALLFFSL P AIPD NSLH GCPV+EGEKWSATKWIHVD
Sbjct: 187 SHEDLSECAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVD 246

Query: 249 SFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           SF KN+   GNCTD NESCERWAALGECT NPEYMVGSPE+PGYCRRSC++C
Sbjct: 247 SFDKNIEAGGNCTDKNESCERWAALGECTNNPEYMVGSPELPGYCRRSCKVC 297

BLAST of Cucsa.363400 vs. NCBI nr
Match: gi|703134588|ref|XP_010105675.1| (Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis])

HSP 1 Score: 486.5 bits (1251), Expect = 3.4e-134
Identity = 228/293 (77.82%), Postives = 258/293 (88.05%), Query Frame = 1

Query: 8   LFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIA 67
           LF+FL+  SS   ES+ SYAGSA++ ++PSKVKQ+SWKPRAFVYEGFLTDLECDHL+S+A
Sbjct: 8   LFLFLLSISSSFHESSSSYAGSAASIINPSKVKQVSWKPRAFVYEGFLTDLECDHLISLA 67

Query: 68  RSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQ 127
           +SELKRS VADN SGKSKLS VRTSSGMFI K KDPIV+GIEDKIS WTFLPKENGED+Q
Sbjct: 68  KSELKRSAVADNVSGKSKLSEVRTSSGMFIPKAKDPIVAGIEDKISTWTFLPKENGEDMQ 127

Query: 128 VLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAY 187
           VLRYEHGQKY+ HYDYF DKVNIA GGHR+ATVLMYL++V +GGETVFP AE+  H +A 
Sbjct: 128 VLRYEHGQKYDPHYDYFADKVNIARGGHRIATVLMYLTDVVKGGETVFPSAEESHHHKAS 187

Query: 188 ETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHV 247
            TD+DLSECAKKG+AVKP++GDALLFFSL P A+PDT SLH GCPV+EGEKWSATKWIHV
Sbjct: 188 TTDDDLSECAKKGIAVKPRRGDALLFFSLLPTAVPDTISLHAGCPVIEGEKWSATKWIHV 247

Query: 248 DSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           DSF K+L   G CTD NESCERWAALGEC KN EYMVGSPE+PGYCRRSC++C
Sbjct: 248 DSFDKDLSAGGKCTDQNESCERWAALGECNKNREYMVGSPELPGYCRRSCKVC 300

BLAST of Cucsa.363400 vs. NCBI nr
Match: gi|802578382|ref|XP_012069451.1| (PREDICTED: probable prolyl 4-hydroxylase 4 [Jatropha curcas])

HSP 1 Score: 484.6 bits (1246), Expect = 1.3e-133
Identity = 230/294 (78.23%), Postives = 260/294 (88.44%), Query Frame = 1

Query: 7   LLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSI 66
           L F+FL L+ S I   + SY G++S+ +DP+KVKQ+SWKPRAFVY GFLTDLECDHL+S+
Sbjct: 8   LQFLFL-LSISLILHKSGSYPGTSSSIIDPAKVKQVSWKPRAFVYHGFLTDLECDHLISL 67

Query: 67  ARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDI 126
           A+SELKRS VADN SGKSK++ VRTSSGMFI K KDPIV+GIEDKI+ WTFLPKENGEDI
Sbjct: 68  AKSELKRSAVADNVSGKSKVAEVRTSSGMFIPKGKDPIVAGIEDKIATWTFLPKENGEDI 127

Query: 127 QVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRA 186
           QVLRYE+GQKY+ HYDYFVD+VNIA GGHRLATVLMYLSNV +GGETVFP AE    R+A
Sbjct: 128 QVLRYEYGQKYDPHYDYFVDRVNIARGGHRLATVLMYLSNVEKGGETVFPSAEDAPRRKA 187

Query: 187 YETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
            E DEDLSECAKKG+AVKP++GDALLFFSL PNA+PD +SLH GCPV+EGEKWSATKWIH
Sbjct: 188 NEGDEDLSECAKKGIAVKPRRGDALLFFSLLPNAVPDQSSLHAGCPVIEGEKWSATKWIH 247

Query: 247 VDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 301
           VDSFSKNL   GNCTDLNESCERWAALGECTKNPEYMVGS E+PGYCRRSC++C
Sbjct: 248 VDSFSKNLEADGNCTDLNESCERWAALGECTKNPEYMVGSAELPGYCRRSCKVC 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H4_ARATH6.2e-12874.49Probable prolyl 4-hydroxylase 4 OS=Arabidopsis thaliana GN=P4H4 PE=2 SV=1[more]
P4H2_ARATH8.3e-12571.33Prolyl 4-hydroxylase 2 OS=Arabidopsis thaliana GN=P4H2 PE=1 SV=1[more]
P4H7_ARATH8.7e-9860.66Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1[more]
P4H6_ARATH7.9e-9158.48Probable prolyl 4-hydroxylase 6 OS=Arabidopsis thaliana GN=P4H6 PE=2 SV=1[more]
P4H3_ARATH1.4e-6354.55Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KCQ5_CUCSA4.7e-175100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G067350 PE=4 SV=1[more]
B9RSW4_RICCO2.8e-13577.40Prolyl 4-hydroxylase alpha subunit, putative OS=Ricinus communis GN=RCOM_0679070... [more]
W9SGN4_9ROSA2.3e-13477.82Prolyl 4-hydroxylase subunit alpha-1 OS=Morus notabilis GN=L484_011286 PE=4 SV=1[more]
A0A067L7I1_JATCU8.9e-13478.23Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02052 PE=4 SV=1[more]
A0A0B2P6V3_GLYSO5.4e-13176.71Prolyl 4-hydroxylase subunit alpha-2 OS=Glycine soja GN=glysoja_004434 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G18900.13.5e-12974.49 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT3G06300.14.7e-12671.33 P4H isoform 2[more]
AT3G28480.22.6e-9256.79 Oxoglutarate/iron-dependent oxygenase[more]
AT3G28490.14.4e-9258.48 Oxoglutarate/iron-dependent oxygenase[more]
AT1G20270.17.9e-6554.55 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449454448|ref|XP_004144967.1|6.7e-175100.00PREDICTED: probable prolyl 4-hydroxylase 4 [Cucumis sativus][more]
gi|659117281|ref|XP_008458517.1|7.7e-17197.33PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|255551575|ref|XP_002516833.1|4.0e-13577.40PREDICTED: probable prolyl 4-hydroxylase 4 [Ricinus communis][more]
gi|703134588|ref|XP_010105675.1|3.4e-13477.82Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis][more]
gi|802578382|ref|XP_012069451.1|1.3e-13378.23PREDICTED: probable prolyl 4-hydroxylase 4 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003582ShKT_dom
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006525 arginine metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
biological_process GO:0006560 proline metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.363400.1Cucsa.363400.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003582ShKT domainPFAMPF01549ShKcoord: 259..300
score: 2.
IPR003582ShKT domainSMARTSM00254ShkT_1coord: 259..300
score: 0.
IPR003582ShKT domainPROFILEPS51670SHKTcoord: 260..300
score: 8
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 126..246
score: 4.4
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 122..247
score: 12
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 46..246
score: 1.9
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 25..282
score: 2.7E
NoneNo IPR availablePANTHERPTHR10869:SF64OXIDOREDUCTASE, 2OG-FE(II) OXYGENASE FAMILY PROTEIN-RELATEDcoord: 25..282
score: 2.7E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cucsa.363400Cla021690Watermelon (97103) v1cgywmB664
Cucsa.363400Cla009353Watermelon (97103) v1cgywmB665
Cucsa.363400Csa3G150820Cucumber (Chinese Long) v2cgycuB510
Cucsa.363400Csa6G067350Cucumber (Chinese Long) v2cgycuB511
Cucsa.363400MELO3C021380Melon (DHL92) v3.5.1cgymeB587
Cucsa.363400MELO3C006592Melon (DHL92) v3.5.1cgymeB595
Cucsa.363400ClCG06G005580Watermelon (Charleston Gray)cgywcgB619
Cucsa.363400ClCG05G005650Watermelon (Charleston Gray)cgywcgB618
Cucsa.363400CSPI03G13780Wild cucumber (PI 183967)cgycpiB538
Cucsa.363400CSPI06G05250Wild cucumber (PI 183967)cgycpiB540
Cucsa.363400CmaCh14G016540Cucurbita maxima (Rimu)cgycmaB0991
Cucsa.363400CmaCh17G002910Cucurbita maxima (Rimu)cgycmaB0999
Cucsa.363400CmaCh08G011910Cucurbita maxima (Rimu)cgycmaB1014
Cucsa.363400CmoCh08G011630Cucurbita moschata (Rifu)cgycmoB1014
Cucsa.363400CmoCh14G017000Cucurbita moschata (Rifu)cgycmoB0993
Cucsa.363400CmoCh17G002790Cucurbita moschata (Rifu)cgycmoB1000
Cucsa.363400Lsi05G014860Bottle gourd (USVL1VR-Ls)cgylsiB586
Cucsa.363400Cp4.1LG12g02390Cucurbita pepo (Zucchini)cgycpeB0948
Cucsa.363400Cp4.1LG17g01080Cucurbita pepo (Zucchini)cgycpeB0955
Cucsa.363400Cp4.1LG03g11130Cucurbita pepo (Zucchini)cgycpeB0966
Cucsa.363400MELO3C006592.2Melon (DHL92) v3.6.1cgymedB593
Cucsa.363400MELO3C021380.2Melon (DHL92) v3.6.1cgymedB587
Cucsa.363400CsaV3_6G005580Cucumber (Chinese Long) v3cgycucB556
Cucsa.363400CsaV3_3G014000Cucumber (Chinese Long) v3cgycucB554
Cucsa.363400Cla97C06G115090Watermelon (97103) v2cgywmbB624
Cucsa.363400Cla97C05G087090Watermelon (97103) v2cgywmbB623
Cucsa.363400Bhi12G001572Wax gourdcgywgoB725
Cucsa.363400Carg17382Silver-seed gourdcarcgyB0682
Cucsa.363400Carg15947Silver-seed gourdcarcgyB0384
Cucsa.363400Carg05473Silver-seed gourdcarcgyB0085
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cucsa.363400Cucsa.253720Cucumber (Gy14) v1cgycgyB131
The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.363400Bottle gourd (USVL1VR-Ls)cgylsiB584
Cucsa.363400Wax gourdcgywgoB730