CSPI04G11020 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G11020
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCysteine dioxygenase
LocationChr4: 9326824 .. 9330527 (-)
RNA-Seq ExpressionCSPI04G11020
SyntenyCSPI04G11020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGGAGGGGGGTTTTTATTGAGTTCTCCTGAGAATGGAAACGAGAGCGGTGAATCGAGGGAGTAATAGAATTGGGCATGTGAATAAGGTTCAGTATGTGAGAAGGGATTTTAAGAAGAGGAAATGTAGAAAGATCAGACGGTCTATTCCTGTAGTTCCTATGGCGCTTCAGGAACTGTTTGTTTCTTGTAGGGAAGTCTTCAAAGGCCCTGGAACTGTTCCACTGCCTTGTGATGTTGAAAAACTCTGCCGCATTCTTGGTAATATATGTTTCACAGTTTGAATCCTATTTTGTTTCATCCTTTTCATTCTTGGGGTGTTTAATTTCTTAATGTTTTAACAATCTCTAGGAAATTAGCTTAGTTCGATCCTTTGATTTATCTACTTTTGTCTTTTCATTTTTGGCTAAGTGACATGGATTGGAAAGTTCTGAGGTTCTACGGCATTACTATTTCTGGCTACATGCATAATTTATACAACAGTGGAATTTCTTTTCTTTACTCTGTTTCTTAAGCAATTAATTCTTGATCTTCCCTTTATTGATTTGTCATATCATATCATGCTTATATGTTAATTTCACCCCTATGCTCTTGAATTGTTATAGGATTCTTTTAATTTTAGACTGAAACCCATATATATATATAGAGAGAGAGAGAGAGAGAGGTTTTCTTCTCTCGTGGAAAATTGGATGAAAAACCCTCCGATTATTTTATAAATTTTATCAGATGAACTTATTTAAATTATCTTTCTGATGCATTTTTCCCTTTTTTAATCATAATATCACACACTTCGATGCAGTAACATTTACTTGAGTGAACATGAATTATTTGATTTTCTTTTTTTATTAGCCTCGCGGGACATATTTGAGCCACATAGTTTCTGTTTAACTAGCAGATTTGAAATATTAAAAAACAAAGGGAATTTGGGGGACTATTTCAAATCCATGGTCAACTCATGATTGTCAAATGAAGATTAGTGAAATTATGGAATAATAATAAAAAGGCAAAATTATAATACCTTGAACTTTCGCATTTGTTTCAAACGGATCTGTATATTTTTCGAACTAATATTGTTTTAGGCCATTGCCTCACTATTTTGGTCCTAAATTTTTTTCTAAATTTTTAAAGATATAGGCTCTAAATCTAATGATAGTTCTGTAACCTTTATTGAAAAGTGAAAGGTAAGATTAATAAAGGTCAATGGTAAGATTGCCGCTTTTGAAAGTATAGTGGTGATTTTGCAAAAAAAGAGGAAATTCAAGGTTTTTTTTTTTTTTTTTTTTTTTTTTAATTAATTTAACATAATAAAAAATGTAGCAGATTCGTAGGAATAGATTCATCTGTCTGAACAGTATAGTATTTTTTGTATTTGTTTACATCGTACAATCATAATACATCTCTCTCTTTCTCTCTAATTAACAAATATTTTTTTTTATGTTGTGCTTTGTTGTGCCATGGGGGCACATGATCAGTGATGACTTATGAGCATTCTTTGATAAGCCCATGGTTTTTGGGAGAAATTGTAGGGTCCAGGATCATTGGATTTTAAACTTCTGTACTCTGTATTCCTTAGCATATACACATAGTTAAATATTGACCAAATTCTTTTTACTTTTTCAATGCTGCAGATAATATGAAGGCAGAAGATGTTGGACTTAGTAGTAGTTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTCACATACACAACCATATACAAGTGCGACAATTTCTCGGTAAGGCTAATGTAGTGCCTTTCACTAGTTCTGGGCTGCTTTTCGTCTAAAGCGACCTATGCTATGTTTAAAAGGGAAAGCATATATAAACCACGAGATTAATCAAATTGATATAAATCATGAAATTACTTATGAGCATCCAAGTTGAATATAGGGCAAGCTTAAATATCATAAGTTAAATATATTAGTCAAAGTGCACAAAATATATGGGTTGGCCCAAGAGCTAGTGGACAAAGGAGCAAGTTAATAAACTGTGACATTATTGTTAAACCTAAGTTGGCTAACATATGTCCTTAGCCAATAGGTCAGAAGTTAATACTCCTATTTATGTAAGATTGAAAAAAAACAATAGTAAGATTATAGGAGAATAACTAGAGTGATCAAACTTATGAACTATAAGTGAAAGTATGTACAGCACACTTGTAAGTCGCTGAAAACTACAAATCTAGTTGAGCTGTGTGTATATTCTTCTATAGGTTTTGTAACATCTTTAAATTTTATTTCCAAAATGTGTAAACAACATTTGATTGCTTGGTTCCAAATTCCTGACGGCATATCTTTTTTCTTCTTACTACAGTTGTGCATCTTCTTCCTCCCTGCAACTGGTGTAATCCCTCTACACAACCATCCTGGAATGACTGTTTTCAGTAAGCTTCTTTTGGGGAAAATGCACATCAAATCTTATGATTGGGTTGATCCAACTAACAGTGATGATACTGCCCAACCTTGTGAAAGTGAGCGCTCTGCAAGTTTCTTATTATAAAATATGAAATTGTTGGGAGAAATTCTGGTTGTTTATCCTGAAGTGATGATGGATTGTTTTCTTTTTTCCCTTTTTGCAGAGAGATTGGCAAAGCTGAAAGCCGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACATCAGGAGGCAACATCCACTCATTCACTGCTATAACGCCATGTGCGGTGCTTGATGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGAGATTGTTCGTACTATAAGGAACATCCCTATGCCTCTTTTCCAAGTAAGTTGCAAATGATGTTTTCTTATTTAATCAGTTAAGTTGACTTTAAAACACTTCAAATGATGCAAAAAGGGACAGCTGTTTTAATTGGCTATGTGTAAACTTTGGATATCAGGTTGTTTGTTCTTGTGTTTTCGTTCTGTTTTTGGGAGCGATTTTGTAAGGGTAAAATCTGCAGTAATCCTTTTAAAATTGAAAAAGGAAAAGAAACTGTTTTTTGAAAGTGACTCTTTTTGTCATGTGGACTTTTTCTGGTAAAATAACATTCAGATAGTGAAATTTTGACAAGTTCATGAAAGAGTACTTTGGGTGGGACATTGTTGAATGTTGCTGTTATTTTTATTATTTTTAATTGAACTGTAATTCTTACTTAATTTTTTCCCTCCTAAAATAATAACAAGAACTGTTTCTGAAAACCCCCATTTTCTCATACTTTATTATGCTATTTTTGTTGAGGTTATGGGATCGTTTCTGTATTTGGAGTATAATAGGGGAATTCGAACCACATACCTATGGTTTTGGTTATTCACTCTTTCTCATCCTGACTTATATAAGTACCATATAGAGTTCATAATTTTATGATTTTGCACTATTCTTATTCTTATAAATTAAATGTTTTGTAATGCAGATGGTGACATGGGATTGGGAGAAGAAGATCAGGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGGTGCCAGAAAACTCTGAAATGGATGGAATAGAATACTTAGGCCCTCAAATCTGTGACATTTAAGCCTAGTTTTGTTTTGTTTACTTTGCCTCACATTTATTTACCTCCACTTCACAATGCCCTTCCCTCTTTTTCTTTTATTTATTTGTTTATATCTTTCATCTTCATCTTCATCTTGTTTTCTGGGGTTTTAAACTTTTCTTTCTTTGACTGTAG

mRNA sequence

GTGGGAGGGGGGTTTTTATTGAGTTCTCCTGAGAATGGAAACGAGAGCGGTGAATCGAGGGAGTAATAGAATTGGGCATGTGAATAAGGTTCAGTATGTGAGAAGGGATTTTAAGAAGAGGAAATGTAGAAAGATCAGACGGTCTATTCCTGTAGTTCCTATGGCGCTTCAGGAACTGTTTGTTTCTTGTAGGGAAGTCTTCAAAGGCCCTGGAACTGTTCCACTGCCTTGTGATGTTGAAAAACTCTGCCGCATTCTTGATAATATGAAGGCAGAAGATGTTGGACTTAGTAGTAGTTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTCACATACACAACCATATACAAGTGCGACAATTTCTCGTTGTGCATCTTCTTCCTCCCTGCAACTGGTGTAATCCCTCTACACAACCATCCTGGAATGACTGTTTTCAGTAAGCTTCTTTTGGGGAAAATGCACATCAAATCTTATGATTGGGTTGATCCAACTAACAGTGATGATACTGCCCAACCTTGTGAAAAGAGATTGGCAAAGCTGAAAGCCGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACATCAGGAGGCAACATCCACTCATTCACTGCTATAACGCCATGTGCGGTGCTTGATGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGAGATTGTTCGTACTATAAGGAACATCCCTATGCCTCTTTTCCAAATGGTGACATGGGATTGGGAGAAGAAGATCAGGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGGTGCCAGAAAACTCTGAAATGGATGGAATAGAATACTTAGGCCCTCAAATCTGTGACATTTAAGCCTAGTTTTGTTTTGTTTACTTTGCCTCACATTTATTTACCTCCACTTCACAATGCCCTTCCCTCTTTTTCTTTTATTTATTTGTTTATATCTTTCATCTTCATCTTCATCTTGTTTTCTGGGGTTTTAAACTTTTCTTTCTTTGACTGTAG

Coding sequence (CDS)

ATGGAAACGAGAGCGGTGAATCGAGGGAGTAATAGAATTGGGCATGTGAATAAGGTTCAGTATGTGAGAAGGGATTTTAAGAAGAGGAAATGTAGAAAGATCAGACGGTCTATTCCTGTAGTTCCTATGGCGCTTCAGGAACTGTTTGTTTCTTGTAGGGAAGTCTTCAAAGGCCCTGGAACTGTTCCACTGCCTTGTGATGTTGAAAAACTCTGCCGCATTCTTGATAATATGAAGGCAGAAGATGTTGGACTTAGTAGTAGTTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTCACATACACAACCATATACAAGTGCGACAATTTCTCGTTGTGCATCTTCTTCCTCCCTGCAACTGGTGTAATCCCTCTACACAACCATCCTGGAATGACTGTTTTCAGTAAGCTTCTTTTGGGGAAAATGCACATCAAATCTTATGATTGGGTTGATCCAACTAACAGTGATGATACTGCCCAACCTTGTGAAAAGAGATTGGCAAAGCTGAAAGCCGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACATCAGGAGGCAACATCCACTCATTCACTGCTATAACGCCATGTGCGGTGCTTGATGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGAGATTGTTCGTACTATAAGGAACATCCCTATGCCTCTTTTCCAAATGGTGACATGGGATTGGGAGAAGAAGATCAGGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGGTGCCAGAAAACTCTGAAATGGATGGAATAGAATACTTAGGCCCTCAAATCTGTGACATTTAA

Protein sequence

METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI*
Homology
BLAST of CSPI04G11020 vs. ExPASy Swiss-Prot
Match: Q9LXG9 (Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 4.6e-64
Identity = 122/241 (50.62%), Postives = 164/241 (68.05%), Query Frame = 0

Query: 44  ALQELFVSCREVFK--GPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPN--VPVK 103
           A++ LF +C+EVF   GPG +P    +++L  ILD+MK EDVGL+ ++ +F+PN  V  +
Sbjct: 57  AVRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEAR 116

Query: 104 GSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTN 163
            SP +TY  +++CD FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV    
Sbjct: 117 SSPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWV---- 176

Query: 164 SDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSME 223
            D   +  + RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDVLGPPY   
Sbjct: 177 VDAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNP 236

Query: 224 DGRDCSYYKEHPYASFPNGDMG-LGEEDQGEGYGWLEEIE--VPENSEMDGIEYLGPQIC 278
           +GR C+Y+ E P     + D   L  E++ EGY WL+E +    +++ + G  Y GP++ 
Sbjct: 237 EGRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVE 293

BLAST of CSPI04G11020 vs. ExPASy Swiss-Prot
Match: Q8LGJ5 (Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 8.7e-63
Identity = 126/245 (51.43%), Postives = 165/245 (67.35%), Query Frame = 0

Query: 35  RRSIPVVPMALQELFVSCREVFKG--PGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFF 94
           RRS   +   +Q+LF +C++VF     GTVP   ++E L  +LD +K EDVG++  + +F
Sbjct: 37  RRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYF 96

Query: 95  KPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSY 154
           +  V  + SP VTY  IY C  FS+CIF LP +GVIPLHNHP MTVFSKLL G MHIKSY
Sbjct: 97  RSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSY 156

Query: 155 DWVDPTNSDDTAQP-CEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV 214
           DWV      D+ QP  + RLAK+K D+ FT+PC TS+LYP  GGN+H FTA T CAVLDV
Sbjct: 157 DWV-----PDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDV 216

Query: 215 LGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEE-IEVPENSEMDGIEY 274
           +GPPYS   GR C+YY ++P++SF    + + EE++ EGY WL+E  E PE+  +  + Y
Sbjct: 217 IGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEK-EGYAWLKEREEKPEDLTVTALMY 274

Query: 275 LGPQI 276
            GP I
Sbjct: 277 SGPTI 274

BLAST of CSPI04G11020 vs. ExPASy Swiss-Prot
Match: Q1G3U6 (Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 5.3e-52
Identity = 102/247 (41.30%), Postives = 142/247 (57.49%), Query Frame = 0

Query: 45  LQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPR- 104
           +QEL+  C+E F G    P    ++KLC +LD++   DVGL    Q       V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 105 ---------VTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDW 164
                    +T+  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 165 VDP----TNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLD 224
           V+P    T           RLAKL +D V T       LYP +GGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 225 VLGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQG--EGYGWLEEIEVPENSEMDGI 276
           +L PPY    GR CSYY ++P+++F   + G+ + D+G  + Y WL +I+ P++  M   
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFAL-ENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPG 274

BLAST of CSPI04G11020 vs. ExPASy Swiss-Prot
Match: Q9LXT4 (Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 1.8e-47
Identity = 98/244 (40.16%), Postives = 139/244 (56.97%), Query Frame = 0

Query: 41  VPMALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKG 100
           +P  +Q LF +C+      G V     ++K+  +L+ +K  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 101 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 160
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 161 YDWVDPTNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV 220
           YDW +P  S +   P + R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 221 LGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYL 276
           L PPYS   GR C+Y+++ P    P G++ +   +      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

BLAST of CSPI04G11020 vs. ExPASy Swiss-Prot
Match: Q9SJI9 (Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2)

HSP 1 Score: 179.5 bits (454), Expect = 5.3e-44
Identity = 93/243 (38.27%), Postives = 137/243 (56.38%), Query Frame = 0

Query: 41  VPMALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKP------ 100
           +P   Q L+ +C+  F   G +     +EK+  +L+ +K  DVG+    Q  +       
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDA-LEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLN 60

Query: 101 --NVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSY 160
             N   +  P + Y  +++CD+FS+ IF +P + +IPLHNHPGMTV SKL+ G MH+KSY
Sbjct: 61  ERNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSY 120

Query: 161 DWVDPTNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVL 220
           DW++P    +   P + R AKL  D   T+    + LYP SGGNIH F AIT CA+LD+L
Sbjct: 121 DWLEP-QLTEPEDPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDIL 180

Query: 221 GPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLG 276
            PPYS E  R C+Y+++      P G++ +  E   +   WLEE + P++  +  I Y G
Sbjct: 181 APPYSSEHDRHCTYFRKSRREDLP-GELEVDGEVVTD-VTWLEEFQPPDDFVIRRIPYRG 239

BLAST of CSPI04G11020 vs. ExPASy TrEMBL
Match: A0A0A0KWK6 (Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_4G188410 PE=3 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 4.1e-164
Identity = 278/278 (100.00%), Postives = 278/278 (100.00%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG 60
           METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG
Sbjct: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG 60

Query: 61  TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 120
           TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF
Sbjct: 61  TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 120

Query: 121 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 180
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF
Sbjct: 121 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 180

Query: 181 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 240
           TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM
Sbjct: 181 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 240

Query: 241 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI
Sbjct: 241 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of CSPI04G11020 vs. ExPASy TrEMBL
Match: A0A1S3B5D6 (Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103486005 PE=3 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 5.9e-155
Identity = 267/279 (95.70%), Postives = 274/279 (98.21%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R SIPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPSIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAV 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE+RLAKLKADAV
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAV 180

Query: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240
           FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPN D
Sbjct: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNCD 240

Query: 241 MGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           MGLGEE +GEGYGWLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 MGLGEE-EGEGYGWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of CSPI04G11020 vs. ExPASy TrEMBL
Match: A0A5D3DUS9 (Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G00520 PE=3 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 1.7e-154
Identity = 266/279 (95.34%), Postives = 273/279 (97.85%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R  IPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPPIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAV 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE+RLAKLKADAV
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAV 180

Query: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240
           FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD
Sbjct: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240

Query: 241 MGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           MGLGEE +GEGY WLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 MGLGEE-EGEGYRWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of CSPI04G11020 vs. ExPASy TrEMBL
Match: A0A5A7TK65 (Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold67G001220 PE=3 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 1.2e-152
Identity = 266/284 (93.66%), Postives = 273/284 (96.13%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R  IPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPPIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE-----KRLAKL 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE     +RLAKL
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCESERFARRLAKL 180

Query: 181 KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS 240
           KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS
Sbjct: 181 KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS 240

Query: 241 FPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           FPNGDMGLGEE +GEGY WLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 FPNGDMGLGEE-EGEGYRWLEEIEVPENSQMDGIEYLGPQISDI 283

BLAST of CSPI04G11020 vs. ExPASy TrEMBL
Match: A0A6J1HS36 (Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111466078 PE=3 SV=1)

HSP 1 Score: 515.8 bits (1327), Expect = 1.1e-142
Identity = 245/277 (88.45%), Postives = 262/277 (94.58%), Query Frame = 0

Query: 2   ETRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGT 61
           ETRAV+RGS RIGHVNKVQYV+RD KKRKCR+++RS+PVVPMALQELFVSCR+VFKGPGT
Sbjct: 4   ETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQELFVSCRDVFKGPGT 63

Query: 62  VPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKG-SPRVTYTTIYKCDNFSLCIF 121
           VPLPCDV+KLCRILDNMKAEDVGLSS+LQFF PNV VKG S RVT TTIYKCDNFSLCIF
Sbjct: 64  VPLPCDVDKLCRILDNMKAEDVGLSSNLQFFNPNVQVKGSSSRVTCTTIYKCDNFSLCIF 123

Query: 122 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 181
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTN+DD AQP +KRLAKLKAD VF
Sbjct: 124 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKADTVF 183

Query: 182 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 241
           TSPCSTSVLYPT+GGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYASFPNG++
Sbjct: 184 TSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPNGEV 243

Query: 242 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICD 278
            L EE +GEGYGWLEEIEVPENS MDGIEYLGPQI D
Sbjct: 244 ALTEE-EGEGYGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of CSPI04G11020 vs. NCBI nr
Match: XP_004149110.1 (plant cysteine oxidase 1 [Cucumis sativus] >KGN53913.1 hypothetical protein Csa_019158 [Cucumis sativus])

HSP 1 Score: 587.0 bits (1512), Expect = 8.4e-164
Identity = 278/278 (100.00%), Postives = 278/278 (100.00%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG 60
           METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG
Sbjct: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG 60

Query: 61  TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 120
           TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF
Sbjct: 61  TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 120

Query: 121 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 180
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF
Sbjct: 121 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 180

Query: 181 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 240
           TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM
Sbjct: 181 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 240

Query: 241 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI
Sbjct: 241 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of CSPI04G11020 vs. NCBI nr
Match: XP_008442017.1 (PREDICTED: plant cysteine oxidase 2-like [Cucumis melo])

HSP 1 Score: 556.6 bits (1433), Expect = 1.2e-154
Identity = 267/279 (95.70%), Postives = 274/279 (98.21%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R SIPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPSIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAV 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE+RLAKLKADAV
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAV 180

Query: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240
           FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPN D
Sbjct: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNCD 240

Query: 241 MGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           MGLGEE +GEGYGWLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 MGLGEE-EGEGYGWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of CSPI04G11020 vs. NCBI nr
Match: TYK27060.1 (plant cysteine oxidase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 555.1 bits (1429), Expect = 3.5e-154
Identity = 266/279 (95.34%), Postives = 273/279 (97.85%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R  IPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPPIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAV 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE+RLAKLKADAV
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAV 180

Query: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240
           FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD
Sbjct: 181 FTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGD 240

Query: 241 MGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           MGLGEE +GEGY WLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 MGLGEE-EGEGYRWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of CSPI04G11020 vs. NCBI nr
Match: KAA0041769.1 (plant cysteine oxidase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 548.9 bits (1413), Expect = 2.5e-152
Identity = 266/284 (93.66%), Postives = 273/284 (96.13%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRR-SIPVVPMALQELFVSCREVFKGP 60
           METRAV+RGSNRIGHVNKVQYVRRDFKKRKCRKI+R  IPVVPMALQELFVSCREVFKGP
Sbjct: 1   METRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPPIPVVPMALQELFVSCREVFKGP 60

Query: 61  GTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCI 120
           GTVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIY+CDNFSLCI
Sbjct: 61  GTVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCI 120

Query: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE-----KRLAKL 180
           FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCE     +RLAKL
Sbjct: 121 FFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCESERFARRLAKL 180

Query: 181 KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS 240
           KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS
Sbjct: 181 KADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAS 240

Query: 241 FPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
           FPNGDMGLGEE +GEGY WLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 241 FPNGDMGLGEE-EGEGYRWLEEIEVPENSQMDGIEYLGPQISDI 283

BLAST of CSPI04G11020 vs. NCBI nr
Match: XP_038883864.1 (plant cysteine oxidase 2-like [Benincasa hispida])

HSP 1 Score: 539.3 bits (1388), Expect = 2.0e-149
Identity = 256/278 (92.09%), Postives = 267/278 (96.04%), Query Frame = 0

Query: 1   METRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPG 60
           ME RAVNRGS RIGHV KV+YV+RD K+RKCRKI+RS+PVVPMALQELFVSCREVFKGPG
Sbjct: 3   MERRAVNRGSKRIGHVKKVRYVKRDIKRRKCRKIKRSVPVVPMALQELFVSCREVFKGPG 62

Query: 61  TVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 120
           TVPLPCDVEKLC ILDNMKAEDVGLSS+LQFFKPNVPVKGSPRVTYTTIYKC+NFSLCIF
Sbjct: 63  TVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCNNFSLCIF 122

Query: 121 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVF 180
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVD TNSDD AQPCEKRLAKLKADAVF
Sbjct: 123 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDLTNSDDPAQPCEKRLAKLKADAVF 182

Query: 181 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 240
           TSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF NG+M
Sbjct: 183 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFTNGEM 242

Query: 241 GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 279
            LGEE +GEGYGWLEEIEVPENS+MDGIEYLGPQI DI
Sbjct: 243 ELGEE-EGEGYGWLEEIEVPENSQMDGIEYLGPQIADI 279

BLAST of CSPI04G11020 vs. TAIR 10
Match: AT5G15120.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 246.1 bits (627), Expect = 3.3e-65
Identity = 122/241 (50.62%), Postives = 164/241 (68.05%), Query Frame = 0

Query: 44  ALQELFVSCREVFK--GPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPN--VPVK 103
           A++ LF +C+EVF   GPG +P    +++L  ILD+MK EDVGL+ ++ +F+PN  V  +
Sbjct: 57  AVRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEAR 116

Query: 104 GSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTN 163
            SP +TY  +++CD FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV    
Sbjct: 117 SSPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWV---- 176

Query: 164 SDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSME 223
            D   +  + RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDVLGPPY   
Sbjct: 177 VDAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNP 236

Query: 224 DGRDCSYYKEHPYASFPNGDMG-LGEEDQGEGYGWLEEIE--VPENSEMDGIEYLGPQIC 278
           +GR C+Y+ E P     + D   L  E++ EGY WL+E +    +++ + G  Y GP++ 
Sbjct: 237 EGRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVE 293

BLAST of CSPI04G11020 vs. TAIR 10
Match: AT5G39890.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 241.9 bits (616), Expect = 6.2e-64
Identity = 126/245 (51.43%), Postives = 165/245 (67.35%), Query Frame = 0

Query: 35  RRSIPVVPMALQELFVSCREVFKG--PGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFF 94
           RRS   +   +Q+LF +C++VF     GTVP   ++E L  +LD +K EDVG++  + +F
Sbjct: 37  RRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYF 96

Query: 95  KPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSY 154
           +  V  + SP VTY  IY C  FS+CIF LP +GVIPLHNHP MTVFSKLL G MHIKSY
Sbjct: 97  RSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSY 156

Query: 155 DWVDPTNSDDTAQP-CEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV 214
           DWV      D+ QP  + RLAK+K D+ FT+PC TS+LYP  GGN+H FTA T CAVLDV
Sbjct: 157 DWV-----PDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDV 216

Query: 215 LGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEE-IEVPENSEMDGIEY 274
           +GPPYS   GR C+YY ++P++SF    + + EE++ EGY WL+E  E PE+  +  + Y
Sbjct: 217 IGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEK-EGYAWLKEREEKPEDLTVTALMY 274

Query: 275 LGPQI 276
            GP I
Sbjct: 277 SGPTI 274

BLAST of CSPI04G11020 vs. TAIR 10
Match: AT1G18490.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 206.1 bits (523), Expect = 3.7e-53
Identity = 102/247 (41.30%), Postives = 142/247 (57.49%), Query Frame = 0

Query: 45  LQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPR- 104
           +QEL+  C+E F G    P    ++KLC +LD++   DVGL    Q       V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 105 ---------VTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDW 164
                    +T+  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 165 VDP----TNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLD 224
           V+P    T           RLAKL +D V T       LYP +GGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 225 VLGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQG--EGYGWLEEIEVPENSEMDGI 276
           +L PPY    GR CSYY ++P+++F   + G+ + D+G  + Y WL +I+ P++  M   
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFAL-ENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPG 274

BLAST of CSPI04G11020 vs. TAIR 10
Match: AT3G58670.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 191.0 bits (484), Expect = 1.2e-48
Identity = 98/244 (40.16%), Postives = 139/244 (56.97%), Query Frame = 0

Query: 41  VPMALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKG 100
           +P  +Q LF +C+      G V     ++K+  +L+ +K  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 101 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 160
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 161 YDWVDPTNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV 220
           YDW +P  S +   P + R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 221 LGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYL 276
           L PPYS   GR C+Y+++ P    P G++ +   +      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

BLAST of CSPI04G11020 vs. TAIR 10
Match: AT3G58670.2 (Protein of unknown function (DUF1637) )

HSP 1 Score: 191.0 bits (484), Expect = 1.2e-48
Identity = 98/244 (40.16%), Postives = 139/244 (56.97%), Query Frame = 0

Query: 41  VPMALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKG 100
           +P  +Q LF +C+      G V     ++K+  +L+ +K  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 101 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 160
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 161 YDWVDPTNSDDTAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV 220
           YDW +P  S +   P + R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 221 LGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEEDQGEGYGWLEEIEVPENSEMDGIEYL 276
           L PPYS   GR C+Y+++ P    P G++ +   +      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LXG94.6e-6450.62Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1[more]
Q8LGJ58.7e-6351.43Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1[more]
Q1G3U65.3e-5241.30Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1[more]
Q9LXT41.8e-4740.16Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1[more]
Q9SJI95.3e-4438.27Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KWK64.1e-164100.00Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_4G188410 PE=3 SV=1[more]
A0A1S3B5D65.9e-15595.70Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103486005 PE=3 SV=1[more]
A0A5D3DUS91.7e-15495.34Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G... [more]
A0A5A7TK651.2e-15293.66Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold67G... [more]
A0A6J1HS361.1e-14288.45Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111466078 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_004149110.18.4e-164100.00plant cysteine oxidase 1 [Cucumis sativus] >KGN53913.1 hypothetical protein Csa_... [more]
XP_008442017.11.2e-15495.70PREDICTED: plant cysteine oxidase 2-like [Cucumis melo][more]
TYK27060.13.5e-15495.34plant cysteine oxidase 2-like [Cucumis melo var. makuwa][more]
KAA0041769.12.5e-15293.66plant cysteine oxidase 2-like [Cucumis melo var. makuwa][more]
XP_038883864.12.0e-14992.09plant cysteine oxidase 2-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT5G15120.13.3e-6550.62Protein of unknown function (DUF1637) [more]
AT5G39890.16.2e-6451.43Protein of unknown function (DUF1637) [more]
AT1G18490.13.7e-5341.30Protein of unknown function (DUF1637) [more]
AT3G58670.11.2e-4840.16Protein of unknown function (DUF1637) [more]
AT3G58670.21.2e-4840.16Protein of unknown function (DUF1637) [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847PCO_ADOcoord: 74..275
e-value: 4.2E-71
score: 238.5
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 40..235
e-value: 3.3E-11
score: 44.7
NoneNo IPR availablePANTHERPTHR22966:SF54CYSTEINE OXYGENASE/2-AMINOETHANETHIOL DIOXYGENASE, RMLC-LIKE JELLY ROLL FOLD PROTEIN-RELATEDcoord: 22..276
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 22..276
NoneNo IPR availableCDDcd20289cupin_ADOcoord: 113..217
e-value: 2.13929E-45
score: 146.541
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 41..230

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G11020.1CSPI04G11020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0017172 cysteine dioxygenase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen