Tan0012798 (gene) Snake gourd v1

Overview
NameTan0012798
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine dioxygenase
LocationLG03: 60899767 .. 60905306 (-)
RNA-Seq ExpressionTan0012798
SyntenyTan0012798
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAAGTCAAGAAGTTGTTTCAATTGCTTACATTAAGCTGATTGTAGCTCATTTCTTGATTCTCATTCATTCATCGTCAATGGCGTTTACTTCTTCTTTACCCTCAAAAGAGAAAAAAATGGGAATCTTTGCTTACTGTTCAAGGAAATTCCTTTGATCTTTATGTTCACACCCATGCAAAAATCCGCCATCGTTAGGGCTAGAAGAGTCAAATCTCACCACTTGTTTCCAAATCCCAAAACCGGGTTCTGCATACCGCCAGTTTCCTCTCAATCTTAGGGGCGGCCACCCAACCCCTCGAAGCAGATTTCTCCCTTTCTTTGGCTTCCTCTTTATTTTATCCACTTTTTTTTTTTTTATTTCCCCACCTTTCTCCCCCAGAATGAATAGTCATTCTAGCGAAGCAATTCCTCCTGTTCCAAATGGCCCCTGGTTTTCCCTTCTTTCTCCTCTGTTCTTGTTTTTATATATTAATATCTTATTCCACTTCCTCTTTCTGTTCCTACATCCTCATCTTCTTTCTGACTTTTCCTTTTTTTTCTCTTTTTTTGCACTTGTTCGTCTCTGGTTTCTTGTTTTGGACGAACCCCATTTTCGTTTCTCTTCAATTTTTGATTCTTCTGCTTTATAAACGACACCCCAGATCACGTTTCTGCATCGTTCGGTTTTTCGTATCTGGGTTTGTGGTTTCGGATTCAATCGTTCTTTTTTGAGGCCAAGTTTTTGTTGGGGTTTTGCGGTTTTTGAGGGGAAAAGAGGAGAGGGTTTTTGTTGAGTTCTGATGACGACGGAAACGAGAGCGATGGATCGAGGGAGTAACAGAATTGGACATGTGAATAAGGTTCAGTATGTGAAAAGGGATATTAAGAAGAGGAAATGTAGAAGGATCAAGCGGTCTGTTCCAGTGGTTCCTATGGCGCTTCAAGAACTGTTTGTTTCTTGCAGGGACGTCTTCAAAGGCCCTGGAACCGTTCCATTGCCTTGTGATGTCGAAAAGCTCTGCCGCATTCTTGGTAAGTTTCAGATTGAATCTATTTCCATTAGTTTTTGTTGTTGTTGTTGTTTGTTTGTTTCATTTTCCCTTCAGATTCTTGTTTTGTGAAGAAGAGATTGTTTGTTTTGTCATGGTTTATGAATCTCTATGAAATTAGCTTAGTTTGGCCCAGTTGAACCCTTGATTTCCTTTTGTCTTTACATTTTTGGCTAACTGACATGGTGTGGAAAGTTCTGAAGTTCTAGCAGCATTACCACTTCTGGCTACATGCATATTATTATAGAGCAGCAGGGGAATCTTTCATCATTCTCTTTCTTGAGCAATTTTTTGGGCTTCCCTTTAATGATTTGTCTTATCACATTATGCTTGTATGTTAATGTCAGTCCTATGCTCTTGGATTCTTTGTAGGATTCTTTATTTTTAGACTTAGACCTACCAAGAGAGAAAAAAAAAGAAAGGAAGGATTTCTTCTTTGTGGAATAGTGGATGAAAAACCTTCCCTTCTATTATTTTATGAATTTTATCACATTAAACTTATTAAAAATTTAGTCCCTAAACTTTCATGCTTGTATCTAGTTTGTATTAGATAGATCCCTAAACTTTAAAAATTTTCATATCTAGTTTGTATTAGATAGGTCGCTAAACTTAAAAAATTTTCATGTTTGTATCTAGTTTGTATTAGATAGGTCCCTAAACCTTAAAAATTTTCATGTTTGTATCTAGTTTGTATTAATAGGTTCGAACTTTCGATTTTGTGTCTAATAAGTCCTTAAACTTGGACTTTAAAGAGTGTCTAATAATAAGTCCCTAAATTTTTGATTTTATATCTTTTTAAGTCCTTTAACTTAATTTTGTGTCGACATTTTTTCAAAAATTGAAAGTTTAATATACTATCCTCCACAAAATTCAATTTTATGTCTAATAGTTTCGTTTTAATTTAAAAAAAATTAAAAAAAAAGTTTAGGGACATATTAGATACAAACACCAAGTTTGGGGCTAAGCTTGTAATTTAACCTTAAAATTGTCTTTCTGATGCATTCTGCTTTTTTCTATCATAATGTCACACACTTCAATGCAGCAGCATTTGCTTTAGTAAAATTGAATCATTTGATTTTTTTATTTTTATTTTTTTGGCCTCGTGGGACGTTTTTGAGTTAAACTAGCAGGTTTGAGGTATTAAAAAATAAAGAGAATTTGGGGGGCTTTTGTTCAAATCTAAGGCCACCTAGGAGTCTCAAATGAGATCAGTGAAATTGTTGAATAATAATAATAATAAAAAAGAAACTAGCAGATTTCTAGGAATAGATTCCTCTGCCTGAATAGTATAGTATTTATTTGCTTACATCGTACTATCAACCTACAACTGTCTCTTTCTACAAATATTTGTTTATGCTGTGCTTTGTTGTGCCATGGGGGCACATGAGCCTCCTTTGATAATTATAACTTCTTTGCTGAAAAGACAACATAACACAATAGAATATGAGGGATTTCTGCATCTAAGCACGTGGTTTTGTTTGGAGAAATTTTCGGGTCCAGGAACATAGGACATAAAACTTCTGTACTTTTGAATTACTTAGCTTCTACATGGTTGAATATTGACCAAATTCCTTTTATTTTTTCAATGCTGCAGATAATATGAGGGCAGAAGATGTTGGACTTAGTAGTAATTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTAACATACACAACCATATACAAGTGTGACAATTTCTCGGTAAGAACATTGTCTTGGCTGTTTTTTATCTAAAATAGCTGTTAGTTAATTATATTTATCGAAAAGGGCAAGCTTATAAACAGTGGGATTAGTCAAATTGTTATAGATCATGAAATTAGTTGAGCTGCTTGCGAGTTGAATATAAGGCAAGGTTATATATCATGAGATCAGTCAAAGGGCACGGAACATATAAGGTTGGCCTAGGGGATAATAGTGGTAAGTTACAAATCATGAAATTATTCAATCAATGTGCACACAAGCTAAATATGTAGGGTCAAGAAGGAAAGCTTATAAGTCAAGATCATACTGTATGTTTTAGGTTAGCCTAGTGGTCAAACAAGCAAACGATAAGTCTCGGACTTTCGAGATTAGTCGAGCTGTGTGCATATCATCTATTATTTTCCATTTTGTTGCTTGGATCCAAATTCCTGATGCCATATGTTTTCTTCTTTATACAGTTGTGCATCTTCTTCCTCCCAGCAACTGGTGTAATCCCTCTACACAACCATCCAGGAATGACTGTTTTCAGTAAGCTTCTGCTAGGGAAAATGCATATCAAATCATATGATTGGGTTGATCCAACCAAAACTGATGATTCTGCCCAACCTTGTCAAAGTAAGCACATCTAAACTTTCTTATTATAAAATGAAATTGTTGGGAAAATTTCTAGTTGTTGTTGACCCTGAAGTGATGATGGGTTGTTTTGCTTTGCAGAGAGATTGGCAAAGCTGAAAGCTGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACAACAGGAGGCAATATTCACTCATTCACAGCTATAACGCCATGTGCGGTGCTCGACGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGGGATTGTTCATATTATAAGGAGCATCCCTATGCCGCTTTTCCAAGTAAGTTCCAAATCATGTTCCTTTTTTAATCACTTAAAGTTGACTTTAGAATACTTGCAATCGATGCAAAAAGAGACTGTTTTGTTTGGTTGTGTTCTGGAAAACTTTGGATCTCTGTTTTTGTCAAGATTGTTTTGTTCTCGTGTTTTCATAGTTAATTTTCAGTTAAATTACAAGTTTAATCCTCAAACTTTGAGGTTTGTATCTAGAACTTCAAAATTCGTGTTTAATAGGTATCTTGATAGATCCTTGAATTTTAAGTTCAGCGACTAAATTTATAATTTTGAAAATTTAGGAACCAAATAGACACAAAAAAGTTTTTTTTTTAAAAAACTGTTTTTGAAAGTGACATTTTTTTTTATATGGACTCCAAGTGATTTCTAGTCATATAAAAAGTGCAAAATAGGATAAAAACAAAAAATGAAAAAAAAACAGAGTACTTTGGTGTTTTTGGACTTTCTTATGACAGCTAACATGTTTATGAAGAATAATTCTTTCATTCTTCTTAATTCTTCCTAATGAAAGCTTGCTTTTTTTTCATAAGAAAAAAAAACATGTCTATGAACAAATAATAACTATAAAAGTCTTTCAAAATATTATAGTTTGATGACCCATGATCAGAAAACAAAATGCATTGACAGTAGAGTTCCTCTTTTATTGAACAAATTACATGTCAAAGTACTGGCCGAGGAGAGAACCAATATAATATGTAACAGAACCTATATTATATAACATTACGTGAACTTGAGCATAGCTCAATTGATTAAGAGTATGAGATTACATGTAAGTTAACGTTACAAAATGGATGCAACAGTTTTGCATCCGCTTCAATGGTCATGAGCTGTTTTTTCTTGCTAGAAACTTGTAGAAAATGGGTCGTTCTGAAATTCACAAAACATTTTTTTCAATGGTCATGAGCTGTTTATTTCTTGTTAGAAACTCGTAGAAAATGGGTTGTTTTGTAATTCACAAAACATTTTTGTTTGTTTGTTTGTTTAACTGATGTTAGGGAATGTTCTTATGGTAATGTTGTACTGTAATGCAGATGGTGAAATGGCACTGACAGAAGAAGAGGGTGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCACATGGATGGAATAGAATACTTAGGCCCTCAGATTATTGACATTTAAATTTGATTAAACCCTACTTTGCCTGCCTCACATTCATTGTCCTCTCTGCCCTTACATTCACTTTTTAAAAAAAAAAAATTTTTTTTTTTGTGTTTAAACTTAAATCTTTTTTGTTTTTTTTTTTTTACTGTAGAGTGTCTCCAAAAGTTGAATTCTTTTGGGGGGTTGTATATAAAAAAAAAGGGTACAAAGTTGGAAGTTTGTTGAATTGGCTGATTTTCAAGAGTGAAAGAGAGATGACTTCCTGTTTTTGATGTAGTTATTTGAGTAATTATTTAAATGAAATGTCCATATTTTTTTATTTATATTCTTGATAGATTGAGGTGTTCGCCATCGAAATGTCTGAAAAACATCAAGTTTGGACATAAATGAATATGAAGGGAATAAGCAAAAGCGGATGGATTTTTTAAACTATTTAGGAAAAGAGGCTGTCTATTGTCTTTTGGGAATAGCCTACTTATGGGATATAATTTAAAGAAATTGTGATAAAGGAGTGATTTGGAAGATTCACTTACAATTTTGTCTGATTTTTTGATTTTATAAAAATGGAAAAGGGAAAAAATGGACTAAATTGCCTTTTAAAAAACTTTTAAAATATTATTTTTGTGGATATATACTTTCCTTATATAGGTTTTTGACCTTTAATTGTGCAATTAAACATTATATGGTATAATACGTTTCTCTATTCGAATAAAT

mRNA sequence

AAAAAAAAAAAAGTCAAGAAGTTGTTTCAATTGCTTACATTAAGCTGATTGTAGCTCATTTCTTGATTCTCATTCATTCATCGTCAATGGCGTTTACTTCTTCTTTACCCTCAAAAGAGAAAAAAATGGGAATCTTTGCTTACTGTTCAAGGAAATTCCTTTGATCTTTATGTTCACACCCATGCAAAAATCCGCCATCGTTAGGGCTAGAAGAGTCAAATCTCACCACTTGTTTCCAAATCCCAAAACCGGGTTCTGCATACCGCCAGTTTCCTCTCAATCTTAGGGGCGGCCACCCAACCCCTCGAAGCAGATTTCTCCCTTTCTTTGGCTTCCTCTTTATTTTATCCACTTTTTTTTTTTTTATTTCCCCACCTTTCTCCCCCAGAATGAATAGTCATTCTAGCGAAGCAATTCCTCCTGTTCCAAATGGCCCCTGGTTTTCCCTTCTTTCTCCTCTGTTCTTGTTTTTATATATTAATATCTTATTCCACTTCCTCTTTCTGTTCCTACATCCTCATCTTCTTTCTGACTTTTCCTTTTTTTTCTCTTTTTTTGCACTTGTTCGTCTCTGGTTTCTTGTTTTGGACGAACCCCATTTTCGTTTCTCTTCAATTTTTGATTCTTCTGCTTTATAAACGACACCCCAGATCACGTTTCTGCATCGTTCGGTTTTTCGTATCTGGGTTTGTGGTTTCGGATTCAATCGTTCTTTTTTGAGGCCAAGTTTTTGTTGGGGTTTTGCGGTTTTTGAGGGGAAAAGAGGAGAGGGTTTTTGTTGAGTTCTGATGACGACGGAAACGAGAGCGATGGATCGAGGGAGTAACAGAATTGGACATGTGAATAAGGTTCAGTATGTGAAAAGGGATATTAAGAAGAGGAAATGTAGAAGGATCAAGCGGTCTGTTCCAGTGGTTCCTATGGCGCTTCAAGAACTGTTTGTTTCTTGCAGGGACGTCTTCAAAGGCCCTGGAACCGTTCCATTGCCTTGTGATGTCGAAAAGCTCTGCCGCATTCTTGATAATATGAGGGCAGAAGATGTTGGACTTAGTAGTAATTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTAACATACACAACCATATACAAGTGTGACAATTTCTCGTTGTGCATCTTCTTCCTCCCAGCAACTGGTGTAATCCCTCTACACAACCATCCAGGAATGACTGTTTTCAGTAAGCTTCTGCTAGGGAAAATGCATATCAAATCATATGATTGGGTTGATCCAACCAAAACTGATGATTCTGCCCAACCTTGTCAAAAGAGATTGGCAAAGCTGAAAGCTGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACAACAGGAGGCAATATTCACTCATTCACAGCTATAACGCCATGTGCGGTGCTCGACGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGGGATTGTTCATATTATAAGGAGCATCCCTATGCCGCTTTTCCAAATGGTGAAATGGCACTGACAGAAGAAGAGGGTGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCACATGGATGGAATAGAATACTTAGGCCCTCAGATTATTGACATTTAAATTTGATTAAACCCTACTTTGCCTGCCTCACATTCATTGTCCTCTCTGCCCTTACATTCACTTTTTAAAAAAAAAAAATTTTTTTTTTTGTGTTTAAACTTAAATCTTTTTTGTTTTTTTTTTTTTACTGTAGAGTGTCTCCAAAAGTTGAATTCTTTTGGGGGGTTGTATATAAAAAAAAAGGGTACAAAGTTGGAAGTTTGTTGAATTGGCTGATTTTCAAGAGTGAAAGAGAGATGACTTCCTGTTTTTGATGTAGTTATTTGAGTAATTATTTAAATGAAATGTCCATATTTTTTTATTTATATTCTTGATAGATTGAGGTGTTCGCCATCGAAATGTCTGAAAAACATCAAGTTTGGACATAAATGAATATGAAGGGAATAAGCAAAAGCGGATGGATTTTTTAAACTATTTAGGAAAAGAGGCTGTCTATTGTCTTTTGGGAATAGCCTACTTATGGGATATAATTTAAAGAAATTGTGATAAAGGAGTGATTTGGAAGATTCACTTACAATTTTGTCTGATTTTTTGATTTTATAAAAATGGAAAAGGGAAAAAATGGACTAAATTGCCTTTTAAAAAACTTTTAAAATATTATTTTTGTGGATATATACTTTCCTTATATAGGTTTTTGACCTTTAATTGTGCAATTAAACATTATATGGTATAATACGTTTCTCTATTCGAATAAAT

Coding sequence (CDS)

ATGACGACGGAAACGAGAGCGATGGATCGAGGGAGTAACAGAATTGGACATGTGAATAAGGTTCAGTATGTGAAAAGGGATATTAAGAAGAGGAAATGTAGAAGGATCAAGCGGTCTGTTCCAGTGGTTCCTATGGCGCTTCAAGAACTGTTTGTTTCTTGCAGGGACGTCTTCAAAGGCCCTGGAACCGTTCCATTGCCTTGTGATGTCGAAAAGCTCTGCCGCATTCTTGATAATATGAGGGCAGAAGATGTTGGACTTAGTAGTAATTTGCAGTTTTTCAAGCCCAATGTTCCAGTCAAAGGATCCCCTAGAGTAACATACACAACCATATACAAGTGTGACAATTTCTCGTTGTGCATCTTCTTCCTCCCAGCAACTGGTGTAATCCCTCTACACAACCATCCAGGAATGACTGTTTTCAGTAAGCTTCTGCTAGGGAAAATGCATATCAAATCATATGATTGGGTTGATCCAACCAAAACTGATGATTCTGCCCAACCTTGTCAAAAGAGATTGGCAAAGCTGAAAGCTGATGCTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACAACAGGAGGCAATATTCACTCATTCACAGCTATAACGCCATGTGCGGTGCTCGACGTGCTTGGACCTCCTTATTCCATGGAGGATGGTCGGGATTGTTCATATTATAAGGAGCATCCCTATGCCGCTTTTCCAAATGGTGAAATGGCACTGACAGAAGAAGAGGGTGGTGAGGGTTATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCACATGGATGGAATAGAATACTTAGGCCCTCAGATTATTGACATTTAA

Protein sequence

MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI
Homology
BLAST of Tan0012798 vs. ExPASy Swiss-Prot
Match: Q9LXG9 (Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1)

HSP 1 Score: 245.0 bits (624), Expect = 1.0e-63
Identity = 122/241 (50.62%), Postives = 161/241 (66.80%), Query Frame = 0

Query: 46  ALQELFVSCRDVFK--GPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPN--VPVK 105
           A++ LF +C++VF   GPG +P    +++L  ILD+M+ EDVGL+  + +F+PN  V  +
Sbjct: 57  AVRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEAR 116

Query: 106 GSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTK 165
            SP +TY  +++CD FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV    
Sbjct: 117 SSPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAP 176

Query: 166 TDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSME 225
             DS    + RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDVLGPPY   
Sbjct: 177 MRDS----KTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNP 236

Query: 226 DGRDCSYYKEHPYAAFPN-GEMALTEEEGGEGYGWLEEIEMPENSHMD--GIEYLGPQII 280
           +GR C+Y+ E P     +  +  L+ EE  EGY WL+E +     H +  G  Y GP++ 
Sbjct: 237 EGRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVE 293

BLAST of Tan0012798 vs. ExPASy Swiss-Prot
Match: Q8LGJ5 (Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 7.4e-62
Identity = 127/256 (49.61%), Postives = 167/256 (65.23%), Query Frame = 0

Query: 26  RDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG--PGTVPLPCDVEKLCRILDNMRAE 85
           R   ++K +R  +   + P  +Q+LF +C+ VF     GTVP   ++E L  +LD ++ E
Sbjct: 28  RSNSRKKIQRRSKKTLICP--VQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPE 87

Query: 86  DVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSK 145
           DVG++  + +F+  V  + SP VTY  IY C  FS+CIF LP +GVIPLHNHP MTVFSK
Sbjct: 88  DVGVNPKMSYFRSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSK 147

Query: 146 LLLGKMHIKSYDWVDPTKTDDSAQP-CQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSF 205
           LL G MHIKSYDWV      DS QP    RLAK+K D+ FT+PC TS+LYP  GGN+H F
Sbjct: 148 LLFGTMHIKSYDWV-----PDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCF 207

Query: 206 TAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEE-IEM 265
           TA T CAVLDV+GPPYS   GR C+YY ++P+++F    + + EEE  EGY WL+E  E 
Sbjct: 208 TAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEE-KEGYAWLKEREEK 267

Query: 266 PENSHMDGIEYLGPQI 278
           PE+  +  + Y GP I
Sbjct: 268 PEDLTVTALMYSGPTI 274

BLAST of Tan0012798 vs. ExPASy Swiss-Prot
Match: Q1G3U6 (Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.1e-52
Identity = 106/247 (42.91%), Postives = 142/247 (57.49%), Query Frame = 0

Query: 47  LQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPR- 106
           +QEL+  C++ F G    P    ++KLC +LD++   DVGL    Q       V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 107 ---------VTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDW 166
                    +T+  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 167 VDP--TKTDDSAQP--CQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLD 226
           V+P    T D   P     RLAKL +D V T       LYP TGGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 227 VLGPPYSMEDGRDCSYYKEHPYAAF--PNGEMALTEEEGGEGYGWLEEIEMPENSHMDGI 278
           +L PPY    GR CSYY ++P++ F   NG M   +E   + Y WL +I+ P++ HM   
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFALENG-MKKVDEGKEDEYAWLVQIDTPDDLHMRPG 274

BLAST of Tan0012798 vs. ExPASy Swiss-Prot
Match: Q9LXT4 (Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 4.6e-48
Identity = 100/244 (40.98%), Postives = 140/244 (57.38%), Query Frame = 0

Query: 43  VPMALQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG 102
           +P  +Q LF +C+      G V     ++K+  +L+ ++  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 103 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 162
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 163 YDWVDPTKTDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDV 222
           YDW +P ++ +   P Q R AKL  D   TSP   + LYPTTGGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 223 LGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYL 278
           L PPYS   GR C+Y+++ P    P GE+ +   E      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

BLAST of Tan0012798 vs. ExPASy Swiss-Prot
Match: Q9SJI9 (Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2)

HSP 1 Score: 180.6 bits (457), Expect = 2.4e-44
Identity = 94/243 (38.68%), Postives = 139/243 (57.20%), Query Frame = 0

Query: 43  VPMALQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKP------ 102
           +P   Q L+ +C+  F   G +     +EK+  +L+ ++  DVG+  + Q  +       
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDA-LEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLN 60

Query: 103 --NVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSY 162
             N   +  P + Y  +++CD+FS+ IF +P + +IPLHNHPGMTV SKL+ G MH+KSY
Sbjct: 61  ERNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSY 120

Query: 163 DWVDPTKTDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVL 222
           DW++P  T+    P Q R AKL  D   T+    + LYP +GGNIH F AIT CA+LD+L
Sbjct: 121 DWLEPQLTEPE-DPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDIL 180

Query: 223 GPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLG 278
            PPYS E  R C+Y+++      P GE+ + + E      WLEE + P++  +  I Y G
Sbjct: 181 APPYSSEHDRHCTYFRKSRREDLP-GELEV-DGEVVTDVTWLEEFQPPDDFVIRRIPYRG 239

BLAST of Tan0012798 vs. NCBI nr
Match: XP_004149110.1 (plant cysteine oxidase 1 [Cucumis sativus] >KGN53913.1 hypothetical protein Csa_019158 [Cucumis sativus])

HSP 1 Score: 545.0 bits (1403), Expect = 3.7e-151
Identity = 253/277 (91.34%), Postives = 270/277 (97.47%), Query Frame = 0

Query: 4   ETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKGPGT 63
           ETRA++RGSNRIGHVNKVQYV+RD KKRKCR+I+RS+PVVPMALQELFVSCR+VFKGPGT
Sbjct: 2   ETRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGT 61

Query: 64  VPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 123
           VPLPCDVEKLCRILDNM+AEDVGLSS+LQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF
Sbjct: 62  VPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 121

Query: 124 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVFT 183
           LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT +DD+AQPC+KRLAKLKADAVFT
Sbjct: 122 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVFT 181

Query: 184 SPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEMA 243
           SPCSTSVLYPT+GGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYA+FPNG+M 
Sbjct: 182 SPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDMG 241

Query: 244 LTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI 281
           L EE+ GEGYGWLEEIE+PENS MDGIEYLGPQI DI
Sbjct: 242 LGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of Tan0012798 vs. NCBI nr
Match: XP_022966423.1 (plant cysteine oxidase 2-like [Cucurbita maxima])

HSP 1 Score: 540.4 bits (1391), Expect = 9.0e-150
Identity = 260/280 (92.86%), Postives = 269/280 (96.07%), Query Frame = 0

Query: 1   MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG 60
           MTTETRA+DRGS RIGHVNKVQYVKRDIKKRKCRR+KRSVPVVPMALQELFVSCRDVFKG
Sbjct: 1   MTTETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQELFVSCRDVFKG 60

Query: 61  PGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG-SPRVTYTTIYKCDNFSL 120
           PGTVPLPCDV+KLCRILDNM+AEDVGLSSNLQFF PNV VKG S RVT TTIYKCDNFSL
Sbjct: 61  PGTVPLPCDVDKLCRILDNMKAEDVGLSSNLQFFNPNVQVKGSSSRVTCTTIYKCDNFSL 120

Query: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKAD 180
           CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT TDD AQP QKRLAKLKAD
Sbjct: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKAD 180

Query: 181 AVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPN 240
            VFTSPCSTSVLYPTTGGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYA+FPN
Sbjct: 181 TVFTSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPN 240

Query: 241 GEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIID 280
           GE+ALTEEE GEGYGWLEEIE+PENSHMDGIEYLGPQIID
Sbjct: 241 GEVALTEEE-GEGYGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of Tan0012798 vs. NCBI nr
Match: XP_022924929.1 (plant cysteine oxidase 2-like [Cucurbita moschata])

HSP 1 Score: 539.7 bits (1389), Expect = 1.5e-149
Identity = 259/280 (92.50%), Postives = 270/280 (96.43%), Query Frame = 0

Query: 1   MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG 60
           MTTETRA+DRGS RIGHVNKVQYVKRDIKKRKCRR+KRSVPVVPMALQ+LFVSCRDVFKG
Sbjct: 1   MTTETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQQLFVSCRDVFKG 60

Query: 61  PGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG-SPRVTYTTIYKCDNFSL 120
           PGTVPLPCDV+KLCRILD+M+AEDVGLSSNLQFF PNV VKG SPRVT TTIYKCDNFSL
Sbjct: 61  PGTVPLPCDVDKLCRILDDMKAEDVGLSSNLQFFNPNVAVKGSSPRVTCTTIYKCDNFSL 120

Query: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKAD 180
           CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT TDD AQP QKRLAKLKAD
Sbjct: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKAD 180

Query: 181 AVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPN 240
           AVFTSPCSTSVLYPTTGGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYA+FPN
Sbjct: 181 AVFTSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPN 240

Query: 241 GEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIID 280
            E+ALTEEE GEGYGWLEEIE+PENSHMDGIEYLGPQIID
Sbjct: 241 SEVALTEEE-GEGYGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of Tan0012798 vs. NCBI nr
Match: XP_023518747.1 (plant cysteine oxidase 2-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023518748.1 plant cysteine oxidase 2-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 537.0 bits (1382), Expect = 1.0e-148
Identity = 257/280 (91.79%), Postives = 269/280 (96.07%), Query Frame = 0

Query: 1   MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG 60
           MTTETRA+DRGS RIGHVNKVQYVKRDIKKRKCRR+KRSVPVVPMALQELFVSCRDVFKG
Sbjct: 1   MTTETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQELFVSCRDVFKG 60

Query: 61  PGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG-SPRVTYTTIYKCDNFSL 120
           PGTVPLPCDV+KLCRILDNM+AEDVGLSSNLQFF PNV VKG SPRVT TTIYKCDNFSL
Sbjct: 61  PGTVPLPCDVDKLCRILDNMKAEDVGLSSNLQFFNPNVQVKGSSPRVTCTTIYKCDNFSL 120

Query: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKAD 180
           CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT TDD AQP QKRLAKLK D
Sbjct: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKTD 180

Query: 181 AVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPN 240
           +VFTSPCSTSVLYPTTGGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYA+FPN
Sbjct: 181 SVFTSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPN 240

Query: 241 GEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIID 280
            ++ALTEEE GEG+GWLEEIE+PENSHMDGIEYLGPQIID
Sbjct: 241 SDVALTEEE-GEGHGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of Tan0012798 vs. NCBI nr
Match: XP_008442017.1 (PREDICTED: plant cysteine oxidase 2-like [Cucumis melo])

HSP 1 Score: 533.9 bits (1374), Expect = 8.5e-148
Identity = 253/278 (91.01%), Postives = 268/278 (96.40%), Query Frame = 0

Query: 4   ETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKR-SVPVVPMALQELFVSCRDVFKGPG 63
           ETRA+DRGSNRIGHVNKVQYV+RD KKRKCR+IKR S+PVVPMALQELFVSCR+VFKGPG
Sbjct: 2   ETRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPSIPVVPMALQELFVSCREVFKGPG 61

Query: 64  TVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 123
           TVPLPCDVEKLC ILDNM+AEDVGLSSNLQFFKPNVPVKGSPRVTYTTIY+CDNFSLCIF
Sbjct: 62  TVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCIF 121

Query: 124 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVF 183
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT +DD+AQPC++RLAKLKADAVF
Sbjct: 122 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAVF 181

Query: 184 TSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEM 243
           TSPCSTSVLYPT+GGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYA+FPN +M
Sbjct: 182 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNCDM 241

Query: 244 ALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI 281
            L EEE GEGYGWLEEIE+PENS MDGIEYLGPQI DI
Sbjct: 242 GLGEEE-GEGYGWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of Tan0012798 vs. ExPASy TrEMBL
Match: A0A0A0KWK6 (Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_4G188410 PE=3 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 1.8e-151
Identity = 253/277 (91.34%), Postives = 270/277 (97.47%), Query Frame = 0

Query: 4   ETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKGPGT 63
           ETRA++RGSNRIGHVNKVQYV+RD KKRKCR+I+RS+PVVPMALQELFVSCR+VFKGPGT
Sbjct: 2   ETRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGT 61

Query: 64  VPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 123
           VPLPCDVEKLCRILDNM+AEDVGLSS+LQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF
Sbjct: 62  VPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 121

Query: 124 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVFT 183
           LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT +DD+AQPC+KRLAKLKADAVFT
Sbjct: 122 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVFT 181

Query: 184 SPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEMA 243
           SPCSTSVLYPT+GGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYA+FPNG+M 
Sbjct: 182 SPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDMG 241

Query: 244 LTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI 281
           L EE+ GEGYGWLEEIE+PENS MDGIEYLGPQI DI
Sbjct: 242 LGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of Tan0012798 vs. ExPASy TrEMBL
Match: A0A6J1HS36 (Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111466078 PE=3 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 4.4e-150
Identity = 260/280 (92.86%), Postives = 269/280 (96.07%), Query Frame = 0

Query: 1   MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG 60
           MTTETRA+DRGS RIGHVNKVQYVKRDIKKRKCRR+KRSVPVVPMALQELFVSCRDVFKG
Sbjct: 1   MTTETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQELFVSCRDVFKG 60

Query: 61  PGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG-SPRVTYTTIYKCDNFSL 120
           PGTVPLPCDV+KLCRILDNM+AEDVGLSSNLQFF PNV VKG S RVT TTIYKCDNFSL
Sbjct: 61  PGTVPLPCDVDKLCRILDNMKAEDVGLSSNLQFFNPNVQVKGSSSRVTCTTIYKCDNFSL 120

Query: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKAD 180
           CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT TDD AQP QKRLAKLKAD
Sbjct: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKAD 180

Query: 181 AVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPN 240
            VFTSPCSTSVLYPTTGGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYA+FPN
Sbjct: 181 TVFTSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPN 240

Query: 241 GEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIID 280
           GE+ALTEEE GEGYGWLEEIE+PENSHMDGIEYLGPQIID
Sbjct: 241 GEVALTEEE-GEGYGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of Tan0012798 vs. ExPASy TrEMBL
Match: A0A6J1EAL9 (Cysteine dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111432330 PE=3 SV=1)

HSP 1 Score: 539.7 bits (1389), Expect = 7.5e-150
Identity = 259/280 (92.50%), Postives = 270/280 (96.43%), Query Frame = 0

Query: 1   MTTETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG 60
           MTTETRA+DRGS RIGHVNKVQYVKRDIKKRKCRR+KRSVPVVPMALQ+LFVSCRDVFKG
Sbjct: 1   MTTETRAVDRGSIRIGHVNKVQYVKRDIKKRKCRRMKRSVPVVPMALQQLFVSCRDVFKG 60

Query: 61  PGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG-SPRVTYTTIYKCDNFSL 120
           PGTVPLPCDV+KLCRILD+M+AEDVGLSSNLQFF PNV VKG SPRVT TTIYKCDNFSL
Sbjct: 61  PGTVPLPCDVDKLCRILDDMKAEDVGLSSNLQFFNPNVAVKGSSPRVTCTTIYKCDNFSL 120

Query: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKAD 180
           CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT TDD AQP QKRLAKLKAD
Sbjct: 121 CIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNTDDPAQPRQKRLAKLKAD 180

Query: 181 AVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPN 240
           AVFTSPCSTSVLYPTTGGNIHSFTA+TPCAVLDV+GPPYSMEDGRDCSYYKEHPYA+FPN
Sbjct: 181 AVFTSPCSTSVLYPTTGGNIHSFTAVTPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPN 240

Query: 241 GEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIID 280
            E+ALTEEE GEGYGWLEEIE+PENSHMDGIEYLGPQIID
Sbjct: 241 SEVALTEEE-GEGYGWLEEIEVPENSHMDGIEYLGPQIID 279

BLAST of Tan0012798 vs. ExPASy TrEMBL
Match: A0A1S3B5D6 (Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103486005 PE=3 SV=1)

HSP 1 Score: 533.9 bits (1374), Expect = 4.1e-148
Identity = 253/278 (91.01%), Postives = 268/278 (96.40%), Query Frame = 0

Query: 4   ETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKR-SVPVVPMALQELFVSCRDVFKGPG 63
           ETRA+DRGSNRIGHVNKVQYV+RD KKRKCR+IKR S+PVVPMALQELFVSCR+VFKGPG
Sbjct: 2   ETRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPSIPVVPMALQELFVSCREVFKGPG 61

Query: 64  TVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 123
           TVPLPCDVEKLC ILDNM+AEDVGLSSNLQFFKPNVPVKGSPRVTYTTIY+CDNFSLCIF
Sbjct: 62  TVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCIF 121

Query: 124 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVF 183
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT +DD+AQPC++RLAKLKADAVF
Sbjct: 122 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAVF 181

Query: 184 TSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEM 243
           TSPCSTSVLYPT+GGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYA+FPN +M
Sbjct: 182 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNCDM 241

Query: 244 ALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI 281
            L EEE GEGYGWLEEIE+PENS MDGIEYLGPQI DI
Sbjct: 242 GLGEEE-GEGYGWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of Tan0012798 vs. ExPASy TrEMBL
Match: A0A5D3DUS9 (Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G00520 PE=3 SV=1)

HSP 1 Score: 532.3 bits (1370), Expect = 1.2e-147
Identity = 252/278 (90.65%), Postives = 267/278 (96.04%), Query Frame = 0

Query: 4   ETRAMDRGSNRIGHVNKVQYVKRDIKKRKCRRIKR-SVPVVPMALQELFVSCRDVFKGPG 63
           ETRA+DRGSNRIGHVNKVQYV+RD KKRKCR+IKR  +PVVPMALQELFVSCR+VFKGPG
Sbjct: 2   ETRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPPIPVVPMALQELFVSCREVFKGPG 61

Query: 64  TVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIF 123
           TVPLPCDVEKLC ILDNM+AEDVGLSSNLQFFKPNVPVKGSPRVTYTTIY+CDNFSLCIF
Sbjct: 62  TVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCIF 121

Query: 124 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTKTDDSAQPCQKRLAKLKADAVF 183
           FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPT +DD+AQPC++RLAKLKADAVF
Sbjct: 122 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAVF 181

Query: 184 TSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEM 243
           TSPCSTSVLYPT+GGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYA+FPNG+M
Sbjct: 182 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDM 241

Query: 244 ALTEEEGGEGYGWLEEIEMPENSHMDGIEYLGPQIIDI 281
            L EEE GEGY WLEEIE+PENS MDGIEYLGPQI DI
Sbjct: 242 GLGEEE-GEGYRWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of Tan0012798 vs. TAIR 10
Match: AT5G15120.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 245.0 bits (624), Expect = 7.3e-65
Identity = 122/241 (50.62%), Postives = 161/241 (66.80%), Query Frame = 0

Query: 46  ALQELFVSCRDVFK--GPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPN--VPVK 105
           A++ LF +C++VF   GPG +P    +++L  ILD+M+ EDVGL+  + +F+PN  V  +
Sbjct: 57  AVRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEAR 116

Query: 106 GSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTK 165
            SP +TY  +++CD FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV    
Sbjct: 117 SSPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAP 176

Query: 166 TDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDVLGPPYSME 225
             DS    + RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDVLGPPY   
Sbjct: 177 MRDS----KTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNP 236

Query: 226 DGRDCSYYKEHPYAAFPN-GEMALTEEEGGEGYGWLEEIEMPENSHMD--GIEYLGPQII 280
           +GR C+Y+ E P     +  +  L+ EE  EGY WL+E +     H +  G  Y GP++ 
Sbjct: 237 EGRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVE 293

BLAST of Tan0012798 vs. TAIR 10
Match: AT5G39890.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 238.8 bits (608), Expect = 5.2e-63
Identity = 127/256 (49.61%), Postives = 167/256 (65.23%), Query Frame = 0

Query: 26  RDIKKRKCRRIKRSVPVVPMALQELFVSCRDVFKG--PGTVPLPCDVEKLCRILDNMRAE 85
           R   ++K +R  +   + P  +Q+LF +C+ VF     GTVP   ++E L  +LD ++ E
Sbjct: 28  RSNSRKKIQRRSKKTLICP--VQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPE 87

Query: 86  DVGLSSNLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSK 145
           DVG++  + +F+  V  + SP VTY  IY C  FS+CIF LP +GVIPLHNHP MTVFSK
Sbjct: 88  DVGVNPKMSYFRSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSK 147

Query: 146 LLLGKMHIKSYDWVDPTKTDDSAQP-CQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSF 205
           LL G MHIKSYDWV      DS QP    RLAK+K D+ FT+PC TS+LYP  GGN+H F
Sbjct: 148 LLFGTMHIKSYDWV-----PDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCF 207

Query: 206 TAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEE-IEM 265
           TA T CAVLDV+GPPYS   GR C+YY ++P+++F    + + EEE  EGY WL+E  E 
Sbjct: 208 TAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEE-KEGYAWLKEREEK 267

Query: 266 PENSHMDGIEYLGPQI 278
           PE+  +  + Y GP I
Sbjct: 268 PEDLTVTALMYSGPTI 274

BLAST of Tan0012798 vs. TAIR 10
Match: AT1G18490.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 208.4 bits (529), Expect = 7.6e-54
Identity = 106/247 (42.91%), Postives = 142/247 (57.49%), Query Frame = 0

Query: 47  LQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKGSPR- 106
           +QEL+  C++ F G    P    ++KLC +LD++   DVGL    Q       V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 107 ---------VTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDW 166
                    +T+  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 167 VDP--TKTDDSAQP--CQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLD 226
           V+P    T D   P     RLAKL +D V T       LYP TGGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 227 VLGPPYSMEDGRDCSYYKEHPYAAF--PNGEMALTEEEGGEGYGWLEEIEMPENSHMDGI 278
           +L PPY    GR CSYY ++P++ F   NG M   +E   + Y WL +I+ P++ HM   
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFALENG-MKKVDEGKEDEYAWLVQIDTPDDLHMRPG 274

BLAST of Tan0012798 vs. TAIR 10
Match: AT3G58670.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 193.0 bits (489), Expect = 3.3e-49
Identity = 100/244 (40.98%), Postives = 140/244 (57.38%), Query Frame = 0

Query: 43  VPMALQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG 102
           +P  +Q LF +C+      G V     ++K+  +L+ ++  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 103 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 162
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 163 YDWVDPTKTDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDV 222
           YDW +P ++ +   P Q R AKL  D   TSP   + LYPTTGGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 223 LGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYL 278
           L PPYS   GR C+Y+++ P    P GE+ +   E      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

BLAST of Tan0012798 vs. TAIR 10
Match: AT3G58670.2 (Protein of unknown function (DUF1637) )

HSP 1 Score: 193.0 bits (489), Expect = 3.3e-49
Identity = 100/244 (40.98%), Postives = 140/244 (57.38%), Query Frame = 0

Query: 43  VPMALQELFVSCRDVFKGPGTVPLPCDVEKLCRILDNMRAEDVGLSSNLQFFKPNVPVKG 102
           +P  +Q LF +C+      G V     ++K+  +L+ ++  DVGL    Q  + N P  G
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEA-LDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPG 60

Query: 103 S---------PRVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKS 162
           +         P + Y  +++CD+FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KS
Sbjct: 61  NERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKS 120

Query: 163 YDWVDPTKTDDSAQPCQKRLAKLKADAVFTSPCSTSVLYPTTGGNIHSFTAITPCAVLDV 222
           YDW +P ++ +   P Q R AKL  D   TSP   + LYPTTGGNIH F AIT CA+ D+
Sbjct: 121 YDWAEPDQS-ELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDI 180

Query: 223 LGPPYSMEDGRDCSYYKEHPYAAFPNGEMALTEEEGGEGYGWLEEIEMPENSHMDGIEYL 278
           L PPYS   GR C+Y+++ P    P GE+ +   E      WLEE + P+N  +  + Y 
Sbjct: 181 LSPPYSSTHGRHCNYFRKSPMLDLP-GEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYR 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LXG91.0e-6350.62Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1[more]
Q8LGJ57.4e-6249.61Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1[more]
Q1G3U61.1e-5242.91Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1[more]
Q9LXT44.6e-4840.98Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1[more]
Q9SJI92.4e-4438.68Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
XP_004149110.13.7e-15191.34plant cysteine oxidase 1 [Cucumis sativus] >KGN53913.1 hypothetical protein Csa_... [more]
XP_022966423.19.0e-15092.86plant cysteine oxidase 2-like [Cucurbita maxima][more]
XP_022924929.11.5e-14992.50plant cysteine oxidase 2-like [Cucurbita moschata][more]
XP_023518747.11.0e-14891.79plant cysteine oxidase 2-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023518... [more]
XP_008442017.18.5e-14891.01PREDICTED: plant cysteine oxidase 2-like [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A0A0KWK61.8e-15191.34Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_4G188410 PE=3 SV=1[more]
A0A6J1HS364.4e-15092.86Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111466078 PE=3 SV=1[more]
A0A6J1EAL97.5e-15092.50Cysteine dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111432330 PE=3 SV=1[more]
A0A1S3B5D64.1e-14891.01Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103486005 PE=3 SV=1[more]
A0A5D3DUS91.2e-14790.65Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G... [more]
Match NameE-valueIdentityDescription
AT5G15120.17.3e-6550.62Protein of unknown function (DUF1637) [more]
AT5G39890.15.2e-6349.61Protein of unknown function (DUF1637) [more]
AT1G18490.17.6e-5442.91Protein of unknown function (DUF1637) [more]
AT3G58670.13.3e-4940.98Protein of unknown function (DUF1637) [more]
AT3G58670.23.3e-4940.98Protein of unknown function (DUF1637) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 42..241
e-value: 6.1E-11
score: 43.8
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847PCO_ADOcoord: 76..277
e-value: 1.2E-71
score: 240.3
NoneNo IPR availablePANTHERPTHR22966:SF54CYSTEINE OXYGENASE/2-AMINOETHANETHIOL DIOXYGENASE, RMLC-LIKE JELLY ROLL FOLD PROTEIN-RELATEDcoord: 24..279
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 24..279
NoneNo IPR availableCDDcd20289cupin_ADOcoord: 115..219
e-value: 3.53467E-44
score: 143.46
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 43..233

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012798.1Tan0012798.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0017172 cysteine dioxygenase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen