Tan0002843 (gene) Snake gourd v1

Overview
NameTan0002843
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine dioxygenase
LocationLG07: 55219855 .. 55223182 (-)
RNA-Seq ExpressionTan0002843
SyntenyTan0002843
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGCTATTTGGAACAGCCCACTGGAATACTGGGAATGGCCCCTGGTTTTCTCTTCTTCAAACCCATCTCTCTCTCTCTCTCTCTGCTAAAACCTCTCCGTTTTTCTCCCTTTTATATCGCTTCTCTCTCCCTTCTTCCCTCTTCTTCTCCATAACGGACATCCCCATTTTTCATCTCCCTTCATTCTCTACCCTCTTTCGCTTTTCGATTCTATCCAGATCCCTGTTTTCCCCCAACCCAATTGCTCCACTGGCCACTTAATTCCACTCCTCCTCCACTTTCCGCCTCCAATTTCAACCAAATTTTGGAGGATTTAATGGGGATTGAGAGGTCTCTAGCCGATCGAAAGGGGAAACAGTTTTGTGAATTGCCTAAAGAAATTACTACGAATAATAAGACGAGGAAGAACCGGCGGCGGCCTAAGAAGCCGTCCTCGCCGGTTCAGAAACTTTACGAAACTTGTAAGCAAGTTTTCGCCTCTAGTGAGACTGGAATTGTTCCCCCTTCTGAGGATATCCAACGCCTACGAACTGTTCTGGGTATGCGTTTGTTTCTGTTTTTCATCTTGCTTCTGATTCAATGAACCCCACATTGTGTGTTGAATTCCCAATTCTCATTCATAATGTGGTTTCATGTTGAGATGGCTCGCTTTGTTTGTGTTTTTTTAAATTGAGGGTTCTTTTTAATCCGTACTATCTGTACGAACCTCGTTTGGGGAAAGGTTTTGTCAATTTTTCTTGGGTTTTCATTCTGATTTGTCATTTCTGTACTTTTCAATCTTCTGTTGGATTGTGGGTTTAAAGGTGTATATATATGTTTTAGCTCTGTCTTTTGAGGCTGTTGGTGGCAACTTGCACGGCTCCTGCACGTGCTGAGTTTTATTTATTTATTTATTTTTCTGTTCTAGATATTGGTGCTTTGAGTTATTTTAGTTGGTTAATGATTGGCTTCACAATGGATTTGGTTTTTCTATTCAATGATTGCATTTGAGCATTTTTATCTGGGAATTGCAAATTGGTTCAAGTTCCACAGAGTTTAGTTTTGGTAATAGGATAAGTTTGTAGGGTAGCCCATTTCAATTTTTTATGTTCTGGTAATGCTAATTAGAAGTTTTTGGGAAGTTTTTGAGAGAGAATCTATTAAACCCAATTGGTGTTTTCTTAAACCTTAATTTATTAAAGTTTAGGAAACTGGCTGAGTTGCATTTTGTAAACGGAGACAGAGACAGAGACAGACACAATGCAACCCTTGAAACCGTGTTGTCATTTTTCACTTGTGTCTGGCTTTTGGTTTGTTTTTTTGTGTGTTACTAATGAATCTTCTTGTATATAGATAAAATGAAGCCAGTAGATGTTGGGTTGTCACCGGAGATGCCGTATTTTCGGACGACAGCCGGTGAACGAACTCCACCTCCTATAACATATTTGCACCTCTATGAGAGCAACAAATTCTCTGTATGCATTTTCAGTTGCTTTATTAGAAAGCAGTCTAGTTGCGGGCAACTACTTTAACTGATGTAGTTTTTGACCGTTTGAATTGTCTTTGTGTTTGCAGATGGGAATATTTTGCTTGCCTCCTTCGGGTGTCATTCCACTCCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAGGCGTACGACTGGGCAGAGGTCGGTGCCGAGAATGGCACATCAGTGAGTGTAGACACGTCGGATGGCACAGCTCCCTCAAGAAGTGAGTCTCTGTACAATATTGTTCTGTGCTCTCTCTGCAGTTGGTAGAAATAACAGTCTTATCTTACTCATTAATGAAGAAAGAATGTGCAATAGCTTAGGTTTAAAAGGATTGAAATAAGGATTTAAACATCAATATTGATGGAAATATCTATGGCAGATTTGATGTTGATGGGTATTTGTCTGTTACAAGAACGTTAACTGGATCAAGACATGTTGTCTGTTACAAAACCTGAACTTATAGGAAACCTGAACTTATAGGCCAACTTAGGGATAGCTGAACTGGATAGGACATTGTACCTTCCACCTAGGGTAAGATGTTTGAATCCCAAACCCGCATATTGTTGAACTCAACAAAAAAAAAAAAAAAAGAAGAAACTAATGTGGAGAGTGCGACCAAAGTCCCACATCGGCTAGATAAGGGGAAGATCATGAGTTTATAAGTGAGAGACACTATCTCTATTGATATGAGGCCTTTGAGTGAAACCAAAAGCAAAGCCATGAAGGCATATGTTCAAAGTGGACAATATCATACCAATGTAGAATAGGGTCTTTCTGATAGTAAGATTTTGAAATGGGTTTGCCAATGGTCATATTATAGGCAAGCAATAGTTTTTTTTTTCCAAATTTTTTTAATGAGATTAGTTGACATGCGTAGTTATTAGAAAGAATCTCAAATCTTGTGTCCTTTATCTTTAAGGTACTTAACTTAATGGATCATATAATGTAACAATGTAGAGAAGAACACCTAATTTTCTGTTATATTACACAGAGTAATCAATTTTAAGCAATTAGGACTCCAATCTATAAGCATTCGTCTTGTTCAACATCTCCAGGTATTCAGTTGGCCAAAGTTAAGGTAGACTCCAACTTCACAGCACCGTGCGACTCGTCCATTCTTTACCCTGCAGACGGAGGAAACATGCATTGCTTCACAGCTGTGACCGCGTGTGCAGTGTTAGATGTGCTCGGCCCACCTTACTCCGATTGCGATGGTCGGCATTGCTCATACTACCTCGACTTTCCCTTCACCGAGTTTTCAGGTACCCTTACGACTATGTAGTAGTGTTTTTTTACCATCAAATAGATGAAGCAGAAATTAAGATCGTGATTTTGTTTGAAAAATCACAGTCGATGGGGTTTCAATCCCGGAAGAGGAGAGGGAAAACTATGCATGGCTGGAAGAAAGAGAGCAACCTGAAGACTTAGCTGCAGTTGGAGCAGTGTACAGAGGGCCAAAGATAGTTGAGAATTGATGAACTCAGCAAACAAATCTTCTTCATTTCATACAATGCTGTCATGGACTTGCCACCATTTCTGTGCAGCATAGAATCTTTGTTCTTACTTTCTACTTCCAAAAGAGAGAGAGAGAGAGAGAGAGAGAAACAAAACAAAAACAAAAGAAGCTTTTTTATTGATCTTTCTTCTTTGTGGAACATGAATTAGGTACTTTTTTTTTACCCCCCTCCCTGTTTTTAATCATCTTTTGTATATAGTCAAGTTAGAAAGTGACAAGTAAAAAAAAAGTATGAAATTGTTATGTTCTTCCAATAACACTCTTCCAGTTTGTTGTTTTTAACAGATTCAACTC

mRNA sequence

CCGCTATTTGGAACAGCCCACTGGAATACTGGGAATGGCCCCTGGTTTTCTCTTCTTCAAACCCATCTCTCTCTCTCTCTCTCTGCTAAAACCTCTCCGTTTTTCTCCCTTTTATATCGCTTCTCTCTCCCTTCTTCCCTCTTCTTCTCCATAACGGACATCCCCATTTTTCATCTCCCTTCATTCTCTACCCTCTTTCGCTTTTCGATTCTATCCAGATCCCTGTTTTCCCCCAACCCAATTGCTCCACTGGCCACTTAATTCCACTCCTCCTCCACTTTCCGCCTCCAATTTCAACCAAATTTTGGAGGATTTAATGGGGATTGAGAGGTCTCTAGCCGATCGAAAGGGGAAACAGTTTTGTGAATTGCCTAAAGAAATTACTACGAATAATAAGACGAGGAAGAACCGGCGGCGGCCTAAGAAGCCGTCCTCGCCGGTTCAGAAACTTTACGAAACTTGTAAGCAAGTTTTCGCCTCTAGTGAGACTGGAATTGTTCCCCCTTCTGAGGATATCCAACGCCTACGAACTGTTCTGGATAAAATGAAGCCAGTAGATGTTGGGTTGTCACCGGAGATGCCGTATTTTCGGACGACAGCCGGTGAACGAACTCCACCTCCTATAACATATTTGCACCTCTATGAGAGCAACAAATTCTCTATGGGAATATTTTGCTTGCCTCCTTCGGGTGTCATTCCACTCCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAGGCGTACGACTGGGCAGAGGTCGGTGCCGAGAATGGCACATCAGTGAGTGTAGACACGTCGGATGGCACAGCTCCCTCAAGAAGTATTCAGTTGGCCAAAGTTAAGGTAGACTCCAACTTCACAGCACCGTGCGACTCGTCCATTCTTTACCCTGCAGACGGAGGAAACATGCATTGCTTCACAGCTGTGACCGCGTGTGCAGTGTTAGATGTGCTCGGCCCACCTTACTCCGATTGCGATGGTCGGCATTGCTCATACTACCTCGACTTTCCCTTCACCGAGTTTTCAGTCGATGGGGTTTCAATCCCGGAAGAGGAGAGGGAAAACTATGCATGGCTGGAAGAAAGAGAGCAACCTGAAGACTTAGCTGCAGTTGGAGCAGTGTACAGAGGGCCAAAGATAGTTGAGAATTGATGAACTCAGCAAACAAATCTTCTTCATTTCATACAATGCTGTCATGGACTTGCCACCATTTCTGTGCAGCATAGAATCTTTGTTCTTACTTTCTACTTCCAAAAGAGAGAGAGAGAGAGAGAGAGAGAAACAAAACAAAAACAAAAGAAGCTTTTTTATTGATCTTTCTTCTTTGTGGAACATGAATTAGGTACTTTTTTTTTACCCCCCTCCCTGTTTTTAATCATCTTTTGTATATAGTCAAGTTAGAAAGTGACAAGTAAAAAAAAAGTATGAAATTGTTATGTTCTTCCAATAACACTCTTCCAGTTTGTTGTTTTTAACAGATTCAACTC

Coding sequence (CDS)

ATGGGGATTGAGAGGTCTCTAGCCGATCGAAAGGGGAAACAGTTTTGTGAATTGCCTAAAGAAATTACTACGAATAATAAGACGAGGAAGAACCGGCGGCGGCCTAAGAAGCCGTCCTCGCCGGTTCAGAAACTTTACGAAACTTGTAAGCAAGTTTTCGCCTCTAGTGAGACTGGAATTGTTCCCCCTTCTGAGGATATCCAACGCCTACGAACTGTTCTGGATAAAATGAAGCCAGTAGATGTTGGGTTGTCACCGGAGATGCCGTATTTTCGGACGACAGCCGGTGAACGAACTCCACCTCCTATAACATATTTGCACCTCTATGAGAGCAACAAATTCTCTATGGGAATATTTTGCTTGCCTCCTTCGGGTGTCATTCCACTCCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAGGCGTACGACTGGGCAGAGGTCGGTGCCGAGAATGGCACATCAGTGAGTGTAGACACGTCGGATGGCACAGCTCCCTCAAGAAGTATTCAGTTGGCCAAAGTTAAGGTAGACTCCAACTTCACAGCACCGTGCGACTCGTCCATTCTTTACCCTGCAGACGGAGGAAACATGCATTGCTTCACAGCTGTGACCGCGTGTGCAGTGTTAGATGTGCTCGGCCCACCTTACTCCGATTGCGATGGTCGGCATTGCTCATACTACCTCGACTTTCCCTTCACCGAGTTTTCAGTCGATGGGGTTTCAATCCCGGAAGAGGAGAGGGAAAACTATGCATGGCTGGAAGAAAGAGAGCAACCTGAAGACTTAGCTGCAGTTGGAGCAGTGTACAGAGGGCCAAAGATAGTTGAGAATTGA

Protein sequence

MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN
Homology
BLAST of Tan0002843 vs. ExPASy Swiss-Prot
Match: Q8LGJ5 (Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 3.8e-90
Identity = 162/266 (60.90%), Postives = 198/266 (74.44%), Query Frame = 0

Query: 21  EITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGIVPPSEDIQRLRTVLDKMKPV 80
           E  +N++ +  RR  K    PVQKL++TCK+VFA  ++G VP  E+I+ LR VLD++KP 
Sbjct: 26  ENRSNSRKKIQRRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPE 85

Query: 81  DVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSK 140
           DVG++P+M YFR+T   R+ P +TYLH+Y  ++FS+ IFCLPPSGVIPLHNHP MTVFSK
Sbjct: 86  DVGVNPKMSYFRSTVTGRS-PLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSK 145

Query: 141 LLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYP 200
           LLFGTMHIK+YDW                D   PS   +LAKVKVDS+FTAPCD+SILYP
Sbjct: 146 LLFGTMHIKSYDW--------------VPDSPQPSSDTRLAKVKVDSDFTAPCDTSILYP 205

Query: 201 ADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDGVSIPEEERENYA 260
           ADGGNMHCFTA TACAVLDV+GPPYSD  GRHC+YY D+PF+ FSVDGV + EEE+E YA
Sbjct: 206 ADGGNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYA 265

Query: 261 WLEEREQ-PEDLAAVGAVYRGPKIVE 286
           WL+ERE+ PEDL     +Y GP I E
Sbjct: 266 WLKEREEKPEDLTVTALMYSGPTIKE 276

BLAST of Tan0002843 vs. ExPASy Swiss-Prot
Match: Q9LXG9 (Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 8.6e-82
Identity = 156/281 (55.52%), Postives = 195/281 (69.40%), Query Frame = 0

Query: 19  PKEITTNNKTRKNRR----RPKKPSSP------VQKLYETCKQVFASSETGIVPPSEDIQ 78
           P  +   NK +  +     R KK  SP      V++L+ TCK+VF++   G++P  + IQ
Sbjct: 25  PNSVKKKNKNKNKKMMMTWRRKKIDSPADGITAVRRLFNTCKEVFSNGGPGVIPSEDKIQ 84

Query: 79  RLRTVLDKMKPVDVGLSPEMPYFRTTAG--ERTPPPITYLHLYESNKFSMGIFCLPPSGV 138
           +LR +LD MKP DVGL+P MPYFR  +G   R+ PPITYLHL++ ++FS+GIFCLPPSGV
Sbjct: 85  QLREILDDMKPEDVGLTPTMPYFRPNSGVEARSSPPITYLHLHQCDQFSIGIFCLPPSGV 144

Query: 139 IPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVD 198
           IPLHNHPGMTVFSKLLFGTMHIK+YDW                D        +LAK+KVD
Sbjct: 145 IPLHNHPGMTVFSKLLFGTMHIKSYDW--------------VVDAPMRDSKTRLAKLKVD 204

Query: 199 SNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSV 258
           S FTAPC++SILYP DGGNMH FTA+TACAVLDVLGPPY + +GRHC+Y+L+FP  + S 
Sbjct: 205 STFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSS 264

Query: 259 --DGVSIPEEERENYAWLEER-EQPED-LAAVGAVYRGPKI 284
             D V   EEE+E YAWL+ER + PED    VGA+YRGPK+
Sbjct: 265 EDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKV 291

BLAST of Tan0002843 vs. ExPASy Swiss-Prot
Match: Q9SJI9 (Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2)

HSP 1 Score: 201.4 bits (511), Expect = 1.3e-50
Identity = 110/255 (43.14%), Postives = 152/255 (59.61%), Query Frame = 0

Query: 43  QKLYETCKQVFASSETGIVPPSED-IQRLRTVLDKMKPVDVGLSPEMPYFRTTAG----- 102
           Q+LY TCK  F+S      P +ED ++++R VL+K+KP DVG+  +    R+ +G     
Sbjct: 6   QRLYNTCKASFSSDG----PITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNER 65

Query: 103 ---ERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDW 162
               ++PP I YLHL+E + FS+GIFC+PPS +IPLHNHPGMTV SKL++G+MH+K+YDW
Sbjct: 66  NGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDW 125

Query: 163 AEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVT 222
            E             ++   PS++ + AK+  D+  TA    + LYP  GGN+HCF A+T
Sbjct: 126 LE----------PQLTEPEDPSQA-RPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAIT 185

Query: 223 ACAVLDVLGPPYSDCDGRHCSYYL-----DFPFTEFSVDGVSIPEEERENYAWLEEREQP 282
            CA+LD+L PPYS    RHC+Y+      D P  E  VDG     E   +  WLEE + P
Sbjct: 186 HCAILDILAPPYSSEHDRHCTYFRKSRREDLP-GELEVDG-----EVVTDVTWLEEFQPP 239

Query: 283 EDLAAVGAVYRGPKI 284
           +D       YRGP I
Sbjct: 246 DDFVIRRIPYRGPVI 239

BLAST of Tan0002843 vs. ExPASy Swiss-Prot
Match: Q1G3U6 (Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.3e-50
Identity = 107/262 (40.84%), Postives = 146/262 (55.73%), Query Frame = 0

Query: 34  RPKKPSSPVQKLYETCKQVFASSETGIVPPSEDIQRLRTVLDKMKPVDVGLSPEMP---- 93
           R ++ S  VQ+LY+ CK+ F        P S  IQ+L +VLD + P DVGL         
Sbjct: 27  RNQEKSPKVQELYDLCKETFTGKAPS--PASMAIQKLCSVLDSVSPADVGLEEVSQDDDR 86

Query: 94  ------YFRTTAGERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLF 153
                   R     R   PIT+L ++E + F+M IFC P S VIPLH+HP M VFSK+L+
Sbjct: 87  GYGVSGVSRFNRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILY 146

Query: 154 GTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADG 213
           G++H+KAYDW E          +    G   S   +LAK+  D   T   +   LYP  G
Sbjct: 147 GSLHVKAYDWVE------PPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTG 206

Query: 214 GNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDG--VSIPEEERENYAW 273
           GN+HCFTA+T CAVLD+L PPY +  GR CSYY+D+PF+ F+++     + E + + YAW
Sbjct: 207 GNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDYPFSTFALENGMKKVDEGKEDEYAW 266

Query: 274 LEEREQPEDLAAVGAVYRGPKI 284
           L + + P+DL      Y GP I
Sbjct: 267 LVQIDTPDDLHMRPGSYTGPTI 280

BLAST of Tan0002843 vs. ExPASy Swiss-Prot
Match: Q9LXT4 (Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 3.9e-50
Identity = 105/251 (41.83%), Postives = 147/251 (58.57%), Query Frame = 0

Query: 42  VQKLYETCKQVFASSETGIVPPSED-IQRLRTVLDKMKPVDVGLSPEMPYFRTTAG---- 101
           +Q+L+ TCK    SS +   P SE+ + ++R VL+K+KP DVGL  E    R   G    
Sbjct: 5   IQRLFNTCK----SSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 64

Query: 102 ----ERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYD 161
                 + P I YL L+E + FS+GIFC+PP  +IPLHNHPGMTV SKL++G+MH+K+YD
Sbjct: 65  RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 124

Query: 162 WAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAV 221
           WAE           D S+   P ++ + AK+  D + T+P  ++ LYP  GGN+HCF A+
Sbjct: 125 WAE----------PDQSELDDPLQA-RPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAI 184

Query: 222 TACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDGVSIPEEERENYAWLEEREQPEDLA 281
           T CA+ D+L PPYS   GRHC+Y+   P  +   +   +  E   N  WLEE + P++  
Sbjct: 185 THCAIFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFV 240

Query: 282 AVGAVYRGPKI 284
                YRGP I
Sbjct: 245 IWRVPYRGPVI 240

BLAST of Tan0002843 vs. NCBI nr
Match: KAG6588381.1 (Plant cysteine oxidase 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 513.5 bits (1321), Expect = 1.2e-141
Identity = 252/286 (88.11%), Postives = 266/286 (93.01%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGI 60
           MG ERSLADRKGKQF ELPKE T NNK+RKNRRR KKPSSP+QKLYETCKQVFAS++TGI
Sbjct: 62  MGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTDTGI 121

Query: 61  VPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFC 120
           VP  EDIQRL++VLDKMK VDVGLSPEMPYFRTTA E T PPITYLHLYE+NKFSMGIFC
Sbjct: 122 VPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGT-PPITYLHLYENNKFSMGIFC 181

Query: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQL 180
           LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VD S+GTAPS SI+L
Sbjct: 182 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGISAGVDASNGTAPS-SIRL 241

Query: 181 AKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFP 240
           AKVKVD+NFTAPCDS+ILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLDFP
Sbjct: 242 AKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFP 301

Query: 241 FTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FT+FSVDG SIPE ERE+YAWLEEREQPEDLAAVGA YRGPKIVE+
Sbjct: 302 FTKFSVDGKSIPEAERESYAWLEEREQPEDLAAVGAEYRGPKIVES 345

BLAST of Tan0002843 vs. NCBI nr
Match: XP_022928708.1 (plant cysteine oxidase 2-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 513.1 bits (1320), Expect = 1.6e-141
Identity = 251/286 (87.76%), Postives = 266/286 (93.01%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGI 60
           MG ERSLADRKGKQF ELPKE T NNK+RKNRRR KKPSSP+QKLYETCKQVFAS++TGI
Sbjct: 1   MGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTDTGI 60

Query: 61  VPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFC 120
           VP  EDIQRL++VLDKMK VDVGLSPEMPYFRTTA E T PPITYLHLYE+NKFSMGIFC
Sbjct: 61  VPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGT-PPITYLHLYENNKFSMGIFC 120

Query: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQL 180
           LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VD S+GTAPS SI+L
Sbjct: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGISAGVDASNGTAPS-SIRL 180

Query: 181 AKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFP 240
           AKVKVD+NFTAPCDS+ILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLDFP
Sbjct: 181 AKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFP 240

Query: 241 FTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FT+FSVDG S+PE ERE+YAWLEEREQPEDLAAVGA YRGPKIVE+
Sbjct: 241 FTKFSVDGKSVPEAERESYAWLEEREQPEDLAAVGAEYRGPKIVES 284

BLAST of Tan0002843 vs. NCBI nr
Match: XP_023531691.1 (plant cysteine oxidase 2-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 512.7 bits (1319), Expect = 2.1e-141
Identity = 252/286 (88.11%), Postives = 266/286 (93.01%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGI 60
           MG ERSLADRKGKQF ELPKE T NNK+RKNRRR KKPSSP+QKLYETCKQVFAS++TGI
Sbjct: 1   MGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTDTGI 60

Query: 61  VPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFC 120
           VP  EDIQRL++VLDKMK VDVGLSPEMPYFRTTA E T PPITYLHLYE+NKFSMGIFC
Sbjct: 61  VPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGT-PPITYLHLYENNKFSMGIFC 120

Query: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQL 180
           LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S SVD S+GTAPS SI+L
Sbjct: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGISASVDASNGTAPS-SIRL 180

Query: 181 AKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFP 240
           AKVKVD+NFTAPCDS+ILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLDFP
Sbjct: 181 AKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFP 240

Query: 241 FTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FT+FSVDG  IPE ERE+YAWLEEREQPEDLAAVGA YRGPKIVE+
Sbjct: 241 FTKFSVDGKLIPEAERESYAWLEEREQPEDLAAVGAEYRGPKIVES 284

BLAST of Tan0002843 vs. NCBI nr
Match: XP_038903013.1 (plant cysteine oxidase 2 [Benincasa hispida])

HSP 1 Score: 511.1 bits (1315), Expect = 6.0e-141
Identity = 247/288 (85.76%), Postives = 268/288 (93.06%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSS--PVQKLYETCKQVFASSET 60
           MGIERSLADRKGKQFCELPKE TTNN++R+NRRR +K SS  PVQKLYETCK+VFASS T
Sbjct: 1   MGIERSLADRKGKQFCELPKETTTNNRSRRNRRRLRKSSSPLPVQKLYETCKEVFASSGT 60

Query: 61  GIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGI 120
           GI+P SEDI+RLR VLDKM+P+DVGLS EMPYFRTTA +RT PPITYLHLYE++KFSMGI
Sbjct: 61  GIIPSSEDIERLRAVLDKMEPLDVGLSAEMPYFRTTADQRT-PPITYLHLYENSKFSMGI 120

Query: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSI 180
           FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDW EV AENG S  VDTS GTAPSRS+
Sbjct: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWVEVAAENGASALVDTSSGTAPSRSV 180

Query: 181 QLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLD 240
           +LAKVKVD++FTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYL+
Sbjct: 181 RLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCSYYLN 240

Query: 241 FPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FPFTEFSVDGVS+PE ERE+YAWLEERE+PEDLAAVGA+Y GPKIVEN
Sbjct: 241 FPFTEFSVDGVSVPEAERESYAWLEEREKPEDLAAVGAMYEGPKIVEN 287

BLAST of Tan0002843 vs. NCBI nr
Match: XP_008443425.1 (PREDICTED: plant cysteine oxidase 2 [Cucumis melo] >TYK25656.1 plant cysteine oxidase 2 [Cucumis melo var. makuwa])

HSP 1 Score: 511.1 bits (1315), Expect = 6.0e-141
Identity = 248/287 (86.41%), Postives = 266/287 (92.68%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSS--PVQKLYETCKQVFASSET 60
           MGIERSLADRKGKQFCELPKE TTNNK RKNRRR +K SS  PVQKLYETCK+VFASS T
Sbjct: 1   MGIERSLADRKGKQFCELPKETTTNNKPRKNRRRMRKSSSPLPVQKLYETCKEVFASSGT 60

Query: 61  GIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGI 120
           GIVP SEDI+RLR VLDKM+PVDVGLSP+MPYF TTA ++T PPITYLHLYE+NKFSMGI
Sbjct: 61  GIVPSSEDIERLRAVLDKMEPVDVGLSPDMPYFWTTASQQT-PPITYLHLYENNKFSMGI 120

Query: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSI 180
           FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VDTS GTAPSRS+
Sbjct: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGASACVDTSSGTAPSRSV 180

Query: 181 QLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLD 240
           +LAKVKVD++FTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLD
Sbjct: 181 RLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCSYYLD 240

Query: 241 FPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVE 286
           FP ++FSVD +S+PE ERE+YAWLEEREQPEDLAAVGA+Y GPKIVE
Sbjct: 241 FPLSDFSVDNISVPEVERESYAWLEEREQPEDLAAVGALYEGPKIVE 286

BLAST of Tan0002843 vs. ExPASy TrEMBL
Match: A0A6J1EPV0 (Cysteine dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111435532 PE=3 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 7.6e-142
Identity = 251/286 (87.76%), Postives = 266/286 (93.01%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGI 60
           MG ERSLADRKGKQF ELPKE T NNK+RKNRRR KKPSSP+QKLYETCKQVFAS++TGI
Sbjct: 1   MGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTDTGI 60

Query: 61  VPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFC 120
           VP  EDIQRL++VLDKMK VDVGLSPEMPYFRTTA E T PPITYLHLYE+NKFSMGIFC
Sbjct: 61  VPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGT-PPITYLHLYENNKFSMGIFC 120

Query: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQL 180
           LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VD S+GTAPS SI+L
Sbjct: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGISAGVDASNGTAPS-SIRL 180

Query: 181 AKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFP 240
           AKVKVD+NFTAPCDS+ILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLDFP
Sbjct: 181 AKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFP 240

Query: 241 FTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FT+FSVDG S+PE ERE+YAWLEEREQPEDLAAVGA YRGPKIVE+
Sbjct: 241 FTKFSVDGKSVPEAERESYAWLEEREQPEDLAAVGAEYRGPKIVES 284

BLAST of Tan0002843 vs. ExPASy TrEMBL
Match: A0A5D3DPQ3 (Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G008190 PE=3 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 2.9e-141
Identity = 248/287 (86.41%), Postives = 266/287 (92.68%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSS--PVQKLYETCKQVFASSET 60
           MGIERSLADRKGKQFCELPKE TTNNK RKNRRR +K SS  PVQKLYETCK+VFASS T
Sbjct: 1   MGIERSLADRKGKQFCELPKETTTNNKPRKNRRRMRKSSSPLPVQKLYETCKEVFASSGT 60

Query: 61  GIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGI 120
           GIVP SEDI+RLR VLDKM+PVDVGLSP+MPYF TTA ++T PPITYLHLYE+NKFSMGI
Sbjct: 61  GIVPSSEDIERLRAVLDKMEPVDVGLSPDMPYFWTTASQQT-PPITYLHLYENNKFSMGI 120

Query: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSI 180
           FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VDTS GTAPSRS+
Sbjct: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGASACVDTSSGTAPSRSV 180

Query: 181 QLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLD 240
           +LAKVKVD++FTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLD
Sbjct: 181 RLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCSYYLD 240

Query: 241 FPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVE 286
           FP ++FSVD +S+PE ERE+YAWLEEREQPEDLAAVGA+Y GPKIVE
Sbjct: 241 FPLSDFSVDNISVPEVERESYAWLEEREQPEDLAAVGALYEGPKIVE 286

BLAST of Tan0002843 vs. ExPASy TrEMBL
Match: A0A1S3B7Z3 (Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103487016 PE=3 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 2.9e-141
Identity = 248/287 (86.41%), Postives = 266/287 (92.68%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSS--PVQKLYETCKQVFASSET 60
           MGIERSLADRKGKQFCELPKE TTNNK RKNRRR +K SS  PVQKLYETCK+VFASS T
Sbjct: 1   MGIERSLADRKGKQFCELPKETTTNNKPRKNRRRMRKSSSPLPVQKLYETCKEVFASSGT 60

Query: 61  GIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGI 120
           GIVP SEDI+RLR VLDKM+PVDVGLSP+MPYF TTA ++T PPITYLHLYE+NKFSMGI
Sbjct: 61  GIVPSSEDIERLRAVLDKMEPVDVGLSPDMPYFWTTASQQT-PPITYLHLYENNKFSMGI 120

Query: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSI 180
           FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG S  VDTS GTAPSRS+
Sbjct: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGASACVDTSSGTAPSRSV 180

Query: 181 QLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLD 240
           +LAKVKVD++FTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLD
Sbjct: 181 RLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCSYYLD 240

Query: 241 FPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVE 286
           FP ++FSVD +S+PE ERE+YAWLEEREQPEDLAAVGA+Y GPKIVE
Sbjct: 241 FPLSDFSVDNISVPEVERESYAWLEEREQPEDLAAVGALYEGPKIVE 286

BLAST of Tan0002843 vs. ExPASy TrEMBL
Match: A0A6J1I853 (Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111470048 PE=3 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 3.8e-141
Identity = 252/286 (88.11%), Postives = 265/286 (92.66%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGI 60
           MG ERSLADRKGKQF ELPKE T NNK+RKNRRR KKPSSP+QKLYETCKQVFAS+ETGI
Sbjct: 1   MGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTETGI 60

Query: 61  VPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFC 120
           VP  EDIQRL++VLDKMK VDVGLSPEMPYFRTTA E T PPITYLHLYE+NKFSMGIFC
Sbjct: 61  VPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGT-PPITYLHLYENNKFSMGIFC 120

Query: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQL 180
           LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENG   SVD S+GTAPS SI+L
Sbjct: 121 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGILASVDASNGTAPS-SIRL 180

Query: 181 AKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFP 240
           AKVKVD+NFTAPCDS+ILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLDFP
Sbjct: 181 AKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFP 240

Query: 241 FTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVEN 287
           FT+FSVDG SIPE ERE+YAWLEEREQPEDLAAVGA Y GPKIVE+
Sbjct: 241 FTKFSVDGKSIPEAERESYAWLEEREQPEDLAAVGAEYIGPKIVES 284

BLAST of Tan0002843 vs. ExPASy TrEMBL
Match: A0A0A0LC53 (Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_3G827270 PE=3 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 1.1e-140
Identity = 247/287 (86.06%), Postives = 266/287 (92.68%), Query Frame = 0

Query: 1   MGIERSLADRKGKQFCELPKEITTNNKTRKNRRRPKKPSS--PVQKLYETCKQVFASSET 60
           MGIERSLADRKGKQFCELPKE TTNNK+RK+RRR ++ SS  PVQKLYETCK+VFASS T
Sbjct: 1   MGIERSLADRKGKQFCELPKETTTNNKSRKSRRRMRRSSSPLPVQKLYETCKKVFASSGT 60

Query: 61  GIVPPSEDIQRLRTVLDKMKPVDVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGI 120
           GIVP SEDI+RL+ VLDKMKPVDVGLSP+MPYF TT+ +RT PPITYLHLYE+NKFSMGI
Sbjct: 61  GIVPSSEDIERLQAVLDKMKPVDVGLSPDMPYFWTTSSQRT-PPITYLHLYENNKFSMGI 120

Query: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSI 180
           FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAE GA NG S  VDTS GTAPSRS+
Sbjct: 121 FCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEAGAVNGASACVDTSSGTAPSRSV 180

Query: 181 QLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLD 240
           +LAKVKVD++FTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSD DGRHCSYYLD
Sbjct: 181 RLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCSYYLD 240

Query: 241 FPFTEFSVDGVSIPEEERENYAWLEEREQPEDLAAVGAVYRGPKIVE 286
           FPFTEFSVD +S+PE ERE+YAWLEEREQPEDLAAVGA+Y GPKIVE
Sbjct: 241 FPFTEFSVDRISVPEAERESYAWLEEREQPEDLAAVGALYEGPKIVE 286

BLAST of Tan0002843 vs. TAIR 10
Match: AT5G39890.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 332.8 bits (852), Expect = 2.7e-91
Identity = 162/266 (60.90%), Postives = 198/266 (74.44%), Query Frame = 0

Query: 21  EITTNNKTRKNRRRPKKPSSPVQKLYETCKQVFASSETGIVPPSEDIQRLRTVLDKMKPV 80
           E  +N++ +  RR  K    PVQKL++TCK+VFA  ++G VP  E+I+ LR VLD++KP 
Sbjct: 26  ENRSNSRKKIQRRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPE 85

Query: 81  DVGLSPEMPYFRTTAGERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSK 140
           DVG++P+M YFR+T   R+ P +TYLH+Y  ++FS+ IFCLPPSGVIPLHNHP MTVFSK
Sbjct: 86  DVGVNPKMSYFRSTVTGRS-PLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSK 145

Query: 141 LLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYP 200
           LLFGTMHIK+YDW                D   PS   +LAKVKVDS+FTAPCD+SILYP
Sbjct: 146 LLFGTMHIKSYDW--------------VPDSPQPSSDTRLAKVKVDSDFTAPCDTSILYP 205

Query: 201 ADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDGVSIPEEERENYA 260
           ADGGNMHCFTA TACAVLDV+GPPYSD  GRHC+YY D+PF+ FSVDGV + EEE+E YA
Sbjct: 206 ADGGNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYA 265

Query: 261 WLEEREQ-PEDLAAVGAVYRGPKIVE 286
           WL+ERE+ PEDL     +Y GP I E
Sbjct: 266 WLKEREEKPEDLTVTALMYSGPTIKE 276

BLAST of Tan0002843 vs. TAIR 10
Match: AT5G15120.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 305.1 bits (780), Expect = 6.1e-83
Identity = 156/281 (55.52%), Postives = 195/281 (69.40%), Query Frame = 0

Query: 19  PKEITTNNKTRKNRR----RPKKPSSP------VQKLYETCKQVFASSETGIVPPSEDIQ 78
           P  +   NK +  +     R KK  SP      V++L+ TCK+VF++   G++P  + IQ
Sbjct: 25  PNSVKKKNKNKNKKMMMTWRRKKIDSPADGITAVRRLFNTCKEVFSNGGPGVIPSEDKIQ 84

Query: 79  RLRTVLDKMKPVDVGLSPEMPYFRTTAG--ERTPPPITYLHLYESNKFSMGIFCLPPSGV 138
           +LR +LD MKP DVGL+P MPYFR  +G   R+ PPITYLHL++ ++FS+GIFCLPPSGV
Sbjct: 85  QLREILDDMKPEDVGLTPTMPYFRPNSGVEARSSPPITYLHLHQCDQFSIGIFCLPPSGV 144

Query: 139 IPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVD 198
           IPLHNHPGMTVFSKLLFGTMHIK+YDW                D        +LAK+KVD
Sbjct: 145 IPLHNHPGMTVFSKLLFGTMHIKSYDW--------------VVDAPMRDSKTRLAKLKVD 204

Query: 199 SNFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSV 258
           S FTAPC++SILYP DGGNMH FTA+TACAVLDVLGPPY + +GRHC+Y+L+FP  + S 
Sbjct: 205 STFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSS 264

Query: 259 --DGVSIPEEERENYAWLEER-EQPED-LAAVGAVYRGPKI 284
             D V   EEE+E YAWL+ER + PED    VGA+YRGPK+
Sbjct: 265 EDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKV 291

BLAST of Tan0002843 vs. TAIR 10
Match: AT2G42670.2 (Protein of unknown function (DUF1637) )

HSP 1 Score: 205.3 bits (521), Expect = 6.6e-53
Identity = 110/255 (43.14%), Postives = 151/255 (59.22%), Query Frame = 0

Query: 43  QKLYETCKQVFASSETGIVPPSED-IQRLRTVLDKMKPVDVGLSPEMPYFRTTAG----- 102
           Q+LY TCK  F+S      P +ED ++++R VL+K+KP DVG+  +    R+ +G     
Sbjct: 6   QRLYNTCKASFSSDG----PITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNER 65

Query: 103 ---ERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDW 162
               ++PP I YLHL+E + FS+GIFC+PPS +IPLHNHPGMTV SKL++G+MH+K+YDW
Sbjct: 66  NGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDW 125

Query: 163 AEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVT 222
            E             ++   PS+  + AK+  D+  TA    + LYP  GGN+HCF A+T
Sbjct: 126 LE----------PQLTEPEDPSQEARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAIT 185

Query: 223 ACAVLDVLGPPYSDCDGRHCSYYL-----DFPFTEFSVDGVSIPEEERENYAWLEEREQP 282
            CA+LD+L PPYS    RHC+Y+      D P  E  VDG     E   +  WLEE + P
Sbjct: 186 HCAILDILAPPYSSEHDRHCTYFRKSRREDLP-GELEVDG-----EVVTDVTWLEEFQPP 240

Query: 283 EDLAAVGAVYRGPKI 284
           +D       YRGP I
Sbjct: 246 DDFVIRRIPYRGPVI 240

BLAST of Tan0002843 vs. TAIR 10
Match: AT2G42670.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 201.4 bits (511), Expect = 9.5e-52
Identity = 110/255 (43.14%), Postives = 152/255 (59.61%), Query Frame = 0

Query: 43  QKLYETCKQVFASSETGIVPPSED-IQRLRTVLDKMKPVDVGLSPEMPYFRTTAG----- 102
           Q+LY TCK  F+S      P +ED ++++R VL+K+KP DVG+  +    R+ +G     
Sbjct: 6   QRLYNTCKASFSSDG----PITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNER 65

Query: 103 ---ERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDW 162
               ++PP I YLHL+E + FS+GIFC+PPS +IPLHNHPGMTV SKL++G+MH+K+YDW
Sbjct: 66  NGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDW 125

Query: 163 AEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADGGNMHCFTAVT 222
            E             ++   PS++ + AK+  D+  TA    + LYP  GGN+HCF A+T
Sbjct: 126 LE----------PQLTEPEDPSQA-RPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAIT 185

Query: 223 ACAVLDVLGPPYSDCDGRHCSYYL-----DFPFTEFSVDGVSIPEEERENYAWLEEREQP 282
            CA+LD+L PPYS    RHC+Y+      D P  E  VDG     E   +  WLEE + P
Sbjct: 186 HCAILDILAPPYSSEHDRHCTYFRKSRREDLP-GELEVDG-----EVVTDVTWLEEFQPP 239

Query: 283 EDLAAVGAVYRGPKI 284
           +D       YRGP I
Sbjct: 246 DDFVIRRIPYRGPVI 239

BLAST of Tan0002843 vs. TAIR 10
Match: AT1G18490.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 200.7 bits (509), Expect = 1.6e-51
Identity = 107/262 (40.84%), Postives = 146/262 (55.73%), Query Frame = 0

Query: 34  RPKKPSSPVQKLYETCKQVFASSETGIVPPSEDIQRLRTVLDKMKPVDVGLSPEMP---- 93
           R ++ S  VQ+LY+ CK+ F        P S  IQ+L +VLD + P DVGL         
Sbjct: 27  RNQEKSPKVQELYDLCKETFTGKAPS--PASMAIQKLCSVLDSVSPADVGLEEVSQDDDR 86

Query: 94  ------YFRTTAGERTPPPITYLHLYESNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLF 153
                   R     R   PIT+L ++E + F+M IFC P S VIPLH+HP M VFSK+L+
Sbjct: 87  GYGVSGVSRFNRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILY 146

Query: 154 GTMHIKAYDWAEVGAENGTSVSVDTSDGTAPSRSIQLAKVKVDSNFTAPCDSSILYPADG 213
           G++H+KAYDW E          +    G   S   +LAK+  D   T   +   LYP  G
Sbjct: 147 GSLHVKAYDWVE------PPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTG 206

Query: 214 GNMHCFTAVTACAVLDVLGPPYSDCDGRHCSYYLDFPFTEFSVDG--VSIPEEERENYAW 273
           GN+HCFTA+T CAVLD+L PPY +  GR CSYY+D+PF+ F+++     + E + + YAW
Sbjct: 207 GNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDYPFSTFALENGMKKVDEGKEDEYAW 266

Query: 274 LEEREQPEDLAAVGAVYRGPKI 284
           L + + P+DL      Y GP I
Sbjct: 267 LVQIDTPDDLHMRPGSYTGPTI 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LGJ53.8e-9060.90Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1[more]
Q9LXG98.6e-8255.52Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1[more]
Q9SJI91.3e-5043.14Plant cysteine oxidase 4 OS=Arabidopsis thaliana OX=3702 GN=PCO4 PE=1 SV=2[more]
Q1G3U62.3e-5040.84Plant cysteine oxidase 3 OS=Arabidopsis thaliana OX=3702 GN=PCO3 PE=1 SV=1[more]
Q9LXT43.9e-5041.83Plant cysteine oxidase 5 OS=Arabidopsis thaliana OX=3702 GN=PCO5 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6588381.11.2e-14188.11Plant cysteine oxidase 2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022928708.11.6e-14187.76plant cysteine oxidase 2-like isoform X2 [Cucurbita moschata][more]
XP_023531691.12.1e-14188.11plant cysteine oxidase 2-like isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_038903013.16.0e-14185.76plant cysteine oxidase 2 [Benincasa hispida][more]
XP_008443425.16.0e-14186.41PREDICTED: plant cysteine oxidase 2 [Cucumis melo] >TYK25656.1 plant cysteine ox... [more]
Match NameE-valueIdentityDescription
A0A6J1EPV07.6e-14287.76Cysteine dioxygenase OS=Cucurbita moschata OX=3662 GN=LOC111435532 PE=3 SV=1[more]
A0A5D3DPQ32.9e-14186.41Cysteine dioxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352... [more]
A0A1S3B7Z32.9e-14186.41Cysteine dioxygenase OS=Cucumis melo OX=3656 GN=LOC103487016 PE=3 SV=1[more]
A0A6J1I8533.8e-14188.11Cysteine dioxygenase OS=Cucurbita maxima OX=3661 GN=LOC111470048 PE=3 SV=1[more]
A0A0A0LC531.1e-14086.06Cysteine dioxygenase OS=Cucumis sativus OX=3659 GN=Csa_3G827270 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G39890.12.7e-9160.90Protein of unknown function (DUF1637) [more]
AT5G15120.16.1e-8355.52Protein of unknown function (DUF1637) [more]
AT2G42670.26.6e-5343.14Protein of unknown function (DUF1637) [more]
AT2G42670.19.5e-5243.14Protein of unknown function (DUF1637) [more]
AT1G18490.11.6e-5140.84Protein of unknown function (DUF1637) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 36..256
e-value: 6.4E-11
score: 43.7
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847PCO_ADOcoord: 73..283
e-value: 2.1E-72
score: 242.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..42
NoneNo IPR availablePANTHERPTHR22966:SF1PLANT CYSTEINE OXIDASE 2coord: 22..286
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 22..286
NoneNo IPR availableCDDcd20289cupin_ADOcoord: 112..226
e-value: 5.5724E-42
score: 138.067
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 38..236

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002843.1Tan0002843.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070483 detection of hypoxia
biological_process GO:0018171 peptidyl-cysteine oxidation
cellular_component GO:0005829 cytosol
cellular_component GO:0005634 nucleus
molecular_function GO:0017172 cysteine dioxygenase activity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen