Cp4.1LG05g15090 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g15090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Description2-aminoethanethiol dioxygenase
LocationCp4.1LG05 : 10404229 .. 10412229 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGTAATATAAAGCCATTCCGTCGTCTTCCTTCTTCTGATTCTAAGCTTGGGTGCGATCTCTCAGCTCGGTGCAGCTCCTCAGGTACCAAATCCTCCGACGACTTCCTTCTTTTTTCACTTTCTAAATCTCTTCCTCACTATTCTCCTTCGCTTTCCTCGCTCCCTTGGCTGGCCTTTTCAGATTATGGTTCTGTAAGTTTATGATGCTCGAAATTATACTACTACTGTGCTCTTTAATCCGTCTTTCTCTTCGATTCTTGTTCTTATACCTAGCGACTGAATTTCACATTTTCTAGTTCGTTTAATTGCTCTCTACTTCATCAGAATTTGGTTTTCCTCCTTGATGAGGTTTTCTAATTGTTCATATGGCAAGATGTTTACTGTTTTCCACTGTTCTTTCTTTATATCGCACTTGTATTGGTTCTTGAAATATGATGGAGTACTCGAGACATCTGGTTCGTGCTTAGGATATGATTTATTGATTTCCGGAAAACGAAGAACCACGGTGTTTAGTTTCCTAATTCTTTAGTTTTTCTTGCACTATCCTAGCTGCGATAGTGATAATTGATGCAGTGAATAACTCAATTTGATTATCCTGTTTCTAAAGTTGATGATCTACTTGACGAACTTTATTGTTCCTTTACGCGTGGAAGAAGTACTGTGGATGGTTGTGGTTTTAATAACTCTTGTTATGCTCTTGAGCTGGTATTTCTATTTCGAACTTAGCCGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNCCATGCAGATCTATTGCTGTTGGCTACGTATGCAGAATAAGTTCGAGTTACCATCCTGTTATAGTTTTTATCATCATTATAGTTGATAAACGTCTTGTAAAGACGTTATCAGCTTACTGGCTTCGGGCTTCAGTCCTCTCAGTTAAGTTGGTATCGTCTTTGCGTCATGCCATATTACATACAGAGACTCTATAATACTTGTAAATCTGCGTTGTCTCCTAATGGACCCGTGTCGGAAGAAGCCCTCGAAAAAGTTCGTGCCATGTTAGGTATGATATGATTTCAGTTTACATCCAATTTTGAACTAATCCTTCTGTCTCATCTTATTTGACCAATCAAGATTTTAATTAGGATGTTTCATGGAATGGATTTTACTTTTTATCACTGTCTTTGGAACCCTTCAGTGATATAACTATTATAGGATAATCAGAATTCAGGCTGTAGAGTTTTTAATGAACAGGTTCTACAATTACTTCATGAATTGGTTAATTTTAGATAGTGTTAGCATGACCACAAGTCCAATTGGCTTACATTTACTTCGAGTAGTGGATCGAGTCCCATTGAGGCAAGTGAAATTCTGCATTTCCTGGTTTTAGGAGTTTTAACTTGTTAGTATTATGGTGATTCTGAAAGCCATGCATCGTACATGGGGCATGCAACACACCTACTGTTTGGTTAAGTACTTTGGTTTAGGCTTAAAATGAAATTACTTACAAGCATAGCTTGAAATACATTTGTTCGAGTAATCTCATTAATGGCTTGAAGTGATATCGGAAATACAATGCTGCTATTTATGGTTCATGTGATTTACTTTATGGGTCGTAGTGAGGAGGGTTAATGTAACACATTAGATATGTTAGTGGAGTAAACCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANAGAGAGAGAAGTTCAGTGATACTATCTTTGTTTTCATCTTCACAACATGGTAAATATATTTCTAAAATTCATTTTCTTTGAAAGTTGCATCATGCGTTAGGTGCCCATCAATTGCCTATTTTGCTGTTAGTTTTGCACATTTAATCATAGACCATTTGATTGCAGAAAAAGTCAAGCCCTCTGATGTAGGTCTTGAACAGGAGGCGCAAGTCGTTCGTAATTGGCAGGAACCTGTGCAGGAACGTAATGGGAGACGGCAACCATTGCCACCAATTAAATACTTGCATTTGCATGAGTGTGATAGCTTCTCCGTAAGCATCTATTTTCCAATCATTTTATCATTGGGAAGTAAAATCCTTGATAATTACTTATTTATTAAAGACTTTTTCTATTGGAAAACAACTAGACTTATTCTTGGATGGTAAGGACTAAGTGAAATTCCTATTATGAACTCTTTTGCAGATAGGTATTTTCTGTATGCCTCCAACTTCTATCATACCTTTTCATAATCATCCTGGAATGACGGTGTTGAGCAAACTAATTTATGGTTCCTTACACGTCAAGTCATATGACTGGTTTGATTTTCCAGGACTTGATGATATTTCCGAAGGTTTGTTACTATTACTTATTGTCATCCCATCCTGGCTTTGATTTTGTCAATTTCCTTATTCTATATACACATGCCTAGAGAATAAGCGTAAATGGAAGTTACCATTAATTCTCCCTTGTATTAACCAATTCATGAAGTATACTCTTTTAGCATAGTCGTACCTTTCTTTTTTTTGATAAGGGAAAACATGCCTTCACTTTTTTACTTGGATAAGAAACAATTTCATTTCATTGATGTATAAAATTACAAAAGGGATGAAAAATCCAGGAGAACTACGAAAAACATTTCTAGTTGATTAAAAAGGAAGCGTAACTATATGAAGCAAAGATGTTAGACAATTTACACCAAGAGATAGGATAATAACAAACAAAATCAAAGAAAACATCCAGATGAACTTCAGATAGGATAATAACAAACAAAATCAAAGAAAACATTCGGATGAACTTCCTTATCGAAGAATATACTTTGATTTATTTCCCCAAAAATCCCTAAAAACAAGCCCAAGACACGTTCATCCAAAGAGTAGCTTTGGCTTCTTTGAAGGGATGGACCACATGGACGTGATTGAGGAGATCTCTCAATCCATCGGAAGAACCGTTATCCATCCAAAGGAACTAAGAATTATGCTCCAATATTTAGTGAACAAGTGATTTTGAGTTTCTTCATTAGCTTTACACAAAGAGCACAGCTGGGGGAGATGGCTCATAAGGCATACATTTTTGGAGGAGTACCAATACCAACAATACCAACAATGAAATGAAAGTTACTACCAATACCAACAAGGTGCATCTTCCTTTTCAGTGGCTCAATCATTGGAACTTCATAGTTAAGCGTACTTTACTTTGAGCAATTCTATTAGCTATGTTGGGTGACCTCATCGGAATTTTCCTAGAATGCATGTGAGTGAGGACAAAACATACTAACAGGACTCGTGTTTGTCTGTGGAGATAGTCTTCACTCTTAAGAAGGGAGAAGTAGGTAATGTAACCATGTCGTAGGGGATGTAGGGGATGCAGGGGAATGTTGGGGCCATCAAGTGCCAAATTTGGATTTCGAATGTTGATCTAAATCCTAGGCATGGGTCGTTACAAATGTCATCAGAGTGATACTTTCCTAGTAAGATGTGGTTCAAGGATGAACCAAGGCAAAAACTCGTGGGCATGTGACACGCGAATAAGGGGTTACATGAATGAACTGAGACCACATCCAAATGAAAGTGGTCCTGAGAATAGGATGACTAAGAAAGGTTTAAAGGAAATGAAAGTTACTACCTATACCAATAAGGTGCATCTTCCTTTTTGATGGCTCAATCATAGGAACTTCATGGTTAATCGTGTTTTGCTTTGAGCAATTCTATGTTAGGTGACCTCTTGGAAATTTTCCTACGAGACATGTGAGTGAGGACAAAGCATGTTGAAAGGACCTATGTTGGTCTGTGAGGATAGTGTTCACTCTTAGAAGCGAGAAGTAGGTAATTGACCATATCGCAAGGGATGCAGGGGAATGTCAGGGCCATCAGGTGCCAAAGTGAGATGCATGTGAGTGAGGACGAAGCATGTTAAAAGGACCTATGTTGGCTATGGGGATAGTCTTCACTCTTAGAAGCGAGAAGTAGGTAACTGACCATGTTGTTGGGGATGTAGGGGAATGTTGGGGCCATCAGGTGCTGAATCTGGATTTCGAATCCTAATCCAAATCCTAGGAGTGGGTTGTTACAAGGTCTTTGGTATTGATGGTGGAATGAGAAATTCCCATAAGAGAAATTTAATATTTTTTTGGGTACCAGTGCCTCCAAATAGCTCTAGCCAAAGATGATTCCAGTATTCACTGGAATTGTCAGACGTGAGAGATTTGCTGATGAAAATACTGTTTGCGCTAGACAACCAAATCCATGAATCTTCTCGCTACGCCCATTCAGAGGCTTCTTCATCCTTCAAATTCTGATCAAGTTTAAGATCCCAGTGAAATAATAGTGAGTTCCATACCTCCTTGACAGTGACCTAGTCAGCCTATATAAGAGGGGGAATTGTGAAGCCAAGGGAGTATTGGTTAACCAGGGGGCTGTCCATAAAGGTGTTGATGCACCACCTCCCCCAGTCTACTAATAACAAAGGCTTGGCGTTTGATAATAATTTTCCAAGGAACCTTTGAAGTGTTTAATGAAGACCATTCCAAGTGGATTGTAGCACCATATTTACCATTATTAAGTTTCTTTCAAAGAGTACCTTTTTTCATTATTGTATTGTCAAGTTCATTTTGGAAGGAGCGCTCTATTTTTATCTTTCAATGAATGAAGGCTAAGAGCACCTTTATTCAAGGGGAGAAGAATTTTATCCTATCTCACGAGATGGATTCTGGATCCAGAGGCCGACCCCTACCATAGAAAATTCCGAAAGGTCTTTCCACCTCATTAACTATCTTTCGAGGCATGTTAAAAGATTGAGAGGTAATAAATTGGGAGATCGGTGAGTGTGGCTTGGAGAAGAGTAAGTCTAACTCCCTAGGAGATAAAGTAGGATCCCCATGCTGAAAGCTTTCTCTCAATTTTCTCGATAAAATGGGGGCCTAACTTAATTGTTCGTGGATTTCGATTAAGTGGAAGGCTCGGGTATGATCTCGGCCAGCTTCCACATACCCCTTAACTTCAAGGTTGTATCAATAAAAAAACTCTAAAATAGTAATTGTGTCAATATAGACTTTAACTTTAATAATATATCAATTTATACCCCTGTCCGATTCTATTCATAAAATGTTGATTTACATGTGTGCAAACTTATAATTCTATGAATTAAACTTGTAAATTGTCATAAATGAATCAATTGAGACCTGTCTGTGTAGACTTAATCTAGACTCCCATTTGATAATTGTCAATTATTTGAAATAGAGTTTTAAATAGATCCACTTATAAAAGTTTAAAAGTTTAATGGATGTAGTTCTAATTTTGCACATGTATAAACTAACATTTGAATAAGTGATACACTTATCGATTATATTGATACAATTTTTACTTTAAGATTTTTATTGATGCAGCCACTAAAATTTAGGGATATTAATTCATTTTTCCCTAATTTTTACATTTCAAACCTGCGCCATTTATTCAGCCTGATTTATGTATGCTAATGCAGGACAGATGTACAAAGCCAAATTTTAATAGATAAAAGTGAGTGGAATTCTATTAAGTACTGAAAAGAGAGTTCAAAACCCATTGAAACATCTAAAATAAATTGAAAACTAGTCAAATTAAGAGTTAAAACAATTATATAAATCAAATTAACACATTCCCAGCAACGATGCCATCAATAGTGGGTGCTAAGAAAAGCTTTTTGGTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTGACCAAGACTTACTGGAATGCCTTAAAAGTCATGTAAAACTTATTGGTATTGATATATACTCAATGGTTTTACTTGGATCCAAACTAGGGGCCCATATTTCTGAAAACTAAATTTAGTGGATGAACTTTCACGCTTGATACTAGATGATTATATAAAGATATAAGATTGTTTGCTGGAGCAAAGATGATAAGACTAGAAACTTTGCTTCTATTAACTTTATTGTTTAGGGTCAATTTTTGCAACTCCCTGTTCCATTACTCTCGAGGTAGATGCTCTTAATTGTTTGATTTCCATTTTTAGATTTATATGCTAACTTTAATTGGTGTCCTCACAGCTCGACCTGCAACGCTTGTAAAAGATACAGAAATGACGGCACCTACTCCAACCACTATTCTATATCCTACAAGTGGAGGCAATATCCATAGCTTCAAGGCCATCACACCTTGTGCTATTTTTGACATCCTTTCTCCCCCTTATTCATCAGAACATGGACGGCACTGTACTTATTTCCGGAAGTCTCATAGGAAGGATCTTCCTGGTAATTTCATTTGTGATTTATAATCCACAACTTGTAATACCATCTCTCTTAATTCTTGGTGTTGCTTACACACTTCTTACTGGATTTAGGTGATCTTGAATTGGATGGAGATGGAGTTTCAGTTTCTGAACTGACATGGTTAGAGGAATATCAACCTCCCGACAACTTAGTGATCCGGAGAGGGCTGTACAAGGGACCTGTGATTAGAACTTGATCAGTATTCCTGTTTTAGCTGTAGCATGCATTTGAAAACTGCATCTTCTAACTATGAATGATTGAGCGAAATTCCGGTATATGGACTGTTCATAGTCATTAAAAATAGCGATATTGACTCGACTTAGCTGGATATATTTGGCTTGCACATATGTTGCATTGCACCTAATTGACCATTTTCATTTATCTACCTCACTTTGTTTGATGATATATTTATGCAGGAATGGATTATGTTTTTGGGTGGAGGAGGAGAATTTCTAGGTAACTCATTTATTATCTGTCACGCTATCATATTTTTAAGAACGATGTGAGATCCATGTGTATGAAAAAAATATATAAATATTTACACTGGCAAAATCATTGTGTAGACCAATAGAATATGACACTCTCGTAGGTATTAGAAATTTTCTGGTGGAAAAGTTGGCTTCAGGGGTGTCATGTGGGTTTATTCACACCGTTGCTTTCAGCAGGTAAAAAACTAAGATGCAACTAGGCTATAGCCTTTCAACTAGATTGCTAATCCTATTGTTGAGATTTATTGGCCTTCTCATTTGTGGTCCTTTCCTTCTTAGAATGCACATGGAACTCATTACAATGCTAGTAAAGCTAGAAATCATTTCTGGATGATCAGTTTGGCCTGGAAAGATTGCTTCCTTCATGTGTTTGGTGTCTTCTATAGTCTATAACGAGACAAGTTCATGTGAACTTTTAAGTTGGCCACAAAAAGAGAACAGTTTTTTCTAGTAATTACTTTCTACAAAGCTGTTGGGAAAAAATAGAGTCTGATTTCACTCTGCACGCACACCCAATACTAATTTGTATAAGCTAGATCTCGGGAGGTTTTGTTAGCATAGTTGTAGTAGTTCCTCCATTAGTGCTCTTTGCTATTGGTTTTTCAGGCCTCATACCAACGTAGATGTAATCCTTACTTATAAATCTATGTTTAGCCTCTTAATTAGTCGGTGTGGAATTCCTCTCCCAATCATTACCAACTAGAAAGAGAATCGAGAATTAGATCTTAAACCGAGAACTAGAAAAAGAAAAAAGGGGTGAAATTGGAGAGGTTAGGGGTTTAGTTTTGGTGAGATATAATTGGTAAAAAAGGGGGTGAGGAGGAAGCTAATTGATCCTTCTCTTCCATAAAACATGGACATGTGATCTCCAACACTGACATTCTAAAATACAAAGCCACTGACACACTCTCTGTGTGTTGAGATCTCACAAAATGGCATTGTTTGCGGCGGCTCTTTTGCCAATGCTCACAAATTAGATTCCTTTTTTATACATCTTACACACACTCTCTCTTCAAAATTATGCCCTATCAATAATCATCCTTAAATCAGATATTATTCCACACAAAATTATTACCCCATTTAATTAGTGTCTTCAATAATAATTCATTTCCATTTAATATCCCAATTATTAGTTTAAATTACAAATTATTTGTATAAGTACGTCTCACTCTCGTTCTTTTATTATCGTTATTATTTTGTTCTATAATTTTAAGTTCTCTCCTTCTTTTGTTTAGTTCAACAATATTGAGGTGGGGATTCGAATTTCGACTCTTGATCAAAGG

mRNA sequence

ATGACATCTATTGCTGTTGGCTACCTTACTGGCTTCGGGCTTCAGTCCTCTCAGTTAAGTTGGTATCGTCTTTGCGTCATGCCATATTACATACAGAGACTCTATAATACTTGTAAATCTGCGTTGTCTCCTAATGGACCCGTGTCGGAAGAAGCCCTCGAAAAAGTTCGTGCCATGTTAGAAAAAGTCAAGCCCTCTGATGTAGGTCTTGAACAGGAGGCGCAAGTCGTTCGTAATTGGCAGGAACCTGTGCAGGAACGTAATGGGAGACGGCAACCATTGCCACCAATTAAATACTTGCATTTGCATGAGTGTGATAGCTTCTCCATAGGTATTTTCTGTATGCCTCCAACTTCTATCATACCTTTTCATAATCATCCTGGAATGACGGTGTTGAGCAAACTAATTTATGGTTCCTTACACGTCAAGTCATATGACTGGTTTGATTTTCCAGGACTTGATGATATTTCCGAAGCTCGACCTGCAACGCTTGTAAAAGATACAGAAATGACGGCACCTACTCCAACCACTATTCTATATCCTACAAGTGGAGGCAATATCCATAGCTTCAAGGCCATCACACCTTGTGCTATTTTTGACATCCTTTCTCCCCCTTATTCATCAGAACATGGACGGCACTGTACTTATTTCCGGAAGTCTCATAGGAAGGATCTTCCTGGTGATCTTGAATTGGATGGAGATGGAGTTTCAGTTTCTGAACTGACATGGTTAGAGGAATATCAACCTCCCGACAACTTAGTGATCCGGAGAGGGCTGTACAAGGGACCTTTCAACAATATTGAGGTGGGGATTCGAATTTCGACTCTTGATCAAAGG

Coding sequence (CDS)

ATGACATCTATTGCTGTTGGCTACCTTACTGGCTTCGGGCTTCAGTCCTCTCAGTTAAGTTGGTATCGTCTTTGCGTCATGCCATATTACATACAGAGACTCTATAATACTTGTAAATCTGCGTTGTCTCCTAATGGACCCGTGTCGGAAGAAGCCCTCGAAAAAGTTCGTGCCATGTTAGAAAAAGTCAAGCCCTCTGATGTAGGTCTTGAACAGGAGGCGCAAGTCGTTCGTAATTGGCAGGAACCTGTGCAGGAACGTAATGGGAGACGGCAACCATTGCCACCAATTAAATACTTGCATTTGCATGAGTGTGATAGCTTCTCCATAGGTATTTTCTGTATGCCTCCAACTTCTATCATACCTTTTCATAATCATCCTGGAATGACGGTGTTGAGCAAACTAATTTATGGTTCCTTACACGTCAAGTCATATGACTGGTTTGATTTTCCAGGACTTGATGATATTTCCGAAGCTCGACCTGCAACGCTTGTAAAAGATACAGAAATGACGGCACCTACTCCAACCACTATTCTATATCCTACAAGTGGAGGCAATATCCATAGCTTCAAGGCCATCACACCTTGTGCTATTTTTGACATCCTTTCTCCCCCTTATTCATCAGAACATGGACGGCACTGTACTTATTTCCGGAAGTCTCATAGGAAGGATCTTCCTGGTGATCTTGAATTGGATGGAGATGGAGTTTCAGTTTCTGAACTGACATGGTTAGAGGAATATCAACCTCCCGACAACTTAGTGATCCGGAGAGGGCTGTACAAGGGACCTTTCAACAATATTGAGGTGGGGATTCGAATTTCGACTCTTGATCAAAGG

Protein sequence

MTSIAVGYLTGFGLQSSQLSWYRLCVMPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQERNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDWFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPYSSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGPFNNIEVGIRISTLDQR
BLAST of Cp4.1LG05g15090 vs. Swiss-Prot
Match: PCO5_ARATH (Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 9.2e-102
Identity = 177/239 (74.06%), Postives = 198/239 (82.85%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPY+IQRL+NTCKS+LSPNGPVSEEAL+KVR +LEK+KPSDVGLEQEAQ+VRNW  P  E
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG    LP IKYL LHECDSFSIGIFCMPP SIIP HNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61  RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 147 WF--DFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSP 206
           W   D   LDD  +ARPA LVKD +MT+P+P T LYPT+GGNIH FKAIT CAIFDILSP
Sbjct: 121 WAEPDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSP 180

Query: 207 PYSSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           PYSS HGRHC YFRKS   DLPG++E+  +G  +S +TWLEEYQPPDN VI R  Y+GP
Sbjct: 181 PYSSTHGRHCNYFRKSPMLDLPGEIEV-MNGEVISNVTWLEEYQPPDNFVIWRVPYRGP 238

BLAST of Cp4.1LG05g15090 vs. Swiss-Prot
Match: PCO4_ARATH (Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2)

HSP 1 Score: 360.5 bits (924), Expect = 1.6e-98
Identity = 169/240 (70.42%), Postives = 202/240 (84.17%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPY+ QRLYNTCK++ S +GP++E+ALEKVR +LEK+KPSDVG+EQ+AQ+ R+   P+ E
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG  Q  P IKYLHLHECDSFSIGIFCMPP+S+IP HNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 147 WFDFPGL---DDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILS 206
           W + P L   +D S+ARPA LVKDTEMTA +P T LYP SGGNIH FKAIT CAI DIL+
Sbjct: 121 WLE-PQLTEPEDPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILA 180

Query: 207 PPYSSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           PPYSSEH RHCTYFRKS R+DLPG+LE+DG+   V+++TWLEE+QPPD+ VIRR  Y+GP
Sbjct: 181 PPYSSEHDRHCTYFRKSRREDLPGELEVDGE--VVTDVTWLEEFQPPDDFVIRRIPYRGP 237

BLAST of Cp4.1LG05g15090 vs. Swiss-Prot
Match: PCO1_ARATH (Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 7.1e-54
Identity = 112/243 (46.09%), Postives = 148/243 (60.91%), Query Frame = 1

Query: 31  IQRLYNTCKSALSPNGP---VSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQER 90
           ++RL+NTCK   S  GP    SE+ ++++R +L+ +KP DVGL       R     V+ R
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRP-NSGVEAR 117

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
           +      PPI YLHLH+CD FSIGIFC+PP+ +IP HNHPGMTV SKL++G++H+KSYDW
Sbjct: 118 SS-----PPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 177

Query: 151 -FDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 210
             D P  D  S+ R A L  D+  TAP   +ILYP  GGN+H F AIT CA+ D+L PPY
Sbjct: 178 VVDAPMRD--SKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPY 237

Query: 211 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSE----LTWLEEY--QPPDNLVIRRGLY 264
            +  GRHCTYF +     L  +   D D +S  E      WL+E    P D+  +   LY
Sbjct: 238 CNPEGRHCTYFLEFPLDKLSSE---DDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALY 289

BLAST of Cp4.1LG05g15090 vs. Swiss-Prot
Match: PCO2_ARATH (Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 8.4e-47
Identity = 102/243 (41.98%), Postives = 146/243 (60.08%), Query Frame = 1

Query: 31  IQRLYNTCKSALSP--NGPV-SEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQER 90
           +Q+L++TCK   +   +G V S+E +E +RA+L+++KP DVG+  +    R+        
Sbjct: 47  VQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTV------ 106

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
            GR    P + YLH++ C  FSI IFC+PP+ +IP HNHP MTV SKL++G++H+KSYDW
Sbjct: 107 TGRS---PLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDW 166

Query: 151 F-DFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 210
             D P     S+ R A +  D++ TAP  T+ILYP  GGN+H F A T CA+ D++ PPY
Sbjct: 167 VPDSP--QPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPY 226

Query: 211 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSE-----LTWLEE-YQPPDNLVIRRGLY 264
           S   GRHCTY+      D P       DGV V+E       WL+E  + P++L +   +Y
Sbjct: 227 SDPAGRHCTYY-----FDYPFS-SFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMY 272

BLAST of Cp4.1LG05g15090 vs. Swiss-Prot
Match: PCO3_ARATH (Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 3.0e-44
Identity = 99/249 (39.76%), Postives = 138/249 (55.42%), Query Frame = 1

Query: 31  IQRLYNTCKSALSPNGPV-SEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQ--EPVQER 90
           +Q LY+ CK   +   P  +  A++K+ ++L+ V P+DVGLE+ +Q          V   
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
           N   +   PI +L +HECD+F++ IFC P +S+IP H+HP M V SK++YGSLHVK+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 151 FDFP-------GLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFD 210
            + P       G+     AR A LV D  +T  +    LYP +GGN+H F A+TPCA+ D
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 211 ILSPPYSSEHGRHCTYFRKSHRKDLP-GDLELDG-----DGVSVSELTWLEEYQPPDNLV 264
           ILSPPY    GR C+Y+      D P     L+      D     E  WL +   PD+L 
Sbjct: 215 ILSPPYKESVGRSCSYY-----MDYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLH 274

BLAST of Cp4.1LG05g15090 vs. TrEMBL
Match: A0A0A0KDT5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G157630 PE=4 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 1.2e-127
Identity = 218/239 (91.21%), Postives = 225/239 (94.14%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTCK+ALSPNGPVSEEALEKVRAMLEK+KPSDVGLEQEAQVVRNW  PVQE
Sbjct: 1   MPYYIQRLYNTCKAALSPNGPVSEEALEKVRAMLEKIKPSDVGLEQEAQVVRNWSGPVQE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNGRRQ  PPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD
Sbjct: 61  RNGRRQSFPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PGLDDISEARPA LVKDTEMTAPTPTT+LYPTSGGNIHSF+AITPCAIFDILSPPY
Sbjct: 121 WVDLPGLDDISEARPAMLVKDTEMTAPTPTTVLYPTSGGNIHSFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLEL--DGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFRKS RKDLPGD +L  DGDG SVSE+TWLEE+QPPDN VIRRG YKGP
Sbjct: 181 SSEHGRHCTYFRKSPRKDLPGDFQLDGDGDGDSVSEVTWLEEFQPPDNFVIRRGQYKGP 239

BLAST of Cp4.1LG05g15090 vs. TrEMBL
Match: C6TMW7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_06G276400 PE=2 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-110
Identity = 186/237 (78.48%), Postives = 210/237 (88.61%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLY  C ++ SPNGP SEEA+EKVR  LE++KPSDVGLEQEAQVVRNW   + E
Sbjct: 1   MPYYIQRLYRLCNASFSPNGPASEEAIEKVREKLERIKPSDVGLEQEAQVVRNWSGSMLE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
            NG  Q LPPIKYLHLHECDSFSIGIFCMPP+SIIP HNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61  HNGSHQSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSMYVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PG +D SEARPA LVKDTEMTAP+PTT+LYPTSGGNIH F+AITPCAIFDILSPPY
Sbjct: 121 WIDAPGSNDPSEARPAKLVKDTEMTAPSPTTVLYPTSGGNIHCFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SS+HGRHCTYFR+S RKDLP +++L  +GV+VSE+TWLEE+QPPDN VIRRGLY+GP
Sbjct: 181 SSDHGRHCTYFRRSQRKDLPVNVQL--NGVTVSEVTWLEEFQPPDNFVIRRGLYRGP 235

BLAST of Cp4.1LG05g15090 vs. TrEMBL
Match: A0A0B2RH57_GLYSO (2-aminoethanethiol dioxygenase OS=Glycine soja GN=glysoja_036753 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-110
Identity = 186/237 (78.48%), Postives = 210/237 (88.61%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLY  C ++ SPNGP SEEA+EKVR  LE++KPSDVGLEQEAQVVRNW   + E
Sbjct: 1   MPYYIQRLYRLCNASFSPNGPASEEAIEKVREKLERIKPSDVGLEQEAQVVRNWSGSMLE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
            NG  Q LPPIKYLHLHECDSFSIGIFCMPP+SIIP HNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61  HNGSHQSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSMYVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PG +D SEARPA LVKDTEMTAP+PTT+LYPTSGGNIH F+AITPCAIFDILSPPY
Sbjct: 121 WIDAPGSNDPSEARPAKLVKDTEMTAPSPTTVLYPTSGGNIHCFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SS+HGRHCTYFR+S RKDLP +++L  +GV+VSE+TWLEE+QPPDN VIRRGLY+GP
Sbjct: 181 SSDHGRHCTYFRRSQRKDLPVNVQL--NGVTVSEVTWLEEFQPPDNFVIRRGLYRGP 235

BLAST of Cp4.1LG05g15090 vs. TrEMBL
Match: A0A061DLA3_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_001727 PE=4 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 6.4e-110
Identity = 186/237 (78.48%), Postives = 209/237 (88.19%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLY TC+ + SPNGPVSEEALE+VRAMLEK+KPSDVGLEQEAQVVRNW  PV E
Sbjct: 1   MPYYIQRLYKTCRESFSPNGPVSEEALERVRAMLEKMKPSDVGLEQEAQVVRNWSGPVHE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG  Q LPPIKYLHLHECDSFSIGIFCMPP+S+IP HNHPGMTVLS+LIYGSLHVKSYD
Sbjct: 61  RNGSHQSLPPIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSRLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D    +D  +ARPA LV+D EMTAP  TT+LYPTSGGNIH F+A TPCA+FDILSPPY
Sbjct: 121 WLDSTEPEDPLQARPAKLVRDDEMTAPCATTVLYPTSGGNIHCFRARTPCALFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFR+S R+DLPG++E+  +GV+ SE+TWLEE+QPPDN VIRRGLY+GP
Sbjct: 181 SSEHGRHCTYFRRSPRRDLPGEIEV--NGVTYSEMTWLEEFQPPDNFVIRRGLYRGP 235

BLAST of Cp4.1LG05g15090 vs. TrEMBL
Match: A0A0D2V1M6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G046000 PE=4 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 6.4e-110
Identity = 187/237 (78.90%), Postives = 210/237 (88.61%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTC++A SPNGPVS+EALE+VR MLEK+KPSDVGLEQEAQVVRNW  PV E
Sbjct: 1   MPYYIQRLYNTCRAAFSPNGPVSDEALERVRTMLEKMKPSDVGLEQEAQVVRNWSGPVSE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG  Q LPPIKYLHLHECDSFSIGIFCMPP+S+IP HNHPGMTVLS+LIYGSLHVKSYD
Sbjct: 61  RNGTHQSLPPIKYLHLHECDSFSIGIFCMPPSSLIPLHNHPGMTVLSRLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D    +D  +AR A LVKDTEMTAP  TT+LYPTSGGNIH F+A TPCAIFDILSPPY
Sbjct: 121 WLDPTEPEDPLQARAAKLVKDTEMTAPCATTVLYPTSGGNIHCFRARTPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFR+S R+DLPG++E++G+  + SE+TWLEE+QPPD+ VIRRGLYKGP
Sbjct: 181 SSEHGRHCTYFRRSPRRDLPGEIEMNGE--TFSEVTWLEEFQPPDDFVIRRGLYKGP 235

BLAST of Cp4.1LG05g15090 vs. TAIR10
Match: AT3G58670.1 (AT3G58670.1 Protein of unknown function (DUF1637))

HSP 1 Score: 371.3 bits (952), Expect = 5.2e-103
Identity = 177/239 (74.06%), Postives = 198/239 (82.85%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPY+IQRL+NTCKS+LSPNGPVSEEAL+KVR +LEK+KPSDVGLEQEAQ+VRNW  P  E
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG    LP IKYL LHECDSFSIGIFCMPP SIIP HNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61  RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 147 WF--DFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSP 206
           W   D   LDD  +ARPA LVKD +MT+P+P T LYPT+GGNIH FKAIT CAIFDILSP
Sbjct: 121 WAEPDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSP 180

Query: 207 PYSSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           PYSS HGRHC YFRKS   DLPG++E+  +G  +S +TWLEEYQPPDN VI R  Y+GP
Sbjct: 181 PYSSTHGRHCNYFRKSPMLDLPGEIEV-MNGEVISNVTWLEEYQPPDNFVIWRVPYRGP 238

BLAST of Cp4.1LG05g15090 vs. TAIR10
Match: AT2G42670.2 (AT2G42670.2 Protein of unknown function (DUF1637))

HSP 1 Score: 359.0 bits (920), Expect = 2.7e-99
Identity = 169/241 (70.12%), Postives = 200/241 (82.99%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPY+ QRLYNTCK++ S +GP++E+ALEKVR +LEK+KPSDVG+EQ+AQ+ R+   P+ E
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG  Q  P IKYLHLHECDSFSIGIFCMPP+S+IP HNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 147 WFDFPGL----DDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDIL 206
           W + P L    D   EARPA LVKDTEMTA +P T LYP SGGNIH FKAIT CAI DIL
Sbjct: 121 WLE-PQLTEPEDPSQEARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDIL 180

Query: 207 SPPYSSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKG 264
           +PPYSSEH RHCTYFRKS R+DLPG+LE+DG+   V+++TWLEE+QPPD+ VIRR  Y+G
Sbjct: 181 APPYSSEHDRHCTYFRKSRREDLPGELEVDGE--VVTDVTWLEEFQPPDDFVIRRIPYRG 238

BLAST of Cp4.1LG05g15090 vs. TAIR10
Match: AT5G15120.1 (AT5G15120.1 Protein of unknown function (DUF1637))

HSP 1 Score: 212.2 bits (539), Expect = 4.0e-55
Identity = 112/243 (46.09%), Postives = 148/243 (60.91%), Query Frame = 1

Query: 31  IQRLYNTCKSALSPNGP---VSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQER 90
           ++RL+NTCK   S  GP    SE+ ++++R +L+ +KP DVGL       R     V+ R
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRP-NSGVEAR 117

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
           +      PPI YLHLH+CD FSIGIFC+PP+ +IP HNHPGMTV SKL++G++H+KSYDW
Sbjct: 118 SS-----PPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 177

Query: 151 -FDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 210
             D P  D  S+ R A L  D+  TAP   +ILYP  GGN+H F AIT CA+ D+L PPY
Sbjct: 178 VVDAPMRD--SKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPY 237

Query: 211 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSE----LTWLEEY--QPPDNLVIRRGLY 264
            +  GRHCTYF +     L  +   D D +S  E      WL+E    P D+  +   LY
Sbjct: 238 CNPEGRHCTYFLEFPLDKLSSE---DDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALY 289

BLAST of Cp4.1LG05g15090 vs. TAIR10
Match: AT5G39890.1 (AT5G39890.1 Protein of unknown function (DUF1637))

HSP 1 Score: 188.7 bits (478), Expect = 4.8e-48
Identity = 102/243 (41.98%), Postives = 146/243 (60.08%), Query Frame = 1

Query: 31  IQRLYNTCKSALSP--NGPV-SEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQER 90
           +Q+L++TCK   +   +G V S+E +E +RA+L+++KP DVG+  +    R+        
Sbjct: 47  VQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTV------ 106

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
            GR    P + YLH++ C  FSI IFC+PP+ +IP HNHP MTV SKL++G++H+KSYDW
Sbjct: 107 TGRS---PLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDW 166

Query: 151 F-DFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 210
             D P     S+ R A +  D++ TAP  T+ILYP  GGN+H F A T CA+ D++ PPY
Sbjct: 167 VPDSP--QPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPY 226

Query: 211 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSE-----LTWLEE-YQPPDNLVIRRGLY 264
           S   GRHCTY+      D P       DGV V+E       WL+E  + P++L +   +Y
Sbjct: 227 SDPAGRHCTYY-----FDYPFS-SFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMY 272

BLAST of Cp4.1LG05g15090 vs. TAIR10
Match: AT1G18490.1 (AT1G18490.1 Protein of unknown function (DUF1637))

HSP 1 Score: 180.3 bits (456), Expect = 1.7e-45
Identity = 99/249 (39.76%), Postives = 138/249 (55.42%), Query Frame = 1

Query: 31  IQRLYNTCKSALSPNGPV-SEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQ--EPVQER 90
           +Q LY+ CK   +   P  +  A++K+ ++L+ V P+DVGLE+ +Q          V   
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 91  NGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYDW 150
           N   +   PI +L +HECD+F++ IFC P +S+IP H+HP M V SK++YGSLHVK+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 151 FDFP-------GLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFD 210
            + P       G+     AR A LV D  +T  +    LYP +GGN+H F A+TPCA+ D
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 211 ILSPPYSSEHGRHCTYFRKSHRKDLP-GDLELDG-----DGVSVSELTWLEEYQPPDNLV 264
           ILSPPY    GR C+Y+      D P     L+      D     E  WL +   PD+L 
Sbjct: 215 ILSPPYKESVGRSCSYY-----MDYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLH 274

BLAST of Cp4.1LG05g15090 vs. NCBI nr
Match: gi|659094799|ref|XP_008448247.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X1 [Cucumis melo])

HSP 1 Score: 468.8 bits (1205), Expect = 6.7e-129
Identity = 218/237 (91.98%), Postives = 224/237 (94.51%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTCK+ALSPNGPVSEEALEKVRAMLEK+KPSDVGLEQEAQVVRNW  PVQE
Sbjct: 1   MPYYIQRLYNTCKAALSPNGPVSEEALEKVRAMLEKIKPSDVGLEQEAQVVRNWSGPVQE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNGRRQ  PPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD
Sbjct: 61  RNGRRQSFPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PGLDDISEARPA LVKDTEMTAPTPTT+LYPTSGGNIHSF+AITPCAIFDILSPPY
Sbjct: 121 WVDLPGLDDISEARPAVLVKDTEMTAPTPTTVLYPTSGGNIHSFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFRKS RKDLPGD + DGDG SVSE+TWLEEYQPPDN VIRRG YKGP
Sbjct: 181 SSEHGRHCTYFRKSPRKDLPGDFQSDGDGDSVSEVTWLEEYQPPDNFVIRRGQYKGP 237

BLAST of Cp4.1LG05g15090 vs. NCBI nr
Match: gi|449463110|ref|XP_004149277.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X1 [Cucumis sativus])

HSP 1 Score: 464.2 bits (1193), Expect = 1.7e-127
Identity = 218/239 (91.21%), Postives = 225/239 (94.14%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTCK+ALSPNGPVSEEALEKVRAMLEK+KPSDVGLEQEAQVVRNW  PVQE
Sbjct: 1   MPYYIQRLYNTCKAALSPNGPVSEEALEKVRAMLEKIKPSDVGLEQEAQVVRNWSGPVQE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNGRRQ  PPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD
Sbjct: 61  RNGRRQSFPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PGLDDISEARPA LVKDTEMTAPTPTT+LYPTSGGNIHSF+AITPCAIFDILSPPY
Sbjct: 121 WVDLPGLDDISEARPAMLVKDTEMTAPTPTTVLYPTSGGNIHSFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLEL--DGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFRKS RKDLPGD +L  DGDG SVSE+TWLEE+QPPDN VIRRG YKGP
Sbjct: 181 SSEHGRHCTYFRKSPRKDLPGDFQLDGDGDGDSVSEVTWLEEFQPPDNFVIRRGQYKGP 239

BLAST of Cp4.1LG05g15090 vs. NCBI nr
Match: gi|659094805|ref|XP_008448250.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X2 [Cucumis melo])

HSP 1 Score: 454.9 bits (1169), Expect = 1.0e-124
Identity = 214/237 (90.30%), Postives = 220/237 (92.83%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTCK+ALSPNGPVSEEALEKVRAMLEK+KPSDVGLEQEAQVVRNW  PVQE
Sbjct: 1   MPYYIQRLYNTCKAALSPNGPVSEEALEKVRAMLEKIKPSDVGLEQEAQVVRNWSGPVQE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNGRRQ  PPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD
Sbjct: 61  RNGRRQSFPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PGLDDISE     LVKDTEMTAPTPTT+LYPTSGGNIHSF+AITPCAIFDILSPPY
Sbjct: 121 WVDLPGLDDISE----VLVKDTEMTAPTPTTVLYPTSGGNIHSFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFRKS RKDLPGD + DGDG SVSE+TWLEEYQPPDN VIRRG YKGP
Sbjct: 181 SSEHGRHCTYFRKSPRKDLPGDFQSDGDGDSVSEVTWLEEYQPPDNFVIRRGQYKGP 233

BLAST of Cp4.1LG05g15090 vs. NCBI nr
Match: gi|778713674|ref|XP_011657089.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X2 [Cucumis sativus])

HSP 1 Score: 451.1 bits (1159), Expect = 1.5e-123
Identity = 214/239 (89.54%), Postives = 221/239 (92.47%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTCK+ALSPNGPVSEEALEKVRAMLEK+KPSDVGLEQEAQVVRNW  PVQE
Sbjct: 1   MPYYIQRLYNTCKAALSPNGPVSEEALEKVRAMLEKIKPSDVGLEQEAQVVRNWSGPVQE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNGRRQ  PPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD
Sbjct: 61  RNGRRQSFPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D PGLDDISE     LVKDTEMTAPTPTT+LYPTSGGNIHSF+AITPCAIFDILSPPY
Sbjct: 121 WVDLPGLDDISEV----LVKDTEMTAPTPTTVLYPTSGGNIHSFRAITPCAIFDILSPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLEL--DGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFRKS RKDLPGD +L  DGDG SVSE+TWLEE+QPPDN VIRRG YKGP
Sbjct: 181 SSEHGRHCTYFRKSPRKDLPGDFQLDGDGDGDSVSEVTWLEEFQPPDNFVIRRGQYKGP 235

BLAST of Cp4.1LG05g15090 vs. NCBI nr
Match: gi|1009167310|ref|XP_015902048.1| (PREDICTED: plant cysteine oxidase 4-like [Ziziphus jujuba])

HSP 1 Score: 417.9 bits (1073), Expect = 1.4e-113
Identity = 198/237 (83.54%), Postives = 210/237 (88.61%), Query Frame = 1

Query: 27  MPYYIQRLYNTCKSALSPNGPVSEEALEKVRAMLEKVKPSDVGLEQEAQVVRNWQEPVQE 86
           MPYYIQRLYNTC++A SPNGPVSEEA+EKVRAML+K+KPSDVGLEQEAQ VRNW   V E
Sbjct: 1   MPYYIQRLYNTCRAAFSPNGPVSEEAIEKVRAMLDKIKPSDVGLEQEAQWVRNWSGTVHE 60

Query: 87  RNGRRQPLPPIKYLHLHECDSFSIGIFCMPPTSIIPFHNHPGMTVLSKLIYGSLHVKSYD 146
           RNG  Q LPPIKYLHLHECDSFSIGIFCMPP+SIIP HNHPGMTVLSKLIYGSL V SYD
Sbjct: 61  RNGGPQSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLIYGSLFVNSYD 120

Query: 147 WFDFPGLDDISEARPATLVKDTEMTAPTPTTILYPTSGGNIHSFKAITPCAIFDILSPPY 206
           W D P   D  EARPA LVKDTEMTAP PTTILYPTSGGNIHSF+A TPCAIFDIL+PPY
Sbjct: 121 WLDLPEPADHLEARPAKLVKDTEMTAPCPTTILYPTSGGNIHSFRAQTPCAIFDILAPPY 180

Query: 207 SSEHGRHCTYFRKSHRKDLPGDLELDGDGVSVSELTWLEEYQPPDNLVIRRGLYKGP 264
           SSEHGRHCTYFR+S R+DLPGDLEL   GV+VSE+TWLEEYQPPDN VIRRGLYKGP
Sbjct: 181 SSEHGRHCTYFRRSPRRDLPGDLEL--GGVTVSEVTWLEEYQPPDNFVIRRGLYKGP 235

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCO5_ARATH9.2e-10274.06Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1[more]
PCO4_ARATH1.6e-9870.42Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2[more]
PCO1_ARATH7.1e-5446.09Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1[more]
PCO2_ARATH8.4e-4741.98Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1[more]
PCO3_ARATH3.0e-4439.76Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KDT5_CUCSA1.2e-12791.21Uncharacterized protein OS=Cucumis sativus GN=Csa_6G157630 PE=4 SV=1[more]
C6TMW7_SOYBN2.2e-11078.48Uncharacterized protein OS=Glycine max GN=GLYMA_06G276400 PE=2 SV=1[more]
A0A0B2RH57_GLYSO2.2e-11078.482-aminoethanethiol dioxygenase OS=Glycine soja GN=glysoja_036753 PE=4 SV=1[more]
A0A061DLA3_THECC6.4e-11078.48Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_001727 PE=4 SV=1[more]
A0A0D2V1M6_GOSRA6.4e-11078.90Uncharacterized protein OS=Gossypium raimondii GN=B456_012G046000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G58670.15.2e-10374.06 Protein of unknown function (DUF1637)[more]
AT2G42670.22.7e-9970.12 Protein of unknown function (DUF1637)[more]
AT5G15120.14.0e-5546.09 Protein of unknown function (DUF1637)[more]
AT5G39890.14.8e-4841.98 Protein of unknown function (DUF1637)[more]
AT1G18490.11.7e-4539.76 Protein of unknown function (DUF1637)[more]
Match NameE-valueIdentityDescription
gi|659094799|ref|XP_008448247.1|6.7e-12991.98PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X1 [Cucumis melo][more]
gi|449463110|ref|XP_004149277.1|1.7e-12791.21PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X1 [Cucumis sativus][more]
gi|659094805|ref|XP_008448250.1|1.0e-12490.30PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X2 [Cucumis melo][more]
gi|778713674|ref|XP_011657089.1|1.5e-12389.54PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X2 [Cucumis sativus][more]
gi|1009167310|ref|XP_015902048.1|1.4e-11383.54PREDICTED: plant cysteine oxidase 4-like [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: Molecular Function
TermDefinition
GO:0016702oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
Vocabulary: INTERPRO
TermDefinition
IPR014710RmlC-like_jellyroll
IPR012864PCO/ADO
IPR011051RmlC_Cupin_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019530 taurine metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0047800 cysteamine dioxygenase activity
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g15090.1Cp4.1LG05g15090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 27..219
score: 4.27
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847DUF1637coord: 59..263
score: 2.8
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 102..206
score: 2.2
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 67..273
score: 3.8E
NoneNo IPR availablePANTHERPTHR22966:SF9SUBFAMILY NOT NAMEDcoord: 67..273
score: 3.8E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG05g15090Cp4.1LG16g07730Cucurbita pepo (Zucchini)cpecpeB317