CmaCh03G013240.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh03G013240.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
Description2-aminoethanethiol dioxygenase
LocationCma_Chr03 : 8533990 .. 8537740 (-)
Sequence length1033
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCATAGTTCAAAAATTGTACGATGCTTGCAAGGCATCATTTTCTACCAGTGTTCCGGTGTCAGAAGAGGCTCTGCAGAAGGTCATTACTCTCTTAGGTGTGTGCAAATAAGTAATTCGCTTATTTTCTTGATAAGTCTTTTTCGTACAAAATTTCACTCAATAAGCTGTTTTTTTAGGATGCATCGCGTATCTTTATGAGTTTGTACAATTCTGTTTCAGATGAACTGAAGCCATCTAATGTGGGTCTCGAACGAGAGTCGCAGTTAGCTCGTGGTTGGAAAGGTTCACTGAATGTTACTAATGGGAAGAAAGGTCGAAAGGGCGCACACCAATATCCACCAACTATACAATATCTACACTTGCATGAATGTGATAGATTCTCGGTAAATGATCGCATTGATATGTTAACTTCTGGTTATTCCCCACCCCCTGCTCCATTAGTGCCATCTCTCGAAGGAACTATTTCTCGTTGTGTGCAGATCGGAATCTTTTGCATGCCTCCAGGTTCTATCATTCCACTTCATAATCATCCTGGAATGACTGTATTGAGCAAGCTTCTATATGGTGCATTACATGTACGATCATATGATTGGCTTGATCTGCCTGAGTTCATAGATCTGTCTCAAGGTATTTCTTTCTCATCCATGGAACCCACCTCAAGCTTGATAGAAGCCCAGTTCTAGTTTTTTGTGAGATCAGAAAATTAAGAATTTATTAGTGCTTATGTAATGAATTGTGAAGCAGTGCTATTATTATTGATTTTTTGATTAGAGACAATTTCATTGATAATTGAAATTTACGATGTTTATAAAGGACCTTCCCAATTTACCATGAGTGAATGTATAACTATAGAAGTAGAAATATTAGACGGTTTACACCTAGATATAGCTTGGTAAATAACATTGTCGAAGAATTTTGTATAAGTCTCGTGTCTTTTTTGCAAGTATTCTTTGATTTCTTTCTTTCCAAATATTCCAAAAGAAAGTCATGATGAGGTTTGTCCATAGTAGGGCTTTTGCATTCTTGAAAGTGTGGTACGTTAAGGCCATGTCCAATAGATCTTTTACCTCCCTTGGAAATGTGAGATGTCATTCGAATATATTGGGAATTGTTGTCCAGAAATTCTGAGTGTATATGCATTGCATAAATAAATAGTTTTGTGATTTGTTTTCTTTTTTTTTTTTCTTTTTTTTTTTTGTTTTTGCATAATGGACACCAATTTGGAAATAGAGTTATGTAGGCTATTGTTTTTTTTTAAAGATTTTCACTTGTGCTAATGGCTTTATGCATATTTCTCAAATGAAGAATTTCACCTTTTTTAGGTGATGAACTTTCCATACTGTCTTTCCTAGCGCTATTTCCCAAGTAGTAGCTTGTCTTAGCATTTATAATGCAATTGTACTTTTCAATTCAATACGGTCCTATCTGGCATTATTAGTTCTCTGTTTTGACTGTTTACTTCATTTGTGAAAAGTTGATACTAACGTTTTCCTTTAATCCCCTGGATGACAATGATGCTTTCCTTTCTGAAAACAAACGTTTTTGGAAATGGCAGACGGCAATATCCACCAACTATAAAATATCTACACTTGCATGAATGTCACAGATTTTCAGCTGGCGAGGTTCCTCTAGTCCACGGATCATTGTGGATCTGTTTCTAGAGGCGTTACTTGCTCCTGAAAAGCTACTCTCATGATCATGCAACTATTTTAAAACGAGTGCCCCCATCAATATTAGACTTTGTGTATAGTGAATACTCTCAGGTTCAAAAGAATGGGATCGAGTAGCTTGTAGGTGAAATGTCGGCTGCTGGGCTCATCCTGCTTAGTTGGGGTCTGTTTCCAACAGTCATAATAGTTCAAATGGAGGGTCTGTTTCCAACAGAGTCTTAATAGTTAAAAAGAAGCGTCTATTTCTAACAGTCTTATTAGTTAAAAGGAGCTTATGGCTATTTTGATGGGGGTACAGAAGTGGAGATCTTACCTTTTTTTGGCTAGAACATTCATAATTCATTCCTACGAGAAAAGCTCACATGACTGGTTGTTGCAGTAATTAGTCAGAATGGAACATCAAAGGTGGCTTAGTAATTTTGGTGAGTAAGGATCTATGTGTTGAATGATCTCAAGTTTCAAGAAAGAGCGTAAGAAATCATGAATAGCGATCTGATAATAACATAAGGGGTGTGAAATTGAAGTTGGGAATTTTGGTCTACTTGAAGAACTTTGGTCTACTGAACTTCAATAGAAATCACCGGTAGTTATATGAAAGGAAAAGTTTACTCCTGTTATTACAACCCGAACTGATTACTCAAAAACAGTCGGTAAGGTGTCTTATCACTTGGTTTTTCTCTGCCTTCCACCATGAATATCCACCATGCTTTTCACGTGTCCTAGCAGAAATTGTAAGAAGGATCCACTTTCTGTCTTGTGACCTCAACTCAGGGCTAGCTACCTTCGAGCTTGACACCCTTTAGCAATTCTGGGAGAGTGCATTTCGGCTAAAAGACCCTGATGGAGCTGCAAGTTTTTGTTTAGCGCCAAGGCTAGTTAGTAGAGGAAGTTTAGTGGGAACCTTATATATCATCATAGCACAACGCTTTCCATCTTTTTACCCTGAGGACAAGGTAAAAGCTTGGGCAGGAAGTAATTTTAGGAGACATATCCTCCCATACGTTTTATGTATGGTAGGAGGGAGGCATGAGGTAAAGTGCATAAAGCGAGTTGTTAAAGGGAATATTTGGTGATTTGGGAGGGTGAAGGGTAGCTACTCAGACATCCCGAATATATATCCTTCCATACATTTTATGTATGGTAGAATTTCATGCAATTGTCTGCATTGTTTATCGTAATTGTTTTTGCTCTAAACATCTGCCCTGGGAAGAAAGAAAGAAATTGCTTGAGTTGAATCTTTTCCTCATTCCGTGAATAGCATTTGATGTAGTGATATATTTGAAATGATCCATTTTTGGGACGAATGTGAAACCGTTATTTTTTCCGGGCTTGGTTATTTTGTTCACTTTCCTTTCTTTGAGTAAAAGTTTCAGTATAAAATTTTATTTGAGTTGTATCCTCAGCCAGACCTGCAAAACTAGTTAGGGACTGTGAGATGATTGCACCTTGTGGAACTACAATTCTTTATCCAGATCGAGGTGGCAACATTCATCATTTCAAAGCCATAACTCCCTGCGCAATCTTCGACATTCTTTCACCGCCTTACTCGTCTGCAGATGGGCGACACTGCTCTTATTTCCGGAGGTCACCTAGGCGAGAGATTTCAGGTATCGTTTGTTTCCTCTGGATTTCTTTCTTGCTTCTACTTCTACACTGCTCTTATCTCCGCTCTCTTCCAATTCAAAGGTCTTGACCAACTGTGTGGAACTGAAACCGGTCCCTCAGAAGTTACTTGGTTGGACGAGATTCAGCCACCTGAAAACTTCGTGGTCCGGCGGGGCCTGTACAAAGGCCCCATTATTAGACAATGGAGTTAATACTGTAGAGCATCATTTAGTGGGATTGGCTCGAAAAATCAACGAACCATCCAGTTAAACTACACCAGCGATTCTTAACTGGTTCAATAATTGTATGTTAAGAATACTGAGGAGGGAGTCTTCTTCGATGTAATTGAATTGTATAATTGAAAGTTAAGAAAAAGACTGAATAATAAATGAAAAATGTCTGTGGTAGAAGATTGTAACGCTGGATTTGACTTTGATATCAATCAATTAATGTGATTAAATTTGAATTGCA

mRNA sequence

ATGCCCATAGTTCAAAAATTGTACGATGCTTGCAAGGCATCATTTTCTACCAGTGTTCCGGTGTCAGAAGAGGCTCTGCAGAAGGTCATTACTCTCTTAGATGAACTGAAGCCATCTAATGTGGGTCTCGAACGAGAGTCGCAGTTAGCTCGTGGTTGGAAAGGTTCACTGAATGTTACTAATGGGAAGAAAGGTCGAAAGGGCGCACACCAATATCCACCAACTATACAATATCTACACTTGCATGAATGTGATAGATTCTCGGTAAATGATCGCATTGATATGTTAACTTCTGGTTATTCCCCACCCCCTGCTCCATTAGTGCCATCTCTCGAAGGAACTATTTCTCGTTGTGTGCAGATCGGAATCTTTTGCATGCCTCCAGGTTCTATCATTCCACTTCATAATCATCCTGGAATGACTGTATTGAGCAAGCTTCTATATGGTGCATTACATGTACGATCATATGATTGGCTTGATCTGCCTGAGTTCATAGATCTGTCTCAAGATCGAGGTGGCAACATTCATCATTTCAAAGCCATAACTCCCTGCGCAATCTTCGACATTCTTTCACCGCCTTACTCGTCTGCAGATGGGCGACACTGCTCTTATTTCCGGAGGTCACCTAGGCGAGAGATTTCAGGTCTTGACCAACTGTGTGGAACTGAAACCGGTCCCTCAGAAGTTACTTGGTTGGACGAGATTCAGCCACCTGAAAACTTCGTGGTCCGGCGGGGCCTGTACAAAGGCCCCATTATTAGACAATGGAGTTAATACTGTAGAGCATCATTTAGTGGGATTGGCTCGAAAAATCAACGAACCATCCAGTTAAACTACACCAGCGATTCTTAACTGGTTCAATAATTGTATGTTAAGAATACTGAGGAGGGAGTCTTCTTCGATGTAATTGAATTGTATAATTGAAAGTTAAGAAAAAGACTGAATAATAAATGAAAAATGTCTGTGGTAGAAGATTGTAACGCTGGATTTGACTTTGATATCAATCAATTAATGTGATTAAATTTGAATTGCA

Coding sequence (CDS)

ATGCCCATAGTTCAAAAATTGTACGATGCTTGCAAGGCATCATTTTCTACCAGTGTTCCGGTGTCAGAAGAGGCTCTGCAGAAGGTCATTACTCTCTTAGATGAACTGAAGCCATCTAATGTGGGTCTCGAACGAGAGTCGCAGTTAGCTCGTGGTTGGAAAGGTTCACTGAATGTTACTAATGGGAAGAAAGGTCGAAAGGGCGCACACCAATATCCACCAACTATACAATATCTACACTTGCATGAATGTGATAGATTCTCGGTAAATGATCGCATTGATATGTTAACTTCTGGTTATTCCCCACCCCCTGCTCCATTAGTGCCATCTCTCGAAGGAACTATTTCTCGTTGTGTGCAGATCGGAATCTTTTGCATGCCTCCAGGTTCTATCATTCCACTTCATAATCATCCTGGAATGACTGTATTGAGCAAGCTTCTATATGGTGCATTACATGTACGATCATATGATTGGCTTGATCTGCCTGAGTTCATAGATCTGTCTCAAGATCGAGGTGGCAACATTCATCATTTCAAAGCCATAACTCCCTGCGCAATCTTCGACATTCTTTCACCGCCTTACTCGTCTGCAGATGGGCGACACTGCTCTTATTTCCGGAGGTCACCTAGGCGAGAGATTTCAGGTCTTGACCAACTGTGTGGAACTGAAACCGGTCCCTCAGAAGTTACTTGGTTGGACGAGATTCAGCCACCTGAAAACTTCGTGGTCCGGCGGGGCCTGTACAAAGGCCCCATTATTAGACAATGGAGTTAA

Protein sequence

MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVTNGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQIGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDRGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQWS
BLAST of CmaCh03G013240.1 vs. Swiss-Prot
Match: PCO5_ARATH (Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 2.2e-65
Identity = 135/277 (48.74%), Postives = 165/277 (59.57%), Query Frame = 1

Query: 4   VQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVTNGK 63
           +Q+L++ CK+S S + PVSEEAL KV  +L+++KPS+VGLE+E+QL R W G  N  NG 
Sbjct: 5   IQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNERNGN 64

Query: 64  KGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQIGI 123
                 H   P I+YL LHECD FS                                IGI
Sbjct: 65  ------HHSLPAIKYLQLHECDSFS--------------------------------IGI 124

Query: 124 FCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWL--DLPEFIDLSQDR---------- 183
           FCMPPGSIIPLHNHPGMTVLSKL+YG++HV+SYDW   D  E  D  Q R          
Sbjct: 125 FCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDWAEPDQSELDDPLQARPAKLVKDIDM 184

Query: 184 -------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLDQ 243
                        GGNIH FKAIT CAIFDILSPPYSS  GRHC+YFR+SP  ++ G  +
Sbjct: 185 TSPSPATTLYPTTGGNIHCFKAITHCAIFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIE 242

Query: 244 LCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           +   E   S VTWL+E QPP+NFV+ R  Y+GP+IR+
Sbjct: 245 VMNGEV-ISNVTWLEEYQPPDNFVIWRVPYRGPVIRK 242

BLAST of CmaCh03G013240.1 vs. Swiss-Prot
Match: PCO4_ARATH (Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2)

HSP 1 Score: 250.4 bits (638), Expect = 2.2e-65
Identity = 132/275 (48.00%), Postives = 172/275 (62.55%), Query Frame = 1

Query: 5   QKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVTNGKK 64
           Q+LY+ CKASFS+  P++E+AL+KV  +L+++KPS+VG+E+++QLAR   G LN  NG  
Sbjct: 6   QRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNERNG-- 65

Query: 65  GRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQIGIF 124
               ++Q PP I+YLHLHECD FS                                IGIF
Sbjct: 66  ----SNQSPPAIKYLHLHECDSFS--------------------------------IGIF 125

Query: 125 CMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLD--LPEFIDLSQDR----------- 184
           CMPP S+IPLHNHPGMTVLSKL+YG++HV+SYDWL+  L E  D SQ R           
Sbjct: 126 CMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDWLEPQLTEPEDPSQARPAKLVKDTEMT 185

Query: 185 ------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLDQL 244
                       GGNIH FKAIT CAI DIL+PPYSS   RHC+YFR+S R ++ G  ++
Sbjct: 186 AQSPVTTLYPKSGGNIHCFKAITHCAILDILAPPYSSEHDRHCTYFRKSRREDLPGELEV 240

Query: 245 CGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
            G     ++VTWL+E QPP++FV+RR  Y+GP+IR
Sbjct: 246 DGEVV--TDVTWLEEFQPPDDFVIRRIPYRGPVIR 240

BLAST of CmaCh03G013240.1 vs. Swiss-Prot
Match: PCO1_ARATH (Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 1.7e-38
Identity = 96/276 (34.78%), Postives = 133/276 (48.19%), Query Frame = 1

Query: 4   VQKLYDACKASFSTSVP---VSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 63
           V++L++ CK  FS   P    SE+ +Q++  +LD++KP +VGL       R         
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRP-------N 117

Query: 64  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 123
           +G + R       P I YLHLH+CD+FS                                
Sbjct: 118 SGVEARSS-----PPITYLHLHQCDQFS-------------------------------- 177

Query: 124 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDW-LDLP----------------- 183
           IGIFC+PP  +IPLHNHPGMTV SKLL+G +H++SYDW +D P                 
Sbjct: 178 IGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAPMRDSKTRLAKLKVDSTF 237

Query: 184 ----EFIDLSQDRGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGL-D 243
                   L  + GGN+H F AIT CA+ D+L PPY + +GRHC+YF   P  ++S   D
Sbjct: 238 TAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSSEDD 289

Query: 244 QLCGTETGPSEVTWLDE--IQPPENFVVRRGLYKGP 252
            +  +E       WL E    P ++  V   LY+GP
Sbjct: 298 DVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGP 289

BLAST of CmaCh03G013240.1 vs. Swiss-Prot
Match: PCO3_ARATH (Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 4.3e-37
Identity = 99/287 (34.49%), Postives = 139/287 (48.43%), Query Frame = 1

Query: 2   PIVQKLYDACKASFSTSVPV-SEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 61
           P VQ+LYD CK +F+   P  +  A+QK+ ++LD + P++VGLE  SQ      G   V+
Sbjct: 33  PKVQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVS 92

Query: 62  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 121
             +  R G    P  I +L +HECD F++                            C  
Sbjct: 93  --RFNRVGRWAQP--ITFLDIHECDTFTM----------------------------C-- 152

Query: 122 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 181
             IFC P  S+IPLH+HP M V SK+LYG+LHV++YDW++ P  I  +QD+         
Sbjct: 153 --IFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDWVEPPCII--TQDKGVPGSLPAR 212

Query: 182 -----------------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRS 241
                                  GGN+H F A+TPCA+ DILSPPY  + GR CSY+   
Sbjct: 213 LAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDY 272

Query: 242 PRREISGLDQLCGTETG-PSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
           P    +  + +   + G   E  WL +I  P++  +R G Y GP IR
Sbjct: 273 PFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGSYTGPTIR 281

BLAST of CmaCh03G013240.1 vs. Swiss-Prot
Match: PCO2_ARATH (Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 4.4e-34
Identity = 95/279 (34.05%), Postives = 132/279 (47.31%), Query Frame = 1

Query: 4   VQKLYDACKASF----STSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNV 63
           VQKL+D CK  F    S +VP S+E ++ +  +LDE+KP +VG+  +    R       V
Sbjct: 47  VQKLFDTCKKVFADGKSGTVP-SQENIEMLRAVLDEIKPEDVGVNPKMSYFRS-----TV 106

Query: 64  TNGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCV 123
           T    GR       P + YLH++ C RFS                               
Sbjct: 107 T----GRS------PLVTYLHIYACHRFS------------------------------- 166

Query: 124 QIGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWL-DLPE--------FIDLSQD 183
            I IFC+PP  +IPLHNHP MTV SKLL+G +H++SYDW+ D P+         + +  D
Sbjct: 167 -ICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSSDTRLAKVKVDSD 226

Query: 184 -------------RGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLD 243
                         GGN+H F A T CA+ D++ PPYS   GRHC+Y+   P    S +D
Sbjct: 227 FTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFS-VD 276

Query: 244 QLCGTETGPSEVTWLDE-IQPPENFVVRRGLYKGPIIRQ 256
            +   E       WL E  + PE+  V   +Y GP I++
Sbjct: 287 GVVVAEEEKEGYAWLKEREEKPEDLTVTALMYSGPTIKE 276

BLAST of CmaCh03G013240.1 vs. TrEMBL
Match: D7TJM3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g01390 PE=4 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 7.2e-92
Identity = 171/277 (61.73%), Postives = 195/277 (70.40%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MP+VQKLY+ACK SFS   P+SEEAL KV ++LD++KPSNVGLE+E+QLARGWKGS++  
Sbjct: 1   MPVVQKLYNACKESFSVDGPLSEEALGKVRSILDDMKPSNVGLEQEAQLARGWKGSMHGA 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKK R G+HQYPP I+YLHLHECDRFS                                
Sbjct: 61  NGKKVRNGSHQYPPPIKYLHLHECDRFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           IGIFCMPP SIIPLHNHPGMTVLSKLLYG LHV+SYDWLDLP   DLSQ R         
Sbjct: 121 IGIFCMPPSSIIPLHNHPGMTVLSKLLYGTLHVKSYDWLDLPGTADLSQARPAKLVRDCE 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLD 240
                         GGNIH FKAITPCA+FD+LSPPYSS DGRHCSYFR+SPR+++ G+D
Sbjct: 181 MSAPCGTTILYPTNGGNIHCFKAITPCALFDVLSPPYSSEDGRHCSYFRKSPRKDLPGID 240

Query: 241 QLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
           QLCG +  PSEV WL+EIQPPEN VV RG Y+GPIIR
Sbjct: 241 QLCGIK--PSEVVWLEEIQPPENVVVLRGQYEGPIIR 243

BLAST of CmaCh03G013240.1 vs. TrEMBL
Match: V7BPY1_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G175200g PE=4 SV=1)

HSP 1 Score: 338.2 bits (866), Expect = 8.8e-90
Identity = 172/280 (61.43%), Postives = 193/280 (68.93%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV  LLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALKKVQALLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKG  G++QYPP I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGPNGSYQYPPPIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           +GIFCM PGSIIPLHNHPGMTVLSKLLYG+ HVR+YDWLDLP   D SQ R         
Sbjct: 121 MGIFCMAPGSIIPLHNHPGMTVLSKLLYGSFHVRAYDWLDLPGSDDSSQARPAKLVKDCQ 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                         GGNIH FKA+TPCA+FDILSPPYSS DGRHCSYFR+SPR+E+ G  
Sbjct: 181 MSAPCNTTILYPSKGGNIHCFKALTPCALFDILSPPYSSEDGRHCSYFRKSPRKELPGVD 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LDQ CG +   SEVTWL+EIQ PEN VVRRGLY+GP IR+
Sbjct: 241 LDQSCGVKA--SEVTWLEEIQAPENLVVRRGLYRGPTIRR 246

BLAST of CmaCh03G013240.1 vs. TrEMBL
Match: C6TH33_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G083700 PE=2 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.0e-89
Identity = 171/280 (61.07%), Postives = 196/280 (70.00%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV TLLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALEKVRTLLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKGR G++QYPP+I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGRNGSYQYPPSIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           +GIFCM PGS+IPLHNHPGMTVLSKLLYG+L VRSYDWLDLP   D SQ R         
Sbjct: 121 MGIFCMSPGSVIPLHNHPGMTVLSKLLYGSLLVRSYDWLDLPGPDDPSQARPAKLVKDCQ 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                         GGNIH FKA+TPCA+FD+LSPPYSS DGRHCSYFR+S R+++ G  
Sbjct: 181 MSAPCNTTVLYPSKGGNIHCFKALTPCALFDVLSPPYSSEDGRHCSYFRKSTRKDLPGVE 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LDQL G +  PSE+TWL+EIQ PEN VVRRG+YKGP IR+
Sbjct: 241 LDQLSGVK--PSEITWLEEIQAPENLVVRRGVYKGPTIRR 246

BLAST of CmaCh03G013240.1 vs. TrEMBL
Match: A0A0B2R3A7_GLYSO (2-aminoethanethiol dioxygenase OS=Glycine soja GN=glysoja_040484 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.0e-89
Identity = 171/280 (61.07%), Postives = 196/280 (70.00%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV TLLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALEKVRTLLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKGR G++QYPP+I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGRNGSYQYPPSIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           +GIFCM PGS+IPLHNHPGMTVLSKLLYG+L VRSYDWLDLP   D SQ R         
Sbjct: 121 MGIFCMSPGSVIPLHNHPGMTVLSKLLYGSLLVRSYDWLDLPGPDDPSQARPAKLVKDCQ 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                         GGNIH FKA+TPCA+FD+LSPPYSS DGRHCSYFR+S R+++ G  
Sbjct: 181 MSAPCNTTVLYPSKGGNIHCFKALTPCALFDVLSPPYSSEDGRHCSYFRKSTRKDLPGVE 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LDQL G +  PSE+TWL+EIQ PEN VVRRG+YKGP IR+
Sbjct: 241 LDQLSGVK--PSEITWLEEIQAPENLVVRRGVYKGPTIRR 246

BLAST of CmaCh03G013240.1 vs. TrEMBL
Match: I1MET1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G083700 PE=4 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 2.6e-89
Identity = 171/286 (59.79%), Postives = 197/286 (68.88%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV TLLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALEKVRTLLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKGR G++QYPP+I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGRNGSYQYPPSIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQD---------- 180
           +GIFCM PGS+IPLHNHPGMTVLSKLLYG+L VRSYDWLDLP   D SQD          
Sbjct: 121 MGIFCMSPGSVIPLHNHPGMTVLSKLLYGSLLVRSYDWLDLPGPDDPSQDFSTLAARPAK 180

Query: 181 -------------------RGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRR 240
                              +GGNIH FKA+TPCA+FD+LSPPYSS DGRHCSYFR+S R+
Sbjct: 181 LVKDCQMSAPCNTTVLYPSKGGNIHCFKALTPCALFDVLSPPYSSEDGRHCSYFRKSTRK 240

Query: 241 EISG--LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           ++ G  LDQL G +  PSE+TWL+EIQ PEN VVRRG+YKGP IR+
Sbjct: 241 DLPGVELDQLSGVK--PSEITWLEEIQAPENLVVRRGVYKGPTIRR 252

BLAST of CmaCh03G013240.1 vs. TAIR10
Match: AT3G58670.1 (AT3G58670.1 Protein of unknown function (DUF1637))

HSP 1 Score: 250.4 bits (638), Expect = 1.2e-66
Identity = 135/277 (48.74%), Postives = 165/277 (59.57%), Query Frame = 1

Query: 4   VQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVTNGK 63
           +Q+L++ CK+S S + PVSEEAL KV  +L+++KPS+VGLE+E+QL R W G  N  NG 
Sbjct: 5   IQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNERNGN 64

Query: 64  KGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQIGI 123
                 H   P I+YL LHECD FS                                IGI
Sbjct: 65  ------HHSLPAIKYLQLHECDSFS--------------------------------IGI 124

Query: 124 FCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWL--DLPEFIDLSQDR---------- 183
           FCMPPGSIIPLHNHPGMTVLSKL+YG++HV+SYDW   D  E  D  Q R          
Sbjct: 125 FCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDWAEPDQSELDDPLQARPAKLVKDIDM 184

Query: 184 -------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLDQ 243
                        GGNIH FKAIT CAIFDILSPPYSS  GRHC+YFR+SP  ++ G  +
Sbjct: 185 TSPSPATTLYPTTGGNIHCFKAITHCAIFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIE 242

Query: 244 LCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           +   E   S VTWL+E QPP+NFV+ R  Y+GP+IR+
Sbjct: 245 VMNGEV-ISNVTWLEEYQPPDNFVIWRVPYRGPVIRK 242

BLAST of CmaCh03G013240.1 vs. TAIR10
Match: AT2G42670.2 (AT2G42670.2 Protein of unknown function (DUF1637))

HSP 1 Score: 249.2 bits (635), Expect = 2.7e-66
Identity = 131/276 (47.46%), Postives = 172/276 (62.32%), Query Frame = 1

Query: 5   QKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVTNGKK 64
           Q+LY+ CKASFS+  P++E+AL+KV  +L+++KPS+VG+E+++QLAR   G LN  NG  
Sbjct: 6   QRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNERNG-- 65

Query: 65  GRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQIGIF 124
               ++Q PP I+YLHLHECD FS                                IGIF
Sbjct: 66  ----SNQSPPAIKYLHLHECDSFS--------------------------------IGIF 125

Query: 125 CMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLD--LPEFIDLSQD------------ 184
           CMPP S+IPLHNHPGMTVLSKL+YG++HV+SYDWL+  L E  D SQ+            
Sbjct: 126 CMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDWLEPQLTEPEDPSQEARPAKLVKDTEM 185

Query: 185 ------------RGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLDQ 244
                        GGNIH FKAIT CAI DIL+PPYSS   RHC+YFR+S R ++ G  +
Sbjct: 186 TAQSPVTTLYPKSGGNIHCFKAITHCAILDILAPPYSSEHDRHCTYFRKSRREDLPGELE 241

Query: 245 LCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
           + G     ++VTWL+E QPP++FV+RR  Y+GP+IR
Sbjct: 246 VDGEVV--TDVTWLEEFQPPDDFVIRRIPYRGPVIR 241

BLAST of CmaCh03G013240.1 vs. TAIR10
Match: AT5G15120.1 (AT5G15120.1 Protein of unknown function (DUF1637))

HSP 1 Score: 161.0 bits (406), Expect = 9.8e-40
Identity = 96/276 (34.78%), Postives = 133/276 (48.19%), Query Frame = 1

Query: 4   VQKLYDACKASFSTSVP---VSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 63
           V++L++ CK  FS   P    SE+ +Q++  +LD++KP +VGL       R         
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRP-------N 117

Query: 64  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 123
           +G + R       P I YLHLH+CD+FS                                
Sbjct: 118 SGVEARSS-----PPITYLHLHQCDQFS-------------------------------- 177

Query: 124 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDW-LDLP----------------- 183
           IGIFC+PP  +IPLHNHPGMTV SKLL+G +H++SYDW +D P                 
Sbjct: 178 IGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAPMRDSKTRLAKLKVDSTF 237

Query: 184 ----EFIDLSQDRGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGL-D 243
                   L  + GGN+H F AIT CA+ D+L PPY + +GRHC+YF   P  ++S   D
Sbjct: 238 TAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSSEDD 289

Query: 244 QLCGTETGPSEVTWLDE--IQPPENFVVRRGLYKGP 252
            +  +E       WL E    P ++  V   LY+GP
Sbjct: 298 DVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGP 289

BLAST of CmaCh03G013240.1 vs. TAIR10
Match: AT1G18490.1 (AT1G18490.1 Protein of unknown function (DUF1637))

HSP 1 Score: 156.4 bits (394), Expect = 2.4e-38
Identity = 99/287 (34.49%), Postives = 139/287 (48.43%), Query Frame = 1

Query: 2   PIVQKLYDACKASFSTSVPV-SEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 61
           P VQ+LYD CK +F+   P  +  A+QK+ ++LD + P++VGLE  SQ      G   V+
Sbjct: 33  PKVQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVS 92

Query: 62  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 121
             +  R G    P  I +L +HECD F++                            C  
Sbjct: 93  --RFNRVGRWAQP--ITFLDIHECDTFTM----------------------------C-- 152

Query: 122 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 181
             IFC P  S+IPLH+HP M V SK+LYG+LHV++YDW++ P  I  +QD+         
Sbjct: 153 --IFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDWVEPPCII--TQDKGVPGSLPAR 212

Query: 182 -----------------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRS 241
                                  GGN+H F A+TPCA+ DILSPPY  + GR CSY+   
Sbjct: 213 LAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDY 272

Query: 242 PRREISGLDQLCGTETG-PSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
           P    +  + +   + G   E  WL +I  P++  +R G Y GP IR
Sbjct: 273 PFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGSYTGPTIR 281

BLAST of CmaCh03G013240.1 vs. TAIR10
Match: AT5G39890.1 (AT5G39890.1 Protein of unknown function (DUF1637))

HSP 1 Score: 146.4 bits (368), Expect = 2.5e-35
Identity = 95/279 (34.05%), Postives = 132/279 (47.31%), Query Frame = 1

Query: 4   VQKLYDACKASF----STSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNV 63
           VQKL+D CK  F    S +VP S+E ++ +  +LDE+KP +VG+  +    R       V
Sbjct: 47  VQKLFDTCKKVFADGKSGTVP-SQENIEMLRAVLDEIKPEDVGVNPKMSYFRS-----TV 106

Query: 64  TNGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCV 123
           T    GR       P + YLH++ C RFS                               
Sbjct: 107 T----GRS------PLVTYLHIYACHRFS------------------------------- 166

Query: 124 QIGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWL-DLPE--------FIDLSQD 183
            I IFC+PP  +IPLHNHP MTV SKLL+G +H++SYDW+ D P+         + +  D
Sbjct: 167 -ICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSSDTRLAKVKVDSD 226

Query: 184 -------------RGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLD 243
                         GGN+H F A T CA+ D++ PPYS   GRHC+Y+   P    S +D
Sbjct: 227 FTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFS-VD 276

Query: 244 QLCGTETGPSEVTWLDE-IQPPENFVVRRGLYKGPIIRQ 256
            +   E       WL E  + PE+  V   +Y GP I++
Sbjct: 287 GVVVAEEEKEGYAWLKEREEKPEDLTVTALMYSGPTIKE 276

BLAST of CmaCh03G013240.1 vs. NCBI nr
Match: gi|645237640|ref|XP_008225299.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase [Prunus mume])

HSP 1 Score: 345.9 bits (886), Expect = 6.1e-92
Identity = 171/278 (61.51%), Postives = 197/278 (70.86%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MP+VQKLY+ACK SFS++ P+SEE L+KV  +LDELK SNVGLE+E+QLARGWK S++  
Sbjct: 1   MPVVQKLYNACKGSFSSTGPISEEDLEKVRAILDELKASNVGLEQEAQLARGWKHSIHGG 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NG+KGR G HQYPP I+YLHLHECDRFS                                
Sbjct: 61  NGRKGRNGTHQYPPAIKYLHLHECDRFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           IGIFCMPPGSIIPLHNHPGMTVLSKLLYG+LHVRSYDWLDLPE  D+S+ R         
Sbjct: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGSLHVRSYDWLDLPECSDVSKARPAKLVRDCE 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLD 240
                         GGNIH FKA+T CA+FDILSPPYSS DGRHCSYFR+SPR ++  L+
Sbjct: 181 MSAPCGTTVLYPNSGGNIHCFKALTSCALFDILSPPYSSEDGRHCSYFRKSPRVDLPSLE 240

Query: 241 QLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           +LCG E  PSEV WL+EIQPPENFVVRRG+YKGP IR+
Sbjct: 241 ELCGAE--PSEVAWLEEIQPPENFVVRRGVYKGPTIRK 244

BLAST of CmaCh03G013240.1 vs. NCBI nr
Match: gi|225443784|ref|XP_002272019.1| (PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera])

HSP 1 Score: 345.1 bits (884), Expect = 1.0e-91
Identity = 171/277 (61.73%), Postives = 195/277 (70.40%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MP+VQKLY+ACK SFS   P+SEEAL KV ++LD++KPSNVGLE+E+QLARGWKGS++  
Sbjct: 1   MPVVQKLYNACKESFSVDGPLSEEALGKVRSILDDMKPSNVGLEQEAQLARGWKGSMHGA 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKK R G+HQYPP I+YLHLHECDRFS                                
Sbjct: 61  NGKKVRNGSHQYPPPIKYLHLHECDRFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           IGIFCMPP SIIPLHNHPGMTVLSKLLYG LHV+SYDWLDLP   DLSQ R         
Sbjct: 121 IGIFCMPPSSIIPLHNHPGMTVLSKLLYGTLHVKSYDWLDLPGTADLSQARPAKLVRDCE 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISGLD 240
                         GGNIH FKAITPCA+FD+LSPPYSS DGRHCSYFR+SPR+++ G+D
Sbjct: 181 MSAPCGTTILYPTNGGNIHCFKAITPCALFDVLSPPYSSEDGRHCSYFRKSPRKDLPGID 240

Query: 241 QLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIR 255
           QLCG +  PSEV WL+EIQPPEN VV RG Y+GPIIR
Sbjct: 241 QLCGIK--PSEVVWLEEIQPPENVVVLRGQYEGPIIR 243

BLAST of CmaCh03G013240.1 vs. NCBI nr
Match: gi|951038227|ref|XP_014517109.1| (PREDICTED: plant cysteine oxidase 5-like [Vigna radiata var. radiata])

HSP 1 Score: 339.0 bits (868), Expect = 7.4e-90
Identity = 171/280 (61.07%), Postives = 195/280 (69.64%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV  LLD+LKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALEKVRALLDDLKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKGR G++ YPP I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGRNGSYLYPPPIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQ----------- 180
           +GIFCM PGSIIPLHNHPGMTVLSKLLYG+LHVRSYDWLDLP   D SQ           
Sbjct: 121 MGIFCMAPGSIIPLHNHPGMTVLSKLLYGSLHVRSYDWLDLPGSDDPSQARPAKLVKDCQ 180

Query: 181 ------------DRGGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                       ++GGNIH FKA+TPCA+FDILSPPYSS DGRHCSYFR+SPR+E+ G  
Sbjct: 181 MSAPCNTTILHPNKGGNIHCFKALTPCALFDILSPPYSSEDGRHCSYFRKSPRKELPGVD 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LD+ CG +   SEVTWL+EIQ PEN VVRRGLY+GP IR+
Sbjct: 241 LDECCGVKA--SEVTWLEEIQAPENLVVRRGLYRGPTIRR 246

BLAST of CmaCh03G013240.1 vs. NCBI nr
Match: gi|593695079|ref|XP_007148038.1| (hypothetical protein PHAVU_006G175200g [Phaseolus vulgaris])

HSP 1 Score: 338.2 bits (866), Expect = 1.3e-89
Identity = 172/280 (61.43%), Postives = 193/280 (68.93%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV  LLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALKKVQALLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKG  G++QYPP I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGPNGSYQYPPPIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           +GIFCM PGSIIPLHNHPGMTVLSKLLYG+ HVR+YDWLDLP   D SQ R         
Sbjct: 121 MGIFCMAPGSIIPLHNHPGMTVLSKLLYGSFHVRAYDWLDLPGSDDSSQARPAKLVKDCQ 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                         GGNIH FKA+TPCA+FDILSPPYSS DGRHCSYFR+SPR+E+ G  
Sbjct: 181 MSAPCNTTILYPSKGGNIHCFKALTPCALFDILSPPYSSEDGRHCSYFRKSPRKELPGVD 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LDQ CG +   SEVTWL+EIQ PEN VVRRGLY+GP IR+
Sbjct: 241 LDQSCGVKA--SEVTWLEEIQAPENLVVRRGLYRGPTIRR 246

BLAST of CmaCh03G013240.1 vs. NCBI nr
Match: gi|359807407|ref|NP_001240875.1| (uncharacterized protein LOC100777850 [Glycine max])

HSP 1 Score: 337.0 bits (863), Expect = 2.8e-89
Identity = 171/280 (61.07%), Postives = 196/280 (70.00%), Query Frame = 1

Query: 1   MPIVQKLYDACKASFSTSVPVSEEALQKVITLLDELKPSNVGLERESQLARGWKGSLNVT 60
           MPIVQKLYD CKAS S   P+SEEAL+KV TLLDELKPSNVGLE+E+QL RGWKGSLN T
Sbjct: 1   MPIVQKLYDTCKASLSPEGPISEEALEKVRTLLDELKPSNVGLEQEAQLVRGWKGSLNGT 60

Query: 61  NGKKGRKGAHQYPPTIQYLHLHECDRFSVNDRIDMLTSGYSPPPAPLVPSLEGTISRCVQ 120
           NGKKGR G++QYPP+I+Y+HLHECD+FS                                
Sbjct: 61  NGKKGRNGSYQYPPSIKYIHLHECDKFS-------------------------------- 120

Query: 121 IGIFCMPPGSIIPLHNHPGMTVLSKLLYGALHVRSYDWLDLPEFIDLSQDR--------- 180
           +GIFCM PGS+IPLHNHPGMTVLSKLLYG+L VRSYDWLDLP   D SQ R         
Sbjct: 121 MGIFCMSPGSVIPLHNHPGMTVLSKLLYGSLLVRSYDWLDLPGPDDPSQARPAKLVKDCQ 180

Query: 181 --------------GGNIHHFKAITPCAIFDILSPPYSSADGRHCSYFRRSPRREISG-- 240
                         GGNIH FKA+TPCA+FD+LSPPYSS DGRHCSYFR+S R+++ G  
Sbjct: 181 MSAPCNTTVLYPSKGGNIHCFKALTPCALFDVLSPPYSSEDGRHCSYFRKSTRKDLPGVE 240

Query: 241 LDQLCGTETGPSEVTWLDEIQPPENFVVRRGLYKGPIIRQ 256
           LDQL G +  PSE+TWL+EIQ PEN VVRRG+YKGP IR+
Sbjct: 241 LDQLSGVK--PSEITWLEEIQAPENLVVRRGVYKGPTIRR 246

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCO5_ARATH2.2e-6548.74Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1[more]
PCO4_ARATH2.2e-6548.00Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2[more]
PCO1_ARATH1.7e-3834.78Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1[more]
PCO3_ARATH4.3e-3734.49Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1[more]
PCO2_ARATH4.4e-3434.05Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
D7TJM3_VITVI7.2e-9261.73Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0003g01390 PE=4 SV=... [more]
V7BPY1_PHAVU8.8e-9061.43Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_006G175200g PE=4 SV=1[more]
C6TH33_SOYBN2.0e-8961.07Uncharacterized protein OS=Glycine max GN=GLYMA_15G083700 PE=2 SV=1[more]
A0A0B2R3A7_GLYSO2.0e-8961.072-aminoethanethiol dioxygenase OS=Glycine soja GN=glysoja_040484 PE=4 SV=1[more]
I1MET1_SOYBN2.6e-8959.79Uncharacterized protein OS=Glycine max GN=GLYMA_15G083700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G58670.11.2e-6648.74 Protein of unknown function (DUF1637)[more]
AT2G42670.22.7e-6647.46 Protein of unknown function (DUF1637)[more]
AT5G15120.19.8e-4034.78 Protein of unknown function (DUF1637)[more]
AT1G18490.12.4e-3834.49 Protein of unknown function (DUF1637)[more]
AT5G39890.12.5e-3534.05 Protein of unknown function (DUF1637)[more]
Match NameE-valueIdentityDescription
gi|645237640|ref|XP_008225299.1|6.1e-9261.51PREDICTED: probable 2-aminoethanethiol dioxygenase [Prunus mume][more]
gi|225443784|ref|XP_002272019.1|1.0e-9161.73PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera][more]
gi|951038227|ref|XP_014517109.1|7.4e-9061.07PREDICTED: plant cysteine oxidase 5-like [Vigna radiata var. radiata][more]
gi|593695079|ref|XP_007148038.1|1.3e-8961.43hypothetical protein PHAVU_006G175200g [Phaseolus vulgaris][more]
gi|359807407|ref|NP_001240875.1|2.8e-8961.07uncharacterized protein LOC100777850 [Glycine max][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR011051RmlC_Cupin_sf
IPR012864PCO/ADO
IPR014710RmlC-like_jellyroll
Vocabulary: Molecular Function
TermDefinition
GO:0016702oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh03G013240CmaCh03G013240gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh03G013240.1CmaCh03G013240.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh03G013240.1.three_prime_UTR.1CmaCh03G013240.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh03G013240.1.CDS.4CmaCh03G013240.1.CDS.4CDS
CmaCh03G013240.1.CDS.3CmaCh03G013240.1.CDS.3CDS
CmaCh03G013240.1.CDS.2CmaCh03G013240.1.CDS.2CDS
CmaCh03G013240.1.CDS.1CmaCh03G013240.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh03G013240.1.exon.4CmaCh03G013240.1.exon.4exon
CmaCh03G013240.1.exon.3CmaCh03G013240.1.exon.3exon
CmaCh03G013240.1.exon.2CmaCh03G013240.1.exon.2exon
CmaCh03G013240.1.exon.1CmaCh03G013240.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 99..208
score: 1.53E-22coord: 4..45
score: 1.53
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847DUF1637coord: 170..252
score: 2.2E-23coord: 118..162
score: 1.2E-20coord: 32..89
score: 1.
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 119..194
score: 3.1
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 121..236
score: 2.0E-88coord: 29..88
score: 2.0
NoneNo IPR availablePANTHERPTHR22966:SF3SUBFAMILY NOT NAMEDcoord: 29..88
score: 2.0E-88coord: 121..236
score: 2.0