Cp4.1LG10g06940 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g06940
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Description2-aminoethanethiol dioxygenase
LocationCp4.1LG10 : 6229023 .. 6230927 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACGGAAACGAGAATGGTGGAACAGAGGAGTAACAGAGTAGGGCATATGAGTAAGGTTCAGTATGTGAAAAGGGATATCAAAAAGAGGAAATGTAGAGATATCAAGCGGCCTGTTCCAGTGGTTCCTATGACGCTTCAGGAACTGTTCGTTGCTTGCAAGGAAGTCTTCAAAGGCCCTGGAACTGTTCCATTGCCTTGTGAAGTTGAAAACCTCTGTCGCATTCTTGGTATGTTTCAGATCGAATCCATTTTACATTACTTTTTTCTTGCCTTTCGTTAGGAATCACGAATCTCTGTAATGTCATGATGTTGTCCCGTTTGAATTCCCTCGAACAATGTACACTATAGAACCTCTCCGGAGGTCTATGGAGCCCTCGAACAGCCTCCCCTTTGTTCGACACTTGAGTCACTTTTGACTACCCATTCGAGGCTCACAACTTCTTCATTCGACATTTGAGGATTCTATTGACCTGACCAAGTTAAGGGTATGACTCTAATATCATGTTAGGAATCACGACTCTCTACAATGACATGATATTGTCCACTTTGAGCGTAAGCTCTCGTAGCTTTGCTTTGGACTTCTCCAAAAGGCCTCGTACCGATCGACATTAGCTTAGTTTGGCCATGTTAGATCCTTTGATTGCCTTGTCTTTTAAGCAAATTTATTGTTATTGTCACTTCTATGTTCTTGAACTGTTCATAGCCTATATAAATTTGAGTATTGACCAAATTCTTTTTGTTTTGTGAATGCTGTAGATAATATGAAGTTAGAAGATGTTGGACTTAGTAATAATTTACCGTTTTTCAAGCACAATGTTCCAGTCAAAGGATCCCCTAGAGTAATATACACAACCATATATAAGTGTGACGAATTCTCGGTAAGCCTCGTGTAGCGCCTTTCGTTAGTTCCCGAATACTTTTCTTCTAGTGATTGTTGTTATATTCAAAAGGGTCAAGCTTATATACGCCATAAGCCTAGTCAAATTGCTATAAATCACGAGATTACTCGAGTGCGAGATATATTAGTCAAAGTGCTCGAGGTATATAGGGTTGGCTGATATCATCGATCAAAATGTTAGAGGCTTGAATCTCTTAGGGGCAAGTCATGAATTGTGAAAATATTCAAGGTGCATGTAAATCGAGATCATACTGTATATTTCGGGTTTTCGGGCTTGCTTCTTGATAGCGTGTTTTTCCCGTCGTTTATGCAGTTGTGCATCTTCTTCCTCCCCGAGACCGGCGTAATCCCTCTACACAACCATCCCGGAATGACGGTTTTTAGTAAGCTTCTAATGGGGAAGATGCATATCAAATCATATGATTGGGTTGATCCAACCAACAGTGATGATCCTACCAAACCTTGTGAAAGTGAGCATTCTCAAGTTTCTTATGAAATTATTGAGCTTGAAGTGATGATGGGTTGTTTTTGCAGAGAGATTAGCTAAGTTGAAAACTGATTCGGTCTTCACTTCGCCCTCGAGTACCTCTGTCTTGTACCCGACATCAGGAGGCAACATCCACTCGTTCACTGCTATAACCCCGTGTGCGGTGCTTGATGTGCTTGGACCTCCTTACTCCATGGACGACGATCGAGATTGTTCATATTATAAGGAGCATCCCTATGGCTCTTTTTCAAGTACGTTCCAAATCATTTTCATGAACCATCGTAAAAAAGTCTTTCCAAATATTGTAACGAAATACGAATATCGATATGCTACGACATTTACAATATGACTGTGACTCTCGTAACGGTTTCGTATCGGCTTCGATGGCAGATGGTGGCATGGCAATGACAGAAGAAGAGGCTAAGGGATATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCGCATGGATGGCATACAATACTTAGGCCCACAGATTATGGACATTTAA

mRNA sequence

ATGGCGACGGAAACGAGAATGGTGGAACAGAGGAGTAACAGAGTAGGGCATATGAGTAAGGTTCAGTATGTGAAAAGGGATATCAAAAAGAGGAAATGTAGAGATATCAAGCGGCCTGTTCCAGTGGTTCCTATGACGCTTCAGGAACTGTTCGTTGCTTGCAAGGAAGTCTTCAAAGGCCCTGGAACTGTTCCATTGCCTTGTGAAGTTGAAAACCTCTATAATATGAAGTTAGAAGATGTTGGACTTAGTAATAATTTACCGTTTTTCAAGCACAATGTTCCAGTCAAAGGATCCCCTAGAGTAATATACACAACCATATATAAGTGTGACGAATTCTCGTTGTGCATCTTCTTCCTCCCCGAGACCGGCGTAATCCCTCTACACAACCATCCCGGAATGACGGTTTTTAGTAAGCTTCTAATGGGGAAGATGCATATCAAATCATATGATTGGGTTGATCCAACCAACAGTGATGATCCTACCAAACCTTGTGAAAAGAGATTAGCTAAGTTGAAAACTGATTCGGTCTTCACTTCGCCCTCGAGTACCTCTGTCTTGTACCCGACATCAGGAGGCAACATCCACTCGTTCACTGCTATAACCCCGTGTGCGGTGCTTGATGTGCTTGGACCTCCTTACTCCATGGACGACGATCGAGATTGTTCATATTATAAGGAGCATCCCTATGGCTCTTTTTCAAATGGTGGCATGGCAATGACAGAAGAAGAGGCTAAGGGATATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCGCATGGATGGCATACAATACTTAGGCCCACAGATTATGGACATTTAA

Coding sequence (CDS)

ATGGCGACGGAAACGAGAATGGTGGAACAGAGGAGTAACAGAGTAGGGCATATGAGTAAGGTTCAGTATGTGAAAAGGGATATCAAAAAGAGGAAATGTAGAGATATCAAGCGGCCTGTTCCAGTGGTTCCTATGACGCTTCAGGAACTGTTCGTTGCTTGCAAGGAAGTCTTCAAAGGCCCTGGAACTGTTCCATTGCCTTGTGAAGTTGAAAACCTCTATAATATGAAGTTAGAAGATGTTGGACTTAGTAATAATTTACCGTTTTTCAAGCACAATGTTCCAGTCAAAGGATCCCCTAGAGTAATATACACAACCATATATAAGTGTGACGAATTCTCGTTGTGCATCTTCTTCCTCCCCGAGACCGGCGTAATCCCTCTACACAACCATCCCGGAATGACGGTTTTTAGTAAGCTTCTAATGGGGAAGATGCATATCAAATCATATGATTGGGTTGATCCAACCAACAGTGATGATCCTACCAAACCTTGTGAAAAGAGATTAGCTAAGTTGAAAACTGATTCGGTCTTCACTTCGCCCTCGAGTACCTCTGTCTTGTACCCGACATCAGGAGGCAACATCCACTCGTTCACTGCTATAACCCCGTGTGCGGTGCTTGATGTGCTTGGACCTCCTTACTCCATGGACGACGATCGAGATTGTTCATATTATAAGGAGCATCCCTATGGCTCTTTTTCAAATGGTGGCATGGCAATGACAGAAGAAGAGGCTAAGGGATATGGATGGTTAGAAGAGATTGAGATGCCAGAAAACTCTCGCATGGATGGCATACAATACTTAGGCCCACAGATTATGGACATTTAA

Protein sequence

MATETRMVEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKGPGTVPLPCEVENLYNMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMDI
BLAST of Cp4.1LG10g06940 vs. Swiss-Prot
Match: PCO1_ARATH (Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 4.2e-59
Identity = 118/240 (49.17%), Postives = 158/240 (65.83%), Query Frame = 1

Query: 47  LQELFVACKEVFK--GPGTVPLPCEVENLY----NMKLEDVGLSNNLPFFKHN--VPVKG 106
           ++ LF  CKEVF   GPG +P   +++ L     +MK EDVGL+  +P+F+ N  V  + 
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEARS 117

Query: 107 SPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNS 166
           SP + Y  +++CD+FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV     
Sbjct: 118 SPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWV----V 177

Query: 167 DDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDD 226
           D P +  + RLAKLK DS FT+P + S+LYP  GGN+H FTAIT CAVLDVLGPPY   +
Sbjct: 178 DAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPE 237

Query: 227 DRDCSYYKEHPYGSFS--NGGMAMTEEEAKGYGWLEEIE--MPENSRMDGIQYLGPQIMD 275
            R C+Y+ E P    S  +  +  +EEE +GY WL+E +    +++ + G  Y GP++ D
Sbjct: 238 GRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVED 293

BLAST of Cp4.1LG10g06940 vs. Swiss-Prot
Match: PCO2_ARATH (Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 5.5e-59
Identity = 125/257 (48.64%), Postives = 161/257 (62.65%), Query Frame = 1

Query: 25  KRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKG--PGTVPLPCEVENLY----NMKL 84
           ++ I++R  + +  PV       Q+LF  CK+VF     GTVP    +E L      +K 
Sbjct: 32  RKKIQRRSKKTLICPV-------QKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKP 91

Query: 85  EDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFS 144
           EDVG++  + +F+  V  + SP V Y  IY C  FS+CIF LP +GVIPLHNHP MTVFS
Sbjct: 92  EDVGVNPKMSYFRSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFS 151

Query: 145 KLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSF 204
           KLL G MHIKSYDWV     D P    + RLAK+K DS FT+P  TS+LYP  GGN+H F
Sbjct: 152 KLLFGTMHIKSYDWVP----DSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCF 211

Query: 205 TAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEE-IEMP 264
           TA T CAVLDV+GPPYS    R C+YY ++P+ SFS  G+ + EEE +GY WL+E  E P
Sbjct: 212 TAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYAWLKEREEKP 271

Query: 265 ENSRMDGIQYLGPQIMD 275
           E+  +  + Y GP I +
Sbjct: 272 EDLTVTALMYSGPTIKE 276

BLAST of Cp4.1LG10g06940 vs. Swiss-Prot
Match: PCO3_ARATH (Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 2.7e-45
Identity = 97/246 (39.43%), Postives = 132/246 (53.66%), Query Frame = 1

Query: 47  LQELFVACKEVFKGPGTVPLPCEVENLYNM----KLEDVGLSNNLPFFKHNVPVKGSPR- 106
           +QEL+  CKE F G    P    ++ L ++       DVGL            V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 107 ---------VIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDW 166
                    + +  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 167 VDP----TNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLD 226
           V+P    T           RLAKL +D V T  S    LYP +GGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 227 VLGPPYSMDDDRDCSYYKEHPYGSFS--NGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQ 273
           +L PPY     R CSYY ++P+ +F+  NG   + E +   Y WL +I+ P++  M    
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGS 274

BLAST of Cp4.1LG10g06940 vs. Swiss-Prot
Match: PCO5_ARATH (Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 1.9e-43
Identity = 99/242 (40.91%), Postives = 130/242 (53.72%), Query Frame = 1

Query: 43  VPMTLQELFVACKEVFK--GPGTVPLPCEVEN-LYNMKLEDVGLSNNLPFFKHNVPVKGS 102
           +P  +Q LF  CK      GP +     +V N L  +K  DVGL       + N P  G+
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPGN 60

Query: 103 ---------PRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSY 162
                    P + Y  +++CD FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KSY
Sbjct: 61  ERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSY 120

Query: 163 DWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVL 222
           DW +P  S+    P + R AKL  D   TSPS  + LYPT+GGNIH F AIT CA+ D+L
Sbjct: 121 DWAEPDQSE-LDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDIL 180

Query: 223 GPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGP 273
            PPYS    R C+Y+++ P          M  E      WLEE + P+N  +  + Y GP
Sbjct: 181 SPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGP 240

BLAST of Cp4.1LG10g06940 vs. Swiss-Prot
Match: PCO4_ARATH (Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2)

HSP 1 Score: 177.6 bits (449), Expect = 1.9e-43
Identity = 98/241 (40.66%), Postives = 136/241 (56.43%), Query Frame = 1

Query: 43  VPMTLQELFVACKEVFK--GPGTVPLPCEVEN-LYNMKLEDVGL--------SNNLPFFK 102
           +P   Q L+  CK  F   GP T     +V N L  +K  DVG+        S + P  +
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 103 HNVPVKGSPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYD 162
            N   +  P + Y  +++CD FS+ IF +P + +IPLHNHPGMTV SKL+ G MH+KSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 163 WVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLG 222
           W++P    +P  P + R AKL  D+  T+ S  + LYP SGGNIH F AIT CA+LD+L 
Sbjct: 121 WLEP-QLTEPEDPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILA 180

Query: 223 PPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQ 273
           PPYS + DR C+Y+++        G + +  E      WLEE + P++  +  I Y GP 
Sbjct: 181 PPYSSEHDRHCTYFRKSRREDLP-GELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPV 239

BLAST of Cp4.1LG10g06940 vs. TrEMBL
Match: A0A0A0KWK6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188410 PE=4 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 4.6e-129
Identity = 223/277 (80.51%), Postives = 244/277 (88.09%), Query Frame = 1

Query: 4   ETRMVEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKGPGT 63
           ETR V + SNR+GH++KVQYV+RD KKRKCR I+R +PVVPM LQELFV+C+EVFKGPGT
Sbjct: 2   ETRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGT 61

Query: 64  VPLPCEVENLY----NMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFF 123
           VPLPC+VE L     NMK EDVGLS++L FFK NVPVKGSPRV YTTIYKCD FSLCIFF
Sbjct: 62  VPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 121

Query: 124 LPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFT 183
           LP TGVIPLHNHPGMTVFSKLL+GKMHIKSYDWVDPTNSDD  +PCEKRLAKLK D+VFT
Sbjct: 122 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVFT 181

Query: 184 SPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMA 243
           SP STSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSM+D RDCSYYKEHPY SF NG M 
Sbjct: 182 SPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDMG 241

Query: 244 MTEE-EAKGYGWLEEIEMPENSRMDGIQYLGPQIMDI 276
           + EE + +GYGWLEEIE+PENS MDGI+YLGPQI DI
Sbjct: 242 LGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of Cp4.1LG10g06940 vs. TrEMBL
Match: M5VWU7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009537mg PE=4 SV=1)

HSP 1 Score: 368.2 bits (944), Expect = 8.5e-99
Identity = 189/286 (66.08%), Postives = 217/286 (75.87%), Query Frame = 1

Query: 1   MATET-RMVEQRSN-----RVGHMSKVQY-VKRDIKKRKC-RDIKRPVPVVPMTLQELFV 60
           M  ET R+VEQ  +     RVGH SKV Y V + I KRKC + IK  VP  P  LQ+LFV
Sbjct: 1   MTMETARLVEQGRDHKVVSRVGHASKVGYHVSKAITKRKCGKKIKHSVP--PTVLQQLFV 60

Query: 61  ACKEVFKGPGTVPLPCEVENLYN----MKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIY 120
           +C++VFKGPGTVP P +V NL +    M+ EDVGLS +L FFK    V+G+PRV YTTIY
Sbjct: 61  SCRQVFKGPGTVPSPHDVHNLCSILDKMRPEDVGLSRDLQFFKPKTVVQGTPRVTYTTIY 120

Query: 121 KCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKR 180
           +C  FSLC  F+P TGVIPLHNHP MTVFSKLL+GKMHIKSYDWVDP NSD  T   + R
Sbjct: 121 ECSNFSLCCLFIPATGVIPLHNHPEMTVFSKLLLGKMHIKSYDWVDPVNSDGSTPAPQLR 180

Query: 181 LAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEH 240
           LAKLK DSVFTSP +TSVLYPT GGNIH+FTAITPCAVLDVLGPPYS +DDRDCSYYK+H
Sbjct: 181 LAKLKADSVFTSPCNTSVLYPTEGGNIHAFTAITPCAVLDVLGPPYSKEDDRDCSYYKDH 240

Query: 241 PYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
           PY ++SNG  ++TE     YGWLEEIEMPENS MD I YLGPQ+ +
Sbjct: 241 PYAAYSNGEASVTEGNGDCYGWLEEIEMPENSEMDKIPYLGPQVTE 284

BLAST of Cp4.1LG10g06940 vs. TrEMBL
Match: A0A061F0D3_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_025853 PE=4 SV=1)

HSP 1 Score: 358.6 bits (919), Expect = 6.7e-96
Identity = 181/277 (65.34%), Postives = 215/277 (77.62%), Query Frame = 1

Query: 7   MVEQRSNRVG--HM-SKVQYVKRDIKKRKCRDIKRPVPV-VPMTLQELFVACKEVFKGPG 66
           +VEQR + +   H+ SKV+YV + IKK + +   +PV   VP  L ELFVAC+EVFKGPG
Sbjct: 8   LVEQRKDTMTMRHVNSKVRYVNKPIKKLRRKKRSKPVASRVPRLLPELFVACREVFKGPG 67

Query: 67  TVPLPCEVENLYN----MKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIF 126
            VP P +V+ L +    MK EDVGLS NL FFK    V G+PRV YTTIY+CDEFSLCIF
Sbjct: 68  NVPPPSDVDKLCSILDRMKPEDVGLSKNLQFFKARGAVTGTPRVTYTTIYQCDEFSLCIF 127

Query: 127 FLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVF 186
           FLPE  VIPLHNHPGMTVFSKLL+GKMHIKSYDWVDP +S+DP  P + RLA+LK DSVF
Sbjct: 128 FLPEKAVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPVHSEDPVPPSQPRLARLKADSVF 187

Query: 187 TSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGM 246
           T+P  TSVLYPT+GGNIH FTAITPCAVLDVLGPPYS +DDRDCSYY++ P  +F NG  
Sbjct: 188 TAPCDTSVLYPTAGGNIHQFTAITPCAVLDVLGPPYSKEDDRDCSYYRDVPCSAFPNGET 247

Query: 247 AMTEE-EAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
            ++EE E   +GWLEEI++PENS+MD I+YLGPQI +
Sbjct: 248 TVSEEVEGDLFGWLEEIQVPENSKMDRIEYLGPQIAE 284

BLAST of Cp4.1LG10g06940 vs. TrEMBL
Match: A0A067KTW4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02681 PE=4 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 7.0e-93
Identity = 168/276 (60.87%), Postives = 212/276 (76.81%), Query Frame = 1

Query: 7   MVEQRSNRVGHMSKVQYVKRDIKKRKCRDI--KRPVPVVPMTLQELFVACKEVFKGPGTV 66
           +V+ R +  GH+ KV Y  R IK+RKC+    KRP   VPM LQEL+++CK+VFKGPGTV
Sbjct: 6   VVDPRRDPAGHVHKVGYANRVIKRRKCKRKINKRPEVKVPMALQELYMSCKQVFKGPGTV 65

Query: 67  PLPCEVENLYN----MKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFL 126
           P P +VE L +    M+ EDVGLS+ L FFK    VK +PRV  TTIYKCD+FS+CIFFL
Sbjct: 66  PFPHDVERLCHILDKMRPEDVGLSSELQFFKPKPSVKVTPRVTTTTIYKCDKFSICIFFL 125

Query: 127 PETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTS 186
           P + VIPLHNHP MTVF+KLL+GKMHIKSYDW+ P  +++P +P   RLAK+  +SVF +
Sbjct: 126 PASAVIPLHNHPEMTVFNKLLLGKMHIKSYDWISPPVAEEPVQPSNIRLAKMVANSVFEA 185

Query: 187 PSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNG-GMA 246
           P +TSVLYPT+GGNIH FTAITPCA LDV GPPYS +D RDCSYYK++PY +F +G    
Sbjct: 186 PCNTSVLYPTTGGNIHQFTAITPCAFLDVFGPPYSKEDGRDCSYYKDYPYDAFPDGEERE 245

Query: 247 MTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMDI 276
           + +E+   YGWL+EI+MPENSRMDGI+YLGPQ+++I
Sbjct: 246 VKKEDGDIYGWLQEIDMPENSRMDGIEYLGPQVVEI 281

BLAST of Cp4.1LG10g06940 vs. TrEMBL
Match: A9PDV9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s06040g PE=2 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 2.7e-92
Identity = 166/271 (61.25%), Postives = 202/271 (74.54%), Query Frame = 1

Query: 8   VEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKGPGTVPLP 67
           VE R     H++++ + KR  K+++ +  K+  P   M LQ+L+V+CKEVFKGPGTVPL 
Sbjct: 7   VEPRREPTAHVNRLGFAKRPTKRKRSKKTKKCAPT--MALQDLYVSCKEVFKGPGTVPLH 66

Query: 68  CEVENLY----NMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFLPET 127
            +V+ L     NMKLED GLS  L FF     V+G+PRV YT +Y+CD+FS+C+FFLP T
Sbjct: 67  QDVKRLCHMLDNMKLEDFGLSCKLEFFNPKAAVRGTPRVTYTIVYECDKFSMCVFFLPAT 126

Query: 128 GVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSS 187
            VIPLHNHPGMTVFSKLLMG MH+KSYDWVDP  +D+P  P + RLAKL+ DSVFT+P  
Sbjct: 127 AVIPLHNHPGMTVFSKLLMGTMHVKSYDWVDPPATDEPDSPAQVRLAKLEADSVFTAPCH 186

Query: 188 TSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEE 247
           TSVLYPT+GGNIH FTAITPCAVLDVLGPPYS +D RDCSYYK+ PY +F NG M   EE
Sbjct: 187 TSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSNEDGRDCSYYKDFPYTAFPNGEMGSEEE 246

Query: 248 EAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
           E   Y WLEEI +PEN +M  I+YLGPQ+ D
Sbjct: 247 EGDCYAWLEEITVPENLQMFVIKYLGPQVDD 275

BLAST of Cp4.1LG10g06940 vs. TAIR10
Match: AT5G15120.1 (AT5G15120.1 Protein of unknown function (DUF1637))

HSP 1 Score: 229.6 bits (584), Expect = 2.4e-60
Identity = 118/240 (49.17%), Postives = 158/240 (65.83%), Query Frame = 1

Query: 47  LQELFVACKEVFK--GPGTVPLPCEVENLY----NMKLEDVGLSNNLPFFKHN--VPVKG 106
           ++ LF  CKEVF   GPG +P   +++ L     +MK EDVGL+  +P+F+ N  V  + 
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEARS 117

Query: 107 SPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNS 166
           SP + Y  +++CD+FS+ IF LP +GVIPLHNHPGMTVFSKLL G MHIKSYDWV     
Sbjct: 118 SPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWV----V 177

Query: 167 DDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDD 226
           D P +  + RLAKLK DS FT+P + S+LYP  GGN+H FTAIT CAVLDVLGPPY   +
Sbjct: 178 DAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPE 237

Query: 227 DRDCSYYKEHPYGSFS--NGGMAMTEEEAKGYGWLEEIE--MPENSRMDGIQYLGPQIMD 275
            R C+Y+ E P    S  +  +  +EEE +GY WL+E +    +++ + G  Y GP++ D
Sbjct: 238 GRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVED 293

BLAST of Cp4.1LG10g06940 vs. TAIR10
Match: AT5G39890.1 (AT5G39890.1 Protein of unknown function (DUF1637))

HSP 1 Score: 229.2 bits (583), Expect = 3.1e-60
Identity = 125/257 (48.64%), Postives = 161/257 (62.65%), Query Frame = 1

Query: 25  KRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKG--PGTVPLPCEVENLY----NMKL 84
           ++ I++R  + +  PV       Q+LF  CK+VF     GTVP    +E L      +K 
Sbjct: 32  RKKIQRRSKKTLICPV-------QKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKP 91

Query: 85  EDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFS 144
           EDVG++  + +F+  V  + SP V Y  IY C  FS+CIF LP +GVIPLHNHP MTVFS
Sbjct: 92  EDVGVNPKMSYFRSTVTGR-SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFS 151

Query: 145 KLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSF 204
           KLL G MHIKSYDWV     D P    + RLAK+K DS FT+P  TS+LYP  GGN+H F
Sbjct: 152 KLLFGTMHIKSYDWVP----DSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCF 211

Query: 205 TAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEE-IEMP 264
           TA T CAVLDV+GPPYS    R C+YY ++P+ SFS  G+ + EEE +GY WL+E  E P
Sbjct: 212 TAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYAWLKEREEKP 271

Query: 265 ENSRMDGIQYLGPQIMD 275
           E+  +  + Y GP I +
Sbjct: 272 EDLTVTALMYSGPTIKE 276

BLAST of Cp4.1LG10g06940 vs. TAIR10
Match: AT1G18490.1 (AT1G18490.1 Protein of unknown function (DUF1637))

HSP 1 Score: 183.7 bits (465), Expect = 1.5e-46
Identity = 97/246 (39.43%), Postives = 132/246 (53.66%), Query Frame = 1

Query: 47  LQELFVACKEVFKGPGTVPLPCEVENLYNM----KLEDVGLSNNLPFFKHNVPVKGSPR- 106
           +QEL+  CKE F G    P    ++ L ++       DVGL            V G  R 
Sbjct: 35  VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94

Query: 107 ---------VIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDW 166
                    + +  I++CD F++CIF  P + VIPLH+HP M VFSK+L G +H+K+YDW
Sbjct: 95  NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154

Query: 167 VDP----TNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLD 226
           V+P    T           RLAKL +D V T  S    LYP +GGN+H FTA+TPCAVLD
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214

Query: 227 VLGPPYSMDDDRDCSYYKEHPYGSFS--NGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQ 273
           +L PPY     R CSYY ++P+ +F+  NG   + E +   Y WL +I+ P++  M    
Sbjct: 215 ILSPPYKESVGRSCSYYMDYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGS 274

BLAST of Cp4.1LG10g06940 vs. TAIR10
Match: AT3G58670.1 (AT3G58670.1 Protein of unknown function (DUF1637))

HSP 1 Score: 177.6 bits (449), Expect = 1.1e-44
Identity = 99/242 (40.91%), Postives = 130/242 (53.72%), Query Frame = 1

Query: 43  VPMTLQELFVACKEVFK--GPGTVPLPCEVEN-LYNMKLEDVGLSNNLPFFKHNVPVKGS 102
           +P  +Q LF  CK      GP +     +V N L  +K  DVGL       + N P  G+
Sbjct: 1   MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVR-NWPGPGN 60

Query: 103 ---------PRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSY 162
                    P + Y  +++CD FS+ IF +P   +IPLHNHPGMTV SKL+ G MH+KSY
Sbjct: 61  ERNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSY 120

Query: 163 DWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVL 222
           DW +P  S+    P + R AKL  D   TSPS  + LYPT+GGNIH F AIT CA+ D+L
Sbjct: 121 DWAEPDQSE-LDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDIL 180

Query: 223 GPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGP 273
            PPYS    R C+Y+++ P          M  E      WLEE + P+N  +  + Y GP
Sbjct: 181 SPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGP 240

BLAST of Cp4.1LG10g06940 vs. TAIR10
Match: AT2G42670.2 (AT2G42670.2 Protein of unknown function (DUF1637))

HSP 1 Score: 177.6 bits (449), Expect = 1.1e-44
Identity = 100/243 (41.15%), Postives = 139/243 (57.20%), Query Frame = 1

Query: 43  VPMTLQELFVACKEVFK--GPGTVPLPCEVEN-LYNMKLEDVGL--------SNNLPFFK 102
           +P   Q L+  CK  F   GP T     +V N L  +K  DVG+        S + P  +
Sbjct: 1   MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 103 HNVPVKGSPRVIYTTIYKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYD 162
            N   +  P + Y  +++CD FS+ IF +P + +IPLHNHPGMTV SKL+ G MH+KSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 163 WVDP--TNSDDPTKPCEKRLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDV 222
           W++P  T  +DP++  E R AKL  D+  T+ S  + LYP SGGNIH F AIT CA+LD+
Sbjct: 121 WLEPQLTEPEDPSQ--EARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDI 180

Query: 223 LGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLG 273
           L PPYS + DR C+Y+++        G + +  E      WLEE + P++  +  I Y G
Sbjct: 181 LAPPYSSEHDRHCTYFRKSRREDLP-GELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRG 240

BLAST of Cp4.1LG10g06940 vs. NCBI nr
Match: gi|659082758|ref|XP_008442017.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase [Cucumis melo])

HSP 1 Score: 471.5 bits (1212), Expect = 1.0e-129
Identity = 224/277 (80.87%), Postives = 246/277 (88.81%), Query Frame = 1

Query: 4   ETRMVEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRP-VPVVPMTLQELFVACKEVFKGPG 63
           ETR V++ SNR+GH++KVQYV+RD KKRKCR IKRP +PVVPM LQELFV+C+EVFKGPG
Sbjct: 2   ETRAVDRGSNRIGHVNKVQYVRRDFKKRKCRKIKRPSIPVVPMALQELFVSCREVFKGPG 61

Query: 64  TVPLPCEVENLY----NMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIF 123
           TVPLPC+VE L     NMK EDVGLS+NL FFK NVPVKGSPRV YTTIY+CD FSLCIF
Sbjct: 62  TVPLPCDVEKLCCILDNMKAEDVGLSSNLQFFKPNVPVKGSPRVTYTTIYRCDNFSLCIF 121

Query: 124 FLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVF 183
           FLP TGVIPLHNHPGMTVFSKLL+GKMHIKSYDWVDPTNSDD  +PCE+RLAKLK D+VF
Sbjct: 122 FLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCERRLAKLKADAVF 181

Query: 184 TSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGM 243
           TSP STSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSM+D RDCSYYKEHPY SF N  M
Sbjct: 182 TSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNCDM 241

Query: 244 AMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMDI 276
            + EEE +GYGWLEEIE+PENS+MDGI+YLGPQI DI
Sbjct: 242 GLGEEEGEGYGWLEEIEVPENSQMDGIEYLGPQISDI 278

BLAST of Cp4.1LG10g06940 vs. NCBI nr
Match: gi|449462764|ref|XP_004149110.1| (PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus])

HSP 1 Score: 468.8 bits (1205), Expect = 6.6e-129
Identity = 223/277 (80.51%), Postives = 244/277 (88.09%), Query Frame = 1

Query: 4   ETRMVEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKGPGT 63
           ETR V + SNR+GH++KVQYV+RD KKRKCR I+R +PVVPM LQELFV+C+EVFKGPGT
Sbjct: 2   ETRAVNRGSNRIGHVNKVQYVRRDFKKRKCRKIRRSIPVVPMALQELFVSCREVFKGPGT 61

Query: 64  VPLPCEVENLY----NMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFF 123
           VPLPC+VE L     NMK EDVGLS++L FFK NVPVKGSPRV YTTIYKCD FSLCIFF
Sbjct: 62  VPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSPRVTYTTIYKCDNFSLCIFF 121

Query: 124 LPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFT 183
           LP TGVIPLHNHPGMTVFSKLL+GKMHIKSYDWVDPTNSDD  +PCEKRLAKLK D+VFT
Sbjct: 122 LPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDDTAQPCEKRLAKLKADAVFT 181

Query: 184 SPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMA 243
           SP STSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSM+D RDCSYYKEHPY SF NG M 
Sbjct: 182 SPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGRDCSYYKEHPYASFPNGDMG 241

Query: 244 MTEE-EAKGYGWLEEIEMPENSRMDGIQYLGPQIMDI 276
           + EE + +GYGWLEEIE+PENS MDGI+YLGPQI DI
Sbjct: 242 LGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICDI 278

BLAST of Cp4.1LG10g06940 vs. NCBI nr
Match: gi|743915500|ref|XP_011001703.1| (PREDICTED: 2-aminoethanethiol dioxygenase-like [Populus euphratica])

HSP 1 Score: 369.0 bits (946), Expect = 7.2e-99
Identity = 175/272 (64.34%), Postives = 211/272 (77.57%), Query Frame = 1

Query: 7   MVEQRSNRVGHMSKVQYVKRDIKKRKCRDIKRPVPVVPMTLQELFVACKEVFKGPGTVPL 66
           MVEQR   V H+ +V Y  R I+K++ +  ++    VPM LQ+LFV+CK++FKG  TVPL
Sbjct: 6   MVEQRREPVAHVHRVGYANRSIRKKRRKRAEKCASTVPMALQDLFVSCKQMFKGSDTVPL 65

Query: 67  PCEVENLYN----MKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIYKCDEFSLCIFFLPE 126
           P ++E L N    MK EDVGLS+ L FFK    VKG+PRV YTTIYKC++FSLCIFFLP 
Sbjct: 66  PEDIERLCNILDNMKPEDVGLSSELQFFKTKAAVKGTPRVTYTTIYKCNDFSLCIFFLPA 125

Query: 127 TGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKRLAKLKTDSVFTSPS 186
             VIPLHNHPGMTVFSKLL+GKMHIK+YDWVDP  +D P  P + RLAKL+ D+V T+P 
Sbjct: 126 NAVIPLHNHPGMTVFSKLLLGKMHIKAYDWVDPPRADGPDTPNQLRLAKLEADNVITAPC 185

Query: 187 STSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEHPYGSFSNGGMAMTE 246
           +TSVLYPT+GGNIH FTAITPCAVLDVLGPPYS + DRDCSYYK+ PY +FSNG M + +
Sbjct: 186 NTSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSKEGDRDCSYYKDFPYTAFSNGEMELKK 245

Query: 247 EEAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
           EE   Y WLEE E+PENS+MDGI+YLGPQ+ D
Sbjct: 246 EEGSCYAWLEETEVPENSKMDGIEYLGPQVDD 277

BLAST of Cp4.1LG10g06940 vs. NCBI nr
Match: gi|595820692|ref|XP_007204698.1| (hypothetical protein PRUPE_ppa009537mg [Prunus persica])

HSP 1 Score: 368.2 bits (944), Expect = 1.2e-98
Identity = 189/286 (66.08%), Postives = 217/286 (75.87%), Query Frame = 1

Query: 1   MATET-RMVEQRSN-----RVGHMSKVQY-VKRDIKKRKC-RDIKRPVPVVPMTLQELFV 60
           M  ET R+VEQ  +     RVGH SKV Y V + I KRKC + IK  VP  P  LQ+LFV
Sbjct: 1   MTMETARLVEQGRDHKVVSRVGHASKVGYHVSKAITKRKCGKKIKHSVP--PTVLQQLFV 60

Query: 61  ACKEVFKGPGTVPLPCEVENLYN----MKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTIY 120
           +C++VFKGPGTVP P +V NL +    M+ EDVGLS +L FFK    V+G+PRV YTTIY
Sbjct: 61  SCRQVFKGPGTVPSPHDVHNLCSILDKMRPEDVGLSRDLQFFKPKTVVQGTPRVTYTTIY 120

Query: 121 KCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEKR 180
           +C  FSLC  F+P TGVIPLHNHP MTVFSKLL+GKMHIKSYDWVDP NSD  T   + R
Sbjct: 121 ECSNFSLCCLFIPATGVIPLHNHPEMTVFSKLLLGKMHIKSYDWVDPVNSDGSTPAPQLR 180

Query: 181 LAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKEH 240
           LAKLK DSVFTSP +TSVLYPT GGNIH+FTAITPCAVLDVLGPPYS +DDRDCSYYK+H
Sbjct: 181 LAKLKADSVFTSPCNTSVLYPTEGGNIHAFTAITPCAVLDVLGPPYSKEDDRDCSYYKDH 240

Query: 241 PYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
           PY ++SNG  ++TE     YGWLEEIEMPENS MD I YLGPQ+ +
Sbjct: 241 PYAAYSNGEASVTEGNGDCYGWLEEIEMPENSEMDKIPYLGPQVTE 284

BLAST of Cp4.1LG10g06940 vs. NCBI nr
Match: gi|645278211|ref|XP_008244130.1| (PREDICTED: probable 2-aminoethanethiol dioxygenase [Prunus mume])

HSP 1 Score: 367.5 bits (942), Expect = 2.1e-98
Identity = 189/287 (65.85%), Postives = 217/287 (75.61%), Query Frame = 1

Query: 1   MATET-RMVEQRSN-----RVGHMSKVQY-VKRDIKKRKC-RDIKRPVP-VVPMTLQELF 60
           M  ET R+VEQ  +     RVGH SKV Y V + IKKRKC + +K  VP  VP+ LQ+LF
Sbjct: 1   MTMETARLVEQGRDHKVVSRVGHASKVGYHVNKAIKKRKCGKKMKHSVPSTVPIALQQLF 60

Query: 61  VACKEVFKGPGTVPLPCEVENLY----NMKLEDVGLSNNLPFFKHNVPVKGSPRVIYTTI 120
           V+C++VFKGPGTVP P +V  L     NM+ EDVGLS +L FFK    V+G+PRV YTTI
Sbjct: 61  VSCRQVFKGPGTVPSPHDVHKLCSILDNMRPEDVGLSRDLQFFKPKTVVQGTPRVTYTTI 120

Query: 121 YKCDEFSLCIFFLPETGVIPLHNHPGMTVFSKLLMGKMHIKSYDWVDPTNSDDPTKPCEK 180
           Y+C  FSLC  F+P TGVIPLHNHP MTVFSKLL+GKMHIKSYDWVDP NSD  T   + 
Sbjct: 121 YECRNFSLCCLFIPATGVIPLHNHPEMTVFSKLLLGKMHIKSYDWVDPVNSDGSTPAPQF 180

Query: 181 RLAKLKTDSVFTSPSSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMDDDRDCSYYKE 240
           RLAKLK DSVFTSP +TSVLYPT GGNIH+FTAITPCAVLDVLGPPYS +DDRDCSYYK+
Sbjct: 181 RLAKLKADSVFTSPCNTSVLYPTEGGNIHAFTAITPCAVLDVLGPPYSKEDDRDCSYYKD 240

Query: 241 HPYGSFSNGGMAMTEEEAKGYGWLEEIEMPENSRMDGIQYLGPQIMD 275
           HPY ++ NG  A+TE     YGWLEEIEMP NS MD I YLGPQ+ +
Sbjct: 241 HPYAAYPNGEAAVTEGNGDCYGWLEEIEMPANSEMDKIPYLGPQVTE 287

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCO1_ARATH4.2e-5949.17Plant cysteine oxidase 1 OS=Arabidopsis thaliana GN=PCO1 PE=1 SV=1[more]
PCO2_ARATH5.5e-5948.64Plant cysteine oxidase 2 OS=Arabidopsis thaliana GN=PCO2 PE=1 SV=1[more]
PCO3_ARATH2.7e-4539.43Plant cysteine oxidase 3 OS=Arabidopsis thaliana GN=PCO3 PE=1 SV=1[more]
PCO5_ARATH1.9e-4340.91Plant cysteine oxidase 5 OS=Arabidopsis thaliana GN=PCO5 PE=1 SV=1[more]
PCO4_ARATH1.9e-4340.66Plant cysteine oxidase 4 OS=Arabidopsis thaliana GN=PCO4 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KWK6_CUCSA4.6e-12980.51Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188410 PE=4 SV=1[more]
M5VWU7_PRUPE8.5e-9966.08Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009537mg PE=4 SV=1[more]
A0A061F0D3_THECC6.7e-9665.34Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_025853 PE=4 SV=1[more]
A0A067KTW4_JATCU7.0e-9360.87Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02681 PE=4 SV=1[more]
A9PDV9_POPTR2.7e-9261.25Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s06040g PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G15120.12.4e-6049.17 Protein of unknown function (DUF1637)[more]
AT5G39890.13.1e-6048.64 Protein of unknown function (DUF1637)[more]
AT1G18490.11.5e-4639.43 Protein of unknown function (DUF1637)[more]
AT3G58670.11.1e-4440.91 Protein of unknown function (DUF1637)[more]
AT2G42670.21.1e-4441.15 Protein of unknown function (DUF1637)[more]
Match NameE-valueIdentityDescription
gi|659082758|ref|XP_008442017.1|1.0e-12980.87PREDICTED: probable 2-aminoethanethiol dioxygenase [Cucumis melo][more]
gi|449462764|ref|XP_004149110.1|6.6e-12980.51PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus][more]
gi|743915500|ref|XP_011001703.1|7.2e-9964.34PREDICTED: 2-aminoethanethiol dioxygenase-like [Populus euphratica][more]
gi|595820692|ref|XP_007204698.1|1.2e-9866.08hypothetical protein PRUPE_ppa009537mg [Prunus persica][more]
gi|645278211|ref|XP_008244130.1|2.1e-9865.85PREDICTED: probable 2-aminoethanethiol dioxygenase [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: Molecular Function
TermDefinition
GO:0016702oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
Vocabulary: INTERPRO
TermDefinition
IPR014710RmlC-like_jellyroll
IPR012864PCO/ADO
IPR011051RmlC_Cupin_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g06940.1Cp4.1LG10g06940.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 43..228
score: 3.87
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847DUF1637coord: 75..272
score: 5.4
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 104..227
score: 6.6
NoneNo IPR availablePANTHERPTHR22966UNCHARACTERIZEDcoord: 69..275
score: 5.8
NoneNo IPR availablePANTHERPTHR22966:SF13SUBFAMILY NOT NAMEDcoord: 69..275
score: 5.8