CmaCh03G000050 (gene) Cucurbita maxima (Rimu)

NameCmaCh03G000050
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
Description4,5-DOPA dioxygenase extradiol-like protein
LocationCma_Chr03 : 116004 .. 119961 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTCAGAAGCCTTGCAGGTAGCTAAGGTTTACCGCCAGCTTCTCAAAGCTGTAAAGAACCATATTGGAAAGGAAGAGAACAAGAAGCATTTTGTGGACTATGTGGCTCAGAAGTTCAGGGAAAAAAGCACACTTTCGAAACCCCATTCTGTTCAACAGAAAATCAAGCTTGCTCGCGATTACACATTTCTACTTAACAGCGTGCACCATCAAAAGGCAATACTTCACTTCCACTATTTAAAACTTTTTTCTTGTTCGAATAAAGAATTAGGCCTTGATAACCATTGTTGTATTTTTCATAGTTTCTTTAGAATGAATTTCGTCTTTTATAACTTATAAGGGAACCTTCCGATTCTTAGTCGAATTTCTTTTTTTTTTTTTGATAAGAGATTCTTAGTCAAATTTCAAAAACAATTTTTTTTTCGAACTTGGCTTGAATTTCAAAAACATTCAACTCTTTGGCTATAGGTGGAAGTAGTGTTGACTAGTGGGGCCTCAGTAATCTTTTGTTTGGTTATGTGATTCATTAACTATAATTTTCTTTTACAATGTAGATTAATTTGTAGTCCATTGAGGATTTGACAAATGAGGACTTGTTAATGGCCTTAAGTCATAATTTTCATTAATCTAACTACGACGTATAATATAGATACTGTTTCATTAGGTGGTTTATCAATCAATTTGTAGATCTTCAATGCAAGATTTGATGGGGAAATTAGGTGCTTCTTACGTTCTACTATCTGCAATTTCAGAGTAATGAGATATTCAAATTCTAGCAGAAGACACATTATAGCTCAGAATCAAGAATAAATGAACTTCTGCTTCGTTTCCTCATTTGATGTTTGTTAATAACTGGAATTTTTATTGGTTATGTATGTTCAGGACTTGCTGTTCTCGTACAACATTGCTATTGATAGATCAGATGAAATGAAGCGGATATTGGGCAAGTCTGCCGCAAGTGTGGGGCTTAAACTTCCGGAGGTTTATCAACCCTGATGTCCTGTGGTTGTTTTATCAAGCAGAATTACTTCGTTCCTCCTAAGAAAAGAAGAATAATGACTTCGATGAGCTGCCCCTGTACCAGCAGACTTTAATATTGTTGATCTTCTACCATCTGATACTTCAGTTTTATGTGCATTGGAAGAAGCAAAGGGCAGCTACGAAGATTTTATCTTTCCTCTTATTAGAACTTCTAAGGGTGAGTTTGACAGCAATCTGGTGAGTTATTTGTTTCATTTCCATATAATATGTACTTCAATAACGTGAAAAAGAGCCGTTGAATTTTGATCAAGAAGCACTTTGACTTAGTTGGATCTTAGTTGTAGGTGTTTTCATCGCAATATAATCAACCAACCGTTCGAAGTTATGTGTTTAGTCATTTTCAGTCTTATTTGACCAACTTCTTGATCAATTATTTCCCCATATAGAATTTCTGAGTAATTGGAAGTCGCTTTAGATGATGAGGCCATAGGAATGGGTGCATCATGACATGTCTTAATGAATTAGTTGGTACTAGAAATGCTGGAACAATCGATGACTGCTTAGTCTCATTCGAGAGTCTAATGTCTTTGTATCATTTTTTCAAAGGAAAAAAAAAGAAAAAAAAAAGGTGGATTTGGGCTCCTTCCCTTCCATTGTATAATTGCGCGGGAAGGAATCACATGCCTTTTGTCTCATGCATCTAACATGCTTGCGGGTTCACCTGTTTTAGTCATTTATGAAGTATTACAATTGAAAAATAGGTATTGCCTATGGTTTCATACTCTATCTACCAATACGAGAATAACCCATGTATAAGTAACATATAATTCAAGGAGATGGAAGGTTCATATCAACCTCCAATGTCACCTGTCAAATGTTTTCATTTCTATTAATTGATTTAGACATTTGTGTATAGATAACAAATCTATTTATAACTATTCTCTTACAACAGAAATTTAGTGGTGGTTCAATAATTACGGGATAGAATTGAATTACTTCCCCACCTATGCCAGAAGTAAAAAGCGATCTCACGTTTGTGGAAGGATTACTATTTGAGGACTAATTTCCCACGCATTCAACGGCATGTGCCTCACATTCAGCTCTATAAACCCCTTCCCCACTGCACTGCATGCCTTTACCTTACTAGCTTATTTCTGATGGCTTTGAACGACACTTTCTATGTGTCCCATGGATCCCCAACGCTAACCATCGACGATGGCATCAAAGCAAGGCACTTCTTCAAATCATGGAAGGACAAGGTTTTTTCCCAAACACCCAAAGCCATTCTTTGTGTCTCTGCCCATTTCGACACTGCCTATCCCACCGTCAACGTTGTTTCTGGCCCCAATGATACCATCTATGACTTCTATGGTTTCCCCTCCTCCATGTATGAGGTAACATACTAAACCTAGCAAGTGTATGTTAGGAACCACGACTCTTCACAATTGTATGATATTCTCCACTTTGAGTCTACGCTTTTATGACTTTGGTTTTAGTTTCCCCAAAAGACCTTGTACTACTGGGGAGATATATTTCTTACTTATAAACTTATGATCAACCTCTTAATTAGCCGATGTGGGACATCTTTCCCAACAATTCTCGATAGTTGCATTTATAAATTTTCAAGCTTTGAACCCTGGAGGGAAACTTTCCAATATTGATAACAATAATGCATAAATGACATGAAAAACAGTTAAAATATCCAGCACCAGGAGCTCCGGCATTGGCGAAGAGGGTGAAGGAGGCTCTGATGGGGGCCGGGTTTGAGCGGGTGGACGAGGAGAAAGGGCGAGGGCTGGATCATGGAGCATGGGTTCCTCTGATGTTCATGTATCCAGAGGCAGACATCCCAGTTTGCCAACTTTCAGTACAATCACGCCTAAATGGGACACACCATTACAACATGGGCAAGGCATTGGCCCCTCTCAAGGACGAAGGGGTTCTCGTTGTTGGGTCAGGAAGTGCCACGCACAACCTTAGGACACTCGACCGCCACGCCAGGTCTTCTGTCAACGCCTCATGGGCCATTGAATTTGACGACTGGCTTAAACATGCTCTCCTTCAAAGAAGGTAATTTCATTTCCATTATTTATAATCAAATCAATTAAAAAGATATAAAAGACTCATGAAAAGATACCTATGAAAGATATTTGCCTATAATTGTGATTATCCTACCTTAGTTGGGGAGGAGAACGAAACACAATTTATAAGGTGTGTCAACTTCTGCTAGCCGACGCGTTTTAAAGCCTTGAGAGGAAGCTCAAAGAAGACAATATTTGTTAGCGGTGGATAAGAATCTACAACGCACTAGAATGCCCTTGTTGACGATCCTTTACTCCTTTACTAAATATCTTCAACACCTTCAACCGTGGTTGAAGGAAAAATCTCACTTATTCTTTGCTCATTTAACTTAAATAAATATTTTTTGTTTGAGAATTAGAAGAATATTTCGTGTGGGTAGATACGAAGATGTGAACGAGTACAAAAACAAGGCTCCAAATGCAGAAAAGGCACATCCAAGTCCCGATCACCTATTTCCACTCCATGTCGCAATCGGGGCAGCTGGAGGCCACCCCAAAGCGAAGCTAGTCCACCATAGCTGGGACCTTGGCACCATGTCCTACGCCTCCTACCAGTTCACAGCCTCTTAAGCCCTAAACCCACTAATCGCATCAACATAATTACAAACCTCCTAACTCTTCTAGTTGCCTTGTGTCTTTCATTTCAATGTTGTTCAAATGTACCATAATGTAGTATGTTTGTAAGGTCATCCTATTTATTTATGTGACAAACCATATATGGTTTTGGTTTATTAGAACGTTGTATGAACGTTTCTTGTCAATGTGACATATCACACAACACAATCAAAAGTAATAAGATCGAGAGAAGTAAAAACAAATCTGTATACGTGTATTGTTCAGTGATTTGAAATAGCACTACCATATTACGAGATCCTATGTTGGTTGAATAAGAG

mRNA sequence

ATGAATTCAGAAGCCTTGCAGGTAGCTAAGGTTTACCGCCAGCTTCTCAAAGCTGTAAAGAACCATATTGGAAAGGAAGAGAACAAGAAGCATTTTGTGGACTATGTGGCTCAGAAGTTCAGGGAAAAAAGCACACTTTCGAAACCCCATTCTGTTCAACAGAAAATCAAGCTTGCTCGCGATTACACATTTCTACTTAACAGCGACTTGCTGTTCTCGTACAACATTGCTATTGATAGATCAGATGAAATGAAGCGGATATTGGGCAAGTCTGCCGCAAGTGTGGGGCTTAAACTTCCGGAGTTTTATGTGCATTGGAAGAAGCAAAGGGCAGCTACGAAGATTTTATCTTTCCTCTTATTAGAACTTCTAAGGGATTACTATTTGAGGACTAATTTCCCACGCATTCAACGGCATGTGCCTCACATTCAGCTCTATAAACCCCTTCCCCACTGCACTGCATGCCTTTACCTTACTAGCTTATTTCTGATGGCTTTGAACGACACTTTCTATGTGTCCCATGGATCCCCAACGCTAACCATCGACGATGGCATCAAAGCAAGGCACTTCTTCAAATCATGGAAGGACAAGGTTTTTTCCCAAACACCCAAAGCCATTCTTTGTGTCTCTGCCCATTTCGACACTGCCTATCCCACCGTCAACTTAAAATATCCAGCACCAGGAGCTCCGGCATTGGCGAAGAGGGTGAAGGAGGCTCTGATGGGGGCCGGGTTTGAGCGGGTGGACGAGGAGAAAGGGCGAGGGCTGGATCATGGAGCATGGGTTCCTCTGATGTTCATGTATCCAGAGGCAGACATCCCAGTTTGCCAACTTTCAGTACAATCACGCCTAAATGGGACACACCATTACAACATGGGCAAGGCATTGGCCCCTCTCAAGGACGAAGGGGTTCTCGTTGTTGGGTCAGGAAGTGCCACGCACAACCTTAGGACACTCGACCGCCACGCCAGGTCTTCTGTCAACGCCTCATGGGCCATTGAATTTGACGACTGGCTTAAACATGCTCTCCTTCAAAGAAGATACGAAGATGTGAACGAGTACAAAAACAAGGCTCCAAATGCAGAAAAGGCACATCCAAGTCCCGATCACCTATTTCCACTCCATGTCGCAATCGGGGCAGCTGGAGGCCACCCCAAAGCGAAGCTAGTCCACCATAGCTGGGACCTTGGCACCATGTCCTACGCCTCCTACCAGTTCACAGCCTCTTAAGCCCTAAACCCACTAATCGCATCAACATAATTACAAACCTCCTAACTCTTCTAGTTGCCTTGTGTCTTTCATTTCAATGTTGTTCAAATGTACCATAATGTAGTATGTTTGTAAGGTCATCCTATTTATTTATGTGACAAACCATATATGGTTTTGGTTTATTAGAACGTTGTATGAACGTTTCTTGTCAATGTGACATATCACACAACACAATCAAAAGTAATAAGATCGAGAGAAGTAAAAACAAATCTGTATACGTGTATTGTTCAGTGATTTGAAATAGCACTACCATATTACGAGATCCTATGTTGGTTGAATAAGAG

Coding sequence (CDS)

ATGAATTCAGAAGCCTTGCAGGTAGCTAAGGTTTACCGCCAGCTTCTCAAAGCTGTAAAGAACCATATTGGAAAGGAAGAGAACAAGAAGCATTTTGTGGACTATGTGGCTCAGAAGTTCAGGGAAAAAAGCACACTTTCGAAACCCCATTCTGTTCAACAGAAAATCAAGCTTGCTCGCGATTACACATTTCTACTTAACAGCGACTTGCTGTTCTCGTACAACATTGCTATTGATAGATCAGATGAAATGAAGCGGATATTGGGCAAGTCTGCCGCAAGTGTGGGGCTTAAACTTCCGGAGTTTTATGTGCATTGGAAGAAGCAAAGGGCAGCTACGAAGATTTTATCTTTCCTCTTATTAGAACTTCTAAGGGATTACTATTTGAGGACTAATTTCCCACGCATTCAACGGCATGTGCCTCACATTCAGCTCTATAAACCCCTTCCCCACTGCACTGCATGCCTTTACCTTACTAGCTTATTTCTGATGGCTTTGAACGACACTTTCTATGTGTCCCATGGATCCCCAACGCTAACCATCGACGATGGCATCAAAGCAAGGCACTTCTTCAAATCATGGAAGGACAAGGTTTTTTCCCAAACACCCAAAGCCATTCTTTGTGTCTCTGCCCATTTCGACACTGCCTATCCCACCGTCAACTTAAAATATCCAGCACCAGGAGCTCCGGCATTGGCGAAGAGGGTGAAGGAGGCTCTGATGGGGGCCGGGTTTGAGCGGGTGGACGAGGAGAAAGGGCGAGGGCTGGATCATGGAGCATGGGTTCCTCTGATGTTCATGTATCCAGAGGCAGACATCCCAGTTTGCCAACTTTCAGTACAATCACGCCTAAATGGGACACACCATTACAACATGGGCAAGGCATTGGCCCCTCTCAAGGACGAAGGGGTTCTCGTTGTTGGGTCAGGAAGTGCCACGCACAACCTTAGGACACTCGACCGCCACGCCAGGTCTTCTGTCAACGCCTCATGGGCCATTGAATTTGACGACTGGCTTAAACATGCTCTCCTTCAAAGAAGATACGAAGATGTGAACGAGTACAAAAACAAGGCTCCAAATGCAGAAAAGGCACATCCAAGTCCCGATCACCTATTTCCACTCCATGTCGCAATCGGGGCAGCTGGAGGCCACCCCAAAGCGAAGCTAGTCCACCATAGCTGGGACCTTGGCACCATGTCCTACGCCTCCTACCAGTTCACAGCCTCTTAA

Protein sequence

MNSEALQVAKVYRQLLKAVKNHIGKEENKKHFVDYVAQKFREKSTLSKPHSVQQKIKLARDYTFLLNSDLLFSYNIAIDRSDEMKRILGKSAASVGLKLPEFYVHWKKQRAATKILSFLLLELLRDYYLRTNFPRIQRHVPHIQLYKPLPHCTACLYLTSLFLMALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVNLKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVPLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGHPKAKLVHHSWDLGTMSYASYQFTAS
BLAST of CmaCh03G000050 vs. Swiss-Prot
Match: DIOXL_ARATH (Extradiol ring-cleavage dioxygenase OS=Arabidopsis thaliana GN=LIGB PE=2 SV=1)

HSP 1 Score: 337.4 bits (864), Expect = 2.2e-91
Identity = 163/265 (61.51%), Postives = 196/265 (73.96%), Query Frame = 1

Query: 166 LNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN---- 225
           +N TF++SHGSPTL+IDD ++AR FFKSW  KV  Q PK+IL +SAH+DT +P+VN    
Sbjct: 4   VNQTFFLSHGSPTLSIDDSLEARQFFKSWTQKVLPQKPKSILVISAHWDTKFPSVNTVLR 63

Query: 226 ----------------LKYPAPGAPALAKRVKEALMG-AGFERVDEEKGRGLDHGAWVPL 285
                           LKY APGA  L KRVKE LM   G +RVDE+  RGLDHGAWVPL
Sbjct: 64  NNTIHDFSGFPDPMYKLKYEAPGAIELGKRVKELLMKEGGMKRVDEDTKRGLDHGAWVPL 123

Query: 286 MFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHAR 345
           M MYPEADIP+CQLSVQS  NG++HYNMGKALA LKDEGVL++GSGSATHNLR LD +  
Sbjct: 124 MLMYPEADIPICQLSVQSNQNGSYHYNMGKALASLKDEGVLIIGSGSATHNLRKLDFNIT 183

Query: 346 SSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGH 405
                 WA+EFD WL+ +LLQ RY DVNE++ KAPNA+ AHP P+HL+PLHV +GAAGG 
Sbjct: 184 DGSPVPWALEFDHWLRDSLLQGRYGDVNEWEEKAPNAKMAHPWPEHLYPLHVVMGAAGGD 243

Query: 406 PKAKLVHHSWDLGTMSYASYQFTAS 410
            KA+ +H SW LGT+SY+SY FT+S
Sbjct: 244 AKAEQIHTSWQLGTLSYSSYSFTSS 268

BLAST of CmaCh03G000050 vs. Swiss-Prot
Match: DODA_BETVU (4,5-DOPA dioxygenase extradiol OS=Beta vulgaris GN=DODA PE=1 SV=1)

HSP 1 Score: 302.4 bits (773), Expect = 7.7e-81
Identity = 143/264 (54.17%), Postives = 190/264 (71.97%), Query Frame = 1

Query: 166 LNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVNL--- 225
           + +TF++SHG+P + IDD   ++ F +SW++K+FS+ PKAIL +SAH++T  P+VN+   
Sbjct: 7   IKETFFISHGTPMMAIDDSKPSKKFLESWREKIFSKKPKAILVISAHWETDQPSVNVVDI 66

Query: 226 -----------------KYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVPLM 285
                            KY APG+P LA R+++ L G+GF+ V+ +K RGLDHGAWVPLM
Sbjct: 67  NDTIYDFRGFPARLYQFKYSAPGSPELANRIQDLLAGSGFKSVNTDKKRGLDHGAWVPLM 126

Query: 286 FMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHARS 345
            MYPEADIPVCQLSVQS L+GTHHY +G+ALAPLKDEGVL++GSGSATH   +      S
Sbjct: 127 LMYPEADIPVCQLSVQSHLDGTHHYKLGQALAPLKDEGVLIIGSGSATH--PSNGTPPCS 186

Query: 346 SVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGHP 405
              A WA  FD WL+ AL    YE+VN+Y+ KAPN + AHP P+H +PLHVA+GAAG + 
Sbjct: 187 DGVAPWAAAFDSWLETALTNGSYEEVNKYETKAPNWKLAHPWPEHFYPLHVAMGAAGENS 246

Query: 406 KAKLVHHSWDLGTMSYASYQFTAS 410
           KA+L+H+SWD G MSY SY+FT++
Sbjct: 247 KAELIHNSWDGGIMSYGSYKFTST 268

BLAST of CmaCh03G000050 vs. Swiss-Prot
Match: DOD1U_BETVU (4,5-DOPA dioxygenase extradiol 1 OS=Beta vulgaris GN=DODA1 PE=2 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 9.4e-79
Identity = 139/264 (52.65%), Postives = 188/264 (71.21%), Query Frame = 1

Query: 166 LNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN---- 225
           + ++F+++HG+P LT++D    R FF++W++K+FS+ PKAIL +S H++T  PTVN    
Sbjct: 14  IKESFFITHGNPILTVEDTHPLRPFFETWREKIFSKKPKAILIISGHWETVKPTVNAVHI 73

Query: 226 ----------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVPLM 285
                            KYPAPGAP LA++V+E L  +GFE  + ++ RGLDHGAWVPLM
Sbjct: 74  NDTIHDFDDYPAAMYLFKYPAPGAPELARKVEEILKKSGFETAETDEKRGLDHGAWVPLM 133

Query: 286 FMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHARS 345
            MYPEADIPVCQLSVQ  L+GT+HYN+G+ALAPLK++GVL++GSGSATH L     H   
Sbjct: 134 LMYPEADIPVCQLSVQPHLDGTYHYNLGRALAPLKNDGVLIIGSGSATHPLDETP-HYFD 193

Query: 346 SVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGHP 405
            V A WA  FD WL+ AL+  R+E+VN Y+ KAPN + AHP P+H +PLHV +GAAG   
Sbjct: 194 GV-APWAAAFDSWLRKALINGRFEEVNIYETKAPNWKLAHPFPEHFYPLHVVLGAAGEKW 253

Query: 406 KAKLVHHSWDLGTMSYASYQFTAS 410
           KA+L+H SWD GT+ + SY+FT++
Sbjct: 254 KAELIHSSWDHGTLCHGSYKFTSA 275

BLAST of CmaCh03G000050 vs. Swiss-Prot
Match: DOD1W_BETVU (4,5-DOPA dioxygenase extradiol 1 OS=Beta vulgaris GN=DODA1 PE=2 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 2.7e-78
Identity = 138/264 (52.27%), Postives = 188/264 (71.21%), Query Frame = 1

Query: 166 LNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN---- 225
           + ++F+++HG+P LT++D    R FF++W++K+FS+ PKAIL +S H++T  PTVN    
Sbjct: 14  IKESFFITHGNPILTVEDTHPLRPFFETWREKIFSKKPKAILIISGHWETVKPTVNAVHI 73

Query: 226 ----------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVPLM 285
                            KYPAPG P LA++V+E L  +GFE  + ++ RGLDHGAWVPLM
Sbjct: 74  NDTIHDFDDYPAAMYQFKYPAPGEPELARKVEEILKKSGFETAETDQKRGLDHGAWVPLM 133

Query: 286 FMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHARS 345
            MYPEADIPVCQLSVQ  L+GT+HYN+G+ALAPLK++GVL++GSGSATH L     H   
Sbjct: 134 LMYPEADIPVCQLSVQPHLDGTYHYNLGRALAPLKNDGVLIIGSGSATHPLDETP-HYFD 193

Query: 346 SVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGHP 405
            V A WA  FD WL+ AL+  R+E+VN Y++KAPN + AHP P+H +PLHV +GAAG   
Sbjct: 194 GV-APWAAAFDSWLRKALINGRFEEVNIYESKAPNWKLAHPFPEHFYPLHVVLGAAGEKW 253

Query: 406 KAKLVHHSWDLGTMSYASYQFTAS 410
           KA+L+H SWD GT+ + SY+FT++
Sbjct: 254 KAELIHSSWDHGTLCHGSYKFTSA 275

BLAST of CmaCh03G000050 vs. Swiss-Prot
Match: DODA_PORGR (4,5-DOPA dioxygenase extradiol OS=Portulaca grandiflora GN=DODA PE=1 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.0e-69
Identity = 133/266 (50.00%), Postives = 171/266 (64.29%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           ++  ++F++SHG+P +  D+   AR+F   WK  VF   PK+IL VSAH++T  P V+  
Sbjct: 7   VSFKESFFLSHGNPAMLADESFIARNFLLGWKKNVFPVKPKSILVVSAHWETDVPCVSAG 66

Query: 224 ------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVP 283
                             +KYPAPG P LAKRV+E L+  GF+    ++ RG DH +WVP
Sbjct: 67  QYPNVIYDFTEVPASMFQMKYPAPGCPKLAKRVQELLIAGGFKSAKLDEERGFDHSSWVP 126

Query: 284 LMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHA 343
           L  M PEADIPVCQLSVQ  L+ THH+N+G+ALAPLK EGVL +GSG A H       H 
Sbjct: 127 LSMMCPEADIPVCQLSVQPGLDATHHFNVGRALAPLKGEGVLFIGSGGAVHPSDDTP-HW 186

Query: 344 RSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEK-AHPSPDHLFPLHVAIGAAG 403
              V A WA EFD WL+ ALL+ RYEDVN Y+ KAP   K AHP P+H  PLHVA+GA G
Sbjct: 187 FDGV-APWAAEFDQWLEDALLEGRYEDVNNYQTKAPEGWKLAHPIPEHFLPLHVAMGAGG 246

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
              KA+L++ +WD GT+ YASY+FT+
Sbjct: 247 EKSKAELIYRTWDHGTLGYASYKFTS 270

BLAST of CmaCh03G000050 vs. TrEMBL
Match: A0A0A0KYZ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G443120 PE=4 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 9.0e-113
Identity = 198/267 (74.16%), Postives = 219/267 (82.02%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           M L +TFY+SHGSP ++IDD I+AR FFKSWKD  +   PKAILCVSAH+DT +PTVN  
Sbjct: 1   MGLKETFYLSHGSPMMSIDDSIQARQFFKSWKDSFYVIKPKAILCVSAHYDTTFPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAPALAK VKEAL+ AGFERV+EE+GRGLDHGAWV
Sbjct: 61  SGPNDTIYDFYGFPSSMYKLKYPAPGAPALAKSVKEALVRAGFERVEEERGRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PLMFMYPEADIPVCQLSVQS LNGTHHYN+GKALAPLKDEGVL++GSGSATHNLRTL+  
Sbjct: 121 PLMFMYPEADIPVCQLSVQSHLNGTHHYNLGKALAPLKDEGVLIIGSGSATHNLRTLNHS 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
             SS  A WA+EFD+WLK ALLQ RY DVNEY+ KAP+A  AHPSPDHLFPLHVAIGAAG
Sbjct: 181 GNSSAIAPWALEFDNWLKDALLQGRYNDVNEYEKKAPHARMAHPSPDHLFPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTAS 410
           G+PKAKL+HHSWDLGTMSYASYQFTAS
Sbjct: 241 GNPKAKLIHHSWDLGTMSYASYQFTAS 267

BLAST of CmaCh03G000050 vs. TrEMBL
Match: I3S0X4_LOTJA (Uncharacterized protein OS=Lotus japonicus PE=2 SV=1)

HSP 1 Score: 368.6 bits (945), Expect = 9.7e-99
Identity = 179/266 (67.29%), Postives = 208/266 (78.20%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           MAL DTFY+SHGSPTL+ID+ + AR F +SWK +VF   P +IL +S H+DTA PTVN  
Sbjct: 1   MALKDTFYISHGSPTLSIDESLVARKFLQSWKKEVFPPRPTSILVISGHWDTAVPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAP LAKRVKE L   GF RVDE+K RGLDHGAWV
Sbjct: 61  DSTNDTIYDFYGFPKPTYQLKYPAPGAPHLAKRVKELLKEGGFSRVDEDKKRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PL+ MYPEADIPVCQLSVQS L+GTHHYN+GKALAPLKDEGVL+VGSGSA HNLR L+RH
Sbjct: 121 PLLLMYPEADIPVCQLSVQSNLDGTHHYNIGKALAPLKDEGVLIVGSGSAVHNLRALERH 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
           A  +V A WA+EFD+WLK ALL+ RYEDVN Y+ KAP+A+KAHP PDHL+PLHVA+GAAG
Sbjct: 181 A--TVAAPWAVEFDNWLKEALLEGRYEDVNHYEQKAPHAKKAHPWPDHLYPLHVAVGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
            + KAKL+H S DLG++SYASYQFT+
Sbjct: 241 ENSKAKLIHSSIDLGSLSYASYQFTS 264

BLAST of CmaCh03G000050 vs. TrEMBL
Match: I3SFC8_LOTJA (Uncharacterized protein OS=Lotus japonicus PE=2 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.2e-98
Identity = 179/266 (67.29%), Postives = 207/266 (77.82%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           MAL DTFY+SHGSPTL+ID+ + AR F +SWK +VF   P +IL +S H+DTA PTVN  
Sbjct: 1   MALKDTFYISHGSPTLSIDESLVARKFLQSWKKEVFPPRPTSILVISGHWDTAVPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAP LAKRVKE L   GF RVDE+K RGLDHGAWV
Sbjct: 61  DSTNDTIYDFYGFPKPMYQLKYPAPGAPHLAKRVKELLKEGGFSRVDEDKKRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PL+ MYPEADIPVCQLSVQS L+GTHHYN+GKALAPLKDEGVL+VGSGSA HNLR L+RH
Sbjct: 121 PLLLMYPEADIPVCQLSVQSNLDGTHHYNIGKALAPLKDEGVLIVGSGSAVHNLRALERH 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
           A  +V A WA+EFD+WLK ALL+ RYEDVN Y+ KAP+A+KAHP PDH +PLHVAIGAAG
Sbjct: 181 A--TVAAPWAVEFDNWLKEALLEGRYEDVNHYEQKAPHAKKAHPWPDHFYPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
            + KAKL+H S DLG++SYASYQFT+
Sbjct: 241 ENSKAKLIHSSIDLGSLSYASYQFTS 264

BLAST of CmaCh03G000050 vs. TrEMBL
Match: A0A061DK69_THECC (Catalytic LigB subunit of aromatic ring-opening dioxygenase family isoform 1 OS=Theobroma cacao GN=TCM_001659 PE=4 SV=1)

HSP 1 Score: 364.4 bits (934), Expect = 1.8e-97
Identity = 174/266 (65.41%), Postives = 204/266 (76.69%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           + + DTFY+SHGSPTL+IDD + ARHF +SWKD VF QTPK+IL VS H+DT+YP VN  
Sbjct: 6   LTMKDTFYISHGSPTLSIDDSLPARHFLQSWKDTVFGQTPKSILVVSGHWDTSYPAVNMV 65

Query: 224 ------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVP 283
                             LKYPAPGAP LAKRVKE LM +G +RVDE+K RGLDHGAWVP
Sbjct: 66  QRNDTIYDFYGFPDKMYKLKYPAPGAPELAKRVKELLMASGLKRVDEDKKRGLDHGAWVP 125

Query: 284 LMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHA 343
           LM MYPEADIPVCQLSVQSR +GT+HYN+GKALAPLKDEGVL++GSG+ATHNLR L    
Sbjct: 126 LMLMYPEADIPVCQLSVQSRRDGTYHYNLGKALAPLKDEGVLIIGSGAATHNLRALGN-- 185

Query: 344 RSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGG 403
            +     WA EFD WLK ALL+ RYEDVN ++ KAP A+ AHP PDH +PLHVA+GAAG 
Sbjct: 186 LNGAVVPWASEFDTWLKDALLEGRYEDVNHFQEKAPYAKMAHPWPDHFYPLHVAMGAAGE 245

Query: 404 HPKAKLVHHSWDLGTMSYASYQFTAS 410
             KAKL+H SW+LG++SYASYQFTA+
Sbjct: 246 SSKAKLIHQSWELGSLSYASYQFTAA 269

BLAST of CmaCh03G000050 vs. TrEMBL
Match: G7KNT9_MEDTR (4,5-DOPA dioxygenase extradiol-like protein OS=Medicago truncatula GN=MTR_6g064960 PE=4 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 4.5e-96
Identity = 178/266 (66.92%), Postives = 207/266 (77.82%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           MAL DTFY+SHGSPTL+IDD I+AR F +SWK  VF + PK+IL +S H+DT  PTVN  
Sbjct: 1   MALKDTFYISHGSPTLSIDDSIEARKFLQSWKKDVFEERPKSILVISGHWDTTVPTVNVI 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAP LAKRVKE L  +GF+RVDE+K RGLDHGAWV
Sbjct: 61  QTTNDTIHDFYGFPKPMYQLKYPAPGAPELAKRVKELLNKSGFDRVDEDKKRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PLM MYPEADIPVCQLSVQS L+GT+HYN+GKALAPLKDEGVL++GSGSA HNL TL  +
Sbjct: 121 PLMLMYPEADIPVCQLSVQSDLDGTYHYNLGKALAPLKDEGVLIMGSGSAVHNLGTL--N 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
            R+ V A WA+EFD+WLK ALL  RYEDVN Y+ KAP+A+KAHP PDH +PLHVAIGAAG
Sbjct: 181 PRAGV-APWALEFDNWLKDALLDGRYEDVNHYEQKAPHAKKAHPHPDHFYPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
            + KAKL+H S +LGT+SYASYQFT+
Sbjct: 241 ENSKAKLIHSSINLGTLSYASYQFTS 263

BLAST of CmaCh03G000050 vs. TAIR10
Match: AT4G15093.1 (AT4G15093.1 catalytic LigB subunit of aromatic ring-opening dioxygenase family)

HSP 1 Score: 337.4 bits (864), Expect = 1.2e-92
Identity = 163/265 (61.51%), Postives = 196/265 (73.96%), Query Frame = 1

Query: 166 LNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN---- 225
           +N TF++SHGSPTL+IDD ++AR FFKSW  KV  Q PK+IL +SAH+DT +P+VN    
Sbjct: 4   VNQTFFLSHGSPTLSIDDSLEARQFFKSWTQKVLPQKPKSILVISAHWDTKFPSVNTVLR 63

Query: 226 ----------------LKYPAPGAPALAKRVKEALMG-AGFERVDEEKGRGLDHGAWVPL 285
                           LKY APGA  L KRVKE LM   G +RVDE+  RGLDHGAWVPL
Sbjct: 64  NNTIHDFSGFPDPMYKLKYEAPGAIELGKRVKELLMKEGGMKRVDEDTKRGLDHGAWVPL 123

Query: 286 MFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHAR 345
           M MYPEADIP+CQLSVQS  NG++HYNMGKALA LKDEGVL++GSGSATHNLR LD +  
Sbjct: 124 MLMYPEADIPICQLSVQSNQNGSYHYNMGKALASLKDEGVLIIGSGSATHNLRKLDFNIT 183

Query: 346 SSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGGH 405
                 WA+EFD WL+ +LLQ RY DVNE++ KAPNA+ AHP P+HL+PLHV +GAAGG 
Sbjct: 184 DGSPVPWALEFDHWLRDSLLQGRYGDVNEWEEKAPNAKMAHPWPEHLYPLHVVMGAAGGD 243

Query: 406 PKAKLVHHSWDLGTMSYASYQFTAS 410
            KA+ +H SW LGT+SY+SY FT+S
Sbjct: 244 AKAEQIHTSWQLGTLSYSSYSFTSS 268

BLAST of CmaCh03G000050 vs. TAIR10
Match: AT5G51960.1 (AT5G51960.1 Complex 1 LYR protein (InterPro:IPR008011))

HSP 1 Score: 119.0 bits (297), Expect = 6.8e-27
Identity = 62/108 (57.41%), Postives = 81/108 (75.00%), Query Frame = 1

Query: 1   MNSEALQVAKVYRQLLKAVKNHIGKEENKKHFVDYVAQKFREKSTLSKPHSVQQKIKLAR 60
           M+ E +  A+VYR LLKAV  H+GKE++K HF+D+V Q+FR+ +         +KI LAR
Sbjct: 1   MSGEVVLAARVYRDLLKAVVKHVGKEDHKSHFLDFVKQEFRKNAN-------SEKINLAR 60

Query: 61  DYTFLLNS-----DLLFSYNIAIDRSDEMKRILGKSAASVGLKLPEFY 104
           +YT+LLNS     DLLFSYNIA+DR++EMKR+L KSAASVGL+LPE Y
Sbjct: 61  NYTYLLNSIHSHKDLLFSYNIAVDRTEEMKRVLNKSAASVGLRLPEVY 101

BLAST of CmaCh03G000050 vs. NCBI nr
Match: gi|659069420|ref|XP_008449742.1| (PREDICTED: extradiol ring-cleavage dioxygenase [Cucumis melo])

HSP 1 Score: 418.7 bits (1075), Expect = 1.2e-113
Identity = 199/267 (74.53%), Postives = 221/267 (82.77%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           M L +TFY+SHGSP ++IDD I+AR FFKSWKD VF   PKAILCVSAH+DT +PTVN  
Sbjct: 1   MGLKETFYLSHGSPMMSIDDSIQARQFFKSWKDNVFVTKPKAILCVSAHYDTTFPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAPALAK VKEAL+ AGFERV+EE+GRGLDHGAWV
Sbjct: 61  SGPNGTIYDFYGFPSSMYKLKYPAPGAPALAKSVKEALVRAGFERVEEERGRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PLMFMYPEADIPVCQLSVQS LNGTHHYN+GKALAPLKDEGVL++GSGSATHNLRTL+R+
Sbjct: 121 PLMFMYPEADIPVCQLSVQSHLNGTHHYNLGKALAPLKDEGVLIIGSGSATHNLRTLNRN 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
             SS  A WA+EFD+WLK ALLQ RY+DVNEY+ KAP+A  AHPSPDH FPLHVAIGAAG
Sbjct: 181 GNSSAIAPWALEFDNWLKDALLQGRYDDVNEYEKKAPHARMAHPSPDHFFPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTAS 410
           G+PKAKL+HHSWDLGTMSYASYQFT S
Sbjct: 241 GNPKAKLIHHSWDLGTMSYASYQFTDS 267

BLAST of CmaCh03G000050 vs. NCBI nr
Match: gi|449453205|ref|XP_004144349.1| (PREDICTED: extradiol ring-cleavage dioxygenase [Cucumis sativus])

HSP 1 Score: 415.2 bits (1066), Expect = 1.3e-112
Identity = 198/267 (74.16%), Postives = 219/267 (82.02%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           M L +TFY+SHGSP ++IDD I+AR FFKSWKD  +   PKAILCVSAH+DT +PTVN  
Sbjct: 1   MGLKETFYLSHGSPMMSIDDSIQARQFFKSWKDSFYVIKPKAILCVSAHYDTTFPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAPALAK VKEAL+ AGFERV+EE+GRGLDHGAWV
Sbjct: 61  SGPNDTIYDFYGFPSSMYKLKYPAPGAPALAKSVKEALVRAGFERVEEERGRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PLMFMYPEADIPVCQLSVQS LNGTHHYN+GKALAPLKDEGVL++GSGSATHNLRTL+  
Sbjct: 121 PLMFMYPEADIPVCQLSVQSHLNGTHHYNLGKALAPLKDEGVLIIGSGSATHNLRTLNHS 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
             SS  A WA+EFD+WLK ALLQ RY DVNEY+ KAP+A  AHPSPDHLFPLHVAIGAAG
Sbjct: 181 GNSSAIAPWALEFDNWLKDALLQGRYNDVNEYEKKAPHARMAHPSPDHLFPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTAS 410
           G+PKAKL+HHSWDLGTMSYASYQFTAS
Sbjct: 241 GNPKAKLIHHSWDLGTMSYASYQFTAS 267

BLAST of CmaCh03G000050 vs. NCBI nr
Match: gi|388491700|gb|AFK33916.1| (unknown [Lotus japonicus])

HSP 1 Score: 368.6 bits (945), Expect = 1.4e-98
Identity = 179/266 (67.29%), Postives = 208/266 (78.20%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           MAL DTFY+SHGSPTL+ID+ + AR F +SWK +VF   P +IL +S H+DTA PTVN  
Sbjct: 1   MALKDTFYISHGSPTLSIDESLVARKFLQSWKKEVFPPRPTSILVISGHWDTAVPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAP LAKRVKE L   GF RVDE+K RGLDHGAWV
Sbjct: 61  DSTNDTIYDFYGFPKPTYQLKYPAPGAPHLAKRVKELLKEGGFSRVDEDKKRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PL+ MYPEADIPVCQLSVQS L+GTHHYN+GKALAPLKDEGVL+VGSGSA HNLR L+RH
Sbjct: 121 PLLLMYPEADIPVCQLSVQSNLDGTHHYNIGKALAPLKDEGVLIVGSGSAVHNLRALERH 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
           A  +V A WA+EFD+WLK ALL+ RYEDVN Y+ KAP+A+KAHP PDHL+PLHVA+GAAG
Sbjct: 181 A--TVAAPWAVEFDNWLKEALLEGRYEDVNHYEQKAPHAKKAHPWPDHLYPLHVAVGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
            + KAKL+H S DLG++SYASYQFT+
Sbjct: 241 ENSKAKLIHSSIDLGSLSYASYQFTS 264

BLAST of CmaCh03G000050 vs. NCBI nr
Match: gi|388501808|gb|AFK38970.1| (unknown [Lotus japonicus])

HSP 1 Score: 367.5 bits (942), Expect = 3.1e-98
Identity = 179/266 (67.29%), Postives = 207/266 (77.82%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           MAL DTFY+SHGSPTL+ID+ + AR F +SWK +VF   P +IL +S H+DTA PTVN  
Sbjct: 1   MALKDTFYISHGSPTLSIDESLVARKFLQSWKKEVFPPRPTSILVISGHWDTAVPTVNVV 60

Query: 224 -------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWV 283
                              LKYPAPGAP LAKRVKE L   GF RVDE+K RGLDHGAWV
Sbjct: 61  DSTNDTIYDFYGFPKPMYQLKYPAPGAPHLAKRVKELLKEGGFSRVDEDKKRGLDHGAWV 120

Query: 284 PLMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRH 343
           PL+ MYPEADIPVCQLSVQS L+GTHHYN+GKALAPLKDEGVL+VGSGSA HNLR L+RH
Sbjct: 121 PLLLMYPEADIPVCQLSVQSNLDGTHHYNIGKALAPLKDEGVLIVGSGSAVHNLRALERH 180

Query: 344 ARSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAG 403
           A  +V A WA+EFD+WLK ALL+ RYEDVN Y+ KAP+A+KAHP PDH +PLHVAIGAAG
Sbjct: 181 A--TVAAPWAVEFDNWLKEALLEGRYEDVNHYEQKAPHAKKAHPWPDHFYPLHVAIGAAG 240

Query: 404 GHPKAKLVHHSWDLGTMSYASYQFTA 409
            + KAKL+H S DLG++SYASYQFT+
Sbjct: 241 ENSKAKLIHSSIDLGSLSYASYQFTS 264

BLAST of CmaCh03G000050 vs. NCBI nr
Match: gi|590709699|ref|XP_007048625.1| (Catalytic LigB subunit of aromatic ring-opening dioxygenase family isoform 1 [Theobroma cacao])

HSP 1 Score: 364.4 bits (934), Expect = 2.6e-97
Identity = 174/266 (65.41%), Postives = 204/266 (76.69%), Query Frame = 1

Query: 164 MALNDTFYVSHGSPTLTIDDGIKARHFFKSWKDKVFSQTPKAILCVSAHFDTAYPTVN-- 223
           + + DTFY+SHGSPTL+IDD + ARHF +SWKD VF QTPK+IL VS H+DT+YP VN  
Sbjct: 6   LTMKDTFYISHGSPTLSIDDSLPARHFLQSWKDTVFGQTPKSILVVSGHWDTSYPAVNMV 65

Query: 224 ------------------LKYPAPGAPALAKRVKEALMGAGFERVDEEKGRGLDHGAWVP 283
                             LKYPAPGAP LAKRVKE LM +G +RVDE+K RGLDHGAWVP
Sbjct: 66  QRNDTIYDFYGFPDKMYKLKYPAPGAPELAKRVKELLMASGLKRVDEDKKRGLDHGAWVP 125

Query: 284 LMFMYPEADIPVCQLSVQSRLNGTHHYNMGKALAPLKDEGVLVVGSGSATHNLRTLDRHA 343
           LM MYPEADIPVCQLSVQSR +GT+HYN+GKALAPLKDEGVL++GSG+ATHNLR L    
Sbjct: 126 LMLMYPEADIPVCQLSVQSRRDGTYHYNLGKALAPLKDEGVLIIGSGAATHNLRALGN-- 185

Query: 344 RSSVNASWAIEFDDWLKHALLQRRYEDVNEYKNKAPNAEKAHPSPDHLFPLHVAIGAAGG 403
            +     WA EFD WLK ALL+ RYEDVN ++ KAP A+ AHP PDH +PLHVA+GAAG 
Sbjct: 186 LNGAVVPWASEFDTWLKDALLEGRYEDVNHFQEKAPYAKMAHPWPDHFYPLHVAMGAAGE 245

Query: 404 HPKAKLVHHSWDLGTMSYASYQFTAS 410
             KAKL+H SW+LG++SYASYQFTA+
Sbjct: 246 SSKAKLIHQSWELGSLSYASYQFTAA 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DIOXL_ARATH2.2e-9161.51Extradiol ring-cleavage dioxygenase OS=Arabidopsis thaliana GN=LIGB PE=2 SV=1[more]
DODA_BETVU7.7e-8154.174,5-DOPA dioxygenase extradiol OS=Beta vulgaris GN=DODA PE=1 SV=1[more]
DOD1U_BETVU9.4e-7952.654,5-DOPA dioxygenase extradiol 1 OS=Beta vulgaris GN=DODA1 PE=2 SV=1[more]
DOD1W_BETVU2.7e-7852.274,5-DOPA dioxygenase extradiol 1 OS=Beta vulgaris GN=DODA1 PE=2 SV=1[more]
DODA_PORGR1.0e-6950.004,5-DOPA dioxygenase extradiol OS=Portulaca grandiflora GN=DODA PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KYZ9_CUCSA9.0e-11374.16Uncharacterized protein OS=Cucumis sativus GN=Csa_4G443120 PE=4 SV=1[more]
I3S0X4_LOTJA9.7e-9967.29Uncharacterized protein OS=Lotus japonicus PE=2 SV=1[more]
I3SFC8_LOTJA2.2e-9867.29Uncharacterized protein OS=Lotus japonicus PE=2 SV=1[more]
A0A061DK69_THECC1.8e-9765.41Catalytic LigB subunit of aromatic ring-opening dioxygenase family isoform 1 OS=... [more]
G7KNT9_MEDTR4.5e-9666.924,5-DOPA dioxygenase extradiol-like protein OS=Medicago truncatula GN=MTR_6g0649... [more]
Match NameE-valueIdentityDescription
AT4G15093.11.2e-9261.51 catalytic LigB subunit of aromatic ring-opening dioxygenase family[more]
AT5G51960.16.8e-2757.41 Complex 1 LYR protein (InterPro:IPR008011)[more]
Match NameE-valueIdentityDescription
gi|659069420|ref|XP_008449742.1|1.2e-11374.53PREDICTED: extradiol ring-cleavage dioxygenase [Cucumis melo][more]
gi|449453205|ref|XP_004144349.1|1.3e-11274.16PREDICTED: extradiol ring-cleavage dioxygenase [Cucumis sativus][more]
gi|388491700|gb|AFK33916.1|1.4e-9867.29unknown [Lotus japonicus][more]
gi|388501808|gb|AFK38970.1|3.1e-9867.29unknown [Lotus japonicus][more]
gi|590709699|ref|XP_007048625.1|2.6e-9765.41Catalytic LigB subunit of aromatic ring-opening dioxygenase family isoform 1 [Th... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004183Xdiol_dOase_suB
Vocabulary: Biological Process
TermDefinition
GO:0006725cellular aromatic compound metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0008198ferrous iron binding
GO:0016491oxidoreductase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006725 cellular aromatic compound metabolic process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008198 ferrous iron binding
molecular_function GO:0016701 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0051213 dioxygenase activity
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G000050.1CmaCh03G000050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004183Extradiol ring-cleavage dioxygenase, class III enzyme, subunit BGENE3DG3DSA:3.40.830.10coord: 164..406
score: 1.5
IPR004183Extradiol ring-cleavage dioxygenase, class III enzyme, subunit BPFAMPF02900LigBcoord: 169..406
score: 7.4
IPR004183Extradiol ring-cleavage dioxygenase, class III enzyme, subunit BunknownSSF53213LigB-likecoord: 169..406
score: 1.83
NoneNo IPR availablePANTHERPTHR30096UNCHARACTERIZEDcoord: 164..409
score: 1.1E
NoneNo IPR availablePANTHERPTHR30096:SF0SUBFAMILY NOT NAMEDcoord: 164..409
score: 1.1E
NoneNo IPR availablePFAMPF13233Complex1_LYR_2coord: 9..97
score: 6.

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh03G000050Watermelon (Charleston Gray)cmawcgB594
CmaCh03G000050Watermelon (97103) v1cmawmB643
CmaCh03G000050Watermelon (97103) v2cmawmbB693