CmaCh04G005620 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G005620
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionHomogentisate 1,2-dioxygenase
LocationCma_Chr04 : 2862063 .. 2865925 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAACATATAAATGCAATTTCGAGTTCATCTCTATCTTGTAACATGGTGAGCTTCTTGAACACGTCATTACGTGATTTCGATCGTGAATCTAGTTGCGTGGGGGAATCACCGATCATTGGAACCGCCGCGTGTTCTCCGGTGCCGTTTTCCGGTTTCAATAGTCTCCACAATCGATCATCATCTTCATCTGCTTTTTATTCTTCTTCAGTTTCAATGGCTGCTCAATCGGTCGGCGAAATGGACGGCGAAAATTTCCCTTCCGACCTCGCTTACCAGTCCGGCTTCAACAATCATTTCTCGTCGGAGGCTATTCCCGGTGCTCTTCCTCAATTGCAAAACAGTCCTCTGATATGCCCGTACGGCCTCTATGCCGAGCAGATCTCTGGCACTTCCTTCACATCGCCTCGCAAAGTCAACATGTGCAGGTATTCTTTTGTCTTCTCGTGTTGTCAGTGTGGTTTTTCTGGTTTTGGTGTCTTTGTTGTTTACGCATTTCGGATTTTGGTGCGCAGTTGGCTGTATCGAATTAAGCCGTCGGTCACGCATGAACCGTTTAGGCCACGTTTGCCTAAAAACGAGAAGTTGATCAGTGAATTTAATGCGTCAAATTGTTCGTCGACTCCGACTCAGCTAAGGTGGAGACCGGCGGATGTTCCTGATTCACCGCTGGATTTCGTTGATGGTCTGTACACTGTTTGTGGAGCCGGCAGTTCGTTTCTTCGGCATGGTTTTGCTATTCACATGTAAGTATGTAATCAGTATTCCTTCTACGGTTTCTTAGCTTTGAATCTTTTTCTAACAATCAGTAAATTTTTTAGGGAAGGTATTATTTGTCGGATTTATGGGTGGGGTATTTTGTCAGAGCAGAAAATATGAGCCCTTAAGTTCGACTCTCTTTTTCCCACCATGGGGGTCGTTCATTACCAACACGAAACAGTACAAATTTTATACATCGTGTTACGAGTATTGATCCCCCCAATAAATAATATTCTCGATACCTTTTTCTGGGATTTGGTGGGGAAGAAAGCAATAAGATACCCTTTTAGACTGGTCATTGGTTGAATGCTATGACTGAAACAGTATTGATCATTTGTATAGGTACACAGCCAATAAGTCAATGGAGAACTGTGCGTTCTGTAATGCTGACGGCGACTTCTTGATAGTCCCTCAGAGTGGAAGTGAGTTATTGTTGGTAGTTACTTTCATTGAAAATTATGCACGTAGTGGGCTTCAATCTGAGTATTTTTATTCGGAAAGTCCAGTTAGTTTTAGATTCTTTGCACGAACATCTTGTGCGATCTGTAAAGCTGATTCCAATTCTATGTTTTCAACTTTTGGCTACATACATAATCCTCAACTGTCTTTGCATTATCTTGTCATTGAGTTATGGAAGTCTCTGCTGTATATTGAATCTGAAGGAAACTATATTCCTGTCTTAAAAAGTTTTAAGGAAACAGGATCAAACTCTTATAGAATATTCTTATTCTACAGGGCTGTGGATTATTACCGAGTGTGGAAGACTGGAAGTTTCTCCTGGTGAAATAGTGGTTTTACCTCAAGGTTTTCGCTTTGTTGTTTATCTGCCTGATGGTCCATCACGTGGCTACGTAGCTGAGATTTTTGGCAGTCATTTTCAGCTTCCTGATCTCGGACCAATAGGTATCTAACCTCTTATTATCTTCTCGACAGCCATTTATTTAGGTTTTGACCACATTCTTTTCCTTTTTTTCCCGAACATTGGGAAATAGTTTTCTGAGTTTTGCATTTTCAACTAATGTAATATTGAGGGCTAAGTTCATTGACATGTAGGTGCTAATGGTCTTGCTGCACCAAGGGATTTTCTTGCCCCTGTAGCCTGGTTTGAAAATATTTCTCGTCCCGGTTACACAATTGTGCAAAAATATGGTGGGGAATTGTTTACTGCCATACAGGACTTTTCTCCCTTTAATGTAGTTGCCTGGCATGGTAATTATGTTCCCTATAAGGTGGGTGTTGCGTATGCCATGCCAATATAGCTTAACTTTGGTTTCTTTTTGTGGAATTAAAAATAAGTTTGTCTATACTTTCTTTCAGGCTTTTTGCTGTTTATGTCTTGTTATAGGCTAATGTTTCCTATCTTTGCACGTGTGTGCCAATTCCACCTCAATATAACTGGAAGTTGATTCTTCTCCTTGCAGTACGATCTTAGTAAGTTCTGCCCTTACAATACTGTGTTGTTTGATCACAGTGATCCATCAATAAATACAGGTACATACATATAATGCATTAATTTATTTCGCTTTAAGTATACTGCTTTAAAATCCTTGACATTTCATTGAATTTCAAGTGTTTTATTCACCTATCTGAATTATCATGTTGCTGCCCTGTTGTAGTATTAACGGCACCAACTGATAAACCCGGAGTAGCGTTACTCGACTTTGTCATTTTCCCTCCAAGATGGCTTGTCTCTGAACACACATTCCGCCCCCCATATTACCATCGCAACTGCATGAGTGAATTCATGGGTCTCATATATGGAGGCTACGAGGTAATTTTCATCGTCTCGTTAAATAAAGAGACTGAATGAACCAAAATATTTAGGGAAGGAAGCGAAGTTGCTTCTTTTATTATTACTCTACTTGCCTTATAAAGTCGTAAAAGCCTCTCCTTGCCTGTTACCTTAATGATAATGTTGTTTCTAGGTAGAAAATTGGATCTGACTTCTCCCCCTTTACTTCGACAGGGTTCTAGTTTACATTTCAAATGGATTGATTTTATTCCAGTAATGTATTTTTTTCCCTAGATCTTTAGAATTTCTGATCCATCTTTCAGGCAAAAGCTGACGGGTTCCTTCCTGGGGGTGCCAGCCTTCATAATTGCATGACTCCCCATGGCCCCGATACTAAAACGTACGAGGTATGCAGTGCTTCTTATAAGTTCCATTAGTCATTTAATTTCCACCACATTCAAACTTTCTACCTCACAAGTTTCTCTGATGGAAATTAATTATAAATCCAAGCTTCCATCTTGAAGGTACAGCATGATAATATAAAGCATCGTCTTTCTTCTATAATTGATATGATTAGTAAAAGATAAATATATTTTATTGGTTTATTTTTTTTCGATTCTATAAATATCAACAAACTTTCTATACTTGGAAAGCCACAATTTTTTCTTTTAACATCCACGAGGGACTCGTAGTCACCGTGGCTTGACCCGATTCACTCTAGAACCTCTAAACCCTTTATAGATTACATGATCCTTATTGACCACTATGTCAACTCATGGAAGTTTACTTTACTTGGAAAGCTACAAATGGATCTTCTCATTCTTCCAACAGTATATACTTTTAGTTTAATTGATCTTAGACCATTCCTTCTATGATTCGACGAAAAGTCAGTAAGCATTCTCTTGATTTAGGCCACTATTGCTCGAGGAAACGAGGCTGGACCTTATCGAATCACCGACACAATGGCATTTATGTTTGAATCAAGTCTGATCCCTCGCGTATGCTCTTGGGCTCTGGAGTCTCCATGCATGGATCATGACTACTACCAATGCTGGATAGGATTGAAATCTCATTTCACAAGTGAAGGAACAACTAGAGACATGGATCCACGGAAGGTAAAAACTGATGTTGAAAATGGAAGACAGACTGTGTAGATATAATGTTGGATGAACATCATGGATTTCATGAAAAGGTTTCCATCCAAAGTGTACATATAAGAATAAAATTATGAATCTTATAAAAAATAATAATAAATTATGACAAAATGATGAATCTTCTTGCTGTAACTTCTTTACTTTGGTAATGCCAACATTTGTTAATCTTTCAATAGTAAATATTTGATCTGTCCTTAGCTCAAGGGAGATCCCCCTTGC

mRNA sequence

TTTAACATATAAATGCAATTTCGAGTTCATCTCTATCTTGTAACATGGTGAGCTTCTTGAACACGTCATTACGTGATTTCGATCGTGAATCTAGTTGCGTGGGGGAATCACCGATCATTGGAACCGCCGCGTGTTCTCCGGTGCCGTTTTCCGGTTTCAATAGTCTCCACAATCGATCATCATCTTCATCTGCTTTTTATTCTTCTTCAGTTTCAATGGCTGCTCAATCGGTCGGCGAAATGGACGGCGAAAATTTCCCTTCCGACCTCGCTTACCAGTCCGGCTTCAACAATCATTTCTCGTCGGAGGCTATTCCCGGTGCTCTTCCTCAATTGCAAAACAGTCCTCTGATATGCCCGTACGGCCTCTATGCCGAGCAGATCTCTGGCACTTCCTTCACATCGCCTCGCAAAGTCAACATGTGCAGTTGGCTGTATCGAATTAAGCCGTCGGTCACGCATGAACCGTTTAGGCCACGTTTGCCTAAAAACGAGAAGTTGATCAGTGAATTTAATGCGTCAAATTGTTCGTCGACTCCGACTCAGCTAAGGTGGAGACCGGCGGATGTTCCTGATTCACCGCTGGATTTCGTTGATGGTCTGTACACTGTTTGTGGAGCCGGCAGTTCGTTTCTTCGGCATGGTTTTGCTATTCACATGTACACAGCCAATAAGTCAATGGAGAACTGTGCGTTCTGTAATGCTGACGGCGACTTCTTGATAGTCCCTCAGAGTGGAAGGCTGTGGATTATTACCGAGTGTGGAAGACTGGAAGTTTCTCCTGGTGAAATAGTGGTTTTACCTCAAGGTTTTCGCTTTGTTGTTTATCTGCCTGATGGTCCATCACGTGGCTACGTAGCTGAGATTTTTGGCAGTCATTTTCAGCTTCCTGATCTCGGACCAATAGGTGCTAATGGTCTTGCTGCACCAAGGGATTTTCTTGCCCCTGTAGCCTGGTTTGAAAATATTTCTCGTCCCGGTTACACAATTGTGCAAAAATATGGTGGGGAATTGTTTACTGCCATACAGGACTTTTCTCCCTTTAATGTAGTTGCCTGGCATGGTAATTATGTTCCCTATAAGTACGATCTTAGTAAGTTCTGCCCTTACAATACTGTGTTGTTTGATCACAGTGATCCATCAATAAATACAGTATTAACGGCACCAACTGATAAACCCGGAGTAGCGTTACTCGACTTTGTCATTTTCCCTCCAAGATGGCTTGTCTCTGAACACACATTCCGCCCCCCATATTACCATCGCAACTGCATGAGTGAATTCATGGGTCTCATATATGGAGGCTACGAGGCAAAAGCTGACGGGTTCCTTCCTGGGGGTGCCAGCCTTCATAATTGCATGACTCCCCATGGCCCCGATACTAAAACGTACGAGGCCACTATTGCTCGAGGAAACGAGGCTGGACCTTATCGAATCACCGACACAATGGCATTTATGTTTGAATCAAGTCTGATCCCTCGCGTATGCTCTTGGGCTCTGGAGTCTCCATGCATGGATCATGACTACTACCAATGCTGGATAGGATTGAAATCTCATTTCACAAGTGAAGGAACAACTAGAGACATGGATCCACGGAAGGTAAAAACTGATGTTGAAAATGGAAGACAGACTGTGTAGATATAATGTTGGATGAACATCATGGATTTCATGAAAAGGTTTCCATCCAAAGTGTACATATAAGAATAAAATTATGAATCTTATAAAAAATAATAATAAATTATGACAAAATGATGAATCTTCTTGCTGTAACTTCTTTACTTTGGTAATGCCAACATTTGTTAATCTTTCAATAGTAAATATTTGATCTGTCCTTAGCTCAAGGGAGATCCCCCTTGC

Coding sequence (CDS)

ATGGTGAGCTTCTTGAACACGTCATTACGTGATTTCGATCGTGAATCTAGTTGCGTGGGGGAATCACCGATCATTGGAACCGCCGCGTGTTCTCCGGTGCCGTTTTCCGGTTTCAATAGTCTCCACAATCGATCATCATCTTCATCTGCTTTTTATTCTTCTTCAGTTTCAATGGCTGCTCAATCGGTCGGCGAAATGGACGGCGAAAATTTCCCTTCCGACCTCGCTTACCAGTCCGGCTTCAACAATCATTTCTCGTCGGAGGCTATTCCCGGTGCTCTTCCTCAATTGCAAAACAGTCCTCTGATATGCCCGTACGGCCTCTATGCCGAGCAGATCTCTGGCACTTCCTTCACATCGCCTCGCAAAGTCAACATGTGCAGTTGGCTGTATCGAATTAAGCCGTCGGTCACGCATGAACCGTTTAGGCCACGTTTGCCTAAAAACGAGAAGTTGATCAGTGAATTTAATGCGTCAAATTGTTCGTCGACTCCGACTCAGCTAAGGTGGAGACCGGCGGATGTTCCTGATTCACCGCTGGATTTCGTTGATGGTCTGTACACTGTTTGTGGAGCCGGCAGTTCGTTTCTTCGGCATGGTTTTGCTATTCACATGTACACAGCCAATAAGTCAATGGAGAACTGTGCGTTCTGTAATGCTGACGGCGACTTCTTGATAGTCCCTCAGAGTGGAAGGCTGTGGATTATTACCGAGTGTGGAAGACTGGAAGTTTCTCCTGGTGAAATAGTGGTTTTACCTCAAGGTTTTCGCTTTGTTGTTTATCTGCCTGATGGTCCATCACGTGGCTACGTAGCTGAGATTTTTGGCAGTCATTTTCAGCTTCCTGATCTCGGACCAATAGGTGCTAATGGTCTTGCTGCACCAAGGGATTTTCTTGCCCCTGTAGCCTGGTTTGAAAATATTTCTCGTCCCGGTTACACAATTGTGCAAAAATATGGTGGGGAATTGTTTACTGCCATACAGGACTTTTCTCCCTTTAATGTAGTTGCCTGGCATGGTAATTATGTTCCCTATAAGTACGATCTTAGTAAGTTCTGCCCTTACAATACTGTGTTGTTTGATCACAGTGATCCATCAATAAATACAGTATTAACGGCACCAACTGATAAACCCGGAGTAGCGTTACTCGACTTTGTCATTTTCCCTCCAAGATGGCTTGTCTCTGAACACACATTCCGCCCCCCATATTACCATCGCAACTGCATGAGTGAATTCATGGGTCTCATATATGGAGGCTACGAGGCAAAAGCTGACGGGTTCCTTCCTGGGGGTGCCAGCCTTCATAATTGCATGACTCCCCATGGCCCCGATACTAAAACGTACGAGGCCACTATTGCTCGAGGAAACGAGGCTGGACCTTATCGAATCACCGACACAATGGCATTTATGTTTGAATCAAGTCTGATCCCTCGCGTATGCTCTTGGGCTCTGGAGTCTCCATGCATGGATCATGACTACTACCAATGCTGGATAGGATTGAAATCTCATTTCACAAGTGAAGGAACAACTAGAGACATGGATCCACGGAAGGTAAAAACTGATGTTGAAAATGGAAGACAGACTGTGTAG

Protein sequence

MVSFLNTSLRDFDRESSCVGESPIIGTAACSPVPFSGFNSLHNRSSSSSAFYSSSVSMAAQSVGEMDGENFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISRPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCMDHDYYQCWIGLKSHFTSEGTTRDMDPRKVKTDVENGRQTV
BLAST of CmaCh04G005620 vs. Swiss-Prot
Match: HGD_ARATH (Homogentisate 1,2-dioxygenase OS=Arabidopsis thaliana GN=HGO PE=2 SV=2)

HSP 1 Score: 790.4 bits (2040), Expect = 1.2e-227
Identity = 357/432 (82.64%), Postives = 390/432 (90.28%), Query Frame = 1

Query: 74  DLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRI 133
           +L YQSGF NHFSSEAI GALP  QNSPL+CPYGLYAEQISGTSFTSPRK+N  SWLYR+
Sbjct: 10  ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRV 69

Query: 134 KPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTVCGAG 193
           KPSVTHEPF+PR+P ++KL+SEF+ASN  + PTQLRWRP D+PDS +DFVDGL+T+CGAG
Sbjct: 70  KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAG 129

Query: 194 SSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIVVLP 253
           SSFLRHGFAIHMY AN  M++ AFCNADGDFL+VPQ+GRLWI TECGRL V+PGEI V+P
Sbjct: 130 SSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 254 QGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISRPGY 313
           QGFRF + LPDG SRGYVAEI+G+HFQLPDLGPIGANGLAA RDFLAP AWFE+  RP Y
Sbjct: 190 QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEY 249

Query: 314 TIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTA 373
           TIVQK+GGELFTA QDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL DH DPSINTVLTA
Sbjct: 250 TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 374 PTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 433
           PTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310 PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 434 LHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCMDHDY 493
           LH+CMTPHGPDT TYEATIAR N   P ++T TMAFMFES+LIPRVC WALESP +DHDY
Sbjct: 370 LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 494 YQCWIGLKSHFT 506
           YQCWIGLKSHF+
Sbjct: 430 YQCWIGLKSHFS 441

BLAST of CmaCh04G005620 vs. Swiss-Prot
Match: HGD_ORYSJ (Homogentisate 1,2-dioxygenase OS=Oryza sativa subsp. japonica GN=HGO PE=2 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 3.6e-216
Identity = 346/450 (76.89%), Postives = 387/450 (86.00%), Query Frame = 1

Query: 74  DLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRI 133
           +  Y SG  N  SSEA+ G LP+ QNSPL+CP GLYAEQ+SGT FT+PR  N+ +WLYRI
Sbjct: 21  EYVYLSGLGNSLSSEAVAGTLPRGQNSPLVCPLGLYAEQLSGTPFTAPRARNLRTWLYRI 80

Query: 134 KPSVTHEPFRPRLPKNEKLISEFNASNCSS--TPTQLRWRPADVPDS--PLDFVDGLYTV 193
           KPSVTHEPF PR P + +LI +F+ +   +  TPTQLRWRPADVP    PLDF+DGLYTV
Sbjct: 81  KPSVTHEPFHPRRPAHPRLIGDFDRTTTDTVATPTQLRWRPADVPPHHPPLDFIDGLYTV 140

Query: 194 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 253
           CGAGSSFLRHG+AIHMY ANKSM+ CAFCNADGDFLIVPQ G+L I TECG+L V PGEI
Sbjct: 141 CGAGSSFLRHGYAIHMYAANKSMDGCAFCNADGDFLIVPQQGKLLITTECGKLLVPPGEI 200

Query: 254 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 313
           VV+PQGFRF V LPDGPSRGYV+EIFG+HFQLPDLGPIGANGLA+ RDFL+P AWFE + 
Sbjct: 201 VVIPQGFRFAVDLPDGPSRGYVSEIFGTHFQLPDLGPIGANGLASARDFLSPTAWFEQVH 260

Query: 314 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 373
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCP+NTVLFDH+DPS+NT
Sbjct: 261 RPGYTIVQKYGGELFTATQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHADPSVNT 320

Query: 374 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 433
           VLTAPTDKPGVALLDFVIFPPRWLV+E+TFRPPYYHRNCMSEFMGLIYG YEAKADGFLP
Sbjct: 321 VLTAPTDKPGVALLDFVIFPPRWLVAENTFRPPYYHRNCMSEFMGLIYGIYEAKADGFLP 380

Query: 434 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 493
           GGASLH+CMTPHGPDTKTYEATI+R +   P R++ T+AFMFES+LIPRVC WAL+SP  
Sbjct: 381 GGASLHSCMTPHGPDTKTYEATISRPDANEPSRLSGTLAFMFESALIPRVCQWALDSPSR 440

Query: 494 DHDYYQCWIGLKSHFTSE--GTTRDMDPRK 518
           D DYYQCWIGLKSHF+ +  G T +   RK
Sbjct: 441 DLDYYQCWIGLKSHFSHDNGGATSEEPCRK 470

BLAST of CmaCh04G005620 vs. Swiss-Prot
Match: HGD_DICDI (Homogentisate 1,2-dioxygenase OS=Dictyostelium discoideum GN=hgd PE=2 SV=1)

HSP 1 Score: 524.2 bits (1349), Expect = 1.6e-147
Identity = 258/432 (59.72%), Postives = 315/432 (72.92%), Query Frame = 1

Query: 74  DLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRI 133
           D  YQSGF N F SEAI G LP+ +N+P  CP  LYAEQ+SG +FT+PR     SWLYRI
Sbjct: 7   DYEYQSGFGNSFESEAIKGTLPKGRNAPQNCPLDLYAEQLSGNAFTAPRHTQQRSWLYRI 66

Query: 134 KPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVP-DSPLDFVDGLYTVCGA 193
           +PSV H P +P    +  L+ + N  N    P QLRW+P  +  D P DFV+GL T+ GA
Sbjct: 67  RPSVCHTPLKPI---DSGLVCDLN--NLHVDPNQLRWKPFPITEDKPHDFVEGLITIAGA 126

Query: 194 GSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIVVL 253
           G + +RHG AIH+YTA KSMEN +F N+DGDFLIVPQ G L I TE G ++V  GEI V+
Sbjct: 127 GHASVRHGLAIHIYTATKSMENKSFYNSDGDFLIVPQQGTLDIQTEFGFMKVKSGEICVI 186

Query: 254 PQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISRPG 313
            +G  F V + +GP+RGY+ E+FGSHF+LPDLGPIGANGLA PRDFL+PVA +E      
Sbjct: 187 QRGITFSVNV-EGPTRGYICEVFGSHFKLPDLGPIGANGLANPRDFLSPVAAYEKKEGIE 246

Query: 314 YTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLT 373
           +T + K+ G+LF+A Q +SPFNVVAWHGNY PYKYDLS FC  N+V FDH DPSI TVLT
Sbjct: 247 HTKINKFLGKLFSATQTYSPFNVVAWHGNYCPYKYDLSLFCVVNSVSFDHLDPSIFTVLT 306

Query: 374 APTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGA 433
           APT++ GVA  DFVIFPPRWLV E+TFRPPY+HRNCMSEFMGLI G YEAK +GFLPGG 
Sbjct: 307 APTNEVGVAAADFVIFPPRWLVQENTFRPPYFHRNCMSEFMGLIRGVYEAKKEGFLPGGG 366

Query: 434 SLHNCMTPHGPDTKTYEATIARGNEAGPYRITD-TMAFMFESSLIPRVCSWALESPCMDH 493
           SLH+CMTPHGPD+ T+ A I    E  P +I D  +AFMFESSLI  +  +A ++  +D 
Sbjct: 367 SLHSCMTPHGPDSDTFYAAIKA--ELKPTKIPDVALAFMFESSLILGISDYAKKN-FIDD 426

Query: 494 DYYQCWIGLKSH 504
           DY++CW GLK +
Sbjct: 427 DYWKCWQGLKDN 429

BLAST of CmaCh04G005620 vs. Swiss-Prot
Match: HGD_HUMAN (Homogentisate 1,2-dioxygenase OS=Homo sapiens GN=HGD PE=1 SV=2)

HSP 1 Score: 516.5 bits (1329), Expect = 3.3e-145
Identity = 254/441 (57.60%), Postives = 313/441 (70.98%), Query Frame = 1

Query: 73  SDLAYQSGFNNHFSSE--AIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWL 132
           ++L Y SGF N  SSE    PG+LP+ QN+P +CPY LYAEQ+SG++FT PR  N  SWL
Sbjct: 2   AELKYISGFGNECSSEDPRCPGSLPEGQNNPQVCPYNLYAEQLSGSAFTCPRSTNKRSWL 61

Query: 133 YRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSP---LDFVDGLY 192
           YRI PSV+H+PF      +E  ++  N       P QLRW+P ++P +    +DFV GL+
Sbjct: 62  YRILPSVSHKPFESI---DEGQVTH-NWDEVDPDPNQLRWKPFEIPKASQKKVDFVSGLH 121

Query: 193 TVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPG 252
           T+CGAG     +G AIH++  N SMEN  F N+DGDFLIVPQ G L I TE G++ V P 
Sbjct: 122 TLCGAGDIKSNNGLAIHIFLCNTSMENRCFYNSDGDFLIVPQKGNLLIYTEFGKMLVQPN 181

Query: 253 EIVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFEN 312
           EI V+ +G RF + + +  +RGY+ E++G HF+LPDLGPIGANGLA PRDFL P+AW+E+
Sbjct: 182 EICVIQRGMRFSIDVFE-ETRGYILEVYGVHFELPDLGPIGANGLANPRDFLIPIAWYED 241

Query: 313 ISRPG-YTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPS 372
              PG YT++ KY G+LF A QD SPFNVVAWHGNY PYKY+L  F   N+V FDH+DPS
Sbjct: 242 RQVPGGYTVINKYQGKLFAAKQDVSPFNVVAWHGNYTPYKYNLKNFMVINSVAFDHADPS 301

Query: 373 INTVLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADG 432
           I TVLTA + +PGVA+ DFVIFPPRW V++ TFRPPYYHRNCMSEFMGLI G YEAK  G
Sbjct: 302 IFTVLTAKSVRPGVAIADFVIFPPRWGVADKTFRPPYYHRNCMSEFMGLIRGHYEAKQGG 361

Query: 433 FLPGGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITD-TMAFMFESSLIPRVCSWALE 492
           FLPGG SLH+ MTPHGPD   +E   A   +  P RI D TMAFMFESSL   V  W L+
Sbjct: 362 FLPGGGSLHSTMTPHGPDADCFEK--ASKVKLAPERIADGTMAFMFESSLSLAVTKWGLK 421

Query: 493 -SPCMDHDYYQCWIGLKSHFT 506
            S C+D +Y++CW  LKSHFT
Sbjct: 422 ASRCLDENYHKCWEPLKSHFT 435

BLAST of CmaCh04G005620 vs. Swiss-Prot
Match: HGD_MOUSE (Homogentisate 1,2-dioxygenase OS=Mus musculus GN=Hgd PE=1 SV=2)

HSP 1 Score: 514.2 bits (1323), Expect = 1.7e-144
Identity = 250/441 (56.69%), Postives = 310/441 (70.29%), Query Frame = 1

Query: 73  SDLAYQSGFNNHFSSE--AIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWL 132
           ++L Y SGF N  +SE    PG+LP+ QN+P +CPY LYAEQ+SG++FT PR  N  SWL
Sbjct: 2   AELKYISGFGNECASEDPRCPGSLPKGQNNPQVCPYNLYAEQLSGSAFTCPRNTNKRSWL 61

Query: 133 YRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVP---DSPLDFVDGLY 192
           YRI PSV+H+PF       ++     N       P QLRW+P ++P   +  +DFV GLY
Sbjct: 62  YRILPSVSHKPFE----SIDQGHVTHNWDEVGPDPNQLRWKPFEIPKASEKKVDFVSGLY 121

Query: 193 TVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPG 252
           T+CGAG     +G A+H++  N SMEN  F N+DGDFLIVPQ G+L I TE G++ + P 
Sbjct: 122 TLCGAGDIKSNNGLAVHIFLCNSSMENRCFYNSDGDFLIVPQKGKLLIYTEFGKMSLQPN 181

Query: 253 EIVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFEN 312
           EI V+ +G RF V + +  +RGY+ E++G HF+LPDLGPIGANGLA PRDFL PVAW+E+
Sbjct: 182 EICVIQRGMRFSVDVFE-ETRGYILEVYGVHFELPDLGPIGANGLANPRDFLIPVAWYED 241

Query: 313 ISRPG-YTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPS 372
              PG YT++ K+ G+LF   QD SPFNVVAWHGNY PYKY+L  F   N V FDH+DPS
Sbjct: 242 RRVPGGYTVINKFQGKLFACKQDVSPFNVVAWHGNYTPYKYNLENFMVINAVAFDHADPS 301

Query: 373 INTVLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADG 432
           I TVLTA + +PGVA+ DFVIFPPRW V++ TFRPPYYHRNCMSEFMGLI G YEAK  G
Sbjct: 302 IFTVLTAKSLRPGVAIADFVIFPPRWGVADKTFRPPYYHRNCMSEFMGLIKGHYEAKQGG 361

Query: 433 FLPGGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITD-TMAFMFESSLIPRVCSWALE 492
           FLPGG SLH+ MTPHGPD   +E   A   +  P RI D TMAFMFESSL   V  W L+
Sbjct: 362 FLPGGGSLHSAMTPHGPDADCFEK--ASKAKLEPERIADGTMAFMFESSLSLAVTKWGLK 421

Query: 493 S-PCMDHDYYQCWIGLKSHFT 506
           +  C+D +YY+CW  L+SHFT
Sbjct: 422 TCSCLDENYYKCWEPLRSHFT 435

BLAST of CmaCh04G005620 vs. TrEMBL
Match: A0A0A0KYZ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G088750 PE=4 SV=1)

HSP 1 Score: 929.9 bits (2402), Expect = 1.4e-267
Identity = 428/470 (91.06%), Postives = 449/470 (95.53%), Query Frame = 1

Query: 58  MAAQSVGEMDGENFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTS 117
           MAAQSVGE DG +FPSDL Y SGFNNHFSSEAIPGALPQ QNSPLICP+GLYAEQISGTS
Sbjct: 1   MAAQSVGETDGTDFPSDLPYLSGFNNHFSSEAIPGALPQSQNSPLICPFGLYAEQISGTS 60

Query: 118 FTSPRKVNMCSWLYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPD 177
           FTSPRK N+CSWLYRIKPSVTHEPFR RLPKNEKLISEFNASNCSSTPTQLRW+PAD PD
Sbjct: 61  FTSPRKANLCSWLYRIKPSVTHEPFRQRLPKNEKLISEFNASNCSSTPTQLRWKPADFPD 120

Query: 178 SPLDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIIT 237
           SP+DFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSG+LWIIT
Sbjct: 121 SPVDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGKLWIIT 180

Query: 238 ECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD 297
           ECGRLEVSPGE+VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD
Sbjct: 181 ECGRLEVSPGEVVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD 240

Query: 298 FLAPVAWFENISRPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNT 357
           FLAPVAWFEN  RPGYTI+QK+GGELFTAIQDFSPFNVVAWHGNYVPYKYDL KFCPYNT
Sbjct: 241 FLAPVAWFENSPRPGYTIIQKFGGELFTAIQDFSPFNVVAWHGNYVPYKYDLCKFCPYNT 300

Query: 358 VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIY 417
           VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIY
Sbjct: 301 VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 360

Query: 418 GGYEAKADGFLPGGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIP 477
           GGYEAKADGF+PGGASLH+CMTPHGPDTKTYEATIARGN+AGP++I+ TMAFMFESSLIP
Sbjct: 361 GGYEAKADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPHKISGTMAFMFESSLIP 420

Query: 478 RVCSWALESPCMDHDYYQCWIGLKSHFTSEGTTRDMDPRKVKTDVENGRQ 528
           RVCSWALESP +DHDYYQCWIGLKSHF +E    D DP+KV+ + ENGRQ
Sbjct: 421 RVCSWALESPFIDHDYYQCWIGLKSHFKNE-AIGDTDPQKVRIESENGRQ 469

BLAST of CmaCh04G005620 vs. TrEMBL
Match: Q9M6U1_SOLLC (Homogentisate 1,2-dioxygenase (Fragment) OS=Solanum lycopersicum GN=HGO PE=2 SV=1)

HSP 1 Score: 843.2 bits (2177), Expect = 1.7e-241
Identity = 384/452 (84.96%), Postives = 414/452 (91.59%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 6   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 65

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 66  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 125

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 126 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 185

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V+LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL PVAW+ + S
Sbjct: 186 VILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDGS 245

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 246 RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 305

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF P
Sbjct: 306 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHP 365

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKT+EATIA GNEAGP+RI DTMAFMFES L+PRVC WALESP M
Sbjct: 366 GGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFM 425

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD-MDPRKVKT 521
           DHDYYQCWIGLKSHF+      D +D +K KT
Sbjct: 426 DHDYYQCWIGLKSHFSGLSMNEDNVDLQKGKT 457

BLAST of CmaCh04G005620 vs. TrEMBL
Match: A0A0V0IDM4_SOLCH (Putative homogentisate 1,2-dioxygenase-like OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 842.8 bits (2176), Expect = 2.3e-241
Identity = 382/443 (86.23%), Postives = 407/443 (91.87%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 9   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 68

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 69  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 128

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 129 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 188

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL PVAW+E+ S
Sbjct: 189 VTLPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGS 248

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGY IVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 249 RPGYAIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 308

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF P
Sbjct: 309 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHP 368

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKTYEATIA GNEAGP+RI DTMAFMFES LIPRVC WALESP M
Sbjct: 369 GGASLHSCMTPHGPDTKTYEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALESPFM 428

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD 513
           DHDYYQCWIGLKSHF+      D
Sbjct: 429 DHDYYQCWIGLKSHFSGLSMNED 451

BLAST of CmaCh04G005620 vs. TrEMBL
Match: M1B8K1_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400015330 PE=4 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 3.0e-241
Identity = 382/443 (86.23%), Postives = 408/443 (92.10%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 9   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 68

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 69  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 128

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 129 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 188

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V+LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL PVAW+E+ S
Sbjct: 189 VILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGS 248

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 249 RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 308

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLI GGYEAKADGF P
Sbjct: 309 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLINGGYEAKADGFHP 368

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKTYEATIA GNEAGP+RI DTMAFMFES LIPRVC WALESP M
Sbjct: 369 GGASLHSCMTPHGPDTKTYEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALESPFM 428

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD 513
           DHDYYQCWIGLKSHF+      D
Sbjct: 429 DHDYYQCWIGLKSHFSGLSMNED 451

BLAST of CmaCh04G005620 vs. TrEMBL
Match: K4DCU5_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 841.3 bits (2172), Expect = 6.6e-241
Identity = 380/443 (85.78%), Postives = 408/443 (92.10%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 9   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 68

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 69  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 128

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 129 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 188

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V+LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL PVAW+ + S
Sbjct: 189 VILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDGS 248

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 249 RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 308

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF P
Sbjct: 309 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHP 368

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKT+EATIA GNEAGP+RI DTMAFMFES L+PRVC WALESP M
Sbjct: 369 GGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFM 428

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD 513
           DHDYYQCWIGLKSHF+      D
Sbjct: 429 DHDYYQCWIGLKSHFSGLSMNED 451

BLAST of CmaCh04G005620 vs. TAIR10
Match: AT5G54080.1 (AT5G54080.1 homogentisate 1,2-dioxygenase)

HSP 1 Score: 790.4 bits (2040), Expect = 6.8e-229
Identity = 357/432 (82.64%), Postives = 390/432 (90.28%), Query Frame = 1

Query: 74  DLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRI 133
           +L YQSGF NHFSSEAI GALP  QNSPL+CPYGLYAEQISGTSFTSPRK+N  SWLYR+
Sbjct: 10  ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRV 69

Query: 134 KPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTVCGAG 193
           KPSVTHEPF+PR+P ++KL+SEF+ASN  + PTQLRWRP D+PDS +DFVDGL+T+CGAG
Sbjct: 70  KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAG 129

Query: 194 SSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIVVLP 253
           SSFLRHGFAIHMY AN  M++ AFCNADGDFL+VPQ+GRLWI TECGRL V+PGEI V+P
Sbjct: 130 SSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 254 QGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISRPGY 313
           QGFRF + LPDG SRGYVAEI+G+HFQLPDLGPIGANGLAA RDFLAP AWFE+  RP Y
Sbjct: 190 QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEY 249

Query: 314 TIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTA 373
           TIVQK+GGELFTA QDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL DH DPSINTVLTA
Sbjct: 250 TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 374 PTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 433
           PTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310 PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 434 LHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCMDHDY 493
           LH+CMTPHGPDT TYEATIAR N   P ++T TMAFMFES+LIPRVC WALESP +DHDY
Sbjct: 370 LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 494 YQCWIGLKSHFT 506
           YQCWIGLKSHF+
Sbjct: 430 YQCWIGLKSHFS 441

BLAST of CmaCh04G005620 vs. NCBI nr
Match: gi|449438877|ref|XP_004137214.1| (PREDICTED: homogentisate 1,2-dioxygenase [Cucumis sativus])

HSP 1 Score: 929.9 bits (2402), Expect = 2.0e-267
Identity = 428/470 (91.06%), Postives = 449/470 (95.53%), Query Frame = 1

Query: 58  MAAQSVGEMDGENFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTS 117
           MAAQSVGE DG +FPSDL Y SGFNNHFSSEAIPGALPQ QNSPLICP+GLYAEQISGTS
Sbjct: 1   MAAQSVGETDGTDFPSDLPYLSGFNNHFSSEAIPGALPQSQNSPLICPFGLYAEQISGTS 60

Query: 118 FTSPRKVNMCSWLYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPD 177
           FTSPRK N+CSWLYRIKPSVTHEPFR RLPKNEKLISEFNASNCSSTPTQLRW+PAD PD
Sbjct: 61  FTSPRKANLCSWLYRIKPSVTHEPFRQRLPKNEKLISEFNASNCSSTPTQLRWKPADFPD 120

Query: 178 SPLDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIIT 237
           SP+DFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSG+LWIIT
Sbjct: 121 SPVDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGKLWIIT 180

Query: 238 ECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD 297
           ECGRLEVSPGE+VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD
Sbjct: 181 ECGRLEVSPGEVVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD 240

Query: 298 FLAPVAWFENISRPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNT 357
           FLAPVAWFEN  RPGYTI+QK+GGELFTAIQDFSPFNVVAWHGNYVPYKYDL KFCPYNT
Sbjct: 241 FLAPVAWFENSPRPGYTIIQKFGGELFTAIQDFSPFNVVAWHGNYVPYKYDLCKFCPYNT 300

Query: 358 VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIY 417
           VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIY
Sbjct: 301 VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 360

Query: 418 GGYEAKADGFLPGGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIP 477
           GGYEAKADGF+PGGASLH+CMTPHGPDTKTYEATIARGN+AGP++I+ TMAFMFESSLIP
Sbjct: 361 GGYEAKADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPHKISGTMAFMFESSLIP 420

Query: 478 RVCSWALESPCMDHDYYQCWIGLKSHFTSEGTTRDMDPRKVKTDVENGRQ 528
           RVCSWALESP +DHDYYQCWIGLKSHF +E    D DP+KV+ + ENGRQ
Sbjct: 421 RVCSWALESPFIDHDYYQCWIGLKSHFKNE-AIGDTDPQKVRIESENGRQ 469

BLAST of CmaCh04G005620 vs. NCBI nr
Match: gi|659101616|ref|XP_008451701.1| (PREDICTED: homogentisate 1,2-dioxygenase [Cucumis melo])

HSP 1 Score: 926.8 bits (2394), Expect = 1.7e-266
Identity = 435/496 (87.70%), Postives = 458/496 (92.34%), Query Frame = 1

Query: 32  PVPFSGFNSLHNRSSSSSAFYSSSVSMAAQSVGEMDGENFPSDLAYQSGFNNHFSSEAIP 91
           PV  SG ++L NRSS SS        MAAQSVGE +G +FPSDL Y SGFNNHFSSEAIP
Sbjct: 4   PVISSGSDNLCNRSSYSS--------MAAQSVGETEGRDFPSDLPYLSGFNNHFSSEAIP 63

Query: 92  GALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWLYRIKPSVTHEPFRPRLPKNEK 151
           GALPQ QNSPL CP+GLYAEQISGTSFTSPRK N+CSWLYRIKPSVTHEPFR RLPKNEK
Sbjct: 64  GALPQSQNSPLNCPFGLYAEQISGTSFTSPRKANLCSWLYRIKPSVTHEPFRQRLPKNEK 123

Query: 152 LISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTVCGAGSSFLRHGFAIHMYTANKS 211
           LISEFNASNCSSTPTQLRW+PAD PDSP+DFVDGL+TVCGAGSSFLRHGFAIHMYTANKS
Sbjct: 124 LISEFNASNCSSTPTQLRWKPADFPDSPVDFVDGLHTVCGAGSSFLRHGFAIHMYTANKS 183

Query: 212 MENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYV 271
           MENCAFCNADGDFLIVPQ+GRLWI TECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYV
Sbjct: 184 MENCAFCNADGDFLIVPQTGRLWITTECGRLEVSPGEIVVLPQGFRFVVYLPDGPSRGYV 243

Query: 272 AEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISRPGYTIVQKYGGELFTAIQDFS 331
           AEIFG HFQLPDLGPIGANGLAAPRDFLAPVAWFEN  RPGYT++QK+GGELFTAIQDFS
Sbjct: 244 AEIFGCHFQLPDLGPIGANGLAAPRDFLAPVAWFENSPRPGYTVIQKFGGELFTAIQDFS 303

Query: 332 PFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPR 391
           PFNVVAWHGNYVPYKYDL KFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPR
Sbjct: 304 PFNVVAWHGNYVPYKYDLCKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPR 363

Query: 392 WLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHNCMTPHGPDTKTYEAT 451
           WLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF+PGGASLH+CMTPHGPDTKTYEAT
Sbjct: 364 WLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVPGGASLHSCMTPHGPDTKTYEAT 423

Query: 452 IARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCMDHDYYQCWIGLKSHFTSEGTTR 511
           IARGN+AGP++I+ TMAFMFESSLIPRVCSWALESP MDHDYYQCWIGLKSHF +E    
Sbjct: 424 IARGNDAGPHKISGTMAFMFESSLIPRVCSWALESPFMDHDYYQCWIGLKSHFKNE-AIG 483

Query: 512 DMDPRKVKTDVENGRQ 528
           D DP+KV+   ENGRQ
Sbjct: 484 DTDPQKVRIKSENGRQ 490

BLAST of CmaCh04G005620 vs. NCBI nr
Match: gi|970064661|ref|XP_015059302.1| (PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase [Solanum pennellii])

HSP 1 Score: 844.7 bits (2181), Expect = 8.6e-242
Identity = 382/443 (86.23%), Postives = 410/443 (92.55%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 9   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 68

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 69  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 128

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 129 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 188

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V+LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL+PVAW+E+ S
Sbjct: 189 VILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLSPVAWYEDGS 248

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 249 RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 308

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF P
Sbjct: 309 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHP 368

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKT+EATIA GNEAGP+RI DTMAFMFES LIPRVC WALESP M
Sbjct: 369 GGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALESPFM 428

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD 513
           DHDYYQCWIGLKSHF+      D
Sbjct: 429 DHDYYQCWIGLKSHFSGLSMNED 451

BLAST of CmaCh04G005620 vs. NCBI nr
Match: gi|697176514|ref|XP_009597213.1| (PREDICTED: homogentisate 1,2-dioxygenase [Nicotiana tomentosiformis])

HSP 1 Score: 843.6 bits (2178), Expect = 1.9e-241
Identity = 382/435 (87.82%), Postives = 409/435 (94.02%), Query Frame = 1

Query: 71  FPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSWL 130
           FPSDL YQSGF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SWL
Sbjct: 16  FPSDLEYQSGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWL 75

Query: 131 YRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTVC 190
           YRIKPSVTHEPFR RLP++EKL+SEF+ SN ++TPTQLRW+P ++P++P DF+DGLYT+C
Sbjct: 76  YRIKPSVTHEPFRRRLPRHEKLVSEFDQSNSAATPTQLRWKPVEIPETPKDFIDGLYTIC 135

Query: 191 GAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEIV 250
           GAGSS+LRHGFAIHMYTANKSMENCAFCNADGDFLIVPQ GRLWI TECGRL+VSPGEIV
Sbjct: 136 GAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQQGRLWITTECGRLQVSPGEIV 195

Query: 251 VLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENISR 310
           +LPQGFRF V LPDGPSRGYVAEIFG+H QLPDLGPIGANGLAAPRDFL PVAW+E+ S+
Sbjct: 196 ILPQGFRFSVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSQ 255

Query: 311 PGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTV 370
           PGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINTV
Sbjct: 256 PGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTV 315

Query: 371 LTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPG 430
           LTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF PG
Sbjct: 316 LTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHPG 375

Query: 431 GASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCMD 490
           GASLH+CMTPHGPDTKTYEATIA GNEAGP++I DTMAFMFES LIPRVC WALESP MD
Sbjct: 376 GASLHSCMTPHGPDTKTYEATIALGNEAGPHKIADTMAFMFESCLIPRVCPWALESPFMD 435

Query: 491 HDYYQCWIGLKSHFT 506
           HDYYQCWIGLKSHF+
Sbjct: 436 HDYYQCWIGLKSHFS 450

BLAST of CmaCh04G005620 vs. NCBI nr
Match: gi|8131905|gb|AAF73132.1|AF149017_1 (homogentisate 1,2-dioxygenase [Solanum lycopersicum])

HSP 1 Score: 843.2 bits (2177), Expect = 2.5e-241
Identity = 384/452 (84.96%), Postives = 414/452 (91.59%), Query Frame = 1

Query: 70  NFPSDLAYQSGFNNHFSSEAIPGALPQLQNSPLICPYGLYAEQISGTSFTSPRKVNMCSW 129
           NFPSDL YQ+GF NHFSSEAI GALPQ QNSPLICP+GLYAEQISGTSFTSPRK+N  SW
Sbjct: 6   NFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 65

Query: 130 LYRIKPSVTHEPFRPRLPKNEKLISEFNASNCSSTPTQLRWRPADVPDSPLDFVDGLYTV 189
           LYRIKPSVTHEPFRPR+P++EKL+SEFN SN S+TPTQLRW+P ++P++P DF+DGLYT+
Sbjct: 66  LYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 125

Query: 190 CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGRLWIITECGRLEVSPGEI 249
           CGAGSS+LRHGFAIHMYTANKSMEN AFCNADGDFLIVPQ GRLWI TECGRL+V PGEI
Sbjct: 126 CGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEI 185

Query: 250 VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENIS 309
           V+LPQG+RF V LPDGPSRGYVAE FG+H QLPDLGPIGANGLAAPRDFL PVAW+ + S
Sbjct: 186 VILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDGS 245

Query: 310 RPGYTIVQKYGGELFTAIQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINT 369
           RPGYTIVQKYGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINT
Sbjct: 246 RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 305

Query: 370 VLTAPTDKPGVALLDFVIFPPRWLVSEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 429
           VLTAPTDKPGVALLDFVIFPPRWLV+EHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF P
Sbjct: 306 VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHP 365

Query: 430 GGASLHNCMTPHGPDTKTYEATIARGNEAGPYRITDTMAFMFESSLIPRVCSWALESPCM 489
           GGASLH+CMTPHGPDTKT+EATIA GNEAGP+RI DTMAFMFES L+PRVC WALESP M
Sbjct: 366 GGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFM 425

Query: 490 DHDYYQCWIGLKSHFTSEGTTRD-MDPRKVKT 521
           DHDYYQCWIGLKSHF+      D +D +K KT
Sbjct: 426 DHDYYQCWIGLKSHFSGLSMNEDNVDLQKGKT 457

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HGD_ARATH1.2e-22782.64Homogentisate 1,2-dioxygenase OS=Arabidopsis thaliana GN=HGO PE=2 SV=2[more]
HGD_ORYSJ3.6e-21676.89Homogentisate 1,2-dioxygenase OS=Oryza sativa subsp. japonica GN=HGO PE=2 SV=1[more]
HGD_DICDI1.6e-14759.72Homogentisate 1,2-dioxygenase OS=Dictyostelium discoideum GN=hgd PE=2 SV=1[more]
HGD_HUMAN3.3e-14557.60Homogentisate 1,2-dioxygenase OS=Homo sapiens GN=HGD PE=1 SV=2[more]
HGD_MOUSE1.7e-14456.69Homogentisate 1,2-dioxygenase OS=Mus musculus GN=Hgd PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KYZ6_CUCSA1.4e-26791.06Uncharacterized protein OS=Cucumis sativus GN=Csa_4G088750 PE=4 SV=1[more]
Q9M6U1_SOLLC1.7e-24184.96Homogentisate 1,2-dioxygenase (Fragment) OS=Solanum lycopersicum GN=HGO PE=2 SV=... [more]
A0A0V0IDM4_SOLCH2.3e-24186.23Putative homogentisate 1,2-dioxygenase-like OS=Solanum chacoense PE=4 SV=1[more]
M1B8K1_SOLTU3.0e-24186.23Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400015330 PE=4 SV=1[more]
K4DCU5_SOLLC6.6e-24185.78Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G54080.16.8e-22982.64 homogentisate 1,2-dioxygenase[more]
Match NameE-valueIdentityDescription
gi|449438877|ref|XP_004137214.1|2.0e-26791.06PREDICTED: homogentisate 1,2-dioxygenase [Cucumis sativus][more]
gi|659101616|ref|XP_008451701.1|1.7e-26687.70PREDICTED: homogentisate 1,2-dioxygenase [Cucumis melo][more]
gi|970064661|ref|XP_015059302.1|8.6e-24286.23PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase [Solanum pennellii... [more]
gi|697176514|ref|XP_009597213.1|1.9e-24187.82PREDICTED: homogentisate 1,2-dioxygenase [Nicotiana tomentosiformis][more]
gi|8131905|gb|AAF73132.1|AF149017_12.5e-24184.96homogentisate 1,2-dioxygenase [Solanum lycopersicum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005708Homogentis_dOase
IPR011051RmlC_Cupin_sf
IPR014710RmlC-like_jellyroll
Vocabulary: Molecular Function
TermDefinition
GO:0004411homogentisate 1,2-dioxygenase activity
Vocabulary: Biological Process
TermDefinition
GO:0006559L-phenylalanine catabolic process
GO:0006570tyrosine metabolic process
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015996 chlorophyll catabolic process
biological_process GO:0006635 fatty acid beta-oxidation
biological_process GO:1902000 homogentisate catabolic process
biological_process GO:0006559 L-phenylalanine catabolic process
biological_process GO:0016558 protein import into peroxisome matrix
biological_process GO:0009750 response to fructose
biological_process GO:0009744 response to sucrose
biological_process GO:0042207 styrene catabolic process
biological_process GO:0006572 tyrosine catabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006570 tyrosine metabolic process
cellular_component GO:0005829 cytosol
cellular_component GO:0005575 cellular_component
molecular_function GO:0004411 homogentisate 1,2-dioxygenase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G005620.1CmaCh04G005620.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005708Homogentisate 1,2-dioxygenasePANTHERPTHR11056HOMOGENTISATE 1,2-DIOXYGENASEcoord: 63..510
score: 3.7E
IPR005708Homogentisate 1,2-dioxygenasePFAMPF04209HgmAcoord: 76..504
score: 2.9E
IPR005708Homogentisate 1,2-dioxygenaseTIGRFAMsTIGR01015TIGR01015coord: 75..504
score: 1.9E
IPR011051RmlC-like cupin domainunknownSSF51182RmlC-like cupinscoord: 74..506
score: 3.27E
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 268..293
score: 3.6E-7coord: 318..474
score: 1.7