HG10019716 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019716
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncytochrome P450 CYP72A219-like
LocationChr04: 24802081 .. 24807015 (+)
RNA-Seq ExpressionHG10019716
SyntenyHG10019716
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGAATTGGATATGGGTGGTGGGTTTATTGTGGTTATTGGGATTATGGGGATGGAGAATTGCGAATTGGGTTTGGTTGAGGCCGAAAAGGCTCGAAAAGTACTTGAGACACCAAGGCCTCGCCGGCAACTCTTATCGCTTTCTGTTTGGGGATACAAAGGATATTGCGGCGGCCGTCCGCCAAGCAAGATCGCACTCCATGACCTTCTCCCATGATATTGCTCCACGCGCCACCCCTACCTCTCATCTCACCATCCACAAATACGGTACTTCCTCTCATTTAATTCTCCTTTTTCGCTCTTCGTTTTAGTCGAAAGTCTACAAAAATTGATGGTTTGAACTTAATTTGTTTTCGTCTTGGTATTAATGGAGATGAACTGAGATAAGCTCTCAAAATAATTGGCCAAAAAAAGATGCTATTTGTGGAAAGGAGGAAAATTTTGAGTGTTATTTCTAATGATTATTCTATCTTTTTTTAGTGTTTTCTCTTTTAGTCAGATTATATTGTTTTGTTTTTAATTCAACCATGAAAATTTTGAATATTTGATTTTCTGATAAAAAAAAAAATTATATTTTATGTTAGTTGAACTATGTATATATTTGTTATTAATTGTATAAGAGTTGGAGTTTGTAAAAAAAAAAAAAAAAGTGTATTAGTACACTCTAATTTTTTTTTCTTTCAGTTGACTATGATAACCTTGGAATTGAGAATAAAATTAAGAGAAGATGAATGGGTTTTTTGAGTGAGTGAAGAATGGTTAGTGATTTGGTTGTGAGATTCAACTAATTTATTGGGATCAAATTTAGAACTTTTTTCTTTATAATTATCAAATTTGGAACTTTTTTCTTTATAATAATCAAATTGGAAATAAGTCTATGTTTAGATCAAATAGATTTTTTTTTTTTTTTTTTTTTTTTTAGATTTAACAATAGCAAAGGTGGGAGGACATCACAATTGTAGACATTTGAAATGATAATTGATTTTTTATTCATTAATATGTTTCCTCAAATTGACTAGAACATTGAAACCAAATTGCTCACATTTGAAATCCTATAGACTCAATATTAATTAGTAAAATTAGACTTTTTTTCTTTCAATTAAAATGCAAAAGATATGTTATATAGCTTTTTCCTTTTTTTTTTTTTATTTTTTTATTTTTATCATGAAGCTCGAACCTTGAGATTTTCATTCATTATTTGTTTAAGATGAAACTTGGTAGGTAAGGATTCTTTCACATGGGTTGGCACAACTCCAAGAGTATATATAACAGAGCCTGAACAAGTGAAAATTGTTTTCTCTCAAATAAATGATATTCGTAAGACATCTTCTTTTCCCCTTAGACGAAGAATTGGAGGTGGGCTTGTCTCTCTTGAGGGTTCTAAATGGGCTAAACATAGAAAAATCATCAACCCTGCATTTCACATGGAGAAGTTGAAGGTATTCTTCTTCTTTCTATTTTTTTTTCCTAAAAAAATAAAAATAAAAATAAAAAATAAATCCGAAAACAAAAATATGAATTTTACCTGAGAATGGTAAACTTAAAGGAGGCATCATCTTAATGGAGTATGTTAAAGATGTTGCAAAGATCGATGGACCTCACTAATCCTTATAAGATTTATAAATCTTTACTCTTCTCATTGTCAATTGGTTGTGATAAAGATAAAGTTGATGATGTTTAAATTGTGAAATTTTTGCAGGAAATGTTCCCAGCATTCTCCAAGAGTTGTGGAGAGATGGTTAACAAATGGGAAAAGATGATTCCTGAAGAAGGATGTTGTGAAATAGATGTGTGGCCTGATTTGCAAAATATGGCTGCAGATGTGATTTCTAGAACTGCATTTGGAAGTAGTTTTGAGGAAGGAAAAAAGATTTTCCAGCTCTTGAAAGAGTGGGCTAAGCTATTGATGACATACCTAACAAAAAGAGCTTATTACATTCCAGGGTTTAGGTAAGATATAATTTTCAATGGATTATGAATCTGTTGCTATAATTTAAACTGATGTCAATTAAATAAATGCCCATTTATTTCCAAAAAAAAAAAAAAGTTAAATCAATGCTGATTTGATCTTTATGGAAAAATATCATAATTTTGGTTCATATGTTTTTAATATAGTTTTCATTTGGTGTTTATGTTTCAAAATGTCGTTTCGTTATTACTCAGTAACTAATTGACAATAAAAGAAGTAGTCTAAGACTTGTTTGGATTAATTTAAAAAATAAATATTTTTCAAAAATTCATTTCATTTAAACACTTTTGACAAGAGCTGTTTAAAATAAAAATACTAATGGCTTTTAAAAATGACTTGTATTTCTTTCCAAATGACTTATTTTTTAAATTAAACATTTAAAAAGGTATTCTAAACACACTCTTAAGTATTTTTTTTAAAGATTGTGAGGTCTCCTTAATTTTCTAATGTAGGATTCTGATCAATGCGTCCTTTCAAGATGATACCTCTTATGCGTTCACCATTTTTTAGTCGGATCCCAATTTTTTGGACGATCGAATGCCCATTTGAGTTTCATGGGCTATGATAGCATATTATATTTAGTTCCTCTTTGATGATATCTTTATGTTTCAAAATTTACCTTTTTAGTCTTAGATTTTCACTTATGCTCACCTATTGGCTCTTGGTGCTAACTTTTTAGTCAATTAGTTTTAAATAATTATGATAATTTATTTATTATTTTTAATAGAGATGAAAAATTAGTGAAATCTAATTTAATTATAATTAATTGAATTATTTTTAAGTAATTAATAAAAGAATGAATATAAATGACAAAAATGCAACTTTTGAAAACCTAAAATCAAATGGAAATTGATGGTAACCAAAATAAAAACGAAACTCAAAACATAGAGATAAAATATACTCAAAACTTGTGAATTAATATGATATATTTTTTTCTTCGACTAGAAATAAACTAATTCTTTCGAGAAAGTAATCTTTGTGCAATCTTTCGATTGAAAAATTGTGTAAAATATAAACTAAGAAAGTATTTTCATTGAATATATTAATTCTTGATGAATTGTATACTTTTTTTTAGGTATATACCAACAAAATTAAACAAGAGGATGGAAGAGATTGATAGGAAAATACGAGATATGGTTTGGGGTATTATAAGCAAAAGACAAAATGGTATGAAGAAAGGTGAAGCTTCCAATAATGAAGATCTATTACGCATCCTTTTAGAATCAAATGCAAGTCAAATTGAAGAACAAAAAATCAAGAATAAAGAAGTTGGAATGAGCATTGAAGAAGTAATAAGTGAATGCAGACTTTTCTATTTTGCTGGCCAAGAAACCACAGCTGCATTGCTTGCTTGGACAATGGTTCTACTTGGTCGATATTCAGAGTGGCAAGATCGAGCAAGAGCAGAAGTCTTGGAGGTTTTTGGTGATAACAACAAATTGGATTTTGATGGTCTAAGTCGTCTAAAAATAGTAAGTAATAATCATTTAATTTATATTACCTTCATTTTTTTTTAGAAAAAAATAATTACAAGTTTTACTCATCTACTTTTATATTTATATTTATGTCTATTTAGTAAGTATTTAATCTTTAAAAGTATTTAATCAATCTCGGAATTTAATTTTGTTAACAGTTGATCTGTAAACATTAAAAAGCATCTAATAGGTACTTAAAAAACATTCAATTTTTTGTCCGATATGTTTCTAAATATTCAAGTTGTATCTAATAGGTCATCGACCGAATTGACATTTCTAAAAATTGATGAATCTACTAGACTCAAAATTGAAAGTTTAGGATACTATTAGACATCAAATTCAACTTTTACATTTAATAGATCACTTTATTTTTAAAATCCTAAATTAAGTAAGTAGGCATGTTTTATTAGACAGAAAATTCATAAAGTTCAATAACCTATTAGACATTTTAAAGTTCAGAAACTTATAGGTATAAAAATTTAAAGTCTAATGGTTTTTGGATCTTAAAAATTAAGCCTACAAATACTACTTTTGTTTGTTTTGTTGTTTACATTTTAGGAGTGTTTTCAAAATCCAAGTCAATATTTGAAAATTAATAAAAAATAACTTTTAAAAAAATATTTTTGTTTTAAATCATAATAGGTTGACCTAGCAGTAAATAGGGGGCATGATCTTGATAAACCGCTAATAGGTCATAGATTCAATCTAGGGTGGCCACCTACCTAAGATTTAATATCTTACAGAGTCAGGCAAGTTGTCCTTGAGATTAGTCGAAGTGTGCGTAAACTGACCTGAACGCTCATAAATTAAAAAAAAAGTATTTAGACAATTTATAAGTACAAAAACCTTTTAGATACAAACTTGGAAATTCATGAACTATATATTACTAATGCTCTTTTAAGTTCAATCTTTAAAGCTGTTCTTTTCTTTTTTTCTTATTCACTTAATGTTTAAAGGGAAATGTAAAAATTTTGCCTCAAATTTTGGTGATCAACTTAGATAGTTGGCCTGAATTAGTATATTGTTATATATTGATTAATCCTTTATTGATGTGGATAACACAGGTAACTATGATACTAAATGAAGTTCTTAGGCTATATCCTCCTGTTGGAATGCTAGCTCGAGAGCTTCACAATGAAACAAAATTGGAGAATTTGACATTACCAGTTGGAGTATCAATAGGAATACCAATTTTATGCATACACCGAGATTCCAAAATATGGGGTGAAGATGCAAGTGAGTTCAAACCAGAAAGATTTTCGGAAGGAATTTCTAAAGCAACAAAGAATCAAGTATGCTTTATCCCTTTTGGATGGGGTCCTCGAATTTGTGTTGGTCAAAATTTTGCAATGATTGAAGCCAAAATAGCATTATCCATGATTTTGCAACAATTCTCATTTACAATTTCTCCAACTTATACTCATGCTCCTATCTCAAACATCACTATTCAGCCACAACATGGGGCTCATCTCATCCTTCGTAAGCTGTAA

mRNA sequence

ATGTGGAATTGGATATGGGTGGTGGGTTTATTGTGGTTATTGGGATTATGGGGATGGAGAATTGCGAATTGGGTTTGGTTGAGGCCGAAAAGGCTCGAAAAGTACTTGAGACACCAAGGCCTCGCCGGCAACTCTTATCGCTTTCTGTTTGGGGATACAAAGGATATTGCGGCGGCCGTCCGCCAAGCAAGATCGCACTCCATGACCTTCTCCCATGATATTGCTCCACGCGCCACCCCTACCTCTCATCTCACCATCCACAAATACGGTAAGGATTCTTTCACATGGGTTGGCACAACTCCAAGAGTATATATAACAGAGCCTGAACAAGTGAAAATTGTTTTCTCTCAAATAAATGATATTCGTAAGACATCTTCTTTTCCCCTTAGACGAAGAATTGGAGGTGGGCTTGTCTCTCTTGAGGGTTCTAAATGGGCTAAACATAGAAAAATCATCAACCCTGCATTTCACATGGAGAAGTTGAAGGAAATGTTCCCAGCATTCTCCAAGAGTTGTGGAGAGATGGTTAACAAATGGGAAAAGATGATTCCTGAAGAAGGATGTTGTGAAATAGATGTGTGGCCTGATTTGCAAAATATGGCTGCAGATGTGATTTCTAGAACTGCATTTGGAAGTAGTTTTGAGGAAGGAAAAAAGATTTTCCAGCTCTTGAAAGAGTGGGCTAAGCTATTGATGACATACCTAACAAAAAGAGCTTATTACATTCCAGGGTTTAGGTATATACCAACAAAATTAAACAAGAGGATGGAAGAGATTGATAGGAAAATACGAGATATGGTTTGGGGTATTATAAGCAAAAGACAAAATGGTATGAAGAAAGGTGAAGCTTCCAATAATGAAGATCTATTACGCATCCTTTTAGAATCAAATGCAAGTCAAATTGAAGAACAAAAAATCAAGAATAAAGAAGTTGGAATGAGCATTGAAGAAGTAATAAGTGAATGCAGACTTTTCTATTTTGCTGGCCAAGAAACCACAGCTGCATTGCTTGCTTGGACAATGGTTCTACTTGGTCGATATTCAGAGTGGCAAGATCGAGCAAGAGCAGAAGTCTTGGAGGTTTTTGGTGATAACAACAAATTGGATTTTGATGGTCTAAGTCGTCTAAAAATAGTAACTATGATACTAAATGAAGTTCTTAGGCTATATCCTCCTGTTGGAATGCTAGCTCGAGAGCTTCACAATGAAACAAAATTGGAGAATTTGACATTACCAGTTGGAGTATCAATAGGAATACCAATTTTATGCATACACCGAGATTCCAAAATATGGGGTGAAGATGCAAGTGAGTTCAAACCAGAAAGATTTTCGGAAGGAATTTCTAAAGCAACAAAGAATCAAGTATGCTTTATCCCTTTTGGATGGGGTCCTCGAATTTGTGTTGGTCAAAATTTTGCAATGATTGAAGCCAAAATAGCATTATCCATGATTTTGCAACAATTCTCATTTACAATTTCTCCAACTTATACTCATGCTCCTATCTCAAACATCACTATTCAGCCACAACATGGGGCTCATCTCATCCTTCGTAAGCTGTAA

Coding sequence (CDS)

ATGTGGAATTGGATATGGGTGGTGGGTTTATTGTGGTTATTGGGATTATGGGGATGGAGAATTGCGAATTGGGTTTGGTTGAGGCCGAAAAGGCTCGAAAAGTACTTGAGACACCAAGGCCTCGCCGGCAACTCTTATCGCTTTCTGTTTGGGGATACAAAGGATATTGCGGCGGCCGTCCGCCAAGCAAGATCGCACTCCATGACCTTCTCCCATGATATTGCTCCACGCGCCACCCCTACCTCTCATCTCACCATCCACAAATACGGTAAGGATTCTTTCACATGGGTTGGCACAACTCCAAGAGTATATATAACAGAGCCTGAACAAGTGAAAATTGTTTTCTCTCAAATAAATGATATTCGTAAGACATCTTCTTTTCCCCTTAGACGAAGAATTGGAGGTGGGCTTGTCTCTCTTGAGGGTTCTAAATGGGCTAAACATAGAAAAATCATCAACCCTGCATTTCACATGGAGAAGTTGAAGGAAATGTTCCCAGCATTCTCCAAGAGTTGTGGAGAGATGGTTAACAAATGGGAAAAGATGATTCCTGAAGAAGGATGTTGTGAAATAGATGTGTGGCCTGATTTGCAAAATATGGCTGCAGATGTGATTTCTAGAACTGCATTTGGAAGTAGTTTTGAGGAAGGAAAAAAGATTTTCCAGCTCTTGAAAGAGTGGGCTAAGCTATTGATGACATACCTAACAAAAAGAGCTTATTACATTCCAGGGTTTAGGTATATACCAACAAAATTAAACAAGAGGATGGAAGAGATTGATAGGAAAATACGAGATATGGTTTGGGGTATTATAAGCAAAAGACAAAATGGTATGAAGAAAGGTGAAGCTTCCAATAATGAAGATCTATTACGCATCCTTTTAGAATCAAATGCAAGTCAAATTGAAGAACAAAAAATCAAGAATAAAGAAGTTGGAATGAGCATTGAAGAAGTAATAAGTGAATGCAGACTTTTCTATTTTGCTGGCCAAGAAACCACAGCTGCATTGCTTGCTTGGACAATGGTTCTACTTGGTCGATATTCAGAGTGGCAAGATCGAGCAAGAGCAGAAGTCTTGGAGGTTTTTGGTGATAACAACAAATTGGATTTTGATGGTCTAAGTCGTCTAAAAATAGTAACTATGATACTAAATGAAGTTCTTAGGCTATATCCTCCTGTTGGAATGCTAGCTCGAGAGCTTCACAATGAAACAAAATTGGAGAATTTGACATTACCAGTTGGAGTATCAATAGGAATACCAATTTTATGCATACACCGAGATTCCAAAATATGGGGTGAAGATGCAAGTGAGTTCAAACCAGAAAGATTTTCGGAAGGAATTTCTAAAGCAACAAAGAATCAAGTATGCTTTATCCCTTTTGGATGGGGTCCTCGAATTTGTGTTGGTCAAAATTTTGCAATGATTGAAGCCAAAATAGCATTATCCATGATTTTGCAACAATTCTCATTTACAATTTCTCCAACTTATACTCATGCTCCTATCTCAAACATCACTATTCAGCCACAACATGGGGCTCATCTCATCCTTCGTAAGCTGTAA

Protein sequence

MWNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL
Homology
BLAST of HG10019716 vs. NCBI nr
Match: XP_038904026.1 (cytochrome P450 CYP72A219-like [Benincasa hispida])

HSP 1 Score: 958.7 bits (2477), Expect = 2.0e-275
Identity = 471/520 (90.58%), Postives = 491/520 (94.42%), Query Frame = 0

Query: 2   WNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVR 61
           WNWIW VGLLWLLGLWGWR+ NWVW RPKRLEK LRHQGLAGNSYR LFGDTK+IAAAV 
Sbjct: 3   WNWIWAVGLLWLLGLWGWRVVNWVWFRPKRLEKCLRHQGLAGNSYRLLFGDTKEIAAAVH 62

Query: 62  QARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDI 121
           QARSHSM+FSHDIAPR TPTSH TIHKYGKDSFTW+GTTPRVYITEPEQVKIVFSQINDI
Sbjct: 63  QARSHSMSFSHDIAPRTTPTSHTTIHKYGKDSFTWIGTTPRVYITEPEQVKIVFSQINDI 122

Query: 122 RKTSSFPLRRRIG-GGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 181
           RKTSSFPLRRR G GG+VSLEGSKWAKHRKIINPAFHMEKLK+MFPAFSKSC EMVNKWE
Sbjct: 123 RKTSSFPLRRRKGSGGVVSLEGSKWAKHRKIINPAFHMEKLKDMFPAFSKSCREMVNKWE 182

Query: 182 KMIP-EEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRA 241
           KMIP EEG CEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLK+WA LLMTYLTK  
Sbjct: 183 KMIPAEEGSCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKQWAMLLMTYLTKGV 242

Query: 242 YYIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNAS 301
           Y+IPGFRY+PTKLNKRMEEIDRKI+DMVWGII KRQN MKKGE SNNEDLL ILLESNAS
Sbjct: 243 YFIPGFRYMPTKLNKRMEEIDRKIQDMVWGIIRKRQNAMKKGEGSNNEDLLGILLESNAS 302

Query: 302 QIEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVL 361
           QIEE K K K+VGMSIEEVISECRLFYFAGQETTAALLAWTMVLL RYSEWQDRARAEVL
Sbjct: 303 QIEEHKNK-KDVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLARYSEWQDRARAEVL 362

Query: 362 EVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGI 421
           EVFGD+NKLDFDGLSRL++V MILNEVLRLYPPVGMLARE+HNETKL NLTLPVGVSIGI
Sbjct: 363 EVFGDSNKLDFDGLSRLRVVNMILNEVLRLYPPVGMLAREIHNETKLGNLTLPVGVSIGI 422

Query: 422 PILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKI 481
           PIL IH+++KIWGEDA EFKPERFSEGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKI
Sbjct: 423 PILSIHQNTKIWGEDAIEFKPERFSEGISKATKNQVCFIPFGWGPRICLGQNFAMIEAKI 482

Query: 482 ALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           ALSMILQQFSFT+SP YTHAPISNITIQPQHGAHLILRKL
Sbjct: 483 ALSMILQQFSFTLSPAYTHAPISNITIQPQHGAHLILRKL 521

BLAST of HG10019716 vs. NCBI nr
Match: XP_008443405.1 (PREDICTED: cytochrome P450 CYP72A219-like [Cucumis melo])

HSP 1 Score: 924.1 bits (2387), Expect = 5.4e-265
Identity = 457/521 (87.72%), Postives = 485/521 (93.09%), Query Frame = 0

Query: 2   WNWIWVVGLLW---LLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAA 61
           +NWIW VGLLW   LLGLWGWRI NWVWLRPKRLEK LR QGLAGNSYRFLFGDTK+I  
Sbjct: 6   YNWIWGVGLLWFLGLLGLWGWRIVNWVWLRPKRLEKLLRQQGLAGNSYRFLFGDTKEIGV 65

Query: 62  AVRQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQI 121
           AVRQARS SMTFSHDIA RATP+S+ TIHKYGKDSFTW+GTTPRVYITEPEQVKIVFSQI
Sbjct: 66  AVRQARSQSMTFSHDIASRATPSSYPTIHKYGKDSFTWIGTTPRVYITEPEQVKIVFSQI 125

Query: 122 NDIRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNK 181
           NDIRKTSSFPLRRR+G GLV+LEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSC EMVNK
Sbjct: 126 NDIRKTSSFPLRRRMGSGLVTLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCREMVNK 185

Query: 182 WEKMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKR 241
           WEKMI E+G  EIDVWPDLQN+AADVISRTAFGSS++EGKKIFQLLKEWA LLM+YLTKR
Sbjct: 186 WEKMISEKGSGEIDVWPDLQNLAADVISRTAFGSSYDEGKKIFQLLKEWAVLLMSYLTKR 245

Query: 242 AYYIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNA 301
           AYYIPG RYIPTKLNKRM+EID KIRD+V GII+KRQN MKKGEAS NEDLL ILLESN 
Sbjct: 246 AYYIPGARYIPTKLNKRMQEIDMKIRDLVRGIINKRQNAMKKGEASKNEDLLGILLESNE 305

Query: 302 SQIEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEV 361
           +QIEEQK K K+VGMSIEEVISECRLFYFAGQETTA LLAWTMVLLGRYSEWQDRARAEV
Sbjct: 306 TQIEEQKNK-KDVGMSIEEVISECRLFYFAGQETTAVLLAWTMVLLGRYSEWQDRARAEV 365

Query: 362 LEVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIG 421
           LEVFG+N KLDFDGLSRL++V+MILNEVLRLYPPVGMLARE+HNETKL NLTLP GVSIG
Sbjct: 366 LEVFGNNKKLDFDGLSRLRVVSMILNEVLRLYPPVGMLAREVHNETKLGNLTLPSGVSIG 425

Query: 422 IPILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAK 481
           IPIL +H++ KIWGED  EF PERFSEGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAK
Sbjct: 426 IPILSMHQNPKIWGEDVLEFNPERFSEGISKATKNQVCFIPFGWGPRICIGQNFAMIEAK 485

Query: 482 IALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           IALSMILQQFSFT+SPTYTHAPI+NITIQPQHGAHLILRKL
Sbjct: 486 IALSMILQQFSFTLSPTYTHAPITNITIQPQHGAHLILRKL 525

BLAST of HG10019716 vs. NCBI nr
Match: XP_004151308.2 (cytochrome P450 CYP72A219 [Cucumis sativus] >KGN59713.2 hypothetical protein Csa_000733 [Cucumis sativus])

HSP 1 Score: 923.3 bits (2385), Expect = 9.2e-265
Identity = 454/520 (87.31%), Postives = 480/520 (92.31%), Query Frame = 0

Query: 3   NWIWVVGLLW---LLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAA 62
           NWIW VGLLW   LLGLWGWRI NWVWLRPKRLEK LR QGLAGNSYRFLFGDTK+I  A
Sbjct: 6   NWIWGVGLLWFLGLLGLWGWRIVNWVWLRPKRLEKLLRQQGLAGNSYRFLFGDTKEIGVA 65

Query: 63  VRQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIN 122
           VRQAR  SMTFSHDIA RATP+S+ TIHKYGK+SFTW+GTTPRVYITEPEQVKI FSQIN
Sbjct: 66  VRQARLQSMTFSHDIASRATPSSYPTIHKYGKNSFTWIGTTPRVYITEPEQVKIAFSQIN 125

Query: 123 DIRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKW 182
           DIRKTSSFPLRRR+G GLV+LEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSC EMVNKW
Sbjct: 126 DIRKTSSFPLRRRMGSGLVTLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCREMVNKW 185

Query: 183 EKMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRA 242
           EKMI EE  CEIDVWPDLQNM ADVISRTAFGSS++EGKKIFQLLKEWA LLM+YLTKR 
Sbjct: 186 EKMINEEESCEIDVWPDLQNMTADVISRTAFGSSYDEGKKIFQLLKEWAMLLMSYLTKRG 245

Query: 243 YYIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNAS 302
           YYIPG RY+PTKLN RM+EID KIRDMV GII+KRQNGMKKGEASNNEDLL ILLESNAS
Sbjct: 246 YYIPGARYVPTKLNNRMQEIDTKIRDMVRGIINKRQNGMKKGEASNNEDLLGILLESNAS 305

Query: 303 QIEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVL 362
           QIEE K K K+VGMSIEEVISECRLFYFAGQETTA LLAWTMVLLGRY EWQDRARAEVL
Sbjct: 306 QIEEHKNK-KDVGMSIEEVISECRLFYFAGQETTAVLLAWTMVLLGRYPEWQDRARAEVL 365

Query: 363 EVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGI 422
           EVFGDN KLDFDGLSRL++V MILNEVLRLYPPVGMLARE+HNETKL NLTLP GVSIG+
Sbjct: 366 EVFGDNKKLDFDGLSRLRVVNMILNEVLRLYPPVGMLAREIHNETKLGNLTLPCGVSIGV 425

Query: 423 PILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKI 482
           PIL +H++ KIWGEDA EF PERF+EGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKI
Sbjct: 426 PILSMHQNPKIWGEDALEFNPERFAEGISKATKNQVCFIPFGWGPRICIGQNFAMIEAKI 485

Query: 483 ALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           ALSMILQQFSFT+SPTYTHAPI++ITIQPQHGAHLILRKL
Sbjct: 486 ALSMILQQFSFTLSPTYTHAPITHITIQPQHGAHLILRKL 524

BLAST of HG10019716 vs. NCBI nr
Match: TYK25638.1 (cytochrome P450 CYP72A219-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 898.3 bits (2320), Expect = 3.2e-257
Identity = 445/519 (85.74%), Postives = 475/519 (91.52%), Query Frame = 0

Query: 2   WNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVR 61
           WNWIWVVGLLW+LGLWGWRI NWVW RPKR+EK LR QGLAGNSYRFLFGDTK+I AAVR
Sbjct: 3   WNWIWVVGLLWVLGLWGWRIVNWVWFRPKRVEKLLRQQGLAGNSYRFLFGDTKEITAAVR 62

Query: 62  QAR-SHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 121
           +AR S  M+FSH I PR TP S+ TIHKYGKDSFTW+GTTPRVYITEPEQVKIVFSQIND
Sbjct: 63  RARTSQPMSFSHHIDPRTTPYSYPTIHKYGKDSFTWIGTTPRVYITEPEQVKIVFSQIND 122

Query: 122 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 181
           IRKTSSFPLRRR+G GLV+LEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSC EMVNKWE
Sbjct: 123 IRKTSSFPLRRRMGSGLVTLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCREMVNKWE 182

Query: 182 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 241
           KMI E+G  EIDVWPDLQN+AADVISRTAFGSS++EGKKIFQLLKEWA LLM+YLTKRAY
Sbjct: 183 KMISEKGSGEIDVWPDLQNLAADVISRTAFGSSYDEGKKIFQLLKEWAVLLMSYLTKRAY 242

Query: 242 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 301
           YIPG RY     N  M+EID KIRD+V GII+KRQN MKKGEAS NEDLL ILLESN +Q
Sbjct: 243 YIPGARYD----NSMMQEIDMKIRDLVRGIINKRQNAMKKGEASKNEDLLGILLESNETQ 302

Query: 302 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 361
           IEEQK K K+VGMSIEEVISECRLFYFAGQETTA LLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 303 IEEQKNK-KDVGMSIEEVISECRLFYFAGQETTAVLLAWTMVLLGRYSEWQDRARAEVLE 362

Query: 362 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 421
           VFG+N KLDFDGLSRL++V+MILNEVLRLYPPVGMLARE+HNETKL NLTLP GVSIGIP
Sbjct: 363 VFGNNKKLDFDGLSRLRVVSMILNEVLRLYPPVGMLAREVHNETKLGNLTLPSGVSIGIP 422

Query: 422 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 481
           IL +H++ KIWGED  EF PERFSEGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 423 ILSMHQNPKIWGEDVLEFNPERFSEGISKATKNQVCFIPFGWGPRICIGQNFAMIEAKIA 482

Query: 482 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQQFSFT+SPTYTHAPI+NITIQPQHGAHLILRKL
Sbjct: 483 LSMILQQFSFTLSPTYTHAPITNITIQPQHGAHLILRKL 516

BLAST of HG10019716 vs. NCBI nr
Match: XP_023529493.1 (cytochrome P450 CYP72A219-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 894.4 bits (2310), Expect = 4.6e-256
Identity = 430/519 (82.85%), Postives = 478/519 (92.10%), Query Frame = 0

Query: 1   MWNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAV 60
           MWNWIW VG L LLG+WGW+I NW+WLRPKRLEK LRHQGLAGNSYR LFGD+KD AAAV
Sbjct: 1   MWNWIWAVG-LGLLGVWGWKILNWMWLRPKRLEKCLRHQGLAGNSYRLLFGDSKDTAAAV 60

Query: 61  RQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 120
           RQA++  ++FSH IA RATP+SH+TI +YGKDSFTW+G TPRVYIT PEQVK+VFSQIND
Sbjct: 61  RQAKTQPISFSHRIAARATPSSHITIDQYGKDSFTWIGPTPRVYITHPEQVKMVFSQIND 120

Query: 121 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 180
           IRKT+SFP RRR+GGG+VSLEG+KWAKHRKIINPAFHMEKLK+MFPAFS+SC EM+NKWE
Sbjct: 121 IRKTTSFPFRRRMGGGVVSLEGAKWAKHRKIINPAFHMEKLKDMFPAFSQSCSEMINKWE 180

Query: 181 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 240
            MI +EGCCE+DVWPDLQNMAADVISRTAFGSS+EEGKKIFQLLK+WA LLM Y+ K  Y
Sbjct: 181 MMITKEGCCELDVWPDLQNMAADVISRTAFGSSYEEGKKIFQLLKQWASLLMAYVMKGVY 240

Query: 241 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 300
           +IPG R++PTKLNKRM+EI+  IRDM+WGII+KRQN MKKGEAS N+DLL ILLESN+ +
Sbjct: 241 FIPGLRFLPTKLNKRMDEINGNIRDMIWGIINKRQNAMKKGEAS-NDDLLGILLESNSRE 300

Query: 301 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360
           I+E K K K+VGMSIEEVI+ECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 301 IQEHKNK-KDVGMSIEEVIAECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360

Query: 361 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 420
           VFGDN K DFDGLSRLKIVTMILNEVLRLYPPVGMLARE+H ETKL NLTLPVGVSIGIP
Sbjct: 361 VFGDNKKFDFDGLSRLKIVTMILNEVLRLYPPVGMLAREIHKETKLGNLTLPVGVSIGIP 420

Query: 421 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 480
           I+CIH+D  IWGEDASEFKPERF+EGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 421 IVCIHQDPTIWGEDASEFKPERFAEGISKATKNQVCFIPFGWGPRICLGQNFAMIEAKIA 480

Query: 481 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQ+FSF +SP+YTHAPISN+TIQPQHGAHLIL +L
Sbjct: 481 LSMILQRFSFELSPSYTHAPISNVTIQPQHGAHLILHRL 516

BLAST of HG10019716 vs. ExPASy Swiss-Prot
Match: H1A988 (11-oxo-beta-amyrin 30-oxidase OS=Glycyrrhiza uralensis OX=74613 GN=CYP72A154 PE=1 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 8.9e-162
Identity = 293/520 (56.35%), Postives = 373/520 (71.73%), Query Frame = 0

Query: 4   WIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKD--IAAAVR 63
           W+ +  +L  + +W   + N +WLRPKRLE++LR QGL G+ Y+    ++K   +    +
Sbjct: 11  WVVLTVILAAIPIWVCHMVNTLWLRPKRLERHLRAQGLHGDPYKLSLDNSKQTYMLKLQQ 70

Query: 64  QARSHSMTFS-HDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 123
           +A+S S+  S  D APR    +H T+HKYGK+SF W GT P+V IT+PEQ+K VF++I D
Sbjct: 71  EAQSKSIGLSKDDAAPRIFSLAHQTVHKYGKNSFAWEGTAPKVIITDPEQIKEVFNKIQD 130

Query: 124 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 183
             K    P+ + I  GLV  EG KWAKHRKIINPAFH+EKLK M PAFS SC EM++KW+
Sbjct: 131 FPKPKLNPIAKYISIGLVQYEGDKWAKHRKIINPAFHLEKLKGMLPAFSHSCHEMISKWK 190

Query: 184 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 243
            ++  +G CE+DVWP LQN+  DVISRTAFGSS+ EG KIF+LLK     LM   T R  
Sbjct: 191 GLLSSDGTCEVDVWPFLQNLTCDVISRTAFGSSYAEGAKIFELLKRQGYALM---TARYA 250

Query: 244 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 303
            IP +  +P+   +RM+EI+R IRD + GII KR+  +K G+ S ++DLL ILL+SN   
Sbjct: 251 RIPLWWLLPSTTKRRMKEIERGIRDSLEGIIRKREKALKSGK-STDDDLLGILLQSN--H 310

Query: 304 IEEQKIKN-KEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVL 363
           IE +  +N K  GM+ +EV+ EC+LFY AGQETTAALLAWTMVLLG++ EWQ RAR EVL
Sbjct: 311 IENKGDENSKSAGMTTQEVMEECKLFYLAGQETTAALLAWTMVLLGKHPEWQARARQEVL 370

Query: 364 EVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGI 423
           +VFG+ N  +F+GL RLKIVTMIL EVLRLYPP   L R L  + KL NL LP GV + +
Sbjct: 371 QVFGNQNP-NFEGLGRLKIVTMILYEVLRLYPPGIYLTRALRKDLKLGNLLLPAGVQVSV 430

Query: 424 PILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKI 483
           PIL IH D  IWG DA EF PERF+EGI+KATK QVC+ PFGWGPRICVGQNFA++EAKI
Sbjct: 431 PILLIHHDEGIWGNDAKEFNPERFAEGIAKATKGQVCYFPFGWGPRICVGQNFALLEAKI 490

Query: 484 ALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
            LS++LQ FSF +SPTY H P + +T+QP+HGA +IL KL
Sbjct: 491 VLSLLLQNFSFELSPTYAHVPTTVLTLQPKHGAPIILHKL 523

BLAST of HG10019716 vs. ExPASy Swiss-Prot
Match: A0A0S2IHL2 (Cytochrome P450 72A397 OS=Kalopanax septemlobus OX=228393 GN=CYP72A397 PE=1 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 8.9e-162
Identity = 281/503 (55.86%), Postives = 360/503 (71.57%), Query Frame = 0

Query: 17  WGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIAP 76
           W W++ NWVW+ P++LE+ LR QG  GNSYR  +GD K+ +   R+A+   +  S D   
Sbjct: 24  WAWKVLNWVWVSPRKLEESLRKQGFRGNSYRLFYGDLKESSEMTRKAKLKPINLSDDPVL 83

Query: 77  RATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGGG 136
           R  P  H T+ KYGK SF W+G TPRV I +PE +K +  +     K    PL +    G
Sbjct: 84  RVRPFIHQTVKKYGKSSFIWIGPTPRVQIMDPEIIKEIMVKSYKFNKPKRNPLVKLFADG 143

Query: 137 LVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEGCCEIDVWPD 196
           L + EG  WAKHRK++NPAFH+E+LK M PA   SC EMV+KW+KMI ++G  E+DVWP 
Sbjct: 144 LANHEGELWAKHRKLLNPAFHLERLKCMLPAMYFSCIEMVSKWDKMISKDGSRELDVWPF 203

Query: 197 LQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKRM 256
           LQ + +DVIS TAFGSS+EEG  +F+L  E A+L+M   T ++ YIPG+ Y+PTK N++M
Sbjct: 204 LQRLTSDVISHTAFGSSYEEGNIVFELQTEQAELVMK--TLQSVYIPGWSYLPTKRNRKM 263

Query: 257 EEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSIE 316
           +EIDRK +  +  II+K+   M+ GE S  +D+L ILLESN  +   Q  KN  VGMSI+
Sbjct: 264 KEIDRKTQSCLMNIINKKTKAMQAGEGS-TDDILGILLESNLKEQLGQGKKN--VGMSIQ 323

Query: 317 EVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSRL 376
           EV+ EC+ FYFAGQETT+ LL WTMVLL  +  WQ RAR EVL+ FG N K DFD L+ L
Sbjct: 324 EVMGECKQFYFAGQETTSGLLVWTMVLLSIHPNWQARAREEVLQQFG-NAKPDFDNLNHL 383

Query: 377 KIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDAS 436
           KIVTMIL EVLRLYPPV  L R +  ET L ++TLP GV I +PI+ +H D  IWG+DA 
Sbjct: 384 KIVTMILYEVLRLYPPVDTLFRRVDQETTLGDITLPAGVQISLPIMILHHDQNIWGDDAK 443

Query: 437 EFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPTY 496
           EF PERFSEG+SKATKNQV F PFGWGPRIC+GQNFA++EAK+AL++ILQ+FSF +SP+Y
Sbjct: 444 EFNPERFSEGVSKATKNQVVFFPFGWGPRICIGQNFALLEAKLALAIILQRFSFELSPSY 503

Query: 497 THAPISNITIQPQHGAHLILRKL 520
           THAP + +T+QPQHGA+LIL KL
Sbjct: 504 THAPTTVLTVQPQHGANLILHKL 520

BLAST of HG10019716 vs. ExPASy Swiss-Prot
Match: H2DH21 (Cytochrome P450 CYP72A219 OS=Panax ginseng OX=4054 PE=2 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 3.4e-161
Identity = 271/504 (53.77%), Postives = 367/504 (72.82%), Query Frame = 0

Query: 16  LWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIA 75
           L GWRI NWVWLRP++LEKYLR+QG  GNSYR  FGD K++   +++A+S  +    DI 
Sbjct: 19  LLGWRIFNWVWLRPRKLEKYLRNQGFNGNSYRLFFGDVKEMIVMLKEAKSKPINLYDDII 78

Query: 76  PRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGG 135
           PR  P +   I  YGK+SF W+G  P V+I  P+ +K V S+    +K    PL + +  
Sbjct: 79  PRIIPLNQKIITNYGKNSFLWLGPKPMVHIMNPDHIKDVLSKFYQFQKPRHNPLTKLLAT 138

Query: 136 GLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEGCCEIDVWP 195
           G+   EG +WAKHRK+INPAFH+EKLK M PA   S  E+V KWE+M+  +G  E+DV P
Sbjct: 139 GVADAEGDRWAKHRKLINPAFHLEKLKNMLPAIYLSSSEIVTKWEEMVSTKGQFELDVLP 198

Query: 196 DLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKR 255
            L+ + +DVISRTAFGSS+EEG+KIFQL +E A+L++     +  Y+PG R++PTK NKR
Sbjct: 199 YLETLTSDVISRTAFGSSYEEGRKIFQLQREQAELIIQ--ASQTIYLPGMRFLPTKRNKR 258

Query: 256 MEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSI 315
           M+EI ++++  +  II+KR   M+ GE S+++DLL ILLESN+ +I++    N   G+++
Sbjct: 259 MKEIAKEVKIALKSIINKRLKAMEAGERSSHDDLLGILLESNSKEIKQH--GNTNFGLTV 318

Query: 316 EEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSR 375
           +EVI EC+LF+FAGQETT+ LL WTM+LL ++ +WQ RA+ EVL  FG NNK DFDGL+ 
Sbjct: 319 DEVIEECKLFFFAGQETTSNLLVWTMILLSQHQDWQKRAKEEVLRTFG-NNKPDFDGLNH 378

Query: 376 LKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDA 435
           LK+V MIL EVLRLYPP+  L R ++ E KL  ++LP GV + +PI+ +H D +IWG+DA
Sbjct: 379 LKVVNMILLEVLRLYPPILSLDRTIYEEIKLGEISLPAGVILLLPIILLHYDQEIWGDDA 438

Query: 436 SEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPT 495
            EF PERFSEG+ KATK +V + PF WGPRIC+GQNFAM+EAK+A++MILQ+FSF +SP+
Sbjct: 439 KEFNPERFSEGVLKATKGRVTYFPFSWGPRICIGQNFAMLEAKMAMAMILQRFSFVLSPS 498

Query: 496 YTHAPISNITIQPQHGAHLILRKL 520
           Y HAP + IT+QPQ+GAHLIL  L
Sbjct: 499 YAHAPHAIITLQPQYGAHLILHSL 517

BLAST of HG10019716 vs. ExPASy Swiss-Prot
Match: Q9LUC6 (Cytochrome P450 72A14 OS=Arabidopsis thaliana OX=3702 GN=CYP72A14 PE=2 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 3.0e-157
Identity = 270/504 (53.57%), Postives = 361/504 (71.63%), Query Frame = 0

Query: 17  WGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIAP 76
           W WR   WVW  PK LE+ LR QGL+G SY  L GD K + +   +A S  +  + DI P
Sbjct: 20  WVWRTLKWVWFTPKMLERSLRRQGLSGTSYTPLIGDFKKMISMFIEATSKPIKPTDDITP 79

Query: 77  RATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGGG 136
           R  P     +  +G+ + TW G  P + I +PEQ+K VF+++ D +K  +FPL + +G G
Sbjct: 80  RVMPHPLQMLKTHGRTNLTWFGPIPTITIMDPEQIKEVFNKVYDFQKAHTFPLSKILGTG 139

Query: 137 LVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEG-CCEIDVWP 196
           LVS +G KWA+HR+IINPAFH+EK+K M   F +SC E+V +W+K++ ++G  CE+DVWP
Sbjct: 140 LVSYDGDKWAQHRRIINPAFHLEKIKNMVHVFHESCSELVGEWDKLVSDKGSSCEVDVWP 199

Query: 197 DLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKR 256
            L +M ADVISRTAFGSS+ EG +IF+L  E A+L+M    K  ++IPG+ Y+PTK N+R
Sbjct: 200 GLTSMTADVISRTAFGSSYREGHRIFELQAELAQLVMQAFQK--FFIPGYIYLPTKGNRR 259

Query: 257 MEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSI 316
           M+   R+I+D++ GII+KR+   + GEA  +EDLL ILLESN  Q E         GMS 
Sbjct: 260 MKTAAREIQDILRGIINKRERARESGEAP-SEDLLGILLESNLGQTEGN-------GMST 319

Query: 317 EEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSR 376
           E+++ EC+LFY AGQETT+ LL WTMVLL ++ +WQ RAR EV +VFGD    D +GL++
Sbjct: 320 EDMMEECKLFYLAGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKQP-DTEGLNQ 379

Query: 377 LKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDA 436
           LK++TMIL EVLRLYPPV  L R +H E KL +LTLP GV I +P+L +HRD+++WG DA
Sbjct: 380 LKVMTMILYEVLRLYPPVVQLTRAIHKEMKLGDLTLPGGVQISLPVLLVHRDTELWGNDA 439

Query: 437 SEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPT 496
            EFKPERF +G+SKATKNQV F PF WGPRIC+GQNF ++EAK+A+S+ILQ+FSF +SP+
Sbjct: 440 GEFKPERFKDGLSKATKNQVSFFPFAWGPRICIGQNFTLLEAKMAMSLILQRFSFELSPS 499

Query: 497 YTHAPISNITIQPQHGAHLILRKL 520
           Y HAP + IT+ PQ GAHL+L KL
Sbjct: 500 YVHAPYTIITLYPQFGAHLMLHKL 512

BLAST of HG10019716 vs. ExPASy Swiss-Prot
Match: Q2MJ19 (Cytochrome P450 72A68 OS=Medicago truncatula OX=3880 GN=CYP72A68 PE=1 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 7.3e-156
Identity = 267/505 (52.87%), Postives = 368/505 (72.87%), Query Frame = 0

Query: 16  LWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIA 75
           ++ WR+ NW+WL+PK++EK LR QGL GN YR L GD KD     ++ +S  M  S DIA
Sbjct: 21  VYAWRVLNWMWLKPKKIEKLLREQGLQGNPYRLLLGDAKDYFVMQKKVQSKPMNLSDDIA 80

Query: 76  PRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGG 135
           PR  P  H  +  +GK SF W G  P V + EPEQ++ VF+++++  K   +   + I  
Sbjct: 81  PRVAPYIHHAVQTHGKKSFIWFGMKPWVILNEPEQIREVFNKMSEFPKV-QYKFMKLITR 140

Query: 136 GLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEGCCEIDVWP 195
           GLV LEG KW+KHR+IINPAFHMEKLK M P F KSC ++++ WEKM+   G CE+DVWP
Sbjct: 141 GLVKLEGEKWSKHRRIINPAFHMEKLKIMTPTFLKSCNDLISNWEKMLSSNGSCEMDVWP 200

Query: 196 DLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKR 255
            LQ++ +DVI+R++FGSS+EEG+K+FQL  E  +L+M  L K    IP +R++PT  +++
Sbjct: 201 SLQSLTSDVIARSSFGSSYEEGRKVFQLQIEQGELIMKNLMKS--LIPLWRFLPTADHRK 260

Query: 256 MEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQ-KIKNKEVGMS 315
           + E +++I   +  II+KR+  +K GEA+ N DLL +LLESN  +I+E   +KN  +G+S
Sbjct: 261 INENEKQIETTLKNIINKREKAIKAGEATEN-DLLGLLLESNHREIKEHGNVKN--MGLS 320

Query: 316 IEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLS 375
           +EEV+ ECRLF+ AGQETT+ LL WTMVLL RY +WQ+RAR EVLE+FG N K DFDGL+
Sbjct: 321 LEEVVGECRLFHVAGQETTSDLLVWTMVLLSRYPDWQERARKEVLEIFG-NEKPDFDGLN 380

Query: 376 RLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGED 435
           +LKI+ MIL EVLRLYPPV  +AR++ N+ KL +LTL  G+ + +PI+ IH D ++WG+D
Sbjct: 381 KLKIMAMILYEVLRLYPPVTGVARKVENDIKLGDLTLYAGMEVYMPIVLIHHDCELWGDD 440

Query: 436 ASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISP 495
           A  F PERFS GISKAT  +  + PFG GPRIC+GQNF+++EAK+A+++IL+ FSF +S 
Sbjct: 441 AKIFNPERFSGGISKATNGRFSYFPFGAGPRICIGQNFSLLEAKMAMALILKNFSFELSQ 500

Query: 496 TYTHAPISNITIQPQHGAHLILRKL 520
           TY HAP   +++QPQHGAH+ILRK+
Sbjct: 501 TYAHAPSVVLSVQPQHGAHVILRKI 518

BLAST of HG10019716 vs. ExPASy TrEMBL
Match: A0A1S3B801 (cytochrome P450 CYP72A219-like OS=Cucumis melo OX=3656 GN=LOC103486999 PE=3 SV=1)

HSP 1 Score: 924.1 bits (2387), Expect = 2.6e-265
Identity = 457/521 (87.72%), Postives = 485/521 (93.09%), Query Frame = 0

Query: 2   WNWIWVVGLLW---LLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAA 61
           +NWIW VGLLW   LLGLWGWRI NWVWLRPKRLEK LR QGLAGNSYRFLFGDTK+I  
Sbjct: 6   YNWIWGVGLLWFLGLLGLWGWRIVNWVWLRPKRLEKLLRQQGLAGNSYRFLFGDTKEIGV 65

Query: 62  AVRQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQI 121
           AVRQARS SMTFSHDIA RATP+S+ TIHKYGKDSFTW+GTTPRVYITEPEQVKIVFSQI
Sbjct: 66  AVRQARSQSMTFSHDIASRATPSSYPTIHKYGKDSFTWIGTTPRVYITEPEQVKIVFSQI 125

Query: 122 NDIRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNK 181
           NDIRKTSSFPLRRR+G GLV+LEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSC EMVNK
Sbjct: 126 NDIRKTSSFPLRRRMGSGLVTLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCREMVNK 185

Query: 182 WEKMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKR 241
           WEKMI E+G  EIDVWPDLQN+AADVISRTAFGSS++EGKKIFQLLKEWA LLM+YLTKR
Sbjct: 186 WEKMISEKGSGEIDVWPDLQNLAADVISRTAFGSSYDEGKKIFQLLKEWAVLLMSYLTKR 245

Query: 242 AYYIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNA 301
           AYYIPG RYIPTKLNKRM+EID KIRD+V GII+KRQN MKKGEAS NEDLL ILLESN 
Sbjct: 246 AYYIPGARYIPTKLNKRMQEIDMKIRDLVRGIINKRQNAMKKGEASKNEDLLGILLESNE 305

Query: 302 SQIEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEV 361
           +QIEEQK K K+VGMSIEEVISECRLFYFAGQETTA LLAWTMVLLGRYSEWQDRARAEV
Sbjct: 306 TQIEEQKNK-KDVGMSIEEVISECRLFYFAGQETTAVLLAWTMVLLGRYSEWQDRARAEV 365

Query: 362 LEVFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIG 421
           LEVFG+N KLDFDGLSRL++V+MILNEVLRLYPPVGMLARE+HNETKL NLTLP GVSIG
Sbjct: 366 LEVFGNNKKLDFDGLSRLRVVSMILNEVLRLYPPVGMLAREVHNETKLGNLTLPSGVSIG 425

Query: 422 IPILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAK 481
           IPIL +H++ KIWGED  EF PERFSEGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAK
Sbjct: 426 IPILSMHQNPKIWGEDVLEFNPERFSEGISKATKNQVCFIPFGWGPRICIGQNFAMIEAK 485

Query: 482 IALSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           IALSMILQQFSFT+SPTYTHAPI+NITIQPQHGAHLILRKL
Sbjct: 486 IALSMILQQFSFTLSPTYTHAPITNITIQPQHGAHLILRKL 525

BLAST of HG10019716 vs. ExPASy TrEMBL
Match: A0A5D3DQ19 (Cytochrome P450 CYP72A219-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G007980 PE=3 SV=1)

HSP 1 Score: 898.3 bits (2320), Expect = 1.5e-257
Identity = 445/519 (85.74%), Postives = 475/519 (91.52%), Query Frame = 0

Query: 2   WNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVR 61
           WNWIWVVGLLW+LGLWGWRI NWVW RPKR+EK LR QGLAGNSYRFLFGDTK+I AAVR
Sbjct: 3   WNWIWVVGLLWVLGLWGWRIVNWVWFRPKRVEKLLRQQGLAGNSYRFLFGDTKEITAAVR 62

Query: 62  QAR-SHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 121
           +AR S  M+FSH I PR TP S+ TIHKYGKDSFTW+GTTPRVYITEPEQVKIVFSQIND
Sbjct: 63  RARTSQPMSFSHHIDPRTTPYSYPTIHKYGKDSFTWIGTTPRVYITEPEQVKIVFSQIND 122

Query: 122 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 181
           IRKTSSFPLRRR+G GLV+LEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSC EMVNKWE
Sbjct: 123 IRKTSSFPLRRRMGSGLVTLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCREMVNKWE 182

Query: 182 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 241
           KMI E+G  EIDVWPDLQN+AADVISRTAFGSS++EGKKIFQLLKEWA LLM+YLTKRAY
Sbjct: 183 KMISEKGSGEIDVWPDLQNLAADVISRTAFGSSYDEGKKIFQLLKEWAVLLMSYLTKRAY 242

Query: 242 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 301
           YIPG RY     N  M+EID KIRD+V GII+KRQN MKKGEAS NEDLL ILLESN +Q
Sbjct: 243 YIPGARYD----NSMMQEIDMKIRDLVRGIINKRQNAMKKGEASKNEDLLGILLESNETQ 302

Query: 302 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 361
           IEEQK K K+VGMSIEEVISECRLFYFAGQETTA LLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 303 IEEQKNK-KDVGMSIEEVISECRLFYFAGQETTAVLLAWTMVLLGRYSEWQDRARAEVLE 362

Query: 362 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 421
           VFG+N KLDFDGLSRL++V+MILNEVLRLYPPVGMLARE+HNETKL NLTLP GVSIGIP
Sbjct: 363 VFGNNKKLDFDGLSRLRVVSMILNEVLRLYPPVGMLAREVHNETKLGNLTLPSGVSIGIP 422

Query: 422 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 481
           IL +H++ KIWGED  EF PERFSEGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 423 ILSMHQNPKIWGEDVLEFNPERFSEGISKATKNQVCFIPFGWGPRICIGQNFAMIEAKIA 482

Query: 482 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQQFSFT+SPTYTHAPI+NITIQPQHGAHLILRKL
Sbjct: 483 LSMILQQFSFTLSPTYTHAPITNITIQPQHGAHLILRKL 516

BLAST of HG10019716 vs. ExPASy TrEMBL
Match: A0A6J1KZI7 (cytochrome P450 CYP72A219-like OS=Cucurbita maxima OX=3661 GN=LOC111499007 PE=3 SV=1)

HSP 1 Score: 891.3 bits (2302), Expect = 1.9e-255
Identity = 429/519 (82.66%), Postives = 477/519 (91.91%), Query Frame = 0

Query: 1   MWNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAV 60
           MWNWIW VG L LLG+WGW+I NW+W+ PKRLEK LR QGLAGNSYR LFGD+KD AAAV
Sbjct: 1   MWNWIWAVG-LGLLGVWGWKILNWMWVTPKRLEKCLRQQGLAGNSYRLLFGDSKDTAAAV 60

Query: 61  RQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 120
           RQAR+  ++FSH IA RATP+SH+TI +YGKDSFTW+G +PRVYIT+PEQVK+VFSQIND
Sbjct: 61  RQARTQPISFSHRIAARATPSSHITIDQYGKDSFTWIGPSPRVYITQPEQVKMVFSQIND 120

Query: 121 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 180
           IRK +SFP RRRIGGG+VSLEG+KWAKHRKIINPAFHMEKLK+MFPAFSKSC EM+NKWE
Sbjct: 121 IRKATSFPFRRRIGGGVVSLEGAKWAKHRKIINPAFHMEKLKDMFPAFSKSCSEMINKWE 180

Query: 181 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 240
            MI +EGCCE+DVWPDLQNMAADVISRTAFGSS+EEGKKIFQLLK+WA LLM Y+ K  Y
Sbjct: 181 MMITKEGCCELDVWPDLQNMAADVISRTAFGSSYEEGKKIFQLLKQWASLLMAYVMKGVY 240

Query: 241 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 300
           ++PG R++PTKLNKRM+EI+ KIRDM+WGIISKRQN MKKGEAS N+DLL ILLESN+ +
Sbjct: 241 FLPGLRFLPTKLNKRMDEINGKIRDMIWGIISKRQNAMKKGEAS-NDDLLGILLESNSRE 300

Query: 301 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360
           I+E K K K+VGMSIEEVI+ECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 301 IQEHKNK-KDVGMSIEEVIAECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360

Query: 361 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 420
           VFGDN KLDFDGLSRLK+VTMILNEVLRLYPPVGMLARE+H ETKL NLTLPVGVSIGIP
Sbjct: 361 VFGDNKKLDFDGLSRLKVVTMILNEVLRLYPPVGMLAREIHKETKLGNLTLPVGVSIGIP 420

Query: 421 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 480
           I+CIH+D  IWGEDASEFKPERF+EGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 421 IVCIHQDPTIWGEDASEFKPERFAEGISKATKNQVCFIPFGWGPRICLGQNFAMIEAKIA 480

Query: 481 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQ+FSF +SP+YTHAPISN+TIQPQHGAHLIL  L
Sbjct: 481 LSMILQRFSFELSPSYTHAPISNVTIQPQHGAHLILHML 516

BLAST of HG10019716 vs. ExPASy TrEMBL
Match: A0A6J1EFS6 (cytochrome P450 CYP72A219-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432119 PE=3 SV=1)

HSP 1 Score: 886.3 bits (2289), Expect = 6.0e-254
Identity = 423/519 (81.50%), Postives = 475/519 (91.52%), Query Frame = 0

Query: 1   MWNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAV 60
           MWNWIW VG L LLG+WGW+I NW+WL PKRLEK LRHQGLAGNSYR LFGD+KD AAAV
Sbjct: 23  MWNWIWAVG-LGLLGVWGWKILNWMWLTPKRLEKCLRHQGLAGNSYRLLFGDSKDTAAAV 82

Query: 61  RQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 120
           RQA++  ++FSH IA RATP+SH+TIH+YGKDSFTW+G TPRVYIT PEQVK+VFSQIND
Sbjct: 83  RQAKTQPISFSHRIAARATPSSHITIHRYGKDSFTWIGPTPRVYITHPEQVKVVFSQIND 142

Query: 121 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 180
           IRK + FP RRR+GGG+VSLEG+KWAKHRKIINPAFHMEKLK+MFPAFS+SC EM+NKWE
Sbjct: 143 IRKATLFPFRRRMGGGVVSLEGAKWAKHRKIINPAFHMEKLKDMFPAFSQSCSEMINKWE 202

Query: 181 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 240
            MI +EGCCE+DVWPDLQNMAADVISRTAFGSS+EEGKKIFQLLK+WA LLM Y+ K  Y
Sbjct: 203 MMITKEGCCELDVWPDLQNMAADVISRTAFGSSYEEGKKIFQLLKQWASLLMAYVMKGVY 262

Query: 241 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 300
           +IPG R++PTKLNKRM+EI+  IRDM+WGII+KRQN MKKGEAS N+DLL ILLESN+ +
Sbjct: 263 FIPGLRFLPTKLNKRMDEINGNIRDMIWGIINKRQNAMKKGEAS-NDDLLGILLESNSRE 322

Query: 301 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360
           I+E K K K+VGMS+EEVI+ECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 323 IQEHKNK-KDVGMSMEEVIAECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 382

Query: 361 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 420
           +FGDN K DFDGLSRLKIVTMILNEVLRLYPPVG+L+RE+H ETKL NLTLPVGVSIGIP
Sbjct: 383 IFGDNKKFDFDGLSRLKIVTMILNEVLRLYPPVGLLSREIHKETKLGNLTLPVGVSIGIP 442

Query: 421 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 480
           I+CIH+D  +WGEDASEFKPERF EGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 443 IVCIHQDPTLWGEDASEFKPERFVEGISKATKNQVCFIPFGWGPRICLGQNFAMIEAKIA 502

Query: 481 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQ+FSF +SP+YTHAPI+N+TIQPQHGAHLILR L
Sbjct: 503 LSMILQRFSFELSPSYTHAPITNVTIQPQHGAHLILRML 538

BLAST of HG10019716 vs. ExPASy TrEMBL
Match: A0A6J1E9Z3 (cytochrome P450 CYP72A219-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432119 PE=3 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 3.9e-253
Identity = 422/519 (81.31%), Postives = 474/519 (91.33%), Query Frame = 0

Query: 1   MWNWIWVVGLLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAV 60
           MWNWIW VG L LLG+WGW+I NW+WL PKRLEK LRHQGLAGNSYR LFGD+KD AAAV
Sbjct: 23  MWNWIWAVG-LGLLGVWGWKILNWMWLTPKRLEKCLRHQGLAGNSYRLLFGDSKDTAAAV 82

Query: 61  RQARSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQIND 120
           RQA++  ++FSH IA RATP+SH+TIH+YGKDSFTW+G TPRVYIT PEQVK+VFSQIND
Sbjct: 83  RQAKTQPISFSHRIAARATPSSHITIHRYGKDSFTWIGPTPRVYITHPEQVKVVFSQIND 142

Query: 121 IRKTSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWE 180
           IRK + FP RRR+GGG+VSLEG+KWAKHRKIINPAFHMEKLK+MFPAFS+SC EM+NKWE
Sbjct: 143 IRKATLFPFRRRMGGGVVSLEGAKWAKHRKIINPAFHMEKLKDMFPAFSQSCSEMINKWE 202

Query: 181 KMIPEEGCCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAY 240
            MI +EGCCE+DVWPDLQNMAADVISRTAFGSS+EEGKKIFQLLK+WA LLM Y+ K  Y
Sbjct: 203 MMITKEGCCELDVWPDLQNMAADVISRTAFGSSYEEGKKIFQLLKQWASLLMAYVMKGVY 262

Query: 241 YIPGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQ 300
           +IPG R++PTKLNKRM+EI+  IRDM+WGII+KRQN MKKGEAS N+DLL ILLESN+ +
Sbjct: 263 FIPGLRFLPTKLNKRMDEINGNIRDMIWGIINKRQNAMKKGEAS-NDDLLGILLESNSRE 322

Query: 301 IEEQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 360
           I+E K K K+VGMS+EEVI+ECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE
Sbjct: 323 IQEHKNK-KDVGMSMEEVIAECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLE 382

Query: 361 VFGDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIP 420
           +FGDN K DFDGLSRLKIVTMILNEVLRLYPPVG+L+RE+H ETKL NLTLPVGVSIGIP
Sbjct: 383 IFGDNKKFDFDGLSRLKIVTMILNEVLRLYPPVGLLSREIHKETKLGNLTLPVGVSIGIP 442

Query: 421 ILCIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIA 480
           I+CIH+D  +WGEDASEFKPERF EGISKATKNQVCFIPFGWGPRIC+GQNFAMIEAKIA
Sbjct: 443 IVCIHQDPTLWGEDASEFKPERFVEGISKATKNQVCFIPFGWGPRICLGQNFAMIEAKIA 502

Query: 481 LSMILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           LSMILQ+FSF +SP+YTHAPI+N+TIQPQHGAHLIL  L
Sbjct: 503 LSMILQRFSFELSPSYTHAPITNVTIQPQHGAHLILPML 538

BLAST of HG10019716 vs. TAIR 10
Match: AT3G14610.1 (cytochrome P450, family 72, subfamily A, polypeptide 7 )

HSP 1 Score: 566.2 bits (1458), Expect = 2.7e-161
Identity = 274/511 (53.62%), Postives = 366/511 (71.62%), Query Frame = 0

Query: 10  LLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMT 69
           L+ ++ LW WRI  WVW++PK LE  L+ QGL G  Y  L GD K     + +ARS  + 
Sbjct: 12  LVAVVVLWTWRIVKWVWIKPKMLESSLKRQGLTGTPYTPLVGDIKRNVDMMMEARSKPIN 71

Query: 70  FSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPL 129
            + DI PR  P +   ++ +GK  F W+G  P + IT PEQ+K VF+++ND  K S+FPL
Sbjct: 72  VTDDITPRLLPLALKMLNSHGKTFFIWIGPLPTIVITNPEQIKEVFNKVNDFEKASTFPL 131

Query: 130 RRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPE-EGC 189
            R + GGL S +G KWA HR+IINPAFH+EK+K M PAF   C E+V +WEK+  + E  
Sbjct: 132 IRLLAGGLASYKGDKWASHRRIINPAFHLEKIKNMIPAFYHCCSEVVCQWEKLFTDKESP 191

Query: 190 CEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYI 249
            E+DVWP L NM ADVIS TAFGSS++EG++IFQL  E A+L+     K   YIPG R+ 
Sbjct: 192 LEVDVWPWLVNMTADVISHTAFGSSYKEGQRIFQLQGELAELIAQAFKKS--YIPGSRFY 251

Query: 250 PTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKN 309
           PTK N+RM+ IDR++  ++ GI+SKR+   + GE + N+DLL ILLESN+ + +      
Sbjct: 252 PTKSNRRMKAIDREVDVILRGIVSKREKAREAGEPA-NDDLLGILLESNSEESQGN---- 311

Query: 310 KEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKL 369
              GMS+E+V+ EC+LFYFAGQETT+ LL WTMVLL  + +WQ RAR EV++V G+NNK 
Sbjct: 312 ---GMSVEDVMKECKLFYFAGQETTSVLLVWTMVLLSHHQDWQARAREEVMQVLGENNKP 371

Query: 370 DFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDS 429
           D + L+ LK++TMI NEVLRLYPPV  L R ++ E KL  LTLP G+ I +P + + RD+
Sbjct: 372 DMESLNNLKVMTMIFNEVLRLYPPVAQLKRVVNKEMKLGELTLPAGIQIYLPTILVQRDT 431

Query: 430 KIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQF 489
           ++WG+DA++FKPERF +G+SKATKNQV F PFGWGPRIC+GQNFAM+EAK+A+++ILQ+F
Sbjct: 432 ELWGDDAADFKPERFRDGLSKATKNQVSFFPFGWGPRICIGQNFAMLEAKMAMALILQKF 491

Query: 490 SFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           SF +SP+Y HAP + +T +PQ GAHLIL KL
Sbjct: 492 SFELSPSYVHAPQTVMTTRPQFGAHLILHKL 512

BLAST of HG10019716 vs. TAIR 10
Match: AT3G14680.1 (cytochrome P450, family 72, subfamily A, polypeptide 14 )

HSP 1 Score: 556.6 bits (1433), Expect = 2.1e-158
Identity = 270/504 (53.57%), Postives = 361/504 (71.63%), Query Frame = 0

Query: 17  WGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIAP 76
           W WR   WVW  PK LE+ LR QGL+G SY  L GD K + +   +A S  +  + DI P
Sbjct: 20  WVWRTLKWVWFTPKMLERSLRRQGLSGTSYTPLIGDFKKMISMFIEATSKPIKPTDDITP 79

Query: 77  RATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGGG 136
           R  P     +  +G+ + TW G  P + I +PEQ+K VF+++ D +K  +FPL + +G G
Sbjct: 80  RVMPHPLQMLKTHGRTNLTWFGPIPTITIMDPEQIKEVFNKVYDFQKAHTFPLSKILGTG 139

Query: 137 LVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEG-CCEIDVWP 196
           LVS +G KWA+HR+IINPAFH+EK+K M   F +SC E+V +W+K++ ++G  CE+DVWP
Sbjct: 140 LVSYDGDKWAQHRRIINPAFHLEKIKNMVHVFHESCSELVGEWDKLVSDKGSSCEVDVWP 199

Query: 197 DLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKR 256
            L +M ADVISRTAFGSS+ EG +IF+L  E A+L+M    K  ++IPG+ Y+PTK N+R
Sbjct: 200 GLTSMTADVISRTAFGSSYREGHRIFELQAELAQLVMQAFQK--FFIPGYIYLPTKGNRR 259

Query: 257 MEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSI 316
           M+   R+I+D++ GII+KR+   + GEA  +EDLL ILLESN  Q E         GMS 
Sbjct: 260 MKTAAREIQDILRGIINKRERARESGEAP-SEDLLGILLESNLGQTEGN-------GMST 319

Query: 317 EEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSR 376
           E+++ EC+LFY AGQETT+ LL WTMVLL ++ +WQ RAR EV +VFGD    D +GL++
Sbjct: 320 EDMMEECKLFYLAGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKQP-DTEGLNQ 379

Query: 377 LKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDA 436
           LK++TMIL EVLRLYPPV  L R +H E KL +LTLP GV I +P+L +HRD+++WG DA
Sbjct: 380 LKVMTMILYEVLRLYPPVVQLTRAIHKEMKLGDLTLPGGVQISLPVLLVHRDTELWGNDA 439

Query: 437 SEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPT 496
            EFKPERF +G+SKATKNQV F PF WGPRIC+GQNF ++EAK+A+S+ILQ+FSF +SP+
Sbjct: 440 GEFKPERFKDGLSKATKNQVSFFPFAWGPRICIGQNFTLLEAKMAMSLILQRFSFELSPS 499

Query: 497 YTHAPISNITIQPQHGAHLILRKL 520
           Y HAP + IT+ PQ GAHL+L KL
Sbjct: 500 YVHAPYTIITLYPQFGAHLMLHKL 512

BLAST of HG10019716 vs. TAIR 10
Match: AT3G14630.1 (cytochrome P450, family 72, subfamily A, polypeptide 9 )

HSP 1 Score: 548.1 bits (1411), Expect = 7.5e-156
Identity = 272/517 (52.61%), Postives = 369/517 (71.37%), Query Frame = 0

Query: 5   IWVVGLLWLLGLWG-WRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQA 64
           I +  L  ++ LW  WRI  WVWL+PK LE YLR QGL G  Y  L GD +   + +++A
Sbjct: 3   IVIASLALVVVLWCIWRILEWVWLKPKMLESYLRRQGLVGTRYTPLVGDVRRSFSMLKEA 62

Query: 65  RSHSMTFSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRK 124
           RS  M  + D+     P S   ++ YGK  FTW G  P + I  P+ +K V+++  D  K
Sbjct: 63  RSKPMKPTDDLISLVMPYSFHMLNTYGKTFFTWSGPIPAITIMNPQLIKEVYNKFYDFEK 122

Query: 125 TSSFPLRRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMI 184
           T +FPL   +  GL + +G KW KHRKIINPAFH EK+K M P F KSC E++ +WEK++
Sbjct: 123 THTFPLTSLLTDGLANADGDKWVKHRKIINPAFHFEKIKNMVPTFYKSCIEVMCEWEKLV 182

Query: 185 PEEG-CCEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYI 244
            ++G  CE+DVWP + NM  DVISRTAFGSS++EG++IF L  E A L++  L K   YI
Sbjct: 183 SDKGSSCELDVWPWIVNMTGDVISRTAFGSSYKEGQRIFILQAELAHLIILALGKN--YI 242

Query: 245 PGFRYIPTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIE 304
           P +R+ PTK N+RM+ I ++I+ ++ GIIS R+     GEA  ++DLL ILL+SN+ Q  
Sbjct: 243 PAYRHFPTKNNRRMKTIVKEIQVILRGIISHREKARDAGEAP-SDDLLGILLKSNSEQ-- 302

Query: 305 EQKIKNKEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVF 364
                +K  G+++EE++ EC+LFYFAGQETT+ LLAWTMVLL ++ +WQ RAR EV++VF
Sbjct: 303 -----SKGNGLNMEEIMEECKLFYFAGQETTSVLLAWTMVLLSQHQDWQARAREEVMQVF 362

Query: 365 GDNNKLDFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPIL 424
           G +NK D  G+++LK++TMI+ EVLRLYPPV  + R  H E KL ++TLP G+ + +P+L
Sbjct: 363 G-HNKPDLQGINQLKVMTMIIYEVLRLYPPVIQMNRATHKEIKLGDMTLPGGIQVHMPVL 422

Query: 425 CIHRDSKIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALS 484
            IHRD+K+WG+DA+EFKPERF +GI+KATKNQVCF+PFGWGPRIC+GQNFA++EAK+AL+
Sbjct: 423 LIHRDTKLWGDDAAEFKPERFKDGIAKATKNQVCFLPFGWGPRICIGQNFALLEAKMALA 482

Query: 485 MILQQFSFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           +ILQ+FSF +SP+Y H+P    TI PQ GAHLIL KL
Sbjct: 483 LILQRFSFELSPSYVHSPYRVFTIHPQCGAHLILHKL 508

BLAST of HG10019716 vs. TAIR 10
Match: AT3G14660.1 (cytochrome P450, family 72, subfamily A, polypeptide 13 )

HSP 1 Score: 546.2 bits (1406), Expect = 2.8e-155
Identity = 270/504 (53.57%), Postives = 356/504 (70.63%), Query Frame = 0

Query: 17  WGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMTFSHDIAP 76
           W WR    VWL+PK LE YLR QGLAG  Y  L GD K   + + +ARS  +  + DI P
Sbjct: 20  WVWRTLQRVWLKPKMLESYLRRQGLAGTPYTPLVGDLKRNFSMLAEARSKPINLTDDITP 79

Query: 77  RATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPLRRRIGGG 136
           R  P     +  +G+  FTW G  P + I +PEQ+K VF+++ D +K  +FPL R I  G
Sbjct: 80  RIVPYPLQMLKTHGRTFFTWFGPIPTITIMDPEQIKEVFNKVYDFQKAHTFPLGRLIAAG 139

Query: 137 LVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPE-EGCCEIDVWP 196
           LVS +G KW KHR+IINPAFH+EK+K M PAF +SC E+V +W+K++ + +  CE+D+WP
Sbjct: 140 LVSYDGDKWTKHRRIINPAFHLEKIKNMVPAFHQSCSEIVGEWDKLVTDKQSSCEVDIWP 199

Query: 197 DLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYIPTKLNKR 256
            L +M ADVISRTAFGSS++EG++IF+L  E A+L++    K    IPG+RY PTK N+R
Sbjct: 200 WLVSMTADVISRTAFGSSYKEGQRIFELQAELAQLIIQAFRKA--IIPGYRYFPTKGNRR 259

Query: 257 MEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKNKEVGMSI 316
           M+   R+I+ ++ GI++KR    + GEA  ++DLL ILLESN  Q        K  GMS 
Sbjct: 260 MKAAAREIKFILRGIVNKRLRAREAGEAP-SDDLLGILLESNLGQ-------TKGNGMST 319

Query: 317 EEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKLDFDGLSR 376
           EE++ EC+LFYFAGQETT  LL WTMVLL ++ +WQ RAR EV +VFGD    D +GL++
Sbjct: 320 EELMEECKLFYFAGQETTTVLLVWTMVLLSQHQDWQARAREEVKQVFGDKEP-DAEGLNQ 379

Query: 377 LKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDSKIWGEDA 436
           LK++TMIL EVLRLYPPV  L R +H E +L +LTLP GV I +PIL I RD ++WG DA
Sbjct: 380 LKVMTMILYEVLRLYPPVVQLTRAIHKEMQLGDLTLPGGVQISLPILLIQRDRELWGNDA 439

Query: 437 SEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQFSFTISPT 496
            EFKP+RF +G+SKATKNQV F PF WGPRIC+GQNFA++EAK+A+++IL++FSF +SP+
Sbjct: 440 GEFKPDRFKDGLSKATKNQVSFFPFAWGPRICIGQNFALLEAKMAMTLILRKFSFELSPS 499

Query: 497 YTHAPISNITIQPQHGAHLILRKL 520
           Y HAP + +T  PQ GA LIL KL
Sbjct: 500 YVHAPYTVLTTHPQFGAPLILHKL 512

BLAST of HG10019716 vs. TAIR 10
Match: AT3G14690.1 (cytochrome P450, family 72, subfamily A, polypeptide 15 )

HSP 1 Score: 545.0 bits (1403), Expect = 6.3e-155
Identity = 268/511 (52.45%), Postives = 361/511 (70.65%), Query Frame = 0

Query: 10  LLWLLGLWGWRIANWVWLRPKRLEKYLRHQGLAGNSYRFLFGDTKDIAAAVRQARSHSMT 69
           +L ++  W WR   WVW +PK LE YLR QGLAG  Y  L GD K     + +ARS  + 
Sbjct: 13  VLAVVSWWIWRTLQWVWFKPKMLEHYLRRQGLAGTPYTPLVGDLKKNFTMLSEARSKPLK 72

Query: 70  FSHDIAPRATPTSHLTIHKYGKDSFTWVGTTPRVYITEPEQVKIVFSQINDIRKTSSFPL 129
            + DI+PR  P        YG+  FTW G  P + I +PEQ+K VF+++ D +K  +FPL
Sbjct: 73  LTDDISPRVVPYPLQMFKTYGRTYFTWFGPIPTITIMDPEQIKEVFNKVYDFQKPHTFPL 132

Query: 130 RRRIGGGLVSLEGSKWAKHRKIINPAFHMEKLKEMFPAFSKSCGEMVNKWEKMIPEEG-C 189
              I  GL + +G KWAKHR+IINPAFH+EK+K M PAF +SC E+V +W++++ ++G  
Sbjct: 133 ATIIAKGLANYDGDKWAKHRRIINPAFHIEKIKNMVPAFHQSCREVVGEWDQLVSDKGSS 192

Query: 190 CEIDVWPDLQNMAADVISRTAFGSSFEEGKKIFQLLKEWAKLLMTYLTKRAYYIPGFRYI 249
           CE+DVWP L +M ADVISRTAFGSS++EG++IF+L  E A+L++    K   +IPG+ Y+
Sbjct: 193 CEVDVWPGLVSMTADVISRTAFGSSYKEGQRIFELQAELAQLIIQAFRKA--FIPGYSYL 252

Query: 250 PTKLNKRMEEIDRKIRDMVWGIISKRQNGMKKGEASNNEDLLRILLESNASQIEEQKIKN 309
           PTK N+RM+   R+I+ ++ GI++KR    + GEA  ++DLL ILLESN  Q E      
Sbjct: 253 PTKSNRRMKAAAREIQVILRGIVNKRLRAREAGEAP-SDDLLGILLESNLRQTEGN---- 312

Query: 310 KEVGMSIEEVISECRLFYFAGQETTAALLAWTMVLLGRYSEWQDRARAEVLEVFGDNNKL 369
              GMS E+++ EC+LFYFAGQETT+ LL WTMVLL ++ +WQ RAR EV +VFGD    
Sbjct: 313 ---GMSTEDLMEECKLFYFAGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKEP- 372

Query: 370 DFDGLSRLKIVTMILNEVLRLYPPVGMLARELHNETKLENLTLPVGVSIGIPILCIHRDS 429
           D +GL++LK++TMIL EVLRLYPPV  L R +H E KL +LTLP GV I +PIL +  D 
Sbjct: 373 DAEGLNQLKVMTMILYEVLRLYPPVTQLTRAIHKELKLGDLTLPGGVQISLPILLVQHDI 432

Query: 430 KIWGEDASEFKPERFSEGISKATKNQVCFIPFGWGPRICVGQNFAMIEAKIALSMILQQF 489
           ++WG DA+EF P+RF +G+SKATK+QV F PF WGPRIC+GQNFA++EAK+A+++IL++F
Sbjct: 433 ELWGNDAAEFNPDRFKDGLSKATKSQVSFFPFAWGPRICIGQNFALLEAKMAMALILRRF 492

Query: 490 SFTISPTYTHAPISNITIQPQHGAHLILRKL 520
           SF ISP+Y HAP + ITI PQ GA LI+ KL
Sbjct: 493 SFEISPSYVHAPYTVITIHPQFGAQLIMHKL 512

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904026.12.0e-27590.58cytochrome P450 CYP72A219-like [Benincasa hispida][more]
XP_008443405.15.4e-26587.72PREDICTED: cytochrome P450 CYP72A219-like [Cucumis melo][more]
XP_004151308.29.2e-26587.31cytochrome P450 CYP72A219 [Cucumis sativus] >KGN59713.2 hypothetical protein Csa... [more]
TYK25638.13.2e-25785.74cytochrome P450 CYP72A219-like protein [Cucumis melo var. makuwa][more]
XP_023529493.14.6e-25682.85cytochrome P450 CYP72A219-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
H1A9888.9e-16256.3511-oxo-beta-amyrin 30-oxidase OS=Glycyrrhiza uralensis OX=74613 GN=CYP72A154 PE=... [more]
A0A0S2IHL28.9e-16255.86Cytochrome P450 72A397 OS=Kalopanax septemlobus OX=228393 GN=CYP72A397 PE=1 SV=1[more]
H2DH213.4e-16153.77Cytochrome P450 CYP72A219 OS=Panax ginseng OX=4054 PE=2 SV=1[more]
Q9LUC63.0e-15753.57Cytochrome P450 72A14 OS=Arabidopsis thaliana OX=3702 GN=CYP72A14 PE=2 SV=1[more]
Q2MJ197.3e-15652.87Cytochrome P450 72A68 OS=Medicago truncatula OX=3880 GN=CYP72A68 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3B8012.6e-26587.72cytochrome P450 CYP72A219-like OS=Cucumis melo OX=3656 GN=LOC103486999 PE=3 SV=1[more]
A0A5D3DQ191.5e-25785.74Cytochrome P450 CYP72A219-like protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A6J1KZI71.9e-25582.66cytochrome P450 CYP72A219-like OS=Cucurbita maxima OX=3661 GN=LOC111499007 PE=3 ... [more]
A0A6J1EFS66.0e-25481.50cytochrome P450 CYP72A219-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1E9Z33.9e-25381.31cytochrome P450 CYP72A219-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT3G14610.12.7e-16153.62cytochrome P450, family 72, subfamily A, polypeptide 7 [more]
AT3G14680.12.1e-15853.57cytochrome P450, family 72, subfamily A, polypeptide 14 [more]
AT3G14630.17.5e-15652.61cytochrome P450, family 72, subfamily A, polypeptide 9 [more]
AT3G14660.12.8e-15553.57cytochrome P450, family 72, subfamily A, polypeptide 13 [more]
AT3G14690.16.3e-15552.45cytochrome P450, family 72, subfamily A, polypeptide 15 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002401Cytochrome P450, E-class, group IPRINTSPR00463EP450Icoord: 457..467
score: 54.96
coord: 200..218
score: 25.1
coord: 467..490
score: 32.71
coord: 337..363
score: 28.04
coord: 317..334
score: 26.24
coord: 86..105
score: 24.9
IPR001128Cytochrome P450PRINTSPR00385P450coord: 328..345
score: 38.39
coord: 458..467
score: 59.8
coord: 467..478
score: 39.18
coord: 381..392
score: 40.16
IPR001128Cytochrome P450PFAMPF00067p450coord: 88..503
e-value: 2.2E-86
score: 290.4
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 25..519
e-value: 1.5E-109
score: 368.7
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 85..517
NoneNo IPR availablePANTHERPTHR24282CYTOCHROME P450 FAMILY MEMBERcoord: 16..516
NoneNo IPR availablePANTHERPTHR24282:SF20911-OXO-BETA-AMYRIN 30-OXIDASE-RELATEDcoord: 16..516
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 460..469

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019716.1HG10019716.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen