Cp4.1LG19g03680 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG19g03680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptioncarotene epsilon-monooxygenase, chloroplastic
LocationCp4.1LG19: 3917919 .. 3924539 (-)
RNA-Seq ExpressionCp4.1LG19g03680
SyntenyCp4.1LG19g03680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGTTAAAATTAATATATAAATTAGTGGAAGTTGAAGTAATAATAAAAATAATGTTGAAATTCTAAATTAATATTAAATTTAAATTTTTCCTAAATAAATAAATAAATAACTGCGCTAAGCTATCCATATGTAGTAGCAGCTTGAAGTTGAACATCCGAAACTCCTCAACCGCCATCCATGTCTTCCATTCTCTGTTATCCCTCACTCATTTCTCCGTCATTTCCCCTCCATAAACGAATCCCTCTCCGGCGAAGAACCCAATTTCGATTTCTCTCCATTAGATCCTCCATCGACGAGAGGGATCCCCCAACGCCGGCGAAGCTCAACAACTCGACCAACACTTCAAAATCCGGTTCCTGGGTTAGCCCAGATTGGCTCACTTCTCTGACTCGCTACATTACCCTAGGTCAGGGTGACGACTCCGGCATTCCCATAGCCAGTGCGAAGCTCGATGACGTTTCCGATCTGCTCGGCGGTGCTCTTTTTCTTCCGCTTTTCAAGTGGATGAATGACTATGGACCTATTTACAGGCTCGCCGCTGGGCCTAGGAATTTCGTTGTGGTTAGTGACCCTGCCATTGCTAAGCATGTTCTTAGGAATTATGGGACTTACGCTAAAGGGCTTGTTTCTGAGGTCTCCGAGTTCTTGTTTGGGTCCGGTTTTGCGATTGCTGAAGGCCCGCTTTGGACGGTACTCTTTTCGCTTACTGTTCTGTTCATGAACGAAGATAAATTTGGAATTCATCATCGTATGCTGCGTTTCTACTTTCAAATTCTATCTGTTAATTTACGGTTGAATTATGACTCTGGTATGCGTTTCAACTTCAAGCAACAAAGAGTATGAACTGTAATTGTACTTTAAATTTAACTATTTATTATATTATTTTCAACTGTGTTTTCATTATGTTAATACACCGTATTGATTAATGAGTAATGAAACAAAACAATTGAGGATTTCTGGTTGTAACGTTACTGGTTTTTGTATTTTTGGAAGCTATTTAGTCGTTTTTATCATTGATCTTCTGCATTTTTGTATAAGTTTGCTTCTTTGGTATACATTCTTGCTGTGAAAGATGTTTTACTGGTTCGATGAGAAGCATTTTATGGACAATAATCGAGTAAACAAAATCTCAGTGGGTGAGTTTTGGAGCATGAATGAAGAAGAGGGAAAATATTTTGAAAATGTTTGAAATTGTATACTTTTATGCATTCAGATGTGATTTTACTTCCCATACTGATTATTGGTTTATATAGTTGAATTATGGAAGGAAGAAAAAGCTTTTAAGCCTGTGGAATATGCACGTTTAACCGAATGTTCTCCAATCGTTCAAATTCCTTTTTGAGTCTCACTTGAACTTGGTTGTTATATATCATATTAGGTTCGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCCGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAGAGGATGCATTAAATAATAATTCAGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCACTACTGACAGCCCTGTGATTGATGCAGTTTACACTGCTTTGAAAGAGGCAGAGGCTCGTTCTACTGATATTTTACCATATTGGAAGGCAGGTTTTCTTATGTAACTAGTTCTGCATATGATCGTCGTTTTCTCACCTCGTTTTTCACCCGCACCATTTTTTGTTGCAGATTAAGGCTCTGTGTAAAATAATCCCAAGACAGATAAAAGCCGAAGAAGCAGTTACAGTGATTCAAAGAACTGTTGAAGATCTAATTGCCAAGTGCAAAGCAATTGTGGAAATGGAGGGTGAGCGCATCGATGAGGAGGAATATGTGAATGATGCTGATCCAAGTATCCTTCGTTTCCTGCTGGCCAGTAGACAAGAGGTCTGTTCTCGATAACTTTTACGTTTTCCATTCATCACTGACATTTTGCAATGTAGTTTAAGCAATTTTGCAAAGTATCTTTCGTTTCCTGAGGGTGAGCGCTCTGATTCTTTTTAATTACAGCCATTGTCCCCTTTTGTGTATGCTAAATCACCGATCAAGGAATTTATTTCATGCTTATGAAACAGGTTCTGTCGCATGGTGAGATGAGATTATTATAGGCTTTGATTACATTGTGAATTCAATATATTAATGCCAGAATTAAGCCTTCTTGTCCCTCCTATAATGTTTGTTTCTCCAAAAACGCAAAAAGTAAATAAAAAAGTGCACTAACGACGAGAGGTTGTAAGGAAGGCTAGAAGGGATATATTAGGAATGTGAACTGTGATATATAAATTACTGGGGAAATTTGTTTAAGTCGAGTGGTTATATAGAGGAGTTAGAGGATTTGGTAAGTAATAGAATTTGGAGGGTCGTTTTGACTTTTGTAAGCCTCTCTCAAAATATTTGGGAAGTGGTAATTTTTCTTTTCTTTGATCAATACTATTTTCTGTTAGTTTTTTGTTTTAAATTTGGGTTCTACCAGAGGTCCAGAGGCCAGAACTTTGGAATTAGGAAATTTTGTGTCTATTAGCATAGTTCAAATGAATGCTCGTCAATTTGCTAAGGTCTATTGATGGTGTGCTCATAAAATTGAAAAAACTACTTATACTTTGAGATCTGCCATGTTCTAACTTCTGTCACTGATACTTGCCTCCTGTGTTGTTTACTGTTTAGGTTTCAAGTGTACAATTACGAGATGATCTATTGTCGATGTTGGTTGCTGGACATGAAACTACCGGTTCTGTTCTGACTTGGACATTGTATCTTTTAAGTAAGGTAATCTGGTCATTTGGGTATATGTAGAATTTTACATAGTAACTTTACTTATAGATGTTTGTCCCTTGGTACATTTTATTTCTTTGAAGAGAAAATTTCATTTGGTCCAATCTTGATGGGTTAATCTTAGCCTTTAAACTTCCAAATTTGTTACCTAAACCTGAATAAGTAATATAGTTAATTCATTCACGTGTTATAAAAATAGTGAAGAAAGATTTTGGCTAGTCACCATGTGGTTGTAATTGATATTATACATAAGAACAATCTTATCAGGCACAGGATCTAAATGAAGTTTTGACTGGATATATTTTTATTATGAAATTGATATACTTCTGTTCTTTCTTTTCAAAGAGAAACCAAATTTTGATTGAGAAAAATAAAAGATTACAAAGGGGGGGATCTTGTGTTAAGAATGGATACTTCTTGATGTTTGCTGGAATTCCGACTTGAGGATATCTTTGACTGATTGCCTTCTTCATTATGATTAGTAAGATGGGGAAAAGCTCAAATTTTTTTTTTCGTAGTTGTTTTTCAATATGGTTTATTATTCTGCAACTCAGGATTCCTCAGCATTGAACAAGGCGCAAACTGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGATACAAAGGAACTTAAATTTTTGACGCGTTGTATCCTTGAGTCAATGCGCCTTTATCCACATCCACCTGTACGTTTATTTATTGATCAAGTGAATTATGACATTAGAAGAAAAGAAGGAAAAGTATGGAAATTATTCATCATTCTGCTAATCTTGTTTTTATCGTAATGTTCCTTGAACCTTTCCAGGTTTTAATAAGAAGAGCTCAAGTTGCTGACGTGCTTCCTGGAAATTACAAGGTTAATGCTGGTCAAGATATCATGATTTCAGTATATAACATCCATCGCTCGTCCCAGGTACATATTTCAAGGAGTTTGTTTCTATCTTTGCATTAGATATAGACTCTTTTGTTCATTCTTGTTCAAGATTTTCACTATAAAATTTCAGAAGATGCTTTCTGCTGTTAAGTTGGTATATTGGTAGTAGTTATTAGAACTCAGTTGATGATTTTCAAATATTGTGCTTGCAGAAATTCTTACACTTGCTTCATTGAATAGACATCTCTTCTGATATTCGATTAGTTAACTCGTCTATGAACAAGAATCTTTTACCAAATGATATAACCTAACAATAATAATCTATTACCATGAATAGGAATTGGATGGTAAGTTTTGGTATAAATTTGAGCAATGGAGGGAGAATTTTTAGGATGTTTGAAATTATAGGTGAAATTCCTTGTTTGTATATGGGTATATTCGACCGCCTCTCATCCTTAGAACTCACAAACACAAATTATACAAAACATCAGGAAAGGAATCATTTTCTTAAAAGAGTTTCGAAGAGAAGTCGTTGATTATTACAAAAGTCGATCTACAATGCAAAATTAGCAAAACAAAACTAAATACAGAAAGAACACTTACATTCTTACCAAAAATTAAAGTAAAAATCATTTACATTATCTCAAGCATTCTCAATTGTTTTTTTGTTACTAGTTTGAATATATATTTTGAACATAGGGATGTTTGAACATGCTCTTGCTTTATGTGGTGTTTTGCAAAATATTGTAACTCTCACTTGCGTTTCAAGGTTTGGGAACAAGCAGAAGAGTTCATACCAGAAAGATTTGACTTGGAGGGTCCTGTGCCCAATGAAAGCAATACAGATTTCAGGTACACATTCAATTTGTGTGTGGAAACTACACTGTTTTAGTAGTACAATAGGCACTTACCAGGCAGTCATAGGCTTGTTTGGTGTGAGTTGCTTGACCCCAAGTTTGTTATCCTCTTTAAGGACATGCTAATAGGCTAGAATGATGGCATTTTGAATGTCCAGGTGCCATAGACACTATTTGTGAGATCCCACATCGGTTGGGAAGGAGAACGAAATATTCTTTATAAGGGTGTGAAAACCTCTCCCTAGTAGACGTGTTTTAAAAACCTTGAGGGAAAACTCGAAAGGGAAAGCCCCTTGGGCCGTTACAAATGGTATCAGAGCTAGACATCGGGCGATGTGTCAGCGAGGAGGCTGAGCCCCGAAGGGAGTTGAACATGAGGCGGTGTGCCAGCAAGGACGCTGGGCCCCGAAGGGGTGTGGACTGTGAGATCCCACATCGGTCGGGGAGGAGAACAAAACATTCTTTATAGGGTGTGGAAACCTCTCCTTAGTAGACGCATTTTAAACCTTGGGAGAACCCCGAAAGGGAAATCCCAAAGAGGACAATATCTGCTAGCGGTGGGCTTGAGCCGTTACACTATTACTTTGGCTGGCCGAAAACAAAAATCACCATCTTTTACACTTTACAGAAGTAGTTTGTCCAATGTCCTCTCCCCACTCTTCAGATGGTGCCCGTCCATCTATCGCTGGCTCAAAAGTAACCAATGACCACCATTTTTTCTTCTTCCTGCATGTTAGTTTCCTAGAAATCCTTTGCCCCTCCGCCTTTTGACAGTTTCTGAAAGCAGCCATTCAATTTCTAAGGTTTCTGTTCCCCCTCCCTTTCACTTTTTTTTTTCTCCCTTGCCTTGATTTGACGACTATAAATTTAATTCTTATAATTAGGTTGTGGACTCGATTCATAAGCTTGGGCTTTATTTAATTTAAATGCTTACTAATTGGACCTCATTTGACAATCCTTCGTCCGTGCCATAATATTTTACGTATTTAGTAACCTATACTGTCTCATTGGACTTCTTTCTCACTTTTTGCATTGCACTGAACCTGTAAAGAGCTTTCTCCCTGGAAAAAAAAACAGTGGGTGGTACTTTGGATACTAGTGGCACTATAAGAACAGCATAGTAGTTAGATCACTTTCATTTTTTACTTGCATTTTAAATGATGCCTGGTTTTCATAGACCTGAATGACCAAACCATACAATACAGATTTATACCGTTCAGTGGAGGACCCCGAAAGTGTGTTGGCGATCAATTTGCCTTGCTTGAAGCTATAGTTGCACTTGCCATTTTTCTGCAGCATTTGAACTTCGAGCTGGTTCCAGATCAGACCATTGGGATGACTACTGGAGCAACTATACATACAACAAATGTAATGTTTACTATGCTCATCCCTGCATTGCTACACTTTTTCCTACATTTTGAGCTTGGATCCCGAGTCAACAATTCATTTTAGAATTGTGTTCTCAGTATCGTTGGCTGTGCTTGATAATTACTAATGAAACTATAACTAGATAATAAATAATTTTTTTGAGAACTGCCTTTTTTAGAACATTAATGAATTTATTTTATTTTCAAAACTCTGTAGACATCAATTGTGAATTATTGTATTGTCTTTTCTTGTTGTTAATGTTTGAAAGATTATTTGTAATATTGATGAGTTTTCCTTAGAAGAATAGCGGATGAAATTGTAGAAAAATATGGTTTATATGAAAGATTGATACTTATAAGTGCTAATTGCAAATGCAGGGTTTGTACATGAAACTCAGCCAAAAGCAGATGACCCCAGGATTAGCTTCCCCTGCTTCAAGGTAAATGTTGAAAATGTACATAATATCACGTTAGGGAGTTGAGAGAAAAAAGATAGAATTAGTTCTTTTAAATTCTGTGCTGATTCTTTTGTCTATAGACAATCACTTAATCATATATACATACTGAAGAATCATTTGATCTAGATTCCTGTGGTAATAAGATGCTTGAGAATAGTATTATGATATAGCCCAACTCACTTGATAGGATTTCTTGTTTGAATAAATCAAC

mRNA sequence

ATGACACTTGAAGTTGAACATCCGAAACTCCTCAACCGCCATCCATGTCTTCCATTCTCTGTTATCCCTCACTCATTTCTCCGTCATTTCCCCTCCATAAACGAATCCCTCTCCGGCGAAGAACCCAATTTCGATTTCTCTCCATTAGATCCTCCATCGACGAGAGGGATCCCCCAACGCCGGCGAAGCTCAACAACTCGACCAACACTTCAAAATCCGGTTCCTGGCCCAGATTGGCTCACTTCTCTGACTCGCTACATTACCCTAGGTCAGGGTGACGACTCCGGCATTCCCATAGCCAGTGCGAAGCTCGATGACGTTTCCGATCTGCTCGGCGGTGCTCTTTTTCTTCCGCTTTTCAAGTGGATGAATGACTATGGACCTATTTACAGGCTCGCCGCTGGGCCTAGGAATTTCGTTGTGGTTAGTGACCCTGCCATTGCTAAGCATGTTCTTAGGAATTATGGGACTTACGCTAAAGGGCTTGTTTCTGAGGTCTCCGAGTTCTTGTTTGGGTCCGGTTTTGCGATTGCTGAAGGCCCGCTTTGGACGGTTCGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCCGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAGAGGATGCATTAAATAATAATTCAGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCACTACTGACAGCCCTGTGATTGATGCAATTAAGGCTCTGTGTAAAATAATCCCAAGACAGATAAAAGCCGAAGAAGCAGTTACAGTGATTCAAAGAACTGTTGAAGATCTAATTGCCAAGTGCAAAGCAATTGTGGAAATGGAGGGTGAGCGCATCGATGAGGAGGAATATGTGAATGATGCTGATCCAAGTATCCTTCGTTTCCTGCTGGCCAGTAGACAAGAGGTTTCAAGTGTACAATTACGAGATGATCTATTGTCGATGTTGGTTGCTGGACATGAAACTACCGGTTCTGTTCTGACTTGGACATTGTATCTTTTAAGTAAGGCGCAAACTGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGATACAAAGGAACTTAAATTTTTGACGCGTTGTATCCTTGAGTCAATGCGCCTTTATCCACATCCACCTGTTTTAATAAGAAGAGCTCAAGTTGCTGACGTGCTTCCTGGAAATTACAAGGTTAATGCTGGTCAAGATATCATGATTTCAGTATATAACATCCATCGCTCGTCCCAGGTTTGGGAACAAGCAGAAGAGTTCATACCAGAAAGATTTGACTTGGAGGGTCCTGTGCCCAATGAAAGCAATACAGATTTCAGTGGAGGACCCCGAAAGTGTGTTGGCGATCAATTTGCCTTGCTTGAAGCTATAGTTGCACTTGCCATTTTTCTGCAGCATTTGAACTTCGAGCTGGTTCCAGATCAGACCATTGGGATGACTACTGGAGCAACTATACATACAACAAATGGTTTGTACATGAAACTCAGCCAAAAGCAGATGACCCCAGGATTAGCTTCCCCTGCTTCAAGGTAAATGTTGAAAATGTACATAATATCACGTTAGGGAGTTGAGAGAAAAAAGATAGAATTAGTTCTTTTAAATTCTGTGCTGATTCTTTTGTCTATAGACAATCACTTAATCATATATACATACTGAAGAATCATTTGATCTAGATTCCTGTGGTAATAAGATGCTTGAGAATAGTATTATGATATAGCCCAACTCACTTGATAGGATTTCTTGTTTGAATAAATCAAC

Coding sequence (CDS)

ATGACACTTGAAGTTGAACATCCGAAACTCCTCAACCGCCATCCATGTCTTCCATTCTCTGTTATCCCTCACTCATTTCTCCGTCATTTCCCCTCCATAAACGAATCCCTCTCCGGCGAAGAACCCAATTTCGATTTCTCTCCATTAGATCCTCCATCGACGAGAGGGATCCCCCAACGCCGGCGAAGCTCAACAACTCGACCAACACTTCAAAATCCGGTTCCTGGCCCAGATTGGCTCACTTCTCTGACTCGCTACATTACCCTAGGTCAGGGTGACGACTCCGGCATTCCCATAGCCAGTGCGAAGCTCGATGACGTTTCCGATCTGCTCGGCGGTGCTCTTTTTCTTCCGCTTTTCAAGTGGATGAATGACTATGGACCTATTTACAGGCTCGCCGCTGGGCCTAGGAATTTCGTTGTGGTTAGTGACCCTGCCATTGCTAAGCATGTTCTTAGGAATTATGGGACTTACGCTAAAGGGCTTGTTTCTGAGGTCTCCGAGTTCTTGTTTGGGTCCGGTTTTGCGATTGCTGAAGGCCCGCTTTGGACGGTTCGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCCGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAGAGGATGCATTAAATAATAATTCAGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCACTACTGACAGCCCTGTGATTGATGCAATTAAGGCTCTGTGTAAAATAATCCCAAGACAGATAAAAGCCGAAGAAGCAGTTACAGTGATTCAAAGAACTGTTGAAGATCTAATTGCCAAGTGCAAAGCAATTGTGGAAATGGAGGGTGAGCGCATCGATGAGGAGGAATATGTGAATGATGCTGATCCAAGTATCCTTCGTTTCCTGCTGGCCAGTAGACAAGAGGTTTCAAGTGTACAATTACGAGATGATCTATTGTCGATGTTGGTTGCTGGACATGAAACTACCGGTTCTGTTCTGACTTGGACATTGTATCTTTTAAGTAAGGCGCAAACTGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGATACAAAGGAACTTAAATTTTTGACGCGTTGTATCCTTGAGTCAATGCGCCTTTATCCACATCCACCTGTTTTAATAAGAAGAGCTCAAGTTGCTGACGTGCTTCCTGGAAATTACAAGGTTAATGCTGGTCAAGATATCATGATTTCAGTATATAACATCCATCGCTCGTCCCAGGTTTGGGAACAAGCAGAAGAGTTCATACCAGAAAGATTTGACTTGGAGGGTCCTGTGCCCAATGAAAGCAATACAGATTTCAGTGGAGGACCCCGAAAGTGTGTTGGCGATCAATTTGCCTTGCTTGAAGCTATAGTTGCACTTGCCATTTTTCTGCAGCATTTGAACTTCGAGCTGGTTCCAGATCAGACCATTGGGATGACTACTGGAGCAACTATACATACAACAAATGGTTTGTACATGAAACTCAGCCAAAAGCAGATGACCCCAGGATTAGCTTCCCCTGCTTCAAGGTAA

Protein sequence

MTLEVEHPKLLNRHPCLPFSVIPHSFLRHFPSINESLSGEEPNFDFSPLDPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFSGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR
Homology
BLAST of Cp4.1LG19g03680 vs. ExPASy Swiss-Prot
Match: Q6TBX7 (Carotene epsilon-monooxygenase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97C1 PE=1 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 1.6e-214
Identity = 388/539 (71.99%), Postives = 440/539 (81.63%), Query Frame = 0

Query: 19  FSVIPHSFLRHFPSINESLSGEEPNFDFSPLDPPSTRGIPQRRRSSTTRPTLQNPVPGPD 78
           FS    S+   F +    L   +P F FS         I + +    T  +       PD
Sbjct: 6   FSPSSSSYSSLFTAKPTRLLSPKPKFTFS-----IRSSIEKPKPKLETNSSKSQSWVSPD 65

Query: 79  WLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRN 138
           WLT+LTR ++ G+ D+SGIPIA+AKLDDV+DLLGGALFLPL+KWMN+YGPIYRLAAGPRN
Sbjct: 66  WLTTLTRTLSSGKNDESGIPIANAKLDDVADLLGGALFLPLYKWMNEYGPIYRLAAGPRN 125

Query: 139 FVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKY 198
           FV+VSDPAIAKHVLRNY  YAKGLV+EVSEFLFGSGFAIAEGPLWT RRRAVVPSLH++Y
Sbjct: 126 FVIVSDPAIAKHVLRNYPKYAKGLVAEVSEFLFGSGFAIAEGPLWTARRRAVVPSLHRRY 185

Query: 199 LSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDS 258
           LSVIV+RVFCKCA RLVEKLQ  A + ++VNME KFSQ+TLDVIGLS+FNY+FDSLTTDS
Sbjct: 186 LSVIVERVFCKCAERLVEKLQPYAEDGSAVNMEAKFSQMTLDVIGLSLFNYNFDSLTTDS 245

Query: 259 PVIDA--------------------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAI 318
           PVI+A                    I ALCKI+PRQ+KAE+AVT+I+ TVEDLIAKCK I
Sbjct: 246 PVIEAVYTALKEAELRSTDLLPYWKIDALCKIVPRQVKAEKAVTLIRETVEDLIAKCKEI 305

Query: 319 VEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 378
           VE EGERI++EEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTL
Sbjct: 306 VEREGERINDEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 365

Query: 379 YLLS-------KAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQV 438
           YLLS       KAQ EVDRVL+GR P++ED KELK++TRCI ESMRLYPHPPVLIRRAQV
Sbjct: 366 YLLSKNSSALRKAQEEVDRVLEGRNPAFEDIKELKYITRCINESMRLYPHPPVLIRRAQV 425

Query: 439 ADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTD-----F 498
            D+LPGNYKVN GQDIMISVYNIHRSS+VWE+AEEF+PERFD++G +PNE+NTD     F
Sbjct: 426 PDILPGNYKVNTGQDIMISVYNIHRSSEVWEKAEEFLPERFDIDGAIPNETNTDFKFIPF 485

Query: 499 SGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQK 526
           SGGPRKCVGDQFAL+EAIVALA+FLQ LN ELVPDQTI MTTGATIHTTNGLYMK+SQ+
Sbjct: 486 SGGPRKCVGDQFALMEAIVALAVFLQRLNVELVPDQTISMTTGATIHTTNGLYMKVSQR 539

BLAST of Cp4.1LG19g03680 vs. ExPASy Swiss-Prot
Match: Q93VK5 (Protein LUTEIN DEFICIENT 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97A3 PE=1 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.6e-110
Identity = 217/468 (46.37%), Postives = 306/468 (65.38%), Query Frame = 0

Query: 92  GDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHV 151
           G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+
Sbjct: 105 GSDQDYPKVPEAKGSIQAVRNEAFFIPLYELFLTYGGIFRLTFGPKSFLIVSDPSIAKHI 164

Query: 152 LR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKC 211
           L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+VP+LH+KY++ ++  +F + 
Sbjct: 165 LKDNAKAYSKGILAEILDFVMGKGLIPADGEIWRRRRRAIVPALHQKYVAAMIS-LFGEA 224

Query: 212 AMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALCK- 271
           + RL +KL   AL    V ME  FS+LTLD+IG +VFNY FDSLT D+ VI+A+  + + 
Sbjct: 225 SDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVFNYDFDSLTNDTGVIEAVYTVLRE 284

Query: 272 -------------------IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEE 331
                              I PRQ K   ++ +I  T++DLIA CK +VE E E    EE
Sbjct: 285 AEDRSVSPIPVWDIPIWKDISPRQRKVATSLKLINDTLDDLIATCKRMVE-EEELQFHEE 344

Query: 332 YVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLL-------SK 391
           Y+N+ DPSIL FLLAS  +VSS QLRDDL++ML+AGHET+ +VLTWT YLL       +K
Sbjct: 345 YMNERDPSILHFLLASGDDVSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTTEPSVVAK 404

Query: 392 AQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNA 451
            Q EVD V+  R P+ +D K+LK+ TR + ES+RLYP PPVLIRR+   D+L G Y +  
Sbjct: 405 LQEEVDSVIGDRFPTIQDMKKLKYTTRVMNESLRLYPQPPVLIRRSIDNDIL-GEYPIKR 464

Query: 452 GQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFS-----GGPRKCVGDQF 511
           G+DI ISV+N+HRS   W+ AE+F PER+ L+GP PNE+N +FS     GGPRKC+GD F
Sbjct: 465 GEDIFISVWNLHRSPLHWDDAEKFNPERWPLDGPNPNETNQNFSYLPFGGGPRKCIGDMF 524

Query: 512 ALLEAIVALAIFLQHLNFELVPD-QTIGMTTGATIHTTNGLYMKLSQK 526
           A  E +VA+A+ ++  NF++ P    + MTTGATIHTT GL + ++++
Sbjct: 525 ASFENVVAIAMLIRRFNFQIAPGAPPVKMTTGATIHTTEGLKLTVTKR 569

BLAST of Cp4.1LG19g03680 vs. ExPASy Swiss-Prot
Match: O48921 (Cytochrome P450 97B2, chloroplastic OS=Glycine max OX=3847 GN=CYP97B2 PE=2 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 3.5e-92
Identity = 202/506 (39.92%), Postives = 299/506 (59.09%), Query Frame = 0

Query: 89  LGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIA 148
           L  G    +PIA      VSDLLG  LF  L+ W  ++G +Y+LA GP+ FVVVSDP +A
Sbjct: 71  LSGGSIGSMPIAEGA---VSDLLGRPLFFSLYDWFLEHGAVYKLAFGPKAFVVVSDPIVA 130

Query: 149 KHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVF 208
           +H+LR N  +Y KG+++++ E + G G   A+   W  RRR + P+ H  YL  +V ++F
Sbjct: 131 RHILRENAFSYDKGVLADILEPIMGKGLIPADLDTWKQRRRVIAPAFHNSYLEAMV-KIF 190

Query: 209 CKCAMRLVEKLQE-------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPV 268
             C+ R + K  +       D  ++  +++E +FS L LD+IGL VFNY F S+T +SPV
Sbjct: 191 TTCSERTILKFNKLLEGEGYDGPDSIELDLEAEFSSLALDIIGLGVFNYDFGSVTKESPV 250

Query: 269 IDAIKALC--------------------KIIPRQIKAEEAVTVIQRTVEDLIAKCK-AIV 328
           I A+                         I+PRQ K ++ + VI   ++ LI   K +  
Sbjct: 251 IKAVYGTLFEAEHRSTFYIPYWKIPLARWIVPRQRKFQDDLKVINTCLDGLIRNAKESRQ 310

Query: 329 EMEGERIDEEEYVNDADPSILRFLLASR-QEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 388
           E + E++ + +Y+N  D S+LRFL+  R  +V   QLRDDL++ML+AGHETT +VLTW +
Sbjct: 311 ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAV 370

Query: 389 YLLS-------KAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQV 448
           +LL+       KAQ EVD VL    P++E  KEL+++   ++E++RLYP PP+LIRR+  
Sbjct: 371 FLLAQNPSKMKKAQAEVDLVLGTGRPTFESLKELQYIRLIVVEALRLYPQPPLLIRRSLK 430

Query: 449 ADVLPGNYK-------VNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG- 508
           +DVLPG +K       + AG D+ ISVYN+HRS   W++ ++F PERF       ++EG 
Sbjct: 431 SDVLPGGHKGEKDGYAIPAGTDVFISVYNLHRSPYFWDRPDDFEPERFLVQNKNEEIEGW 490

Query: 509 -----------PVPNESNTDFS-----GGPRKCVGDQFALLEAIVALAIFLQHLNFELV- 526
                        PNE  +DF+     GGPRKCVGDQFAL+E+ VAL + LQ+ + EL  
Sbjct: 491 AGLDPSRSPGALYPNEVISDFAFLPFGGGPRKCVGDQFALMESTVALTMLLQNFDVELKG 550

BLAST of Cp4.1LG19g03680 vs. ExPASy Swiss-Prot
Match: O23365 (Cytochrome P450 97B3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97B3 PE=1 SV=2)

HSP 1 Score: 333.6 bits (854), Expect = 4.2e-90
Identity = 210/547 (38.39%), Postives = 303/547 (55.39%), Query Frame = 0

Query: 51  PPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYIT--LGQGDDSGIPIASAKLDDVS 110
           P +   +  RR S + +     P    + L + +  +T  L  G    +P A      VS
Sbjct: 36  PQTISSVNSRRASVSIKCQSTEPKTNGNILDNASNLLTNFLSGGSLGSMPTAEG---SVS 95

Query: 111 DLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVS 170
           DL G  LFL L+ W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ 
Sbjct: 96  DLFGKPLFLSLYDWFLEHGGIYKLAFGPKAFVVISDPIIARHVLRENAFSYDKGVLAEIL 155

Query: 171 EFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLV--------EKLQ 230
           E + G G   A+   W +RRRA+ P+ HK YL  +V +VF  C+ +++        EK  
Sbjct: 156 EPIMGKGLIPADLDTWKLRRRAITPAFHKLYLEAMV-KVFSDCSEKMILKSEKLIREKET 215

Query: 231 EDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALC----------- 290
               +   +++E +FS L LD+IGLSVFNY F S+T +SPVI A+               
Sbjct: 216 SSGEDTIELDLEAEFSSLALDIIGLSVFNYDFGSVTKESPVIKAVYGTLFEAEHRSTFYF 275

Query: 291 ---------KIIPRQIKAEEAVTVIQRTVEDLIAKCKAI-VEMEGERIDEEEYVNDADPS 350
                     I+PRQ K +  + +I   ++ LI   K    E + E++ E +Y N  D S
Sbjct: 276 PYWNFPPARWIVPRQRKFQSDLKIINDCLDGLIQNAKETRQETDVEKLQERDYTNLKDAS 335

Query: 351 ILRFLLASR-QEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLS-------KAQTEVDR 410
           +LRFL+  R  ++   QLRDDL++ML+AGHETT +VLTW ++LLS       KAQ E+D 
Sbjct: 336 LLRFLVDMRGVDIDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLSQNPEKIRKAQAEIDA 395

Query: 411 VLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNA 470
           VL   PP+YE  K+L+++   ++E +RL+P PP+LIRR    + LPG        +KV  
Sbjct: 396 VLGQGPPTYESMKKLEYIRLIVVEVLRLFPQPPLLIRRTLKPETLPGGHKGEKEGHKVPK 455

Query: 471 GQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG------------PVPNESNT 526
           G DI ISVYN+HRS   W+   +F PERF        +EG              PNE   
Sbjct: 456 GTDIFISVYNLHRSPYFWDNPHDFEPERFLRTKESNGIEGWAGFDPSRSPGALYPNEIIA 515

BLAST of Cp4.1LG19g03680 vs. ExPASy Swiss-Prot
Match: Q43078 (Cytochrome P450 97B1, chloroplastic OS=Pisum sativum OX=3888 GN=CYP97B1 PE=2 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 3.0e-80
Identity = 196/510 (38.43%), Postives = 283/510 (55.49%), Query Frame = 0

Query: 46  FSPLDPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLD 105
           FS +   S  G    +R  ++R    N     + LTSL     LG      +PIA     
Sbjct: 48  FSSIRCQSVNG---EKRKQSSRNVFDN---ASNLLTSLLSGANLG-----SMPIAEGA-- 107

Query: 106 DVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVS 165
            V+DL    LF  L+ W  ++G +Y+LA GP+ FVVVSDP +A+H+LR N  +Y KG+++
Sbjct: 108 -VTDLFDRPLFFSLYDWFLEHGSVYKLAFGPKAFVVVSDPIVARHILRENAFSYDKGVLA 167

Query: 166 EVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQE---- 225
           ++ E + G G   A+   W  RRR + P  H  YL  +V ++F  C+ R V K+ E    
Sbjct: 168 DILEPIMGKGLIPADLETWKQRRRVIAPGFHTSYLEAMV-QLFTSCSERTVLKVNELLEG 227

Query: 226 ---DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALC--------- 285
              D   +  +++E +FS L L++IGL VFNY F S+T +SPVI A+             
Sbjct: 228 EGRDGQKSVELDLEAEFSNLALEIIGLGVFNYDFGSVTNESPVIKAVYGTLFEAEHRSTF 287

Query: 286 -----------KIIPRQIKAEEAVTVIQRTVEDLIAKCK-AIVEMEGERIDEEEYVNDAD 345
                       I+PRQ K ++ + VI   ++ LI   K +  E + E++ + +Y N  D
Sbjct: 288 YIPYWKFPLARWIVPRQRKFQDDLKVINTCLDGLIRNAKESRQETDVEKLQQRDYSNLKD 347

Query: 346 PSILRFLLASR-QEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLS-------KAQTEV 405
            S+LRFL+  R  +V   QLRDDL++ML+AGHETT +VLTW ++LL+       KAQ EV
Sbjct: 348 ASLLRFLVDMRGVDVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPDKMKKAQAEV 407

Query: 406 DRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYK-------V 465
           D VL    P++E  K+L+++   ++E++RLYP PP+LIRR+   DVLPG +K       +
Sbjct: 408 DLVLGMGKPTFELLKKLEYIRLIVVETLRLYPQPPLLIRRSLKPDVLPGGHKGDKDGYTI 467

Query: 466 NAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG------------PVPNES 488
            AG D+ ISVYN+HRS   W++  +F PERF       ++EG              PNE 
Sbjct: 468 PAGTDVFISVYNLHRSPYFWDRPNDFEPERFLVQNNNEEVEGWAGFDPSRSPGALYPNEI 527

BLAST of Cp4.1LG19g03680 vs. NCBI nr
Match: XP_023518813.1 (carotene epsilon-monooxygenase, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 888 bits (2294), Expect = 0.0
Identity = 465/520 (89.42%), Postives = 471/520 (90.58%), Query Frame = 0

Query: 50  DPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 109
           DPP+   +     +++T  +       PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD
Sbjct: 42  DPPTPAKL-----NNSTNTSKSGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 101

Query: 110 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 169
           LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF
Sbjct: 102 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 161

Query: 170 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 229
           LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN
Sbjct: 162 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 221

Query: 230 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------------------IKALCK 289
           MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA                    IKALCK
Sbjct: 222 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCK 281

Query: 290 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 349
           IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE
Sbjct: 282 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 341

Query: 350 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQTEVDRVLQGRPPSYEDT 409
           VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQTEVDRVLQGRPPSYEDT
Sbjct: 342 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSALNKAQTEVDRVLQGRPPSYEDT 401

Query: 410 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 469
           KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE
Sbjct: 402 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 461

Query: 470 QAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFALLEAIVALAIFLQHLNFE 529
           QAEEFIPERFDLEGPVPNESNTDF     SGGPRKCVGDQFALLEAIVALAIFLQHLNFE
Sbjct: 462 QAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHLNFE 521

Query: 530 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 537
           LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR
Sbjct: 522 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 556

BLAST of Cp4.1LG19g03680 vs. NCBI nr
Match: XP_022962840.1 (carotene epsilon-monooxygenase, chloroplastic [Cucurbita moschata])

HSP 1 Score: 884 bits (2284), Expect = 0.0
Identity = 462/520 (88.85%), Postives = 471/520 (90.58%), Query Frame = 0

Query: 50  DPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 109
           DPP+   +     +++T  +       PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD
Sbjct: 42  DPPTPAKL-----NNSTNTSKSGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 101

Query: 110 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 169
           LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF
Sbjct: 102 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 161

Query: 170 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 229
           LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQ+DALNNNSVN
Sbjct: 162 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVN 221

Query: 230 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------------------IKALCK 289
           MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA                    IKALCK
Sbjct: 222 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCK 281

Query: 290 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 349
           IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE
Sbjct: 282 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 341

Query: 350 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQTEVDRVLQGRPPSYEDT 409
           VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQTEVDRVLQGRPPSYEDT
Sbjct: 342 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSALNKAQTEVDRVLQGRPPSYEDT 401

Query: 410 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 469
           KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE
Sbjct: 402 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 461

Query: 470 QAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFALLEAIVALAIFLQHLNFE 529
           +AEEFIPERFDLEGPVPNESNTDF     SGGPRKCVGDQFALLEAIVALAIFLQH+NFE
Sbjct: 462 RAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHINFE 521

Query: 530 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 537
           LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR
Sbjct: 522 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 556

BLAST of Cp4.1LG19g03680 vs. NCBI nr
Match: KAG6595192.1 (Carotene epsilon-monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 882 bits (2279), Expect = 0.0
Identity = 457/493 (92.70%), Postives = 461/493 (93.51%), Query Frame = 0

Query: 77  PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP 136
           PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP
Sbjct: 64  PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP 123

Query: 137 RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK 196
           RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK
Sbjct: 124 RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK 183

Query: 197 KYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTT 256
           KYLSVIVDRVFCKCAMRLVEKLQ+DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTT
Sbjct: 184 KYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTT 243

Query: 257 DSPVIDA--------------------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCK 316
           DSPVIDA                    IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCK
Sbjct: 244 DSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCK 303

Query: 317 AIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW 376
           AIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW
Sbjct: 304 AIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW 363

Query: 377 TLYLLSK-------AQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRA 436
           TLYLLSK       AQ+EVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRA
Sbjct: 364 TLYLLSKDSSALNKAQSEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRA 423

Query: 437 QVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF--- 496
           QVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE+AEEFIPERFDLEGPVPNESNTDF   
Sbjct: 424 QVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLEGPVPNESNTDFRFI 483

Query: 497 --SGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQ 537
             SGGPRKCVGDQFALLEAIVALAIFLQH+NFELVPDQTIGMTTGATIHTTNGLYMKLSQ
Sbjct: 484 PFSGGPRKCVGDQFALLEAIVALAIFLQHINFELVPDQTIGMTTGATIHTTNGLYMKLSQ 543

BLAST of Cp4.1LG19g03680 vs. NCBI nr
Match: XP_022972774.1 (carotene epsilon-monooxygenase, chloroplastic [Cucurbita maxima])

HSP 1 Score: 875 bits (2260), Expect = 0.0
Identity = 457/520 (87.88%), Postives = 469/520 (90.19%), Query Frame = 0

Query: 50  DPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 109
           DPP+   +     +++T  +       PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD
Sbjct: 42  DPPTPAKL-----NNSTNTSKSGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 101

Query: 110 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 169
           LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF
Sbjct: 102 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 161

Query: 170 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 229
           LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN
Sbjct: 162 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 221

Query: 230 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------------------IKALCK 289
           MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA                    IKALCK
Sbjct: 222 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCK 281

Query: 290 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 349
           IIPRQIKAEEAVTVI+RTVEDLIAKCKAIVE EGERIDEEEYVNDADPSILRFLLASRQE
Sbjct: 282 IIPRQIKAEEAVTVIRRTVEDLIAKCKAIVETEGERIDEEEYVNDADPSILRFLLASRQE 341

Query: 350 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQTEVDRVLQGRPPSYEDT 409
           VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQTEVDRVLQGRPPSY+DT
Sbjct: 342 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSALMKAQTEVDRVLQGRPPSYKDT 401

Query: 410 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 469
           KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE
Sbjct: 402 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 461

Query: 470 QAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFALLEAIVALAIFLQHLNFE 529
           +AEEFIPERFDL+GPVPNESNTDF     SGGPRKCVGDQFALLEAIVALAIFLQH+NFE
Sbjct: 462 RAEEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFE 521

Query: 530 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 537
           LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTP LA+PASR
Sbjct: 522 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPRLATPASR 556

BLAST of Cp4.1LG19g03680 vs. NCBI nr
Match: XP_004143287.1 (carotene epsilon-monooxygenase, chloroplastic [Cucumis sativus] >KGN48212.1 hypothetical protein Csa_003812 [Cucumis sativus])

HSP 1 Score: 855 bits (2210), Expect = 4.38e-309
Identity = 445/536 (83.02%), Postives = 471/536 (87.87%), Query Frame = 0

Query: 48  PLDPPST-------RGIPQRRRSSTTRPTLQNPVPGP--------DWLTSLTRYITLGQG 107
           PL PP+T       +     RR+S+T P ++NP   P        DWLTSLTRYITLGQG
Sbjct: 22  PLTPPTTPFPSLSIKSSIDERRNSSTPPKIKNPTNAPKSRSWVSPDWLTSLTRYITLGQG 81

Query: 108 DDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVL 167
           DDSGIP+A+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDP IAKHVL
Sbjct: 82  DDSGIPVATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVIVSDPTIAKHVL 141

Query: 168 RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM 227
           RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM
Sbjct: 142 RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM 201

Query: 228 RLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------- 287
           RLVEKL++DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSL+TDSPVIDA         
Sbjct: 202 RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSTDSPVIDAVYTALKEAE 261

Query: 288 -----------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYV 347
                      IKALCKIIPRQIKAEEAVTVI++TVE+LIAKCK IVE EGERI+EEEYV
Sbjct: 262 ARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRKTVEELIAKCKEIVEAEGERINEEEYV 321

Query: 348 NDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQ 407
           NDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQ
Sbjct: 322 NDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQ 381

Query: 408 TEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQ 467
            EVDRVLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVLIRRAQVAD+LPG+YKVNAGQ
Sbjct: 382 NEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADILPGDYKVNAGQ 441

Query: 468 DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFAL 527
           DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF     SGGPRKCVGDQFAL
Sbjct: 442 DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL 501

Query: 528 LEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPAS 536
           LEAIVALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMKLSQ+++TP L S A+
Sbjct: 502 LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMKLSQRKLTPELVSSAT 557

BLAST of Cp4.1LG19g03680 vs. ExPASy TrEMBL
Match: A0A6J1HG80 (carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111463211 PE=3 SV=1)

HSP 1 Score: 884 bits (2284), Expect = 0.0
Identity = 462/520 (88.85%), Postives = 471/520 (90.58%), Query Frame = 0

Query: 50  DPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 109
           DPP+   +     +++T  +       PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD
Sbjct: 42  DPPTPAKL-----NNSTNTSKSGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 101

Query: 110 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 169
           LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF
Sbjct: 102 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 161

Query: 170 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 229
           LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQ+DALNNNSVN
Sbjct: 162 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVN 221

Query: 230 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------------------IKALCK 289
           MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA                    IKALCK
Sbjct: 222 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCK 281

Query: 290 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 349
           IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE
Sbjct: 282 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 341

Query: 350 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQTEVDRVLQGRPPSYEDT 409
           VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQTEVDRVLQGRPPSYEDT
Sbjct: 342 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSALNKAQTEVDRVLQGRPPSYEDT 401

Query: 410 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 469
           KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE
Sbjct: 402 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 461

Query: 470 QAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFALLEAIVALAIFLQHLNFE 529
           +AEEFIPERFDLEGPVPNESNTDF     SGGPRKCVGDQFALLEAIVALAIFLQH+NFE
Sbjct: 462 RAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHINFE 521

Query: 530 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 537
           LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR
Sbjct: 522 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 556

BLAST of Cp4.1LG19g03680 vs. ExPASy TrEMBL
Match: A0A6J1ICJ1 (carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111471281 PE=3 SV=1)

HSP 1 Score: 875 bits (2260), Expect = 0.0
Identity = 457/520 (87.88%), Postives = 469/520 (90.19%), Query Frame = 0

Query: 50  DPPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 109
           DPP+   +     +++T  +       PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD
Sbjct: 42  DPPTPAKL-----NNSTNTSKSGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSD 101

Query: 110 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 169
           LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF
Sbjct: 102 LLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEF 161

Query: 170 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 229
           LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN
Sbjct: 162 LFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVN 221

Query: 230 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------------------IKALCK 289
           MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA                    IKALCK
Sbjct: 222 MEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCK 281

Query: 290 IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQE 349
           IIPRQIKAEEAVTVI+RTVEDLIAKCKAIVE EGERIDEEEYVNDADPSILRFLLASRQE
Sbjct: 282 IIPRQIKAEEAVTVIRRTVEDLIAKCKAIVETEGERIDEEEYVNDADPSILRFLLASRQE 341

Query: 350 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQTEVDRVLQGRPPSYEDT 409
           VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQTEVDRVLQGRPPSY+DT
Sbjct: 342 VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSALMKAQTEVDRVLQGRPPSYKDT 401

Query: 410 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 469
           KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE
Sbjct: 402 KELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWE 461

Query: 470 QAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFALLEAIVALAIFLQHLNFE 529
           +AEEFIPERFDL+GPVPNESNTDF     SGGPRKCVGDQFALLEAIVALAIFLQH+NFE
Sbjct: 462 RAEEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFE 521

Query: 530 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPASR 537
           LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTP LA+PASR
Sbjct: 522 LVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPRLATPASR 556

BLAST of Cp4.1LG19g03680 vs. ExPASy TrEMBL
Match: A0A0A0KIH3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G448700 PE=3 SV=1)

HSP 1 Score: 855 bits (2210), Expect = 2.12e-309
Identity = 445/536 (83.02%), Postives = 471/536 (87.87%), Query Frame = 0

Query: 48  PLDPPST-------RGIPQRRRSSTTRPTLQNPVPGP--------DWLTSLTRYITLGQG 107
           PL PP+T       +     RR+S+T P ++NP   P        DWLTSLTRYITLGQG
Sbjct: 22  PLTPPTTPFPSLSIKSSIDERRNSSTPPKIKNPTNAPKSRSWVSPDWLTSLTRYITLGQG 81

Query: 108 DDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVL 167
           DDSGIP+A+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDP IAKHVL
Sbjct: 82  DDSGIPVATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVIVSDPTIAKHVL 141

Query: 168 RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM 227
           RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM
Sbjct: 142 RNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAM 201

Query: 228 RLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDA--------- 287
           RLVEKL++DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSL+TDSPVIDA         
Sbjct: 202 RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSTDSPVIDAVYTALKEAE 261

Query: 288 -----------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEEYV 347
                      IKALCKIIPRQIKAEEAVTVI++TVE+LIAKCK IVE EGERI+EEEYV
Sbjct: 262 ARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRKTVEELIAKCKEIVEAEGERINEEEYV 321

Query: 348 NDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK-------AQ 407
           NDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK       AQ
Sbjct: 322 NDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQ 381

Query: 408 TEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQ 467
            EVDRVLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVLIRRAQVAD+LPG+YKVNAGQ
Sbjct: 382 NEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADILPGDYKVNAGQ 441

Query: 468 DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF-----SGGPRKCVGDQFAL 527
           DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF     SGGPRKCVGDQFAL
Sbjct: 442 DIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL 501

Query: 528 LEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQKQMTPGLASPAS 536
           LEAIVALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMKLSQ+++TP L S A+
Sbjct: 502 LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMKLSQRKLTPELVSSAT 557

BLAST of Cp4.1LG19g03680 vs. ExPASy TrEMBL
Match: A0A1S3CIN0 (carotene epsilon-monooxygenase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500849 PE=3 SV=1)

HSP 1 Score: 854 bits (2206), Expect = 8.31e-309
Identity = 451/547 (82.45%), Postives = 471/547 (86.11%), Query Frame = 0

Query: 30  FPSINESLSGEEPNFDFSPLDPPSTRGIPQRRRSSTTRPTLQNPVPGP--------DWLT 89
           FPS   SL    P    +P   PS +     R + +T P L+NP   P        DWLT
Sbjct: 12  FPS--SSLHKRIPLTPTTPFPYPSIKSSLDERGNPSTPPKLKNPTNAPKSRSWVSPDWLT 71

Query: 90  SLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVV 149
           SLTR ITLGQGDDSGIPIA+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFV+
Sbjct: 72  SLTRSITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVI 131

Query: 150 VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSV 209
           VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSV
Sbjct: 132 VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSV 191

Query: 210 IVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVI 269
           IVDRVFCKCAMRLVEKL++DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSL+ DSPVI
Sbjct: 192 IVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSPVI 251

Query: 270 DA--------------------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEM 329
           DA                    IKALCKIIPRQIKAEEAVTVI+RTVE+LIAKCK IVE 
Sbjct: 252 DAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEIVEA 311

Query: 330 EGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLL 389
           EGERI+EEEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLL
Sbjct: 312 EGERINEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLL 371

Query: 390 SK-------AQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADV 449
           SK       AQ EVDRVLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVLIRRAQVAD 
Sbjct: 372 SKHSSSLVKAQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADT 431

Query: 450 LPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF-----SGG 509
           LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDF     SGG
Sbjct: 432 LPGNYKVNAGQDIMISVYNIHRSPQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGG 491

Query: 510 PRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQKQMTP 536
           PRKCVGDQFALLEAIVALAIFLQHLNFELVP+QTIGMTTGATIHTTNGLYMKLSQ+++TP
Sbjct: 492 PRKCVGDQFALLEAIVALAIFLQHLNFELVPNQTIGMTTGATIHTTNGLYMKLSQRKLTP 551

BLAST of Cp4.1LG19g03680 vs. ExPASy TrEMBL
Match: A0A6J1CRM7 (carotene epsilon-monooxygenase, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111013687 PE=3 SV=1)

HSP 1 Score: 853 bits (2205), Expect = 1.14e-308
Identity = 440/492 (89.43%), Postives = 452/492 (91.87%), Query Frame = 0

Query: 77  PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP 136
           PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP
Sbjct: 64  PDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGP 123

Query: 137 RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK 196
           RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK
Sbjct: 124 RNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHK 183

Query: 197 KYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTT 256
           KYLSVIVDRVFCKCAMRLVEKLQ+DALNNNSVNMEEKFSQLTLD+IGLSVFNYSFDSL+ 
Sbjct: 184 KYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDIIGLSVFNYSFDSLSA 243

Query: 257 DSPVIDA--------------------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCK 316
           DSPVIDA                    IKALCKIIPRQIKAEEAVTVI+RTVE+LIAKCK
Sbjct: 244 DSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCK 303

Query: 317 AIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW 376
            IVE EGERIDEEEYVND DPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW
Sbjct: 304 EIVETEGERIDEEEYVNDTDPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTW 363

Query: 377 TLYLLSK-------AQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRA 436
           TLYLLSK       AQ EVDRVLQGRPPSYEDTKELKFL RCILESMRLYPHPPVLIRRA
Sbjct: 364 TLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLMRCILESMRLYPHPPVLIRRA 423

Query: 437 QVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF--- 496
           +VAD+LPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF   
Sbjct: 424 RVADILPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFI 483

Query: 497 --SGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQ 536
             SGGPRKCVGDQFALLEA+VALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMKLSQ
Sbjct: 484 PFSGGPRKCVGDQFALLEAVVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMKLSQ 543

BLAST of Cp4.1LG19g03680 vs. TAIR 10
Match: AT3G53130.1 (Cytochrome P450 superfamily protein )

HSP 1 Score: 746.9 bits (1927), Expect = 1.1e-215
Identity = 388/539 (71.99%), Postives = 440/539 (81.63%), Query Frame = 0

Query: 19  FSVIPHSFLRHFPSINESLSGEEPNFDFSPLDPPSTRGIPQRRRSSTTRPTLQNPVPGPD 78
           FS    S+   F +    L   +P F FS         I + +    T  +       PD
Sbjct: 6   FSPSSSSYSSLFTAKPTRLLSPKPKFTFS-----IRSSIEKPKPKLETNSSKSQSWVSPD 65

Query: 79  WLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRN 138
           WLT+LTR ++ G+ D+SGIPIA+AKLDDV+DLLGGALFLPL+KWMN+YGPIYRLAAGPRN
Sbjct: 66  WLTTLTRTLSSGKNDESGIPIANAKLDDVADLLGGALFLPLYKWMNEYGPIYRLAAGPRN 125

Query: 139 FVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKY 198
           FV+VSDPAIAKHVLRNY  YAKGLV+EVSEFLFGSGFAIAEGPLWT RRRAVVPSLH++Y
Sbjct: 126 FVIVSDPAIAKHVLRNYPKYAKGLVAEVSEFLFGSGFAIAEGPLWTARRRAVVPSLHRRY 185

Query: 199 LSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDS 258
           LSVIV+RVFCKCA RLVEKLQ  A + ++VNME KFSQ+TLDVIGLS+FNY+FDSLTTDS
Sbjct: 186 LSVIVERVFCKCAERLVEKLQPYAEDGSAVNMEAKFSQMTLDVIGLSLFNYNFDSLTTDS 245

Query: 259 PVIDA--------------------IKALCKIIPRQIKAEEAVTVIQRTVEDLIAKCKAI 318
           PVI+A                    I ALCKI+PRQ+KAE+AVT+I+ TVEDLIAKCK I
Sbjct: 246 PVIEAVYTALKEAELRSTDLLPYWKIDALCKIVPRQVKAEKAVTLIRETVEDLIAKCKEI 305

Query: 319 VEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 378
           VE EGERI++EEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTL
Sbjct: 306 VEREGERINDEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 365

Query: 379 YLLS-------KAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQV 438
           YLLS       KAQ EVDRVL+GR P++ED KELK++TRCI ESMRLYPHPPVLIRRAQV
Sbjct: 366 YLLSKNSSALRKAQEEVDRVLEGRNPAFEDIKELKYITRCINESMRLYPHPPVLIRRAQV 425

Query: 439 ADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTD-----F 498
            D+LPGNYKVN GQDIMISVYNIHRSS+VWE+AEEF+PERFD++G +PNE+NTD     F
Sbjct: 426 PDILPGNYKVNTGQDIMISVYNIHRSSEVWEKAEEFLPERFDIDGAIPNETNTDFKFIPF 485

Query: 499 SGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPDQTIGMTTGATIHTTNGLYMKLSQK 526
           SGGPRKCVGDQFAL+EAIVALA+FLQ LN ELVPDQTI MTTGATIHTTNGLYMK+SQ+
Sbjct: 486 SGGPRKCVGDQFALMEAIVALAVFLQRLNVELVPDQTISMTTGATIHTTNGLYMKVSQR 539

BLAST of Cp4.1LG19g03680 vs. TAIR 10
Match: AT1G31800.1 (cytochrome P450, family 97, subfamily A, polypeptide 3 )

HSP 1 Score: 401.4 bits (1030), Expect = 1.2e-111
Identity = 217/468 (46.37%), Postives = 306/468 (65.38%), Query Frame = 0

Query: 92  GDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHV 151
           G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+
Sbjct: 105 GSDQDYPKVPEAKGSIQAVRNEAFFIPLYELFLTYGGIFRLTFGPKSFLIVSDPSIAKHI 164

Query: 152 LR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKC 211
           L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+VP+LH+KY++ ++  +F + 
Sbjct: 165 LKDNAKAYSKGILAEILDFVMGKGLIPADGEIWRRRRRAIVPALHQKYVAAMIS-LFGEA 224

Query: 212 AMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALCK- 271
           + RL +KL   AL    V ME  FS+LTLD+IG +VFNY FDSLT D+ VI+A+  + + 
Sbjct: 225 SDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVFNYDFDSLTNDTGVIEAVYTVLRE 284

Query: 272 -------------------IIPRQIKAEEAVTVIQRTVEDLIAKCKAIVEMEGERIDEEE 331
                              I PRQ K   ++ +I  T++DLIA CK +VE E E    EE
Sbjct: 285 AEDRSVSPIPVWDIPIWKDISPRQRKVATSLKLINDTLDDLIATCKRMVE-EEELQFHEE 344

Query: 332 YVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLL-------SK 391
           Y+N+ DPSIL FLLAS  +VSS QLRDDL++ML+AGHET+ +VLTWT YLL       +K
Sbjct: 345 YMNERDPSILHFLLASGDDVSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTTEPSVVAK 404

Query: 392 AQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNA 451
            Q EVD V+  R P+ +D K+LK+ TR + ES+RLYP PPVLIRR+   D+L G Y +  
Sbjct: 405 LQEEVDSVIGDRFPTIQDMKKLKYTTRVMNESLRLYPQPPVLIRRSIDNDIL-GEYPIKR 464

Query: 452 GQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFS-----GGPRKCVGDQF 511
           G+DI ISV+N+HRS   W+ AE+F PER+ L+GP PNE+N +FS     GGPRKC+GD F
Sbjct: 465 GEDIFISVWNLHRSPLHWDDAEKFNPERWPLDGPNPNETNQNFSYLPFGGGPRKCIGDMF 524

Query: 512 ALLEAIVALAIFLQHLNFELVPD-QTIGMTTGATIHTTNGLYMKLSQK 526
           A  E +VA+A+ ++  NF++ P    + MTTGATIHTT GL + ++++
Sbjct: 525 ASFENVVAIAMLIRRFNFQIAPGAPPVKMTTGATIHTTEGLKLTVTKR 569

BLAST of Cp4.1LG19g03680 vs. TAIR 10
Match: AT4G15110.1 (cytochrome P450, family 97, subfamily B, polypeptide 3 )

HSP 1 Score: 333.6 bits (854), Expect = 3.0e-91
Identity = 210/547 (38.39%), Postives = 303/547 (55.39%), Query Frame = 0

Query: 51  PPSTRGIPQRRRSSTTRPTLQNPVPGPDWLTSLTRYIT--LGQGDDSGIPIASAKLDDVS 110
           P +   +  RR S + +     P    + L + +  +T  L  G    +P A      VS
Sbjct: 36  PQTISSVNSRRASVSIKCQSTEPKTNGNILDNASNLLTNFLSGGSLGSMPTAEG---SVS 95

Query: 111 DLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVS 170
           DL G  LFL L+ W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ 
Sbjct: 96  DLFGKPLFLSLYDWFLEHGGIYKLAFGPKAFVVISDPIIARHVLRENAFSYDKGVLAEIL 155

Query: 171 EFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLV--------EKLQ 230
           E + G G   A+   W +RRRA+ P+ HK YL  +V +VF  C+ +++        EK  
Sbjct: 156 EPIMGKGLIPADLDTWKLRRRAITPAFHKLYLEAMV-KVFSDCSEKMILKSEKLIREKET 215

Query: 231 EDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLTTDSPVIDAIKALC----------- 290
               +   +++E +FS L LD+IGLSVFNY F S+T +SPVI A+               
Sbjct: 216 SSGEDTIELDLEAEFSSLALDIIGLSVFNYDFGSVTKESPVIKAVYGTLFEAEHRSTFYF 275

Query: 291 ---------KIIPRQIKAEEAVTVIQRTVEDLIAKCKAI-VEMEGERIDEEEYVNDADPS 350
                     I+PRQ K +  + +I   ++ LI   K    E + E++ E +Y N  D S
Sbjct: 276 PYWNFPPARWIVPRQRKFQSDLKIINDCLDGLIQNAKETRQETDVEKLQERDYTNLKDAS 335

Query: 351 ILRFLLASR-QEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLS-------KAQTEVDR 410
           +LRFL+  R  ++   QLRDDL++ML+AGHETT +VLTW ++LLS       KAQ E+D 
Sbjct: 336 LLRFLVDMRGVDIDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLSQNPEKIRKAQAEIDA 395

Query: 411 VLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNA 470
           VL   PP+YE  K+L+++   ++E +RL+P PP+LIRR    + LPG        +KV  
Sbjct: 396 VLGQGPPTYESMKKLEYIRLIVVEVLRLFPQPPLLIRRTLKPETLPGGHKGEKEGHKVPK 455

Query: 471 GQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG------------PVPNESNT 526
           G DI ISVYN+HRS   W+   +F PERF        +EG              PNE   
Sbjct: 456 GTDIFISVYNLHRSPYFWDNPHDFEPERFLRTKESNGIEGWAGFDPSRSPGALYPNEIIA 515

BLAST of Cp4.1LG19g03680 vs. TAIR 10
Match: AT2G44890.1 (cytochrome P450, family 704, subfamily A, polypeptide 1 )

HSP 1 Score: 132.5 bits (332), Expect = 1.0e-30
Identity = 117/442 (26.47%), Postives = 206/442 (46.61%), Query Frame = 0

Query: 128 PIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVS-EFLFGSGFAIAEGPLWTV 187
           P +R  +  ++ +  +DP   +H+L+  +  Y+KG V  V+   L G G    +G  W  
Sbjct: 63  PTFRFLSPGQSEIFTADPRNVEHILKTRFHNYSKGPVGTVNLADLLGHGIFAVDGEKWKQ 122

Query: 188 RRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVI--- 247
           +R+ V      + L      VF   A +LV  + E AL+  S + ++   + TLD I   
Sbjct: 123 QRKLVSFEFSTRVLRNFSYSVFRTSASKLVGFIAEFALSGKSFDFQDMLMKCTLDSIFKV 182

Query: 248 GLSV--------------FNYSFD--SLTTDSPVIDAI-KALCKI-IPRQIKAEEAVTVI 307
           G  V              F  +FD  +  T S V D   K  C + I  + + ++++ +I
Sbjct: 183 GFGVELGCLDGFSKEGEEFMKAFDEGNGATSSRVTDPFWKLKCFLNIGSESRLKKSIAII 242

Query: 308 QRTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQ---LRDDLLS 367
            + V  LI         + + + +E+  +  +  + +FLL S ++  ++    LRD +L+
Sbjct: 243 DKFVYSLIT-------TKRKELSKEQNTSVREDILSKFLLESEKDPENMNDKYLRDIILN 302

Query: 368 MLVAGHETTGSVLTWTLYLLSK---AQTEVDRVLQGRPPSYEDT---------------K 427
           ++VAG +TT + L+W LY+L K    Q ++ + ++    S+E T                
Sbjct: 303 VMVAGKDTTAASLSWFLYMLCKNPLVQEKIVQEIRDVTSSHEKTTDVNGFIESVTEEALA 362

Query: 428 ELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQ 487
           ++++L   + E+MRLYP  P  +R A+  DVLP  ++V+ G +I    Y + R + +W Q
Sbjct: 363 QMQYLHAALSETMRLYPPVPEHMRCAENDDVLPDGHRVSKGDNIYYISYAMGRMTYIWGQ 422

Query: 488 -AEEFIPERFDLEGPVPNESN---TDFSGGPRKCVGDQFALLEAIVALAIFLQHLNFELV 521
            AEEF PER+  +G    ES      F  GPR C+G  FA  +  +     L    F++ 
Sbjct: 423 DAEEFKPERWLKDGVFQPESQFKFISFHAGPRICIGKDFAYRQMKIVSMALLHFFRFKMA 482

BLAST of Cp4.1LG19g03680 vs. TAIR 10
Match: AT1G67110.1 (cytochrome P450, family 735, subfamily A, polypeptide 2 )

HSP 1 Score: 131.7 bits (330), Expect = 1.7e-30
Identity = 104/402 (25.87%), Postives = 177/402 (44.03%), Query Frame = 0

Query: 122 WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYG--TYAKGLVSEVSEFLFGSGFAIAE 181
           W   YG  + +  G    + +++  + K +L  +   T    L  + ++   G G  +A 
Sbjct: 90  WSKQYGKRFIMWNGTEPRLCLTETEMIKELLTKHNPVTGKSWLQQQGTKGFIGRGLLMAN 149

Query: 182 GPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTL 241
           G  W  +R    P+  +  L      +  +C   + E+L+++      V + E+  +LT 
Sbjct: 150 GEAWHHQRHMAAPAFTRDRLKGYAKHM-VECTKMMAERLRKEV--GEEVEIGEEMRRLTA 209

Query: 242 DVIGLSVFNYSFDSLTTDSPVIDAIKALC------------KIIPRQIKAE--EAVTVIQ 301
           D+I  + F  S D       ++  ++ LC            + +P +   E     T ++
Sbjct: 210 DIISRTEFGSSCDKGKELFSLLTVLQRLCAQATRHLCFPGSRFLPSKYNREIKSLKTEVE 269

Query: 302 RTVEDLIAKCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVA 361
           R + ++I   K  VE+         Y +D    +L  + +++  ++   + D+  +    
Sbjct: 270 RLLMEIIDSRKDSVEIG----RSSSYGDDLLGLLLNQMDSNKNNLNVQMIMDECKTFFFT 329

Query: 362 GHETTGSVLTWTLYLLSKAQTEVDRVL--------QGRPPSYEDTKELKFLTRCILESMR 421
           GHETT  +LTWTL LL+   T  D V         Q   PS E    L  L + I ES+R
Sbjct: 330 GHETTSLLLTWTLMLLAHNPTWQDNVRDEVRQVCGQDGVPSVEQLSSLTSLNKVINESLR 389

Query: 422 LYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVW-EQAEEFIPERFDLEG 481
           LYP P  L+ R    D+  G+  +  G  I I V  IH S+++W E A EF PERF    
Sbjct: 390 LYP-PATLLPRMAFEDIKLGDLIIPKGLSIWIPVLAIHHSNELWGEDANEFNPERFTTRS 449

Query: 482 PVPNESNTDFSGGPRKCVGDQFALLEAIVALAIFLQHLNFEL 499
              +     F+ GPR C+G  FA++EA + LA+ +   +F +
Sbjct: 450 FASSRHFMPFAAGPRNCIGQTFAMMEAKIILAMLVSKFSFAI 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6TBX71.6e-21471.99Carotene epsilon-monooxygenase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN... [more]
Q93VK51.6e-11046.37Protein LUTEIN DEFICIENT 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP... [more]
O489213.5e-9239.92Cytochrome P450 97B2, chloroplastic OS=Glycine max OX=3847 GN=CYP97B2 PE=2 SV=1[more]
O233654.2e-9038.39Cytochrome P450 97B3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97B3 P... [more]
Q430783.0e-8038.43Cytochrome P450 97B1, chloroplastic OS=Pisum sativum OX=3888 GN=CYP97B1 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
XP_023518813.10.089.42carotene epsilon-monooxygenase, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022962840.10.088.85carotene epsilon-monooxygenase, chloroplastic [Cucurbita moschata][more]
KAG6595192.10.092.70Carotene epsilon-monooxygenase, chloroplastic, partial [Cucurbita argyrosperma s... [more]
XP_022972774.10.087.88carotene epsilon-monooxygenase, chloroplastic [Cucurbita maxima][more]
XP_004143287.14.38e-30983.02carotene epsilon-monooxygenase, chloroplastic [Cucumis sativus] >KGN48212.1 hypo... [more]
Match NameE-valueIdentityDescription
A0A6J1HG800.088.85carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1ICJ10.087.88carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC... [more]
A0A0A0KIH32.12e-30983.02Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G448700 PE=3 SV=1[more]
A0A1S3CIN08.31e-30982.45carotene epsilon-monooxygenase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC1035... [more]
A0A6J1CRM71.14e-30889.43carotene epsilon-monooxygenase, chloroplastic OS=Momordica charantia OX=3673 GN=... [more]
Match NameE-valueIdentityDescription
AT3G53130.11.1e-21571.99Cytochrome P450 superfamily protein [more]
AT1G31800.11.2e-11146.37cytochrome P450, family 97, subfamily A, polypeptide 3 [more]
AT4G15110.13.0e-9138.39cytochrome P450, family 97, subfamily B, polypeptide 3 [more]
AT2G44890.11.0e-3026.47cytochrome P450, family 704, subfamily A, polypeptide 1 [more]
AT1G67110.11.7e-3025.87cytochrome P450, family 735, subfamily A, polypeptide 2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002401Cytochrome P450, E-class, group IPRINTSPR00463EP450Icoord: 334..351
score: 29.84
coord: 123..142
score: 32.62
coord: 463..473
score: 41.47
coord: 473..496
score: 30.35
coord: 430..454
score: 27.16
IPR001128Cytochrome P450PRINTSPR00385P450coord: 345..362
score: 41.91
coord: 473..484
score: 38.58
coord: 390..401
score: 35.27
coord: 464..473
score: 48.24
IPR001128Cytochrome P450PFAMPF00067p450coord: 119..509
e-value: 2.2E-69
score: 234.4
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 98..532
e-value: 5.6E-94
score: 317.2
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 111..523
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 41..74
NoneNo IPR availablePANTHERPTHR24291:SF134CAROTENE EPSILON-MONOOXYGENASE, CHLOROPLASTICcoord: 135..524
NoneNo IPR availablePANTHERPTHR24291CYTOCHROME P450 FAMILY 4coord: 135..524
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 466..475

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG19g03680.1Cp4.1LG19g03680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016117 carotenoid biosynthetic process
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0009974 zeinoxanthin epsilon hydroxylase activity
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen