ClCG07G010550 (gene) Watermelon (Charleston Gray)

NameClCG07G010550
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
Description9-cis-epoxycarotenoid dioxygenase
LocationCG_Chr07 : 26364685 .. 26366472 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTCTTTCTTCTTCTTCATGGATCAAATCTGGTTTTTCCTCTTCAATGCCGGAGAATCTCTCTGTAAATACAAATTATGGTAGAAACAGAGTTGCTACCTCCTGTTCTCTTCATACGCCTTCTATTATTCAAATTCCTAAGCATTCTCACACCGCATTTCCATCTCCTATTAAGACTGCAGTGCCTAAACCCCTTCCAGTTTCGCCTCCCTCCCTGTCCTCCGCATCCGATGACTGGAATTTCTTGCAGCGGGCTGCGGCTATGGCTCTTGATGCGGTTGAAAATGCTCTGGTTTCGGCAGAGCGGAAGCATTCCTTGCCCAAGACCGCCGACCCTGCGGTTCAAATTGCTGGCAATTTTGCCCCGGTGCCGGAGCAGCCGGTTCGGAAGGGCTTACCGGTGATTGGGAAAGTTCCTGACTGTATTCGAGGGGTTTATGTGCGAAACGGGGCCAACCCACTTCACGAACCCGTGTCCGGTCATCATTTGTTCGACGGTGATGGGATGGTTCACGCTGTGGAATTCTCTGAGGGCGGCGGTGTTAGTTACGCTTGCCGGTTCACGGAGACCCAACGGCTGGTTCAGGAGCGAGCTTACGGCCGGCCGGTATTCCCAAAGGCGATCGGCGAACTCCATGGTCACTCTGGGATAGCTCGGCTGATGCTTTTCTATGCCAGAGGATTATTTGGCCTTGTCGATCACAACCATGGAATTGGAGTCGCTAACGCCGGTCTGGTCTTTTTCAACGGCCGGCTTCTTGCCATGTCGGAGGACGATTTGCCTTACCAAATCAGAGTCACTCCGGCGGGTGATTTGAAAACCGTCGGCCGGTTCGATTTCGATGGCCAACTGACATCCACAATGATAGCCCACCCGAAACTCGACCCTGTTTCCGGCGAGATGTTCGCTTTAAGCTACGACGTAATCCAAAAGCCATATTTGAAATACTTCAAATTCTCGCCGGAGGGAGAGAAATCACCGGACGTTGAAATTCCCCTGCCTCAGCCTACGATGATGCACGATTTCGCAATTACAGAGAAATTCGTCGTAATTCCCGACCAGCAGGTGGTTTTCAAGTTGCCGGAGATGATCCGCGGCGGGTCTCCGGTGGTATACGACAAAGAAAAAACCTCCCGATTCGGAATTTTGGACAAAAATGCCACAGACGCCAATTCTATTAAATGGATTGAAGCTCCTGATTGCTTCTGTTTCCATCTCTGGAACGCTTGGGAAGAACCAGAAACTAACGAAATCGTCGTAATTGGGTCGTGCATGACGCCGCCAGACTCCATTTTCAACGAATGCGAAGAGAATTTGAAATCCGTTTTGTCGGAAATTCGTCTGAATCTCTCGACTGGAAAATCGACCCGTCGACCAATCATCACTGAAACAGAGCAAGTGAACTTAGAGGCCGGAATGGTGAACCGAAATCTACTGGGGAGAAAAACCCAGTTTGCTTATCTTGCTCTAGCAGAGCCATGGCCAAAAGTTTCTGGTTTTGCGAAGGTCGATCTCTTCACCGGAGAAATCAAAAAGTATCTCTACGGCGAACAGAGGTACGGCGGTGAGCCTCTGTTTCTGCCTAGAGAAGGGGCTGAGGCAGAGGACGATGGCCACATCTTGGCCTTCGTTCACGACGAGAAGGAATGGAAATCGGAGCTTCAGATCGTTAACGCCATGACTTTGGAGCTTGAAGCTACTGTAAAGCTGCCTTCTCGAGTTCCTTATGGCTTCCATGGAACTTTCATAAGTTCCGAGGATTTGCAGAAGCAGATACGGTAA

mRNA sequence

ATGGCTTCTCTTTCTTCTTCTTCATGGATCAAATCTGGTTTTTCCTCTTCAATGCCGGAGAATCTCTCTGTAAATACAAATTATGGTAGAAACAGAGTTGCTACCTCCTGTTCTCTTCATACGCCTTCTATTATTCAAATTCCTAAGCATTCTCACACCGCATTTCCATCTCCTATTAAGACTGCAGTGCCTAAACCCCTTCCAGTTTCGCCTCCCTCCCTGTCCTCCGCATCCGATGACTGGAATTTCTTGCAGCGGGCTGCGGCTATGGCTCTTGATGCGGTTGAAAATGCTCTGGTTTCGGCAGAGCGGAAGCATTCCTTGCCCAAGACCGCCGACCCTGCGGTTCAAATTGCTGGCAATTTTGCCCCGGTGCCGGAGCAGCCGGTTCGGAAGGGCTTACCGGTGATTGGGAAAGTTCCTGACTGTATTCGAGGGGTTTATGTGCGAAACGGGGCCAACCCACTTCACGAACCCGTGTCCGGTCATCATTTGTTCGACGGTGATGGGATGGTTCACGCTGTGGAATTCTCTGAGGGCGGCGGTGTTAGTTACGCTTGCCGGTTCACGGAGACCCAACGGCTGGTTCAGGAGCGAGCTTACGGCCGGCCGGTATTCCCAAAGGCGATCGGCGAACTCCATGGTCACTCTGGGATAGCTCGGCTGATGCTTTTCTATGCCAGAGGATTATTTGGCCTTGTCGATCACAACCATGGAATTGGAGTCGCTAACGCCGGTCTGGTCTTTTTCAACGGCCGGCTTCTTGCCATGTCGGAGGACGATTTGCCTTACCAAATCAGAGTCACTCCGGCGGGTGATTTGAAAACCGTCGGCCGGTTCGATTTCGATGGCCAACTGACATCCACAATGATAGCCCACCCGAAACTCGACCCTGTTTCCGGCGAGATGTTCGCTTTAAGCTACGACGTAATCCAAAAGCCATATTTGAAATACTTCAAATTCTCGCCGGAGGGAGAGAAATCACCGGACGTTGAAATTCCCCTGCCTCAGCCTACGATGATGCACGATTTCGCAATTACAGAGAAATTCGTCGTAATTCCCGACCAGCAGGTGGTTTTCAAGTTGCCGGAGATGATCCGCGGCGGGTCTCCGGTGGTATACGACAAAGAAAAAACCTCCCGATTCGGAATTTTGGACAAAAATGCCACAGACGCCAATTCTATTAAATGGATTGAAGCTCCTGATTGCTTCTGTTTCCATCTCTGGAACGCTTGGGAAGAACCAGAAACTAACGAAATCGTCGTAATTGGGTCGTGCATGACGCCGCCAGACTCCATTTTCAACGAATGCGAAGAGAATTTGAAATCCGTTTTGTCGGAAATTCGTCTGAATCTCTCGACTGGAAAATCGACCCGTCGACCAATCATCACTGAAACAGAGCAAGTGAACTTAGAGGCCGGAATGGTGAACCGAAATCTACTGGGGAGAAAAACCCAGTTTGCTTATCTTGCTCTAGCAGAGCCATGGCCAAAAGTTTCTGGTTTTGCGAAGGTCGATCTCTTCACCGGAGAAATCAAAAAGTATCTCTACGGCGAACAGAGGTACGGCGGTGAGCCTCTGTTTCTGCCTAGAGAAGGGGCTGAGGCAGAGGACGATGGCCACATCTTGGCCTTCGTTCACGACGAGAAGGAATGGAAATCGGAGCTTCAGATCGTTAACGCCATGACTTTGGAGCTTGAAGCTACTGTAAAGCTGCCTTCTCGAGTTCCTTATGGCTTCCATGGAACTTTCATAAGTTCCGAGGATTTGCAGAAGCAGATACGGTAA

Coding sequence (CDS)

ATGGCTTCTCTTTCTTCTTCTTCATGGATCAAATCTGGTTTTTCCTCTTCAATGCCGGAGAATCTCTCTGTAAATACAAATTATGGTAGAAACAGAGTTGCTACCTCCTGTTCTCTTCATACGCCTTCTATTATTCAAATTCCTAAGCATTCTCACACCGCATTTCCATCTCCTATTAAGACTGCAGTGCCTAAACCCCTTCCAGTTTCGCCTCCCTCCCTGTCCTCCGCATCCGATGACTGGAATTTCTTGCAGCGGGCTGCGGCTATGGCTCTTGATGCGGTTGAAAATGCTCTGGTTTCGGCAGAGCGGAAGCATTCCTTGCCCAAGACCGCCGACCCTGCGGTTCAAATTGCTGGCAATTTTGCCCCGGTGCCGGAGCAGCCGGTTCGGAAGGGCTTACCGGTGATTGGGAAAGTTCCTGACTGTATTCGAGGGGTTTATGTGCGAAACGGGGCCAACCCACTTCACGAACCCGTGTCCGGTCATCATTTGTTCGACGGTGATGGGATGGTTCACGCTGTGGAATTCTCTGAGGGCGGCGGTGTTAGTTACGCTTGCCGGTTCACGGAGACCCAACGGCTGGTTCAGGAGCGAGCTTACGGCCGGCCGGTATTCCCAAAGGCGATCGGCGAACTCCATGGTCACTCTGGGATAGCTCGGCTGATGCTTTTCTATGCCAGAGGATTATTTGGCCTTGTCGATCACAACCATGGAATTGGAGTCGCTAACGCCGGTCTGGTCTTTTTCAACGGCCGGCTTCTTGCCATGTCGGAGGACGATTTGCCTTACCAAATCAGAGTCACTCCGGCGGGTGATTTGAAAACCGTCGGCCGGTTCGATTTCGATGGCCAACTGACATCCACAATGATAGCCCACCCGAAACTCGACCCTGTTTCCGGCGAGATGTTCGCTTTAAGCTACGACGTAATCCAAAAGCCATATTTGAAATACTTCAAATTCTCGCCGGAGGGAGAGAAATCACCGGACGTTGAAATTCCCCTGCCTCAGCCTACGATGATGCACGATTTCGCAATTACAGAGAAATTCGTCGTAATTCCCGACCAGCAGGTGGTTTTCAAGTTGCCGGAGATGATCCGCGGCGGGTCTCCGGTGGTATACGACAAAGAAAAAACCTCCCGATTCGGAATTTTGGACAAAAATGCCACAGACGCCAATTCTATTAAATGGATTGAAGCTCCTGATTGCTTCTGTTTCCATCTCTGGAACGCTTGGGAAGAACCAGAAACTAACGAAATCGTCGTAATTGGGTCGTGCATGACGCCGCCAGACTCCATTTTCAACGAATGCGAAGAGAATTTGAAATCCGTTTTGTCGGAAATTCGTCTGAATCTCTCGACTGGAAAATCGACCCGTCGACCAATCATCACTGAAACAGAGCAAGTGAACTTAGAGGCCGGAATGGTGAACCGAAATCTACTGGGGAGAAAAACCCAGTTTGCTTATCTTGCTCTAGCAGAGCCATGGCCAAAAGTTTCTGGTTTTGCGAAGGTCGATCTCTTCACCGGAGAAATCAAAAAGTATCTCTACGGCGAACAGAGGTACGGCGGTGAGCCTCTGTTTCTGCCTAGAGAAGGGGCTGAGGCAGAGGACGATGGCCACATCTTGGCCTTCGTTCACGACGAGAAGGAATGGAAATCGGAGCTTCAGATCGTTAACGCCATGACTTTGGAGCTTGAAGCTACTGTAAAGCTGCCTTCTCGAGTTCCTTATGGCTTCCATGGAACTTTCATAAGTTCCGAGGATTTGCAGAAGCAGATACGGTAA

Protein sequence

MASLSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAVPKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQIR
BLAST of ClCG07G010550 vs. Swiss-Prot
Match: NCED3_ARATH (9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic OS=Arabidopsis thaliana GN=NCED3 PE=2 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 1.4e-253
Identity = 436/592 (73.65%), Postives = 489/592 (82.60%), Query Frame = 1

Query: 4   LSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAV 63
           LSSS      + SS+P    V       ++  S +LHTP  +  PK S  +         
Sbjct: 24  LSSSQSSDLSYCSSLPMASRVT-----RKLNVSSALHTPPALHFPKQSSNS--------- 83

Query: 64  PKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFA 123
              + V P +  S +   N  QRAAA ALDA E  LVS E+ H LPKTADP+VQIAGNFA
Sbjct: 84  -PAIVVKPKAKESNTKQMNLFQRAAAAALDAAEGFLVSHEKLHPLPKTADPSVQIAGNFA 143

Query: 124 PVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGV 183
           PV EQPVR+ LPV+GK+PD I+GVYVRNGANPLHEPV+GHH FDGDGMVHAV+F E G  
Sbjct: 144 PVNEQPVRRNLPVVGKLPDSIKGVYVRNGANPLHEPVTGHHFFDGDGMVHAVKF-EHGSA 203

Query: 184 SYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVA 243
           SYACRFT+T R VQER  GRPVFPKAIGELHGH+GIARLMLFYAR   G+VD  HG GVA
Sbjct: 204 SYACRFTQTNRFVQERQLGRPVFPKAIGELHGHTGIARLMLFYARAAAGIVDPAHGTGVA 263

Query: 244 NAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEM 303
           NAGLV+FNGRLLAMSEDDLPYQ+++TP GDLKTVGRFDFDGQL STMIAHPK+DP SGE+
Sbjct: 264 NAGLVYFNGRLLAMSEDDLPYQVQITPNGDLKTVGRFDFDGQLESTMIAHPKVDPESGEL 323

Query: 304 FALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLP 363
           FALSYDV+ KPYLKYF+FSP+G KSPDVEI L QPTMMHDFAITE FVV+PDQQVVFKLP
Sbjct: 324 FALSYDVVSKPYLKYFRFSPDGTKSPDVEIQLDQPTMMHDFAITENFVVVPDQQVVFKLP 383

Query: 364 EMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVI 423
           EMIRGGSPVVYDK K +RFGILDK A D+++IKWI+APDCFCFHLWNAWEEPET+E+VVI
Sbjct: 384 EMIRGGSPVVYDKNKVARFGILDKYAEDSSNIKWIDAPDCFCFHLWNAWEEPETDEVVVI 443

Query: 424 GSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIIT-ETEQVNLEAGMVNRNLLG 483
           GSCMTPPDSIFNE +ENLKSVLSEIRLNL TG+STRRPII+ E +QVNLEAGMVNRN+LG
Sbjct: 444 GSCMTPPDSIFNESDENLKSVLSEIRLNLKTGESTRRPIISNEDQQVNLEAGMVNRNMLG 503

Query: 484 RKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHI 543
           RKT+FAYLALAEPWPKVSGFAKVDL TGE+KK+LYG+ RYGGEPLFLP EG E ED+G+I
Sbjct: 504 RKTKFAYLALAEPWPKVSGFAKVDLTTGEVKKHLYGDNRYGGEPLFLPGEGGE-EDEGYI 563

Query: 544 LAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQI 595
           L FVHDEK WKSELQIVNA++LE+EATVKLPSRVPYGFHGTFI ++DL KQ+
Sbjct: 564 LCFVHDEKTWKSELQIVNAVSLEVEATVKLPSRVPYGFHGTFIGADDLAKQV 598

BLAST of ClCG07G010550 vs. Swiss-Prot
Match: NCED1_PHAVU (9-cis-epoxycarotenoid dioxygenase NCED1, chloroplastic OS=Phaseolus vulgaris GN=NCED1 PE=2 SV=1)

HSP 1 Score: 864.0 bits (2231), Expect = 9.6e-250
Identity = 434/617 (70.34%), Postives = 487/617 (78.93%), Query Frame = 1

Query: 1   MASLSSSSWIKSGF--------------SSSMPENLSVNTNYGRNRVATSCSL---HTPS 60
           M S +S++WI +                SSS    L        N    +CSL   H P 
Sbjct: 1   MPSPASNTWINTTLPSSCSSPFKDLASTSSSPTTLLPFKKRSSSNTNTITCSLQTLHYPK 60

Query: 61  IIQIPKHSHTAFPSPIKTAVPKPLPVSPPSLSSASDD-------WNFLQRAAAMALDAVE 120
             Q    S T  P+PIK                 SD        WNFLQ+AAA  LD VE
Sbjct: 61  QYQPTSTSTTTTPTPIKPTTTTTTTTPHRETKPLSDTKQPFPQKWNFLQKAAATGLDMVE 120

Query: 121 NALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPL 180
            ALVS E KH LPKTADP VQIAGNFAPVPE    + LPV+GK+P CI GVYVRNGANPL
Sbjct: 121 TALVSHESKHPLPKTADPKVQIAGNFAPVPEHAADQALPVVGKIPKCIDGVYVRNGANPL 180

Query: 181 HEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGH 240
           +EPV+GHH FDGDGMVHAV+F+ G   SYACRFTETQRL QE++ GRPVFPKAIGELHGH
Sbjct: 181 YEPVAGHHFFDGDGMVHAVKFTNGAA-SYACRFTETQRLAQEKSLGRPVFPKAIGELHGH 240

Query: 241 SGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKT 300
           SGIARL+LFYAR LF LVD +HG+GVANAGLV+FN  LLAMSEDDLPY +R+T  GDL T
Sbjct: 241 SGIARLLLFYARSLFQLVDGSHGMGVANAGLVYFNNHLLAMSEDDLPYHVRITSNGDLTT 300

Query: 301 VGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLP 360
           VGR+DF+GQL STMIAHPKLDPV+G++ ALSYDV+QKPYLKYF+FS +G KSPDVEIPL 
Sbjct: 301 VGRYDFNGQLNSTMIAHPKLDPVNGDLHALSYDVVQKPYLKYFRFSADGVKSPDVEIPLK 360

Query: 361 QPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIK 420
           +PTMMHDFAITE FVV+PDQQVVFKL EMI GGSPVVYDK KTSRFGILDKNA DAN+++
Sbjct: 361 EPTMMHDFAITENFVVVPDQQVVFKLTEMITGGSPVVYDKNKTSRFGILDKNAKDANAMR 420

Query: 421 WIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGK 480
           WI+AP+CFCFHLWNAWEEPET+EIVVIGSCMTP DSIFNEC+E+LKSVLSEIRLNL TGK
Sbjct: 421 WIDAPECFCFHLWNAWEEPETDEIVVIGSCMTPADSIFNECDESLKSVLSEIRLNLRTGK 480

Query: 481 STRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYL 540
           STRRPII++ EQVNLEAGMVNRN LGRKTQFAYLALAEPWPKVSGFAKVDLF+GE++KY+
Sbjct: 481 STRRPIISDAEQVNLEAGMVNRNKLGRKTQFAYLALAEPWPKVSGFAKVDLFSGEVQKYM 540

Query: 541 YGEQRYGGEPLFLPREGAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRV 594
           YGE+++GGEPLFLP    E E DG+ILAFVHDEKEWKSELQIVNA  L+LEA++KLPSRV
Sbjct: 541 YGEEKFGGEPLFLP--NGEEEGDGYILAFVHDEKEWKSELQIVNAQNLKLEASIKLPSRV 600

BLAST of ClCG07G010550 vs. Swiss-Prot
Match: NCED9_ARATH (9-cis-epoxycarotenoid dioxygenase NCED9, chloroplastic OS=Arabidopsis thaliana GN=NCED9 PE=2 SV=1)

HSP 1 Score: 820.8 bits (2119), Expect = 9.3e-237
Identity = 414/586 (70.65%), Postives = 478/586 (81.57%), Query Frame = 1

Query: 15  SSSMP--ENLSVNTNYGRNRVATSC----SLHTPSIIQIPKHSHTAFPSPIKTAVPKPLP 74
           SSS P  ++LS ++     ++   C    S++  S +     S T F  P    +   + 
Sbjct: 74  SSSRPKLQSLSFSSTLRNKKLVVPCYVSSSVNKKSSVSSSLQSPT-FKPPSWKKLCNDVT 133

Query: 75  VSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERK-HSLPKTADPAVQIAGNFAPVPE 134
              P  ++ +   N +QR AAM LDAVENA++S ER+ H  PKTADPAVQIAGNF PVPE
Sbjct: 134 NLIPKTTNQNPKLNPVQRTAAMVLDAVENAMISHERRRHPHPKTADPAVQIAGNFFPVPE 193

Query: 135 QPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYAC 194
           +PV   LPV G VP+CI+GVYVRNGANPLH+PVSGHHLFDGDGMVHAV F + G VSYAC
Sbjct: 194 KPVVHNLPVTGTVPECIQGVYVRNGANPLHKPVSGHHLFDGDGMVHAVRF-DNGSVSYAC 253

Query: 195 RFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGL 254
           RFTET RLVQER  GRPVFPKAIGELHGH GIA+LMLF  RGLFGLVD   G+GVANAGL
Sbjct: 254 RFTETNRLVQERECGRPVFPKAIGELHGHLGIAKLMLFNTRGLFGLVDPTGGLGVANAGL 313

Query: 255 VFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALS 314
           V+FNG LLAMSEDDLPY ++VT  GDL+T GR+DFDGQL STMIAHPK+DP + E+FALS
Sbjct: 314 VYFNGHLLAMSEDDLPYHVKVTQTGDLETSGRYDFDGQLKSTMIAHPKIDPETRELFALS 373

Query: 315 YDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIR 374
           YDV+ KPYLKYF+F+ +GEKSPDVEIPL QPTM+HDFAITE FVVIPDQQVVF+LPEMIR
Sbjct: 374 YDVVSKPYLKYFRFTSDGEKSPDVEIPLDQPTMIHDFAITENFVVIPDQQVVFRLPEMIR 433

Query: 375 GGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCM 434
           GGSPVVYD++K SRFGIL+KNA DA+SI+WIE PDCFCFHLWN+WEEPET+E+VVIGSCM
Sbjct: 434 GGSPVVYDEKKKSRFGILNKNAKDASSIQWIEVPDCFCFHLWNSWEEPETDEVVVIGSCM 493

Query: 435 TPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQF 494
           TPPDSIFNE +E L+SVLSEIRLNL TG+STRRP+I  +EQVNLEAGMVNRNLLGRKT++
Sbjct: 494 TPPDSIFNEHDETLQSVLSEIRLNLKTGESTRRPVI--SEQVNLEAGMVNRNLLGRKTRY 553

Query: 495 AYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHILAFVH 554
           AYLAL EPWPKVSGFAKVDL TGEI+KY+YGE +YGGEPLFLP  G   ED G+I+ FVH
Sbjct: 554 AYLALTEPWPKVSGFAKVDLSTGEIRKYIYGEGKYGGEPLFLP-SGDGEEDGGYIMVFVH 613

Query: 555 DEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           DE++ KSELQ++NA+ ++LEATV LPSRVPYGFHGTFIS EDL KQ
Sbjct: 614 DEEKVKSELQLINAVNMKLEATVTLPSRVPYGFHGTFISKEDLSKQ 654

BLAST of ClCG07G010550 vs. Swiss-Prot
Match: NCED5_ARATH (Probable 9-cis-epoxycarotenoid dioxygenase NCED5, chloroplastic OS=Arabidopsis thaliana GN=NCED5 PE=1 SV=1)

HSP 1 Score: 806.6 bits (2082), Expect = 1.8e-232
Identity = 395/570 (69.30%), Postives = 460/570 (80.70%), Query Frame = 1

Query: 25  NTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAVPKPLPVSPPSLSSASDDWNFL 84
           NT   R +++ +    TP+++  P +             P P P+ P   +S    WN L
Sbjct: 36  NTKPRRRKLSANSVSDTPNLLNFPNY-------------PSPNPIIPEKDTSR---WNPL 95

Query: 85  QRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCI 144
           QRAA+ ALD  E AL+  ER   LPKT DP  QI+GN+APVPEQ V+  L V GK+PDCI
Sbjct: 96  QRAASAALDFAETALLRRERSKPLPKTVDPRHQISGNYAPVPEQSVKSSLSVDGKIPDCI 155

Query: 145 RGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRP 204
            GVY+RNGANPL EPVSGHHLFDGDGMVHAV+ + G   SY+CRFTET+RLVQE+  G P
Sbjct: 156 DGVYLRNGANPLFEPVSGHHLFDGDGMVHAVKITNGDA-SYSCRFTETERLVQEKQLGSP 215

Query: 205 VFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPY 264
           +FPKAIGELHGHSGIARLMLFYARGLFGL++H +G GVANAGLV+F+ RLLAMSEDDLPY
Sbjct: 216 IFPKAIGELHGHSGIARLMLFYARGLFGLLNHKNGTGVANAGLVYFHDRLLAMSEDDLPY 275

Query: 265 QIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPE 324
           Q+RVT  GDL+T+GRFDFDGQL+S MIAHPK+DPV+ E+FALSYDV++KPYLKYFKFSPE
Sbjct: 276 QVRVTDNGDLETIGRFDFDGQLSSAMIAHPKIDPVTKELFALSYDVVKKPYLKYFKFSPE 335

Query: 325 GEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGI 384
           GEKSPDVEIPL  PTMMHDFAITE FVVIPDQQVVFKL +M  G SPV YD EK SRFGI
Sbjct: 336 GEKSPDVEIPLASPTMMHDFAITENFVVIPDQQVVFKLSDMFLGKSPVKYDGEKISRFGI 395

Query: 385 LDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSV 444
           L +NA DA+ + W+E+P+ FCFHLWNAWE PET+E+VVIGSCMTP DSIFNEC+E L SV
Sbjct: 396 LPRNAKDASEMVWVESPETFCFHLWNAWESPETDEVVVIGSCMTPADSIFNECDEQLNSV 455

Query: 445 LSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAK 504
           LSEIRLNL TGKSTRR II  + Q+NLEAGMVNRNLLGRKT++AYLA+AEPWPKVSGFAK
Sbjct: 456 LSEIRLNLKTGKSTRRTIIPGSVQMNLEAGMVNRNLLGRKTRYAYLAIAEPWPKVSGFAK 515

Query: 505 VDLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEAEDDGHILAFVHDEKEWKSELQIVNAMT 564
           VDL TGE+K + YG ++YGGEP FLPR   ++ EDDG+I++FVHDE+ W+SEL IVNA+T
Sbjct: 516 VDLSTGEVKNHFYGGKKYGGEPFFLPRGLESDGEDDGYIMSFVHDEESWESELHIVNAVT 575

Query: 565 LELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           LELEATVKLPSRVPYGFHGTF++S D+  Q
Sbjct: 576 LELEATVKLPSRVPYGFHGTFVNSADMLNQ 588

BLAST of ClCG07G010550 vs. Swiss-Prot
Match: NCED_ONCHC (9-cis-epoxycarotenoid dioxygenase, chloroplastic OS=Oncidium hybrid cultivar GN=NCED PE=2 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 1.5e-226
Identity = 386/546 (70.70%), Postives = 442/546 (80.95%), Query Frame = 1

Query: 53  TAFPSPIKTAVPK-PLPVSPPSLSSAS-DDWNFLQRAAAMALDAVENALVS--AERKHSL 112
           T  PSP     P  P  + P   +  S   WN  QRAAA  L+AVE+ L+    E  H L
Sbjct: 65  TTSPSPFYPLPPTCPKEIHPEQSAKPSRPSWNLFQRAAAAVLEAVEDNLIQNLLESGHPL 124

Query: 113 PKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDG 172
           PKTADPAVQIAGNFAPV EQ     LPV G++P  I GVY+RNGANPL EPV+GHH FDG
Sbjct: 125 PKTADPAVQIAGNFAPVGEQKPHHDLPVDGRIPPLINGVYLRNGANPLFEPVAGHHFFDG 184

Query: 173 DGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYAR 232
           DGMVHAV     G  SYACRFTET+RL QERA GR +FPKAIGELHGHSGIARL+LFYAR
Sbjct: 185 DGMVHAVHL-RNGRASYACRFTETERLKQERAVGRAIFPKAIGELHGHSGIARLLLFYAR 244

Query: 233 GLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTS 292
           GL GL+DH+ G GVANAGL++FN RLLAMSEDDLPY +R+ P GDL+T GR+DFDGQLT+
Sbjct: 245 GLLGLIDHSRGTGVANAGLIYFNNRLLAMSEDDLPYHVRIKPNGDLETAGRYDFDGQLTT 304

Query: 293 TMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITE 352
           TMIAHPKLDP + E FALSYDVI+KPYLKYF+FSP GEKSPDVEIPLPQPTMMHDFAIT+
Sbjct: 305 TMIAHPKLDPETREFFALSYDVIKKPYLKYFRFSPCGEKSPDVEIPLPQPTMMHDFAITK 364

Query: 353 KFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHL 412
            FV+IPDQQVVFKL EMI GGSPVVYDKEK +RFG+L K A DA+ ++WI+ PDCFCFHL
Sbjct: 365 NFVIIPDQQVVFKLQEMICGGSPVVYDKEKIARFGVLPKYAIDASEMQWIDVPDCFCFHL 424

Query: 413 WNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITE-TE 472
           WN+WEEPET E+VVIGSCMTPPDSIFNE EENL+SVL+EIRLNL TGKSTRRPI+     
Sbjct: 425 WNSWEEPETEEVVVIGSCMTPPDSIFNESEENLQSVLTEIRLNLRTGKSTRRPILRPGNS 484

Query: 473 QVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPL 532
           Q+NLEAGMVNRN LGR+T+FAYLA+AEPWPKVSGFAKVDL +GEI+++ YG+  YGGEP 
Sbjct: 485 QINLEAGMVNRNRLGRRTRFAYLAIAEPWPKVSGFAKVDLASGEIQRFEYGDGGYGGEPY 544

Query: 533 FLPREGAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISS 592
           F+PREG + ED G++LAFVHDE+E  SEL I+NA  + LEA V+LPSRVPYGF+GTF+S+
Sbjct: 545 FVPREGCDREDGGYVLAFVHDEREGSSELLIMNAADMRLEAAVRLPSRVPYGFYGTFVSA 604

Query: 593 EDLQKQ 594
            +L  Q
Sbjct: 605 TELHSQ 609

BLAST of ClCG07G010550 vs. TrEMBL
Match: A0A0D2R0C3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G038100 PE=4 SV=1)

HSP 1 Score: 921.0 bits (2379), Expect = 7.4e-265
Identity = 455/597 (76.21%), Postives = 507/597 (84.92%), Query Frame = 1

Query: 5   SSSSWIKSGFSSSMPENLSVNTN--YGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTA 64
           S S  +K     S P++ S +++   G  +   SCSL TPSI+ +PK    AFP      
Sbjct: 11  SGSCCVKVKLPVSTPQSASSSSSSCIGFKKRYISCSLQTPSILHLPKQQSPAFP------ 70

Query: 65  VPKPLPVSPPSLSSA-----SDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQ 124
              P P S PS  +      S  WN LQRAAAMALDAVENALVS ER+H LPKTADP VQ
Sbjct: 71  ---PSPSSTPSTKNTEKTTQSQQWNPLQRAAAMALDAVENALVSHERQHPLPKTADPGVQ 130

Query: 125 IAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEF 184
           I+GNFAPVPEQPV++ LPVIG +PDCI+GVY RNGANPLHEPV+GHH FDGDGMVHAV+F
Sbjct: 131 ISGNFAPVPEQPVKQRLPVIGTIPDCIQGVYARNGANPLHEPVAGHHFFDGDGMVHAVQF 190

Query: 185 SEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHN 244
            + G  SYACRFTET RLVQERA+GRPVFPKAIGELHGHSGIARL+LFYARGL GLVD +
Sbjct: 191 -KNGSASYACRFTETNRLVQERAFGRPVFPKAIGELHGHSGIARLLLFYARGLCGLVDPS 250

Query: 245 HGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLD 304
           HG GVANAGLV+FNG LLAMSEDDLPY +R+TP+GDLKT+GR+DFDGQL STMIAHPK+D
Sbjct: 251 HGTGVANAGLVYFNGHLLAMSEDDLPYHVRITPSGDLKTIGRYDFDGQLKSTMIAHPKVD 310

Query: 305 PVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQ 364
           P +GE FALSYDVIQKP+LKYFK SP+G+KSPDVEIP+  PTMMHDFAITE FVVIPDQQ
Sbjct: 311 PQTGEFFALSYDVIQKPHLKYFKISPDGKKSPDVEIPVDGPTMMHDFAITENFVVIPDQQ 370

Query: 365 VVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPET 424
           VVFKL EM+ GGSPVVYDK K SRFG+L+KNA DA+ IKWIEAPDCFCFHLWNAWEEPET
Sbjct: 371 VVFKLGEMVHGGSPVVYDKNKVSRFGVLNKNAIDASGIKWIEAPDCFCFHLWNAWEEPET 430

Query: 425 NEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVN 484
           NE+VVIGSCMTPPDSIFNECEENLKSVLSEIRLNL TGKSTRR II+E+EQVNLEAGMVN
Sbjct: 431 NEVVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLKTGKSTRRAIISESEQVNLEAGMVN 490

Query: 485 RNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEA 544
           +NLLGRKT+FAYLALAEPWPKVSGFAKVDL TGE+ KY+YG+QRYGGEPLF PR   +E 
Sbjct: 491 KNLLGRKTRFAYLALAEPWPKVSGFAKVDLSTGEVNKYIYGDQRYGGEPLFFPRNPNSEN 550

Query: 545 EDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           EDDG+ILAFVHDEK WKSELQIVNAM L+LEA V+LPSRVPYGFHGTFISS+DL+KQ
Sbjct: 551 EDDGYILAFVHDEKTWKSELQIVNAMDLKLEAAVQLPSRVPYGFHGTFISSKDLEKQ 597

BLAST of ClCG07G010550 vs. TrEMBL
Match: A0A0H3YDS4_GOSHI (9-cis-epoxycarotenoid dioxygenase 2 OS=Gossypium hirsutum GN=NCED2 PE=2 SV=1)

HSP 1 Score: 914.4 bits (2362), Expect = 6.9e-263
Identity = 453/592 (76.52%), Postives = 506/592 (85.47%), Query Frame = 1

Query: 5   SSSSWIKSGFSSSMPENLSVNTN-YGRNRVATSCSLHTPSIIQIPKHSHTAFP-SPIKTA 64
           S S  +K     S P++ S +++  G  +   SCSL T SI+ +PK    AFP SP  T 
Sbjct: 11  SGSCCVKVKLPVSTPQSASSSSSCIGFKKRYISCSLQTHSILHLPKQQLPAFPPSPSSTP 70

Query: 65  VPKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNF 124
             K         ++ S  WN LQRAAAMALDAVENALVS ER+H LPKTADP VQI+GNF
Sbjct: 71  TTKNT-----EKTTQSQQWNPLQRAAAMALDAVENALVSHERQHPLPKTADPGVQISGNF 130

Query: 125 APVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGG 184
           APVPEQ V++ LPVIG +PDCI+GVY RNGANPLHEPV+GHH FDGDGMVHAV+F + G 
Sbjct: 131 APVPEQTVKQRLPVIGTIPDCIQGVYARNGANPLHEPVAGHHFFDGDGMVHAVQF-KNGS 190

Query: 185 VSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGV 244
            SYACRFTET RLVQERA+GRPVFPKAIGELHGHSGIARL+LFYARGL GLVD +HG GV
Sbjct: 191 ASYACRFTETNRLVQERAFGRPVFPKAIGELHGHSGIARLLLFYARGLCGLVDPSHGTGV 250

Query: 245 ANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGE 304
           ANAGLV+FNG LLAMSEDDLPY +R+TP+GDLKT+GR+DFDGQL STMIAHPK+DP +GE
Sbjct: 251 ANAGLVYFNGHLLAMSEDDLPYYVRITPSGDLKTIGRYDFDGQLKSTMIAHPKVDPQTGE 310

Query: 305 MFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKL 364
            FALSYDVIQKP+LKYFK SP+G+KSPDVEIP+  PTMMHDFAITE FVVIPDQQVVFKL
Sbjct: 311 FFALSYDVIQKPHLKYFKISPDGKKSPDVEIPVDGPTMMHDFAITENFVVIPDQQVVFKL 370

Query: 365 PEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVV 424
            EM+ GGSPVVYDK K SRFG+L+KNA DA+ IKWIEAPDCFCFHLWNAWEEPETNE+VV
Sbjct: 371 GEMVHGGSPVVYDKNKVSRFGVLNKNAIDASGIKWIEAPDCFCFHLWNAWEEPETNEVVV 430

Query: 425 IGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLG 484
           IGSCMTPPDSIFNECEENLKSVLSEIRLNL TGKSTRR II+E+EQVNLEAGMVN+NLLG
Sbjct: 431 IGSCMTPPDSIFNECEENLKSVLSEIRLNLKTGKSTRRAIISESEQVNLEAGMVNKNLLG 490

Query: 485 RKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEAEDDGH 544
           RKT+FAYLALAEPWPKVSGFAKVDL TGE+ KY+YG+QRYGGEPLF PR   +E EDDG+
Sbjct: 491 RKTRFAYLALAEPWPKVSGFAKVDLSTGEVNKYIYGDQRYGGEPLFFPRNPNSENEDDGY 550

Query: 545 ILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           ILAFVHDEK WKSELQIVNAM L+LEATV+LPSRVPYGFHGTFISS+DL+KQ
Sbjct: 551 ILAFVHDEKTWKSELQIVNAMDLKLEATVQLPSRVPYGFHGTFISSKDLEKQ 596

BLAST of ClCG07G010550 vs. TrEMBL
Match: B9SU59_RICCO (9-cis-epoxycarotenoid dioxygenase, putative OS=Ricinus communis GN=RCOM_1716910 PE=4 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 3.4e-262
Identity = 447/596 (75.00%), Postives = 499/596 (83.72%), Query Frame = 1

Query: 12  SGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHS----------HTAFPSPIK- 71
           S  SSS P ++ +N ++       SCSLHTPSI+  PK S           T+ P+P K 
Sbjct: 7   SSSSSSSPSSIFLNWHHSGKANIISCSLHTPSILHFPKQSTKTTNFPPSPSTSTPAPKKW 66

Query: 72  --TAVPKPLPVS-PPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQ 131
             T V KPL         +   + N LQRAA+MALDAVE+ALVS ERKH LPKTADP VQ
Sbjct: 67  LTTTVDKPLATQHQQQRQTQKQELNLLQRAASMALDAVESALVSHERKHPLPKTADPVVQ 126

Query: 132 IAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEF 191
           I GNFAPVPEQ V   LPV GK+PD I GVY+RNGANPLHEPV+GHH FDGDGMVHAV+F
Sbjct: 127 ICGNFAPVPEQSVVHSLPVAGKIPDSIHGVYIRNGANPLHEPVAGHHFFDGDGMVHAVQF 186

Query: 192 SEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHN 251
            +GG VSYACRFTET R VQER  GRPVFPKAIGELHGHSGIARL+LFYARGL GLVD +
Sbjct: 187 EKGGSVSYACRFTETNRFVQERELGRPVFPKAIGELHGHSGIARLLLFYARGLVGLVDPS 246

Query: 252 HGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLD 311
           HG GVANAGLV+FNGRLLAMSEDDLPY +RV P+GDLKTV R+DFD QL STMIAHPK+D
Sbjct: 247 HGTGVANAGLVYFNGRLLAMSEDDLPYHVRVLPSGDLKTVARYDFDAQLKSTMIAHPKVD 306

Query: 312 PVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQ 371
           PVSGE+FALSYDV+QKP+LKYF+FSP+G+KSPDVEI L QPTMMHDFAITE FVV+PDQQ
Sbjct: 307 PVSGELFALSYDVVQKPFLKYFRFSPDGKKSPDVEISLDQPTMMHDFAITENFVVVPDQQ 366

Query: 372 VVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPET 431
           VVFKLPEMIRGGSPV+YDK K SRFGILDKN+ D + IKWIEAPDCFCFHLWNAWEEPET
Sbjct: 367 VVFKLPEMIRGGSPVIYDKNKMSRFGILDKNSNDGSKIKWIEAPDCFCFHLWNAWEEPET 426

Query: 432 NEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVN 491
           +E+VV+GSCMTPPDSIFNECEE+LKSVLSEIRLNL TGKSTRR II++ EQVNLEAGMV+
Sbjct: 427 DEVVVVGSCMTPPDSIFNECEESLKSVLSEIRLNLKTGKSTRRAIISQHEQVNLEAGMVS 486

Query: 492 RNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAE 551
           RNLLGRKT+FAYLALAEPWPKVSGFAKVDL TGE+ KY+YG+ +YGGEP+FLP   +  E
Sbjct: 487 RNLLGRKTRFAYLALAEPWPKVSGFAKVDLNTGEVHKYVYGDSKYGGEPMFLPSSDSAQE 546

Query: 552 DDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           D G+IL FVHDEKEWKSELQIVNAM L+LEATVKLPSRVPYGFHGTFI++ DLQKQ
Sbjct: 547 DSGYILCFVHDEKEWKSELQIVNAMNLKLEATVKLPSRVPYGFHGTFINASDLQKQ 602

BLAST of ClCG07G010550 vs. TrEMBL
Match: A0A0D2U362_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G177100 PE=4 SV=1)

HSP 1 Score: 909.8 bits (2350), Expect = 1.7e-261
Identity = 445/559 (79.61%), Postives = 492/559 (88.01%), Query Frame = 1

Query: 36  SCSLHTPSIIQIPKHSHTAFPSPIKTAVPKPLPVSPPSLSSASDDWNFLQRAAAMALDAV 95
           SCSL TPSI+ +P  S T + +  K AV KP P            WN  QRAAAMALD V
Sbjct: 44  SCSLQTPSILLLPNQSSTDYTT--KNAV-KPQP----------QHWNPFQRAAAMALDVV 103

Query: 96  ENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANP 155
           ENALVS ER H LPKTADP VQI+GNFAPVP+QPV+  LPVIG +PDC++GVYVRNGANP
Sbjct: 104 ENALVSHERHHPLPKTADPTVQISGNFAPVPDQPVKHNLPVIGTIPDCLQGVYVRNGANP 163

Query: 156 LHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHG 215
           LHEPV+GHH FDGDGMVHAV+F + G  SYACRFTET RLVQERA GRPVFPKAIGELHG
Sbjct: 164 LHEPVAGHHFFDGDGMVHAVQF-QNGSASYACRFTETNRLVQERALGRPVFPKAIGELHG 223

Query: 216 HSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLK 275
           HSGIARL+LFYARGLFGLVD +HG GVANAGLV+FNG LLAMSEDDLPY +RVTP+GDL+
Sbjct: 224 HSGIARLLLFYARGLFGLVDPSHGTGVANAGLVYFNGHLLAMSEDDLPYHVRVTPSGDLE 283

Query: 276 TVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPL 335
           TVGR+DFDGQL STMIAHPK+DP +GE FALSYDVIQKPYLKYF+FSP+G+KSPDVEIP+
Sbjct: 284 TVGRYDFDGQLKSTMIAHPKVDPETGEFFALSYDVIQKPYLKYFRFSPDGKKSPDVEIPV 343

Query: 336 PQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSI 395
             P MMHDFAITE  VVIPDQQVVFKLPEMI GGSPVVYDK K SRFGILDKNATDA+ I
Sbjct: 344 DGPIMMHDFAITENLVVIPDQQVVFKLPEMIHGGSPVVYDKNKMSRFGILDKNATDASGI 403

Query: 396 KWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTG 455
            W+EAPDCFCFHLWNAWEEPET+E+VVIGSCMTPPDSIFNEC+E+LKSVLSEIRLNL TG
Sbjct: 404 TWVEAPDCFCFHLWNAWEEPETDEVVVIGSCMTPPDSIFNECDESLKSVLSEIRLNLRTG 463

Query: 456 KSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKY 515
           KSTRRPII+++EQVNLEAGMVNRNLLGRKT++AYLALAEPWPKVSGFAKVDL TGEIKKY
Sbjct: 464 KSTRRPIISDSEQVNLEAGMVNRNLLGRKTRYAYLALAEPWPKVSGFAKVDLSTGEIKKY 523

Query: 516 LYGEQRYGGEPLFLPRE-GAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPS 575
           +YG+QRYGGEPLF PR   +E EDDG+ILAFVHDE+ WKSE+QIVNAM LE+EATVKLPS
Sbjct: 524 IYGDQRYGGEPLFFPRNPNSENEDDGYILAFVHDERTWKSEVQIVNAMNLEVEATVKLPS 583

Query: 576 RVPYGFHGTFISSEDLQKQ 594
           RVPYGFHGTFI+S+DL+KQ
Sbjct: 584 RVPYGFHGTFINSKDLEKQ 588

BLAST of ClCG07G010550 vs. TrEMBL
Match: K4IAN1_FRAAN (9-cis-epoxycarotenoid dioxygenase 2 OS=Fragaria ananassa GN=NCED2 PE=2 SV=1)

HSP 1 Score: 905.6 bits (2339), Expect = 3.2e-260
Identity = 451/603 (74.79%), Postives = 511/603 (84.74%), Query Frame = 1

Query: 4   LSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAV 63
           LSSS    SG  SS   +LSV++   +N    +CSL TPS IQ PK + T +  P  ++ 
Sbjct: 22  LSSSRRRDSG--SSTATSLSVSST-PKNTNTITCSLQTPSFIQFPKQAPT-YSQPSSSSS 81

Query: 64  PKPLPVSP----PSLSSASDD--------WNFLQRAAAMALDAVENALVSAERKHSLPKT 123
              L  +P    P  SS+S+         WN  QR AAMA+DA+E+ALVS E +H LPKT
Sbjct: 82  TTILTKTPKDQKPLTSSSSNSKPVVQPQQWNLFQRVAAMAIDAMESALVSKELEHPLPKT 141

Query: 124 ADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGM 183
           ADP VQIAGNFAPVPEQPV+  LPV GK+P+CIRGVYVRNGANPLHEPV+GHHLFDGDGM
Sbjct: 142 ADPKVQIAGNFAPVPEQPVKHSLPVTGKIPECIRGVYVRNGANPLHEPVAGHHLFDGDGM 201

Query: 184 VHAVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLF 243
           VHA+ F+  G  SYACRFTET R+VQERA GRP+FPKAIGELHGHSGIARL LFY RG  
Sbjct: 202 VHALSFNSDGSASYACRFTETHRMVQERALGRPMFPKAIGELHGHSGIARLALFYLRGAC 261

Query: 244 GLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMI 303
           GLVD +HG+GVANAGLV+FNGRLLAMSEDDLPY +RVT  GDLKT GR+DF+ QL STMI
Sbjct: 262 GLVDPSHGLGVANAGLVYFNGRLLAMSEDDLPYHVRVTKTGDLKTEGRYDFNDQLKSTMI 321

Query: 304 AHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFV 363
           AHPK+DP +GE+FALSYDV+QKPYLKYFKFSP G KSPDVEIPL QPTMMHDFAITE+FV
Sbjct: 322 AHPKVDPATGELFALSYDVVQKPYLKYFKFSPNGTKSPDVEIPLAQPTMMHDFAITERFV 381

Query: 364 VIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNA 423
           VIPDQQVVF LPEMIRGGSPV+YDK K +RFGILDKNA+DA+ I+W+EAPDCFCFHLWNA
Sbjct: 382 VIPDQQVVFNLPEMIRGGSPVIYDKNKVARFGILDKNASDASGIRWVEAPDCFCFHLWNA 441

Query: 424 WEEPETNEIVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNL 483
           WEEP+T+E+VVIGSCMTPPDSIFNEC+E L+SVLSEIRLNL TGKSTRRPI   +EQ+NL
Sbjct: 442 WEEPDTDEVVVIGSCMTPPDSIFNECDECLESVLSEIRLNLKTGKSTRRPIC--SEQMNL 501

Query: 484 EAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPR 543
           EAGMVNRN LGRKT+FAYLALAEPWPKVSGFAKVDLFTGE+KK++YGEQR+GGEPLFLPR
Sbjct: 502 EAGMVNRNKLGRKTRFAYLALAEPWPKVSGFAKVDLFTGEVKKHIYGEQRFGGEPLFLPR 561

Query: 544 E-GAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDL 594
           +  +E EDDG+ILAFVHDEKEWKSELQIVNAMTL+LEA++KLPSRVPYGFHGTFISS+DL
Sbjct: 562 DPNSENEDDGYILAFVHDEKEWKSELQIVNAMTLKLEASIKLPSRVPYGFHGTFISSKDL 618

BLAST of ClCG07G010550 vs. TAIR10
Match: AT3G14440.1 (AT3G14440.1 nine-cis-epoxycarotenoid dioxygenase 3)

HSP 1 Score: 876.7 bits (2264), Expect = 8.1e-255
Identity = 436/592 (73.65%), Postives = 489/592 (82.60%), Query Frame = 1

Query: 4   LSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAV 63
           LSSS      + SS+P    V       ++  S +LHTP  +  PK S  +         
Sbjct: 24  LSSSQSSDLSYCSSLPMASRVT-----RKLNVSSALHTPPALHFPKQSSNS--------- 83

Query: 64  PKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFA 123
              + V P +  S +   N  QRAAA ALDA E  LVS E+ H LPKTADP+VQIAGNFA
Sbjct: 84  -PAIVVKPKAKESNTKQMNLFQRAAAAALDAAEGFLVSHEKLHPLPKTADPSVQIAGNFA 143

Query: 124 PVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGV 183
           PV EQPVR+ LPV+GK+PD I+GVYVRNGANPLHEPV+GHH FDGDGMVHAV+F E G  
Sbjct: 144 PVNEQPVRRNLPVVGKLPDSIKGVYVRNGANPLHEPVTGHHFFDGDGMVHAVKF-EHGSA 203

Query: 184 SYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVA 243
           SYACRFT+T R VQER  GRPVFPKAIGELHGH+GIARLMLFYAR   G+VD  HG GVA
Sbjct: 204 SYACRFTQTNRFVQERQLGRPVFPKAIGELHGHTGIARLMLFYARAAAGIVDPAHGTGVA 263

Query: 244 NAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEM 303
           NAGLV+FNGRLLAMSEDDLPYQ+++TP GDLKTVGRFDFDGQL STMIAHPK+DP SGE+
Sbjct: 264 NAGLVYFNGRLLAMSEDDLPYQVQITPNGDLKTVGRFDFDGQLESTMIAHPKVDPESGEL 323

Query: 304 FALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLP 363
           FALSYDV+ KPYLKYF+FSP+G KSPDVEI L QPTMMHDFAITE FVV+PDQQVVFKLP
Sbjct: 324 FALSYDVVSKPYLKYFRFSPDGTKSPDVEIQLDQPTMMHDFAITENFVVVPDQQVVFKLP 383

Query: 364 EMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVI 423
           EMIRGGSPVVYDK K +RFGILDK A D+++IKWI+APDCFCFHLWNAWEEPET+E+VVI
Sbjct: 384 EMIRGGSPVVYDKNKVARFGILDKYAEDSSNIKWIDAPDCFCFHLWNAWEEPETDEVVVI 443

Query: 424 GSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIIT-ETEQVNLEAGMVNRNLLG 483
           GSCMTPPDSIFNE +ENLKSVLSEIRLNL TG+STRRPII+ E +QVNLEAGMVNRN+LG
Sbjct: 444 GSCMTPPDSIFNESDENLKSVLSEIRLNLKTGESTRRPIISNEDQQVNLEAGMVNRNMLG 503

Query: 484 RKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHI 543
           RKT+FAYLALAEPWPKVSGFAKVDL TGE+KK+LYG+ RYGGEPLFLP EG E ED+G+I
Sbjct: 504 RKTKFAYLALAEPWPKVSGFAKVDLTTGEVKKHLYGDNRYGGEPLFLPGEGGE-EDEGYI 563

Query: 544 LAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQI 595
           L FVHDEK WKSELQIVNA++LE+EATVKLPSRVPYGFHGTFI ++DL KQ+
Sbjct: 564 LCFVHDEKTWKSELQIVNAVSLEVEATVKLPSRVPYGFHGTFIGADDLAKQV 598

BLAST of ClCG07G010550 vs. TAIR10
Match: AT1G78390.1 (AT1G78390.1 nine-cis-epoxycarotenoid dioxygenase 9)

HSP 1 Score: 820.8 bits (2119), Expect = 5.3e-238
Identity = 414/586 (70.65%), Postives = 478/586 (81.57%), Query Frame = 1

Query: 15  SSSMP--ENLSVNTNYGRNRVATSC----SLHTPSIIQIPKHSHTAFPSPIKTAVPKPLP 74
           SSS P  ++LS ++     ++   C    S++  S +     S T F  P    +   + 
Sbjct: 74  SSSRPKLQSLSFSSTLRNKKLVVPCYVSSSVNKKSSVSSSLQSPT-FKPPSWKKLCNDVT 133

Query: 75  VSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERK-HSLPKTADPAVQIAGNFAPVPE 134
              P  ++ +   N +QR AAM LDAVENA++S ER+ H  PKTADPAVQIAGNF PVPE
Sbjct: 134 NLIPKTTNQNPKLNPVQRTAAMVLDAVENAMISHERRRHPHPKTADPAVQIAGNFFPVPE 193

Query: 135 QPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYAC 194
           +PV   LPV G VP+CI+GVYVRNGANPLH+PVSGHHLFDGDGMVHAV F + G VSYAC
Sbjct: 194 KPVVHNLPVTGTVPECIQGVYVRNGANPLHKPVSGHHLFDGDGMVHAVRF-DNGSVSYAC 253

Query: 195 RFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGL 254
           RFTET RLVQER  GRPVFPKAIGELHGH GIA+LMLF  RGLFGLVD   G+GVANAGL
Sbjct: 254 RFTETNRLVQERECGRPVFPKAIGELHGHLGIAKLMLFNTRGLFGLVDPTGGLGVANAGL 313

Query: 255 VFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALS 314
           V+FNG LLAMSEDDLPY ++VT  GDL+T GR+DFDGQL STMIAHPK+DP + E+FALS
Sbjct: 314 VYFNGHLLAMSEDDLPYHVKVTQTGDLETSGRYDFDGQLKSTMIAHPKIDPETRELFALS 373

Query: 315 YDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIR 374
           YDV+ KPYLKYF+F+ +GEKSPDVEIPL QPTM+HDFAITE FVVIPDQQVVF+LPEMIR
Sbjct: 374 YDVVSKPYLKYFRFTSDGEKSPDVEIPLDQPTMIHDFAITENFVVIPDQQVVFRLPEMIR 433

Query: 375 GGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCM 434
           GGSPVVYD++K SRFGIL+KNA DA+SI+WIE PDCFCFHLWN+WEEPET+E+VVIGSCM
Sbjct: 434 GGSPVVYDEKKKSRFGILNKNAKDASSIQWIEVPDCFCFHLWNSWEEPETDEVVVIGSCM 493

Query: 435 TPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQF 494
           TPPDSIFNE +E L+SVLSEIRLNL TG+STRRP+I  +EQVNLEAGMVNRNLLGRKT++
Sbjct: 494 TPPDSIFNEHDETLQSVLSEIRLNLKTGESTRRPVI--SEQVNLEAGMVNRNLLGRKTRY 553

Query: 495 AYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHILAFVH 554
           AYLAL EPWPKVSGFAKVDL TGEI+KY+YGE +YGGEPLFLP  G   ED G+I+ FVH
Sbjct: 554 AYLALTEPWPKVSGFAKVDLSTGEIRKYIYGEGKYGGEPLFLP-SGDGEEDGGYIMVFVH 613

Query: 555 DEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           DE++ KSELQ++NA+ ++LEATV LPSRVPYGFHGTFIS EDL KQ
Sbjct: 614 DEEKVKSELQLINAVNMKLEATVTLPSRVPYGFHGTFISKEDLSKQ 654

BLAST of ClCG07G010550 vs. TAIR10
Match: AT1G30100.1 (AT1G30100.1 nine-cis-epoxycarotenoid dioxygenase 5)

HSP 1 Score: 806.6 bits (2082), Expect = 1.0e-233
Identity = 395/570 (69.30%), Postives = 460/570 (80.70%), Query Frame = 1

Query: 25  NTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAVPKPLPVSPPSLSSASDDWNFL 84
           NT   R +++ +    TP+++  P +             P P P+ P   +S    WN L
Sbjct: 36  NTKPRRRKLSANSVSDTPNLLNFPNY-------------PSPNPIIPEKDTSR---WNPL 95

Query: 85  QRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCI 144
           QRAA+ ALD  E AL+  ER   LPKT DP  QI+GN+APVPEQ V+  L V GK+PDCI
Sbjct: 96  QRAASAALDFAETALLRRERSKPLPKTVDPRHQISGNYAPVPEQSVKSSLSVDGKIPDCI 155

Query: 145 RGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRP 204
            GVY+RNGANPL EPVSGHHLFDGDGMVHAV+ + G   SY+CRFTET+RLVQE+  G P
Sbjct: 156 DGVYLRNGANPLFEPVSGHHLFDGDGMVHAVKITNGDA-SYSCRFTETERLVQEKQLGSP 215

Query: 205 VFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPY 264
           +FPKAIGELHGHSGIARLMLFYARGLFGL++H +G GVANAGLV+F+ RLLAMSEDDLPY
Sbjct: 216 IFPKAIGELHGHSGIARLMLFYARGLFGLLNHKNGTGVANAGLVYFHDRLLAMSEDDLPY 275

Query: 265 QIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPE 324
           Q+RVT  GDL+T+GRFDFDGQL+S MIAHPK+DPV+ E+FALSYDV++KPYLKYFKFSPE
Sbjct: 276 QVRVTDNGDLETIGRFDFDGQLSSAMIAHPKIDPVTKELFALSYDVVKKPYLKYFKFSPE 335

Query: 325 GEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGI 384
           GEKSPDVEIPL  PTMMHDFAITE FVVIPDQQVVFKL +M  G SPV YD EK SRFGI
Sbjct: 336 GEKSPDVEIPLASPTMMHDFAITENFVVIPDQQVVFKLSDMFLGKSPVKYDGEKISRFGI 395

Query: 385 LDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSV 444
           L +NA DA+ + W+E+P+ FCFHLWNAWE PET+E+VVIGSCMTP DSIFNEC+E L SV
Sbjct: 396 LPRNAKDASEMVWVESPETFCFHLWNAWESPETDEVVVIGSCMTPADSIFNECDEQLNSV 455

Query: 445 LSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAK 504
           LSEIRLNL TGKSTRR II  + Q+NLEAGMVNRNLLGRKT++AYLA+AEPWPKVSGFAK
Sbjct: 456 LSEIRLNLKTGKSTRRTIIPGSVQMNLEAGMVNRNLLGRKTRYAYLAIAEPWPKVSGFAK 515

Query: 505 VDLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEAEDDGHILAFVHDEKEWKSELQIVNAMT 564
           VDL TGE+K + YG ++YGGEP FLPR   ++ EDDG+I++FVHDE+ W+SEL IVNA+T
Sbjct: 516 VDLSTGEVKNHFYGGKKYGGEPFFLPRGLESDGEDDGYIMSFVHDEESWESELHIVNAVT 575

Query: 565 LELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           LELEATVKLPSRVPYGFHGTF++S D+  Q
Sbjct: 576 LELEATVKLPSRVPYGFHGTFVNSADMLNQ 588

BLAST of ClCG07G010550 vs. TAIR10
Match: AT4G18350.1 (AT4G18350.1 nine-cis-epoxycarotenoid dioxygenase 2)

HSP 1 Score: 752.3 bits (1941), Expect = 2.3e-217
Identity = 363/516 (70.35%), Postives = 430/516 (83.33%), Query Frame = 1

Query: 82  NFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVP 141
           N  Q+AAA+A+DA E AL+S E+   LPKTADP VQIAGN++PVPE  VR+ L V G +P
Sbjct: 70  NIFQKAAAIAIDAAERALISHEQDSPLPKTADPRVQIAGNYSPVPESSVRRNLTVEGTIP 129

Query: 142 DCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAY 201
           DCI GVY+RNGANP+ EP +GHHLFDGDGMVHAV+ + G   SYACRFT+T+RLVQE+  
Sbjct: 130 DCIDGVYIRNGANPMFEPTAGHHLFDGDGMVHAVKITNGSA-SYACRFTKTERLVQEKRL 189

Query: 202 GRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDD 261
           GRPVFPKAIGELHGHSGIARLMLFYARGL GL+++ +G+GVANAGLV+FN RLLAMSEDD
Sbjct: 190 GRPVFPKAIGELHGHSGIARLMLFYARGLCGLINNQNGVGVANAGLVYFNNRLLAMSEDD 249

Query: 262 LPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKF 321
           LPYQ+++T  GDL+TVGR+DFDGQL S MIAHPKLDPV+ E+ ALSYDV++KPYLKYF+F
Sbjct: 250 LPYQLKITQTGDLQTVGRYDFDGQLKSAMIAHPKLDPVTKELHALSYDVVKKPYLKYFRF 309

Query: 322 SPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSR 381
           SP+G KSP++EIPL  PTM+HDFAITE FVVIPDQQVVFKL EMI G SPVV+D EK SR
Sbjct: 310 SPDGVKSPELEIPLETPTMIHDFAITENFVVIPDQQVVFKLGEMISGKSPVVFDGEKVSR 369

Query: 382 FGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENL 441
            GI+ K+AT+A+ I W+ +P+ FCFHLWNAWE PET EIVVIGSCM+P DSIFNE +E+L
Sbjct: 370 LGIMPKDATEASQIIWVNSPETFCFHLWNAWESPETEEIVVIGSCMSPADSIFNERDESL 429

Query: 442 KSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSG 501
           +SVLSEIR+NL T K+TRR ++   E VNLE GMVNRN LGRKT+FA+LA+A PWPKVSG
Sbjct: 430 RSVLSEIRINLRTRKTTRRSLLV-NEDVNLEIGMVNRNRLGRKTRFAFLAIAYPWPKVSG 489

Query: 502 FAKVDLFTGEIKKYLYGEQRYGGEPLFLP---REGAEAEDDGHILAFVHDEKEWKSELQI 561
           FAKVDL TGE+KKY+YG ++YGGEP FLP     G E EDDG+I   VHDE+   SELQI
Sbjct: 490 FAKVDLCTGEMKKYIYGGEKYGGEPFFLPGNSGNGEENEDDGYIFCHVHDEETKTSELQI 549

Query: 562 VNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQI 595
           +NA+ L+LEAT+KLPSRVPYGFHGTF+ S +L  Q+
Sbjct: 550 INAVNLKLEATIKLPSRVPYGFHGTFVDSNELVDQL 583

BLAST of ClCG07G010550 vs. TAIR10
Match: AT3G24220.1 (AT3G24220.1 nine-cis-epoxycarotenoid dioxygenase 6)

HSP 1 Score: 644.4 bits (1661), Expect = 6.7e-185
Identity = 314/545 (57.61%), Postives = 405/545 (74.31%), Query Frame = 1

Query: 55  FPSPIKTAVPKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALV-SAERKHSLPKTAD 114
           F  P    +  P+P SP  L     + N LQ+ AA  LD +E+++V   E+   LPK  D
Sbjct: 38  FKIPTLPDLTSPVP-SPVKLKPTYPNLNLLQKLAATMLDKIESSIVIPMEQNRPLPKPTD 97

Query: 115 PAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVH 174
           PAVQ++GNFAPV E PV+ GL V+G++P C++GVY+RNGANP+  P++GHHLFDGDGM+H
Sbjct: 98  PAVQLSGNFAPVNECPVQNGLEVVGQIPSCLKGVYIRNGANPMFPPLAGHHLFDGDGMIH 157

Query: 175 AVEFSEGGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGL 234
           AV       VSY+CR+T+T RLVQE A GR VFPK IGELHGHSG+ARL LF AR   GL
Sbjct: 158 AVSIGFDNQVSYSCRYTKTNRLVQETALGRSVFPKPIGELHGHSGLARLALFTARAGIGL 217

Query: 235 VDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAH 294
           VD   G+GVANAG+VFFNGRLLAMSEDDLPYQ+++   GDL+T+GRF FD Q+ S++IAH
Sbjct: 218 VDGTRGMGVANAGVVFFNGRLLAMSEDDLPYQVKIDGQGDLETIGRFGFDDQIDSSVIAH 277

Query: 295 PKLDPVSGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVI 354
           PK+D  +G++  LSY+V++KP+L+Y KF+  G+K+ DVEI LP+PTM+HDFAITE FVVI
Sbjct: 278 PKVDATTGDLHTLSYNVLKKPHLRYLKFNTCGKKTRDVEITLPEPTMIHDFAITENFVVI 337

Query: 355 PDQQVVFKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWE 414
           PDQQ+VFKL EMIRGGSPV+Y KEK +RFG+L K     + I W++ PDCFCFHLWNAWE
Sbjct: 338 PDQQMVFKLSEMIRGGSPVIYVKEKMARFGVLSKQDLTGSDINWVDVPDCFCFHLWNAWE 397

Query: 415 EPETNE----IVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQV 474
           E  T E    IVVIGSCM+PPD+IF+E  E  +  LSEIRLN+ T +S R+ I+T    V
Sbjct: 398 E-RTEEGDPVIVVIGSCMSPPDTIFSESGEPTRVELSEIRLNMRTKESNRKVIVT---GV 457

Query: 475 NLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFL 534
           NLEAG +NR+ +GRK+QF Y+A+A+PWPK SG AKVD+  G + ++ YG  R+GGEP F+
Sbjct: 458 NLEAGHINRSYVGRKSQFVYIAIADPWPKCSGIAKVDIQNGTVSEFNYGPSRFGGEPCFV 517

Query: 535 PREGAEAEDDGHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSED 594
           P EG   ED G+++ FV DE++ +SE  +V+A  ++  A V+LP RVPYGFHGTF+S   
Sbjct: 518 P-EGEGEEDKGYVMGFVRDEEKDESEFVVVDATDMKQVAAVRLPERVPYGFHGTFVSENQ 576

BLAST of ClCG07G010550 vs. NCBI nr
Match: gi|659131081|ref|XP_008465499.1| (PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1115.9 bits (2885), Expect = 0.0e+00
Identity = 548/592 (92.57%), Postives = 567/592 (95.78%), Query Frame = 1

Query: 4   LSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIKTAV 63
           ++SSS++KS FSSSMP+    N    R  ++ S SLHTPSIIQIPKHSHT FPSP+KT++
Sbjct: 1   MASSSFLKSPFSSSMPDKTLSNYT-SRRLLSVSSSLHTPSIIQIPKHSHTTFPSPLKTSI 60

Query: 64  PKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFA 123
           PKP P+S     S S+DWNFLQRAAAMALDAVENAL+SAERKHSLPKTADPAVQIAGNFA
Sbjct: 61  PKPPPLSTTPPLSGSEDWNFLQRAAAMALDAVENALISAERKHSLPKTADPAVQIAGNFA 120

Query: 124 PVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGV 183
           PVPEQPVRKGLPVIGKVP+CIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSE GGV
Sbjct: 121 PVPEQPVRKGLPVIGKVPECIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEAGGV 180

Query: 184 SYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVA 243
           SYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHN GIGVA
Sbjct: 181 SYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNQGIGVA 240

Query: 244 NAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEM 303
           NAGLV+FNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQL STMIAHPKLDPVSGEM
Sbjct: 241 NAGLVYFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLKSTMIAHPKLDPVSGEM 300

Query: 304 FALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLP 363
           FALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEK+VVIPDQQVVFKLP
Sbjct: 301 FALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKYVVIPDQQVVFKLP 360

Query: 364 EMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVI 423
           EMIRGGSPVVYDKEKTSRFGILDKNA DAN+IKWIEAPDCFCFHLWNAWEEPETNEIVVI
Sbjct: 361 EMIRGGSPVVYDKEKTSRFGILDKNAADANAIKWIEAPDCFCFHLWNAWEEPETNEIVVI 420

Query: 424 GSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGR 483
           GSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPII+ETEQVNLEAGMVNRNLLGR
Sbjct: 421 GSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIISETEQVNLEAGMVNRNLLGR 480

Query: 484 KTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHIL 543
           KTQF+YLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHIL
Sbjct: 481 KTQFSYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDGHIL 540

Query: 544 AFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQIR 596
           AFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQIR
Sbjct: 541 AFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQIR 591

BLAST of ClCG07G010550 vs. NCBI nr
Match: gi|449460068|ref|XP_004147768.1| (PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 1107.8 bits (2864), Expect = 0.0e+00
Identity = 545/596 (91.44%), Postives = 570/596 (95.64%), Query Frame = 1

Query: 1   MASLSSSSWIKSGFSSSMPENLSVNTNYGRNRV-ATSCSLHTPSIIQIPKHSHTAFPSPI 60
           MAS SSSS+++S FSSSMP+     +NY  NR+ + S SLHTPSIIQIPKHSHT FPSP+
Sbjct: 1   MASSSSSSFLQSPFSSSMPDKKL--SNYTTNRLLSVSSSLHTPSIIQIPKHSHTTFPSPL 60

Query: 61  KTAVPKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIA 120
           +T++PKP P++     S S DWNFLQRAAAMALDAVENAL+SAERKHSLPKTADPAVQIA
Sbjct: 61  QTSIPKPPPLTTTPPVSGSQDWNFLQRAAAMALDAVENALISAERKHSLPKTADPAVQIA 120

Query: 121 GNFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSE 180
           GNFAPVPEQPVRKGLPVIGKVPD IRGVYVRNGANPLHEP+SGHHLFDGDGMVHAVEFSE
Sbjct: 121 GNFAPVPEQPVRKGLPVIGKVPDHIRGVYVRNGANPLHEPLSGHHLFDGDGMVHAVEFSE 180

Query: 181 GGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHG 240
            GGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHG
Sbjct: 181 AGGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHG 240

Query: 241 IGVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPV 300
           IGVANAGLV+FNGRLLAMSEDDLPYQIRVTPAGDLKTVGRF+FDGQL STMIAHPKLDPV
Sbjct: 241 IGVANAGLVYFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFNFDGQLESTMIAHPKLDPV 300

Query: 301 SGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVV 360
           SGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEK+VVIPDQQVV
Sbjct: 301 SGEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKYVVIPDQQVV 360

Query: 361 FKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNE 420
           FKLPEMIRGGSPVVYDKEKTSRFGILDKNATDAN+IKWIEAPDCFCFHLWNAWEEPETNE
Sbjct: 361 FKLPEMIRGGSPVVYDKEKTSRFGILDKNATDANAIKWIEAPDCFCFHLWNAWEEPETNE 420

Query: 421 IVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRN 480
           +VVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRN
Sbjct: 421 VVVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRN 480

Query: 481 LLGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDD 540
           LLGRKTQF+YLALAEPWPKVSGFAKVD+ +GE+KKYLYGEQRYGGEPLFLPREGAEAEDD
Sbjct: 481 LLGRKTQFSYLALAEPWPKVSGFAKVDVLSGEVKKYLYGEQRYGGEPLFLPREGAEAEDD 540

Query: 541 GHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQIR 596
           GHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFIS +DLQKQIR
Sbjct: 541 GHILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISCKDLQKQIR 594

BLAST of ClCG07G010550 vs. NCBI nr
Match: gi|408794955|gb|AFU91491.1| (9-cis-epoxycarotenoid dioxygenase [Momordica charantia])

HSP 1 Score: 1070.8 bits (2768), Expect = 8.3e-310
Identity = 523/593 (88.20%), Postives = 556/593 (93.76%), Query Frame = 1

Query: 1   MASLSSSSWIKSGFSSSMPENLSVNTNYGRNRVATSCSLHTPSIIQIPKHSHTAFPSPIK 60
           MAS  SS W+KSGFS SMPE LSVN+NY +NRV  SCSLHTPSI+QIPKHS T  P P+K
Sbjct: 1   MASAPSSPWLKSGFSPSMPELLSVNSNYVKNRV--SCSLHTPSIVQIPKHSQTFRPPPLK 60

Query: 61  TAVPKPLPVSPPSLSSASDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAG 120
           T V KP  VSP +  SA +DWNFLQRAA+MA+DAVENAL+SAERK  LPKTADP VQIAG
Sbjct: 61  TPVEKP--VSPST--SAPEDWNFLQRAASMAIDAVENALLSAERKRPLPKTADPEVQIAG 120

Query: 121 NFAPVPEQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEG 180
           NFAPVPEQPVRK LPV G++P+CIRGVYVRNGANPLHEP +GHHLFDGDGMVHAVEFS+G
Sbjct: 121 NFAPVPEQPVRKALPVTGRIPECIRGVYVRNGANPLHEPAAGHHLFDGDGMVHAVEFSDG 180

Query: 181 GGVSYACRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGI 240
           GGVSYACRFTETQRLVQERA+GRPVFPKAIGELHGHSGIARL+LFYARGLFGLVDH+HGI
Sbjct: 181 GGVSYACRFTETQRLVQERAHGRPVFPKAIGELHGHSGIARLLLFYARGLFGLVDHSHGI 240

Query: 241 GVANAGLVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVS 300
           GVANAGLV+FNGRLLAMSEDDLPYQIRVTPAGDLKT+GRFDFDGQL STMIAHPKLDPVS
Sbjct: 241 GVANAGLVYFNGRLLAMSEDDLPYQIRVTPAGDLKTIGRFDFDGQLKSTMIAHPKLDPVS 300

Query: 301 GEMFALSYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVF 360
           GE+FA+SYDVIQKP+LKYFKFSPEGEKSPDVE+PLPQPTMMHDFAITEKFVVIPDQQVVF
Sbjct: 301 GELFAISYDVIQKPHLKYFKFSPEGEKSPDVEVPLPQPTMMHDFAITEKFVVIPDQQVVF 360

Query: 361 KLPEMIRGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEI 420
           KLPEMIRGGSPV+YDK K SRFGIL KNATDA  IKWIEAPDCFCFHLWNAWEEPETNE+
Sbjct: 361 KLPEMIRGGSPVIYDKNKISRFGILPKNATDAGDIKWIEAPDCFCFHLWNAWEEPETNEV 420

Query: 421 VVIGSCMTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNL 480
           VVIGSCMTPPDSIFNECEENLKSVLSEIRLNL+TGKSTRRPII ETEQVNLEAGMVNRN 
Sbjct: 421 VVIGSCMTPPDSIFNECEENLKSVLSEIRLNLTTGKSTRRPIIPETEQVNLEAGMVNRNR 480

Query: 481 LGRKTQFAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPREGAEAEDDG 540
           LGRKTQFAYLALAEPWPKVSGFAKVDL TGEI+KY+YGEQRYGGEPLFLPREGAEAEDDG
Sbjct: 481 LGRKTQFAYLALAEPWPKVSGFAKVDLSTGEIRKYIYGEQRYGGEPLFLPREGAEAEDDG 540

Query: 541 HILAFVHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           HILAFVHDEKEWKSELQIVNA+TL+LEA+VKLPSRVPYGFHGTFI S+ LQKQ
Sbjct: 541 HILAFVHDEKEWKSELQIVNAITLKLEASVKLPSRVPYGFHGTFIGSKGLQKQ 587

BLAST of ClCG07G010550 vs. NCBI nr
Match: gi|1009109982|ref|XP_015893126.1| (PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED1, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 929.9 bits (2402), Expect = 2.3e-267
Identity = 451/569 (79.26%), Postives = 503/569 (88.40%), Query Frame = 1

Query: 27  NYGRNRVATSCSLHTPSIIQIPKHSHTAF-PSPIKTAVPKPLPVSPPSLSSASDDWNFLQ 86
           N  + R   +CSL TPS+I+IPK S + + PS   T   K  P S          WNFLQ
Sbjct: 20  NGKKRRPNINCSLQTPSVIRIPKQSPSTYQPSNTTTIKEKSTPNSVSDKPVPFQQWNFLQ 79

Query: 87  RAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVPEQPVRKGLPVIGKVPDCIR 146
           +AAAMAL AVE+ALV  ERKH LPKTADP+VQIAGNFAPV EQPV+  LPV GK+P CI 
Sbjct: 80  KAAAMALQAVESALVEHERKHPLPKTADPSVQIAGNFAPVQEQPVQHSLPVTGKIPKCIE 139

Query: 147 GVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYACRFTETQRLVQERAYGRPV 206
           GVYVRNGANP +EPV+GHH FDGDGMVHAV+F + G  SY+CRFTET RLVQERA+GRPV
Sbjct: 140 GVYVRNGANPHYEPVAGHHFFDGDGMVHAVQF-KNGTASYSCRFTETHRLVQERAFGRPV 199

Query: 207 FPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAGLVFFNGRLLAMSEDDLPYQ 266
           FPKAIGELHGHSGIARL+LFYARG+FGLVD NHG+GVANAGLV+FNGRLLAMSEDDLPY 
Sbjct: 200 FPKAIGELHGHSGIARLLLFYARGIFGLVDSNHGMGVANAGLVYFNGRLLAMSEDDLPYH 259

Query: 267 IRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFALSYDVIQKPYLKYFKFSPEG 326
           +R+TP+GDLKTVGR+DF+GQL STMIAHPKLDP +GE+FALSYDV+Q+PYLKYF FSP G
Sbjct: 260 VRITPSGDLKTVGRYDFNGQLKSTMIAHPKLDPETGELFALSYDVVQRPYLKYFHFSPSG 319

Query: 327 EKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMIRGGSPVVYDKEKTSRFGIL 386
           +KS DVEIPL QPTMMHDFAITE FVVIPDQQVVFKLPEMIRGGSPV+YD+EK +RFGIL
Sbjct: 320 KKSTDVEIPLSQPTMMHDFAITENFVVIPDQQVVFKLPEMIRGGSPVIYDREKMARFGIL 379

Query: 387 DKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSCMTPPDSIFNECEENLKSVL 446
           DKNATDA+ IKWIEAPDCFCFHLWNAWEEPE +E+VVIGSCMTPPDSIFNECEENLKSVL
Sbjct: 380 DKNATDASGIKWIEAPDCFCFHLWNAWEEPENDEVVVIGSCMTPPDSIFNECEENLKSVL 439

Query: 447 SEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQFAYLALAEPWPKVSGFAKV 506
           SEIRLNL TGKSTRRPII+E EQVNLEAGMVNRN LGRKT+FAYLALAEPWPKVSGFAKV
Sbjct: 440 SEIRLNLKTGKSTRRPIISEQEQVNLEAGMVNRNRLGRKTRFAYLALAEPWPKVSGFAKV 499

Query: 507 DLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEAEDDGHILAFVHDEKEWKSELQIVNAMTL 566
           DLFTGE+ K++YGEQR+GGEPLFLP++  +E EDDG+ILAFVHDEKEWKSELQIVNAMT+
Sbjct: 500 DLFTGEVNKFMYGEQRFGGEPLFLPKDPNSENEDDGYILAFVHDEKEWKSELQIVNAMTM 559

Query: 567 ELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           +LEAT+KLPSRVPYGFHGTFISS+DL+KQ
Sbjct: 560 KLEATIKLPSRVPYGFHGTFISSKDLEKQ 587

BLAST of ClCG07G010550 vs. NCBI nr
Match: gi|590611709|ref|XP_007022180.1| (Nine-cis-epoxycarotenoid dioxygenase 3 [Theobroma cacao])

HSP 1 Score: 926.0 bits (2392), Expect = 3.3e-266
Identity = 457/588 (77.72%), Postives = 505/588 (85.88%), Query Frame = 1

Query: 15  SSSMPENLSVNTNYGRNRVAT----SCSLHTPSIIQIPKHSHTAFPSPIKTAVPKPLPVS 74
           SSS P+  S++++    R       SCSL TPSI+  PK S T  PSP  T    P    
Sbjct: 28  SSSAPDLGSISSSIPFKRPTIKPNISCSLQTPSILHFPKQSPTYPPSP--TPASSPTTKH 87

Query: 75  PPSLSSA----SDDWNFLQRAAAMALDAVENALVSAERKHSLPKTADPAVQIAGNFAPVP 134
           P + SSA    S  WN  QRAAAMALD VENALVS ER H LPKTADP VQI+GNFAPVP
Sbjct: 88  PKNTSSAEKPQSQQWNPFQRAAAMALDVVENALVSHERNHPLPKTADPRVQISGNFAPVP 147

Query: 135 EQPVRKGLPVIGKVPDCIRGVYVRNGANPLHEPVSGHHLFDGDGMVHAVEFSEGGGVSYA 194
           EQP++  LPV G +P+CI+GVYVRNGANPLHEPV+GHH FDGDGMVHAV+F + G  SY 
Sbjct: 148 EQPIKHRLPVTGTIPECIQGVYVRNGANPLHEPVAGHHFFDGDGMVHAVQF-KNGSASYG 207

Query: 195 CRFTETQRLVQERAYGRPVFPKAIGELHGHSGIARLMLFYARGLFGLVDHNHGIGVANAG 254
           CRFTET RLVQERA+GRPVFPKAIGELHGHSGIARL+LFYARGLFGLVD +HG GVANAG
Sbjct: 208 CRFTETSRLVQERAFGRPVFPKAIGELHGHSGIARLLLFYARGLFGLVDPSHGTGVANAG 267

Query: 255 LVFFNGRLLAMSEDDLPYQIRVTPAGDLKTVGRFDFDGQLTSTMIAHPKLDPVSGEMFAL 314
           LV+FNG LLAMSEDDLPY +R+TP+GDLKTVGR+DFDGQL STMIAHPK+DPV+GE FAL
Sbjct: 268 LVYFNGHLLAMSEDDLPYHVRITPSGDLKTVGRYDFDGQLKSTMIAHPKVDPVTGEFFAL 327

Query: 315 SYDVIQKPYLKYFKFSPEGEKSPDVEIPLPQPTMMHDFAITEKFVVIPDQQVVFKLPEMI 374
           SYDVIQKPYLKYF FS +G+KSPDVEIP+  PTMMHDFAITE FVVIPDQQVVFKLPEMI
Sbjct: 328 SYDVIQKPYLKYFHFSADGKKSPDVEIPVESPTMMHDFAITENFVVIPDQQVVFKLPEMI 387

Query: 375 RGGSPVVYDKEKTSRFGILDKNATDANSIKWIEAPDCFCFHLWNAWEEPETNEIVVIGSC 434
            GGSPVVYDK K SRFGILDKNATD + I W+EAPDCFCFHLWNAWEEP+T+E+VVIGSC
Sbjct: 388 HGGSPVVYDKNKMSRFGILDKNATDDSGISWVEAPDCFCFHLWNAWEEPQTDEVVVIGSC 447

Query: 435 MTPPDSIFNECEENLKSVLSEIRLNLSTGKSTRRPIITETEQVNLEAGMVNRNLLGRKTQ 494
           MTPPDSIFNEC+E+LKSVLSEIRLNL TGKSTRRPII+E+EQVNLEAGMVNRNLLGRKT+
Sbjct: 448 MTPPDSIFNECDESLKSVLSEIRLNLKTGKSTRRPIISESEQVNLEAGMVNRNLLGRKTR 507

Query: 495 FAYLALAEPWPKVSGFAKVDLFTGEIKKYLYGEQRYGGEPLFLPRE-GAEAEDDGHILAF 554
           FAYLALAEPWPKVSGFAKVDL TGE+KKY+YG+QRYGGEPLF PR   +E EDDG+ILAF
Sbjct: 508 FAYLALAEPWPKVSGFAKVDLSTGEVKKYIYGDQRYGGEPLFFPRNPNSENEDDGYILAF 567

Query: 555 VHDEKEWKSELQIVNAMTLELEATVKLPSRVPYGFHGTFISSEDLQKQ 594
           VHDEK W+SELQIVNAM L+LEATVKLPSRVPYGFHGTFISS+DL+KQ
Sbjct: 568 VHDEKTWQSELQIVNAMNLQLEATVKLPSRVPYGFHGTFISSKDLEKQ 612

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NCED3_ARATH1.4e-25373.659-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic OS=Arabidopsis thaliana G... [more]
NCED1_PHAVU9.6e-25070.349-cis-epoxycarotenoid dioxygenase NCED1, chloroplastic OS=Phaseolus vulgaris GN=... [more]
NCED9_ARATH9.3e-23770.659-cis-epoxycarotenoid dioxygenase NCED9, chloroplastic OS=Arabidopsis thaliana G... [more]
NCED5_ARATH1.8e-23269.30Probable 9-cis-epoxycarotenoid dioxygenase NCED5, chloroplastic OS=Arabidopsis t... [more]
NCED_ONCHC1.5e-22670.709-cis-epoxycarotenoid dioxygenase, chloroplastic OS=Oncidium hybrid cultivar GN=... [more]
Match NameE-valueIdentityDescription
A0A0D2R0C3_GOSRA7.4e-26576.21Uncharacterized protein OS=Gossypium raimondii GN=B456_002G038100 PE=4 SV=1[more]
A0A0H3YDS4_GOSHI6.9e-26376.529-cis-epoxycarotenoid dioxygenase 2 OS=Gossypium hirsutum GN=NCED2 PE=2 SV=1[more]
B9SU59_RICCO3.4e-26275.009-cis-epoxycarotenoid dioxygenase, putative OS=Ricinus communis GN=RCOM_1716910 ... [more]
A0A0D2U362_GOSRA1.7e-26179.61Uncharacterized protein OS=Gossypium raimondii GN=B456_013G177100 PE=4 SV=1[more]
K4IAN1_FRAAN3.2e-26074.799-cis-epoxycarotenoid dioxygenase 2 OS=Fragaria ananassa GN=NCED2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT3G14440.18.1e-25573.65 nine-cis-epoxycarotenoid dioxygenase 3[more]
AT1G78390.15.3e-23870.65 nine-cis-epoxycarotenoid dioxygenase 9[more]
AT1G30100.11.0e-23369.30 nine-cis-epoxycarotenoid dioxygenase 5[more]
AT4G18350.12.3e-21770.35 nine-cis-epoxycarotenoid dioxygenase 2[more]
AT3G24220.16.7e-18557.61 nine-cis-epoxycarotenoid dioxygenase 6[more]
Match NameE-valueIdentityDescription
gi|659131081|ref|XP_008465499.1|0.0e+0092.57PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic-like [Cucumis ... [more]
gi|449460068|ref|XP_004147768.1|0.0e+0091.44PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic-like [Cucumis ... [more]
gi|408794955|gb|AFU91491.1|8.3e-31088.209-cis-epoxycarotenoid dioxygenase [Momordica charantia][more]
gi|1009109982|ref|XP_015893126.1|2.3e-26779.26PREDICTED: 9-cis-epoxycarotenoid dioxygenase NCED1, chloroplastic-like [Ziziphus... [more]
gi|590611709|ref|XP_007022180.1|3.3e-26677.72Nine-cis-epoxycarotenoid dioxygenase 3 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004294Carotenoid_Oase
Vocabulary: Molecular Function
TermDefinition
GO:0016702oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0042572 retinol metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003834 beta-carotene 15,15'-monooxygenase activity
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G010550.1ClCG07G010550.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004294Carotenoid oxygenasePANTHERPTHR10543BETA-CAROTENE DIOXYGENASEcoord: 71..595
score:
IPR004294Carotenoid oxygenasePFAMPF03055RPE65coord: 123..585
score: 1.6E
NoneNo IPR availablePANTHERPTHR10543:SF429-CIS-EPOXYCAROTENOID DIOXYGENASE NCED2, CHLOROPLASTIC-RELATEDcoord: 71..595
score: