Cp4.1LG04g08360 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g08360
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptioncytochrome c biogenesis protein family
LocationCp4.1LG04 : 3131280 .. 3136969 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAATTAGGCATGTCTAACAAGTTTAAAATAAAAAACTATGGAGCAGAAAAAATATCTTGGCGAGGATAAAGCATGAGGTATGGTCGTTATTAACGCTCGGGGGGCAAGAAAGTGGAAACCCACTTCATAATTTCTGCAACAATGGAGGGATTGATCCTTCACAATCCAAACCCATATTTATCCAAAACCCCTTTTCTTCATTCTTCCTTCAGGCCTCAAATCACTCCAAATCTCTGCTGCAGAACCCTCTCTTTCAGCGTCACTTGTAAGGTGAAGGCATCGCAGGACAAGAAGCCCAAGAATGTCAGCAACAAGATTGTGCTCTCTGAAGCAGCGCCGCCGCTTGCGGAGGAGAGCGATGACAATAATGGAAACAACACTGAAGCCGAAGTCAAGCCTGGAAATGGGAGTGGTTTGTTGATGAAGCTTGTGAAGAGATTGCCGAAGAGGATTTTGGGGGCTTTGTCTAATTTGCCTTTGGCTATTGGAGAAATGTTCACCATCGCTGCTCTCATGGCTCTTGGTATGACCTATTTTTGTTGGCTATATCTCTCTGTATTCCGTTTACTATCTGGATTGAATAAACCGAAGATATAGAAAAAGAAAAAGTACAGTAATCTACTTCCTTCTAGTTTCAAATGATCTTATATTTGTTAGGAATCACGGACTCTCCACAATGGTATGATATTGTCCACTTTAGGCATAAGCTCTAAGTGGTTTGCTTTTGGTTTCCCCAAAAGACCTTATACCAATGGAGATGTATTCTTTACTTATAAACCCAGGATCATTCTCTAAATTAGCCAAGGTAGGACCCCTCCCAACAATCCTCTCCTCAAACAAAGTACACCATAGAGCCTCCCTTGAAGTCTATGTAGCCTTCGAACAACCTCCCCTTAATCGACGCTCGACTCCTTTCTTTGGAACCCTCAAACAAAGTACACCCTTTGTTCGACACATGAGTCACTTTTGACTACACCTTCGAGGCTAGCGATTTCTTTGTTCGACACTTGAGGATTCCATTAACATGGCTAAATGAAGGGCATGACTCTAATACCATGTTAGGAATCACGAGTCTACACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTTGGTTTTTCCAAAAGGCCTCATACCAATGAAAATATGATAAATCCATGATCATTCCTTAAATTAGTGAACGTGGGATTTCCTCCCAACAATTCTCAATAATATCGATGAACTTGTTGCGTTTAGAAAGCCCTTGGCTGCAACAGCTGTTGGACATGTATAAATGATGGCTTATTACTGAAAAATCAATCTGAGTTCATCGTGTTTAGCATTCATTTGCAGGTACTTTCATTGATCAGGGAGAGGCCCCTGATTTTTATTTCCAGAAGTACCCTGAAGATAATCCTATGTGGGGATTCTTCACTTGGAGATGGATTCTCACACTTGGGTTTGATCATATGTACACATCTACCATTTTTCTAGCCATGTTATCTCTTCTTGGGCTCTCACTCATGGCATGCACTTACACAACTCAGATTCCTCTTGTTAAGGTTGCAAGAAGGTTACAACACTCTCCCTCTTTCTCTTTCTTGACAGACACATTCATCTTTCTACCTTTTTTGTTATGAACATTCAAATTTTGTATGCAGATGGAACTTTTTGCAGTCCGGTGATGCTATTAGGAAGCTGGAATGTTCTGAAATTTTACCCAGAGCATCAGTCCAAGATTTGGGGGTTGTTTTAATGGGAGCAGGATATGAGGTTGTAACTCAATGTGGATTCTTGCTTCTTATATGATTATACTATTTTATGTACAGTACATAGTACATAGAACATCTAATGTTCTAACGGCCCAGGTCCACTGCTAGTAGATATTGTTCTATTTGGGCTTTTCCTTTCAAGCTTCCCCTCAAGGTTTTTGAAACGTTTCTGCTAGGGAAAAGTTCCACACCCTTAGTGTTTCAATCTCCTCCCCAACTAATGTGGGATCTTACAATCCACCCCCTTCGAAGCCCCGCGTCCTCGCTGGCACTCGTTCCTTTCTCCAATCGATGTGGGACCCCCATCCAATCCACCCCCTTTGGGGCCCAACGTTTTTGCTAGCACATAGCCTCATGTCCACCCCTTCGGGGCTCAGCCTACCCGCTGGCACATCGCCCGGTGTCTGGCTTTGATACCATTTGTAATGGCTCGGTCCACCGCTAGCAAATATTGTCCTCTTTGGAGTTTCCCTTTCGGGCTTTCCCTCAAGGTTATTAAAACACGTCTGCTAGGCAAAGGTTTCCACCCTTATAAAAGGTCGTTCTCCTCCCCAATCGATGTGGGATATATTACATAACATGCATGTGCTTTTTTTTTTTACTTGAAATCATGGAAAGCAACTTAATCAGCATCAAACTCAGATCTTGGTCATGCTAATTTCAGTGTCTAAATTAAATCAGAAATAGATTATACCATACAAGCTAAATATGGTCTCTAGACACTACTTCTTATAGCAAGCTTCTTGTCTTCCTGTTGGTAGCTCCCCAAGTGTGTAAGCTGAAAGTGTAGTTCACTAAATGATATCAATCATCTATTCTGCAGGTATTTATTAAAGGACCAACTTTATATGCTTTCAAGGGATTGGCTGGACGATTTGCGCCTATTGGTGTACATTTGGCCATGCTTTTGATTATGGGTGGTGCAACTCTCAGTGCAGCTGGGAGCTTCAGAGGTTCTGTTACAGTTCCTCAGGGACTGAATTTTGTAGTTGGAGATGTTCTGAACCCGAGCGGGTTTCTGTCCAAGCCAACTGAAGCTTTCAATACTGAAGTTCATGTCAACAACTTCTACATGGATTACTATGACAGTGGAGAGGTATGTCTGAATGAAGAAACCAATAATAACTATAAAACCAGTGGTAGAGTTAATACTGTGTTAGGAATCACGACTCTCCACGATGGTATGATATTGTCCACTTTGAGCTTAAGCTCTTATGCTTGCTTTGGGTTTCCCCAAAAGGCCTCGTACCAATGGAGATGTATTCCTTGCTTATAAAATCCATGATCATTCCCTAAATTAGTTGATGTAGGACTCCCTCCCAACAATCTTCAACATCGTGGTATAAACTAGCATGGGTTGGTCTTGTGATCAATAAGAATTAATGTAAGTAATAAAGGGTTAGCGGGAATGAGTTCAAGTTACGGGGTCCAGCTACTGTAGGATCAAATATTGTAGGGTCAAGTAGTTGTCCTATGAGAATAGTCGAGTTACGTGAAAATTTACCTAGACAACTGATTTTGCAGGTGAAACAGTTTCATAGTGATCTTTCTTTATTTGATCTTAATGGGAAAGAGGTGATGAGGAAGACAATCAGTGTAAATAATCCTTTACGGTATGGAGGTTTTACAATTTACCAAACTGATTGGGGATTTTCAGCTCTGCAAATACTCAAGAATGATGAAGGACCTTTTAACTTGGCTGTGGCACCTCTTAAGATCAATGGAGACAAGAAGCTTTATGGGACTTTCTTACCCGTTGGAGATGTCGATTCACCTGATGTGAAGGGAATGTATGTCGTTTCTTTTTGTACGTTTATCGTTATCTAATACTTCTTATTCGGGTATGTTTGGAAGTAATTTTGGAACGGTTTAAACCACTTTTGTCGAGTTAAAAATCACTCCTAAACATGCATTTAGTCATATATAAGTCGCGTTTTAAAGTATAAAGTCAAACATTGAAATCATTTTGAATGATTAAATGCGTGTTTGGAGTAATTTTCGGCGTGGCAAAAATGATTTTAAACATTTTAAAATCTCTCCCAAACATGCCCTATATCTACAAGGGAAAGACCAGTTATATTGTGAGATCCCAAATCGGTTGGAGAAGGGAACAAAGCATTCCTTATAAGGGTGTGGAAACCTCCCTCTAGGAGATGTGTTTTAAAACTATGAGGGTGATGGTGATACATAACGGATCAAAGCAGACAATATCTACTAGCAATGGGGTTGAGCTGTCAAAAATGGTATTAGAACCAGACACCGAGCGGTGTGCCAGCAAGGACGTTGGGCCCTCAAGGGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGGTGTGGAAACCTCTTCCTATTAGGTGCATTTTAAAACCATGAAGCTGACGGCGATACGTGATGGGCCAAAACGAAAACTATCTTCTAGCGGTGGGCTTGAGCGGTTATATATATATACATGTCAAATTGAGTATCCTTTATGCATTTGGAGGCCATGGGGTCGGCATCTTGGTTCATACTACATCTATATACATAAGGATGTAGTTTTTTGTGGCTATCCTTGTTACTTGTGGATGGTATTCTCATATTTTATATTAATGATGTAATTTCAGATCAATGCTTGCTCGTGATCTGCAATCCATTGTCCTGTATGATCAGGAAGGGAAGTTTGTCGGGGTTCGACGTCCAAGCTCTAAGCTTCCAATCGATATCGACGGTATGAAAATTGAAATACTTGATGCAATTGGCAGCACTGGGCTGGAGTTGAAGGTAACGACTCGGAAACGTTCACTAGTTTTTAGCTGGTGTGTCAAAACTGGATTCTAAACTGTCATTGATATTTGCCTATTTATCTCCAGACTGACCCAGGAGTGCCTGTTGTTTATGCTGGATTTGGTGCTCTAATGCTCACAACATGTATCAGTTTTCTCTCTCATTCACAGGTAAATATGCTTTTCTTCAACCTATTATTAATGCTCAATGCTAGAAAAAGATAACATGGTGTACTTACCATTTCTTTGATTGAAAGATATGGGCCATACAAGACGGAACAGTGGTGATCGTCGGAGGAAAGACGAACCGTGCAAAGGTCGAATTTCCAGAGGAGATTGACAGTTTACTCGATCGTGTTCCAGAAATCATCGAACCATCTCACAATCAGTTCAATAACAATGATGCCTAGTTACTAAGACAAAAATGGAACAAGAGAGAAACACAGTGTCCAACTACAAAATTTTGGCCTGAGATCACAGTTGGCTGTCTTCTTTCTTACATATACATATAGAGGACGGTTTTACCAATTTATAATTCTTTCATCACCGCCATGTTCCAAGTCAAATAAATTTTGTATATAATGTAACATAGGTAGCCATGTACGCCATTTATTATAAGAAAATATCGCTAATTTGAGGGTGAATAATCGACTCGAAAGTCCAAACAATACAACTGCAGGTACGATCCAACGGTTTTCAAGGTCGACATACTTGATGTACTCATTACTATGTAATTCATTTCTGTGAGGACTAAGATCTTTACTGCACAACATTACTATTTTGATACAAGAGGGTATATGATTGATGAAGGGAAGGTATCACCTTTAGTTACACTTACGATCATGCAACTCTTGTCTCTAACAAGTCAATTTGAACTACTAACCTATAAATTTATGCAACCTGCAATGCTTCTTCACGCTTAATGTCGAAGACTTTGACAAGAGCATCTTCGACAACCTAGTCAAACGCGTATGACATGTTATGGTTATAAATCTATTGCAGATCAAATATGTACCAGCAAGCATATAAGCCGAAGTGTATGGCTAACCTTATCGGGGGTCGAGGCACCGGATGTAACACCGATTGTGATAGGACCCTTGGGTAGCCAATTCTCCTTCTCAACTAACTCCCCAT

mRNA sequence

AGAATTAGGCATGTCTAACAAGTTTAAAATAAAAAACTATGGAGCAGAAAAAATATCTTGGCGAGGATAAAGCATGAGGTATGGTCGTTATTAACGCTCGGGGGGCAAGAAAGTGGAAACCCACTTCATAATTTCTGCAACAATGGAGGGATTGATCCTTCACAATCCAAACCCATATTTATCCAAAACCCCTTTTCTTCATTCTTCCTTCAGGCCTCAAATCACTCCAAATCTCTGCTGCAGAACCCTCTCTTTCAGCGTCACTTGTAAGGTGAAGGCATCGCAGGACAAGAAGCCCAAGAATGTCAGCAACAAGATTGTGCTCTCTGAAGCAGCGCCGCCGCTTGCGGAGGAGAGCGATGACAATAATGGAAACAACACTGAAGCCGAAGTCAAGCCTGGAAATGGGAGTGGTTTGTTGATGAAGCTTGTGAAGAGATTGCCGAAGAGGATTTTGGGGGCTTTGTCTAATTTGCCTTTGGCTATTGGAGAAATGTTCACCATCGCTGCTCTCATGGCTCTTGGTACTTTCATTGATCAGGGAGAGGCCCCTGATTTTTATTTCCAGAAGTACCCTGAAGATAATCCTATGTGGGGATTCTTCACTTGGAGATGGATTCTCACACTTGGGTTTGATCATATGTACACATCTACCATTTTTCTAGCCATGTTATCTCTTCTTGGGCTCTCACTCATGGCATGCACTTACACAACTCAGATTCCTCTTGTTAAGGTTGCAAGAAGATGGAACTTTTTGCAGTCCGGTGATGCTATTAGGAAGCTGGAATGTTCTGAAATTTTACCCAGAGCATCAGTCCAAGATTTGGGGGTTGTTTTAATGGGAGCAGGATATGAGGTATTTATTAAAGGACCAACTTTATATGCTTTCAAGGGATTGGCTGGACGATTTGCGCCTATTGGTGTACATTTGGCCATGCTTTTGATTATGGGTGGTGCAACTCTCAGTGCAGCTGGGAGCTTCAGAGGTTCTGTTACAGTTCCTCAGGGACTGAATTTTGTAGTTGGAGATGTTCTGAACCCGAGCGGGTTTCTGTCCAAGCCAACTGAAGCTTTCAATACTGAAGTTCATGTCAACAACTTCTACATGGATTACTATGACACTCTGCAAATACTCAAGAATGATGAAGGACCTTTTAACTTGGCTGTGGCACCTCTTAAGATCAATGGAGACAAGAAGCTTTATGGGACTTTCTTACCCGTTGGAGATGTCGATTCACCTGATGAAGGGAAGTTTGTCGGGGTTCGACGTCCAAGCTCTAAGCTTCCAATCGATATCGACGGTATGAAAATTGAAATACTTGATGCAATTGGCAGCACTGGGCTGGAGTTGAAGACTGACCCAGGAGTGCCTGTTGTTTATGCTGGATTTGGTGCTCTAATGCTCACAACATGTATCAGTTTTCTCTCTCATTCACAGATATGGGCCATACAAGACGGAACAGTGGTGATCGTCGGAGGAAAGACGAACCGTGCAAAGGTCGAATTTCCAGAGGAGATTGACAGTTTACTCGATCGTGTTCCAGAAATCATCGAACCATCTCACAATCAGTTCAATAACAATGATGCCTAGTTACTAAGACAAAAATGGAACAAGAGAGAAACACAGTGTCCAACTACAAAATTTTGGCCTGAGATCACAGTTGGCTGTCTTCTTTCTTACATATACATATAGAGGACGGTTTTACCAATTTATAATTCTTTCATCACCGCCATGTTCCAAGTCAAATAAATTTTGTATATAATGTAACATAGGTAGCCATGTACGCCATTTATTATAAGAAAATATCGCTAATTTGAGGGTGAATAATCGACTCGAAAGTCCAAACAATACAACTGCAGGTACGATCCAACGGTTTTCAAGGTCGACATACTTGATGTACTCATTACTATGTAATTCATTTCTGTGAGGACTAAGATCTTTACTGCACAACATTACTATTTTGATACAAGAGGGTATATGATTGATGAAGGGAAGGTATCACCTTTAGTTACACTTACGATCATGCAACTCTTGTCTCTAACAAGTCAATTTGAACTACTAACCTATAAATTTATGCAACCTGCAATGCTTCTTCACGCTTAATGTCGAAGACTTTGACAAGAGCATCTTCGACAACCTAGTCAAACGCGTATGACATGTTATGGTTATAAATCTATTGCAGATCAAATATGTACCAGCAAGCATATAAGCCGAAGTGTATGGCTAACCTTATCGGGGGTCGAGGCACCGGATGTAACACCGATTGTGATAGGACCCTTGGGTAGCCAATTCTCCTTCTCAACTAACTCCCCAT

Coding sequence (CDS)

ATGGAGGGATTGATCCTTCACAATCCAAACCCATATTTATCCAAAACCCCTTTTCTTCATTCTTCCTTCAGGCCTCAAATCACTCCAAATCTCTGCTGCAGAACCCTCTCTTTCAGCGTCACTTGTAAGGTGAAGGCATCGCAGGACAAGAAGCCCAAGAATGTCAGCAACAAGATTGTGCTCTCTGAAGCAGCGCCGCCGCTTGCGGAGGAGAGCGATGACAATAATGGAAACAACACTGAAGCCGAAGTCAAGCCTGGAAATGGGAGTGGTTTGTTGATGAAGCTTGTGAAGAGATTGCCGAAGAGGATTTTGGGGGCTTTGTCTAATTTGCCTTTGGCTATTGGAGAAATGTTCACCATCGCTGCTCTCATGGCTCTTGGTACTTTCATTGATCAGGGAGAGGCCCCTGATTTTTATTTCCAGAAGTACCCTGAAGATAATCCTATGTGGGGATTCTTCACTTGGAGATGGATTCTCACACTTGGGTTTGATCATATGTACACATCTACCATTTTTCTAGCCATGTTATCTCTTCTTGGGCTCTCACTCATGGCATGCACTTACACAACTCAGATTCCTCTTGTTAAGGTTGCAAGAAGATGGAACTTTTTGCAGTCCGGTGATGCTATTAGGAAGCTGGAATGTTCTGAAATTTTACCCAGAGCATCAGTCCAAGATTTGGGGGTTGTTTTAATGGGAGCAGGATATGAGGTATTTATTAAAGGACCAACTTTATATGCTTTCAAGGGATTGGCTGGACGATTTGCGCCTATTGGTGTACATTTGGCCATGCTTTTGATTATGGGTGGTGCAACTCTCAGTGCAGCTGGGAGCTTCAGAGGTTCTGTTACAGTTCCTCAGGGACTGAATTTTGTAGTTGGAGATGTTCTGAACCCGAGCGGGTTTCTGTCCAAGCCAACTGAAGCTTTCAATACTGAAGTTCATGTCAACAACTTCTACATGGATTACTATGACACTCTGCAAATACTCAAGAATGATGAAGGACCTTTTAACTTGGCTGTGGCACCTCTTAAGATCAATGGAGACAAGAAGCTTTATGGGACTTTCTTACCCGTTGGAGATGTCGATTCACCTGATGAAGGGAAGTTTGTCGGGGTTCGACGTCCAAGCTCTAAGCTTCCAATCGATATCGACGGTATGAAAATTGAAATACTTGATGCAATTGGCAGCACTGGGCTGGAGTTGAAGACTGACCCAGGAGTGCCTGTTGTTTATGCTGGATTTGGTGCTCTAATGCTCACAACATGTATCAGTTTTCTCTCTCATTCACAGATATGGGCCATACAAGACGGAACAGTGGTGATCGTCGGAGGAAAGACGAACCGTGCAAAGGTCGAATTTCCAGAGGAGATTGACAGTTTACTCGATCGTGTTCCAGAAATCATCGAACCATCTCACAATCAGTTCAATAACAATGATGCCTAG

Protein sequence

MEGLILHNPNPYLSKTPFLHSSFRPQITPNLCCRTLSFSVTCKVKASQDKKPKNVSNKIVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSGFLSKPTEAFNTEVHVNNFYMDYYDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPDEGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEIDSLLDRVPEIIEPSHNQFNNNDA
BLAST of Cp4.1LG04g08360 vs. Swiss-Prot
Match: CCS1_ORYSJ (Cytochrome c biogenesis protein CCS1, chloroplastic OS=Oryza sativa subsp. japonica GN=CCS1 PE=2 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 1.5e-152
Identity = 287/499 (57.52%), Postives = 348/499 (69.74%), Query Frame = 1

Query: 58  KIVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGLLMK--------LVKRLPKRILGALS 117
           ++V  +AAPP+++      G   E E   G G G + +        LV+RL KR L  LS
Sbjct: 68  QVVFFDAAPPVSQRGGGGGGEG-EGE---GEGEGKVARRKENAALGLVRRLTKRTLSLLS 127

Query: 118 NLPLAIGEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYT 177
           NLPLAI EMF IAALMALGT IDQGEAP +YF+K+PEDNP++GF TWRWILT GFDHM++
Sbjct: 128 NLPLAISEMFAIAALMALGTVIDQGEAPSYYFEKFPEDNPVFGFITWRWILTPGFDHMFS 187

Query: 178 STIFLAMLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLG 237
           S +FL +L+LL  SLMACTYTTQIP+VKVARRW+F+ S  +IRK E +E LPRAS+QDLG
Sbjct: 188 SPVFLGLLALLAASLMACTYTTQIPIVKVARRWSFMHSAGSIRKQEFAESLPRASIQDLG 247

Query: 238 VVLMGAGYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQG 297
           V+LMG GYEVF KGP+LYAFKGLAGRFAPIGVH+AM+ IM GATLSA GSF+GSV VPQG
Sbjct: 248 VILMGYGYEVFTKGPSLYAFKGLAGRFAPIGVHIAMIFIMAGATLSATGSFKGSVDVPQG 307

Query: 298 LNFVVGDVLNPSGFLSKPTEAFNTEVHVNNF-------------YMDY------------ 357
           LNFV+GDV+ P G LS   + FNTEVHVN F             Y D             
Sbjct: 308 LNFVIGDVMKPKGVLSFVPDVFNTEVHVNRFYMEYYDSGEVSQFYSDLSLFDLDGKEVMR 367

Query: 358 ----------------------YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGD 417
                                 +  LQ+ KN EGPFNLA+APLK+NGDKKL+GT LP+ +
Sbjct: 368 KTIKVNDPLRYGGVTIYQTDWGFSALQVKKNGEGPFNLAMAPLKLNGDKKLFGTLLPLEN 427

Query: 418 VDSPD-------------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLEL 477
             S +                   EGKFVGVRRPSSKLPI+IDG +I I DAIGSTGL+L
Sbjct: 428 SGSSNVKGISMLARDLQSIVLYDQEGKFVGVRRPSSKLPIEIDGNEIVIEDAIGSTGLDL 487

Query: 478 KTDPGVPVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEIDSLL 483
           KTDPG+P+VYAGFGALMLTTCIS+LSHSQIWA+QDG+ V++GGKTNRAK+EF EE++ LL
Sbjct: 488 KTDPGIPIVYAGFGALMLTTCISYLSHSQIWALQDGSTVVIGGKTNRAKLEFSEEMNRLL 547

BLAST of Cp4.1LG04g08360 vs. Swiss-Prot
Match: CCS1_ARATH (Cytochrome c biogenesis protein CCS1, chloroplastic OS=Arabidopsis thaliana GN=CCS1 PE=1 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.3e-132
Identity = 276/540 (51.11%), Postives = 350/540 (64.81%), Query Frame = 1

Query: 10  NPYLSKTPFLHSSFRPQITPNLCCRTLSFSV--TCKVKASQDKKPKNVSNK-----IVLS 69
           NP +     +H   RP    +  CRT + S+   CK++  QD   ++ SN+     I LS
Sbjct: 6   NPKILHFSKIHPFSRPS---SYLCRTRNVSLITNCKLQKPQDGNQRSSSNRNLTKTISLS 65

Query: 70  EAAPPLAEESDDN---NGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMF 129
           ++APP+ EE+ D     G N       G G    +  +K LP+++L  LSNLPLAI EMF
Sbjct: 66  DSAPPVTEETGDGIVKGGGNGGGGGGDGRGG---LGFLKILPRKVLSVLSNLPLAITEMF 125

Query: 130 TIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSL 189
           TIAALMALGT I+QGE PDFYFQKYPEDNP+ GFFTWRWI TLG DHMY++ IFL ML L
Sbjct: 126 TIAALMALGTVIEQGETPDFYFQKYPEDNPVLGFFTWRWISTLGLDHMYSAPIFLGMLVL 185

Query: 190 LGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEV 249
           L  SLMACTYTTQIPLVKVARRW+F++S +AI+K E ++ LPRAS+QDLG++LMG G+EV
Sbjct: 186 LAASLMACTYTTQIPLVKVARRWSFMKSDEAIKKQEFADTLPRASIQDLGMILMGDGFEV 245

Query: 250 FIKGPTLYAFKGLAGRFAP-------------------------IGVHLAMLLIMGGATL 309
           F+KGP+LYAFKGLAGRFAP                         + V   +  +MG   L
Sbjct: 246 FMKGPSLYAFKGLAGRFAPIGVHIAMLLIMVGGTLSATGSFRGSVTVPQGLNFVMGDV-L 305

Query: 310 SAAGSFR---GSVTVPQGLNFVVGDVLNPSGFLSK-----------PTEAFNTEVHVNN- 369
           +  G F     +      +N    D  + SG +S+             E     + VN+ 
Sbjct: 306 APIGFFSIPTDAFNTEVHVNRFTMDYYD-SGEVSQFHSDLSLRDLNGKEVLRKTISVNDP 365

Query: 370 --------FYMDY-YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD--- 429
                   +  D+ +  LQ+ K+ EGPFNLA+AP+KINGDKKLYGTFLPVGD ++P+   
Sbjct: 366 LRYGGVTVYQTDWSFSALQVTKDGEGPFNLAMAPIKINGDKKLYGTFLPVGDTNAPNVKG 425

Query: 430 ----------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPV 472
                           +GKF G+RRPSSKLPI+I+GMKI I DAIGSTGLELKTDPGVPV
Sbjct: 426 ISMLARDLQSIVVYDLDGKFAGIRRPSSKLPIEINGMKIVIEDAIGSTGLELKTDPGVPV 485

BLAST of Cp4.1LG04g08360 vs. Swiss-Prot
Match: CCS1_NOSP7 (Cytochrome c biogenesis protein CcsB OS=Nostoc punctiforme (strain ATCC 29133 / PCC 73102) GN=ccsB PE=3 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 7.4e-59
Identity = 146/430 (33.95%), Postives = 223/430 (51.86%), Query Frame = 1

Query: 100 LPKRILGALSNLPLAIGEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWI 159
           L + +L  L+NL LAI  +  IA   + GT I+QG++P FY   YPE   ++GF TW+ I
Sbjct: 21  LRQELLPVLTNLRLAIALLLLIAIFSSTGTVIEQGQSPAFYQANYPEHPALFGFLTWKVI 80

Query: 160 LTLGFDHMYTSTIFLAMLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEI 219
             +G DH+Y +  FLA+L L G SL AC++T Q+P +K A+RW + +     +KL  S  
Sbjct: 81  QVVGLDHVYRTWWFLALLILFGTSLTACSFTRQLPALKAAQRWKYYEEPRQFQKLALSAE 140

Query: 220 LPRASVQDLGVVLMGAGYEVFI---KGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSA 279
           L   S+  L  +L    Y++F    K   LYA KG+ GR  PI VH+ ++ I+ G    A
Sbjct: 141 LDNGSLNSLSQLLQKRRYKIFPDREKENILYARKGIVGRIGPIIVHIGIVAILLGGIWGA 200

Query: 280 AGSFRGSVTVPQGLNFVVGDVLN------------------------PSGFLSK------ 339
              F     V  G  F V ++++                        PSG + +      
Sbjct: 201 MTGFMAQEMVASGDTFQVTNIVDAGPLAAQVSKDWSVRVNRFWIDYTPSGGIDQFYSDMS 260

Query: 340 -----PTEAFNTEVHVNNFY----MDYYDT------LQILKNDEGPFNLAVAPLKINGDK 399
                  E  + ++ VN       + +Y T      +++  N+   F L +A L   G  
Sbjct: 261 VLNKQGEEVDHKKIFVNEPLRYRGITFYQTDWGIAGVRVQFNNSPIFQLPMALLNTKGQG 320

Query: 400 KLYGTFLPVG-------DVDSPDEGKFVGVRRPSSKL--------PIDIDGMKIEILDAI 459
           +++GT++P          + + D    V +  P+ KL           ++G+K++ILD I
Sbjct: 321 RIWGTWVPTKPDLSEGVSLLAKDLQGMVLIYDPNGKLVDTVRAGMSTQVNGVKLKILDVI 380

Query: 460 GSTGLELKTDPGVPVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFP 467
           GSTGL++K DPG+P+VY+GFG LML   +S+ SHSQIWA+Q G ++ VGGKTNRA+V F 
Sbjct: 381 GSTGLQIKADPGIPIVYSGFGLLMLGVVMSYFSHSQIWALQKGDLLYVGGKTNRAQVAFE 440

BLAST of Cp4.1LG04g08360 vs. Swiss-Prot
Match: CCS1_TRIEI (Cytochrome c biogenesis protein CcsB OS=Trichodesmium erythraeum (strain IMS101) GN=ccsB PE=3 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 2.8e-58
Identity = 149/437 (34.10%), Postives = 217/437 (49.66%), Query Frame = 1

Query: 93  LMKLVKRLPKRILGALSNLPLAIGEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWG 152
           L+  ++   + IL  L++L LAI  +  IA     GT I+QGE+  FY + YPE   ++G
Sbjct: 11  LINPLRIFKREILPLLADLRLAIALLLIIAICSISGTVIEQGESIAFYQENYPEKPALFG 70

Query: 153 FFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIR 212
           F TW+ IL L  DH+Y +  FL++L L G SL ACT+T Q+P +K A RW F       +
Sbjct: 71  FLTWKVILLLELDHVYRTWWFLSILILFGASLTACTFTRQLPALKSANRWKFYNKKQQFK 130

Query: 213 KLECSEILPRASVQDLGVVLMGAGYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGA 272
            L  S  +  AS+  L  +L   GY+   +G  LY  KG+ G+  PI VH +ML+I+ G+
Sbjct: 131 NLALSAEIETASLDSLEEILQKRGYKTSKEGDKLYGRKGIIGKVGPIIVHASMLIILAGS 190

Query: 273 TLSAAGSFRGSVTVPQGLNFVVGDVL-------------------------NPSG----- 332
            + +   F G   VP G  F V +++                          PSG     
Sbjct: 191 IIGSMTGFLGQEIVPSGETFQVKNIIDAGIFAKSQIPKNWSVRVNRFWIDYTPSGGIDQF 250

Query: 333 -----FLSKPTEAFNTEVHVNNFYMDYYDT-----------LQILKNDEGPFNLAVAPLK 392
                 L K  E  + +    N  M Y+             +++  N    F L +A L 
Sbjct: 251 YSDLSILDKSGEEVDRKTIFVNQPMRYHGVTMYQADWAIAAVKVRVNKSPVFRLPMAQLN 310

Query: 393 INGDKKLYGTFLPV----------------GDVDSPD-EGKFVGVRRPSSKLPIDIDGMK 452
             G  K++GT++P+                G+V   D  GK V   R    + I++ G+ 
Sbjct: 311 TEGGGKIWGTWVPIKPDLSEGVSLLAKDLQGNVLVYDTSGKLVASVREG--MFIEVSGVT 370

Query: 453 IEILDAIGSTGLELKTDPGVPVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTN 467
           + I   IGSTGL++K DPG+P+VY GFG LML+  +S++SHSQIWA Q+   + +GGKTN
Sbjct: 371 LFIDKIIGSTGLQIKADPGIPIVYLGFGLLMLSVLMSYVSHSQIWAFQESERLYIGGKTN 430

BLAST of Cp4.1LG04g08360 vs. Swiss-Prot
Match: CCS1_MICAN (Cytochrome c biogenesis protein CcsB OS=Microcystis aeruginosa (strain NIES-843) GN=ccsB PE=3 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 4.5e-56
Identity = 139/428 (32.48%), Postives = 216/428 (50.47%), Query Frame = 1

Query: 102 KRILGALSNLPLAIGEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILT 161
           ++ +  +++L LAI  +  IA     GT I+QG++  FY   YPE   ++GF TW+ +L 
Sbjct: 19  RKFIQTIADLRLAIILLLLIAIFSISGTVIEQGQSLSFYQANYPEKPALFGFLTWKVLLL 78

Query: 162 LGFDHMYTSTIFLAMLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILP 221
           LG +H+Y++  +L++L L G SL ACT+  Q+P +K AR W F Q     +KL  S  L 
Sbjct: 79  LGLNHVYSTWWYLSLLILFGSSLTACTFRRQLPALKAARNWQFYQQSRQFQKLALSAELE 138

Query: 222 RASVQDLGVVLMGAGYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFR 281
             S++ L  +L   GY+VF++  +LYA KGL G+  PI VH AML+I+ GA   A   F 
Sbjct: 139 TGSLESLTPLLEKKGYKVFLENNSLYARKGLIGKIGPIIVHAAMLIILAGAIWGALTGFF 198

Query: 282 GSVTVPQGLNFVVGDVLN---------PSGFLSKPTEA---FNTEVHVNNFYMD------ 341
               V  G +F V +++          P  +  K       ++ +  +  FY D      
Sbjct: 199 AQEMVASGDSFQVKNIIEAGPLSKNSLPKDWGIKVNRFWIDYSPKGDIEQFYSDLSVIDN 258

Query: 342 ----------------------YYDT------LQILKNDEGPFNLAVAPLKINGDKKLYG 401
                                 +Y T      +++  N+     L +A L   G+ +++G
Sbjct: 259 QGQEIDRKTIQVNQPLHHKGVTFYQTSWGIAGVKVQVNNSPILQLPMASLDTKGNGQIWG 318

Query: 402 TFLPV----------------GDVDSPD-EGKFVGVRRPSSKLPIDIDGMKIEILDAIGS 461
           T++P                 G V   D +G      R    +PI+  G+ ++I++ +GS
Sbjct: 319 TWIPTKTDLSEGVSLLTRDLQGTVIVYDAQGDLTSAVREGMTIPIN--GVNLKIVELVGS 378

Query: 462 TGLELKTDPGVPVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEE 467
           TGL++K DPGVP+VY GF  LM+   +S+ SHSQIWA+Q G    +GGKTNRA+V F  E
Sbjct: 379 TGLQIKADPGVPIVYLGFALLMMGVVMSYFSHSQIWALQSGDRFYIGGKTNRAQVSFERE 438

BLAST of Cp4.1LG04g08360 vs. TrEMBL
Match: A0A0A0K6S4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G075670 PE=3 SV=1)

HSP 1 Score: 745.7 bits (1924), Expect = 3.4e-212
Identity = 407/553 (73.60%), Postives = 433/553 (78.30%), Query Frame = 1

Query: 1   MEGLILHNPNPYLSK-----TPFLHSSFRPQITPNLCCRTLSFSVTCKVKASQDKKPKNV 60
           ME LIL+N NP LSK     T FLHS F  QIT     +  SFSVTCK KASQDKK KN 
Sbjct: 11  MERLILNNLNPSLSKPFLLKTSFLHSVFSHQITLRTA-QIRSFSVTCKNKASQDKKLKNA 70

Query: 61  SNKIVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAI 120
           SNKIVLSEAAPPLAEE  D +GN  EAEVKPGNGS   MKLVKRLPKRILGALSNLPLAI
Sbjct: 71  SNKIVLSEAAPPLAEEESDKSGN-AEAEVKPGNGSRS-MKLVKRLPKRILGALSNLPLAI 130

Query: 121 GEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLA 180
           GEMFTIAALMALGT IDQGEAPDFYFQKYPEDNP+WGFF WRWILTLGFDHMY+STIFL 
Sbjct: 131 GEMFTIAALMALGTVIDQGEAPDFYFQKYPEDNPLWGFFNWRWILTLGFDHMYSSTIFLG 190

Query: 181 MLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGA 240
           ML+LLG+SLMACTYTTQIPLVKVARRWNFLQSG+ IRKLECS+ILPRASVQDLGVVLMGA
Sbjct: 191 MLALLGISLMACTYTTQIPLVKVARRWNFLQSGETIRKLECSDILPRASVQDLGVVLMGA 250

Query: 241 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVG 300
           GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIM GATLSA GSFRGSVTVPQGLNFVVG
Sbjct: 251 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMAGATLSATGSFRGSVTVPQGLNFVVG 310

Query: 301 DVLNPSG-------------------------------------FLSKPTEAFNTEVHVN 360
           DVLNPSG                                     F     E     + VN
Sbjct: 311 DVLNPSGFLAKPTEAFNTEVHVNKFYMNYYDSGEIKQFYSDLSLFDLNGKEVMRKTISVN 370

Query: 361 N---------FYMDY-YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD- 420
           N         +  D+ +  LQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDV+SPD 
Sbjct: 371 NPLRYGGFTIYQTDWGFSALQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVNSPDV 430

Query: 421 ------------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGV 480
                             EGKFVGVRRPSS+LPIDI+G+KIEI+DAIGSTGLELKTDPGV
Sbjct: 431 KGISMLARDLQSIVLYDQEGKFVGVRRPSSRLPIDINGIKIEIVDAIGSTGLELKTDPGV 490

Query: 481 PVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEIDSLLDRVPEI 483
           P+VYAGFGALMLTTC+S+LSHSQ+WAIQDGTVVIVGGKTNRAKVEFPEE+D LLD+VPEI
Sbjct: 491 PIVYAGFGALMLTTCVSYLSHSQVWAIQDGTVVIVGGKTNRAKVEFPEEMDRLLDKVPEI 550

BLAST of Cp4.1LG04g08360 vs. TrEMBL
Match: A0A067LN40_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10055 PE=3 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 7.0e-181
Identity = 349/540 (64.63%), Postives = 399/540 (73.89%), Query Frame = 1

Query: 11  PYLSKTPFLHS---------SFRPQITPNLCCR-TLSFSVTCKVKASQD--KKPKNVSNK 70
           P+L KT FL S           +PQI  +LC R TLS ++TCK+K S++  KK  NVS K
Sbjct: 12  PFLLKTHFLKSPPVINSTFIKLKPQIH-SLCHRKTLSLTITCKLKTSKEIKKKDNNVSGK 71

Query: 71  IVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGL-LMKLVKRLPKRILGALSNLPLAIGE 130
           I+LS +APP++EES D +G   E     G+G G   +  +KRLP+R+L  LSNLPLAIGE
Sbjct: 72  ILLSNSAPPVSEESGDLDGKVPEKAGSGGSGGGGGPLGFMKRLPRRVLAVLSNLPLAIGE 131

Query: 131 MFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAML 190
           MF IAALMALGTFIDQGE P+FYFQK+PE+NP+ GFFTWRWILTLGFDHM++S +FL ML
Sbjct: 132 MFVIAALMALGTFIDQGETPEFYFQKFPEENPVLGFFTWRWILTLGFDHMFSSPVFLGML 191

Query: 191 SLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGY 250
           +LLG SLMACTYTTQIPLVKVARRWNFL S +AIRK E S+ LP AS+QDLGV+LMGAGY
Sbjct: 192 ALLGTSLMACTYTTQIPLVKVARRWNFLLSSEAIRKQEYSDTLPSASIQDLGVILMGAGY 251

Query: 251 EVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDV 310
           EVF+KGP+LYAFKGLAGRFAPIGVHLAMLLIM G TL+AAGSFRGSVTVPQGLNFVVGDV
Sbjct: 252 EVFLKGPSLYAFKGLAGRFAPIGVHLAMLLIMAGGTLTAAGSFRGSVTVPQGLNFVVGDV 311

Query: 311 LNPSGFLSKPTEAFNTEVHVNNF-------------YMDY-------------------- 370
           L PSGFLS PTEAFNTEVHVN F             Y D                     
Sbjct: 312 LGPSGFLSTPTEAFNTEVHVNKFYMEYYDSGEVSQFYSDLSLFDLDGKEVLRKTISVNNP 371

Query: 371 --------------YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD--- 430
                         +  LQILKNDEGPFNLA+APLK+NGDKKL+GTFLPVGDV+SP+   
Sbjct: 372 LRYGGITIYQTDWSFSALQILKNDEGPFNLAMAPLKVNGDKKLFGTFLPVGDVNSPNVKG 431

Query: 431 ----------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPV 472
                           EGKFVGVRRP+SKLPI+IDG +I I DAIGSTGLELKTDPGVPV
Sbjct: 432 ISMLARDLQSIVLYDQEGKFVGVRRPNSKLPIEIDGTRIIIEDAIGSTGLELKTDPGVPV 491

BLAST of Cp4.1LG04g08360 vs. TrEMBL
Match: D7TPX3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g02040 PE=3 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 2.7e-180
Identity = 340/533 (63.79%), Postives = 395/533 (74.11%), Query Frame = 1

Query: 15  KTPFLHSSFRPQITP------NLCCRT-LSFSVTCKVKASQD-KKPKNVSNKIVLSEAAP 74
           K+P L SSF+PQ  P      + C  T LSFS+TCK+K S+D K  K+++ KIVLSE AP
Sbjct: 15  KSPLLRSSFKPQFFPYTTQISSPCSSTPLSFSITCKLKTSEDGKSSKSLAKKIVLSEGAP 74

Query: 75  PLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIAALMA 134
            ++E+   N     +   K G G G    LVKR P+++L  LSNLPLAIGEMFT+AALMA
Sbjct: 75  AVSEDGAGNGEAQPKPASKGGGGGGGFGGLVKRFPRKVLSRLSNLPLAIGEMFTVAALMA 134

Query: 135 LGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMA 194
           LGT IDQGEAPD+YFQK+PEDNP+ GFFTWRW+LTLGFDHM++S IFL ML+LL  SLMA
Sbjct: 135 LGTAIDQGEAPDYYFQKFPEDNPVLGFFTWRWVLTLGFDHMFSSPIFLGMLALLATSLMA 194

Query: 195 CTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIKGPTL 254
           CTYTTQIPLVKVARRWNFL S +AIRK E SE LP+ASV+DLGVVLMGAGYEVF+KGP+L
Sbjct: 195 CTYTTQIPLVKVARRWNFLHSAEAIRKQEFSESLPKASVRDLGVVLMGAGYEVFLKGPSL 254

Query: 255 YAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSGFLSK 314
           YAFKGLAGRFAPIGVHLAMLLIM G TLSA GSFRGSVTVPQGLNFV+GDVL+PSGFLS 
Sbjct: 255 YAFKGLAGRFAPIGVHLAMLLIMVGGTLSATGSFRGSVTVPQGLNFVMGDVLSPSGFLST 314

Query: 315 PTEAFNTEVHVNNF---YMD---------------------------------------- 374
           PT+AF+TEVHVN F   Y D                                        
Sbjct: 315 PTKAFDTEVHVNRFYMDYYDSGEVLQFHTDLSLFDLNGKEVMRKTISVNDPLRFDGITIY 374

Query: 375 ----YYDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD------------ 434
                +  LQI K+DEGPFNLA+APLK+NGDKKL+GTFLPVGD DSP+            
Sbjct: 375 QTDWSFSALQIRKDDEGPFNLAMAPLKLNGDKKLFGTFLPVGDSDSPNVKGISMLARDLQ 434

Query: 435 -------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGFGALM 474
                  EGKF GVRRP+S LPIDIDG +I I DAIGS+GL+LKTDPGVP+VYAGFGALM
Sbjct: 435 SIVLYDKEGKFAGVRRPNSNLPIDIDGTRIVIEDAIGSSGLDLKTDPGVPIVYAGFGALM 494

BLAST of Cp4.1LG04g08360 vs. TrEMBL
Match: B9RZC9_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0938170 PE=3 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 2.3e-179
Identity = 339/532 (63.72%), Postives = 395/532 (74.25%), Query Frame = 1

Query: 12  YLSKTPFLHSSFRPQITPNLCC--RTLSFSVTCKVKASQD--KKPKNVSNKIVLSEAAPP 71
           +++  PF++S+ +     ++ C  R LS SV+CK+K S++   K KNVS KI+LS +APP
Sbjct: 16  FINFHPFINSTIKLNPQIHILCNRRALSLSVSCKLKTSKEVENKDKNVSRKILLSNSAPP 75

Query: 72  LAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIAALMAL 131
           ++EE     GNN E   K   G G  ++  KRLP+++L  LSNLPLAIGEMF IA LMAL
Sbjct: 76  VSEEG--GAGNNGEIPDKAAKGGGGPLRFFKRLPRKVLSVLSNLPLAIGEMFAIAGLMAL 135

Query: 132 GTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMAC 191
           GT IDQG+AP+ YFQ YPE+NP+ GFFTWRWILTLGFDHM++S +FL ML+LLGLSLMAC
Sbjct: 136 GTVIDQGQAPEIYFQNYPEENPVLGFFTWRWILTLGFDHMFSSPVFLGMLALLGLSLMAC 195

Query: 192 TYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIKGPTLY 251
           TYTTQIPLVKVARRWNFL S +AIRK E ++ LP+AS+QD+GV+LMGAGYEVF+KGP+LY
Sbjct: 196 TYTTQIPLVKVARRWNFLHSAEAIRKQEFADTLPQASIQDVGVILMGAGYEVFLKGPSLY 255

Query: 252 AFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSGFLSKP 311
           AFKGLAGRFAPIGVHLAMLLIM GATL+A GSFRGSVTVPQGLNFVVGDVL PSGFLS P
Sbjct: 256 AFKGLAGRFAPIGVHLAMLLIMAGATLTATGSFRGSVTVPQGLNFVVGDVLGPSGFLSTP 315

Query: 312 TEAFNTEVHVNNF-------------YMDY------------------------------ 371
           TEAFNTEVHVN F             Y D                               
Sbjct: 316 TEAFNTEVHVNKFYMDYYDSGEVSQFYSDLSLYDIDGKEVLRKTISVNNPLRYGGFTIYQ 375

Query: 372 ----YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD------------- 431
               +  LQI KNDEGPFNLA+APLKINGDKKL+GTFLPVGDV+SP+             
Sbjct: 376 TDWSFSALQIRKNDEGPFNLAMAPLKINGDKKLFGTFLPVGDVNSPNVKGISMLARDLQS 435

Query: 432 ------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGFGALML 474
                 EGKFVGVRRP+SKLPIDIDG +I I DAIGSTGLELKTDPGVPVVYAGFGALML
Sbjct: 436 IVLYDQEGKFVGVRRPNSKLPIDIDGTRIVIEDAIGSTGLELKTDPGVPVVYAGFGALML 495

BLAST of Cp4.1LG04g08360 vs. TrEMBL
Match: A0A0G2T403_9ROSI (Cytochrome c biogenesis protein OS=Francoa sonchifolia GN=ccs1 PE=2 SV=1)

HSP 1 Score: 605.5 bits (1560), Expect = 5.6e-170
Identity = 327/537 (60.89%), Postives = 383/537 (71.32%), Query Frame = 1

Query: 9   PNPYLSKT-----PFLHSSFRPQITPNLCCRTLSFSVTCKVKASQDKKPKNVSNKIVLSE 68
           PNPY+ KT     P  HS F+P +  +   RTL  S+TCK++    KK KNVS KI+LSE
Sbjct: 10  PNPYVPKTQLLKSPLFHSKFKPLLHKSNPKRTLLISITCKLQLP--KKDKNVSKKIILSE 69

Query: 69  AAPP-LAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIA 128
             PP LAE+     G   E   + G   G+L  LVK+LP+++L  LSNL LAIGEMF IA
Sbjct: 70  VPPPPLAED-----GGEKEEPPRDGGKKGVL-SLVKKLPRKVLTVLSNLQLAIGEMFAIA 129

Query: 129 ALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGL 188
           ALMALGT IDQGEAPD+YF+KYPEDNP+ GFFTWRWILTLGFDHM++S +FLA+L+LLG 
Sbjct: 130 ALMALGTAIDQGEAPDYYFRKYPEDNPVLGFFTWRWILTLGFDHMFSSPVFLALLALLGT 189

Query: 189 SLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIK 248
           SLMACTYTTQIPLVKVARRW FL S  AIRK E S+ L +AS++DLG++LMG+GYEV++K
Sbjct: 190 SLMACTYTTQIPLVKVARRWKFLYSVAAIRKQEFSDTLLKASIRDLGLILMGSGYEVYVK 249

Query: 249 GPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSG 308
           GP LYAFKGLAGRFAPIGVH+AMLLIM G TLSA GSFRGSVTVPQGLNFV+GDV+  SG
Sbjct: 250 GPALYAFKGLAGRFAPIGVHIAMLLIMAGGTLSATGSFRGSVTVPQGLNFVIGDVMGASG 309

Query: 309 FLSKPTE-------------AFNTEVHVNNFYMDY------------------------- 368
           FLS PTE              F     V+ F+ D                          
Sbjct: 310 FLSTPTEAFNTEVHVNRFYMDFYDSGEVSQFHTDLSLFDIDGKEVMRKTISVNDPLRYGG 369

Query: 369 ---------YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD-------- 428
                    +  LQ+LK+DEGPFNLA+APL+INGDKKL+GTFLPV D DSP+        
Sbjct: 370 ITMYQTDWSFSALQVLKDDEGPFNLAMAPLQINGDKKLFGTFLPVADADSPNVKGISMLA 429

Query: 429 -----------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGF 474
                      EGKF GVRRP+SKLPIDIDG ++ I+DAIGSTGL LKTDPGVPVVYAGF
Sbjct: 430 RDLQSIVLYDKEGKFAGVRRPNSKLPIDIDGTRVVIVDAIGSTGLNLKTDPGVPVVYAGF 489

BLAST of Cp4.1LG04g08360 vs. TAIR10
Match: AT1G49380.1 (AT1G49380.1 cytochrome c biogenesis protein family)

HSP 1 Score: 474.6 bits (1220), Expect = 7.5e-134
Identity = 276/540 (51.11%), Postives = 350/540 (64.81%), Query Frame = 1

Query: 10  NPYLSKTPFLHSSFRPQITPNLCCRTLSFSV--TCKVKASQDKKPKNVSNK-----IVLS 69
           NP +     +H   RP    +  CRT + S+   CK++  QD   ++ SN+     I LS
Sbjct: 6   NPKILHFSKIHPFSRPS---SYLCRTRNVSLITNCKLQKPQDGNQRSSSNRNLTKTISLS 65

Query: 70  EAAPPLAEESDDN---NGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMF 129
           ++APP+ EE+ D     G N       G G    +  +K LP+++L  LSNLPLAI EMF
Sbjct: 66  DSAPPVTEETGDGIVKGGGNGGGGGGDGRGG---LGFLKILPRKVLSVLSNLPLAITEMF 125

Query: 130 TIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSL 189
           TIAALMALGT I+QGE PDFYFQKYPEDNP+ GFFTWRWI TLG DHMY++ IFL ML L
Sbjct: 126 TIAALMALGTVIEQGETPDFYFQKYPEDNPVLGFFTWRWISTLGLDHMYSAPIFLGMLVL 185

Query: 190 LGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEV 249
           L  SLMACTYTTQIPLVKVARRW+F++S +AI+K E ++ LPRAS+QDLG++LMG G+EV
Sbjct: 186 LAASLMACTYTTQIPLVKVARRWSFMKSDEAIKKQEFADTLPRASIQDLGMILMGDGFEV 245

Query: 250 FIKGPTLYAFKGLAGRFAP-------------------------IGVHLAMLLIMGGATL 309
           F+KGP+LYAFKGLAGRFAP                         + V   +  +MG   L
Sbjct: 246 FMKGPSLYAFKGLAGRFAPIGVHIAMLLIMVGGTLSATGSFRGSVTVPQGLNFVMGDV-L 305

Query: 310 SAAGSFR---GSVTVPQGLNFVVGDVLNPSGFLSK-----------PTEAFNTEVHVNN- 369
           +  G F     +      +N    D  + SG +S+             E     + VN+ 
Sbjct: 306 APIGFFSIPTDAFNTEVHVNRFTMDYYD-SGEVSQFHSDLSLRDLNGKEVLRKTISVNDP 365

Query: 370 --------FYMDY-YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD--- 429
                   +  D+ +  LQ+ K+ EGPFNLA+AP+KINGDKKLYGTFLPVGD ++P+   
Sbjct: 366 LRYGGVTVYQTDWSFSALQVTKDGEGPFNLAMAPIKINGDKKLYGTFLPVGDTNAPNVKG 425

Query: 430 ----------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPV 472
                           +GKF G+RRPSSKLPI+I+GMKI I DAIGSTGLELKTDPGVPV
Sbjct: 426 ISMLARDLQSIVVYDLDGKFAGIRRPSSKLPIEINGMKIVIEDAIGSTGLELKTDPGVPV 485

BLAST of Cp4.1LG04g08360 vs. NCBI nr
Match: gi|659109781|ref|XP_008454876.1| (PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Cucumis melo])

HSP 1 Score: 776.9 bits (2005), Expect = 2.0e-221
Identity = 419/553 (75.77%), Postives = 441/553 (79.75%), Query Frame = 1

Query: 1   MEGLILHNPNPYLSK-----TPFLHSSFRPQITPNLCCRTLSFSVTCKVKASQDKKPKNV 60
           ME LIL+NPNP LSK     TPFLHS F  QITP    +  SFSVTCK KASQDKK KNV
Sbjct: 11  MERLILNNPNPSLSKPFLLKTPFLHSVFGHQITPRTA-QIRSFSVTCKNKASQDKKLKNV 70

Query: 61  SNKIVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAI 120
           SNKIVLSEAAPPLAEE  D +GN  EAEVKPGNGSG   KLVKRLPKRILGALSNLPLAI
Sbjct: 71  SNKIVLSEAAPPLAEEESDKSGN-AEAEVKPGNGSGST-KLVKRLPKRILGALSNLPLAI 130

Query: 121 GEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLA 180
           GEMFTIAALMALGT IDQGEAPDFYFQKYPEDNP+WGFFTWRWILTLGFDHMY+STIFL 
Sbjct: 131 GEMFTIAALMALGTVIDQGEAPDFYFQKYPEDNPLWGFFTWRWILTLGFDHMYSSTIFLG 190

Query: 181 MLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGA 240
           ML+LLG+SLMACTYTTQIPLVKVARRWNFLQSG+ IRKLECS+ILPRASVQDLGVVLMGA
Sbjct: 191 MLALLGISLMACTYTTQIPLVKVARRWNFLQSGETIRKLECSDILPRASVQDLGVVLMGA 250

Query: 241 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVG 300
           GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIM GATLSA GSFRGSVTVPQGLNFVVG
Sbjct: 251 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMAGATLSATGSFRGSVTVPQGLNFVVG 310

Query: 301 DVLNPSGFLSKPTE-------------AFNTEVHVNNFYMDY------------------ 360
           DVLNPSGFLSKPTE              +     V  FY D                   
Sbjct: 311 DVLNPSGFLSKPTEAFNTEVHVNKFYMNYYDSGEVKQFYSDLSLFDLNGKEVMRKTISVN 370

Query: 361 ----------------YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD- 420
                           +  LQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSP+ 
Sbjct: 371 NPLRYGGFTIYQTDWGFSALQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPNV 430

Query: 421 ------------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGV 480
                             EGKFVGVRRPSS+LPIDI+G+KIEI+DAIGSTGLELKTDPGV
Sbjct: 431 KGISMLARDLQSIVLYDQEGKFVGVRRPSSRLPIDINGIKIEIVDAIGSTGLELKTDPGV 490

Query: 481 PVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEIDSLLDRVPEI 483
           P+VYAGFGALMLTTC+S+LSHSQIWAIQDGTVVIVGGKTNRAKVEFPEE+D LLD+VPEI
Sbjct: 491 PIVYAGFGALMLTTCVSYLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEMDRLLDKVPEI 550

BLAST of Cp4.1LG04g08360 vs. NCBI nr
Match: gi|778725296|ref|XP_011658930.1| (PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Cucumis sativus])

HSP 1 Score: 745.7 bits (1924), Expect = 4.9e-212
Identity = 407/553 (73.60%), Postives = 433/553 (78.30%), Query Frame = 1

Query: 1   MEGLILHNPNPYLSK-----TPFLHSSFRPQITPNLCCRTLSFSVTCKVKASQDKKPKNV 60
           ME LIL+N NP LSK     T FLHS F  QIT     +  SFSVTCK KASQDKK KN 
Sbjct: 11  MERLILNNLNPSLSKPFLLKTSFLHSVFSHQITLRTA-QIRSFSVTCKNKASQDKKLKNA 70

Query: 61  SNKIVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAI 120
           SNKIVLSEAAPPLAEE  D +GN  EAEVKPGNGS   MKLVKRLPKRILGALSNLPLAI
Sbjct: 71  SNKIVLSEAAPPLAEEESDKSGN-AEAEVKPGNGSRS-MKLVKRLPKRILGALSNLPLAI 130

Query: 121 GEMFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLA 180
           GEMFTIAALMALGT IDQGEAPDFYFQKYPEDNP+WGFF WRWILTLGFDHMY+STIFL 
Sbjct: 131 GEMFTIAALMALGTVIDQGEAPDFYFQKYPEDNPLWGFFNWRWILTLGFDHMYSSTIFLG 190

Query: 181 MLSLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGA 240
           ML+LLG+SLMACTYTTQIPLVKVARRWNFLQSG+ IRKLECS+ILPRASVQDLGVVLMGA
Sbjct: 191 MLALLGISLMACTYTTQIPLVKVARRWNFLQSGETIRKLECSDILPRASVQDLGVVLMGA 250

Query: 241 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVG 300
           GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIM GATLSA GSFRGSVTVPQGLNFVVG
Sbjct: 251 GYEVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMAGATLSATGSFRGSVTVPQGLNFVVG 310

Query: 301 DVLNPSG-------------------------------------FLSKPTEAFNTEVHVN 360
           DVLNPSG                                     F     E     + VN
Sbjct: 311 DVLNPSGFLAKPTEAFNTEVHVNKFYMNYYDSGEIKQFYSDLSLFDLNGKEVMRKTISVN 370

Query: 361 N---------FYMDY-YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD- 420
           N         +  D+ +  LQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDV+SPD 
Sbjct: 371 NPLRYGGFTIYQTDWGFSALQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVNSPDV 430

Query: 421 ------------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGV 480
                             EGKFVGVRRPSS+LPIDI+G+KIEI+DAIGSTGLELKTDPGV
Sbjct: 431 KGISMLARDLQSIVLYDQEGKFVGVRRPSSRLPIDINGIKIEIVDAIGSTGLELKTDPGV 490

Query: 481 PVVYAGFGALMLTTCISFLSHSQIWAIQDGTVVIVGGKTNRAKVEFPEEIDSLLDRVPEI 483
           P+VYAGFGALMLTTC+S+LSHSQ+WAIQDGTVVIVGGKTNRAKVEFPEE+D LLD+VPEI
Sbjct: 491 PIVYAGFGALMLTTCVSYLSHSQVWAIQDGTVVIVGGKTNRAKVEFPEEMDRLLDKVPEI 550

BLAST of Cp4.1LG04g08360 vs. NCBI nr
Match: gi|802539068|ref|XP_012069488.1| (PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic isoform X1 [Jatropha curcas])

HSP 1 Score: 641.7 bits (1654), Expect = 1.0e-180
Identity = 349/540 (64.63%), Postives = 399/540 (73.89%), Query Frame = 1

Query: 11  PYLSKTPFLHS---------SFRPQITPNLCCR-TLSFSVTCKVKASQD--KKPKNVSNK 70
           P+L KT FL S           +PQI  +LC R TLS ++TCK+K S++  KK  NVS K
Sbjct: 12  PFLLKTHFLKSPPVINSTFIKLKPQIH-SLCHRKTLSLTITCKLKTSKEIKKKDNNVSGK 71

Query: 71  IVLSEAAPPLAEESDDNNGNNTEAEVKPGNGSGL-LMKLVKRLPKRILGALSNLPLAIGE 130
           I+LS +APP++EES D +G   E     G+G G   +  +KRLP+R+L  LSNLPLAIGE
Sbjct: 72  ILLSNSAPPVSEESGDLDGKVPEKAGSGGSGGGGGPLGFMKRLPRRVLAVLSNLPLAIGE 131

Query: 131 MFTIAALMALGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAML 190
           MF IAALMALGTFIDQGE P+FYFQK+PE+NP+ GFFTWRWILTLGFDHM++S +FL ML
Sbjct: 132 MFVIAALMALGTFIDQGETPEFYFQKFPEENPVLGFFTWRWILTLGFDHMFSSPVFLGML 191

Query: 191 SLLGLSLMACTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGY 250
           +LLG SLMACTYTTQIPLVKVARRWNFL S +AIRK E S+ LP AS+QDLGV+LMGAGY
Sbjct: 192 ALLGTSLMACTYTTQIPLVKVARRWNFLLSSEAIRKQEYSDTLPSASIQDLGVILMGAGY 251

Query: 251 EVFIKGPTLYAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDV 310
           EVF+KGP+LYAFKGLAGRFAPIGVHLAMLLIM G TL+AAGSFRGSVTVPQGLNFVVGDV
Sbjct: 252 EVFLKGPSLYAFKGLAGRFAPIGVHLAMLLIMAGGTLTAAGSFRGSVTVPQGLNFVVGDV 311

Query: 311 LNPSGFLSKPTEAFNTEVHVNNF-------------YMDY-------------------- 370
           L PSGFLS PTEAFNTEVHVN F             Y D                     
Sbjct: 312 LGPSGFLSTPTEAFNTEVHVNKFYMEYYDSGEVSQFYSDLSLFDLDGKEVLRKTISVNNP 371

Query: 371 --------------YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD--- 430
                         +  LQILKNDEGPFNLA+APLK+NGDKKL+GTFLPVGDV+SP+   
Sbjct: 372 LRYGGITIYQTDWSFSALQILKNDEGPFNLAMAPLKVNGDKKLFGTFLPVGDVNSPNVKG 431

Query: 431 ----------------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPV 472
                           EGKFVGVRRP+SKLPI+IDG +I I DAIGSTGLELKTDPGVPV
Sbjct: 432 ISMLARDLQSIVLYDQEGKFVGVRRPNSKLPIEIDGTRIIIEDAIGSTGLELKTDPGVPV 491

BLAST of Cp4.1LG04g08360 vs. NCBI nr
Match: gi|225428564|ref|XP_002281077.1| (PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Vitis vinifera])

HSP 1 Score: 639.8 bits (1649), Expect = 3.8e-180
Identity = 340/533 (63.79%), Postives = 395/533 (74.11%), Query Frame = 1

Query: 15  KTPFLHSSFRPQITP------NLCCRT-LSFSVTCKVKASQD-KKPKNVSNKIVLSEAAP 74
           K+P L SSF+PQ  P      + C  T LSFS+TCK+K S+D K  K+++ KIVLSE AP
Sbjct: 15  KSPLLRSSFKPQFFPYTTQISSPCSSTPLSFSITCKLKTSEDGKSSKSLAKKIVLSEGAP 74

Query: 75  PLAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIAALMA 134
            ++E+   N     +   K G G G    LVKR P+++L  LSNLPLAIGEMFT+AALMA
Sbjct: 75  AVSEDGAGNGEAQPKPASKGGGGGGGFGGLVKRFPRKVLSRLSNLPLAIGEMFTVAALMA 134

Query: 135 LGTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMA 194
           LGT IDQGEAPD+YFQK+PEDNP+ GFFTWRW+LTLGFDHM++S IFL ML+LL  SLMA
Sbjct: 135 LGTAIDQGEAPDYYFQKFPEDNPVLGFFTWRWVLTLGFDHMFSSPIFLGMLALLATSLMA 194

Query: 195 CTYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIKGPTL 254
           CTYTTQIPLVKVARRWNFL S +AIRK E SE LP+ASV+DLGVVLMGAGYEVF+KGP+L
Sbjct: 195 CTYTTQIPLVKVARRWNFLHSAEAIRKQEFSESLPKASVRDLGVVLMGAGYEVFLKGPSL 254

Query: 255 YAFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSGFLSK 314
           YAFKGLAGRFAPIGVHLAMLLIM G TLSA GSFRGSVTVPQGLNFV+GDVL+PSGFLS 
Sbjct: 255 YAFKGLAGRFAPIGVHLAMLLIMVGGTLSATGSFRGSVTVPQGLNFVMGDVLSPSGFLST 314

Query: 315 PTEAFNTEVHVNNF---YMD---------------------------------------- 374
           PT+AF+TEVHVN F   Y D                                        
Sbjct: 315 PTKAFDTEVHVNRFYMDYYDSGEVLQFHTDLSLFDLNGKEVMRKTISVNDPLRFDGITIY 374

Query: 375 ----YYDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD------------ 434
                +  LQI K+DEGPFNLA+APLK+NGDKKL+GTFLPVGD DSP+            
Sbjct: 375 QTDWSFSALQIRKDDEGPFNLAMAPLKLNGDKKLFGTFLPVGDSDSPNVKGISMLARDLQ 434

Query: 435 -------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGFGALM 474
                  EGKF GVRRP+S LPIDIDG +I I DAIGS+GL+LKTDPGVP+VYAGFGALM
Sbjct: 435 SIVLYDKEGKFAGVRRPNSNLPIDIDGTRIVIEDAIGSSGLDLKTDPGVPIVYAGFGALM 494

BLAST of Cp4.1LG04g08360 vs. NCBI nr
Match: gi|255556127|ref|XP_002519098.1| (PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic isoform X1 [Ricinus communis])

HSP 1 Score: 636.7 bits (1641), Expect = 3.2e-179
Identity = 339/532 (63.72%), Postives = 395/532 (74.25%), Query Frame = 1

Query: 12  YLSKTPFLHSSFRPQITPNLCC--RTLSFSVTCKVKASQD--KKPKNVSNKIVLSEAAPP 71
           +++  PF++S+ +     ++ C  R LS SV+CK+K S++   K KNVS KI+LS +APP
Sbjct: 16  FINFHPFINSTIKLNPQIHILCNRRALSLSVSCKLKTSKEVENKDKNVSRKILLSNSAPP 75

Query: 72  LAEESDDNNGNNTEAEVKPGNGSGLLMKLVKRLPKRILGALSNLPLAIGEMFTIAALMAL 131
           ++EE     GNN E   K   G G  ++  KRLP+++L  LSNLPLAIGEMF IA LMAL
Sbjct: 76  VSEEG--GAGNNGEIPDKAAKGGGGPLRFFKRLPRKVLSVLSNLPLAIGEMFAIAGLMAL 135

Query: 132 GTFIDQGEAPDFYFQKYPEDNPMWGFFTWRWILTLGFDHMYTSTIFLAMLSLLGLSLMAC 191
           GT IDQG+AP+ YFQ YPE+NP+ GFFTWRWILTLGFDHM++S +FL ML+LLGLSLMAC
Sbjct: 136 GTVIDQGQAPEIYFQNYPEENPVLGFFTWRWILTLGFDHMFSSPVFLGMLALLGLSLMAC 195

Query: 192 TYTTQIPLVKVARRWNFLQSGDAIRKLECSEILPRASVQDLGVVLMGAGYEVFIKGPTLY 251
           TYTTQIPLVKVARRWNFL S +AIRK E ++ LP+AS+QD+GV+LMGAGYEVF+KGP+LY
Sbjct: 196 TYTTQIPLVKVARRWNFLHSAEAIRKQEFADTLPQASIQDVGVILMGAGYEVFLKGPSLY 255

Query: 252 AFKGLAGRFAPIGVHLAMLLIMGGATLSAAGSFRGSVTVPQGLNFVVGDVLNPSGFLSKP 311
           AFKGLAGRFAPIGVHLAMLLIM GATL+A GSFRGSVTVPQGLNFVVGDVL PSGFLS P
Sbjct: 256 AFKGLAGRFAPIGVHLAMLLIMAGATLTATGSFRGSVTVPQGLNFVVGDVLGPSGFLSTP 315

Query: 312 TEAFNTEVHVNNF-------------YMDY------------------------------ 371
           TEAFNTEVHVN F             Y D                               
Sbjct: 316 TEAFNTEVHVNKFYMDYYDSGEVSQFYSDLSLYDIDGKEVLRKTISVNNPLRYGGFTIYQ 375

Query: 372 ----YDTLQILKNDEGPFNLAVAPLKINGDKKLYGTFLPVGDVDSPD------------- 431
               +  LQI KNDEGPFNLA+APLKINGDKKL+GTFLPVGDV+SP+             
Sbjct: 376 TDWSFSALQIRKNDEGPFNLAMAPLKINGDKKLFGTFLPVGDVNSPNVKGISMLARDLQS 435

Query: 432 ------EGKFVGVRRPSSKLPIDIDGMKIEILDAIGSTGLELKTDPGVPVVYAGFGALML 474
                 EGKFVGVRRP+SKLPIDIDG +I I DAIGSTGLELKTDPGVPVVYAGFGALML
Sbjct: 436 IVLYDQEGKFVGVRRPNSKLPIDIDGTRIVIEDAIGSTGLELKTDPGVPVVYAGFGALML 495

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CCS1_ORYSJ1.5e-15257.52Cytochrome c biogenesis protein CCS1, chloroplastic OS=Oryza sativa subsp. japon... [more]
CCS1_ARATH1.3e-13251.11Cytochrome c biogenesis protein CCS1, chloroplastic OS=Arabidopsis thaliana GN=C... [more]
CCS1_NOSP77.4e-5933.95Cytochrome c biogenesis protein CcsB OS=Nostoc punctiforme (strain ATCC 29133 / ... [more]
CCS1_TRIEI2.8e-5834.10Cytochrome c biogenesis protein CcsB OS=Trichodesmium erythraeum (strain IMS101)... [more]
CCS1_MICAN4.5e-5632.48Cytochrome c biogenesis protein CcsB OS=Microcystis aeruginosa (strain NIES-843)... [more]
Match NameE-valueIdentityDescription
A0A0A0K6S4_CUCSA3.4e-21273.60Uncharacterized protein OS=Cucumis sativus GN=Csa_7G075670 PE=3 SV=1[more]
A0A067LN40_JATCU7.0e-18164.63Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10055 PE=3 SV=1[more]
D7TPX3_VITVI2.7e-18063.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0063g02040 PE=3 SV=... [more]
B9RZC9_RICCO2.3e-17963.72Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0938170 PE=3 SV=1[more]
A0A0G2T403_9ROSI5.6e-17060.89Cytochrome c biogenesis protein OS=Francoa sonchifolia GN=ccs1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G49380.17.5e-13451.11 cytochrome c biogenesis protein family[more]
Match NameE-valueIdentityDescription
gi|659109781|ref|XP_008454876.1|2.0e-22175.77PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Cucumis melo][more]
gi|778725296|ref|XP_011658930.1|4.9e-21273.60PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Cucumis sativus][more]
gi|802539068|ref|XP_012069488.1|1.0e-18064.63PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic isoform X1 [Jatro... [more]
gi|225428564|ref|XP_002281077.1|3.8e-18063.79PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic [Vitis vinifera][more]
gi|255556127|ref|XP_002519098.1|3.2e-17963.72PREDICTED: cytochrome c biogenesis protein CCS1, chloroplastic isoform X1 [Ricin... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007816ResB-like_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044262 cellular carbohydrate metabolic process
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0019252 starch biosynthetic process
biological_process GO:0043085 positive regulation of catalytic activity
biological_process GO:0006655 phosphatidylglycerol biosynthetic process
biological_process GO:0016556 mRNA modification
biological_process GO:0000023 maltose metabolic process
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0030154 cell differentiation
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0016117 carotenoid biosynthetic process
biological_process GO:0044767 single-organism developmental process
biological_process GO:0044723 single-organism carbohydrate metabolic process
biological_process GO:0008654 phospholipid biosynthetic process
biological_process GO:0008299 isoprenoid biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g08360.1Cp4.1LG04g08360.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007816ResB-like domainPFAMPF05140ResBcoord: 113..329
score: 3.3E-38coord: 378..459
score: 1.8
NoneNo IPR availablePANTHERPTHR31566FAMILY NOT NAMEDcoord: 61..471
score: 3.4E
NoneNo IPR availablePANTHERPTHR31566:SF0CYTOCHROME C BIOGENESIS PROTEIN CCS1, CHLOROPLASTICcoord: 61..471
score: 3.4E

The following gene(s) are paralogous to this gene:

None