Cp4.1LG08g02470 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g02470
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionChloroplast, plasma membrane, plastid, chloroplast envelope, putative
LocationCp4.1LG08 : 2764805 .. 2768885 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAAACCCACTAGTCACTGGCAAGGCAACTTCCAAAAATGCAAGAGGCAGGCAGCAACAGATAAGGGAAGAAGGTAAATGCAAAGCCAGCCTACACCCTAAACCCATTTCATTCTGAGCTTCCAAGAAGAACAAGAAACGCATCAATGGCGCACCTCAGAACCGCCATGGATTCCGCCTTCTGGGATTTCGACGTTTCCTCCTCTCAAACCCTCGTCGGAACCGCCAAGGCTGTTCCCGGCGAACCATTCCCTCTCGACGGAGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAGTCTCCTTCCTCGGCAATGGATTTCCCCTCGGAATTCTTCCTTCCTTCTCCCCCACTGCACACAAGGAGTTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAAGTTTCCCGCCGCCGACTGGTAAGTAGCAATGGCACATTCTTCTTCTCTTCGGTTTATGGAGCTTTGTGATCTTAAATTCCTAGGTTATGGTTGCTTCGTTCTTTATGCTTCCTGCATTTCATTTTTGGACTGTTCAGTGAATTTCTTGACGGTAGAGATGAATCAAATGTTTGATTGACTGGCTTAATTATTTCTACAGTCATTACTCTACTGAGAATCTCAGAATACCAATTCATGCTGAACATACTGACAGAGTTCACTGTTATTTCTTAAGAAAGGGGTCAAGGATGATTAGAAAGTGCTCCCCTGCATCATCCCAAACTACCAGATTTAGCTTACCAAAACCATAATCCCCTATGATACTACCTAGTTTATATACTCACCTTAGTTTTATTCTCCTAACTCGAATGTTGGTAGCTCAGTTGGTTAAGGTATCTACAAATTTAACCCCCACTCAAAGCATTACGGTTGCTAACGTGGGTATAGCTCAACTGGTTAAGGTATCTACAAACTTAACCCCCCACTCAAAGCGTTACAGTTGCTAACGTGGGTATAGCCCAACTGATTAAGGTATCTACAAATTTAACTCCTCACTCAAAGCGTTATGGTTGCGAACGTAGGCATATCTCAACTGGTTGAGGTATCTACAAACTTAACTCTCCACTCAAAGCGTTACGGTGGCTAACGTGAGTATAGCTCAATTGGTTAAGGTATCTACAAATTTAACCCCCCACTCAAAGCGTTACGGTTTCTAACGTGGGTATAGCTCAACTGGTTAAGGTATCTACAAACTTAACCCCCCACTCAAAGCGTTACAGTTGCTAATGTGGGTATAGCCCAACTAATTAAGGTATCTACAAATTTAACCCCTCACTCAAAGCGTTATGGTTGCCAACATAGGCATAGCTCAACTGGTTGAGGTATCTACAAATTTAACTCTCCACTCAAAGCGTTACGGTTGCTAACGTGAGTATAGCTCAATTGGTTAAGGTATCTACAAATTTAACCCCCCACGCAAAGTGTTACGCTTGCTAACGTGAGTATAGCTCAACTGGTTAAGGTATCTACAAATTTAACCCCCCACTCAAAAGCGTTACGGTTGCCAATGTGAGTATAGCTCAATTGGTTAAGGTATCTACAAATTTAACCCCCCACTCAAAGCGTTATAGTTGCCAACGTGAGTATAGCTCAATTGGTTAAGGTATCTACAAATGTAACCCCCCACTCAAAGCTTACGGTTGCCAACGTAGGTATAGCTCAACTAGTTAAGGTATCTATAAATTTAACCCCCACTCAAAGCGTTACGGTTACGAACATGGGTATAGCTCAACTTAACTTTACCCCCCACTCAAAGTGTTTCTTCTAACTTAAATTTACGGTTGCCAACATGGGTATATCTCAAATGTTTGAATCCCTCACTCTACCATTGACGAAGAAGTCAAATGTTTGAATCTCTCGCCCCCGACATTATACATTTACATGTGGCCTATCACATACTCTTTTGGTTTCTAGTATTGAAGTTTTGCTTTCTCATTTGTTTAAGCAAGTTTTGTAAAGGAAAAAGGTCCAAGATTACCATGTTTACCTCTGACATTGTTTAGATTGCTACTGATTTCACAAATCTCAGATGTACCATCTAATGGAATCTTTACATTTTAGGTGGGTTGGATTGGTTGGCCAATTCCGTCCGAAGAAAGTGATATCTTCTATAAAAGAGGACCTTGTTTCTGATCTAGACAACCTTGAGCTCCTCCCTGCCTTGAAAGATGTTGCTACCATGTTTCTGGACAAGACACTCTATTCATATGGATTATGCTCTCAGTTTTCTCCTACTCCCTTTTCATCTGTATTTGCCAGCACGGAAGAGCACGGTGACAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAGGGTGAGCGTTGTTGTCTCTTATGTTTCATTAAATGATAACTAAAGGAAAAGAGATGAATCATAAACGCGTTTCCTCGAGTTCTTTAAATAATGAGATCAGCCCTTTGATGCATATTTCTGTATAATGCTGATTGGATTCTATCTTTCTTCATGGAGTCTGCTTACTTATCTGTGGTGCTGTATTTATCTTATGCCTAAATAGGATCTTGCATGATTTGAACACCACTTATAACCTATTGGCAATATATTTTGCATCTGTAGGCCAAGTTATAACATGTATGAAGTTTTTAAGCTTCAATCGTGTTTAATGCCCCGTTAATATTAAGCATCGGAGAACATCGAAAACGTTTTAGTTTTCTTTATTTTACTTCTTCAATGGTGATCACTACCTTTTTATAAGCCATCTTTAATCATGCTTCTGCAGCTTCCTCATCATGATATAAATTTGGAAGCAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCCGAGTCTCTGTCTTTGGATCTGTCGTCTCTTAAGTCTGAATCTGGTTTGCGTTACCGGGTCGGGTTGCATAAGAATGGTGGCGTTCCCCGGGCTCTTTATAATACCGATGGTGGCGACCCACCTCTTACTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTTAGAAAAGAATAGGTACCTTTGGGGGGGAAAAGAACAGAAACAAGGCGTAACTGAGACGATAGACGATGCCGAACCATCATACGATGTGCGCCTTAAAGATCCTCATGCAGCCATATCCGGAATTGTTGGTCAGTCTCCTGCCTTTCAAAAACGTTATTTTAAGCATCCAGTTTTGCTGCGTTTTTCATATTAAGTATTTGCATATCAGGTGGCACCTTTAGCGCTTGGTTCGGAGGCAGTGACACGGTTGGGACGAACGGAGATGGAAACTTAGCTATCCATAACAAAAGAAGTCCACTGAATGCTGACCTTTTTGGCTCACTTTGCTGTACTTACCAACATGGGTCATTTAGAAAGGATTTTCGTGACCTCACGAGGTTAGATGCTCGACTAGATATTTCGTCGGGTTCAGCCTTTTCCAAAAGAGTTTTCAATGGGTTCAAGAAATCTATTGATGATCTGGAGAGATCAAAATCTACCCCTAGGCTCAATTTGATCTTCCAACAGCAGGTAATAACTTCTAAGTTTGTGATTCTAGTGCAGCTGCCCCCATTCATATATCTCATCTGTTAGGCATGATATAAGAAACTCATGATGTTAAGAAGAAATCCCATTTGCAGATTGCAGGCCCGATCGTCTTCCGAGTAGATTCCCGGCTTATGCTCGACTCTACCTCCGTCAAGCGCGGACCCCATGTCGAGGACACAATACTAAGCCTGAACTATTCATTCAAGCTTCTTGAATCAGGAAAAGCTGTTTTCTGGTTTTCTCCCAAGAGAAAAGAAGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGACTTCGATATCGTTTAATTCTGTTTTAGTTCAGTTGATGCGTTCAGTTTCTTAGATTTTTGACAACGAAATCGGCTCTACAGACTTAGTATAGCACTTGAGGCTGTTGCAGATGTAATATATAGGAGTGGCGTCCTTGTTTATGGCATAGAGCTGAGGCTTTGAACAGTTGTTGCTCAGAAAATAAGGGCATTTGTGATGAATTTGTAGTTATTGTCTAGAAATCATTGAAACTTGCAGCTAAACATGGGCTGTTCATTGCTCCAAATCTCATCTGTT

mRNA sequence

TGAAAACCCACTAGTCACTGGCAAGGCAACTTCCAAAAATGCAAGAGGCAGGCAGCAACAGATAAGGGAAGAAGGTAAATGCAAAGCCAGCCTACACCCTAAACCCATTTCATTCTGAGCTTCCAAGAAGAACAAGAAACGCATCAATGGCGCACCTCAGAACCGCCATGGATTCCGCCTTCTGGGATTTCGACGTTTCCTCCTCTCAAACCCTCGTCGGAACCGCCAAGGCTGTTCCCGGCGAACCATTCCCTCTCGACGGAGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAGTCTCCTTCCTCGGCAATGGATTTCCCCTCGGAATTCTTCCTTCCTTCTCCCCCACTGCACACAAGGAGTTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAAGTTTCCCGCCGCCGACTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCGAAGAAAGTGATATCTTCTATAAAAGAGGACCTTGTTTCTGATCTAGACAACCTTGAGCTCCTCCCTGCCTTGAAAGATGTTGCTACCATGTTTCTGGACAAGACACTCTATTCATATGGATTATGCTCTCAGTTTTCTCCTACTCCCTTTTCATCTGTATTTGCCAGCACGGAAGAGCACGGTGACAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAGGCTTCCTCATCATGATATAAATTTGGAAGCAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCCGAGTCTCTGTCTTTGGATCTGTCGTCTCTTAAGTCTGAATCTGGTTTGCGTTACCGGGTCGGGTTGCATAAGAATGGTGGCGTTCCCCGGGCTCTTTATAATACCGATGGTGGCGACCCACCTCTTACTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTTAGAAAAGAATAGGTACCTTTGGGGGGGAAAAGAACAGAAACAAGGCGTAACTGAGACGATAGACGATGCCGAACCATCATACGATGTGCGCCTTAAAGATCCTCATGCAGCCATATCCGGAATTGTTGGTGGCACCTTTAGCGCTTGGTTCGGAGGCAGTGACACGGTTGGGACGAACGGAGATGGAAACTTAGCTATCCATAACAAAAGAAGTCCACTGAATGCTGACCTTTTTGGCTCACTTTGCTGTACTTACCAACATGGGTCATTTAGAAAGGATTTTCGTGACCTCACGAGGTTAGATGCTCGACTAGATATTTCGTCGGGTTCAGCCTTTTCCAAAAGAGTTTTCAATGGGTTCAAGAAATCTATTGATGATCTGGAGAGATCAAAATCTACCCCTAGGCTCAATTTGATCTTCCAACAGCAGATTGCAGGCCCGATCGTCTTCCGAGTAGATTCCCGGCTTATGCTCGACTCTACCTCCGTCAAGCGCGGACCCCATGTCGAGGACACAATACTAAGCCTGAACTATTCATTCAAGCTTCTTGAATCAGGAAAAGCTGTTTTCTGGTTTTCTCCCAAGAGAAAAGAAGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGACTTCGATATCGTTTAATTCTGTTTTAGTTCAGTTGATGCGTTCAGTTTCTTAGATTTTTGACAACGAAATCGGCTCTACAGACTTAGTATAGCACTTGAGGCTGTTGCAGATGTAATATATAGGAGTGGCGTCCTTGTTTATGGCATAGAGCTGAGGCTTTGAACAGTTGTTGCTCAGAAAATAAGGGCATTTGTGATGAATTTGTAGTTATTGTCTAGAAATCATTGAAACTTGCAGCTAAACATGGGCTGTTCATTGCTCCAAATCTCATCTGTT

Coding sequence (CDS)

ATGGCGCACCTCAGAACCGCCATGGATTCCGCCTTCTGGGATTTCGACGTTTCCTCCTCTCAAACCCTCGTCGGAACCGCCAAGGCTGTTCCCGGCGAACCATTCCCTCTCGACGGAGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAGTCTCCTTCCTCGGCAATGGATTTCCCCTCGGAATTCTTCCTTCCTTCTCCCCCACTGCACACAAGGAGTTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAAGTTTCCCGCCGCCGACTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCGAAGAAAGTGATATCTTCTATAAAAGAGGACCTTGTTTCTGATCTAGACAACCTTGAGCTCCTCCCTGCCTTGAAAGATGTTGCTACCATGTTTCTGGACAAGACACTCTATTCATATGGATTATGCTCTCAGTTTTCTCCTACTCCCTTTTCATCTGTATTTGCCAGCACGGAAGAGCACGGTGACAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAGGCTTCCTCATCATGATATAAATTTGGAAGCAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCCGAGTCTCTGTCTTTGGATCTGTCGTCTCTTAAGTCTGAATCTGGTTTGCGTTACCGGGTCGGGTTGCATAAGAATGGTGGCGTTCCCCGGGCTCTTTATAATACCGATGGTGGCGACCCACCTCTTACTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTTAGAAAAGAATAGGTACCTTTGGGGGGGAAAAGAACAGAAACAAGGCGTAACTGAGACGATAGACGATGCCGAACCATCATACGATGTGCGCCTTAAAGATCCTCATGCAGCCATATCCGGAATTGTTGGTGGCACCTTTAGCGCTTGGTTCGGAGGCAGTGACACGGTTGGGACGAACGGAGATGGAAACTTAGCTATCCATAACAAAAGAAGTCCACTGAATGCTGACCTTTTTGGCTCACTTTGCTGTACTTACCAACATGGGTCATTTAGAAAGGATTTTCGTGACCTCACGAGGTTAGATGCTCGACTAGATATTTCGTCGGGTTCAGCCTTTTCCAAAAGAGTTTTCAATGGGTTCAAGAAATCTATTGATGATCTGGAGAGATCAAAATCTACCCCTAGGCTCAATTTGATCTTCCAACAGCAGATTGCAGGCCCGATCGTCTTCCGAGTAGATTCCCGGCTTATGCTCGACTCTACCTCCGTCAAGCGCGGACCCCATGTCGAGGACACAATACTAAGCCTGAACTATTCATTCAAGCTTCTTGAATCAGGAAAAGCTGTTTTCTGGTTTTCTCCCAAGAGAAAAGAAGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGA

Protein sequence

MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLELLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAEPSYDVRLKDPHAAISGIVGGTFSAWFGGSDTVGTNGDGNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDFRDLTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
BLAST of Cp4.1LG08g02470 vs. Swiss-Prot
Match: TGD4_ARATH (Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana GN=TGD4 PE=1 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 8.5e-76
Identity = 176/487 (36.14%), Postives = 262/487 (53.80%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           M  +R   +   WD D+S+  TL GTA+AVP +P PL  +R +R  R +QV F       
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  GILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSI-KEDLVSD 120
            ++PSFSP    T     G FSLQ +L    + +W V L+GQF  ++ ++ I K      
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 121 LDNLELLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GDR-KGRRHKAMF 180
             +  +   L  +     DK+LY+ G CS+F  +P  ++  S + + GD  K  R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 181 YHRLPHHDINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPR 240
            H  P H++  EA WP LF+D  G+YW+VP S+++DL+SL +ESG  Y + LH N G P+
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 241 ALYNTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAEPSYDVRLKDP 300
            L++     PP +L+PGL  K+A S   N  LW      +G T  ++  +P YDV L  P
Sbjct: 241 KLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLW------RGTTPKLETCKP-YDVFLSSP 300

Query: 301 HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNLAIH--NKRSPLNADLFGSLCCTYQH 360
           H A+SGI+G   +A FG +         + G G  ++H  +  S   AD  G    T Q+
Sbjct: 301 HVAVSGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQY 360

Query: 361 GSFRKDFRDLTRLDARLDISSGSAF-------SKRVFNGFKKSIDDLERSKSTPRLNLIF 420
           G+F+K F DLTR  ARLD   G  F       ++ + N  + S++  +  K  P + +  
Sbjct: 361 GNFQKFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQ--KICPEVLVSL 420

Query: 421 QQQIAGPIVFRVDSRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGM 467
           QQQI GP  F+V+S + +D  +      V+ T+ ++ Y+ ++L S KAV  +SPK+ E M
Sbjct: 421 QQQIVGPFSFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFM 478

BLAST of Cp4.1LG08g02470 vs. TrEMBL
Match: A0A0A0L4I9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G128880 PE=4 SV=1)

HSP 1 Score: 729.2 bits (1881), Expect = 3.2e-207
Identity = 357/472 (75.64%), Postives = 407/472 (86.23%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWD ++SS QTL GTAKAVPGEPFPLDGARASRTLRIQQ+SFLGNGFPL
Sbjct: 1   MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRTLRIQQLSFLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+ PTAHKELGSFSLQSLL   P+  WW GLVGQFRPKK+ISSIK   +S ++ LE
Sbjct: 61  GIIPSYCPTAHKELGSFSLQSLLFMMPSVKWWAGLVGQFRPKKLISSIKAQ-ISAVEQLE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            L  LKD+A++FLDK+LY+YG+CSQFS  PFSSV+ STE+ G+RKG RHKAMFYHRLP H
Sbjct: 121 -LSDLKDIASLFLDKSLYTYGICSQFSTGPFSSVYVSTEKLGERKGHRHKAMFYHRLPEH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DIN++AAWPELFIDHKGQYW+VPES+SLDLSSLKSESGLRYRVGLHKNGGVPRAL +T+ 
Sbjct: 181 DINVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGVPRALNSTNS 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVT----ETIDDAEPSYDVRLKDPHAA 300
            DPPLTL+PGLCAKAAFS+EKNR LW     ++ +T     T    EP+YDVRL +PHAA
Sbjct: 241 DDPPLTLLPGLCAKAAFSIEKNRDLWRDNLSEEEMTINYIRTGLKKEPAYDVRLDEPHAA 300

Query: 301 ISGIVGGTFSAWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCCTYQHGSFRKDFRD 360
           ISGI+GGT S+WFGGSDTVG+NGDGNL + H KRSPLNADLFGS+C TYQHG F  DF D
Sbjct: 301 ISGIIGGTVSSWFGGSDTVGSNGDGNLTMGHKKRSPLNADLFGSICYTYQHGKFLNDFND 360

Query: 361 LTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420
           LTR+DARL ISS S F+KRVF+ FKKS+DDLERSKS+PRLNLIFQQQ+AGPIVFR++S+L
Sbjct: 361 LTRIDARLSISSASGFAKRVFHVFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLESKL 420

Query: 421 MLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           +LDS S K GPHVEDTI SL YSF  LES KAVFW+SPKRKEGMVELRL+EF
Sbjct: 421 LLDSASGKIGPHVEDTICSLTYSFLDLESAKAVFWYSPKRKEGMVELRLYEF 470

BLAST of Cp4.1LG08g02470 vs. TrEMBL
Match: D7SM50_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0021g00530 PE=4 SV=1)

HSP 1 Score: 609.8 bits (1571), Expect = 2.9e-171
Identity = 304/470 (64.68%), Postives = 371/470 (78.94%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMD+AFWD D+SS QTL G A+AVPG+PFPL+GARASR LR+QQ+SFLGNGFPL
Sbjct: 1   MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PSFSPT+ K+LGSFSLQSL L+   ++WW+GL GQFRPKK+ISSIK DL S +D  E
Sbjct: 61  GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADL-SAVDEWE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
           L    K+VA  F+DK+L+S+GLCSQ S T  SS+  STE+HG++KGRR++ M +H+LP H
Sbjct: 121 L-STFKEVAKHFIDKSLFSFGLCSQLSLTSASSLMVSTEQHGEKKGRRNRVMLFHQLPFH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI LEAAWPELFIDHKG+YWE+PES+SL LSSL SESGLRYR G+HKNGG P+++ N   
Sbjct: 181 DITLEAAWPELFIDHKGRYWELPESISLGLSSLVSESGLRYRFGIHKNGGHPQSV-NAIN 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKE-QKQGVTETIDDA--EPSYDVRLKDPHAAI 300
            + P  LMPGLCAKAAFS EK+R LW  +E Q+ G+ +T       PSYD+RL++PHAAI
Sbjct: 241 DEAPSALMPGLCAKAAFSYEKSRDLWRQREKQEDGIVKTERGLVWRPSYDIRLREPHAAI 300

Query: 301 SGIVGGTFSAWFGGSDTVGTNGDGNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDFRDLT 360
           SGI+GGT  AWFGGS     +GDG+ A   KRSP  ADLF S CCT+QHG FRK + DLT
Sbjct: 301 SGIIGGTCEAWFGGSRE---HGDGSSADAKKRSPFGADLFASGCCTFQHGQFRKRYGDLT 360

Query: 361 RLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLML 420
           R+DARL+I S SA +KRV N F  S++  +   S+PRLNLIFQQQ+AGPIVFRVDS+L+L
Sbjct: 361 RVDARLNICSASALAKRVSNLFSSSVNGAKDPLSSPRLNLIFQQQVAGPIVFRVDSKLLL 420

Query: 421 DSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           DS+  + GP +ED   SLNYS +LL SGK V W+SPKRKEGM+ELRLFEF
Sbjct: 421 DSSGGRAGPQLEDFTYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRLFEF 464

BLAST of Cp4.1LG08g02470 vs. TrEMBL
Match: M5X101_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005217mg PE=4 SV=1)

HSP 1 Score: 608.2 bits (1567), Expect = 8.3e-171
Identity = 306/475 (64.42%), Postives = 373/475 (78.53%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWD +VSS  TL G+AKA+PG+PFP+DGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MANLRTAMDSAFWDLNVSSPHTLEGSAKAIPGDPFPIDGARASRVLRIQQLSLLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+SPT+HK+LGSFSLQSLLL+   ++WW+GL+GQFRPKK+ISSIK +  S  D++E
Sbjct: 61  GIIPSYSPTSHKDLGSFSLQSLLLRPATSNWWLGLIGQFRPKKLISSIKAEF-STNDDME 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            +P  KDVA   LDK+LYS+GLC+Q    P SS+  STE HG++KGRR+K M +H+LP+H
Sbjct: 121 -VPTFKDVAKHVLDKSLYSFGLCTQLLVAPSSSIKLSTEGHGEKKGRRNKFMLFHKLPYH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI LEAAWPELFIDHKGQYW+VPES+SLDLSSL SESGLRYR+G+HKN G P+A+ + D 
Sbjct: 181 DITLEAAWPELFIDHKGQYWDVPESISLDLSSLVSESGLRYRIGIHKNSGHPQAVNSID- 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDA---EPSYDVRLKDPHAAI 300
           G+ P +LMPGLCAKAAFS EK++ LW  KE K+ V    D+     PSYDVRLK+PHAA+
Sbjct: 241 GEVPTSLMPGLCAKAAFSYEKSQDLWRQKETKKDVMVKKDNGWFWRPSYDVRLKEPHAAV 300

Query: 301 SGIVGGTFSAWFGGSDT---VGTNGD-GNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDF 360
           SGI GG+ +AWF    +   V   GD  N     KRSP +AD FGS+C ++QHG FR+ +
Sbjct: 301 SGIFGGSCTAWFQDGHSPVAVELRGDEDNSTSTKKRSPFSADFFGSVCYSFQHGKFRELY 360

Query: 361 RDLTRLDARLDISSGSAFSKRVFNGFKKSIDDLERS-KSTPRLNLIFQQQIAGPIVFRVD 420
            DLTR+DARLDI S SA +KRV NG K S  +  R   S+PR+NLIFQQQ+AGPIVFRVD
Sbjct: 361 GDLTRIDARLDICSASALAKRVINGLKSSSANSARDPMSSPRINLIFQQQVAGPIVFRVD 420

Query: 421 SRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           SR+ LDS   KRGPH+ED I SLNYS +LL SGK V W+SPKRKEGM+ELR+FEF
Sbjct: 421 SRVSLDSLPGKRGPHIEDFIYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRVFEF 472

BLAST of Cp4.1LG08g02470 vs. TrEMBL
Match: W9RTX5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023905 PE=4 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 7.8e-169
Identity = 305/485 (62.89%), Postives = 363/485 (74.85%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA LRTAMDSAFWD D+++ + L G AKA+PGEPFP+DGARASR LRIQQVS LGNGFPL
Sbjct: 1   MARLRTAMDSAFWDLDLATPRVLDGNAKAIPGEPFPMDGARASRALRIQQVSLLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS SPT+ K+LGSFSLQSLLLK   ++WW+GL+GQFRPKK+ISSIK +  SD     
Sbjct: 61  GIIPSLSPTSSKDLGSFSLQSLLLKPSTSNWWLGLIGQFRPKKLISSIKAEFKSDEAE-- 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHR---- 180
             P+ KDVA   LDK+LYS+GL +Q SPTP +S+  STE HG++KGRRHK M +H+    
Sbjct: 121 -FPSFKDVAKHILDKSLYSFGLTTQLSPTPSTSIKWSTEGHGEKKGRRHKMMLFHKASIH 180

Query: 181 ----------LPHHDINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLH 240
                     LP+HDI  EAAWP+LF+DHKGQYW+VPES+SLDL SL SESGLRYR+GLH
Sbjct: 181 IEFLYYINTNLPYHDITFEAAWPQLFVDHKGQYWDVPESISLDLLSLVSESGLRYRLGLH 240

Query: 241 KNGGVPRALYNTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDA---E 300
           K+   P A+ N    DPP  L+PGLCAKAAFS EK+   W  +E+++ + E  D      
Sbjct: 241 KSSDHPLAV-NATSHDPPAALLPGLCAKAAFSYEKSMDFWRQREKREDIIERTDRGLFWR 300

Query: 301 PSYDVRLKDPHAAISGIVGGTFSAWFGGSDTVGTNGDGNLAIHNKRSPLNADLFGSLCCT 360
           PSYDVRL +PH+AISGI+GGT +AWFG                 KRSPL+ADLFGS+C T
Sbjct: 301 PSYDVRLNEPHSAISGIIGGTCAAWFGD--------------RQKRSPLSADLFGSVCYT 360

Query: 361 YQHGSFRKDFRDLTRLDARLDISSGSAFSKRVFNGFK-KSIDDLERSKSTPRLNLIFQQQ 420
           +QHG FRK + DLTR+DARLDI S SA +KRV N FK  S D+ E   S PRLNLIFQQQ
Sbjct: 361 FQHGCFRKFYGDLTRVDARLDICSASAIAKRVLNSFKSSSSDNTEDPASHPRLNLIFQQQ 420

Query: 421 IAGPIVFRVDSRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVEL 468
           +AGPI  R+DSR++LDS+S KRGPHVED I SL YSF+LLESGKAVFW+SPKRKEGMVEL
Sbjct: 421 VAGPIAVRLDSRILLDSSSDKRGPHVEDFICSLTYSFRLLESGKAVFWYSPKRKEGMVEL 467

BLAST of Cp4.1LG08g02470 vs. TrEMBL
Match: A0A061DV28_THECC (Chloroplast, plasma membrane, plastid, chloroplast envelope, putative OS=Theobroma cacao GN=TCM_005797 PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 3.0e-160
Identity = 293/474 (61.81%), Postives = 360/474 (75.95%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+L++AMDSAFWD ++S+ QTL GTAK+VPGE FP+DGARASR LRIQQ+S L NGFPL
Sbjct: 1   MANLKSAMDSAFWDQNISTPQTLEGTAKSVPGESFPVDGARASRALRIQQLSLLRNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS SP   KELGSFSLQSLLL+   ++WW+G++GQFRPKK+IS+IK +L S  D LE
Sbjct: 61  GIIPSLSPPLQKELGSFSLQSLLLRPSTSNWWLGIIGQFRPKKLISAIKTELQS-ADELE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            L   +D A  FLDK+LYS  L +Q S +P SS+  STE  G+RK  R+K   YH+LP H
Sbjct: 121 -LSVFRDAAKHFLDKSLYSIALATQLSLSPSSSLLWSTERQGERKVYRNKFKLYHQLPDH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI L+AAWPELF+DHKG+YWEVPES+SLD+SSL S+SGL Y  GLH+N G P+A +N  G
Sbjct: 181 DITLDAAWPELFMDHKGKYWEVPESISLDVSSLPSDSGLLYHFGLHRNSGHPQA-FNALG 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGV---TETIDDAEPSYDVRLKDPHAAI 300
           G+ P  LMPG CAKAAFS EK++  W  KE K+ V   T       PSYDV LK+PHAAI
Sbjct: 241 GEAPSALMPGFCAKAAFSYEKSKDFWRRKETKEDVFVKTNKGSFFRPSYDVCLKEPHAAI 300

Query: 301 SGIVGGTFSAWFGG---SDTVGTNGDGNL-AIHNKRSPLNADLFGSLCCTYQHGSFRKDF 360
           SGI+GGT +AWFGG   S +  + G+G++    NKRSPLN DLFGS+C T+QHG FRK +
Sbjct: 301 SGIIGGTCAAWFGGRKNSTSAKSQGEGDIPTTINKRSPLNVDLFGSVCYTFQHGQFRKLY 360

Query: 361 RDLTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDS 420
            DLTR+DARLDI S  +F+KR+F     S+   + S S+PRLNLIFQQQ+AGPIV RVDS
Sbjct: 361 GDLTRVDARLDICSLPSFAKRIFK--SSSVSSADNSLSSPRLNLIFQQQVAGPIVVRVDS 420

Query: 421 RLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           + +LDS S +RGPH+ED I SL+YS +LL SGK V W+SPKRKEGM+ELRLFEF
Sbjct: 421 KFLLDSKSGERGPHIEDLIYSLSYSLRLLHSGKVVAWYSPKRKEGMIELRLFEF 469

BLAST of Cp4.1LG08g02470 vs. TAIR10
Match: AT2G44640.1 (AT2G44640.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 483.0 bits (1242), Expect = 2.0e-136
Identity = 252/470 (53.62%), Postives = 329/470 (70.00%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+L +A+DS FWD +VSS QTL GTA++VPGEPFPLDGARASR+ RIQQ+S L  GFPL
Sbjct: 1   MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS +P + K LGSFSL SLLL   + +WW+GLVGQF+PKK+ + IK D +S+ +  +
Sbjct: 61  GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKAD-ISNAEEWD 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            L  +KD A   +DK+LYS GL +Q +    SS+  STE  GD+ G R+K M  H L  H
Sbjct: 121 -LQVVKDTAKHIVDKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPR---ALYN 240
           D+ +EAAWP+LF+D+KG++W+VPESL++D+SSL  ESG+RYR GLHK+ G P+   A   
Sbjct: 181 DLTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGV 240

Query: 241 TDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAEPSYDVRLKDPHAAI 300
             G D P +LMPGLCAKAA S + NR LW  +E++    E        YD+RLK+PHAAI
Sbjct: 241 ESGSDAPTSLMPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAI 300

Query: 301 SGIVGGTFSAWFGGSDTVGTNGDGNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDFRDLT 360
           SGIVG + +AW          G G L    KRSP++AD+FGS C T+Q G F K + DLT
Sbjct: 301 SGIVGSSLAAWI--------TGRGMLVNGKKRSPISADVFGSACYTFQKGRFSKLYGDLT 360

Query: 361 RLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLML 420
           R+DAR+D+ S  A +K++F+    + DD   +  +PRLNLIFQQQ+AGPIVF+VDS+  +
Sbjct: 361 RVDARVDLPSAFALAKKLFHASSNNSDD---TLWSPRLNLIFQQQVAGPIVFKVDSQFQV 420

Query: 421 DSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
            +        +ED I SLNYS +LLESGK V W+SPKRKEGM+ELR+FEF
Sbjct: 421 GAA------RMEDVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFEF 451

BLAST of Cp4.1LG08g02470 vs. TAIR10
Match: AT3G06960.1 (AT3G06960.1 pigment defective 320)

HSP 1 Score: 285.8 bits (730), Expect = 4.8e-77
Identity = 176/487 (36.14%), Postives = 262/487 (53.80%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           M  +R   +   WD D+S+  TL GTA+AVP +P PL  +R +R  R +QV F       
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  GILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSI-KEDLVSD 120
            ++PSFSP    T     G FSLQ +L    + +W V L+GQF  ++ ++ I K      
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 121 LDNLELLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GDR-KGRRHKAMF 180
             +  +   L  +     DK+LY+ G CS+F  +P  ++  S + + GD  K  R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 181 YHRLPHHDINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPR 240
            H  P H++  EA WP LF+D  G+YW+VP S+++DL+SL +ESG  Y + LH N G P+
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 241 ALYNTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAEPSYDVRLKDP 300
            L++     PP +L+PGL  K+A S   N  LW      +G T  ++  +P YDV L  P
Sbjct: 241 KLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLW------RGTTPKLETCKP-YDVFLSSP 300

Query: 301 HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNLAIH--NKRSPLNADLFGSLCCTYQH 360
           H A+SGI+G   +A FG +         + G G  ++H  +  S   AD  G    T Q+
Sbjct: 301 HVAVSGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQY 360

Query: 361 GSFRKDFRDLTRLDARLDISSGSAF-------SKRVFNGFKKSIDDLERSKSTPRLNLIF 420
           G+F+K F DLTR  ARLD   G  F       ++ + N  + S++  +  K  P + +  
Sbjct: 361 GNFQKFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQ--KICPEVLVSL 420

Query: 421 QQQIAGPIVFRVDSRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGM 467
           QQQI GP  F+V+S + +D  +      V+ T+ ++ Y+ ++L S KAV  +SPK+ E M
Sbjct: 421 QQQIVGPFSFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFM 478

BLAST of Cp4.1LG08g02470 vs. NCBI nr
Match: gi|659075684|ref|XP_008438274.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo])

HSP 1 Score: 749.2 bits (1933), Expect = 4.3e-213
Identity = 367/472 (77.75%), Postives = 418/472 (88.56%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWDF++SS QTL GTAK+VPGEPFPL+GARASR LRIQQ+S LG+GFPL
Sbjct: 1   MANLRTAMDSAFWDFNLSSPQTLAGTAKSVPGEPFPLNGARASRALRIQQLSLLGDGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+SPTAHKELGSFSLQSLLL+   A WWVGLVGQFRPKK+IS +K  L SD D  E
Sbjct: 61  GIIPSYSPTAHKELGSFSLQSLLLRLSGARWWVGLVGQFRPKKLISLLKAKL-SDEDGFE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
           L   LKDVA + LDK+ Y+YG+CSQFSP+PFSSV+ STE+HG+RKGRRHKAMFYHRLP H
Sbjct: 121 LSD-LKDVARLLLDKSHYTYGICSQFSPSPFSSVYVSTEKHGERKGRRHKAMFYHRLPQH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DIN++AAWPELFIDHKGQYW+VPES+SLDLSS+KS+SGLRYRVGLHKNGGVPRAL +T+ 
Sbjct: 181 DINVDAAWPELFIDHKGQYWDVPESISLDLSSVKSKSGLRYRVGLHKNGGVPRALNSTNN 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAE----PSYDVRLKDPHAA 300
            DPPLTLMPGLCAKAAFS+EK RYLW  +E+KQ  TE   + E     SYD+RLK+PHAA
Sbjct: 241 DDPPLTLMPGLCAKAAFSIEKKRYLWRVEEKKQDKTEKTGEGELDEMTSYDMRLKEPHAA 300

Query: 301 ISGIVGGTFSAWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCCTYQHGSFRKDFRD 360
           ISGIVGGTFS+WFGGS+TVG+NGDGNL + H KRSPLNADLFGS+C T+Q GSF KDF D
Sbjct: 301 ISGIVGGTFSSWFGGSNTVGSNGDGNLTMGHKKRSPLNADLFGSVCYTFQRGSFIKDFGD 360

Query: 361 LTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420
           LTR+DA+LDISS S F+KRVF+GFKKS+DDLERSKS+PRLNLIFQQQ+AGPIVFR+DS+L
Sbjct: 361 LTRIDAQLDISSASGFAKRVFHGFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLDSKL 420

Query: 421 MLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           MLDS S K GPHVEDTI SL YSFKLL+SGKAVFW+SPKRKEGMVELRLFEF
Sbjct: 421 MLDSASGKIGPHVEDTIYSLTYSFKLLDSGKAVFWYSPKRKEGMVELRLFEF 470

BLAST of Cp4.1LG08g02470 vs. NCBI nr
Match: gi|449432352|ref|XP_004133963.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis sativus])

HSP 1 Score: 729.2 bits (1881), Expect = 4.6e-207
Identity = 357/472 (75.64%), Postives = 407/472 (86.23%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWD ++SS QTL GTAKAVPGEPFPLDGARASRTLRIQQ+SFLGNGFPL
Sbjct: 1   MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRTLRIQQLSFLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+ PTAHKELGSFSLQSLL   P+  WW GLVGQFRPKK+ISSIK   +S ++ LE
Sbjct: 61  GIIPSYCPTAHKELGSFSLQSLLFMMPSVKWWAGLVGQFRPKKLISSIKAQ-ISAVEQLE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            L  LKD+A++FLDK+LY+YG+CSQFS  PFSSV+ STE+ G+RKG RHKAMFYHRLP H
Sbjct: 121 -LSDLKDIASLFLDKSLYTYGICSQFSTGPFSSVYVSTEKLGERKGHRHKAMFYHRLPEH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DIN++AAWPELFIDHKGQYW+VPES+SLDLSSLKSESGLRYRVGLHKNGGVPRAL +T+ 
Sbjct: 181 DINVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGVPRALNSTNS 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVT----ETIDDAEPSYDVRLKDPHAA 300
            DPPLTL+PGLCAKAAFS+EKNR LW     ++ +T     T    EP+YDVRL +PHAA
Sbjct: 241 DDPPLTLLPGLCAKAAFSIEKNRDLWRDNLSEEEMTINYIRTGLKKEPAYDVRLDEPHAA 300

Query: 301 ISGIVGGTFSAWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCCTYQHGSFRKDFRD 360
           ISGI+GGT S+WFGGSDTVG+NGDGNL + H KRSPLNADLFGS+C TYQHG F  DF D
Sbjct: 301 ISGIIGGTVSSWFGGSDTVGSNGDGNLTMGHKKRSPLNADLFGSICYTYQHGKFLNDFND 360

Query: 361 LTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420
           LTR+DARL ISS S F+KRVF+ FKKS+DDLERSKS+PRLNLIFQQQ+AGPIVFR++S+L
Sbjct: 361 LTRIDARLSISSASGFAKRVFHVFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLESKL 420

Query: 421 MLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           +LDS S K GPHVEDTI SL YSF  LES KAVFW+SPKRKEGMVELRL+EF
Sbjct: 421 LLDSASGKIGPHVEDTICSLTYSFLDLESAKAVFWYSPKRKEGMVELRLYEF 470

BLAST of Cp4.1LG08g02470 vs. NCBI nr
Match: gi|225453254|ref|XP_002265990.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Vitis vinifera])

HSP 1 Score: 609.8 bits (1571), Expect = 4.1e-171
Identity = 304/470 (64.68%), Postives = 371/470 (78.94%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMD+AFWD D+SS QTL G A+AVPG+PFPL+GARASR LR+QQ+SFLGNGFPL
Sbjct: 1   MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PSFSPT+ K+LGSFSLQSL L+   ++WW+GL GQFRPKK+ISSIK DL S +D  E
Sbjct: 61  GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADL-SAVDEWE 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
           L    K+VA  F+DK+L+S+GLCSQ S T  SS+  STE+HG++KGRR++ M +H+LP H
Sbjct: 121 L-STFKEVAKHFIDKSLFSFGLCSQLSLTSASSLMVSTEQHGEKKGRRNRVMLFHQLPFH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI LEAAWPELFIDHKG+YWE+PES+SL LSSL SESGLRYR G+HKNGG P+++ N   
Sbjct: 181 DITLEAAWPELFIDHKGRYWELPESISLGLSSLVSESGLRYRFGIHKNGGHPQSV-NAIN 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKE-QKQGVTETIDDA--EPSYDVRLKDPHAAI 300
            + P  LMPGLCAKAAFS EK+R LW  +E Q+ G+ +T       PSYD+RL++PHAAI
Sbjct: 241 DEAPSALMPGLCAKAAFSYEKSRDLWRQREKQEDGIVKTERGLVWRPSYDIRLREPHAAI 300

Query: 301 SGIVGGTFSAWFGGSDTVGTNGDGNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDFRDLT 360
           SGI+GGT  AWFGGS     +GDG+ A   KRSP  ADLF S CCT+QHG FRK + DLT
Sbjct: 301 SGIIGGTCEAWFGGSRE---HGDGSSADAKKRSPFGADLFASGCCTFQHGQFRKRYGDLT 360

Query: 361 RLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLML 420
           R+DARL+I S SA +KRV N F  S++  +   S+PRLNLIFQQQ+AGPIVFRVDS+L+L
Sbjct: 361 RVDARLNICSASALAKRVSNLFSSSVNGAKDPLSSPRLNLIFQQQVAGPIVFRVDSKLLL 420

Query: 421 DSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           DS+  + GP +ED   SLNYS +LL SGK V W+SPKRKEGM+ELRLFEF
Sbjct: 421 DSSGGRAGPQLEDFTYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRLFEF 464

BLAST of Cp4.1LG08g02470 vs. NCBI nr
Match: gi|595999061|ref|XP_007217957.1| (hypothetical protein PRUPE_ppa005217mg [Prunus persica])

HSP 1 Score: 608.2 bits (1567), Expect = 1.2e-170
Identity = 306/475 (64.42%), Postives = 373/475 (78.53%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWD +VSS  TL G+AKA+PG+PFP+DGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MANLRTAMDSAFWDLNVSSPHTLEGSAKAIPGDPFPIDGARASRVLRIQQLSLLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+SPT+HK+LGSFSLQSLLL+   ++WW+GL+GQFRPKK+ISSIK +  S  D++E
Sbjct: 61  GIIPSYSPTSHKDLGSFSLQSLLLRPATSNWWLGLIGQFRPKKLISSIKAEF-STNDDME 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            +P  KDVA   LDK+LYS+GLC+Q    P SS+  STE HG++KGRR+K M +H+LP+H
Sbjct: 121 -VPTFKDVAKHVLDKSLYSFGLCTQLLVAPSSSIKLSTEGHGEKKGRRNKFMLFHKLPYH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI LEAAWPELFIDHKGQYW+VPES+SLDLSSL SESGLRYR+G+HKN G P+A+ + D 
Sbjct: 181 DITLEAAWPELFIDHKGQYWDVPESISLDLSSLVSESGLRYRIGIHKNSGHPQAVNSID- 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDA---EPSYDVRLKDPHAAI 300
           G+ P +LMPGLCAKAAFS EK++ LW  KE K+ V    D+     PSYDVRLK+PHAA+
Sbjct: 241 GEVPTSLMPGLCAKAAFSYEKSQDLWRQKETKKDVMVKKDNGWFWRPSYDVRLKEPHAAV 300

Query: 301 SGIVGGTFSAWFGGSDT---VGTNGD-GNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDF 360
           SGI GG+ +AWF    +   V   GD  N     KRSP +AD FGS+C ++QHG FR+ +
Sbjct: 301 SGIFGGSCTAWFQDGHSPVAVELRGDEDNSTSTKKRSPFSADFFGSVCYSFQHGKFRELY 360

Query: 361 RDLTRLDARLDISSGSAFSKRVFNGFKKSIDDLERS-KSTPRLNLIFQQQIAGPIVFRVD 420
            DLTR+DARLDI S SA +KRV NG K S  +  R   S+PR+NLIFQQQ+AGPIVFRVD
Sbjct: 361 GDLTRIDARLDICSASALAKRVINGLKSSSANSARDPMSSPRINLIFQQQVAGPIVFRVD 420

Query: 421 SRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           SR+ LDS   KRGPH+ED I SLNYS +LL SGK V W+SPKRKEGM+ELR+FEF
Sbjct: 421 SRVSLDSLPGKRGPHIEDFIYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRVFEF 472

BLAST of Cp4.1LG08g02470 vs. NCBI nr
Match: gi|645252835|ref|XP_008232303.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Prunus mume])

HSP 1 Score: 607.8 bits (1566), Expect = 1.6e-170
Identity = 306/475 (64.42%), Postives = 372/475 (78.32%), Query Frame = 1

Query: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60
           MA+LRTAMDSAFWD +VSS  TL G+AKA+PG+PFP+DGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MANLRTAMDSAFWDLNVSSPHTLEGSAKAIPGDPFPIDGARASRVLRIQQLSLLGNGFPL 60

Query: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120
           GI+PS+SPT+HK+LGSFSLQSLLL+   ++WW+GL+GQFRPKK+ISSIK +  S  D++E
Sbjct: 61  GIIPSYSPTSHKDLGSFSLQSLLLRPATSNWWLGLIGQFRPKKLISSIKAEF-STNDDME 120

Query: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180
            +P  KDVA   LDK+LYS+GLC+Q    P SS+  STE HG++KGRR+K M +H+LP+H
Sbjct: 121 -VPTFKDVAKHVLDKSLYSFGLCTQLLVAPSSSIKLSTEGHGEKKGRRNKFMLFHKLPYH 180

Query: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240
           DI LEAAWPELFIDHKGQYW+VPES+SLDLSSL SESGLRYR+G+HKN G P+A+ + D 
Sbjct: 181 DITLEAAWPELFIDHKGQYWDVPESISLDLSSLVSESGLRYRIGIHKNSGHPQAVNSID- 240

Query: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDA---EPSYDVRLKDPHAAI 300
           G+ P +LMPGLCAKAAFS EK++ LW  KE K+ V    D      PSYDVRLK+PHAA+
Sbjct: 241 GEVPTSLMPGLCAKAAFSYEKSQDLWRQKETKKDVMVKKDKGWFWRPSYDVRLKEPHAAV 300

Query: 301 SGIVGGTFSAWFGGSDT---VGTNGD-GNLAIHNKRSPLNADLFGSLCCTYQHGSFRKDF 360
           SGI GG+ +AWF    +   V   GD  N     KRSP +AD FGS+C ++QHG FR+ +
Sbjct: 301 SGIFGGSCTAWFQDGHSPVAVELRGDEDNSTSSKKRSPFSADFFGSVCYSFQHGKFRELY 360

Query: 361 RDLTRLDARLDISSGSAFSKRVFNGFKKSIDDLERS-KSTPRLNLIFQQQIAGPIVFRVD 420
            DLTR+DARLDI S SA +KRV NG K S  +  R   S+PR+NLIFQQQ+AGPIVFRVD
Sbjct: 361 GDLTRIDARLDICSASALAKRVINGLKSSSANSARDPMSSPRINLIFQQQVAGPIVFRVD 420

Query: 421 SRLMLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF 468
           SR+ LDS   KRGPH+ED I SLNYS +LL SGK V W+SPKRKEGM+ELR+FEF
Sbjct: 421 SRVSLDSLPGKRGPHIEDFIYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRVFEF 472

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TGD4_ARATH8.5e-7636.14Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana GN=... [more]
Match NameE-valueIdentityDescription
A0A0A0L4I9_CUCSA3.2e-20775.64Uncharacterized protein OS=Cucumis sativus GN=Csa_3G128880 PE=4 SV=1[more]
D7SM50_VITVI2.9e-17164.68Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0021g00530 PE=4 SV=... [more]
M5X101_PRUPE8.3e-17164.42Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005217mg PE=4 SV=1[more]
W9RTX5_9ROSA7.8e-16962.89Uncharacterized protein OS=Morus notabilis GN=L484_023905 PE=4 SV=1[more]
A0A061DV28_THECC3.0e-16061.81Chloroplast, plasma membrane, plastid, chloroplast envelope, putative OS=Theobro... [more]
Match NameE-valueIdentityDescription
AT2G44640.12.0e-13653.62 FUNCTIONS IN: molecular_function unknown[more]
AT3G06960.14.8e-7736.14 pigment defective 320[more]
Match NameE-valueIdentityDescription
gi|659075684|ref|XP_008438274.1|4.3e-21377.75PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo][more]
gi|449432352|ref|XP_004133963.1|4.6e-20775.64PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis sativus... [more]
gi|225453254|ref|XP_002265990.1|4.1e-17164.68PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Vitis vinifera][more]
gi|595999061|ref|XP_007217957.1|1.2e-17064.42hypothetical protein PRUPE_ppa005217mg [Prunus persica][more]
gi|645252835|ref|XP_008232303.1|1.6e-17064.42PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR022244DUF3769
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0044446 intracellular organelle part
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005774 vacuolar membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g02470.1Cp4.1LG08g02470.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022244Protein of unknown function DUF3769PFAMPF12600DUF3769coord: 337..467
score: 3.
NoneNo IPR availablePANTHERPTHR34954FAMILY NOT NAMEDcoord: 2..467
score: 6.4E
NoneNo IPR availablePANTHERPTHR34954:SF2EXPRESSED PROTEINcoord: 2..467
score: 6.4E

The following gene(s) are paralogous to this gene:

None