CmoCh06G008370 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh06G008370
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
LocationCmo_Chr06: 4609847 .. 4616617 (+)
RNA-Seq ExpressionCmoCh06G008370
SyntenyCmoCh06G008370
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AACAACTCCAAGGTTGAGGAGAGGCGCACTGTACTCTGTTCTGCGTCACAATGTCTTCTTCCATGGCTTTGTTTTCTCCTCCTTTCCATCTTTCCACTCTCTCTTCTTCATTATGCCACAGAAATTCCTTCTCCTCCATACCTTTTTCTTCTCTCAGAACCAGAAGAAGCCCTTCTTCTTCCTCTCTTTTCACCATCAGAGCCGCTGCGGAACCGGAGCCGGCTAAGTCCGTAACGGAAGAGCAGGAGAGCCTCCCCGGAACAAATGGCGCGATGGCGGCTGCTGAAGAAGTGGAGGTCGTTACCAAATTTGAAGACGCTAAGTGGATTAATGGAACTTGGGATCTGAATCAGTTCCGGAAACACGGAAGTATCGACTGGGATTCTGTTATTGATTCCGGTAATCTCAATCTTCTTTGGATCTGAGAAAATGAGAAATTAAGCAACTCTTCCTTAAGAACCTAGATTCTCTGTTGATTTTGATTTCAATTGAATAACTGAAATTAGGTTCAAATTGTGGTGTAATCTCTTGATTTTGGAATTGAGTTTTTCGATCAGAGGCTAGGAGGAGAAAATGGGTGGAAAAAAATCGAGAATCATCAAGTAATGAGGATCCCGTGGTGTTTGGGACATCCATAATTCCATGGTGGGCTTGGATTAAGCGCTACCATCTCCCTGAAGCTGAGACACTCAATGGTATAAACACACGTTTAATACATACGTTGTTCTTTGTTGATAGCGATTTGAACATGAATATGAATATGGTATCATGATGTGATGGGTTTTGGGCGCAGGGCGTGCAGCGATGGTGGGGTTCTTCATGGCTTACTTTGTTGATAGCTTGAGTGGGGTAGGGGTAGTGGGTCAAATGGGCAACTTCTTCTGCAAAACTCTGCTGTTTGTGGCTGTGGTGGGAGTTGTTTTGATCAGAGAGAATGAAGATATTGAGACTTTGAAGAAGTTGATTGATGAGACAAGCTTGTATGACAAGCAATGGCGAGCAACTTGGGAGGATTAAGCTGAAGAAGTTGATTATCTATATCTATAATTTCGCTTTAGATTCCTTTGTTTGTAGCTCGAAATCGAACACTTTTCTTATAGCCCAAGAAAACCCAAGTTTTTAGTCCAATTTCCATTTCCAAACAACAAAACAAACCAAATGGGGACACACAAAAGGGTTTTGCTTAGCTTTTTGGTAATGATTCACACTTGTTATTTCTAAGAAGTTGTTCACAAAACTTTCTATTTTGTATTTTTTTTTTTTTTTGTCGAAGTTCTTTATTATGGTAAATAAATAACTAGGATATGACGTGTATAATTCACTTAACAAGCATCCGGTACATGTCCATAAATAACTAGGAAGGTGTATTAAAGGATTTGATTTGCATTTGATAAATTGATGCGAGTTGAGGGTCTCATCGTCTTGTCCCGGTGGATTGAAACTCAGAAAGCCGAAAGTTTGCCTACAATCTTCTTTGACAAACACTAGCTACTGTTTCATATGCAAAAAGGTCATTCAAGCTTATTTATTTATTTGTTGTTTTTTATTTGTGCTTCGAAAGACATGATCCGGATGAAATTTGAGAATTTGAATTTGGATAATGTTTGAAACTTAGAGATGAAAATTATCTTCTAGATTCTCAACTTCAAAGGCTAAAATTGATGATGTAGTACATCAATTTTCAACTCATAATGTCTTGAAACGATTTATAAGTGCTTTTAAAGAAAAAGAAAATAAAATACCGAAAGAAAAGTATAATCAATGTTGTTTTTTCGACTAGAATCACTCATTTTGAAATTACATGTAATGATGTTAGAAAAGATGTATAATGTGTTTTTGAAATTATTATTATTATTATTATAAATCAGTAAATAAGAAGAAAAAAATAAAATAAAAACAAAAAAGAAGAAGACGACAAGCTACCATGGAGAAGGGATGGAGAACGGAGGAATGGACAGGAGGATTGAAGGGGGAAAAGAAAGGATGGAGGTGTGTGTGGGAGAAATTGTGTTGAAATATCCTATTACGCGTACACCCAAGGGAAAAGGAGTTCGAAACTAGGTAAGTAGCATATCGAGAATCTTAAACTGAGTTCTTAACTTGTTTTGCAACATATAATCATATTTTTTTACCTTTTAAAAGCTTAAAAATAACAATATGTGCTTCCATGTTATCAAATGCCTATTTGTATATTGACTGCTTGTCCGAGGTAAGTATTGGATCAGTATGACAATAATAATTTTAAAATTAAATAGTAAGAGTTAGAAAATATGTCATTGTAGTTTTATAATGGTTCGGTCAAACTCGACCTACTCCACTCCACAAGCGCCTCTTGAGAATTTAAATAAAAAACTTTCTCGACTCTTTTCACGGATTAGAGCCCAACCACTCCACCACTCTTTTTACGAGTCCAAGAGCAAACTCGATATTTTCCACGACGCAGGATCAAATTGATACAAGTATTTGAATTTAGAATTTAGAATACTCACAATTATTTTCTTACAATAAAATAAAAAAATCTCATAAATAAAAATAAAATTGGAAGTTTGAAGAAAGCAACGGGGAGGCTTTAGAAAATTGAAAGACGGAAGAATTATGGAAGTTGTGTTATTGAAAATAGGGAGAAGTTGATGAGTTTAAATAGAGAACGAACAGTGGAAAAAAAAAATAAAAATAAAAAATAATATTGACGGTTAAAAGAGGAATGGAAGGTGGAAATTAAAAAGTAAATAATAACAAAATTTTCTCAATTTATTTTTATTATATATTTTTAAATATAAATAATAATAATAATATAATATAAAATAATAAAAATAGATGTGAATAACCGTCAAATGACAAATTCAAAATTTATTATAAATATAATAATATAATAATAATAAAAATGACGTGAGTAACCACTCAAATTTTAAAATATAATAATATAATAATATAATATAGTAATAATAAAAATCGATGTAAATAACGGTTAGATTTAAATTTATATTATATATAAATAATTTTATTTAGAAATATAATAATAAGAAAAACTCCACACTTTGTCACTATTCCAAAATGATAAGTGTCATCGTATTATTTGATTTGTCCTCATTTTGATTTAGCTGCCACATCATCATGTCGAATATGTTATATCGATTGTAATTTCTTCGTTTTGGCTATGATTTGAGTGATTGAAAAGTCGTTGAAGTCCTTATTCCTAACATACTCTGGACCGTTTAAAATACTGAAAATAGTTAATAATTAAAAGTAGGTGTGTTATCATCCGAACATCAATCAATTAATTATTTAAAATTAATTAATATGAGATTAAGGGCCAAAAAACTAATAGTAAGCATGTTATAAATGATTACTAGAATGTTCTATATGTTGAACATTATAAATGTATGAAAGCATGCATTATAGATGTGAGTATGTTGTGCATGTTAATGGAAAGGTTATAACTCTAGGATATGCTTATAAATTGCTGAGTCATGAGTTATGTATATGAGTATTGTAGTCTACTAATTCCCACAGAGTTCTTGACCAACGCGCTGTGTGCTAAGTCGGGAAAAATGAATACTCCTGACTATTAGTATATGCTAAATCAGGAAGAATGAGTACTCCTAACTATTGGTATACAAGTCACCAAAAAAATTAGTAGACGGGCCGTATTTGACTAAGAGTGAATCTTACAAACGAGTCCGAAAACTTTAGATAAGCGTATAGAGAGCCAATAACCTAAGCTGGAGGTAGCTGAAATACCTCATTAATAAATAGACGACCCTATTAAGAGAAACTAGAAGTTAATGGCGAATAAAACTTAAAAACAGCCGGAAAGATAAGTGATAAAGACTAGGAAAGATAAGTGATGCATGCCAATTGAGGACGAGACCAGCAGGCTAACGGAATCCATGTCTGGGTTTGTTTGTGTATTTATGCTTTCGCTGTTTAAAATTTACATGGGTTGTTGTTTTAAGTTTCAATAGTACTGGTTTTAAATATTTTACTTAATGTATATGGAAATATTTTGAAATTTATGATTGATTGTCGGCTCCTTCCTGATTTGATTGTATTGGTTTAGTTGCAGCTTTTCCTATTAATTTGTCAAATCAGGGAATACAGGGTCAGGGCAGAGAGAGGCAGGTTCAGTAGTTAAGGTAAGGAGAAATCATCGGAAGCACAAAGGGTCTAGTAGGGGATAAGCAGCTAGATAAGTAAGCAATATCAGTTTGTTATTGATATCAGTGGTAACTCCTCTTAATTGCATTATCAATAAGCAGGTGTTATAGTACGCTCCAACACTAGCTTTGAATTTCGTCACTTCTTCCGTCCTTTACATCTTTTGTGGAGAGATATCTTGTCACAATCGTAATTTTCTTGGCAATTTATCGTGTGGCTGTACCTACGAACAGGCAAGACTCAAGTTTTTGACCCTATTTTGAAAATAAGGTTTTAAAATACAGTGGCAGAGAAATTTTCTAAAACGAGTTTGAAAGTACGAACGAAAGATGATTTGAAAGTAACGTTATATAACTATAAGCAAAATACAACAACACGACTTAAATAACATACAACTTTTTGAGTAGGAAGAATTTGACAACTAGTCACATCCCCCTAACTTTTTCATTCAATTCAGTCATTCAATCTTTCGGGAATGGGAAACAATATTTCTTCCTCTATCAAGATCAATTGGTTGGCTTCTTGATTTAGGTGTTGATTTTTTCGTCAACCAGTCTCAATGCATGAAGAAACTAACGAACGATTAAGAAAAGCAAATTGAAGTTGAAGAATGCAAGATAATATTACAATTTACCATCTTATGTACATCACCTTGTATGTATCACCCCAAAATATTGTATATTATCACTTTGTACACACTAAGTGATACTTACAAAGTGATAGGTGATACATACAAGGTGATATATATACAATTTTGTGACGAAGAGATCATTGATTATAAGAAAGCATCTACTAAATACACGATCATACGATCACAGTTTGTAGTCCTATTAAACACAAAATCCAATTTTACGACTAATAAAATAAGTAATGTTTAAGATTTTTTAGTACTTCAAACACCTATCGGACACTTTCAAAGCTCTAGAATACTTCATAATCCCATCTCCGTCAAGCTCAAACTCTCTACCTGTCAACTTGTCAACTGAAATGGTGAATTCTCTCTTATCTCTATTACTCTGCAACTCAAGGTTTGTGTCCTGACATGTGGTGTGGTGATGTTTGTGAATTTTAGGCAAAGGTCTCAAGAGAGAACAAAGTGTTGGTGAGTGTGTATACAGAGAGGCAGGAACCAAGAGTAGCAGTAAATCGATTCCAACGTGATGATGGTGTGAACAAAACAGTGAGGAATCAGTGGAAGAAAGAAATAGCTAATAAAGGATACAATAGGAGAACTGAACTTCTTAAGTACTCTCAACGCCTGCGAAAATCTGCACAATCACCTGCATCTCCATATCTTCCAACTCCGGAGCCGATTTCTGTGACGAACAAGCAACCGATCCAAAGAAACGTGGCGATTAATCCTGTATGTATGCTCTTCCTTATTTTTAGAACTCAAGAATTCCCGTTCAAGCAACCTAGTGGCATATTGTAGGAACAACGATCCTCCACAGTGACATGATACTGCCCACTTTGAGTATAAGCTCTCGGGGGTTTTACTTTTGGTTTTTTCAAAAGATCTCAAACCAATGAAGATAATATCCCTTACTTATATACCATGATATTTCCCTTTATTAGCTAATGCGAGACTCAAACAATCTTAACTAATCCCCCCACGAACAAAGTACACCATTGAGCCTCACCTAACGACTCCTTTTCTTTGGAGCCCTTGAACAAAGTACAACATTGAGTCTCCCCTTAAACAGTTATCTATCTATCCAACACACTATTCTAGACGACTTCTCTTCACTAGAGCACATTGTTAGTCCAATAAAACTCTACCATGAATCTTCACCATGACTGCACTTTCGAGACTCGCAACTTCTTTGTTTGACACCTGAGTCACTGTGACTACACCTACAAGATTCAGAGCTTGTTTGTTCAACATTTGAGGATTCTAGTGACATAACTAAGTTAGGGTATAGCTCTAATACCACTTATAGGAACATCTACCCTTCACAATGACATAATAGAGAATCTATACAAATAGAGAGAGTGTCCCTTAATTATATACCCATGATCTTCCCCTTAATTAGCCAATGCGAGACTCCAACAATCCCAACAATCCTCAACACACACAACAATAATCTAACCCACCTAGTGAGAAAGTATGATAATCAATTTAGTTTTCTTCAACAATAATGTTACCCAACTGTACACCAAATTCATCTGAACAACTCAAATTTGAATAGACATATTGCAGTAATATTTGTTCAAGTGGTCAAGTTTTGGATGTGTAAGTATCTCCATGGTCATTCGACCATAATTTTAAGTCAGAAACATGACAACTTGAAATAGTTACCAGTGTTGTAATACTGTAGATGAGGAGATGATATCAAACCCAATCATTTCTCCTGTATATAACTTGTTTTGGAAGGATATAGCCTGGTATCTATACCATGTTTGTGACGAAGAATTAAAGTTTTCCTTTTTTGATAGCTCGTCAATCTTCGTGCAAAGCCCAAGGGTGCCAGATTTACCTCTTGCTTCGGCAACCTGATTCAAAGGTCTCACAAGGCTTTGACGAGTTTTCAGGCAAAAAACAAGAGACAAAAGCAGAACCAAAGCAGCGGATCTACCAAGAATGCTAAGATAATAAAAGAATTAGAAGGATAA

mRNA sequence

AACAACTCCAAGGTTGAGGAGAGGCGCACTGTACTCTGTTCTGCGTCACAATGTCTTCTTCCATGGCTTTGTTTTCTCCTCCTTTCCATCTTTCCACTCTCTCTTCTTCATTATGCCACAGAAATTCCTTCTCCTCCATACCTTTTTCTTCTCTCAGAACCAGAAGAAGCCCTTCTTCTTCCTCTCTTTTCACCATCAGAGCCGCTGCGGAACCGGAGCCGGCTAAGTCCGTAACGGAAGAGCAGGAGAGCCTCCCCGGAACAAATGGCGCGATGGCGGCTGCTGAAGAAGTGGAGGTCGTTACCAAATTTGAAGACGCTAAGTGGATTAATGGAACTTGGGATCTGAATCAGTTCCGGAAACACGGAAGTATCGACTGGGATTCTGTTATTGATTCCGAGGCTAGGAGGAGAAAATGGGTGGAAAAAAATCGAGAATCATCAAGTAATGAGGATCCCGTGGTGTTTGGGACATCCATAATTCCATGGTGGGCTTGGATTAAGCGCTACCATCTCCCTGAAGCTGAGACACTCAATGGGCGTGCAGCGATGGTGGGGTTCTTCATGGCTTACTTTGTTGATAGCTTGAGTGGGGTAGGGGTAGTGGGTCAAATGGGCAACTTCTTCTGCAAAACTCTGCTGTTTGTGGCTGTGGTGGGAGTTGTTTTGATCAGAGAGAATGAAGATATTGAGACTTTGAAGAAGTTGATTGATGAGACAAGCTTCTCGAAATCGAACACTTTTCTTATAGCCCAAGAAAACCCAAAATCACTCATTTTGAAATTACATAATGTTCTATATGTTGAACATTATAAATGTATGAAAGCATGCATTATAGATGCAAAGGTCTCAAGAGAGAACAAAGTGTTGGTGAGTGTGTATACAGAGAGGCAGGAACCAAGAGTAGCAGTAAATCGATTCCAACGTGATGATGGTGTGAACAAAACAGTGAGGAATCAGTGGAAGAAAGAAATAGCTAATAAAGGATACAATAGGAGAACTGAACTTCTTAAGTACTCTCAACGCCTGCGAAAATCTGCACAATCACCTGCATCTCCATATCTTCCAACTCCGGAGCCGATTTCTGTGACGAACAAGCAACCGATCCAAAGAAACGTGGCGATTAATCCTCTCGTCAATCTTCGTGCAAAGCCCAAGGGTGCCAGATTTACCTCTTGCTTCGGCAACCTGATTCAAAGGTCTCACAAGGCTTTGACGAGTTTTCAGGCAAAAAACAAGAGACAAAAGCAGAACCAAAGCAGCGGATCTACCAAGAATGCTAAGATAATAAAAGAATTAGAAGGATAA

Coding sequence (CDS)

ATGTCTTCTTCCATGGCTTTGTTTTCTCCTCCTTTCCATCTTTCCACTCTCTCTTCTTCATTATGCCACAGAAATTCCTTCTCCTCCATACCTTTTTCTTCTCTCAGAACCAGAAGAAGCCCTTCTTCTTCCTCTCTTTTCACCATCAGAGCCGCTGCGGAACCGGAGCCGGCTAAGTCCGTAACGGAAGAGCAGGAGAGCCTCCCCGGAACAAATGGCGCGATGGCGGCTGCTGAAGAAGTGGAGGTCGTTACCAAATTTGAAGACGCTAAGTGGATTAATGGAACTTGGGATCTGAATCAGTTCCGGAAACACGGAAGTATCGACTGGGATTCTGTTATTGATTCCGAGGCTAGGAGGAGAAAATGGGTGGAAAAAAATCGAGAATCATCAAGTAATGAGGATCCCGTGGTGTTTGGGACATCCATAATTCCATGGTGGGCTTGGATTAAGCGCTACCATCTCCCTGAAGCTGAGACACTCAATGGGCGTGCAGCGATGGTGGGGTTCTTCATGGCTTACTTTGTTGATAGCTTGAGTGGGGTAGGGGTAGTGGGTCAAATGGGCAACTTCTTCTGCAAAACTCTGCTGTTTGTGGCTGTGGTGGGAGTTGTTTTGATCAGAGAGAATGAAGATATTGAGACTTTGAAGAAGTTGATTGATGAGACAAGCTTCTCGAAATCGAACACTTTTCTTATAGCCCAAGAAAACCCAAAATCACTCATTTTGAAATTACATAATGTTCTATATGTTGAACATTATAAATGTATGAAAGCATGCATTATAGATGCAAAGGTCTCAAGAGAGAACAAAGTGTTGGTGAGTGTGTATACAGAGAGGCAGGAACCAAGAGTAGCAGTAAATCGATTCCAACGTGATGATGGTGTGAACAAAACAGTGAGGAATCAGTGGAAGAAAGAAATAGCTAATAAAGGATACAATAGGAGAACTGAACTTCTTAAGTACTCTCAACGCCTGCGAAAATCTGCACAATCACCTGCATCTCCATATCTTCCAACTCCGGAGCCGATTTCTGTGACGAACAAGCAACCGATCCAAAGAAACGTGGCGATTAATCCTCTCGTCAATCTTCGTGCAAAGCCCAAGGGTGCCAGATTTACCTCTTGCTTCGGCAACCTGATTCAAAGGTCTCACAAGGCTTTGACGAGTTTTCAGGCAAAAAACAAGAGACAAAAGCAGAACCAAAGCAGCGGATCTACCAAGAATGCTAAGATAATAAAAGAATTAGAAGGATAA

Protein sequence

MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRRSPSSSSLFTIRAAAEPEPAKSVTEEQESLPGTNGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETSFSKSNTFLIAQENPKSLILKLHNVLYVEHYKCMKACIIDAKVSRENKVLVSVYTERQEPRVAVNRFQRDDGVNKTVRNQWKKEIANKGYNRRTELLKYSQRLRKSAQSPASPYLPTPEPISVTNKQPIQRNVAINPLVNLRAKPKGARFTSCFGNLIQRSHKALTSFQAKNKRQKQNQSSGSTKNAKIIKELEG
Homology
BLAST of CmoCh06G008370 vs. ExPASy Swiss-Prot
Match: Q9SYX1 (Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.1 PE=1 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 9.0e-64
Identity = 136/239 (56.90%), Postives = 167/239 (69.87%), Query Frame = 0

Query: 5   MALFSPPFHLSTLSS-SLCHRNSF---SSIPFSSLRTRRSPSSSSLFTIRAAAE---PEP 64
           MALFSPP   S+L + +   + SF   SS  FS L   R+ S S   +  AA     PEP
Sbjct: 1   MALFSPPISSSSLQNPNFIPKFSFSLLSSNRFSLLSVTRASSDSGSTSPTAAVSVEAPEP 60

Query: 65  AKSVTEE----------QESLPGTNGAMAAAE--EVEVVTKFEDAKWINGTWDLNQFRKH 124
            + + +E          +E+    N A+   E    E V KF+DA+WINGTWDL QF K 
Sbjct: 61  VEVIVKEPPQSTPAVKKEETATAKNVAVEGEEMKTTESVVKFQDARWINGTWDLKQFEKD 120

Query: 125 GSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRA 184
           G  DWDSVI +EA+RRKW+E+N E++SN++PV+F TSIIPWWAWIKRYHLPEAE LNGRA
Sbjct: 121 GKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIPWWAWIKRYHLPEAELLNGRA 180

Query: 185 AMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS 225
           AM+GFFMAYFVDSL+GVG+V QMGNFFCKTLLFVAV GV+ IR+NED++ LK L DET+
Sbjct: 181 AMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDVDKLKNLFDETT 239

BLAST of CmoCh06G008370 vs. ExPASy Swiss-Prot
Match: Q6NKS4 (Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.2 PE=1 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 9.0e-64
Identity = 138/241 (57.26%), Postives = 164/241 (68.05%), Query Frame = 0

Query: 1   MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRR-------SPSSSSLFTIRAAA 60
           MS SMALFSPP     +SSSL + N    I  S L T+R         SS +  T     
Sbjct: 1   MSISMALFSPP-----ISSSLQNPNLIPKISTSLLSTKRFSLISVPRASSDNGTTSPVVE 60

Query: 61  EPEPAKSVTEE-------QESLPGTNGAM---AAAEEVEVVTKFEDAKWINGTWDLNQFR 120
            P+PA    EE       + S    NGA+   A     E V K+++AKW+NGTWDL QF 
Sbjct: 61  IPKPASVAVEEVPVKSPAESSSASENGAVGGEATDSSTETVIKYQNAKWVNGTWDLKQFE 120

Query: 121 KHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNG 180
           K G  DWDSVI SEA+RRKW+E N E++SN++ VVF TSIIPWWAW+KRYHLPEAE LNG
Sbjct: 121 KDGKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIPWWAWMKRYHLPEAELLNG 180

Query: 181 RAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDET 225
           RAAM+GFFMAYFVDSL+GVG+V QMGNFFCKTLLFVAV GV+ IR+NED++ LK L DET
Sbjct: 181 RAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDLDKLKDLFDET 236

BLAST of CmoCh06G008370 vs. ExPASy TrEMBL
Match: A0A5A7TG76 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00700 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 2.5e-125
Identity = 255/376 (67.82%), Postives = 289/376 (76.86%), Query Frame = 0

Query: 3   SSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRR--SPSSSSLFTIRAAAE------ 62
           SSMALFSP  HLSTLS S  H   FS  PFSSLRTR   S SSSSLFTIRA A+      
Sbjct: 2   SSMALFSPSSHLSTLSPSSHHTTHFSFRPFSSLRTRNPSSSSSSSLFTIRATADNGAGIS 61

Query: 63  ----------PEPAKSVTEEQESLPGTNGAMAAAEE-VEVVTKFEDAKWINGTWDLNQFR 122
                     P   K     +ESL GTNG++AAAEE VEVV+KFED KW+NGTWDLNQF+
Sbjct: 62  GGSATVAVETPVEQKDPEPAEESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTWDLNQFQ 121

Query: 123 KHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNG 182
           K+GS DWD+VID+EARRRKW+E N ESSSNEDPVVF TSI+PWWAWIKRYHLPEAE LNG
Sbjct: 122 KNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPEAELLNG 181

Query: 183 RAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDET 242
           RAAMVGFFMAYFVDSL+GVG+VGQMGNFFCKTLLFVAVVGV+LIR+NEDIETLKKLIDET
Sbjct: 182 RAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDET 241

Query: 243 SFSKSNTFLIAQENPKSLILKLHNVLYVEHYKCMKACIIDAKVSRENKVLVSVYTERQEP 302
           +F         Q+   S   K+          C     I+A VS+ENKVLV+VY+E+QEP
Sbjct: 242 TFYDKQWQATWQDETSSGSGKI----------CQGNSKIEATVSKENKVLVTVYSEKQEP 301

Query: 303 RVAVNRFQRDDGVNKTVRNQWKKEIANKGYNRRTELLKYSQRLRKSAQSPASPYLPTPEP 360
           R+ VNR+QR + VNK VRNQWK++ ANKGYNRRTELLKYSQRLRKSA+SPASPY+ TPEP
Sbjct: 302 RLPVNRYQRAN-VNKVVRNQWKQDTANKGYNRRTELLKYSQRLRKSARSPASPYIRTPEP 361

BLAST of CmoCh06G008370 vs. ExPASy TrEMBL
Match: A0A6J1F3X0 (light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111441881 PE=4 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 1.2e-119
Identity = 224/224 (100.00%), Postives = 224/224 (100.00%), Query Frame = 0

Query: 1   MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRRSPSSSSLFTIRAAAEPEPAKS 60
           MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRRSPSSSSLFTIRAAAEPEPAKS
Sbjct: 1   MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRRSPSSSSLFTIRAAAEPEPAKS 60

Query: 61  VTEEQESLPGTNGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR 120
           VTEEQESLPGTNGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR
Sbjct: 61  VTEEQESLPGTNGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR 120

Query: 121 RKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS 180
           RKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS
Sbjct: 121 RKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS 180

Query: 181 GVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS 225
           GVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS
Sbjct: 181 GVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS 224

BLAST of CmoCh06G008370 vs. ExPASy TrEMBL
Match: A0A6J1KTD1 (light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111498485 PE=4 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 4.9e-113
Identity = 214/224 (95.54%), Postives = 218/224 (97.32%), Query Frame = 0

Query: 1   MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRRSPSSSSLFTIRAAAEPEPAKS 60
           MSSSMALFSPP HLSTLSSSL HRNSFSSIPFSSLRTRR+PSSSSLFTIRAAAEPEPAKS
Sbjct: 1   MSSSMALFSPPSHLSTLSSSLRHRNSFSSIPFSSLRTRRNPSSSSLFTIRAAAEPEPAKS 60

Query: 61  VTEEQESLPGTNGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR 120
           +TE+QESLPG NGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR
Sbjct: 61  LTEDQESLPGANGAMAAAEEVEVVTKFEDAKWINGTWDLNQFRKHGSIDWDSVIDSEARR 120

Query: 121 RKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS 180
           RKWVEKN ESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS
Sbjct: 121 RKWVEKNPESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRAAMVGFFMAYFVDSLS 180

Query: 181 GVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS 225
           GVGVVGQ GNF CKTLLFVAVVGVVLIR+NEDIETLKKLIDETS
Sbjct: 181 GVGVVGQTGNFLCKTLLFVAVVGVVLIRKNEDIETLKKLIDETS 224

BLAST of CmoCh06G008370 vs. ExPASy TrEMBL
Match: A0A0A0L5D8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G017160 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 1.9e-88
Identity = 183/247 (74.09%), Postives = 199/247 (80.57%), Query Frame = 0

Query: 3   SSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRR-SPSSSSLFTIRAAA-------- 62
           SSMALFSP  HLST S S  H   FS  PFSSLRTR  S SSSSLFTIRA A        
Sbjct: 2   SSMALFSPSSHLSTFSPS-HHTTHFSFRPFSSLRTRNPSSSSSSLFTIRATADNGAGISG 61

Query: 63  --------------EPEPAKSVTEEQESLPGTNGAMAAAEE-VEVVTKFEDAKWINGTWD 122
                         +PEPAK   EEQESL GTNG++AAAEE VEVV+KFED KW+NGTWD
Sbjct: 62  GSATVSVETPVEQKDPEPAKLAPEEQESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTWD 121

Query: 123 LNQFRKHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEA 182
           LNQF+K+GS DWD+VID+EARRRKW+E N ESSSNEDPVVF TSI+PWWAWIKRYHLPEA
Sbjct: 122 LNQFQKNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPEA 181

Query: 183 ETLNGRAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKK 226
           E LNGRAAMVGFFMAYFVDSL+GVG+VGQMGNFFCKTLLFVAVVGV+LIR+NEDIETLKK
Sbjct: 182 ELLNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKK 241

BLAST of CmoCh06G008370 vs. ExPASy TrEMBL
Match: A0A1S3BLD8 (uncharacterized protein LOC103490842 OS=Cucumis melo OX=3656 GN=LOC103490842 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 1.3e-86
Identity = 178/242 (73.55%), Postives = 195/242 (80.58%), Query Frame = 0

Query: 3   SSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRR--SPSSSSLFTIRAAAE------ 62
           SSMALFSP  HLSTLS S  H   FS  PFSSLRTR   S SSSSLFTIRA A+      
Sbjct: 2   SSMALFSPSSHLSTLSPSSHHTTHFSFRPFSSLRTRNPSSSSSSSLFTIRATADNGAGIS 61

Query: 63  ----------PEPAKSVTEEQESLPGTNGAMAAAEE-VEVVTKFEDAKWINGTWDLNQFR 122
                     P   K     +ESL GTNG++AAAEE VEVV+KFED KW+NGTWDLNQF+
Sbjct: 62  GGSATVAVETPVEQKDPEPAEESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTWDLNQFQ 121

Query: 123 KHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNG 182
           K+GS DWD+VID+EARRRKW+E N ESSSNEDPVVF TSI+PWWAWIKRYHLPEAE LNG
Sbjct: 122 KNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPEAELLNG 181

Query: 183 RAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDET 226
           RAAMVGFFMAYFVDSL+GVG+VGQMGNFFCKTLLFVAVVGV+LIR+NEDIETLKKLIDET
Sbjct: 182 RAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDET 241

BLAST of CmoCh06G008370 vs. TAIR 10
Match: AT4G17600.1 (Chlorophyll A-B binding family protein )

HSP 1 Score: 245.7 bits (626), Expect = 6.4e-65
Identity = 136/239 (56.90%), Postives = 167/239 (69.87%), Query Frame = 0

Query: 5   MALFSPPFHLSTLSS-SLCHRNSF---SSIPFSSLRTRRSPSSSSLFTIRAAAE---PEP 64
           MALFSPP   S+L + +   + SF   SS  FS L   R+ S S   +  AA     PEP
Sbjct: 1   MALFSPPISSSSLQNPNFIPKFSFSLLSSNRFSLLSVTRASSDSGSTSPTAAVSVEAPEP 60

Query: 65  AKSVTEE----------QESLPGTNGAMAAAE--EVEVVTKFEDAKWINGTWDLNQFRKH 124
            + + +E          +E+    N A+   E    E V KF+DA+WINGTWDL QF K 
Sbjct: 61  VEVIVKEPPQSTPAVKKEETATAKNVAVEGEEMKTTESVVKFQDARWINGTWDLKQFEKD 120

Query: 125 GSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNGRA 184
           G  DWDSVI +EA+RRKW+E+N E++SN++PV+F TSIIPWWAWIKRYHLPEAE LNGRA
Sbjct: 121 GKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIPWWAWIKRYHLPEAELLNGRA 180

Query: 185 AMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDETS 225
           AM+GFFMAYFVDSL+GVG+V QMGNFFCKTLLFVAV GV+ IR+NED++ LK L DET+
Sbjct: 181 AMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDVDKLKNLFDETT 239

BLAST of CmoCh06G008370 vs. TAIR 10
Match: AT5G47110.1 (Chlorophyll A-B binding family protein )

HSP 1 Score: 245.7 bits (626), Expect = 6.4e-65
Identity = 138/241 (57.26%), Postives = 164/241 (68.05%), Query Frame = 0

Query: 1   MSSSMALFSPPFHLSTLSSSLCHRNSFSSIPFSSLRTRR-------SPSSSSLFTIRAAA 60
           MS SMALFSPP     +SSSL + N    I  S L T+R         SS +  T     
Sbjct: 1   MSISMALFSPP-----ISSSLQNPNLIPKISTSLLSTKRFSLISVPRASSDNGTTSPVVE 60

Query: 61  EPEPAKSVTEE-------QESLPGTNGAM---AAAEEVEVVTKFEDAKWINGTWDLNQFR 120
            P+PA    EE       + S    NGA+   A     E V K+++AKW+NGTWDL QF 
Sbjct: 61  IPKPASVAVEEVPVKSPAESSSASENGAVGGEATDSSTETVIKYQNAKWVNGTWDLKQFE 120

Query: 121 KHGSIDWDSVIDSEARRRKWVEKNRESSSNEDPVVFGTSIIPWWAWIKRYHLPEAETLNG 180
           K G  DWDSVI SEA+RRKW+E N E++SN++ VVF TSIIPWWAW+KRYHLPEAE LNG
Sbjct: 121 KDGKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIPWWAWMKRYHLPEAELLNG 180

Query: 181 RAAMVGFFMAYFVDSLSGVGVVGQMGNFFCKTLLFVAVVGVVLIRENEDIETLKKLIDET 225
           RAAM+GFFMAYFVDSL+GVG+V QMGNFFCKTLLFVAV GV+ IR+NED++ LK L DET
Sbjct: 181 RAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDLDKLKDLFDET 236

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SYX19.0e-6456.90Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis ... [more]
Q6NKS49.0e-6457.26Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis ... [more]
Match NameE-valueIdentityDescription
A0A5A7TG762.5e-12567.82Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1F3X01.2e-119100.00light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Cucurbita mo... [more]
A0A6J1KTD14.9e-11395.54light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Cucurbita ma... [more]
A0A0A0L5D81.9e-8874.09Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G017160 PE=4 SV=1[more]
A0A1S3BLD81.3e-8673.55uncharacterized protein LOC103490842 OS=Cucumis melo OX=3656 GN=LOC103490842 PE=... [more]
Match NameE-valueIdentityDescription
AT4G17600.16.4e-6556.90Chlorophyll A-B binding family protein [more]
AT5G47110.16.4e-6557.26Chlorophyll A-B binding family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 391..410
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 391..418
NoneNo IPR availablePANTHERPTHR14154UPF0041 BRAIN PROTEIN 44-RELATEDcoord: 77..224
NoneNo IPR availablePANTHERPTHR14154:SF91BNAA06G35590D PROTEINcoord: 77..224
NoneNo IPR availableSUPERFAMILY103511Chlorophyll a-b binding proteincoord: 128..193

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G008370.1CmoCh06G008370.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0046872 metal ion binding