Cla97C07G144190 (gene) Watermelon (97103) v2

NameCla97C07G144190
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionChlorophyll A-B binding protein
LocationCla97Chr07 : 31653037 .. 31654977 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCAATTGCTATCTCTGCCTCATTCCCAAGAGCATCCACTTCAAATCATCATGTTTCCATGAAGAAACAACATGCTCATCAAGCTAGACCTGCCTATTCCTCAACGACGAAAAACCCGACACCAAAGGTAATTAGTACTCTCGACGTCGGGGACTGTGATGGTTTCAATGCTACAGAACATCATGGAAATGTTGCATCAGCTAGGGACAAGCTAGAGGAAGATACTGATGTTTGGAACCATGGTAAAAGGTTCAAAGATAAGAGATGGAGAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAATGGTAAGATGGATTGGGAAGGTGTCATCGTTGCAGGTGAGTGCATCATTTTCTCAAAGGGGATAATTTTAAGTTGAAAATTAGTTTTCGTCCTAAACTTTCACAATTTAGGGGCTATTTAGAATACGTGATTGGTAATTATAGTTGGGGTTATAGTATGTAGGTTATAATAGTTTGTATATTGGGAGCGGACTATTTTAGTCCGAGTTACAGATTATTACAAAATAGTAAACACAGTAAACAAATGAGAGTTTTAACATAATTTTACAGTAGCTAAGGGTGGGTTATAATCATTGGGAACTTCAACTATTATAACGAGAGACTCAAATATTGGGTAGACTATAATAACCAATTCCACTAAAGGTAAGTTTGGGGACCAAAATGTGCAAGTCTTTAGTAATTTAAAATTTATAATGACTTAGTTCGTATCGTGAAAATAATGGCTAAAATTTAAAAAGATATCTTATCGGTAAAGTGATTAGGTACTAAATGTGTTTAATTTAAAATATGTATAATGCCCTTTGACTCCAATATCCAAACATACAAATTTCCCCAAGTGTAAAGATTTAGGATAGTAAAATCTGTTCCTCTGCTGATTGAATGAAGTGCAATAATGGTATATTAATAACAAATTAAGTTATTCAATTTCATTATTGAAATTAGAAGTTTGGATTAACAATGTTAATTGTACAATAATGTAGAAGCAAAGAGGAGGAAGTTTCTTGAATTATATCCAGAACCAGCTACAGATCAAGAACCAGTGCTCTTTAGAAGCTCCATTATACCTTGGTGGACATGGCTCACCACCTCCTACCTCCCACAAGCCGAACTACTTAACGGTAGTCTATCGTGGTTTATCGCATATAGATAATGAAATTTTGCTATATATGTAAATATTTTGGTTCATTTTTCTATATTTGAAAATCGCTCTTATATATAAATGGTAAATAATTCTAAGGGAAATTGTTTTAAATGACAAAATTGTTGGAAATATTTTCAAATATAGCAAAATGTCACTGTTGATAGACATGATCTAGATAAGGATGGATGATAGTGTAAATAAGTTGACTCATTTTGCTATATTCGAAGACAATCCTAATTCTATTTTTTTTAATTTTATTTATTACAATTCAAATATTAAGAGGGGTTAAAGTTTTTTTTTTTCCAATAATCACGGGATATGGGAGTGAACCACGTTAAAAATTTTAAGGGAGTAATGAGTGTCTTATCCACTTAGCTATATATACTCGAATTGGGTATGGGCCATAAGAGGTAACGGTTTTGAACTCAAATGAAAAATATTAATATATGTTAAAGTAGGTAGTTGAGTTAGGTTAAATTGTGTTTGTTAATGATGGTTATATAATTAGTAGTAGTGTTTGTTTGTTGATGCAGGTAGGGCAGCCATGATAGGTTTTTTCATGGCCTATTTAGTGGATGCATTAACAGGAATTGGAATAGTTGGGCAAAGTGGGAATTTCATATGCAAAGCAGCTCTTTTTCTAACAGTCATTGGTGTGTTGCTGTTTAGGCAAACTCAAGACATTGAGGGCTTAAGAAACATAGCTGAAGAAGCTACCTTTTATGACAAGCAATGGCAAGCTTCATGGCAAAACCAAAATCATAAATAG

mRNA sequence

ATGGCTTCAATTGCTATCTCTGCCTCATTCCCAAGAGCATCCACTTCAAATCATCATGTTTCCATGAAGAAACAACATGCTCATCAAGCTAGACCTGCCTATTCCTCAACGACGAAAAACCCGACACCAAAGGTAATTAGTACTCTCGACGTCGGGGACTGTGATGGTTTCAATGCTACAGAACATCATGGAAATGTTGCATCAGCTAGGGACAAGCTAGAGGAAGATACTGATGTTTGGAACCATGGTAAAAGGTTCAAAGATAAGAGATGGAGAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAATGGTAAGATGGATTGGGAAGGTGTCATCGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTATATCCAGAACCAGCTACAGATCAAGAACCAGTGCTCTTTAGAAGCTCCATTATACCTTGGTGGACATGGCTCACCACCTCCTACCTCCCACAAGCCGAACTACTTAACGGTAGGGCAGCCATGATAGGTTTTTTCATGGCCTATTTAGTGGATGCATTAACAGGAATTGGAATAGTTGGGCAAAGTGGGAATTTCATATGCAAAGCAGCTCTTTTTCTAACAGTCATTGGTGTGTTGCTGTTTAGGCAAACTCAAGACATTGAGGGCTTAAGAAACATAGCTGAAGAAGCTACCTTTTATGACAAGCAATGGCAAGCTTCATGGCAAAACCAAAATCATAAATAG

Coding sequence (CDS)

ATGGCTTCAATTGCTATCTCTGCCTCATTCCCAAGAGCATCCACTTCAAATCATCATGTTTCCATGAAGAAACAACATGCTCATCAAGCTAGACCTGCCTATTCCTCAACGACGAAAAACCCGACACCAAAGGTAATTAGTACTCTCGACGTCGGGGACTGTGATGGTTTCAATGCTACAGAACATCATGGAAATGTTGCATCAGCTAGGGACAAGCTAGAGGAAGATACTGATGTTTGGAACCATGGTAAAAGGTTCAAAGATAAGAGATGGAGAAATGGAACTTGGGATCTCAATATGTTTGTAAAGAATGGTAAGATGGATTGGGAAGGTGTCATCGTTGCAGAAGCAAAGAGGAGGAAGTTTCTTGAATTATATCCAGAACCAGCTACAGATCAAGAACCAGTGCTCTTTAGAAGCTCCATTATACCTTGGTGGACATGGCTCACCACCTCCTACCTCCCACAAGCCGAACTACTTAACGGTAGGGCAGCCATGATAGGTTTTTTCATGGCCTATTTAGTGGATGCATTAACAGGAATTGGAATAGTTGGGCAAAGTGGGAATTTCATATGCAAAGCAGCTCTTTTTCTAACAGTCATTGGTGTGTTGCTGTTTAGGCAAACTCAAGACATTGAGGGCTTAAGAAACATAGCTGAAGAAGCTACCTTTTATGACAAGCAATGGCAAGCTTCATGGCAAAACCAAAATCATAAATAG

Protein sequence

MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFNATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQNQNHK
BLAST of Cla97C07G144190 vs. NCBI nr
Match: XP_023551912.1 (light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 399.8 bits (1026), Expect = 6.4e-108
Identity = 200/237 (84.39%), Postives = 213/237 (89.87%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFNAT 60
           MASIAISASFPR+ TSN HVS  +QHA QARP +SS TKNPTP++I T DVGD D F+A 
Sbjct: 1   MASIAISASFPRSCTSN-HVSKNQQHARQARPTHSSMTKNPTPELIRTPDVGDRDAFDAP 60

Query: 61  EHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRR 120
           + HGNVASA DKLEEDTD  NHGK F D+RW+NGTWDLNMFVKNGKMDWEGVIVAEAKRR
Sbjct: 61  KRHGNVASATDKLEEDTDDLNHGKTFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRR 120

Query: 121 KFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTG 180
           KFLELYPE AT+ EPVLFRSSIIPWW WLT SYLPQAELLNGRAAMIGFFMAYLVDALTG
Sbjct: 121 KFLELYPETATNHEPVLFRSSIIPWWAWLTRSYLPQAELLNGRAAMIGFFMAYLVDALTG 180

Query: 181 IGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQNQN 238
           IGIVGQSGNFI K+ALF+TVIGVLLFRQTQDIEGLR +AEEATFYDKQWQASWQNQN
Sbjct: 181 IGIVGQSGNFISKSALFVTVIGVLLFRQTQDIEGLRKLAEEATFYDKQWQASWQNQN 236

BLAST of Cla97C07G144190 vs. NCBI nr
Match: XP_022984403.1 (light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 395.6 bits (1015), Expect = 1.2e-106
Identity = 201/238 (84.45%), Postives = 214/238 (89.92%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCD-GFNA 60
           MASIAISASF R+ TSN HVS K+QHA QARP YSS TKNPTP++I T DVG+ D  F+A
Sbjct: 47  MASIAISASFRRSCTSN-HVSKKQQHARQARPTYSSMTKNPTPELIRTPDVGNRDAAFDA 106

Query: 61  TEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKR 120
            + HGNVASA DKLEEDTD  NHG+ F D+RW+NGTWDLNMFVKNGKMDWEGVIVAEAKR
Sbjct: 107 PKRHGNVASATDKLEEDTDDMNHGQTFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKR 166

Query: 121 RKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALT 180
           RKFLELYPE AT+QEPVLFRSSIIPWW WLT SYLPQAELLNGRAAMIGFFMAYLVDALT
Sbjct: 167 RKFLELYPETATNQEPVLFRSSIIPWWAWLTKSYLPQAELLNGRAAMIGFFMAYLVDALT 226

Query: 181 GIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQNQN 238
           GIGIVGQSGNFI KAALF+TVIGVLLFRQTQDIEGLR +AEEATFYDKQWQASWQNQN
Sbjct: 227 GIGIVGQSGNFISKAALFVTVIGVLLFRQTQDIEGLRKLAEEATFYDKQWQASWQNQN 283

BLAST of Cla97C07G144190 vs. NCBI nr
Match: XP_022922577.1 (light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita moschata])

HSP 1 Score: 386.7 bits (992), Expect = 5.6e-104
Identity = 195/237 (82.28%), Postives = 207/237 (87.34%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFNAT 60
           MASIAISASFPR+ TSNH     KQ   QARP YSS TKNPTP++I T  VGD D  +A 
Sbjct: 1   MASIAISASFPRSCTSNH--VSNKQQRRQARPTYSSMTKNPTPELIKTPYVGDRDALDAP 60

Query: 61  EHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRR 120
           + HGNVASA DKLEEDTD  NHG+ F D+RW+NGTWDLNMFVKNGKMDWEGVIVAEAKRR
Sbjct: 61  KRHGNVASATDKLEEDTDDMNHGQTFTDERWKNGTWDLNMFVKNGKMDWEGVIVAEAKRR 120

Query: 121 KFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTG 180
           KFLELYPE AT+ EPVLFRSSIIPWW WLT SYLPQAELLNGRAAMIGFFMAYLVDALTG
Sbjct: 121 KFLELYPETATNHEPVLFRSSIIPWWAWLTKSYLPQAELLNGRAAMIGFFMAYLVDALTG 180

Query: 181 IGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQNQN 238
           IGIVGQSGNFI KAALF+TVIGVLLFRQT+DIEGLR +AEEATFYDKQWQASWQNQN
Sbjct: 181 IGIVGQSGNFISKAALFVTVIGVLLFRQTKDIEGLRKLAEEATFYDKQWQASWQNQN 235

BLAST of Cla97C07G144190 vs. NCBI nr
Match: XP_022141114.1 (light-harvesting complex-like protein 3 isotype 1, chloroplastic [Momordica charantia])

HSP 1 Score: 347.8 bits (891), Expect = 2.9e-92
Identity = 178/237 (75.11%), Postives = 197/237 (83.12%), Query Frame = 0

Query: 1   MASI--AISASFPRASTSNHHVSMKKQHAH-QARPAYSSTTKNPTPKVISTLDV--GDCD 60
           MASI  AISAS P   TS HHVS KKQHAH QARPAYS     P P+V+ST++V  G  D
Sbjct: 1   MASISAAISASLPTVCTSQHHVSKKKQHAHYQARPAYSF----PAPEVVSTINVDGGGRD 60

Query: 61  GFNATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVA 120
            F+  E HG     RDKL EDTD WNHG+RF D+RW+NGTWDLNMFVKNGKMDWEGVIVA
Sbjct: 61  SFDTAERHG-----RDKLNEDTDAWNHGERFTDERWKNGTWDLNMFVKNGKMDWEGVIVA 120

Query: 121 EAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLV 180
           EAKRRK LE+YPE AT+Q PVLFRSSIIPWW WL+ SYLPQAELLNGRAAM+GFF+AY+V
Sbjct: 121 EAKRRKILEIYPEAATNQHPVLFRSSIIPWWAWLSNSYLPQAELLNGRAAMLGFFVAYVV 180

Query: 181 DALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQAS 233
           DALTGIGIV QSGNF+CK+ALF+TVI VLLFRQT+D EGLR +AEEATFYDKQWQAS
Sbjct: 181 DALTGIGIVWQSGNFLCKSALFVTVISVLLFRQTEDAEGLRKLAEEATFYDKQWQAS 228

BLAST of Cla97C07G144190 vs. NCBI nr
Match: XP_008453842.1 (PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo])

HSP 1 Score: 339.0 bits (868), Expect = 1.3e-89
Identity = 179/243 (73.66%), Postives = 191/243 (78.60%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKK--QHAHQARPAYSSTTKNPTPKVISTLDVGDCD--- 60
           MASIAISAS PRA+  N HVSMKK  QHAH A+PAYSS TKNPTPKVISTLDVG+ D   
Sbjct: 1   MASIAISASLPRAANPN-HVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDXXX 60

Query: 61  --GFNATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVI 120
                                         +RF DKRW+NGTWDLNMFVKNGKMDWEGVI
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERFTDKRWKNGTWDLNMFVKNGKMDWEGVI 120

Query: 121 VAEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAY 180
           V EAKRRKFLEL+PE AT++EPVLFRSSIIPWW WLT SYLPQAELLNGRAAMIGFFM Y
Sbjct: 121 VEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY 180

Query: 181 LVDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASW 237
            VDALTGIGIVGQSGNFICK ALFLTVIGVLLFRQ++D+E LRNIAEEATFYDKQWQ+SW
Sbjct: 181 GVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSW 240

BLAST of Cla97C07G144190 vs. TrEMBL
Match: tr|A0A1S3BXB8|A0A1S3BXB8_CUCME (uncharacterized protein LOC103494445 OS=Cucumis melo OX=3656 GN=LOC103494445 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 8.9e-90
Identity = 179/243 (73.66%), Postives = 191/243 (78.60%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKK--QHAHQARPAYSSTTKNPTPKVISTLDVGDCD--- 60
           MASIAISAS PRA+  N HVSMKK  QHAH A+PAYSS TKNPTPKVISTLDVG+ D   
Sbjct: 1   MASIAISASLPRAANPN-HVSMKKKQQHAHLAKPAYSSMTKNPTPKVISTLDVGNRDXXX 60

Query: 61  --GFNATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVI 120
                                         +RF DKRW+NGTWDLNMFVKNGKMDWEGVI
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERFTDKRWKNGTWDLNMFVKNGKMDWEGVI 120

Query: 121 VAEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAY 180
           V EAKRRKFLEL+PE AT++EPVLFRSSIIPWW WLT SYLPQAELLNGRAAMIGFFM Y
Sbjct: 121 VEEAKRRKFLELHPEAATNEEPVLFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGY 180

Query: 181 LVDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASW 237
            VDALTGIGIVGQSGNFICK ALFLTVIGVLLFRQ++D+E LRNIAEEATFYDKQWQ+SW
Sbjct: 181 GVDALTGIGIVGQSGNFICKTALFLTVIGVLLFRQSEDVENLRNIAEEATFYDKQWQSSW 240

BLAST of Cla97C07G144190 vs. TrEMBL
Match: tr|A0A0A0KUF5|A0A0A0KUF5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G025130 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 6.4e-88
Identity = 171/237 (72.15%), Postives = 183/237 (77.22%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDV---GDCDGF 60
           MASIAISAS PRAS SNH    KKQ  H A+P YSSTTKNPTPKVISTLDV         
Sbjct: 1   MASIAISASLPRASNSNHVSMKKKQRVHLAKPGYSSTTKNPTPKVISTLDVVNRXXXXXX 60

Query: 61  NATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEA 120
                                     +RF DKRW+NGTWDLNMFV+NGKMDWEGVIV EA
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXERFTDKRWKNGTWDLNMFVQNGKMDWEGVIVEEA 120

Query: 121 KRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDA 180
           KRRKFLE++PE AT+QEPV+FRSSIIPWW WLT SYLPQAELLNGRAAMIGFFM Y VDA
Sbjct: 121 KRRKFLEIHPEAATNQEPVVFRSSIIPWWVWLTKSYLPQAELLNGRAAMIGFFMGYAVDA 180

Query: 181 LTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQ 235
           LTG+GIVGQSGNFICK ALFLTVIGVLLFRQ++DIE LRNIAEEATFYDKQWQ+SWQ
Sbjct: 181 LTGVGIVGQSGNFICKTALFLTVIGVLLFRQSEDIENLRNIAEEATFYDKQWQSSWQ 237

BLAST of Cla97C07G144190 vs. TrEMBL
Match: tr|A0A2N9FMI3|A0A2N9FMI3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16270 PE=4 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 3.3e-76
Identity = 150/244 (61.48%), Postives = 186/244 (76.23%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFNAT 60
           MASIAI++S   AS+ +H     K+   Q+  A+S  +K  T  V  TLDV    GFN  
Sbjct: 1   MASIAITSSLHMASSKHH----TKKQTPQSGTAHSLGSKQVTRHV--TLDVEGQKGFNMA 60

Query: 61  EHHGNVAS-------ARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVI 120
           EHHG  AS       A++++E  +D    G +F D+RW+NGTWDLNMFV++GKMDW+GVI
Sbjct: 61  EHHGKSASYNEKSKNAKEEMEHGSDTGPCGPKFVDERWKNGTWDLNMFVQDGKMDWDGVI 120

Query: 121 VAEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAY 180
           VAEA+RRKFLELYPE AT+Q+ VLFR+SIIPWW W   SYLP+AEL+NGRAAM+GFFMAY
Sbjct: 121 VAEARRRKFLELYPEAATEQDTVLFRTSIIPWWAWFMRSYLPEAELINGRAAMVGFFMAY 180

Query: 181 LVDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASW 238
           LVDALTG+ +VGQ+GNFICKA LF+TVI V+L R+TQD E L+ +A+EATFYDKQWQASW
Sbjct: 181 LVDALTGLDVVGQTGNFICKAGLFVTVISVILLRRTQDFETLKKLADEATFYDKQWQASW 238

BLAST of Cla97C07G144190 vs. TrEMBL
Match: tr|A0A0B0N1D6|A0A0B0N1D6_GOSAR (Uncharacterized protein OS=Gossypium arboreum OX=29729 GN=F383_31994 PE=4 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.4e-71
Identity = 141/242 (58.26%), Postives = 179/242 (73.97%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFNAT 60
           MASIAISAS  +A  S+HHV+ KKQHA Q +PAYS  TK     V  T+DV    GF   
Sbjct: 231 MASIAISASLQKA-CSSHHVA-KKQHA-QTKPAYSLGTKQAIDAV--TIDVEGQKGFKID 290

Query: 61  EHH------GNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVIV 120
           E         N     DK     ++ +  ++F D+RW+NGTWDLNMFV+NG+MDW+ VIV
Sbjct: 291 EKDQPSPQINNSEGLEDKSANKLEIESSARKFSDERWKNGTWDLNMFVRNGRMDWDSVIV 350

Query: 121 AEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAYL 180
           AEAKRRK+LE+YPE  ++QEPV FRSSIIPWW W   +YLP+AELLNGRAAMIGFF AY+
Sbjct: 351 AEAKRRKYLEMYPETCSNQEPVQFRSSIIPWWAWFMRTYLPEAELLNGRAAMIGFFSAYV 410

Query: 181 VDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASWQ 237
           VD LTG+ ++GQ+GNFICK ALF+TVIG++L R+T+D + LR +A+E T+YDKQWQASW+
Sbjct: 411 VDGLTGMDLIGQTGNFICKTALFMTVIGIVLLRKTRDFDNLRKLADEVTYYDKQWQASWK 467

BLAST of Cla97C07G144190 vs. TrEMBL
Match: tr|A0A1R3JIV9|A0A1R3JIV9_COCAP (Chlorophyll A-B binding protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_05810 PE=4 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 3.2e-71
Identity = 138/244 (56.56%), Postives = 183/244 (75.00%), Query Frame = 0

Query: 1   MASIAISASFPRASTSNHHVSMKKQHAHQARPAYSSTTKNPTPKVISTLDVGDCDGFN-- 60
           MASI+ISAS  RA +S H   ++++     +P YS  TK     V  TLD+    G+   
Sbjct: 1   MASISISASLQRACSSQH---VRRKQKPIPKPTYSLGTKQVMKAV--TLDMEAQKGYKTD 60

Query: 61  -----ATEHHGNVASARDKLEEDTDVWNHGKRFKDKRWRNGTWDLNMFVKNGKMDWEGVI 120
                A+E +       ++L+ED +  +   +F+D+RW+NGTWDLNMFV+NG+MDW+ VI
Sbjct: 61  TKEEPASEMNYKSKDPEERLKEDFETESSAPKFRDERWKNGTWDLNMFVRNGRMDWDSVI 120

Query: 121 VAEAKRRKFLELYPEPATDQEPVLFRSSIIPWWTWLTTSYLPQAELLNGRAAMIGFFMAY 180
           +AEA+RRKFLE++PE  T++EPV FRSSIIPWW WL  +YLP+AELLNGRAAMIGFFMAY
Sbjct: 121 IAEARRRKFLEMHPEATTNEEPVKFRSSIIPWWAWLMRTYLPEAELLNGRAAMIGFFMAY 180

Query: 181 LVDALTGIGIVGQSGNFICKAALFLTVIGVLLFRQTQDIEGLRNIAEEATFYDKQWQASW 238
           +VDALTG+ +VGQ+GNFICKA LF+TVIGVL+ R+TQD + L+ +A+EAT+YDKQWQASW
Sbjct: 181 IVDALTGLDVVGQTGNFICKAGLFVTVIGVLVLRKTQDFDNLKKLADEATYYDKQWQASW 239

BLAST of Cla97C07G144190 vs. Swiss-Prot
Match: sp|Q9SYX1|LIL31_ARATH (Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.1 PE=1 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 1.6e-54
Identity = 95/153 (62.09%), Postives = 120/153 (78.43%), Query Frame = 0

Query: 85  RFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEPATDQEPVLFRSSIIP 144
           +F+D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  ++ EPVLF +SIIP
Sbjct: 101 KFQDARWINGTWDLKQFEKDGKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIP 160

Query: 145 WWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTGIGIVGQSGNFICKAALFLTVIGVL 204
           WW W+   +LP+AELLNGRAAMIGFFMAY VD+LTG+G+V Q GNF CK  LF+ V GVL
Sbjct: 161 WWAWIKRYHLPEAELLNGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVL 220

Query: 205 LFRQTQDIEGLRNIAEEATFYDKQWQASWQNQN 238
             R+ +D++ L+N+ +E T YDKQWQA+W+N +
Sbjct: 221 FIRKNEDVDKLKNLFDETTLYDKQWQAAWKNDD 253

BLAST of Cla97C07G144190 vs. Swiss-Prot
Match: sp|Q6NKS4|LIL32_ARATH (Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.2 PE=1 SV=1)

HSP 1 Score: 199.1 bits (505), Expect = 5.4e-50
Identity = 87/150 (58.00%), Postives = 117/150 (78.00%), Query Frame = 0

Query: 85  RFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEPATDQEPVLFRSSIIP 144
           ++++ +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  ++ E V+F +SIIP
Sbjct: 98  KYQNAKWVNGTWDLKQFEKDGKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIP 157

Query: 145 WWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTGIGIVGQSGNFICKAALFLTVIGVL 204
           WW W+   +LP+AELLNGRAAMIGFFMAY VD+LTG+G+V Q GNF CK  LF+ V GVL
Sbjct: 158 WWAWMKRYHLPEAELLNGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVL 217

Query: 205 LFRQTQDIEGLRNIAEEATFYDKQWQASWQ 235
             R+ +D++ L+++ +E T YDKQWQA+W+
Sbjct: 218 FIRKNEDLDKLKDLFDETTLYDKQWQAAWK 247

BLAST of Cla97C07G144190 vs. TAIR10
Match: AT4G17600.1 (Chlorophyll A-B binding family protein)

HSP 1 Score: 214.2 bits (544), Expect = 9.0e-56
Identity = 95/153 (62.09%), Postives = 120/153 (78.43%), Query Frame = 0

Query: 85  RFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEPATDQEPVLFRSSIIP 144
           +F+D RW NGTWDL  F K+GK DW+ VIVAEAKRRK+LE  PE  ++ EPVLF +SIIP
Sbjct: 101 KFQDARWINGTWDLKQFEKDGKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIP 160

Query: 145 WWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTGIGIVGQSGNFICKAALFLTVIGVL 204
           WW W+   +LP+AELLNGRAAMIGFFMAY VD+LTG+G+V Q GNF CK  LF+ V GVL
Sbjct: 161 WWAWIKRYHLPEAELLNGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVL 220

Query: 205 LFRQTQDIEGLRNIAEEATFYDKQWQASWQNQN 238
             R+ +D++ L+N+ +E T YDKQWQA+W+N +
Sbjct: 221 FIRKNEDVDKLKNLFDETTLYDKQWQAAWKNDD 253

BLAST of Cla97C07G144190 vs. TAIR10
Match: AT5G47110.1 (Chlorophyll A-B binding family protein)

HSP 1 Score: 199.1 bits (505), Expect = 3.0e-51
Identity = 87/150 (58.00%), Postives = 117/150 (78.00%), Query Frame = 0

Query: 85  RFKDKRWRNGTWDLNMFVKNGKMDWEGVIVAEAKRRKFLELYPEPATDQEPVLFRSSIIP 144
           ++++ +W NGTWDL  F K+GK DW+ VIV+EAKRRK+LE  PE  ++ E V+F +SIIP
Sbjct: 98  KYQNAKWVNGTWDLKQFEKDGKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIP 157

Query: 145 WWTWLTTSYLPQAELLNGRAAMIGFFMAYLVDALTGIGIVGQSGNFICKAALFLTVIGVL 204
           WW W+   +LP+AELLNGRAAMIGFFMAY VD+LTG+G+V Q GNF CK  LF+ V GVL
Sbjct: 158 WWAWMKRYHLPEAELLNGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVL 217

Query: 205 LFRQTQDIEGLRNIAEEATFYDKQWQASWQ 235
             R+ +D++ L+++ +E T YDKQWQA+W+
Sbjct: 218 FIRKNEDLDKLKDLFDETTLYDKQWQAAWK 247

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023551912.16.4e-10884.39light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita pepo... [more]
XP_022984403.11.2e-10684.45light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita maxi... [more]
XP_022922577.15.6e-10482.28light-harvesting complex-like protein 3 isotype 1, chloroplastic [Cucurbita mosc... [more]
XP_022141114.12.9e-9275.11light-harvesting complex-like protein 3 isotype 1, chloroplastic [Momordica char... [more]
XP_008453842.11.3e-8973.66PREDICTED: uncharacterized protein LOC103494445 [Cucumis melo][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BXB8|A0A1S3BXB8_CUCME8.9e-9073.66uncharacterized protein LOC103494445 OS=Cucumis melo OX=3656 GN=LOC103494445 PE=... [more]
tr|A0A0A0KUF5|A0A0A0KUF5_CUCSA6.4e-8872.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G025130 PE=4 SV=1[more]
tr|A0A2N9FMI3|A0A2N9FMI3_FAGSY3.3e-7661.48Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16270 PE=4 SV=1[more]
tr|A0A0B0N1D6|A0A0B0N1D6_GOSAR2.4e-7158.26Uncharacterized protein OS=Gossypium arboreum OX=29729 GN=F383_31994 PE=4 SV=1[more]
tr|A0A1R3JIV9|A0A1R3JIV9_COCAP3.2e-7156.56Chlorophyll A-B binding protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_058... [more]
Match NameE-valueIdentityDescription
sp|Q9SYX1|LIL31_ARATH1.6e-5462.09Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis ... [more]
sp|Q6NKS4|LIL32_ARATH5.4e-5058.00Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis ... [more]
Match NameE-valueIdentityDescription
AT4G17600.19.0e-5662.09Chlorophyll A-B binding family protein[more]
AT5G47110.13.0e-5158.00Chlorophyll A-B binding family protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
biological_process GO:1902326 positive regulation of chlorophyll biosynthetic process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016592 mediator complex
cellular_component GO:0042651 thylakoid membrane
cellular_component GO:0016020 membrane
cellular_component GO:0009507 chloroplast
molecular_function GO:0042802 identical protein binding
molecular_function GO:0019899 enzyme binding
molecular_function GO:0001104 RNA polymerase II transcription cofactor activity
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0043495 protein anchor

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G144190.1Cla97C07G144190.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..42
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..18
NoneNo IPR availablePANTHERPTHR14154:SF13SUBFAMILY NOT NAMEDcoord: 56..237
NoneNo IPR availablePANTHERPTHR14154UPF0041 BRAIN PROTEIN 44-RELATEDcoord: 56..237
NoneNo IPR availableSUPERFAMILYSSF103511Chlorophyll a-b binding proteincoord: 137..191

The following gene(s) are paralogous to this gene:

None