CmaCh16G011040 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G011040
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionLil3 family protein
LocationCma_Chr16 : 8495173 .. 8498340 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGTAATGGCTTAGGCCCAGCCCAAATCATTTTTTTTTCAAATATAGATTTTGTACCTATAAAAAAATTTAGACTTAAAAATAAATAAATTAATATCTAAATGAACCTAGCTTACATTTGAATGGAACCGATCTTAAGGAAAAGATTAAAATAACGGAAAGTTTCTATTTATACCAACAAGGTACACCGGTTCAATCACAGAAACTTCATGATTAAACGTACTCAGACCAATTCTATATTAAGTGATCTCTTATGAGTTTTTTAAAGATGTATGTGAGTGATGACAAAACATACTAGAAGAATTCGTGTTAATTGTGGGAATAGTTTTCCCTTCAGGGTATTATTAAAAAACTTTAAAATTCTAGAAAAAAGTTTATAAGGGTATTTTATTATTTTACACAAAATTAAACAATTTAGCTTCATCTTAATAATCATTACATATGAATTAAAAAAACATATTAAACAAGCTTGTGTTTAACCTTTTAATATAAAATTTAAAGAATTTAATAATATATTCCTTTCCACTTATTAATTTTAAAAAATATTTATATTATATTGTTTAAAATAAAAATTATTATTAGATTAAAATATCATTTTAATTTTTATAGATATCTAATTTTAGTTAAATAAATCTAAATTCAGTTTTATTTGATTTTTTTAGTAAAAAAAGTTCTTTATTTAACATTTTTTTTTAAATATATTCACCTAAGTATTTTAAGACTATTATTATTATTTAATAAATTTTGAAATACTAGATTTATTAGAATTAAATTTAATAATATTATTATTTAATCAACTTTATATATATATATTTATTTATTTATTTTTTAAGCAAGCACATTTATTATGGGCGAAAGTGCAAAGCTTCCCTATCCATCATAATTACAAGGCTGAGGATAGAGCTTTGAAGAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGGTAATCTTCGTAAACCCTACATTTTCTGTTGATTTTGCTTCTAATTGCATTACGAAATCTGGATGTACATAGATACATGCAATGTCTGAGAAGCTTTTTTGGGGAAATCTTGATCTGCAAGTTTTGATGATGTTTAGATTGTAATGGATTTCTTGGCGATGTTTGAGCAATTACTTGATGTTTGAATCAGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGTATAAAACATGTTTCTATGATTGCTTATCTTGTTTTCTTAAAGCTTGTCTGTGTGAAATAATGGCTAAATGTGAATGTTTGAAAATCGTGTTTAGAAGTGTAAAATTGAATATTAATTTGATTTTGAATGTTTAAACACATGTTCTTGAGTGATTATGAACGTGAATGACTGATTTTAACCATTTGAAAATCACTCAAACGTATTTCTTGTTCTTTCGTTCATGTCTGTGAATCATGATTTCTACATGTTCATTGTATTGGCTTCTCTGTATGAAAATCATGTTTAAAAGTGTAAAATCATACATGGAAGTTGATTTTGAATGTTTAAACACATGTTCTTGAGTGATTTTGAGCATGTTCTTGAGTGATTTTGAACGTGAAGGACCGATTTTAACCATTTGAAAATCACTCGATCGTATTTTTTGTTCTTTCTTTCACGTCTGTGGATCATGATTTCTACTTGTTCATAACATTGGCTTCTCCATATGAAAATCACGTTTAAAAGCGTAAAATCATACATTAAGATGATTTTGAATGTTCAAACGCATGTTCTTGACGTGAATGACTGATTTTAACCATTTGAAAATCACTCAAACGTATTTTTTGATGATTTTGAATGTTCAAACGCATGTTCTTGAGTGATTTTGAACGTGAATGACTGATTTTAACCATTTGAAAATCACTCGATCATACTTTGTTCTTGAGTATTTCTACGTATAACGTTGGCTTCTCTGTAACTTACAAGTAATATTGTGTCAAGTAAAGTATTGAATCTACTATAACATTGAAGTAAATTTGAGCGGTGCAATGGTTTCGATGCAGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAATGTGGCAAAGTTGCCTGTGTTCTTGTTTTGCTTTAATTATGTTTGCATCTCATAAGATTTGGGCTTAGTAGAGAAGGATAATGTGAGCTCTGAAAGATATTGAACGTTTAGGCCTCAGTTTGTTAATGTTTCTTGAAGTTGTTGAGGGTTTCAATGAAAAATCTCTCTCTAAAAGGTTGGGTTTTCTGAAAAAAAAAA

mRNA sequence

ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAATGTGGCAAAGTTGCCTGTGTTCTTGTTTTGCTTTAATTATGTTTGCATCTCATAAGATTTGGGCTTAGTAGAGAAGGATAATGTGAGCTCTGAAAGATATTGAACGTTTAGGCCTCAGTTTGTTAATGTTTCTTGAAGTTGTTGAGGGTTTCAATGAAAAATCTCTCTCTAAAAGGTTGGGTTTTCTGAAAAAAAAAA

Coding sequence (CDS)

ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAA

Protein sequence

MPEKSPKTLPPAKGMRKLVESCAVLREEEDNDNVRFDYQWRYSVGKREAGGSGGDLHGYKTLKKQNDNQAGGAKMGDLWAFEGGALSSSSFHTKTMSSMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAADNGAGISGGSVAVEPAVVQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTFYDKQWQATWQDETKGSGKV
BLAST of CmaCh16G011040 vs. TrEMBL
Match: A0A0A0L5D8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017160 PE=4 SV=1)

HSP 1 Score: 427.9 bits (1099), Expect = 1.1e-116
Identity = 226/267 (84.64%), Postives = 236/267 (88.39%), Query Frame = 1

Query: 96  MSSMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSW--------VAADNGAGIS 155
           MSSMALFSP S LS+ SPS HH THF  RPFSSLR+RNPSS           ADNGAGIS
Sbjct: 1   MSSMALFSPSSHLSTFSPS-HHTTHFSFRPFSSLRTRNPSSSSSSLFTIRATADNGAGIS 60

Query: 156 GGS--VAVEPAVVQKDPEPAV---EKKESLAETNGSVAAVEEVV-VVSKFEDPKWVNGTW 215
           GGS  V+VE  V QKDPEPA    E++ESLA TNGSVAA EEVV VVSKFEDPKWVNGTW
Sbjct: 61  GGSATVSVETPVEQKDPEPAKLAPEEQESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTW 120

Query: 216 DLNQFRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPE 275
           DLNQF+K+GSTDWDAVIDAEARRRKWLENNPESSSNEDPV+FDTSIVPWWAWIKRYHLPE
Sbjct: 121 DLNQFQKNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPE 180

Query: 276 AEILNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK 335
           AE+LNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK
Sbjct: 181 AELLNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK 240

Query: 336 KLIDETTFYDKQWQATWQDETKGSGKV 349
           KLIDETTFYDKQWQATWQDET GSGK+
Sbjct: 241 KLIDETTFYDKQWQATWQDETSGSGKM 266

BLAST of CmaCh16G011040 vs. TrEMBL
Match: W9SG48_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023403 PE=4 SV=1)

HSP 1 Score: 348.2 bits (892), Expect = 1.2e-92
Identity = 179/255 (70.20%), Postives = 201/255 (78.82%), Query Frame = 1

Query: 96  MSSMALFSPPSRLSSLSPSSH-HRTHFPSRPFSSLRSRNPSSW---VAADNGAGISGGSV 155
           MSSMALFSPP+   +LSPSS   +T F  +P    R + P  +    +ADNGAG  G + 
Sbjct: 1   MSSMALFSPPTHFPTLSPSSSLSKTQFSHKPHLLFRPKYPLLFRLKASADNGAGAPGSAA 60

Query: 156 AV---EPAVVQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRK 215
           A    +P    K PEP+    ES    NG V    EV V+SKFEDPKWVNGTWDL QF+K
Sbjct: 61  ATAVEDPKAPPKVPEPS----ESSDGANGLVPPAPEVGVLSKFEDPKWVNGTWDLTQFQK 120

Query: 216 DGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGR 275
           DG TDWDAVIDAEARRRKW E+NPESSSNE+P++FDTSI+PWWAWIKRYHLPEAE+LNGR
Sbjct: 121 DGKTDWDAVIDAEARRRKWFEDNPESSSNENPIVFDTSIIPWWAWIKRYHLPEAELLNGR 180

Query: 276 AAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETT 335
           AAMVGFFMAYFVDSLTGVGLV QMGNFFCKTLLF+AV GVLLIRKNEDIET+KKL++ETT
Sbjct: 181 AAMVGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFIAVAGVLLIRKNEDIETIKKLLEETT 240

Query: 336 FYDKQWQATWQDETK 344
           FYDKQWQATWQDE K
Sbjct: 241 FYDKQWQATWQDENK 251

BLAST of CmaCh16G011040 vs. TrEMBL
Match: M5WA97_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010089mg PE=4 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 5.4e-90
Identity = 179/256 (69.92%), Postives = 203/256 (79.30%), Query Frame = 1

Query: 98  SMALFSPPSRLSSLSPSSHH-RTHFPSRPFSSLRSRNP--SSWVAADNGAG-ISGGSVAV 157
           SMALFSP + + +LSPSS + + +   +P+ SLR +NP  ++  +A+NGAG ++  + AV
Sbjct: 4   SMALFSPTTHIPTLSPSSSYSKPNLIHKPYLSLRPKNPLFTARASAENGAGALASAATAV 63

Query: 158 EPAVVQKDPEPAVE---KKESLAETNGSVAA-VEEVVVVSKFEDPKWVNGTWDLNQFRKD 217
           +     K PEP      K ES A  NGS AA  EEV VV  FED +WVNGTWDL QF K 
Sbjct: 64  KSEAEPKVPEPVEPVPVKTESSAGANGSAAAPAEEVKVVGLFEDSRWVNGTWDLKQFEKS 123

Query: 218 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 277
           G TDWDAVIDAEARRRKWL++NPESSSNE+PV FDTSIVPWWAWIKRYHLPEAE+LNGRA
Sbjct: 124 GKTDWDAVIDAEARRRKWLQDNPESSSNENPVFFDTSIVPWWAWIKRYHLPEAELLNGRA 183

Query: 278 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 337
           AMVGFFMAY VDSLTGVGLV QMGNFFCKTLLFVAV GVLLIRKNED+E LKKL+DETTF
Sbjct: 184 AMVGFFMAYLVDSLTGVGLVDQMGNFFCKTLLFVAVSGVLLIRKNEDVENLKKLLDETTF 243

Query: 338 YDKQWQATWQDETKGS 346
           YDKQWQATWQDET  S
Sbjct: 244 YDKQWQATWQDETPAS 259

BLAST of CmaCh16G011040 vs. TrEMBL
Match: A0A067KUX7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_03488 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.7e-89
Identity = 178/261 (68.20%), Postives = 202/261 (77.39%), Query Frame = 1

Query: 98  SMALFSPP---SRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAA-----DNGAGISGG 157
           SMALFSPP     L SLSP    + HF  +P   LR  NP  +++      DNGAG+S  
Sbjct: 4   SMALFSPPLPTHLLRSLSP----KPHFTHKPSLLLRPTNPLFFLSTPKASTDNGAGLS-- 63

Query: 158 SVAVEPAVVQKDPEPA---VEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQF 217
           +   EP   QK  EP+    E KES  E+NG+VA   EV + SKF DP+W+ GTWDL QF
Sbjct: 64  AAVEEPKAEQKAAEPSGSVPEAKESSLESNGAVAD-GEVKLESKFVDPRWIGGTWDLKQF 123

Query: 218 RKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILN 277
           ++DG TDWDAVIDAE RRRKWLE NPESSSN+DPV+FDTSI+PWWAW+KR+HLPEAE+LN
Sbjct: 124 QRDGKTDWDAVIDAEVRRRKWLEGNPESSSNDDPVVFDTSIIPWWAWMKRFHLPEAELLN 183

Query: 278 GRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDE 337
           GRAAM+GFFMAYFVDSLTGVGLV QM NFFCKTLLFVAVVGVLLIRKNED+ETLKKL+DE
Sbjct: 184 GRAAMIGFFMAYFVDSLTGVGLVDQMSNFFCKTLLFVAVVGVLLIRKNEDLETLKKLLDE 243

Query: 338 TTFYDKQWQATWQDETKGSGK 348
           TTFYDKQWQATWQDET G  K
Sbjct: 244 TTFYDKQWQATWQDETPGGSK 257

BLAST of CmaCh16G011040 vs. TrEMBL
Match: B9SEE4_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_0703150 PE=4 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 7.2e-87
Identity = 174/257 (67.70%), Postives = 197/257 (76.65%), Query Frame = 1

Query: 98  SMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAA-----DNGAGISGGSVA 157
           SMALFSPP      S SS  + H  ++P S L   NP    +A     DNGAGIS  + A
Sbjct: 4   SMALFSPPPTQFLRSLSS--KPHLLTKPTSFLTPINPPFLFSAPKASTDNGAGISAAAAA 63

Query: 158 VEPA--VVQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRKDG 217
           VE       K  EP+    ES   +NG+V   E V + SKF DP+W+ GTWDL QF+KDG
Sbjct: 64  VEEPKEAEPKAAEPSPAAVESSLGSNGAVKDAE-VKLESKFVDPRWIGGTWDLKQFQKDG 123

Query: 218 STDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRAA 277
           STDWD+VIDAE RRRKWLE+NPESS+N+DPV+FDTSI+PWWAW+KR+HLPEAE+LNGRAA
Sbjct: 124 STDWDSVIDAEVRRRKWLESNPESSTNDDPVVFDTSIIPWWAWMKRFHLPEAELLNGRAA 183

Query: 278 MVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTFY 337
           MVGFFMAYFVDSLTGVGLV QMGNFFCKTLLF+AV GVLLIRKNEDIETLKKL+DETTFY
Sbjct: 184 MVGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFIAVAGVLLIRKNEDIETLKKLVDETTFY 243

Query: 338 DKQWQATWQDETKGSGK 348
           DKQWQATWQDET  S K
Sbjct: 244 DKQWQATWQDETPSSSK 257

BLAST of CmaCh16G011040 vs. TAIR10
Match: AT4G17600.1 (AT4G17600.1 Chlorophyll A-B binding family protein)

HSP 1 Score: 292.4 bits (747), Expect = 3.8e-79
Identity = 159/265 (60.00%), Postives = 189/265 (71.32%), Query Frame = 1

Query: 99  MALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPS----SWVAADNGAGISGGSVAVE 158
           MALFSPP   SSL     +    P   FS L S   S    +  ++D+G+     +V+VE
Sbjct: 1   MALFSPPISSSSLQ----NPNFIPKFSFSLLSSNRFSLLSVTRASSDSGSTSPTAAVSVE 60

Query: 159 PA----VVQKDPE---PAVEKKESLAETNGSVAAVEEVVVVS--KFEDPKWVNGTWDLNQ 218
                 V+ K+P    PAV+K+E+    N +V   E     S  KF+D +W+NGTWDL Q
Sbjct: 61  APEPVEVIVKEPPQSTPAVKKEETATAKNVAVEGEEMKTTESVVKFQDARWINGTWDLKQ 120

Query: 219 FRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEIL 278
           F KDG TDWD+VI AEA+RRKWLE NPE++SN++PVLFDTSI+PWWAWIKRYHLPEAE+L
Sbjct: 121 FEKDGKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIPWWAWIKRYHLPEAELL 180

Query: 279 NGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLID 338
           NGRAAM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L D
Sbjct: 181 NGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDVDKLKNLFD 240

Query: 339 ETTFYDKQWQATWQ---DETKGSGK 348
           ETT YDKQWQA W+   DE+ GS K
Sbjct: 241 ETTLYDKQWQAAWKNDDDESLGSKK 261

BLAST of CmaCh16G011040 vs. TAIR10
Match: AT5G47110.1 (AT5G47110.1 Chlorophyll A-B binding family protein)

HSP 1 Score: 271.6 bits (693), Expect = 7.0e-73
Identity = 145/251 (57.77%), Postives = 177/251 (70.52%), Query Frame = 1

Query: 98  SMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAADNGAGISGGSVAV---- 157
           SMALFSPP   S  +P+   +        +SL S    S ++    +  +G +  V    
Sbjct: 4   SMALFSPPISSSLQNPNLIPKIS------TSLLSTKRFSLISVPRASSDNGTTSPVVEIP 63

Query: 158 EPAVVQKDPEPAVEKKESL-AETNGSV---AAVEEVVVVSKFEDPKWVNGTWDLNQFRKD 217
           +PA V  +  P     ES  A  NG+V   A       V K+++ KWVNGTWDL QF KD
Sbjct: 64  KPASVAVEEVPVKSPAESSSASENGAVGGEATDSSTETVIKYQNAKWVNGTWDLKQFEKD 123

Query: 218 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 277
           G TDWD+VI +EA+RRKWLE+NPE++SN++ V+FDTSI+PWWAW+KRYHLPEAE+LNGRA
Sbjct: 124 GKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIPWWAWMKRYHLPEAELLNGRA 183

Query: 278 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 337
           AM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L DETT 
Sbjct: 184 AMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDLDKLKDLFDETTL 243

Query: 338 YDKQWQATWQD 341
           YDKQWQA W++
Sbjct: 244 YDKQWQAAWKE 248

BLAST of CmaCh16G011040 vs. NCBI nr
Match: gi|659095806|ref|XP_008448777.1| (PREDICTED: uncharacterized protein LOC103490842 [Cucumis melo])

HSP 1 Score: 430.3 bits (1105), Expect = 3.3e-117
Identity = 228/266 (85.71%), Postives = 236/266 (88.72%), Query Frame = 1

Query: 96  MSSMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSW---------VAADNGAGI 155
           MSSMALFSP S LS+LSPSSHH THF  RPFSSLR+RNPSS            ADNGAGI
Sbjct: 1   MSSMALFSPSSHLSTLSPSSHHTTHFSFRPFSSLRTRNPSSSSSSSLFTIRATADNGAGI 60

Query: 156 SGGS--VAVEPAVVQKDPEPAVEKKESLAETNGSVAAVEEVV-VVSKFEDPKWVNGTWDL 215
           SGGS  VAVE  V QKDPEPA   +ESLA TNGSVAA EEVV VVSKFEDPKWVNGTWDL
Sbjct: 61  SGGSATVAVETPVEQKDPEPA---EESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTWDL 120

Query: 216 NQFRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAE 275
           NQF+K+GSTDWDAVIDAEARRRKWLENNPESSSNEDPV+FDTSIVPWWAWIKRYHLPEAE
Sbjct: 121 NQFQKNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPEAE 180

Query: 276 ILNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKL 335
           +LNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKL
Sbjct: 181 LLNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKL 240

Query: 336 IDETTFYDKQWQATWQDET-KGSGKV 349
           IDETTFYDKQWQATWQDET  GSGK+
Sbjct: 241 IDETTFYDKQWQATWQDETSSGSGKM 263

BLAST of CmaCh16G011040 vs. NCBI nr
Match: gi|449459006|ref|XP_004147237.1| (PREDICTED: uncharacterized protein LOC101206464 [Cucumis sativus])

HSP 1 Score: 427.9 bits (1099), Expect = 1.6e-116
Identity = 226/267 (84.64%), Postives = 236/267 (88.39%), Query Frame = 1

Query: 96  MSSMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSW--------VAADNGAGIS 155
           MSSMALFSP S LS+ SPS HH THF  RPFSSLR+RNPSS           ADNGAGIS
Sbjct: 1   MSSMALFSPSSHLSTFSPS-HHTTHFSFRPFSSLRTRNPSSSSSSLFTIRATADNGAGIS 60

Query: 156 GGS--VAVEPAVVQKDPEPAV---EKKESLAETNGSVAAVEEVV-VVSKFEDPKWVNGTW 215
           GGS  V+VE  V QKDPEPA    E++ESLA TNGSVAA EEVV VVSKFEDPKWVNGTW
Sbjct: 61  GGSATVSVETPVEQKDPEPAKLAPEEQESLAGTNGSVAAAEEVVEVVSKFEDPKWVNGTW 120

Query: 216 DLNQFRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPE 275
           DLNQF+K+GSTDWDAVIDAEARRRKWLENNPESSSNEDPV+FDTSIVPWWAWIKRYHLPE
Sbjct: 121 DLNQFQKNGSTDWDAVIDAEARRRKWLENNPESSSNEDPVVFDTSIVPWWAWIKRYHLPE 180

Query: 276 AEILNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK 335
           AE+LNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK
Sbjct: 181 AELLNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLK 240

Query: 336 KLIDETTFYDKQWQATWQDETKGSGKV 349
           KLIDETTFYDKQWQATWQDET GSGK+
Sbjct: 241 KLIDETTFYDKQWQATWQDETSGSGKM 266

BLAST of CmaCh16G011040 vs. NCBI nr
Match: gi|1009157292|ref|XP_015896688.1| (PREDICTED: uncharacterized protein LOC107430367 [Ziziphus jujuba])

HSP 1 Score: 364.4 bits (934), Expect = 2.2e-97
Identity = 186/258 (72.09%), Postives = 211/258 (81.78%), Query Frame = 1

Query: 97  SSMALFSPPS-RLSSLSPSSHHRTHFPSRPFSSLRSRNPS--SWVAADNGAG-ISGGSVA 156
           SSMALFSPP+  L +LSPSS  + H   +P+  LR +NP   S   +DNGAG +S  + A
Sbjct: 3   SSMALFSPPNTHLPTLSPSSFPKPHLTHKPYLFLRPKNPVFLSKATSDNGAGSLSSAATA 62

Query: 157 VEPAV---VQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRKD 216
           VEP     V +  EPA EK E+ + +NGS    +EV VVSKFEDPKWVNGTW+L QF+KD
Sbjct: 63  VEPIAEPKVSQASEPAFEKNETSSSSNGSAGTAKEVEVVSKFEDPKWVNGTWNLKQFQKD 122

Query: 217 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 276
             TDWDAVIDAEARRRKWLE+NPESSSN++P++FDTSI+PWWAWIKRYHLPEAE+LNGRA
Sbjct: 123 SKTDWDAVIDAEARRRKWLESNPESSSNDEPIVFDTSIIPWWAWIKRYHLPEAELLNGRA 182

Query: 277 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 336
           AM+GFFMAY VDSLTGVGLV QMGNFFCKTLLFVAVVGVLLIRKNEDI TLKKL+DETTF
Sbjct: 183 AMIGFFMAYLVDSLTGVGLVDQMGNFFCKTLLFVAVVGVLLIRKNEDIGTLKKLLDETTF 242

Query: 337 YDKQWQATWQDETKGSGK 348
           YDKQWQATWQDET GS K
Sbjct: 243 YDKQWQATWQDETPGSSK 260

BLAST of CmaCh16G011040 vs. NCBI nr
Match: gi|703153007|ref|XP_010110568.1| (hypothetical protein L484_023403 [Morus notabilis])

HSP 1 Score: 348.2 bits (892), Expect = 1.7e-92
Identity = 179/255 (70.20%), Postives = 201/255 (78.82%), Query Frame = 1

Query: 96  MSSMALFSPPSRLSSLSPSSH-HRTHFPSRPFSSLRSRNPSSW---VAADNGAGISGGSV 155
           MSSMALFSPP+   +LSPSS   +T F  +P    R + P  +    +ADNGAG  G + 
Sbjct: 1   MSSMALFSPPTHFPTLSPSSSLSKTQFSHKPHLLFRPKYPLLFRLKASADNGAGAPGSAA 60

Query: 156 AV---EPAVVQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRK 215
           A    +P    K PEP+    ES    NG V    EV V+SKFEDPKWVNGTWDL QF+K
Sbjct: 61  ATAVEDPKAPPKVPEPS----ESSDGANGLVPPAPEVGVLSKFEDPKWVNGTWDLTQFQK 120

Query: 216 DGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGR 275
           DG TDWDAVIDAEARRRKW E+NPESSSNE+P++FDTSI+PWWAWIKRYHLPEAE+LNGR
Sbjct: 121 DGKTDWDAVIDAEARRRKWFEDNPESSSNENPIVFDTSIIPWWAWIKRYHLPEAELLNGR 180

Query: 276 AAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETT 335
           AAMVGFFMAYFVDSLTGVGLV QMGNFFCKTLLF+AV GVLLIRKNEDIET+KKL++ETT
Sbjct: 181 AAMVGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFIAVAGVLLIRKNEDIETIKKLLEETT 240

Query: 336 FYDKQWQATWQDETK 344
           FYDKQWQATWQDE K
Sbjct: 241 FYDKQWQATWQDENK 251

BLAST of CmaCh16G011040 vs. NCBI nr
Match: gi|645266178|ref|XP_008238501.1| (PREDICTED: uncharacterized protein LOC103337126 [Prunus mume])

HSP 1 Score: 341.7 bits (875), Expect = 1.6e-90
Identity = 180/256 (70.31%), Postives = 201/256 (78.52%), Query Frame = 1

Query: 98  SMALFSPPSRLSSLSPSSHHRT-HFPSRPFSSLRSRNP--SSWVAADNGAGISGGSV-AV 157
           SMALFSP + + +LSPSS +   H   +P+ SLR +NP  ++  + +NGAG  G +  AV
Sbjct: 4   SMALFSPTTHIPTLSPSSSYSNPHLTHKPYLSLRPKNPLFTTKASGENGAGALGSAATAV 63

Query: 158 EPAVVQKDPEPAVE---KKESLAETNGSVAA-VEEVVVVSKFEDPKWVNGTWDLNQFRKD 217
           +P    K PEP      K ES A  NGS AA  EEV VV  FED +WVNGTW+L QF K 
Sbjct: 64  KPEAEPKVPEPVEPVPVKTESSAGANGSAAAPAEEVKVVGLFEDSRWVNGTWNLKQFEKS 123

Query: 218 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 277
           G TDWDAVIDAEARRRKWL++NPESSSNE+PV FDTSIVPWWAWIKRYHLPEAE+LNGRA
Sbjct: 124 GKTDWDAVIDAEARRRKWLQDNPESSSNENPVFFDTSIVPWWAWIKRYHLPEAELLNGRA 183

Query: 278 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 337
           AMVGFFMAY VDSLTGVGLV QMGNFFCKTLLFVAV GVLLIRKNED+E LKKL+DETTF
Sbjct: 184 AMVGFFMAYLVDSLTGVGLVDQMGNFFCKTLLFVAVSGVLLIRKNEDVENLKKLLDETTF 243

Query: 338 YDKQWQATWQDETKGS 346
           YDKQWQATWQDET  S
Sbjct: 244 YDKQWQATWQDETPAS 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L5D8_CUCSA1.1e-11684.64Uncharacterized protein OS=Cucumis sativus GN=Csa_3G017160 PE=4 SV=1[more]
W9SG48_9ROSA1.2e-9270.20Uncharacterized protein OS=Morus notabilis GN=L484_023403 PE=4 SV=1[more]
M5WA97_PRUPE5.4e-9069.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010089mg PE=4 SV=1[more]
A0A067KUX7_JATCU2.7e-8968.20Uncharacterized protein OS=Jatropha curcas GN=JCGZ_03488 PE=4 SV=1[more]
B9SEE4_RICCO7.2e-8767.70Transcription factor, putative OS=Ricinus communis GN=RCOM_0703150 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17600.13.8e-7960.00 Chlorophyll A-B binding family protein[more]
AT5G47110.17.0e-7357.77 Chlorophyll A-B binding family protein[more]
Match NameE-valueIdentityDescription
gi|659095806|ref|XP_008448777.1|3.3e-11785.71PREDICTED: uncharacterized protein LOC103490842 [Cucumis melo][more]
gi|449459006|ref|XP_004147237.1|1.6e-11684.64PREDICTED: uncharacterized protein LOC101206464 [Cucumis sativus][more]
gi|1009157292|ref|XP_015896688.1|2.2e-9772.09PREDICTED: uncharacterized protein LOC107430367 [Ziziphus jujuba][more]
gi|703153007|ref|XP_010110568.1|1.7e-9270.20hypothetical protein L484_023403 [Morus notabilis][more]
gi|645266178|ref|XP_008238501.1|1.6e-9070.31PREDICTED: uncharacterized protein LOC103337126 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR023329Chlorophyll_a/b-bd_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G011040.1CmaCh16G011040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023329Chlorophyll a/b binding protein domainunknownSSF103511Chlorophyll a-b binding proteincoord: 201..297
score: 4.45
NoneNo IPR availableGENE3DG3DSA:1.20.1620.10coord: 246..292
score: 1.
NoneNo IPR availablePANTHERPTHR14154UPF0041 BRAIN PROTEIN 44-RELATEDcoord: 187..340
score: 1.9
NoneNo IPR availablePANTHERPTHR14154:SF14SUBFAMILY NOT NAMEDcoord: 187..340
score: 1.9

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh16G011040CmaCh06G008140Cucurbita maxima (Rimu)cmacmaB356