Tan0022571 (gene) Snake gourd v1

Overview
NameTan0022571
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionChlorophyll a-b binding protein, chloroplastic
LocationLG10: 56891961 .. 56895897 (+)
RNA-Seq ExpressionTan0022571
SyntenyTan0022571
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCATTTTTGCATTCTTTCCTTCTTCACTCTCAGCTTCTCCGGCCATGGCTTCCTTGGCAGCCTCCACCGCCGCCGCCTCCCTCGGCATCTCCGAAATGCTCAGAAATCCCCTCAGCTTCTCCGGTGCCTCTGCTCGGTCCGCTCCGTCCCCTTCTAGCCCGGTGGCCTCCAAGACCGTTGCGCTTTTCGGGAAGAAGCCAGCCGCCCCCAAGCCGAAGGCCTCCGTCGTCTCTCCGATCGACGACGAGCTCGCCAAGTGGTATGGTAAGTCCAATCATCCCCTCATTCAGATCTCTTTCACTTCAATGAAAATGGTTCACTCAGATTTTGTTCTATATAATCTTTAAGACAACTTGAGACCTTACACAAATATACCATTATATATGTATTAAAAAATCAGTTAGTTAGTAAGTTAGTTACAGTTTTATCTCTCTCTGTTTTAAGATACATTATCAAACTTTAATAAATTTTATTCATAAAATTTCTCCATATTTCTAGTTATGGACCTTAGAAATAGTAATTAAGTAATTTTATATGGATTGACTAGTTAAAATTTACGCTAACAATGAACTAAATAGGTGTTTAGAGGTCATCTCCCTTGTTTATACATATTTTCTTGAGTTGATTTTGAATCGTACTTGTTGGCACTAATTAGGTCCTGACAGAAGGATTTTCTTGCCCGAGGGGCTGTTGGACCGATCCGAGATCCCCGAGTACTTGAACGGAGAAGTCCCCGGAGAGTACGACTTAATTTTCTTGTCCACTTCTTTATATTCAATTTCTTAGACCCCATTTGATAACTATTTAATTTTTGCAAAATTGAACTTATATAAACACTACTTCCACTCATGGATTTCTTTGTTTAATTTGTTATCAAATGTTTTCCGTCTAATTTTCTTAAACAAAATGAAATGGTTATCAAATGAACCTTAATTTTTCGGGTGAAAGTGTACGTGGACTCATCGACATTCTTTTTTATTTGGTTAACAACAGCTACGGTTACGACCCGTTTGGATTGAGCAAGAAACCAGAAGACTTCAGCAAGTAAATAAATCTCAATTAATTTAAAAAAAAAACAAAGACGACAAATTTGTATATGTTAAATAATATGATTTTTTTTAAATTTTTGTTTGGGCAGATACCAAGCATTTGAGTTGATTCATGCACGATGGGCGATGCTTGGAGCTGCTGGTTTCATTATCCCTGAGGCGTTCAACAAATTCGGTGCCAACTGCGGGCCTGAGGCTGTGTGGTTCAAGGTCAATGTTTGAATAATTTCTATTTGGAAAGTGAAAGTGCTTTTAAAATAAGCAGTTGAAAATCAGATCCAAGATTGTATGTATGTGAAATGTGAATTTTGTTGAATATAATTATGTAGACTGGAGCTCTGCTTCTTGATGGAAACACATTGAATTACTTTGGAAACAACATTCCCATCAACCTCATTGTCGCCGTCATTGCTGAGGTTGTTCTTGTTGGTGGTGCAGAATATTACAGAATCATCAACGGCTTGGTACGTTTTTAATCCAAATTTAATTTGTGTATGTTTGGGAATGATTTTAGAAATATTTATTTTAATATAATTGATTTATTAGAAGTGATTTTTATGTTTGGCTCTACACTTTTAATAGTGATTTTCATACAATTAAATGATTTTCAGAAAAGGCATGTTTGGAGTGATTTTAAAATAAGTGTTTTAAGCTAAGTGATTTAATTATAATTAATTTTTAAAAAAGTAGTTAGTTGTCATTTTAAAATTTGATTCAACCATTGTAAGAGTGTTTTAAATTTAATTAGAAGCATTTTTATAAAAAAATGATTAATGTTTATTTTCAAAAGAGATTTAGAATCATCAAGTTATTAAAAAAAATGTATTGCATAGAGATTTAAAAAATAGAAGGTGATAGACAAAAGGTTATACAAAGAGAAAATATAAAGGAAGAGAATTTTTTCAAAGAGAAAAAAGTTTAGAGATATGGAGAGATAAAAATTTTAGAGGAAGGAACAAAAAAATCGGAGAGAGAATTTTAATAAAGAGATTGAAATTCAACGAAGAAATAAAGATAGAGATATTAAAAAAAATGGTAAGATGATAAAAGTTTTAGGAAGAAAAACTATAATGAAGAGAAAAAAGTTGAGAGTCAGAAAAAAAAAAGATGACGATTTTTACTCGTCACCAAAAGTAGAATTTTTTCAAAAATGAAATAATAGTAATTTTTAATTACAATTACTTGACAAAAATCATTTGGGAATCACCTTCCAAGTGATTGACCAAGAATCACTTTAAGTGATTTTGGATTCTTCAAAAATCACTTATCAATATTGTAAAACATCACACTTTTGGTAGAAAAGCGATTTTAGACATATCAAAAGTGATTTTGGTCATTCCAAAAAATCACTCCCAAACATGTCCTCAATTTCATGAATAAATTTATTATACTCACGTGATCCCATTTGATAATCATTTGGTTTTTGAAAATTGTGCTTGTTTTCTTACCATTTCTTTATAAGTTTTTATCTTTCTTAGAAAAAACATTTGTTTTTCTAGCCAAATTCCAAAAACAAAAAGAAGTTTTTGAAAAACTATTTTTTTAGGCCTCGTTTGATGACCATTTGATTTTTTGTTTTTTATTTTTGAAAATTAAGTAAATACTACTCCCCTCGACGAGTTTACTTGTTATGTTATCTACTTTTAATCTATGTTTTAAAAAAATCGAATATTTTGTAAACTAAAAAAAGTAGTTTTCAAAACTTACTTTTGTTTTTAGAATTTGGCTAAGAATTCAAATGCTTCCTTAAAGAGGGATGAAAACCATTGTAGAGAAATTAGAAGAAAACAAGATTAATTTTCAAAAACCAAAATAAAAAACAAATGGTTATCAAATGGGACCTTAGCTTTCAAATTTTTACTTGATTTTTGTGAAAACATGTATAACAAATAAATAAGAACACATGAAAACTATTGGTGGAAAAGTAGTATTTATAGGCTTAATTTTTAAAAACTAAAAACAAAAAATCAAATAGTTATCAAATGAGTGTTAAGTTTTGAGTTCAATAATAATTGAATTGAGTAGAAGATCATGGGTTGGGTTGGAAGTCTTGTAATGCTATTTTCTCTTCAATTAATATCGATAACCATTTATTGGACTTTACCCCTATTTACTTTTACTTTTAACTAGACGTCAAAATTTTTGTGGTGGATGCTAGAAAATTCGTATGGTTTCTTATAAAGTTAAGCTTTTGAGTTTAATGGTGATTAACATATATTTATAATGTTTTGTAGAACTTTGAGGACAAGCTTCACCCAGGAGGTCCATTCGACCCGTTGGGGCTCGCGGACGACCCGGACCAGGCAGCAATCTTGAAAGTGAAGGAGATCAAGAATGGAAGATTGGCCATGTTTGCAATGCTAGGGTTTTACTTCCAGGCTTACGTGACAGGAGAAGGGCCAGTGGAGAACTTGGCCAAGCATTTGAGTGATCCTTTTGGAAACAACTTGCTCACAGTCATTTCTGGAAATGCTGAAAGAGTTCCAACTCTTTAAAATGAAGAATTGCATTTTATAATTAAAGCCGCAAGTTTGTGTATTGAATGAAAATCTTGAATGCGTTTGTAAATTTTTATGGAGACTATTTGTAGCCCCTCACCACTCACCACTCTCGTGCATTTGAAAGAGATATGAACAAGAGTTAAGACCAAATTTTCTATCAACTTGTGTGTGTAAAATGTAGCTCACAAATCACAGCTTGAATTGGGATGGTAGGACCAATATCTAATATAACTCGAGTTGAATTTTCATTCAAACCTAATCTAACCCATATATTTATTTGAGTTGGGTCATCTATTGGATTGAGTTGAAGGGTACTTGTACTCAATTTAAGCTTTTGGATCGGTGGCCGTATGTCCGATCTAATATAACCCAACTCGAATTTTCG

mRNA sequence

CACCATTTTTGCATTCTTTCCTTCTTCACTCTCAGCTTCTCCGGCCATGGCTTCCTTGGCAGCCTCCACCGCCGCCGCCTCCCTCGGCATCTCCGAAATGCTCAGAAATCCCCTCAGCTTCTCCGGTGCCTCTGCTCGGTCCGCTCCGTCCCCTTCTAGCCCGGTGGCCTCCAAGACCGTTGCGCTTTTCGGGAAGAAGCCAGCCGCCCCCAAGCCGAAGGCCTCCGTCGTCTCTCCGATCGACGACGAGCTCGCCAAGTGGTATGGTCCTGACAGAAGGATTTTCTTGCCCGAGGGGCTGTTGGACCGATCCGAGATCCCCGAGTACTTGAACGGAGAAGTCCCCGGAGACTACGGTTACGACCCGTTTGGATTGAGCAAGAAACCAGAAGACTTCAGCAAATACCAAGCATTTGAGTTGATTCATGCACGATGGGCGATGCTTGGAGCTGCTGGTTTCATTATCCCTGAGGCGTTCAACAAATTCGGTGCCAACTGCGGGCCTGAGGCTGTGTGGTTCAAGACTGGAGCTCTGCTTCTTGATGGAAACACATTGAATTACTTTGGAAACAACATTCCCATCAACCTCATTGTCGCCGTCATTGCTGAGGTTGTTCTTGTTGGTGGTGCAGAATATTACAGAATCATCAACGGCTTGAACTTTGAGGACAAGCTTCACCCAGGAGGTCCATTCGACCCGTTGGGGCTCGCGGACGACCCGGACCAGGCAGCAATCTTGAAAGTGAAGGAGATCAAGAATGGAAGATTGGCCATGTTTGCAATGCTAGGGTTTTACTTCCAGGCTTACGTGACAGGAGAAGGGCCAGTGGAGAACTTGGCCAAGCATTTGAGTGATCCTTTTGGAAACAACTTGCTCACAGTCATTTCTGGAAATGCTGAAAGAGTTCCAACTCTTTAAAATGAAGAATTGCATTTTATAATTAAAGCCGCAAGTTTGTGTATTGAATGAAAATCTTGAATGCGTTTGTAAATTTTTATGGAGACTATTTGTAGCCCCTCACCACTCACCACTCTCGTGCATTTGAAAGAGATATGAACAAGAGTTAAGACCAAATTTTCTATCAACTTGTGTGTGTAAAATGTAGCTCACAAATCACAGCTTGAATTGGGATGGTAGGACCAATATCTAATATAACTCGAGTTGAATTTTCATTCAAACCTAATCTAACCCATATATTTATTTGAGTTGGGTCATCTATTGGATTGAGTTGAAGGGTACTTGTACTCAATTTAAGCTTTTGGATCGGTGGCCGTATGTCCGATCTAATATAACCCAACTCGAATTTTCG

Coding sequence (CDS)

ATGGCTTCCTTGGCAGCCTCCACCGCCGCCGCCTCCCTCGGCATCTCCGAAATGCTCAGAAATCCCCTCAGCTTCTCCGGTGCCTCTGCTCGGTCCGCTCCGTCCCCTTCTAGCCCGGTGGCCTCCAAGACCGTTGCGCTTTTCGGGAAGAAGCCAGCCGCCCCCAAGCCGAAGGCCTCCGTCGTCTCTCCGATCGACGACGAGCTCGCCAAGTGGTATGGTCCTGACAGAAGGATTTTCTTGCCCGAGGGGCTGTTGGACCGATCCGAGATCCCCGAGTACTTGAACGGAGAAGTCCCCGGAGACTACGGTTACGACCCGTTTGGATTGAGCAAGAAACCAGAAGACTTCAGCAAATACCAAGCATTTGAGTTGATTCATGCACGATGGGCGATGCTTGGAGCTGCTGGTTTCATTATCCCTGAGGCGTTCAACAAATTCGGTGCCAACTGCGGGCCTGAGGCTGTGTGGTTCAAGACTGGAGCTCTGCTTCTTGATGGAAACACATTGAATTACTTTGGAAACAACATTCCCATCAACCTCATTGTCGCCGTCATTGCTGAGGTTGTTCTTGTTGGTGGTGCAGAATATTACAGAATCATCAACGGCTTGAACTTTGAGGACAAGCTTCACCCAGGAGGTCCATTCGACCCGTTGGGGCTCGCGGACGACCCGGACCAGGCAGCAATCTTGAAAGTGAAGGAGATCAAGAATGGAAGATTGGCCATGTTTGCAATGCTAGGGTTTTACTTCCAGGCTTACGTGACAGGAGAAGGGCCAGTGGAGAACTTGGCCAAGCATTTGAGTGATCCTTTTGGAAACAACTTGCTCACAGTCATTTCTGGAAATGCTGAAAGAGTTCCAACTCTTTAA

Protein sequence

MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL
Homology
BLAST of Tan0022571 vs. ExPASy Swiss-Prot
Match: Q9XF89 (Chlorophyll a-b binding protein CP26, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LHCB5 PE=1 SV=1)

HSP 1 Score: 475.3 bits (1222), Expect = 4.8e-133
Identity = 232/280 (82.86%), Postives = 248/280 (88.57%), Query Frame = 0

Query: 11  ASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELA 70
           ASLG+SEML  PL+F   S  SAP  SSP   KTVALF KK  AP  K+  VS   DELA
Sbjct: 2   ASLGVSEMLGTPLNFRAVSRSSAPLASSPSTFKTVALFSKKKPAP-AKSKAVSETSDELA 61

Query: 71  KWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW 130
           KWYGPDRRIFLP+GLLDRSEIPEYLNGEV GDYGYDPFGL KKPE+F+KYQAFELIHARW
Sbjct: 62  KWYGPDRRIFLPDGLLDRSEIPEYLNGEVAGDYGYDPFGLGKKPENFAKYQAFELIHARW 121

Query: 131 AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVV 190
           AMLGAAGFIIPEA NK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVV
Sbjct: 122 AMLGAAGFIIPEALNKYGANCGPEAVWFKTGALLLDGNTLNYFGKNIPINLVLAVVAEVV 181

Query: 191 LVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFY 250
           L+GGAEYYRI NGL+FEDKLHPGGPFDPLGLA DP+Q A+LKVKEIKNGRLAMFAMLGF+
Sbjct: 182 LLGGAEYYRITNGLDFEDKLHPGGPFDPLGLAKDPEQGALLKVKEIKNGRLAMFAMLGFF 241

Query: 251 FQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
            QAYVTGEGPVENLAKHLSDPFGNNLLTVI+G AER PTL
Sbjct: 242 IQAYVTGEGPVENLAKHLSDPFGNNLLTVIAGTAERAPTL 280

BLAST of Tan0022571 vs. ExPASy Swiss-Prot
Match: P12330 (Chlorophyll a-b binding protein 1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=CAB1R PE=2 SV=2)

HSP 1 Score: 224.9 bits (572), Expect = 1.1e-57
Identity = 138/272 (50.74%), Postives = 167/272 (61.40%), Query Frame = 0

Query: 9   AAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDE 68
           AAA++ +S         S   AR+APS SS +  +      K  A PKP AS  SP    
Sbjct: 2   AAATMALS---------SPVMARAAPSTSSALFGEARITMRKTAAKPKPAASSGSP---- 61

Query: 69  LAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHA 128
              WYG DR ++L  G L   E P YL GE PGDYG+D  GLS  PE F+K +  E+IH+
Sbjct: 62  ---WYGADRVLYL--GPLS-GEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHS 121

Query: 129 RWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVI 188
           RWAMLGA G + PE   + G   G EAVWFK G+ +     L+Y GN   I    I+A+ 
Sbjct: 122 RWAMLGALGCVFPELLARNGVKFG-EAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIW 181

Query: 189 A-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAM 248
           A +VVL+G  E YRI  G   E  D L+PGG FDPLGLADDP+  A LKVKEIKNGRLAM
Sbjct: 182 AVQVVLMGAVEGYRIAGGPLGEVVDPLYPGGAFDPLGLADDPEAFAELKVKEIKNGRLAM 241

Query: 249 FAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
           F+M GF+ QA VTG+GP+ENLA HL+DP  NN
Sbjct: 242 FSMFGFFVQAIVTGKGPLENLADHLADPVNNN 253

BLAST of Tan0022571 vs. ExPASy Swiss-Prot
Match: P27517 (Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Dunaliella tertiolecta OX=3047 PE=2 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 7.4e-57
Identity = 114/211 (54.03%), Postives = 141/211 (66.82%), Query Frame = 0

Query: 71  KWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW 130
           ++YGPDR  FL  G    ++ PEYL GE PGDYG+D  GLS  P+ F++Y+  ELIHARW
Sbjct: 32  EFYGPDRAKFL--GPFSENDTPEYLTGEFPGDYGWDTAGLSADPQTFARYREIELIHARW 91

Query: 131 AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI---NLIVAVIA 190
           A+LGA G + PE  +++      E VWFK GA +     LNY GN   I   ++I  +  
Sbjct: 92  ALLGALGILTPELLSQYAGVQFGEPVWFKAGAQIFADGGLNYLGNESLIHAQSIIATLAV 151

Query: 191 EVVLVGGAEYYRIING----LNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAM 250
           +VVL+G AE YR   G    L+  D L+PGGPFDPLGLADDPD  A LKVKEIKNGRLAM
Sbjct: 152 QVVLMGLAEAYRANGGSEGFLDDLDTLYPGGPFDPLGLADDPDTFAELKVKEIKNGRLAM 211

Query: 251 FAMLGFYFQAYVTGEGPVENLAKHLSDPFGN 275
           F+ LGF+ QA VTG+GPV+NL  HL+DP  N
Sbjct: 212 FSCLGFFVQAIVTGKGPVQNLTDHLADPTVN 240

BLAST of Tan0022571 vs. ExPASy Swiss-Prot
Match: P15193 (Chlorophyll a-b binding protein type 2 member 1A, chloroplastic OS=Pinus sylvestris OX=3349 PE=2 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 6.3e-56
Identity = 119/223 (53.36%), Postives = 146/223 (65.47%), Query Frame = 0

Query: 58  KASVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDF 117
           K SV + ID   + WYGPDR ++L        E P YL GE PGDYG+D  GLS  PE F
Sbjct: 51  KKSVAASID---SPWYGPDRVLYLGP---FSGEPPSYLTGEFPGDYGWDTAGLSADPETF 110

Query: 118 SKYQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN- 177
           +K +  E+IH+RWAMLGA G + PE   + G   G EAVWFK GA +     L+Y G+  
Sbjct: 111 AKNRELEVIHSRWAMLGALGCVFPELLARNGVKFG-EAVWFKAGAQIFSEGGLDYLGSPQ 170

Query: 178 -IPINLIVAVIA-EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDPDQAAILK 237
            I    I+A+ A +V+L+G  E YR+  G   E  D ++PGG FDPLGLADDPD  A LK
Sbjct: 171 LIHAQSILAIWACQVILMGAIEGYRVAGGPLGEVTDPIYPGGNFDPLGLADDPDAFAELK 230

Query: 238 VKEIKNGRLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
           VKEIKNGRLAMF+M GF+ QA VTG+GP+ENLA HL+DP  NN
Sbjct: 231 VKEIKNGRLAMFSMFGFFVQAIVTGKGPIENLADHLADPVNNN 266

BLAST of Tan0022571 vs. ExPASy Swiss-Prot
Match: P27497 (Chlorophyll a-b binding protein M9, chloroplastic OS=Zea mays OX=4577 GN=CAB-M9 PE=2 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 3.1e-55
Identity = 123/255 (48.24%), Postives = 158/255 (61.96%), Query Frame = 0

Query: 26  SGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELAKWYGPDRRIFLPEGL 85
           S   A S+ + +    +   +LFG+     +  A+   P     + WYGPDR ++L  G 
Sbjct: 3   SSTMALSSTAFAGKAVNVPSSLFGEARVTMRKTAAKAKPAASSGSPWYGPDRVLYL--GP 62

Query: 86  LDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAMLGAAGFIIPEAFN 145
           L   E P YL GE PGDYG+D  GLS  PE F+K +  E+IH+RWAMLGA G + PE   
Sbjct: 63  LS-GEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCVFPELLA 122

Query: 146 KFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EVVLVGGAEYYRIIN 205
           + G   G EAVWFK G+ +     L+Y GN   I    I+A+ A +VVL+G  E YR+  
Sbjct: 123 RNGVKFG-EAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAIEGYRVAG 182

Query: 206 GLNFE--DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFYFQAYVTGEGP 265
           G   E  D L+PGG FDPLGLADDP+  A +KVKE+KNGRLAMF+M GF+ QA VTG+GP
Sbjct: 183 GPLGEVVDPLYPGGTFDPLGLADDPEAFADVKVKELKNGRLAMFSMFGFFVQAIVTGKGP 242

Query: 266 VENLAKHLSDPFGNN 276
           +ENLA HL+DP  NN
Sbjct: 243 LENLADHLADPVNNN 253

BLAST of Tan0022571 vs. NCBI nr
Match: XP_038889943.1 (chlorophyll a-b binding protein CP26, chloroplastic [Benincasa hispida])

HSP 1 Score: 562.0 bits (1447), Expect = 3.0e-156
Identity = 280/291 (96.22%), Postives = 284/291 (97.59%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAP-KPKA 60
           MASLAASTAAASLG+SEMLRNPLSFSGASARSAPS SSP   KTVALFGKKPAAP KPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSGASARSAPSASSPATFKTVALFGKKPAAPSKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. NCBI nr
Match: XP_022932496.1 (chlorophyll a-b binding protein CP26, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 554.7 bits (1428), Expect = 4.8e-154
Identity = 278/291 (95.53%), Postives = 281/291 (96.56%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPA-APKPKA 60
           MASLAASTAAASLG+SEMLRNPLSFSGASARSA S SSPV SK VALFGKKPA APKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSGASARSASSASSPVTSKIVALFGKKPAPAPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP +DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPANDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVAAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. NCBI nr
Match: XP_022973438.1 (chlorophyll a-b binding protein CP26, chloroplastic [Cucurbita maxima])

HSP 1 Score: 554.7 bits (1428), Expect = 4.8e-154
Identity = 276/291 (94.85%), Postives = 282/291 (96.91%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKP-AAPKPKA 60
           MASLAASTAAASLG+SEMLRNPLS SGASARSA S SSPV SK VA+FGKKP AAPKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSLSGASARSASSASSPVTSKIVAIFGKKPAAAPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV+AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVVAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. NCBI nr
Match: KAG7037190.1 (Chlorophyll a-b binding protein CP26, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 553.9 bits (1426), Expect = 8.2e-154
Identity = 277/291 (95.19%), Postives = 281/291 (96.56%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKP-AAPKPKA 60
           MASLAASTAAASLG+SEMLRNPLSFSGASARSA S SSPV SK VA+FGKKP AAPKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSGASARSASSASSPVTSKIVAIFGKKPAAAPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP +DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPANDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVAAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. NCBI nr
Match: KAG6607551.1 (Chlorophyll a-b binding protein CP26, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 553.1 bits (1424), Expect = 1.4e-153
Identity = 277/291 (95.19%), Postives = 280/291 (96.22%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPA-APKPKA 60
           MASLAASTAAASLG+SEMLRNPLSFSGASARSA S SSPV SK VALFGKKPA  PKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSGASARSASSASSPVTSKIVALFGKKPAPTPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP +DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPANDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVAAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. ExPASy TrEMBL
Match: A0A6J1ID20 (Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111471983 PE=3 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 2.3e-154
Identity = 276/291 (94.85%), Postives = 282/291 (96.91%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKP-AAPKPKA 60
           MASLAASTAAASLG+SEMLRNPLS SGASARSA S SSPV SK VA+FGKKP AAPKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSLSGASARSASSASSPVTSKIVAIFGKKPAAAPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV+AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVVAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. ExPASy TrEMBL
Match: A0A6J1EWI5 (Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111438899 PE=3 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 2.3e-154
Identity = 278/291 (95.53%), Postives = 281/291 (96.56%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPA-APKPKA 60
           MASLAASTAAASLG+SEMLRNPLSFSGASARSA S SSPV SK VALFGKKPA APKPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSGASARSASSASSPVTSKIVALFGKKPAPAPKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S VSP +DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  SAVSPANDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAV AEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVAAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPF NNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFANNLLTVISGNAERVPTL 291

BLAST of Tan0022571 vs. ExPASy TrEMBL
Match: A0A0A0KXR5 (Chlorophyll a-b binding protein, chloroplastic OS=Cucumis sativus OX=3659 GN=Csa_5G646700 PE=3 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 4.4e-153
Identity = 274/291 (94.16%), Postives = 281/291 (96.56%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAP-KPKA 60
           MASLAASTAAASLG+SEMLRNPLSF   S+RSAPSPS+P   KTVALFGKKPAAP KPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSF---SSRSAPSPSTPATFKTVALFGKKPAAPAKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           S  SP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDF+K
Sbjct: 61  SAASPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFTK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 288

BLAST of Tan0022571 vs. ExPASy TrEMBL
Match: A0A6J1DTI7 (Chlorophyll a-b binding protein, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111022982 PE=3 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 3.7e-152
Identity = 271/290 (93.45%), Postives = 277/290 (95.52%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKAS 60
           MASLAASTAAASLG+SEMLRNPLSF G S RSAPS SS V  KTVALFGKKPAA    ++
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFGGGSPRSAPSASSSVTVKTVALFGKKPAAKAKPSA 60

Query: 61  VVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKY 120
            VSP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKY
Sbjct: 61  AVSPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKY 120

Query: 121 QAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPIN 180
           QAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPIN
Sbjct: 121 QAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPIN 180

Query: 181 LIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGR 240
           LIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGR
Sbjct: 181 LIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGR 240

Query: 241 LAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           LAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGN ERVPTL
Sbjct: 241 LAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNVERVPTL 290

BLAST of Tan0022571 vs. ExPASy TrEMBL
Match: A0A1S3CBI9 (Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498611 PE=3 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 3.1e-151
Identity = 273/291 (93.81%), Postives = 278/291 (95.53%), Query Frame = 0

Query: 1   MASLAASTAAASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAP-KPKA 60
           MASLAASTAAASLG+SEMLRNPLSFS  SA   PS SSP   KTVALFGKKPAAP KPK 
Sbjct: 1   MASLAASTAAASLGVSEMLRNPLSFSSRSA--PPSASSPATFKTVALFGKKPAAPSKPKP 60

Query: 61  SVVSPIDDELAKWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120
           +  SP++DELAKWYGPDRRIFLP+GLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK
Sbjct: 61  APASPVNDELAKWYGPDRRIFLPDGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSK 120

Query: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180
           YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI
Sbjct: 121 YQAFELIHARWAMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPI 180

Query: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240
           NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG
Sbjct: 181 NLIVAVIAEVVLVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNG 240

Query: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
           RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL
Sbjct: 241 RLAMFAMLGFYFQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 289

BLAST of Tan0022571 vs. TAIR 10
Match: AT4G10340.1 (light harvesting complex of photosystem II 5 )

HSP 1 Score: 475.3 bits (1222), Expect = 3.4e-134
Identity = 232/280 (82.86%), Postives = 248/280 (88.57%), Query Frame = 0

Query: 11  ASLGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELA 70
           ASLG+SEML  PL+F   S  SAP  SSP   KTVALF KK  AP  K+  VS   DELA
Sbjct: 2   ASLGVSEMLGTPLNFRAVSRSSAPLASSPSTFKTVALFSKKKPAP-AKSKAVSETSDELA 61

Query: 71  KWYGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW 130
           KWYGPDRRIFLP+GLLDRSEIPEYLNGEV GDYGYDPFGL KKPE+F+KYQAFELIHARW
Sbjct: 62  KWYGPDRRIFLPDGLLDRSEIPEYLNGEVAGDYGYDPFGLGKKPENFAKYQAFELIHARW 121

Query: 131 AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNNIPINLIVAVIAEVV 190
           AMLGAAGFIIPEA NK+GANCGPEAVWFKTGALLLDGNTLNYFG NIPINL++AV+AEVV
Sbjct: 122 AMLGAAGFIIPEALNKYGANCGPEAVWFKTGALLLDGNTLNYFGKNIPINLVLAVVAEVV 181

Query: 191 LVGGAEYYRIINGLNFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAMLGFY 250
           L+GGAEYYRI NGL+FEDKLHPGGPFDPLGLA DP+Q A+LKVKEIKNGRLAMFAMLGF+
Sbjct: 182 LLGGAEYYRITNGLDFEDKLHPGGPFDPLGLAKDPEQGALLKVKEIKNGRLAMFAMLGFF 241

Query: 251 FQAYVTGEGPVENLAKHLSDPFGNNLLTVISGNAERVPTL 291
            QAYVTGEGPVENLAKHLSDPFGNNLLTVI+G AER PTL
Sbjct: 242 IQAYVTGEGPVENLAKHLSDPFGNNLLTVIAGTAERAPTL 280

BLAST of Tan0022571 vs. TAIR 10
Match: AT1G29930.1 (chlorophyll A/B binding protein 1 )

HSP 1 Score: 209.5 bits (532), Expect = 3.5e-54
Identity = 123/269 (45.72%), Postives = 161/269 (59.85%), Query Frame = 0

Query: 13  LGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELAKW 72
           +  S M  +  +F+G + + +P+ S  + S  V +   +    KPK    SP       W
Sbjct: 1   MAASTMALSSPAFAGKAVKLSPAASEVLGSGRVTM---RKTVAKPKGPSGSP-------W 60

Query: 73  YGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAM 132
           YG DR  +L        E P YL GE PGDYG+D  GLS  PE F++ +  E+IH+RWAM
Sbjct: 61  YGSDRVKYLGP---FSGESPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHSRWAM 120

Query: 133 LGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EV 192
           LGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V
Sbjct: 121 LGALGCVFPELLARNGVKFG-EAVWFKAGSQIFSDGGLDYLGNPSLVHAQSILAIWATQV 180

Query: 193 VLVGGAEYYRII-NGL--NFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAM 252
           +L+G  E YR+  NG     ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M
Sbjct: 181 ILMGAVEGYRVAGNGPLGEAEDLLYPGGSFDPLGLATDPEAFAELKVKELKNGRLAMFSM 240

Query: 253 LGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
            GF+ QA VTG+GP+ENLA HL+DP  NN
Sbjct: 241 FGFFVQAIVTGKGPIENLADHLADPVNNN 255

BLAST of Tan0022571 vs. TAIR 10
Match: AT1G29910.1 (chlorophyll A/B binding protein 3 )

HSP 1 Score: 208.8 bits (530), Expect = 6.0e-54
Identity = 123/269 (45.72%), Postives = 160/269 (59.48%), Query Frame = 0

Query: 13  LGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELAKW 72
           +  S M  +  +F+G +   +P+ S  + S  V +   +    KPK    SP       W
Sbjct: 1   MAASTMALSSPAFAGKAVNLSPAASEVLGSGRVTM---RKTVAKPKGPSGSP-------W 60

Query: 73  YGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAM 132
           YG DR  +L        E P YL GE PGDYG+D  GLS  PE F++ +  E+IH+RWAM
Sbjct: 61  YGSDRVKYLGP---FSGESPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHSRWAM 120

Query: 133 LGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EV 192
           LGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V
Sbjct: 121 LGALGCVFPELLARNGVKFG-EAVWFKAGSQIFSDGGLDYLGNPSLVHAQSILAIWATQV 180

Query: 193 VLVGGAEYYRII-NGL--NFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAM 252
           +L+G  E YR+  NG     ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M
Sbjct: 181 ILMGAVEGYRVAGNGPLGEAEDLLYPGGSFDPLGLATDPEAFAELKVKELKNGRLAMFSM 240

Query: 253 LGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
            GF+ QA VTG+GP+ENLA HL+DP  NN
Sbjct: 241 FGFFVQAIVTGKGPIENLADHLADPVNNN 255

BLAST of Tan0022571 vs. TAIR 10
Match: AT1G29920.1 (chlorophyll A/B-binding protein 2 )

HSP 1 Score: 208.8 bits (530), Expect = 6.0e-54
Identity = 123/269 (45.72%), Postives = 160/269 (59.48%), Query Frame = 0

Query: 13  LGISEMLRNPLSFSGASARSAPSPSSPVASKTVALFGKKPAAPKPKASVVSPIDDELAKW 72
           +  S M  +  +F+G +   +P+ S  + S  V +   +    KPK    SP       W
Sbjct: 1   MAASTMALSSPAFAGKAVNLSPAASEVLGSGRVTM---RKTVAKPKGPSGSP-------W 60

Query: 73  YGPDRRIFLPEGLLDRSEIPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARWAM 132
           YG DR  +L        E P YL GE PGDYG+D  GLS  PE F++ +  E+IH+RWAM
Sbjct: 61  YGSDRVKYLGP---FSGESPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHSRWAM 120

Query: 133 LGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA-EV 192
           LGA G + PE   + G   G EAVWFK G+ +     L+Y GN   +    I+A+ A +V
Sbjct: 121 LGALGCVFPELLARNGVKFG-EAVWFKAGSQIFSDGGLDYLGNPSLVHAQSILAIWATQV 180

Query: 193 VLVGGAEYYRII-NGL--NFEDKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFAM 252
           +L+G  E YR+  NG     ED L+PGG FDPLGLA DP+  A LKVKE+KNGRLAMF+M
Sbjct: 181 ILMGAVEGYRVAGNGPLGEAEDLLYPGGSFDPLGLATDPEAFAELKVKELKNGRLAMFSM 240

Query: 253 LGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
            GF+ QA VTG+GP+ENLA HL+DP  NN
Sbjct: 241 FGFFVQAIVTGKGPIENLADHLADPVNNN 255

BLAST of Tan0022571 vs. TAIR 10
Match: AT3G27690.1 (photosystem II light harvesting complex gene 2.3 )

HSP 1 Score: 204.9 bits (520), Expect = 8.7e-53
Identity = 113/210 (53.81%), Postives = 138/210 (65.71%), Query Frame = 0

Query: 72  WYGPDRRIFLPEGLLDRSE-IPEYLNGEVPGDYGYDPFGLSKKPEDFSKYQAFELIHARW 131
           WYGPDR    P+ L   SE  P YL GE PGDYG+D  GLS  PE F+K +  E+IH+RW
Sbjct: 50  WYGPDR----PKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRW 109

Query: 132 AMLGAAGFIIPEAFNKFGANCGPEAVWFKTGALLLDGNTLNYFGNN--IPINLIVAVIA- 191
           AMLGA G   PE  +K G   G EAVWFK G+ +     L+Y GN   I    I+A+ A 
Sbjct: 110 AMLGALGCTFPEILSKNGVKFG-EAVWFKAGSQIFSEGGLDYLGNPNLIHAQSILAIWAC 169

Query: 192 EVVLVGGAEYYRIINGLNFE--DKLHPGGPFDPLGLADDPDQAAILKVKEIKNGRLAMFA 251
           +VVL+G  E YRI  G   E  D L+PGG FDPL LA+DP+  + LKVKE+KNGRLAMF+
Sbjct: 170 QVVLMGFIEGYRIGGGPLGEGLDPLYPGGAFDPLNLAEDPEAFSELKVKELKNGRLAMFS 229

Query: 252 MLGFYFQAYVTGEGPVENLAKHLSDPFGNN 276
           M GF+ QA VTG+GP+ENL  H++DP  NN
Sbjct: 230 MFGFFVQAIVTGKGPIENLFDHIADPVANN 254

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9XF894.8e-13382.86Chlorophyll a-b binding protein CP26, chloroplastic OS=Arabidopsis thaliana OX=3... [more]
P123301.1e-5750.74Chlorophyll a-b binding protein 1, chloroplastic OS=Oryza sativa subsp. japonica... [more]
P275177.4e-5754.03Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Dunaliella ter... [more]
P151936.3e-5653.36Chlorophyll a-b binding protein type 2 member 1A, chloroplastic OS=Pinus sylvest... [more]
P274973.1e-5548.24Chlorophyll a-b binding protein M9, chloroplastic OS=Zea mays OX=4577 GN=CAB-M9 ... [more]
Match NameE-valueIdentityDescription
XP_038889943.13.0e-15696.22chlorophyll a-b binding protein CP26, chloroplastic [Benincasa hispida][more]
XP_022932496.14.8e-15495.53chlorophyll a-b binding protein CP26, chloroplastic-like [Cucurbita moschata][more]
XP_022973438.14.8e-15494.85chlorophyll a-b binding protein CP26, chloroplastic [Cucurbita maxima][more]
KAG7037190.18.2e-15495.19Chlorophyll a-b binding protein CP26, chloroplastic, partial [Cucurbita argyrosp... [more]
KAG6607551.11.4e-15395.19Chlorophyll a-b binding protein CP26, chloroplastic, partial [Cucurbita argyrosp... [more]
Match NameE-valueIdentityDescription
A0A6J1ID202.3e-15494.85Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita maxima OX=3661 GN=LO... [more]
A0A6J1EWI52.3e-15495.53Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita moschata OX=3662 GN=... [more]
A0A0A0KXR54.4e-15394.16Chlorophyll a-b binding protein, chloroplastic OS=Cucumis sativus OX=3659 GN=Csa... [more]
A0A6J1DTI73.7e-15293.45Chlorophyll a-b binding protein, chloroplastic OS=Momordica charantia OX=3673 GN... [more]
A0A1S3CBI93.1e-15193.81Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103... [more]
Match NameE-valueIdentityDescription
AT4G10340.13.4e-13482.86light harvesting complex of photosystem II 5 [more]
AT1G29930.13.5e-5445.72chlorophyll A/B binding protein 1 [more]
AT1G29910.16.0e-5445.72chlorophyll A/B binding protein 3 [more]
AT1G29920.16.0e-5445.72chlorophyll A/B-binding protein 2 [more]
AT3G27690.18.7e-5353.81photosystem II light harvesting complex gene 2.3 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022796Chlorophyll A-B binding proteinPFAMPF00504Chloroa_b-bindcoord: 92..253
e-value: 6.2E-46
score: 156.9
IPR023329Chlorophyll a/b binding domain superfamilyGENE3D1.10.3460.10Chlorophyll a/b binding protein domaincoord: 79..290
e-value: 1.4E-89
score: 301.3
NoneNo IPR availablePANTHERPTHR21649:SF104CHLOROPHYLL A-B BINDING PROTEIN, CHLOROPLASTICcoord: 1..290
NoneNo IPR availableSUPERFAMILY103511Chlorophyll a-b binding proteincoord: 71..280
IPR001344Chlorophyll A-B binding protein, plant and chromistaPANTHERPTHR21649CHLOROPHYLL A/B BINDING PROTEINcoord: 1..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022571.1Tan0022571.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009768 photosynthesis, light harvesting in photosystem I
biological_process GO:0018298 protein-chromophore linkage
biological_process GO:0009416 response to light stimulus
biological_process GO:0009765 photosynthesis, light harvesting
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009523 photosystem II
cellular_component GO:0016020 membrane
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0046872 metal ion binding