Cp4.1LG01g15310 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g15310
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionCarbonic anhydrase
LocationCp4.1LG01 : 9162799 .. 9167006 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTGATCTTTCTTCAACAGCTTCTATAGCTTTTGCTTCTTCTTTAATCTCCCTTCCCAACAACATTAACGATCCCCAGATTTTCCCACACAATTTCAAAGGTTCTTTTTAATTTTTGGATTTTTGGATTTTTTTTTTTAAATTTTAATTTTATGAAATGATCATCTGGATCCACCGACGTCGTTTTGATTAATTTTTATGAACAGGAAGACATGGCGGAAGAATCGTACGAGGAAGCGATTGCCGGACTGACAAAGCTTCTCAGGTAACGGATCCATCTCCAATTCCGTTTGTTTTCTGTTTTTTGTTTTTTCCGCCGCCGCTAATTCAGGACCGACCCGAATTCATTTCCGTTTTATATGTGCGTGGTGTAGTGAGAAAGCCAATCTTCAGGAGGCCGCCGCCGTAAGGATCCGGCAAATAACAGCCGAGTTGGCCGGGTCCAACGCCGAGTCTGATGGGTTCGACGCTGTGGATAGGATCAAAACCGGGTTCACCCATTTCAAGAAATCCAAATTCGAGTACGTTTTTTTTTTTTTAATGGAATTTTACATTAAATAATAATTAATAAATAATAATAACGGGGGTTGTTGTTGTTGTTGTTTTGGTTTCCCAGGACGAATCCTGATCTGTTTGGTCAACTAGCGAAAGGGCAGAGCCCGAAGGTAAGGATTTGTAATTTCAGAAAGAATCTGGGGGATAAATGTAAATAATAATTTCTTGAGGGGCTATGATAAGGGCACCGCCGCAATTATTGTGCGGGGTTAGATAAGGTGTTGAATGAAAAGCCAAAAAAGAAAAGCAGCCACTAAGAGATATCCACGTGTAAATTGCAGTTTTTGGTATTTGCGTGTTCGGATTCCCGAGTTTGCCCTTCGCACATACTGAATTTCCAACCTGGGGAGGCCTTTGTGGTCCGAAACATCGCCAGCATGGTCCCACCATTTGACACGGTTTGCCCTCTCACTCTCCATTTATTTCCTTTAAACATTTTAAATATATTATTATTTTTTTTCGCCCTCGTTCTTATCCAATTATTATTATTATTAGTAGTATTATATATAATTAATTTACAACTTATCATTTTTTTTTTTTTAATTTGGTTGTAGACTAAATATTCTGGGACGGGGGCTGCAATTGAATATGCTATCTTGCATCTCAAGGTGGGTAATATATACTTATTATATAATATATGTTAATAAATATTTTAATGATCTCCCATTTTCTTGTATTCTCAATCATGATTTTCATTAAAAGTTACGAATTTAATTAATTTTAGGTGGAGAATATAGTGGTGATTGGACATAGCTGCTGCGGTGGCATTAAGGGCCTCATGTCCATCCCAACCGATGGAACTCTTTCCAGGCAATTTCTCTTTCTTTCTCTCATGTTCTATTCAATAATTTTTTTATATTTATTGATTTAGATCTATGTGATAGTTTTAAAAATTTGGTATAAAAATTATTATTTCCTACTAGTTTTAGATGATTTAAAAAATACTTTAATAATTTTTAAAAAATGAAATTTAAAAAGTGTCATCAATACCCTTCCATTAAAAAAAAAATAATGAAAAATACCTTTTATTATTAGTTTTTTTTTTTTTTTTTTGAAATGCCCATAAAATTTCAACATCAGCATTCACTTTTATAAAAAAAAAAAAAAAAATTCTCTGCGGACACTTCTAGAAATGAAATTTCAAAGATATTTTTTAACCATTTGAAAATAAAAATTATTAGAACTTTTAAAAGTTTAAAAGTATTTAAAAAATAAAGTACTTTTTTTATAATTAGTTTAAAGTTTTGATTAAAAGTTAGTTTTATTTTTGACCCACTTGTAAAAGAACAAATTTTTTTTGACCCACTTGTAAAAGAACATAAATATAAAAAGGGTAGATGGGATTAGGATTAAGGTGAGTTCATCCCATGTGATGGTGGGTAAAGTAAGGGGGTAGTTTAGGGATGGCATTTAATGGGAGTGGTGGGTGAATCACAGAGGAACTCTCTTTACCAATAAAAGCCCTTTAAGCTTTCCACTCTGGTGGTACCATATTCTTGGAGGAAGCTCTGTCTTTATTTATATATCTCTAAAGTTGTCAGAAAAGTTGTGATGAAGGGACATTGTTTACAAATTGCTACTCAAAATTCTGCATTTTACTGTTCACTGCTTTATGATAAGTACATCCTTTTGTCCTTACTTTACAGATCACCCTTTTCCCCCTTTCAACTTTGTGGGAATCCCTCCATTTGGATCCCACTTTTCCCTATCTCTCTCTGCCTTGATAAGAGAAGTGAAGTTACCGCTCTTTTCCAAGATCCTACATTCTAAAACCCATTTTTTTTTTTTACCTCTCTCTGCTCTCTGTTCCTGCAAAAACTTGCCTCTTTCTAAAGTTTACCATGAAAGAACAAAGAATCTAAGGTTGATTGATTGATTCCAAGAGGAGGATGTGGCTTGGTGGGTGTTTGTTTAAGCAAAAAAGTAACACAGTTCCTTTTTAACTTACAGTGAGTTCATCGAGAACTGGGTGAAAATATGCACTCCTGCCAAAGCTAAGGTCCAATTAGGTTTCACCGACTTGAGCTTTGAAGACAAATGCACAAACTGCGAGAAGGTAAAATAACAAGTCAATTCAAGAAACAAACACAAACCATTCGTTGTATGTAATAACAATCATTGATACCTTGCAGGAAGCTGTGAATGTTTCCCTTGGAAACTTGCTGACATACCCTTTTGTAAGAGAAGCTGTCATCAACAAGCAACTGTTCATAAGGGGTGCTCATTACAACTTTGTCTCTGGAGATTTTGAGCTGTGGAATCTTGACTTTAATATTTCTCCTTCTCTAGTTGTCTGAAAATACCATACAATACCTCCTGGTCGACGACGATAATCAAACTCGGTCGACCAATTATTTTTTAGTTTCTTCCCTTTCAGCAATGTGTCTGTATTCTGCTGCTGTCATAGTATATGAGGCTAAGGAAGCACTTATTTATACTTGCAAGGATGATCTGCGCAACTATCTTCCCCTCCAAAATAAGAGTTTTTTCTTCATTTTCCTTTTGTTTCTCTTTGTCTTGGAAGTGTTTCCTGTAATGGGGATATCCCCATCACTTTTGTAATATTACAGACAATAACCATAATGAAATGGCTTGCTTCATTCTCTTTTAACGGTGCAATAAGATCCTTTTAATGTCAATCCATGCTGAGATACTTCCAATAGTTACAGAGCCCATTTCAGAGTAAAGAAGGGAAAGGTTTCTCAAAGATCCCACAGATTAGTACCCTCATGGGATTGGATGTATGACTACTTTTCCAGTGGCTCGGCTGCTTTCAACATATGAGAAAGCCTCTGCGACCTTAGAGAATGGAAAAGGTCCTTTTGGGTCCACGATCGGCTTCACCTTCCCGCTTTCCAGGAATGGGTTCAGTTTCTTCAGAACAGCTCCATCTGAAGTGACCACAAATCTGAAACCAGGCTCTGTCACTGCGCCTGTCAGCGCCACGACGCTGCCACCTTCTTTCACTACTTTCACCGCCTTATCACACTGCCCTGTCATTTGAGAAACAGTCTAAGAACAACCTAATATCAATGGTTAAATGACTCAAATCCTACAAATCCTCACCAACTGCATCGAAGACTACATCAAATTTTTCGGCCAAATCTTCGATGTTCTCCTTGGTATAATCAATGGCTAAATCCACACCCAAACTTTTCAAGAACTCTAACTTTCCAGTGCTCGAAGTGGCTGCAACTTTTGAAGCTCCAAATACATGTTTTGCTATCTGTAGCAGAAACAAGTCCCAGATTTCCGTTCCATTAAGCAAAACAGCAGAGATAACTTGCGGAGATAGTTAATTATTCATGCGAAAGTGAGAGAAATATAACCTGAATGACTAGGCTTCCAACTCCACCAGCACCGTTGAGAACGAGAATTGATTTGCCAGCAGAAAAGTTGGTTCTTACCAGACCTTCGTAGGCTGTTTCAATAGCCAGAGGCAACCCAGCGGCCTGAACAAAATCAATATTCTTGGGTTTTAAGGCTAATAATTTTTCCTCTACAGCAGTGTACTCTGCAAGAGAACCGAACTGTCTGGGGCCATCGAGGGCTTTCTCGTTGATGTTGCCATATACTTCGTCCCCTTCTTTTAGCTCCTTTACCTGACTTCCCACCTTCACCACCACACCTGCTACATCATATCCTGGAACTGTC

mRNA sequence

ATTGATCTTTCTTCAACAGCTTCTATAGCTTTTGCTTCTTCTTTAATCTCCCTTCCCAACAACATTAACGATCCCCAGATTTTCCCACACAATTTCAAAGGAAGACATGGCGGAAGAATCGTACGAGGAAGCGATTGCCGGACTGACAAAGCTTCTCAGTGAGAAAGCCAATCTTCAGGAGGCCGCCGCCGTAAGGATCCGGCAAATAACAGCCGAGTTGGCCGGGTCCAACGCCGAGTCTGATGGGTTCGACGCTGTGGATAGGATCAAAACCGGGTTCACCCATTTCAAGAAATCCAAATTCGAGACGAATCCTGATCTGTTTGGTCAACTAGCGAAAGGGCAGAGCCCGAAGTTTTTGGTATTTGCGTGTTCGGATTCCCGAGTTTGCCCTTCGCACATACTGAATTTCCAACCTGGGGAGGCCTTTGTGGTCCGAAACATCGCCAGCATGGTCCCACCATTTGACACGACTAAATATTCTGGGACGGGGGCTGCAATTGAATATGCTATCTTGCATCTCAAGGTGGAGAATATAGTGGTGATTGGACATAGCTGCTGCGGTGGCATTAAGGGCCTCATGTCCATCCCAACCGATGGAACTCTTTCCAGGCAATTTCTCTTTCTTTCTCTCATTGAGTTCATCGAGAACTGGGTGAAAATATGCACTCCTGCCAAAGCTAAGGTCCAATTAGGTTTCACCGACTTGAGCTTTGAAGACAAATGCACAAACTGCGAGAAGGAAGCTGTGAATGTTTCCCTTGGAAACTTGCTGACATACCCTTTTGTAAGAGAAGCTGTCATCAACAAGCAACTGTTCATAAGGGGTGCTCATTACAACTTTGTCTCTGGAGATTTTGAGCTGTGGAATCTTGACTTTAATATTTCTCCTTCTCTAGTTGTCTGAAAATACCATACAATACCTCCTGGTCGACGACGATAATCAAACTCGGTCGACCAATTATTTTTTAGTTTCTTCCCTTTCAGCAATGTGTCTGTATTCTGCTGCTGTCATAGTATATGAGGCTAAGGAAGCACTTATTTATACTTGCAAGGATGATCTGCGCAACTATCTTCCCCTCCAAAATAAGAGTTTTTTCTTCATTTTCCTTTTGTTTCTCTTTGTCTTGGAAGTGTTTCCTGTAATGGGGATATCCCCATCACTTTTGTAATATTACAGACAATAACCATAATGAAATGGCTTGCTTCATTCTCTTTTAACGGTGCAATAAGATCCTTTTAATGTCAATCCATGCTGAGATACTTCCAATAGTTACAGAGCCCATTTCAGAGTAAAGAAGGGAAAGGTTTCTCAAAGATCCCACAGATTAGTACCCTCATGGGATTGGATGTATGACTACTTTTCCAGTGGCTCGGCTGCTTTCAACATATGAGAAAGCCTCTGCGACCTTAGAGAATGGAAAAGGTCCTTTTGGGTCCACGATCGGCTTCACCTTCCCGCTTTCCAGGAATGGGTTCAGTTTCTTCAGAACAGCTCCATCTGAAGTGACCACAAATCTGAAACCAGGCTCTGTCACTGCGCCTGTCAGCGCCACGACGCTGCCACCTTCTTTCACTACTTTCACCGCCTTATCACACTGCCCTGTCATTTGAGAAACAGTCTAAGAACAACCTAATATCAATGGTTAAATGACTCAAATCCTACAAATCCTCACCAACTGCATCGAAGACTACATCAAATTTTTCGGCCAAATCTTCGATGTTCTCCTTGGTATAATCAATGGCTAAATCCACACCCAAACTTTTCAAGAACTCTAACTTTCCAGTGCTCGAAGTGGCTGCAACTTTTGAAGCTCCAAATACATGTTTTGCTATCTGTAGCAGAAACAAGTCCCAGATTTCCGTTCCATTAAGCAAAACAGCAGAGATAACTTGCGGAGATAGTTAATTATTCATGCGAAAGTGAGAGAAATATAACCTGAATGACTAGGCTTCCAACTCCACCAGCACCGTTGAGAACGAGAATTGATTTGCCAGCAGAAAAGTTGGTTCTTACCAGACCTTCGTAGGCTGTTTCAATAGCCAGAGGCAACCCAGCGGCCTGAACAAAATCAATATTCTTGGGTTTTAAGGCTAATAATTTTTCCTCTACAGCAGTGTACTCTGCAAGAGAACCGAACTGTCTGGGGCCATCGAGGGCTTTCTCGTTGATGTTGCCATATACTTCGTCCCCTTCTTTTAGCTCCTTTACCTGACTTCCCACCTTCACCACCACACCTGCTACATCATATCCTGGAACTGTC

Coding sequence (CDS)

ATGGCGGAAGAATCGTACGAGGAAGCGATTGCCGGACTGACAAAGCTTCTCAGTGAGAAAGCCAATCTTCAGGAGGCCGCCGCCGTAAGGATCCGGCAAATAACAGCCGAGTTGGCCGGGTCCAACGCCGAGTCTGATGGGTTCGACGCTGTGGATAGGATCAAAACCGGGTTCACCCATTTCAAGAAATCCAAATTCGAGACGAATCCTGATCTGTTTGGTCAACTAGCGAAAGGGCAGAGCCCGAAGTTTTTGGTATTTGCGTGTTCGGATTCCCGAGTTTGCCCTTCGCACATACTGAATTTCCAACCTGGGGAGGCCTTTGTGGTCCGAAACATCGCCAGCATGGTCCCACCATTTGACACGACTAAATATTCTGGGACGGGGGCTGCAATTGAATATGCTATCTTGCATCTCAAGGTGGAGAATATAGTGGTGATTGGACATAGCTGCTGCGGTGGCATTAAGGGCCTCATGTCCATCCCAACCGATGGAACTCTTTCCAGGCAATTTCTCTTTCTTTCTCTCATTGAGTTCATCGAGAACTGGGTGAAAATATGCACTCCTGCCAAAGCTAAGGTCCAATTAGGTTTCACCGACTTGAGCTTTGAAGACAAATGCACAAACTGCGAGAAGGAAGCTGTGAATGTTTCCCTTGGAAACTTGCTGACATACCCTTTTGTAAGAGAAGCTGTCATCAACAAGCAACTGTTCATAAGGGGTGCTCATTACAACTTTGTCTCTGGAGATTTTGAGCTGTGGAATCTTGACTTTAATATTTCTCCTTCTCTAGTTGTCTGA

Protein sequence

MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTHFKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPFDTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFIENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIRGAHYNFVSGDFELWNLDFNISPSLVV
BLAST of Cp4.1LG01g15310 vs. Swiss-Prot
Match: CAHC_TOBAC (Carbonic anhydrase, chloroplastic OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 5.3e-99
Identity = 180/266 (67.67%), Postives = 208/266 (78.20%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYE+AIA L KLLSEK  L   AA R+ QITAEL  S+  S  FD V+ +K GF H
Sbjct: 65  MAKESYEQAIAALEKLLSEKGELGPIAAARVDQITAELQSSDG-SKPFDPVEHMKAGFIH 124

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FK  K+E NP L+G+L+KGQSPKF+VFACSDSRVCPSH+LNFQPGEAFVVRNIA+MVP +
Sbjct: 125 FKTEKYEKNPALYGELSKGQSPKFMVFACSDSRVCPSHVLNFQPGEAFVVRNIANMVPAY 184

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D T+YSG GAAIEYA+LHLKVENIVVIGHS CGGIKGLMS+P DG+ S          FI
Sbjct: 185 DKTRYSGVGAAIEYAVLHLKVENIVVIGHSACGGIKGLMSLPADGSES--------TAFI 244

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E+WVKI  PAKAKVQ    D  F D+CT CEKEAVNVSLGNLLTYPFVRE ++ K L ++
Sbjct: 245 EDWVKIGLPAKAKVQGEHVDKCFADQCTACEKEAVNVSLGNLLTYPFVREGLVKKTLALK 304

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           G HY+FV+G FELW L+F +SPSL V
Sbjct: 305 GGHYDFVNGGFELWGLEFGLSPSLSV 321

BLAST of Cp4.1LG01g15310 vs. Swiss-Prot
Match: BCA2_ARATH (Beta carbonic anhydrase 2, chloroplastic OS=Arabidopsis thaliana GN=BCA2 PE=1 SV=3)

HSP 1 Score: 350.5 bits (898), Expect = 1.6e-95
Identity = 170/263 (64.64%), Postives = 205/263 (77.95%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAEL-AGSNAESDGFDAVDRIKTGFT 60
           M  ESYE+AI  L KLL EK +L++ AA ++++ITAEL A S+++S  FD V+RIK GF 
Sbjct: 73  MGNESYEDAIEALKKLLIEKDDLKDVAAAKVKKITAELQAASSSDSKSFDPVERIKEGFV 132

Query: 61  HFKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPP 120
            FKK K+ETNP L+G+LAKGQSPK++VFACSDSRVCPSH+L+F PG+AFVVRNIA+MVPP
Sbjct: 133 TFKKEKYETNPALYGELAKGQSPKYMVFACSDSRVCPSHVLDFHPGDAFVVRNIANMVPP 192

Query: 121 FDTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEF 180
           FD  KY+G GAAIEYA+LHLKVENIVVIGHS CGGIKGLMS P DG  S         +F
Sbjct: 193 FDKVKYAGVGAAIEYAVLHLKVENIVVIGHSACGGIKGLMSFPLDGNNS--------TDF 252

Query: 181 IENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFI 240
           IE+WVKIC PAK+KV       +FED+C  CE+EAVNVSL NLLTYPFVRE V+   L +
Sbjct: 253 IEDWVKICLPAKSKVLAESESSAFEDQCGRCEREAVNVSLANLLTYPFVREGVVKGTLAL 312

Query: 241 RGAHYNFVSGDFELWNLDFNISP 263
           +G +Y+FV+G FELW L F ISP
Sbjct: 313 KGGYYDFVNGSFELWELQFGISP 327

BLAST of Cp4.1LG01g15310 vs. Swiss-Prot
Match: CAHC_PEA (Carbonic anhydrase, chloroplastic OS=Pisum sativum PE=1 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 7.4e-93
Identity = 166/266 (62.41%), Postives = 203/266 (76.32%), Query Frame = 1

Query: 4   ESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGF---DAVDRIKTGFTH 63
           + Y+EAI  L KLL EK  L+  AA ++ QITA+L G+ + SDG    +A +RIKTGF H
Sbjct: 72  KGYDEAIEELQKLLREKTELKATAAEKVEQITAQL-GTTSSSDGIPKSEASERIKTGFLH 131

Query: 64  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 123
           FKK K++ NP L+G+LAKGQSP F+VFACSDSRVCPSH+L+FQPGEAFVVRN+A++VPP+
Sbjct: 132 FKKEKYDKNPALYGELAKGQSPPFMVFACSDSRVCPSHVLDFQPGEAFVVRNVANLVPPY 191

Query: 124 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 183
           D  KY+GTGAAIEYA+LHLKV NIVVIGHS CGGIKGL+S P DGT S         +FI
Sbjct: 192 DQAKYAGTGAAIEYAVLHLKVSNIVVIGHSACGGIKGLLSFPFDGTYST--------DFI 251

Query: 184 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 243
           E WVKI  PAKAKV+    D  F + CT+CEKEAVN SLGNLLTYPFVRE ++NK L ++
Sbjct: 252 EEWVKIGLPAKAKVKAQHGDAPFAELCTHCEKEAVNASLGNLLTYPFVREGLVNKTLALK 311

Query: 244 GAHYNFVSGDFELWNLDFNISPSLVV 267
           G +Y+FV G FELW L+F +S +  V
Sbjct: 312 GGYYDFVKGSFELWGLEFGLSSTFSV 328

BLAST of Cp4.1LG01g15310 vs. Swiss-Prot
Match: BCA1_ARATH (Beta carbonic anhydrase 1, chloroplastic OS=Arabidopsis thaliana GN=BCA1 PE=1 SV=2)

HSP 1 Score: 340.5 bits (872), Expect = 1.7e-92
Identity = 165/262 (62.98%), Postives = 199/262 (75.95%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAEL-AGSNAESDGFDAVDRIKTGFT 60
           M  E+Y+EAI  L KLL EK  L+  AA ++ QITA L  G++++   FD V+ IK GF 
Sbjct: 78  MGTEAYDEAIEALKKLLIEKEELKTVAAAKVEQITAALQTGTSSDKKAFDPVETIKQGFI 137

Query: 61  HFKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPP 120
            FKK K+ETNP L+G+LAKGQSPK++VFACSDSRVCPSH+L+FQPG+AFVVRNIA+MVPP
Sbjct: 138 KFKKEKYETNPALYGELAKGQSPKYMVFACSDSRVCPSHVLDFQPGDAFVVRNIANMVPP 197

Query: 121 FDTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEF 180
           FD  KY G GAAIEYA+LHLKVENIVVIGHS CGGIKGLMS P DG  S         +F
Sbjct: 198 FDKVKYGGVGAAIEYAVLHLKVENIVVIGHSACGGIKGLMSFPLDGNNS--------TDF 257

Query: 181 IENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFI 240
           IE+WVKIC PAK+KV     D +FED+C  CE+EAVNVSL NLLTYPFVRE ++   L +
Sbjct: 258 IEDWVKICLPAKSKVISELGDSAFEDQCGRCEREAVNVSLANLLTYPFVREGLVKGTLAL 317

Query: 241 RGAHYNFVSGDFELWNLDFNIS 262
           +G +Y+FV G FELW L+F +S
Sbjct: 318 KGGYYDFVKGAFELWGLEFGLS 331

BLAST of Cp4.1LG01g15310 vs. Swiss-Prot
Match: BCA4_ARATH (Beta carbonic anhydrase 4 OS=Arabidopsis thaliana GN=BCA4 PE=1 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 2.4e-91
Identity = 166/263 (63.12%), Postives = 196/263 (74.52%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA ESYE AI GL  LLS KA+L   AA +I+ +TAEL     +S   DA++RIKTGFT 
Sbjct: 23  MATESYEAAIKGLNDLLSTKADLGNVAAAKIKALTAEL--KELDSSNSDAIERIKTGFTQ 82

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FK  K+  N  LF  LAK Q+PKFLVFACSDSRVCPSHILNFQPGEAFVVRNIA+MVPPF
Sbjct: 83  FKTEKYLKNSTLFNHLAKTQTPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIANMVPPF 142

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D  ++SG GAA+EYA++HLKVENI+VIGHSCCGGIKGLMSI  D   ++        +FI
Sbjct: 143 DQKRHSGVGAAVEYAVVHLKVENILVIGHSCCGGIKGLMSIEDDAAPTQS-------DFI 202

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWVKI   A+ K++    DLS++D+C  CEKEAVNVSLGNLL+YPFVR  V+   L IR
Sbjct: 203 ENWVKIGASARNKIKEEHKDLSYDDQCNKCEKEAVNVSLGNLLSYPFVRAEVVKNTLAIR 262

Query: 241 GAHYNFVSGDFELWNLDFNISPS 264
           G HYNFV G F+LW LDF  +P+
Sbjct: 263 GGHYNFVKGTFDLWELDFKTTPA 276

BLAST of Cp4.1LG01g15310 vs. TrEMBL
Match: A0A0A0KVM4_CUCSA (Carbonic anhydrase OS=Cucumis sativus GN=Csa_5G601560 PE=3 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 6.1e-126
Identity = 225/266 (84.59%), Postives = 242/266 (90.98%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYEEAIAGL+KLLSEKA+LQ+AAA +IRQITAELAGS+A S+GFD VDRIKTGFTH
Sbjct: 1   MAQESYEEAIAGLSKLLSEKADLQDAAAAKIRQITAELAGSSACSNGFDPVDRIKTGFTH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKKSKFETNP+++G LAKGQSPKFLVFACSDSRVCPSHILNFQPGEAF+VRNIA+MVPPF
Sbjct: 61  FKKSKFETNPEVYGALAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFMVRNIANMVPPF 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIP DG +S         +FI
Sbjct: 121 DKTKYSGAGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPDDGAISS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWVKICTPAK K Q   TDLSFEDKCTNCEKEAVNVSLGNLL+YPFVREAV+NK+LFIR
Sbjct: 181 ENWVKICTPAKNKTQSDCTDLSFEDKCTNCEKEAVNVSLGNLLSYPFVREAVVNKRLFIR 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHYNFVSG FELWNLDFNISPSL V
Sbjct: 241 GAHYNFVSGAFELWNLDFNISPSLAV 258

BLAST of Cp4.1LG01g15310 vs. TrEMBL
Match: E5GBJ5_CUCME (Carbonic anhydrase OS=Cucumis melo subsp. melo PE=3 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 1.5e-124
Identity = 222/266 (83.46%), Postives = 238/266 (89.47%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYEEAIAGL+KLLSEKA+LQ+AAA +IRQITAEL G+ A S+GFD VDRIKTGFTH
Sbjct: 1   MAQESYEEAIAGLSKLLSEKADLQDAAAAKIRQITAELGGTTACSNGFDPVDRIKTGFTH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKKSKFETNPDL+GQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIA+MVPPF
Sbjct: 61  FKKSKFETNPDLYGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIANMVPPF 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DG  S         +FI
Sbjct: 121 DKTKYSGAGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGAFSS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWV+ICTPAK K Q    DLSFEDKCT CEKEAVNVSLGNLL+YPFVREAV+NK++FIR
Sbjct: 181 ENWVQICTPAKNKTQSNCNDLSFEDKCTECEKEAVNVSLGNLLSYPFVREAVVNKKVFIR 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHYNFVSG FELWNLDFNISPSL V
Sbjct: 241 GAHYNFVSGAFELWNLDFNISPSLAV 258

BLAST of Cp4.1LG01g15310 vs. TrEMBL
Match: A0A061EDW0_THECC (Carbonic anhydrase OS=Theobroma cacao GN=TCM_010364 PE=3 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 4.8e-107
Identity = 192/266 (72.18%), Postives = 220/266 (82.71%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M   SYEEAIA L+KLLS+KA+LQ  AA +I QITAEL  + A+ + FD V RI+TGF H
Sbjct: 23  MGSGSYEEAIAALSKLLSDKADLQSVAAAKIMQITAELEAA-ADPNQFDPVKRIETGFLH 82

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKK K+E NPDL+G+LAKGQSPKFLVFACSDSRVCPSHIL+FQPGEAF+VRNIA+MVPP+
Sbjct: 83  FKKEKYEKNPDLYGELAKGQSPKFLVFACSDSRVCPSHILDFQPGEAFMVRNIANMVPPY 142

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DGT +         +FI
Sbjct: 143 DKTKYSGVGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGTTAS--------DFI 202

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E WV IC PAK KV+    DLSF ++CTNCEKEAVNVSLGNLLTYPFVREAV+ K L ++
Sbjct: 203 EQWVSICAPAKTKVKSECNDLSFSEQCTNCEKEAVNVSLGNLLTYPFVREAVVKKSLVLK 262

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV G F+LWNLDFNI+P+L V
Sbjct: 263 GAHYDFVDGKFDLWNLDFNITPTLAV 279

BLAST of Cp4.1LG01g15310 vs. TrEMBL
Match: A0A061E630_THECC (Carbonic anhydrase OS=Theobroma cacao GN=TCM_010364 PE=3 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 4.8e-107
Identity = 192/266 (72.18%), Postives = 220/266 (82.71%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M   SYEEAIA L+KLLS+KA+LQ  AA +I QITAEL  + A+ + FD V RI+TGF H
Sbjct: 1   MGSGSYEEAIAALSKLLSDKADLQSVAAAKIMQITAELEAA-ADPNQFDPVKRIETGFLH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKK K+E NPDL+G+LAKGQSPKFLVFACSDSRVCPSHIL+FQPGEAF+VRNIA+MVPP+
Sbjct: 61  FKKEKYEKNPDLYGELAKGQSPKFLVFACSDSRVCPSHILDFQPGEAFMVRNIANMVPPY 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DGT +         +FI
Sbjct: 121 DKTKYSGVGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGTTAS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E WV IC PAK KV+    DLSF ++CTNCEKEAVNVSLGNLLTYPFVREAV+ K L ++
Sbjct: 181 EQWVSICAPAKTKVKSECNDLSFSEQCTNCEKEAVNVSLGNLLTYPFVREAVVKKSLVLK 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV G F+LWNLDFNI+P+L V
Sbjct: 241 GAHYDFVDGKFDLWNLDFNITPTLAV 257

BLAST of Cp4.1LG01g15310 vs. TrEMBL
Match: A0A0D2RHC0_GOSRA (Carbonic anhydrase OS=Gossypium raimondii GN=B456_005G182300 PE=3 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 5.3e-106
Identity = 190/266 (71.43%), Postives = 219/266 (82.33%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M  ESYEEAIA L+KLLS+KA+L   AA +I+QITAEL  + A+S  FD V R++TGF H
Sbjct: 1   MGSESYEEAIAALSKLLSDKADLGSVAAAKIKQITAELEAA-ADSTQFDPVKRLETGFLH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKK KF+ NPDL+G LAKGQSPKFLVFACSDSRVCPSHILNFQPGEAF+VRNIASMVPP+
Sbjct: 61  FKKEKFDKNPDLYGALAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFIVRNIASMVPPY 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D  KYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DGT +         +FI
Sbjct: 121 DKKKYSGAGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGTTAS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E WV ICTPAK KV+    +LSF ++CTNCEKEAVNVSLGNLLTYPFVREAV+ K + ++
Sbjct: 181 EQWVSICTPAKTKVKSEQNELSFSEQCTNCEKEAVNVSLGNLLTYPFVREAVVKKTVALK 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV+G  +LWNLDF ISP+L +
Sbjct: 241 GAHYDFVNGKLDLWNLDFKISPTLAI 257

BLAST of Cp4.1LG01g15310 vs. TAIR10
Match: AT5G14740.1 (AT5G14740.1 carbonic anhydrase 2)

HSP 1 Score: 350.5 bits (898), Expect = 9.0e-97
Identity = 170/263 (64.64%), Postives = 205/263 (77.95%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAEL-AGSNAESDGFDAVDRIKTGFT 60
           M  ESYE+AI  L KLL EK +L++ AA ++++ITAEL A S+++S  FD V+RIK GF 
Sbjct: 73  MGNESYEDAIEALKKLLIEKDDLKDVAAAKVKKITAELQAASSSDSKSFDPVERIKEGFV 132

Query: 61  HFKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPP 120
            FKK K+ETNP L+G+LAKGQSPK++VFACSDSRVCPSH+L+F PG+AFVVRNIA+MVPP
Sbjct: 133 TFKKEKYETNPALYGELAKGQSPKYMVFACSDSRVCPSHVLDFHPGDAFVVRNIANMVPP 192

Query: 121 FDTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEF 180
           FD  KY+G GAAIEYA+LHLKVENIVVIGHS CGGIKGLMS P DG  S         +F
Sbjct: 193 FDKVKYAGVGAAIEYAVLHLKVENIVVIGHSACGGIKGLMSFPLDGNNS--------TDF 252

Query: 181 IENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFI 240
           IE+WVKIC PAK+KV       +FED+C  CE+EAVNVSL NLLTYPFVRE V+   L +
Sbjct: 253 IEDWVKICLPAKSKVLAESESSAFEDQCGRCEREAVNVSLANLLTYPFVREGVVKGTLAL 312

Query: 241 RGAHYNFVSGDFELWNLDFNISP 263
           +G +Y+FV+G FELW L F ISP
Sbjct: 313 KGGYYDFVNGSFELWELQFGISP 327

BLAST of Cp4.1LG01g15310 vs. TAIR10
Match: AT3G01500.2 (AT3G01500.2 carbonic anhydrase 1)

HSP 1 Score: 340.5 bits (872), Expect = 9.3e-94
Identity = 165/262 (62.98%), Postives = 199/262 (75.95%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAEL-AGSNAESDGFDAVDRIKTGFT 60
           M  E+Y+EAI  L KLL EK  L+  AA ++ QITA L  G++++   FD V+ IK GF 
Sbjct: 78  MGTEAYDEAIEALKKLLIEKEELKTVAAAKVEQITAALQTGTSSDKKAFDPVETIKQGFI 137

Query: 61  HFKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPP 120
            FKK K+ETNP L+G+LAKGQSPK++VFACSDSRVCPSH+L+FQPG+AFVVRNIA+MVPP
Sbjct: 138 KFKKEKYETNPALYGELAKGQSPKYMVFACSDSRVCPSHVLDFQPGDAFVVRNIANMVPP 197

Query: 121 FDTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEF 180
           FD  KY G GAAIEYA+LHLKVENIVVIGHS CGGIKGLMS P DG  S         +F
Sbjct: 198 FDKVKYGGVGAAIEYAVLHLKVENIVVIGHSACGGIKGLMSFPLDGNNS--------TDF 257

Query: 181 IENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFI 240
           IE+WVKIC PAK+KV     D +FED+C  CE+EAVNVSL NLLTYPFVRE ++   L +
Sbjct: 258 IEDWVKICLPAKSKVISELGDSAFEDQCGRCEREAVNVSLANLLTYPFVREGLVKGTLAL 317

Query: 241 RGAHYNFVSGDFELWNLDFNIS 262
           +G +Y+FV G FELW L+F +S
Sbjct: 318 KGGYYDFVKGAFELWGLEFGLS 331

BLAST of Cp4.1LG01g15310 vs. TAIR10
Match: AT1G70410.2 (AT1G70410.2 beta carbonic anhydrase 4)

HSP 1 Score: 336.7 bits (862), Expect = 1.3e-92
Identity = 166/263 (63.12%), Postives = 196/263 (74.52%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA ESYE AI GL  LLS KA+L   AA +I+ +TAEL     +S   DA++RIKTGFT 
Sbjct: 23  MATESYEAAIKGLNDLLSTKADLGNVAAAKIKALTAEL--KELDSSNSDAIERIKTGFTQ 82

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FK  K+  N  LF  LAK Q+PKFLVFACSDSRVCPSHILNFQPGEAFVVRNIA+MVPPF
Sbjct: 83  FKTEKYLKNSTLFNHLAKTQTPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIANMVPPF 142

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D  ++SG GAA+EYA++HLKVENI+VIGHSCCGGIKGLMSI  D   ++        +FI
Sbjct: 143 DQKRHSGVGAAVEYAVVHLKVENILVIGHSCCGGIKGLMSIEDDAAPTQS-------DFI 202

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWVKI   A+ K++    DLS++D+C  CEKEAVNVSLGNLL+YPFVR  V+   L IR
Sbjct: 203 ENWVKIGASARNKIKEEHKDLSYDDQCNKCEKEAVNVSLGNLLSYPFVRAEVVKNTLAIR 262

Query: 241 GAHYNFVSGDFELWNLDFNISPS 264
           G HYNFV G F+LW LDF  +P+
Sbjct: 263 GGHYNFVKGTFDLWELDFKTTPA 276

BLAST of Cp4.1LG01g15310 vs. TAIR10
Match: AT1G23730.1 (AT1G23730.1 beta carbonic anhydrase 3)

HSP 1 Score: 330.5 bits (846), Expect = 9.6e-91
Identity = 162/266 (60.90%), Postives = 203/266 (76.32%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M+ ESYE+AI  L +LLS+K++L   AA +I+++T EL     +S+  DAV+RIK+GF H
Sbjct: 1   MSTESYEDAIKRLGELLSKKSDLGNVAAAKIKKLTDEL--EELDSNKLDAVERIKSGFLH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FK + +E NP L+  LAK Q+PKFLVFAC+DSRV PSHILNFQ GEAF+VRNIA+MVPP+
Sbjct: 61  FKTNNYEKNPTLYNSLAKSQTPKFLVFACADSRVSPSHILNFQLGEAFIVRNIANMVPPY 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TK+S  GAA+EY I  L VENI+VIGHSCCGGIKGLM+I  D T   +       EFI
Sbjct: 121 DKTKHSNVGAALEYPITVLNVENILVIGHSCCGGIKGLMAI-EDNTAPTK------TEFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENW++IC PAK +++    DLSFED+CTNCEKEAVNVSLGNLL+YPFVRE V+  +L IR
Sbjct: 181 ENWIQICAPAKNRIKQDCKDLSFEDQCTNCEKEAVNVSLGNLLSYPFVRERVVKNKLAIR 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV G F+LW LDF  +P+  +
Sbjct: 241 GAHYDFVKGTFDLWELDFKTTPAFAL 257

BLAST of Cp4.1LG01g15310 vs. TAIR10
Match: AT4G33580.2 (AT4G33580.2 beta carbonic anhydrase 5)

HSP 1 Score: 186.8 bits (473), Expect = 1.7e-47
Identity = 103/242 (42.56%), Postives = 146/242 (60.33%), Query Frame = 1

Query: 22  NLQEAAAVRIRQITAELAGSNAE-SDGFDAVDRIKTGFTHFKKSKFETNP-DLFGQLAKG 81
           NLQ  A+ +   +T E  G   +  +  D  D +K  F  FKK K+  +  + +  LA  
Sbjct: 52  NLQVMASGKTPGLTQEANGVAIDRQNNTDVFDDMKQRFLAFKKLKYIRDDFEHYKNLADA 111

Query: 82  QSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPFDTTKYSGTGAAIEYAILHL 141
           Q+PKFLV AC+DSRVCPS +L FQPG+AF VRNIA++VPP+++   + T AA+E+++  L
Sbjct: 112 QAPKFLVIACADSRVCPSAVLGFQPGDAFTVRNIANLVPPYESGP-TETKAALEFSVNTL 171

Query: 142 KVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFIENWVKICTPAKAKVQLGFT 201
            VENI+VIGHS CGGI+ LM +  +G  SR         FI NWV +   AK   +   +
Sbjct: 172 NVENILVIGHSRCGGIQALMKMEDEGD-SR--------SFIHNWVVVGKKAKESTKAVAS 231

Query: 202 DLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIRGAHYNFVSGDFELWNLDFN 261
           +L F+ +C +CEK ++N SL  LL YP++ E V    L + G +YNFV   FE W +D+ 
Sbjct: 232 NLHFDHQCQHCEKASINHSLERLLGYPWIEEKVRQGSLSLHGGYYNFVDCTFEKWTVDYA 283

BLAST of Cp4.1LG01g15310 vs. NCBI nr
Match: gi|449434923|ref|XP_004135245.1| (PREDICTED: carbonic anhydrase 2-like isoform X2 [Cucumis sativus])

HSP 1 Score: 458.4 bits (1178), Expect = 8.7e-126
Identity = 225/266 (84.59%), Postives = 242/266 (90.98%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYEEAIAGL+KLLSEKA+LQ+AAA +IRQITAELAGS+A S+GFD VDRIKTGFTH
Sbjct: 1   MAQESYEEAIAGLSKLLSEKADLQDAAAAKIRQITAELAGSSACSNGFDPVDRIKTGFTH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKKSKFETNP+++G LAKGQSPKFLVFACSDSRVCPSHILNFQPGEAF+VRNIA+MVPPF
Sbjct: 61  FKKSKFETNPEVYGALAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFMVRNIANMVPPF 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIP DG +S         +FI
Sbjct: 121 DKTKYSGAGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPDDGAISS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWVKICTPAK K Q   TDLSFEDKCTNCEKEAVNVSLGNLL+YPFVREAV+NK+LFIR
Sbjct: 181 ENWVKICTPAKNKTQSDCTDLSFEDKCTNCEKEAVNVSLGNLLSYPFVREAVVNKRLFIR 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHYNFVSG FELWNLDFNISPSL V
Sbjct: 241 GAHYNFVSGAFELWNLDFNISPSLAV 258

BLAST of Cp4.1LG01g15310 vs. NCBI nr
Match: gi|449434921|ref|XP_004135244.1| (PREDICTED: carbonic anhydrase 2-like isoform X1 [Cucumis sativus])

HSP 1 Score: 458.4 bits (1178), Expect = 8.7e-126
Identity = 225/266 (84.59%), Postives = 242/266 (90.98%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYEEAIAGL+KLLSEKA+LQ+AAA +IRQITAELAGS+A S+GFD VDRIKTGFTH
Sbjct: 21  MAQESYEEAIAGLSKLLSEKADLQDAAAAKIRQITAELAGSSACSNGFDPVDRIKTGFTH 80

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKKSKFETNP+++G LAKGQSPKFLVFACSDSRVCPSHILNFQPGEAF+VRNIA+MVPPF
Sbjct: 81  FKKSKFETNPEVYGALAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFMVRNIANMVPPF 140

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIP DG +S         +FI
Sbjct: 141 DKTKYSGAGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPDDGAISS--------DFI 200

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWVKICTPAK K Q   TDLSFEDKCTNCEKEAVNVSLGNLL+YPFVREAV+NK+LFIR
Sbjct: 201 ENWVKICTPAKNKTQSDCTDLSFEDKCTNCEKEAVNVSLGNLLSYPFVREAVVNKRLFIR 260

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHYNFVSG FELWNLDFNISPSL V
Sbjct: 261 GAHYNFVSGAFELWNLDFNISPSLAV 278

BLAST of Cp4.1LG01g15310 vs. NCBI nr
Match: gi|659090800|ref|XP_008446208.1| (PREDICTED: carbonic anhydrase 2-like [Cucumis melo])

HSP 1 Score: 453.8 bits (1166), Expect = 2.1e-124
Identity = 222/266 (83.46%), Postives = 238/266 (89.47%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           MA+ESYEEAIAGL+KLLSEKA+LQ+AAA +IRQITAEL G+ A S+GFD VDRIKTGFTH
Sbjct: 1   MAQESYEEAIAGLSKLLSEKADLQDAAAAKIRQITAELGGTTACSNGFDPVDRIKTGFTH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKKSKFETNPDL+GQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIA+MVPPF
Sbjct: 61  FKKSKFETNPDLYGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIANMVPPF 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DG  S         +FI
Sbjct: 121 DKTKYSGAGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGAFSS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           ENWV+ICTPAK K Q    DLSFEDKCT CEKEAVNVSLGNLL+YPFVREAV+NK++FIR
Sbjct: 181 ENWVQICTPAKNKTQSNCNDLSFEDKCTECEKEAVNVSLGNLLSYPFVREAVVNKKVFIR 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHYNFVSG FELWNLDFNISPSL V
Sbjct: 241 GAHYNFVSGAFELWNLDFNISPSLAV 258

BLAST of Cp4.1LG01g15310 vs. NCBI nr
Match: gi|590694582|ref|XP_007044648.1| (Carbonic anhydrase 2, CA2 isoform 2 [Theobroma cacao])

HSP 1 Score: 395.6 bits (1015), Expect = 6.9e-107
Identity = 192/266 (72.18%), Postives = 220/266 (82.71%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M   SYEEAIA L+KLLS+KA+LQ  AA +I QITAEL  + A+ + FD V RI+TGF H
Sbjct: 1   MGSGSYEEAIAALSKLLSDKADLQSVAAAKIMQITAELEAA-ADPNQFDPVKRIETGFLH 60

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKK K+E NPDL+G+LAKGQSPKFLVFACSDSRVCPSHIL+FQPGEAF+VRNIA+MVPP+
Sbjct: 61  FKKEKYEKNPDLYGELAKGQSPKFLVFACSDSRVCPSHILDFQPGEAFMVRNIANMVPPY 120

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DGT +         +FI
Sbjct: 121 DKTKYSGVGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGTTAS--------DFI 180

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E WV IC PAK KV+    DLSF ++CTNCEKEAVNVSLGNLLTYPFVREAV+ K L ++
Sbjct: 181 EQWVSICAPAKTKVKSECNDLSFSEQCTNCEKEAVNVSLGNLLTYPFVREAVVKKSLVLK 240

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV G F+LWNLDFNI+P+L V
Sbjct: 241 GAHYDFVDGKFDLWNLDFNITPTLAV 257

BLAST of Cp4.1LG01g15310 vs. NCBI nr
Match: gi|590694578|ref|XP_007044647.1| (Carbonic anhydrase 2, CA2 isoform 1 [Theobroma cacao])

HSP 1 Score: 395.6 bits (1015), Expect = 6.9e-107
Identity = 192/266 (72.18%), Postives = 220/266 (82.71%), Query Frame = 1

Query: 1   MAEESYEEAIAGLTKLLSEKANLQEAAAVRIRQITAELAGSNAESDGFDAVDRIKTGFTH 60
           M   SYEEAIA L+KLLS+KA+LQ  AA +I QITAEL  + A+ + FD V RI+TGF H
Sbjct: 23  MGSGSYEEAIAALSKLLSDKADLQSVAAAKIMQITAELEAA-ADPNQFDPVKRIETGFLH 82

Query: 61  FKKSKFETNPDLFGQLAKGQSPKFLVFACSDSRVCPSHILNFQPGEAFVVRNIASMVPPF 120
           FKK K+E NPDL+G+LAKGQSPKFLVFACSDSRVCPSHIL+FQPGEAF+VRNIA+MVPP+
Sbjct: 83  FKKEKYEKNPDLYGELAKGQSPKFLVFACSDSRVCPSHILDFQPGEAFMVRNIANMVPPY 142

Query: 121 DTTKYSGTGAAIEYAILHLKVENIVVIGHSCCGGIKGLMSIPTDGTLSRQFLFLSLIEFI 180
           D TKYSG GAAIEYA+LHLKVENIVVIGHSCCGGIKGLMSIP DGT +         +FI
Sbjct: 143 DKTKYSGVGAAIEYAVLHLKVENIVVIGHSCCGGIKGLMSIPDDGTTAS--------DFI 202

Query: 181 ENWVKICTPAKAKVQLGFTDLSFEDKCTNCEKEAVNVSLGNLLTYPFVREAVINKQLFIR 240
           E WV IC PAK KV+    DLSF ++CTNCEKEAVNVSLGNLLTYPFVREAV+ K L ++
Sbjct: 203 EQWVSICAPAKTKVKSECNDLSFSEQCTNCEKEAVNVSLGNLLTYPFVREAVVKKSLVLK 262

Query: 241 GAHYNFVSGDFELWNLDFNISPSLVV 267
           GAHY+FV G F+LWNLDFNI+P+L V
Sbjct: 263 GAHYDFVDGKFDLWNLDFNITPTLAV 279

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CAHC_TOBAC5.3e-9967.67Carbonic anhydrase, chloroplastic OS=Nicotiana tabacum PE=2 SV=1[more]
BCA2_ARATH1.6e-9564.64Beta carbonic anhydrase 2, chloroplastic OS=Arabidopsis thaliana GN=BCA2 PE=1 SV... [more]
CAHC_PEA7.4e-9362.41Carbonic anhydrase, chloroplastic OS=Pisum sativum PE=1 SV=1[more]
BCA1_ARATH1.7e-9262.98Beta carbonic anhydrase 1, chloroplastic OS=Arabidopsis thaliana GN=BCA1 PE=1 SV... [more]
BCA4_ARATH2.4e-9163.12Beta carbonic anhydrase 4 OS=Arabidopsis thaliana GN=BCA4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KVM4_CUCSA6.1e-12684.59Carbonic anhydrase OS=Cucumis sativus GN=Csa_5G601560 PE=3 SV=1[more]
E5GBJ5_CUCME1.5e-12483.46Carbonic anhydrase OS=Cucumis melo subsp. melo PE=3 SV=1[more]
A0A061EDW0_THECC4.8e-10772.18Carbonic anhydrase OS=Theobroma cacao GN=TCM_010364 PE=3 SV=1[more]
A0A061E630_THECC4.8e-10772.18Carbonic anhydrase OS=Theobroma cacao GN=TCM_010364 PE=3 SV=1[more]
A0A0D2RHC0_GOSRA5.3e-10671.43Carbonic anhydrase OS=Gossypium raimondii GN=B456_005G182300 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G14740.19.0e-9764.64 carbonic anhydrase 2[more]
AT3G01500.29.3e-9462.98 carbonic anhydrase 1[more]
AT1G70410.21.3e-9263.12 beta carbonic anhydrase 4[more]
AT1G23730.19.6e-9160.90 beta carbonic anhydrase 3[more]
AT4G33580.21.7e-4742.56 beta carbonic anhydrase 5[more]
Match NameE-valueIdentityDescription
gi|449434923|ref|XP_004135245.1|8.7e-12684.59PREDICTED: carbonic anhydrase 2-like isoform X2 [Cucumis sativus][more]
gi|449434921|ref|XP_004135244.1|8.7e-12684.59PREDICTED: carbonic anhydrase 2-like isoform X1 [Cucumis sativus][more]
gi|659090800|ref|XP_008446208.1|2.1e-12483.46PREDICTED: carbonic anhydrase 2-like [Cucumis melo][more]
gi|590694582|ref|XP_007044648.1|6.9e-10772.18Carbonic anhydrase 2, CA2 isoform 2 [Theobroma cacao][more]
gi|590694578|ref|XP_007044647.1|6.9e-10772.18Carbonic anhydrase 2, CA2 isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0015976carbon utilization
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0004089carbonate dehydratase activity
Vocabulary: INTERPRO
TermDefinition
IPR015892Carbonic_anhydrase_CS
IPR001765Carbonic_anhydrase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015976 carbon utilization
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0006730 one-carbon metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004089 carbonate dehydratase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g15310.1Cp4.1LG01g15310.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001765Carbonic anhydraseGENE3DG3DSA:3.40.1050.10coord: 46..258
score: 4.8
IPR001765Carbonic anhydrasePANTHERPTHR11002CARBONIC ANHYDRASEcoord: 1..266
score: 1.4E
IPR001765Carbonic anhydrasePFAMPF00484Pro_CAcoord: 85..250
score: 4.3
IPR001765Carbonic anhydraseSMARTSM00947Pro_CA_2coord: 77..255
score: 4.3
IPR001765Carbonic anhydraseunknownSSF53056beta-carbonic anhydrase, cabcoord: 47..262
score: 1.3
IPR015892Carbonic anhydrase, prokaryotic-like, conserved sitePROSITEPS00704PROK_CO2_ANHYDRASE_1coord: 89..96
scor
IPR015892Carbonic anhydrase, prokaryotic-like, conserved sitePROSITEPS00705PROK_CO2_ANHYDRASE_2coord: 133..153
scor
NoneNo IPR availableunknownCoilCoilcoord: 6..26
scor
NoneNo IPR availablePANTHERPTHR11002:SF15CARBONIC ANHYDRASEcoord: 1..266
score: 1.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g15310Cp4.1LG09g05050Cucurbita pepo (Zucchini)cpecpeB033
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g15310Cucurbita pepo (Zucchini)cpecpeB116
Cp4.1LG01g15310Cucurbita pepo (Zucchini)cpecpeB406
Cp4.1LG01g15310Cucurbita maxima (Rimu)cmacpeB186
Cp4.1LG01g15310Cucurbita maxima (Rimu)cmacpeB431
Cp4.1LG01g15310Cucurbita maxima (Rimu)cmacpeB776
Cp4.1LG01g15310Cucurbita moschata (Rifu)cmocpeB162
Cp4.1LG01g15310Cucurbita moschata (Rifu)cmocpeB731
Cp4.1LG01g15310Silver-seed gourdcarcpeB0710
Cp4.1LG01g15310Silver-seed gourdcarcpeB0888