Cp4.1LG01g06820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g06820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGAMMA carbonic anhydrase
LocationCp4.1LG01 : 3995882 .. 3999690 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTAAAAATTAAAAAAAAAAAAAAAAAAAAAAACTTTTAGTCTTGGCCACAAATAAAATTGATAGGAAAAGATGAAATTAAAAAAAAAAAAAAAAAAAAAAATTATGAATTTAGGCACTATTCCTCCAGAATTCTACATTTCCCGCATCTGTAATTGAAGCTGAAGATTGAAGGGTGGAGGACGACGACGAACAAGTGTAGAAGATGGGAACATTGGGAAAGGCAATCTACACCGTCGGATTCTGGATCCGAGAGACCGGCCAAGCCCTCGATCGCCTTGGTTGCCGCCTCCAAGGAAACCACTACTTTCAAGAACAACGTAACCTTCTTAATTTCCACATTCTTCTTTACGATCTTCGAAGTTTCACCATTTTAATTGCTATTAGTGTTCTATTTTCCCGCTGATTTCAATTTGGTGTAATCTTTTGAGCTAATCGATGAAATGGAAAGACGTCATTTTTCATAACTTCTGTTTGGCATTTTTCTGGGCAGAAATAAAACCGCAATGTATATTTTGTAGGCTTTTCTAGGAAATTTGATTTCGCATGCCCTTTTACATCTTGACTAGATTTCAAGGAATTCACATTCGAACGAGGATCTTTTTTGGTTGGACGAGATATGTAGAAGGCATGAACTTAGATATTGTAATGTGATGTTTCGGCTTGTAGATATGCGTAAGAGGAATGTGCATCTGTGTACATGTTTATCTGGAGAAACTAGTATTTTTATGACATGAGGGTGTTGGTTTAATTAGGTTTCCATATTCTTCAGTTTAGGCTGTTCATTAGCTAATCTTTCAGAAAATAGCACCATGAATTTAGAGTGCTAAAGTTAGATATTAATTGTAGAGAACTTCCACTTCAGGAAGCTGTAGAACTTCTTCATCCTCTATTGCTCCATGATTTGGGATTATAGTAGTTTGTGATCGATGAAGATTCTGGATGTTCATTATAGTAGTTTGTGATTGATAATACATCCTTTTGTCCGTCTTCTTTTCTTTTGAACTAGTACCTCCATTACTGAGAACCCCAATGTCAAAGAAAATAGATTACAAATTAATGGCTGTTCCCAAACACTTTGCAAGCAAGGTCTGACGCAAAATGCTTCTCAAAAGGACATGAAAAATGGAAATAGTGGAACATTGATGGTGCATTATGTTATTCTATGACTTCCCACCACCTCTTGAGGGGCATCTTTGCCATCCCTTTCACCACACCTATCACAAATAGCCATCACATCCACAAGTCACGATCATCTCTATGTGCACTTCTACCCCCCACAATGGTGTTTACCGTCATTTTTAAATTCAATTTTGGGCATGTATGTATTTCGTACTTGATTGAGGTTTTATTATATACACTGCAAAACTGTACATGCCTTGGAAATTCATCTGTTGAACCCTTGCTTTTTTTGTGCTAAGGTTGAGTCATGCAAATATTGAAAGGCTTTACGTAAATTATTCAAATGACATTAGCTAATAAAGAGTCTTATTTGTTTAGTATCTAGGCACAGGACGCTCATGAACGTATTTGACAAGGCTCCTGTAGTTGACAGTGATGCATTTGTGGCGCCTAGTGCATCTATCATTGGTGATGTGCAGGTGGGACGGGGGTCCTCCATTTGGTATGGCTGTGTTTTGAGAGGTAAACTTAACATTATTTTTCTGAGCGGAGAAACAATTTCATTGATGAATTAGCTTAAATTTTTCCACTTTTGATAGTTTCTTGGATTTGGTTATTTCTAATGCTCTCTATTGGTGTAAATGTGTTAGTCTTTCTTCTTTAATATCCAATTAGAAATCTTTCATGTAATTCACTTTTTGGTGTTAGAGGTAAGCTTAACAGTAGCCTCTCTGAATTTGAAGATGTTCATGTTATTTTTGATCACCCACCTACCCACCCACTAAGTTAGTTTTTCTGCTTTTCTACTTTCCAGAAATCCTGATTATGGGCTTTACGTCTTATGCAAATTGTTTAAGAGTACCTCATCCATAATCTTAACCCAAAATTGCCATTTTTAGGAAAAATATTATTGGCGTTTGAAATTCATTTGATTGAATGATTAGTGAGCTTGTTGAAGTAATGGTCACGCTGGAGAAAATATGATGAACGCATGTTTGGCACATCATGTTAAAAAATATGATAATCTGAAATTTCTTTGAGGCAAGAATATGAGAAGATCTCAAAGATGAAACTCATTGTAGAAGTTGTAAAAAAAATAAGTACAATTTTAGAAAATCAAAAGCCCAATGGTCATCAAATGGCGCCGTAAGTTTTAGTTTGTGATTAAGTATGAGGTCATGCAGATATTGAACTGTTTTTCCTGCATGCTTGCTTGTTCGTATTTACACATGTTTTGTGTTCCTAGGACTGATCTCTACCAGGACTCGTACTTTAATAGCTTTAAAGTTTAGAGGAACCTGCCAGTTTTTTTTTTTTTTTATGAGCAAAATGACCACTTCGTTGTTTGGCTGATGAGTATTTTAATGTAAATAGGACAATACTCCTGTTTATGCCAAAATTTGTTTGCGTGAGAACCAGTTGTTAAACAAGATTGCTTTTCTTCCTTACCTATCTTCTTCTGTATGTTTATATTTGTCTTAACTGCAAAATATTATGAAGGTGATGTTAACAGCATCAGTGTTGGCACTGGGACCAATATACAAGACAACTCATTAGTTCACGTAGCAAAATCCAATTTGAGTGGAAAGGTTTTACCAACTATCATTGGTGATAATGTTACTGTCGGTGAGTGATATAATCTCATTTCAAATTATGTATCCATCCCTTGAGAGGTGTGTGGTTATGGTTTCGAATTTGAACTAGACAGGTCACAGTGCTGTGTTACATGGATGTACTATCGAGGACGAGGCATTTGTGGGTATGGGAGCAACGTTGCTTGATGGAGTCTACGTAGAAAAACATGCAATGGTTGCTGCTGGAGCACTTGTGAGACAGAATACAAGGATCCCATGTGGAGAGGTCTCCCTATCTTATCATTTTACTTCGATATCTAGTGCTACCTTCTATGTTATCAATACTGCTTGGGTTCTGTCATGATTGAAGTAAAATATTCCGTGTTGCTAATGCAATCACAGGTATGGGGAGGAAATCCAGCAAGGTTCCTGAGGAAGCTTACAGAAGAAGAGATGGCCTTTTTCTCCCAGTCAGCCATTAATTATTCCAACTTATCACAGGTTCATGCAGCTGAAAATGTTAAGAGCTTTGATGAAATTGAATTTGAAAAGGTTCTTCGCAAGAAGTTTGCTCGTCGCGATGAAGAGTATGATTCGATGTTGGGCGTTGTTCGTGAAACCCCTCCAGAGCTTGTCCTTCCAGATAACATACTGGCTGATAAAGTACCAAGAGCTTCTTAAGACGTAATTTTTTAATGGAGGTGTTGCCCTTTGAAATATTTTGCTGATATTCATAAAAGACCCGACCTTCATCGCAAATAACATATGGAGGTTGGATGGATGAGCAAACGCCAAAGGTCACGAACAATGTCCTCTGGTTACAGCTCTAACAATCATCCTCCCCGAAACACCATCACTTTAGATGCTTCAATATTCTTGAAGCCATCCTTGGATTTGTTTCCAGGTTTTGAATAATTTTTTCACTGTTCCTGTTCGGAAATAAAGAACTTTTGGAAACAGGTGTATTTAAGGATTAAATCTATTTCGCTTCAACAGGCTTCTTGTTCTTATGTGTATCATATTTTTATAGATCTAACATGCAGTTCTAAACCTTAAAATTGTATTGGGAGTAGCAAAAGCCAACATTGGCATTGGCATTGGCATTGGCACCC

mRNA sequence

TTTTTAAAAATTAAAAAAAAAAAAAAAAAAAAAAACTTTTAGTCTTGGCCACAAATAAAATTGATAGGAAAAGATGAAATTAAAAAAAAAAAAAAAAAAAAAAATTATGAATTTAGGCACTATTCCTCCAGAATTCTACATTTCCCGCATCTGTAATTGAAGCTGAAGATTGAAGGGTGGAGGACGACGACGAACAAGTGTAGAAGATGGGAACATTGGGAAAGGCAATCTACACCGTCGGATTCTGGATCCGAGAGACCGGCCAAGCCCTCGATCGCCTTGGTTGCCGCCTCCAAGGAAACCACTACTTTCAAGAACAACTATCTAGGCACAGGACGCTCATGAACGTATTTGACAAGGCTCCTGTAGTTGACAGTGATGCATTTGTGGCGCCTAGTGCATCTATCATTGGTGATGTTAACAGCATCAGTGTTGGCACTGGGACCAATATACAAGACAACTCATTAGTTCACGTAGCAAAATCCAATTTGAGTGGAAAGGTTTTACCAACTATCATTGGTGATAATGTTACTGTCGGTCACAGTGCTGTGTTACATGGATGTACTATCGAGGACGAGGCATTTGTGGGTATGGGAGCAACGTTGCTTGATGGAGTCTACGTAGAAAAACATGCAATGGTTGCTGCTGGAGCACTTGTGAGACAGAATACAAGGATCCCATGTGGAGAGGTATGGGGAGGAAATCCAGCAAGGTTCCTGAGGAAGCTTACAGAAGAAGAGATGGCCTTTTTCTCCCAGTCAGCCATTAATTATTCCAACTTATCACAGGTTCATGCAGCTGAAAATGTTAAGAGCTTTGATGAAATTGAATTTGAAAAGGTTCTTCGCAAGAAGTTTGCTCGTCGCGATGAAGAGTATGATTCGATGTTGGGCGTTGTTCGTGAAACCCCTCCAGAGCTTGTCCTTCCAGATAACATACTGGCTGATAAAGTACCAAGAGCTTCTTAAGACGTAATTTTTTAATGGAGGTGTTGCCCTTTGAAATATTTTGCTGATATTCATAAAAGACCCGACCTTCATCGCAAATAACATATGGAGGTTGGATGGATGAGCAAACGCCAAAGGTCACGAACAATGTCCTCTGGTTACAGCTCTAACAATCATCCTCCCCGAAACACCATCACTTTAGATGCTTCAATATTCTTGAAGCCATCCTTGGATTTGTTTCCAGGTTTTGAATAATTTTTTCACTGTTCCTGTTCGGAAATAAAGAACTTTTGGAAACAGGTGTATTTAAGGATTAAATCTATTTCGCTTCAACAGGCTTCTTGTTCTTATGTGTATCATATTTTTATAGATCTAACATGCAGTTCTAAACCTTAAAATTGTATTGGGAGTAGCAAAAGCCAACATTGGCATTGGCATTGGCATTGGCACCC

Coding sequence (CDS)

ATGGGAACATTGGGAAAGGCAATCTACACCGTCGGATTCTGGATCCGAGAGACCGGCCAAGCCCTCGATCGCCTTGGTTGCCGCCTCCAAGGAAACCACTACTTTCAAGAACAACTATCTAGGCACAGGACGCTCATGAACGTATTTGACAAGGCTCCTGTAGTTGACAGTGATGCATTTGTGGCGCCTAGTGCATCTATCATTGGTGATGTTAACAGCATCAGTGTTGGCACTGGGACCAATATACAAGACAACTCATTAGTTCACGTAGCAAAATCCAATTTGAGTGGAAAGGTTTTACCAACTATCATTGGTGATAATGTTACTGTCGGTCACAGTGCTGTGTTACATGGATGTACTATCGAGGACGAGGCATTTGTGGGTATGGGAGCAACGTTGCTTGATGGAGTCTACGTAGAAAAACATGCAATGGTTGCTGCTGGAGCACTTGTGAGACAGAATACAAGGATCCCATGTGGAGAGGTATGGGGAGGAAATCCAGCAAGGTTCCTGAGGAAGCTTACAGAAGAAGAGATGGCCTTTTTCTCCCAGTCAGCCATTAATTATTCCAACTTATCACAGGTTCATGCAGCTGAAAATGTTAAGAGCTTTGATGAAATTGAATTTGAAAAGGTTCTTCGCAAGAAGTTTGCTCGTCGCGATGAAGAGTATGATTCGATGTTGGGCGTTGTTCGTGAAACCCCTCCAGAGCTTGTCCTTCCAGATAACATACTGGCTGATAAAGTACCAAGAGCTTCTTAA

Protein sequence

MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAFVAPSASIIGDVNSISVGTGTNIQDNSLVHVAKSNLSGKVLPTIIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEVWGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDEEYDSMLGVVRETPPELVLPDNILADKVPRAS
BLAST of Cp4.1LG01g06820 vs. Swiss-Prot
Match: GCA1_ARATH (Gamma carbonic anhydrase 1, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA1 PE=1 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 1.0e-115
Identity = 209/266 (78.57%), Postives = 229/266 (86.09%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLG+A Y+VGFWIRETGQALDRLGCRLQG +YF+EQLSRHRTLMNVFDKAP+VD +AF
Sbjct: 1   MGTLGRAFYSVGFWIRETGQALDRLGCRLQGKNYFREQLSRHRTLMNVFDKAPIVDKEAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGD                  VN++SVG+GTNIQDNSLVHVAKSNLSGKV PT
Sbjct: 61  VAPSASVIGDVHIGRGSSIWYGCVLRGDVNTVSVGSGTNIQDNSLVHVAKSNLSGKVHPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVT+GHSAVLHGCT+EDE F+GMGATLLDGV VEKH MVAAGALVRQNTRIP GEV
Sbjct: 121 IIGDNVTIGHSAVLHGCTVEDETFIGMGATLLDGVVVEKHGMVAAGALVRQNTRIPSGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPARFLRKLT+EE+AF SQSA NYSNL+Q HAAEN K  + IEFEKVLRKK A +DE
Sbjct: 181 WGGNPARFLRKLTDEEIAFISQSATNYSNLAQAHAAENAKPLNVIEFEKVLRKKHALKDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADK 249
           EYDSMLG+VRETPPEL LP+NIL DK
Sbjct: 241 EYDSMLGIVRETPPELNLPNNILPDK 266

BLAST of Cp4.1LG01g06820 vs. Swiss-Prot
Match: GCA2_ARATH (Gamma carbonic anhydrase 2, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA2 PE=1 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 1.2e-108
Identity = 193/270 (71.48%), Postives = 228/270 (84.44%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLG+AIYTVG WIR TGQALDR+G  LQG+H  +E LSRHRTLMNVFDK+P+VD D F
Sbjct: 1   MGTLGRAIYTVGNWIRGTGQALDRVGSLLQGSHRIEEHLSRHRTLMNVFDKSPLVDKDVF 60

Query: 61  VAPSASIIGDV------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGDV                  N+ISVG+GTNIQDN+LVHVAK+N+SGKVLPT
Sbjct: 61  VAPSASVIGDVQIGKGSSIWYGCVLRGDVNNISVGSGTNIQDNTLVHVAKTNISGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           +IGDNVTVGHSAV+HGCT+ED+AFVGMGATLLDGV VEKHAMVAAG+LV+QNTRIP GEV
Sbjct: 121 LIGDNVTVGHSAVIHGCTVEDDAFVGMGATLLDGVVVEKHAMVAAGSLVKQNTRIPSGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+F+RKLT+EE+ + SQSA NY NL+Q+HA+EN KSF++IE E+ LRKK+AR+DE
Sbjct: 181 WGGNPAKFMRKLTDEEIVYISQSAKNYINLAQIHASENSKSFEQIEVERALRKKYARKDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRA 253
           +YDSMLG+ RETPPEL+LPDN+L    P A
Sbjct: 241 DYDSMLGITRETPPELILPDNVLPGGKPVA 270

BLAST of Cp4.1LG01g06820 vs. Swiss-Prot
Match: GCA3_ARATH (Gamma carbonic anhydrase 3, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA3 PE=1 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 2.5e-98
Identity = 182/262 (69.47%), Postives = 210/262 (80.15%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGT+GKA Y+VGFWIRETGQALDRLGCRLQG ++F+EQLSRHRTLMNVFDK P VD  AF
Sbjct: 1   MGTMGKAFYSVGFWIRETGQALDRLGCRLQGKNHFREQLSRHRTLMNVFDKTPNVDKGAF 60

Query: 61  VAPSASIIGDV------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAP+AS+ GDV                  NSISVG GTNIQDN+LVHVAK+NLSGKVLPT
Sbjct: 61  VAPNASLSGDVHVGRGSSIWYGCVLRGDANSISVGAGTNIQDNALVHVAKTNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           +IGDNVT+GHSAVLHGCT+EDEA++G  AT+LDG +VEKHAMVA+GALVRQNTRIP GEV
Sbjct: 121 VIGDNVTIGHSAVLHGCTVEDEAYIGTSATVLDGAHVEKHAMVASGALVRQNTRIPSGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRK+TEEE  FFS SA+ YSNL+Q HA EN K+ DE EF+K+L KK A RD 
Sbjct: 181 WGGNPAKFLRKVTEEERVFFSSSAVEYSNLAQAHATENAKNLDEAEFKKLLNKKNA-RDT 240

Query: 241 EYDSMLGVVRETPPELVLPDNI 245
           EYDS+L        +L LP+N+
Sbjct: 241 EYDSVL-------DDLTLPENV 254

BLAST of Cp4.1LG01g06820 vs. Swiss-Prot
Match: Y2881_DICDI (Uncharacterized protein DDB_G0288155 OS=Dictyostelium discoideum GN=DDB_G0288155 PE=1 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 9.7e-34
Identity = 83/236 (35.17%), Postives = 132/236 (55.93%), Query Frame = 1

Query: 11  VGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAFVAPSASIIGD 70
           +G  ++ TG  L R GC++QG++ + E+L+RH  L    D AP+V   +F+AP+ASIIGD
Sbjct: 10  LGEVVKNTGLILHRTGCKMQGDYAYVEKLNRHTRLTAFGDNAPIVGQKSFIAPNASIIGD 69

Query: 71  V------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPTIIGDNVTVGH 130
           V                  NSI +G  T + D ++VH + +   G   PT IGD V +G 
Sbjct: 70  VVIGKESSIWYNAVLRGDVNSIHIGDKTVVSDRTVVHCSSNGPLGPK-PTQIGDKVYIGP 129

Query: 131 SAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEVWGGNPARFLR 190
            +++H  TI  E+F+G G+TL DG  VEK+  + AG+L+     I  GE WGG+PA+F+R
Sbjct: 130 GSIVHAATILGESFIGTGSTLCDGSVVEKNGFLEAGSLLTAGKTIKSGEYWGGSPAKFIR 189

Query: 191 KLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDEEYDSML 229
           ++T+++ +   +      NLS+ H  +  KS  E+  +  L +K+ +     D +L
Sbjct: 190 QVTKDDESQLEKIIEQNINLSEQHEKQTSKSAKELNND--LLQKYVKNRTRSDHIL 242

BLAST of Cp4.1LG01g06820 vs. Swiss-Prot
Match: YRDA_SHIFL (Protein YrdA OS=Shigella flexneri GN=yrdA PE=3 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 1.8e-24
Identity = 61/133 (45.86%), Postives = 84/133 (63.16%), Query Frame = 1

Query: 58  DAFVAPSASIIGDVNSISVGTGTNIQDNSLVHVA-KSNLSGKVLPTIIGDNVTVGHSAVL 117
           D  + P   I GDV+ + +G  TNIQD S++HV  KS+ +    P  IG++VTVGH  +L
Sbjct: 36  DVGIWPLVVIRGDVHYVQIGARTNIQDGSMLHVTHKSSYNPDGNPLTIGEDVTVGHKVML 95

Query: 118 HGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEVWGGNPARFLRKLTE 177
           HGCTI +   VGMG+ LLDG  VE   M+ AG+LV QN R+  G ++ G+P + +R L++
Sbjct: 96  HGCTIGNRVLVGMGSILLDGAIVEDDVMIGAGSLVPQNKRLESGYLYLGSPVKQIRPLSD 155

Query: 178 EEMAFFSQSAINY 190
           EE A    SA NY
Sbjct: 156 EEKAGLRYSANNY 168

BLAST of Cp4.1LG01g06820 vs. TrEMBL
Match: A0A0A0KWN9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G046620 PE=4 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 5.0e-130
Identity = 240/271 (88.56%), Postives = 248/271 (91.51%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFWIRETGQALDRLGCRLQGN+YFQEQLSRHRTLMN+FDKAPVVD DAF
Sbjct: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNIFDKAPVVDKDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VNSISVG+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDVQVGRGSSIWYGCVLRGDVNSISVGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTR+PCGEV
Sbjct: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRVPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEM F SQSAINYSNLSQVHAAENVKSFDEIE EKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMVFISQSAINYSNLSQVHAAENVKSFDEIELEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           +YDSMLGVVRETPPELVLPDNILADKV ++S
Sbjct: 241 DYDSMLGVVRETPPELVLPDNILADKVAKSS 271

BLAST of Cp4.1LG01g06820 vs. TrEMBL
Match: A0A0B0PEC8_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_08437 PE=4 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 1.4e-127
Identity = 232/271 (85.61%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFWIRETGQ LDRLGCRLQGN+YFQEQLSRHRTLMNVFDKAPVVD DAF
Sbjct: 1   MGTLGKAIYTVGFWIRETGQTLDRLGCRLQGNYYFQEQLSRHRTLMNVFDKAPVVDRDAF 60

Query: 61  VAPSASIIGDV------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGDV                  N+IS+G+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASVIGDVQVGRGSSIWYGCVLRGDVNNISIGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIG NVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGSNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEK+LRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSALNYSNLAQVHAAENAKSFDEIEFEKMLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRETPPEL+LPDNIL +KVP+ +
Sbjct: 241 EYDSMLGVVRETPPELILPDNILPNKVPKTA 271

BLAST of Cp4.1LG01g06820 vs. TrEMBL
Match: A0A061DFL1_THECC (Gamma carbonic anhydrase 1, CA1 OS=Theobroma cacao GN=TCM_000346 PE=4 SV=1)

HSP 1 Score: 463.0 bits (1190), Expect = 2.3e-127
Identity = 234/271 (86.35%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIY+VGFWIRETGQALDRLGCRLQGN+YFQEQLSRHRTLMNVF+KAPVVD DAF
Sbjct: 1   MGTLGKAIYSVGFWIRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNVFNKAPVVDRDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VNSIS+G+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDVQVGRGSSIWYGCVLRGDVNSISIGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIG NVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGSNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSALNYSNLAQVHAAENAKSFDEIEFEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRE PPEL+LPDNILADKV + +
Sbjct: 241 EYDSMLGVVREMPPELILPDNILADKVSKTA 271

BLAST of Cp4.1LG01g06820 vs. TrEMBL
Match: A0A059ATF4_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I02417 PE=4 SV=1)

HSP 1 Score: 463.0 bits (1190), Expect = 2.3e-127
Identity = 231/271 (85.24%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIY+VGFWIRETGQALDRLGCRLQG++YFQEQLSRHRTLMNVFDKAPV+D D F
Sbjct: 1   MGTLGKAIYSVGFWIRETGQALDRLGCRLQGSYYFQEQLSRHRTLMNVFDKAPVIDKDVF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VN+IS+G GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDIQIGRGSSIWYGCVLRGDVNNISIGAGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTR+PCGEV
Sbjct: 121 IIGDNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRVPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSAVNYSNLAQVHAAENAKSFDEIEFEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRETPPEL+LPDNIL DK+ +A+
Sbjct: 241 EYDSMLGVVRETPPELILPDNILQDKLQKAT 271

BLAST of Cp4.1LG01g06820 vs. TrEMBL
Match: A0A0D2MHW9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G013500 PE=4 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 3.1e-127
Identity = 231/271 (85.24%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFW+RETGQALDRLGCRLQGN+YFQEQLSRHRTLMNVFDKAP VD DAF
Sbjct: 1   MGTLGKAIYTVGFWVRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNVFDKAPFVDRDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGD                  VNSIS+G+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASVIGDVQVGRGSSIWYGCVLRGDVNSISIGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIG NVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGSNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEK+LRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSALNYSNLAQVHAAENGKSFDEIEFEKMLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRETPPEL+LPDN+L +KVP+ +
Sbjct: 241 EYDSMLGVVRETPPELILPDNVLPNKVPKTA 271

BLAST of Cp4.1LG01g06820 vs. TAIR10
Match: AT1G19580.1 (AT1G19580.1 gamma carbonic anhydrase 1)

HSP 1 Score: 417.5 bits (1072), Expect = 5.7e-117
Identity = 209/266 (78.57%), Postives = 229/266 (86.09%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLG+A Y+VGFWIRETGQALDRLGCRLQG +YF+EQLSRHRTLMNVFDKAP+VD +AF
Sbjct: 1   MGTLGRAFYSVGFWIRETGQALDRLGCRLQGKNYFREQLSRHRTLMNVFDKAPIVDKEAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGD                  VN++SVG+GTNIQDNSLVHVAKSNLSGKV PT
Sbjct: 61  VAPSASVIGDVHIGRGSSIWYGCVLRGDVNTVSVGSGTNIQDNSLVHVAKSNLSGKVHPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVT+GHSAVLHGCT+EDE F+GMGATLLDGV VEKH MVAAGALVRQNTRIP GEV
Sbjct: 121 IIGDNVTIGHSAVLHGCTVEDETFIGMGATLLDGVVVEKHGMVAAGALVRQNTRIPSGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPARFLRKLT+EE+AF SQSA NYSNL+Q HAAEN K  + IEFEKVLRKK A +DE
Sbjct: 181 WGGNPARFLRKLTDEEIAFISQSATNYSNLAQAHAAENAKPLNVIEFEKVLRKKHALKDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADK 249
           EYDSMLG+VRETPPEL LP+NIL DK
Sbjct: 241 EYDSMLGIVRETPPELNLPNNILPDK 266

BLAST of Cp4.1LG01g06820 vs. TAIR10
Match: AT1G47260.1 (AT1G47260.1 gamma carbonic anhydrase 2)

HSP 1 Score: 394.0 bits (1011), Expect = 6.8e-110
Identity = 193/270 (71.48%), Postives = 228/270 (84.44%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLG+AIYTVG WIR TGQALDR+G  LQG+H  +E LSRHRTLMNVFDK+P+VD D F
Sbjct: 1   MGTLGRAIYTVGNWIRGTGQALDRVGSLLQGSHRIEEHLSRHRTLMNVFDKSPLVDKDVF 60

Query: 61  VAPSASIIGDV------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGDV                  N+ISVG+GTNIQDN+LVHVAK+N+SGKVLPT
Sbjct: 61  VAPSASVIGDVQIGKGSSIWYGCVLRGDVNNISVGSGTNIQDNTLVHVAKTNISGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           +IGDNVTVGHSAV+HGCT+ED+AFVGMGATLLDGV VEKHAMVAAG+LV+QNTRIP GEV
Sbjct: 121 LIGDNVTVGHSAVIHGCTVEDDAFVGMGATLLDGVVVEKHAMVAAGSLVKQNTRIPSGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+F+RKLT+EE+ + SQSA NY NL+Q+HA+EN KSF++IE E+ LRKK+AR+DE
Sbjct: 181 WGGNPAKFMRKLTDEEIVYISQSAKNYINLAQIHASENSKSFEQIEVERALRKKYARKDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRA 253
           +YDSMLG+ RETPPEL+LPDN+L    P A
Sbjct: 241 DYDSMLGITRETPPELILPDNVLPGGKPVA 270

BLAST of Cp4.1LG01g06820 vs. TAIR10
Match: AT5G66510.2 (AT5G66510.2 gamma carbonic anhydrase 3)

HSP 1 Score: 355.5 bits (911), Expect = 2.7e-98
Identity = 182/273 (66.67%), Postives = 210/273 (76.92%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGT+GKA Y+VGFWIRETGQALDRLGCRLQG ++F+EQLSRHRTLMNVFDK P VD  AF
Sbjct: 1   MGTMGKAFYSVGFWIRETGQALDRLGCRLQGKNHFREQLSRHRTLMNVFDKTPNVDKGAF 60

Query: 61  VAPSASIIGDV-----------------------------NSISVGTGTNIQDNSLVHVA 120
           VAP+AS+ GDV                             NSISVG GTNIQDN+LVHVA
Sbjct: 61  VAPNASLSGDVHVGRGSSIWYGCVLRDIPFDLMTDSAGDANSISVGAGTNIQDNALVHVA 120

Query: 121 KSNLSGKVLPTIIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALV 180
           K+NLSGKVLPT+IGDNVT+GHSAVLHGCT+EDEA++G  AT+LDG +VEKHAMVA+GALV
Sbjct: 121 KTNLSGKVLPTVIGDNVTIGHSAVLHGCTVEDEAYIGTSATVLDGAHVEKHAMVASGALV 180

Query: 181 RQNTRIPCGEVWGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEK 240
           RQNTRIP GEVWGGNPA+FLRK+TEEE  FFS SA+ YSNL+Q HA EN K+ DE EF+K
Sbjct: 181 RQNTRIPSGEVWGGNPAKFLRKVTEEERVFFSSSAVEYSNLAQAHATENAKNLDEAEFKK 240

Query: 241 VLRKKFARRDEEYDSMLGVVRETPPELVLPDNI 245
           +L KK A RD EYDS+L        +L LP+N+
Sbjct: 241 LLNKKNA-RDTEYDSVL-------DDLTLPENV 265

BLAST of Cp4.1LG01g06820 vs. TAIR10
Match: AT3G48680.1 (AT3G48680.1 gamma carbonic anhydrase-like 2)

HSP 1 Score: 106.7 bits (265), Expect = 2.2e-23
Identity = 62/165 (37.58%), Postives = 94/165 (56.97%), Query Frame = 1

Query: 53  PVVDSDAFVAPS------------------ASIIGDVNSISVGTGTNIQDNSLVHVAKSN 112
           P V  DA+VAP+                  A + GD+N I+VG  +N+Q+  +VH A S+
Sbjct: 70  PKVAVDAYVAPNVVLAGQVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSS 129

Query: 113 LSGKVLPTIIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQN 172
            +G    T+I   VTVG  ++L  CTIE E  +G  + L++G  VE  +++ AG+++   
Sbjct: 130 PTGLPAQTLIDRYVTVGAYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVLPPG 189

Query: 173 TRIPCGEVWGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAE 200
            RIP GE+WGGNPARF+R LT EE     + A+  ++LS  + +E
Sbjct: 190 RRIPSGELWGGNPARFIRTLTNEETLEIPKLAVAINHLSGDYFSE 234

BLAST of Cp4.1LG01g06820 vs. TAIR10
Match: AT5G63510.2 (AT5G63510.2 gamma carbonic anhydrase like 1)

HSP 1 Score: 93.6 bits (231), Expect = 1.9e-19
Identity = 67/192 (34.90%), Postives = 97/192 (50.52%), Query Frame = 1

Query: 53  PVVDSDAFVAPS------------------ASIIGDVNSISVGTGTNIQDNSLVHVAKSN 112
           P V  DA+VAP+                  A + GD+N I+VG  +N+Q+  +VH A S+
Sbjct: 66  PKVAVDAYVAPNVVLAGQVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSS 125

Query: 113 LS-----------------------GKV--LP--TIIGDNVTVGHSAVLHGCTIEDEAFV 172
            +                       GK   LP  TII   VTVG  ++L  CTIE E  +
Sbjct: 126 PTVGCNGDKAVSHGCELVFAPRFRQGKFSRLPAATIIDRYVTVGAYSLLRSCTIEPECII 185

Query: 173 GMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEVWGGNPARFLRKLTEEEMAFFSQSAI 200
           G  + L++G  VE  +++ AG++V    RIP GE+WGGNPARF+R LT EE     + A+
Sbjct: 186 GQHSILMEGSLVETRSILEAGSVVPPGRRIPSGELWGGNPARFIRTLTNEETLEIPKLAV 245

BLAST of Cp4.1LG01g06820 vs. NCBI nr
Match: gi|449457524|ref|XP_004146498.1| (PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Cucumis sativus])

HSP 1 Score: 471.9 bits (1213), Expect = 7.2e-130
Identity = 240/271 (88.56%), Postives = 248/271 (91.51%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFWIRETGQALDRLGCRLQGN+YFQEQLSRHRTLMN+FDKAPVVD DAF
Sbjct: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNIFDKAPVVDKDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VNSISVG+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDVQVGRGSSIWYGCVLRGDVNSISVGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTR+PCGEV
Sbjct: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRVPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEM F SQSAINYSNLSQVHAAENVKSFDEIE EKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMVFISQSAINYSNLSQVHAAENVKSFDEIELEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           +YDSMLGVVRETPPELVLPDNILADKV ++S
Sbjct: 241 DYDSMLGVVRETPPELVLPDNILADKVAKSS 271

BLAST of Cp4.1LG01g06820 vs. NCBI nr
Match: gi|659102369|ref|XP_008452092.1| (PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Cucumis melo])

HSP 1 Score: 470.3 bits (1209), Expect = 2.1e-129
Identity = 239/271 (88.19%), Postives = 249/271 (91.88%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFWIRETGQALDRLGCRLQGN+YFQEQLSRHRTLMN+FDKAPV++ DAF
Sbjct: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNIFDKAPVINKDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VNSISVG+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDVQVGRMSSIWYGCVLRGDVNSISVGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           +YDSMLGVVRETP EL+LPDNILADKV ++S
Sbjct: 241 DYDSMLGVVRETPSELILPDNILADKVAKSS 271

BLAST of Cp4.1LG01g06820 vs. NCBI nr
Match: gi|728843857|gb|KHG23300.1| (hypothetical protein F383_08437 [Gossypium arboreum])

HSP 1 Score: 463.8 bits (1192), Expect = 2.0e-127
Identity = 232/271 (85.61%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIYTVGFWIRETGQ LDRLGCRLQGN+YFQEQLSRHRTLMNVFDKAPVVD DAF
Sbjct: 1   MGTLGKAIYTVGFWIRETGQTLDRLGCRLQGNYYFQEQLSRHRTLMNVFDKAPVVDRDAF 60

Query: 61  VAPSASIIGDV------------------NSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSAS+IGDV                  N+IS+G+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASVIGDVQVGRGSSIWYGCVLRGDVNNISIGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIG NVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGSNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEK+LRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSALNYSNLAQVHAAENAKSFDEIEFEKMLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRETPPEL+LPDNIL +KVP+ +
Sbjct: 241 EYDSMLGVVRETPPELILPDNILPNKVPKTA 271

BLAST of Cp4.1LG01g06820 vs. NCBI nr
Match: gi|702467604|ref|XP_010029770.1| (PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Eucalyptus grandis])

HSP 1 Score: 463.0 bits (1190), Expect = 3.4e-127
Identity = 231/271 (85.24%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIY+VGFWIRETGQALDRLGCRLQG++YFQEQLSRHRTLMNVFDKAPV+D D F
Sbjct: 1   MGTLGKAIYSVGFWIRETGQALDRLGCRLQGSYYFQEQLSRHRTLMNVFDKAPVIDKDVF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VN+IS+G GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDIQIGRGSSIWYGCVLRGDVNNISIGAGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIGDNVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTR+PCGEV
Sbjct: 121 IIGDNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRVPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSAVNYSNLAQVHAAENAKSFDEIEFEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRETPPEL+LPDNIL DK+ +A+
Sbjct: 241 EYDSMLGVVRETPPELILPDNILQDKLQKAT 271

BLAST of Cp4.1LG01g06820 vs. NCBI nr
Match: gi|590703469|ref|XP_007046882.1| (Gamma carbonic anhydrase 1, CA1 [Theobroma cacao])

HSP 1 Score: 463.0 bits (1190), Expect = 3.4e-127
Identity = 234/271 (86.35%), Postives = 246/271 (90.77%), Query Frame = 1

Query: 1   MGTLGKAIYTVGFWIRETGQALDRLGCRLQGNHYFQEQLSRHRTLMNVFDKAPVVDSDAF 60
           MGTLGKAIY+VGFWIRETGQALDRLGCRLQGN+YFQEQLSRHRTLMNVF+KAPVVD DAF
Sbjct: 1   MGTLGKAIYSVGFWIRETGQALDRLGCRLQGNYYFQEQLSRHRTLMNVFNKAPVVDRDAF 60

Query: 61  VAPSASIIGD------------------VNSISVGTGTNIQDNSLVHVAKSNLSGKVLPT 120
           VAPSASIIGD                  VNSIS+G+GTNIQDNSLVHVAKSNLSGKVLPT
Sbjct: 61  VAPSASIIGDVQVGRGSSIWYGCVLRGDVNSISIGSGTNIQDNSLVHVAKSNLSGKVLPT 120

Query: 121 IIGDNVTVGHSAVLHGCTIEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180
           IIG NVTVGHSAVLHGCT+EDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV
Sbjct: 121 IIGSNVTVGHSAVLHGCTVEDEAFVGMGATLLDGVYVEKHAMVAAGALVRQNTRIPCGEV 180

Query: 181 WGGNPARFLRKLTEEEMAFFSQSAINYSNLSQVHAAENVKSFDEIEFEKVLRKKFARRDE 240
           WGGNPA+FLRKLTEEEMAF SQSA+NYSNL+QVHAAEN KSFDEIEFEKVLRKKFARRDE
Sbjct: 181 WGGNPAKFLRKLTEEEMAFISQSALNYSNLAQVHAAENAKSFDEIEFEKVLRKKFARRDE 240

Query: 241 EYDSMLGVVRETPPELVLPDNILADKVPRAS 254
           EYDSMLGVVRE PPEL+LPDNILADKV + +
Sbjct: 241 EYDSMLGVVREMPPELILPDNILADKVSKTA 271

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GCA1_ARATH1.0e-11578.57Gamma carbonic anhydrase 1, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA1 PE... [more]
GCA2_ARATH1.2e-10871.48Gamma carbonic anhydrase 2, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA2 PE... [more]
GCA3_ARATH2.5e-9869.47Gamma carbonic anhydrase 3, mitochondrial OS=Arabidopsis thaliana GN=GAMMACA3 PE... [more]
Y2881_DICDI9.7e-3435.17Uncharacterized protein DDB_G0288155 OS=Dictyostelium discoideum GN=DDB_G0288155... [more]
YRDA_SHIFL1.8e-2445.86Protein YrdA OS=Shigella flexneri GN=yrdA PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KWN9_CUCSA5.0e-13088.56Uncharacterized protein OS=Cucumis sativus GN=Csa_4G046620 PE=4 SV=1[more]
A0A0B0PEC8_GOSAR1.4e-12785.61Uncharacterized protein OS=Gossypium arboreum GN=F383_08437 PE=4 SV=1[more]
A0A061DFL1_THECC2.3e-12786.35Gamma carbonic anhydrase 1, CA1 OS=Theobroma cacao GN=TCM_000346 PE=4 SV=1[more]
A0A059ATF4_EUCGR2.3e-12785.24Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I02417 PE=4 SV=1[more]
A0A0D2MHW9_GOSRA3.1e-12785.24Uncharacterized protein OS=Gossypium raimondii GN=B456_003G013500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G19580.15.7e-11778.57 gamma carbonic anhydrase 1[more]
AT1G47260.16.8e-11071.48 gamma carbonic anhydrase 2[more]
AT5G66510.22.7e-9866.67 gamma carbonic anhydrase 3[more]
AT3G48680.12.2e-2337.58 gamma carbonic anhydrase-like 2[more]
AT5G63510.21.9e-1934.90 gamma carbonic anhydrase like 1[more]
Match NameE-valueIdentityDescription
gi|449457524|ref|XP_004146498.1|7.2e-13088.56PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Cucumis sativus][more]
gi|659102369|ref|XP_008452092.1|2.1e-12988.19PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Cucumis melo][more]
gi|728843857|gb|KHG23300.1|2.0e-12785.61hypothetical protein F383_08437 [Gossypium arboreum][more]
gi|702467604|ref|XP_010029770.1|3.4e-12785.24PREDICTED: gamma carbonic anhydrase 1, mitochondrial [Eucalyptus grandis][more]
gi|590703469|ref|XP_007046882.1|3.4e-12786.35Gamma carbonic anhydrase 1, CA1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011004Trimer_LpxA-like_sf
IPR001451Hexapep
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005747 mitochondrial respiratory chain complex I
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g06820.1Cp4.1LG01g06820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001451Hexapeptide repeatPFAMPF00132Hexapepcoord: 102..133
score: 2.
IPR011004Trimeric LpxA-likeunknownSSF51161Trimeric LpxA-like enzymescoord: 54..192
score: 2.5
NoneNo IPR availableGENE3DG3DSA:2.160.10.10coord: 55..194
score: 8.3
NoneNo IPR availablePANTHERPTHR13061DYNACTIN SUBUNIT P25coord: 1..252
score: 4.1E
NoneNo IPR availablePANTHERPTHR13061:SF9SUBFAMILY NOT NAMEDcoord: 1..252
score: 4.1E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g06820Cp4.1LG14g01980Cucurbita pepo (Zucchini)cpecpeB233