Cla022885 (gene) Watermelon (97103) v1

NameCla022885
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionAlpha-galactosidase 2 (AHRD V1 ***- D7M1B4_ARALL); contains Interpro domain(s) IPR013785 Aldolase-type TIM barrel
LocationChr11 : 15265038 .. 15270145 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCTTCCCCACTCCCGCTTTCATCTCCACCTTCTTTTATGGTTCTACTCTACTCCGTCCTTTTCTGGATGTTCTTGTTAGCTAGCAATGGCAGTGGCAATGGCGGTGCGGTGACCGGAGCGACTCGATCAAGACCTCTACGGTTTGCCGCTGAGTTTGATTCTGTTTCCACTCGAAGAGTTCTCCTCAACAATGGCCTCGCTCTAACTCCTCCGATGGGGTAAGTATTATGCTCCGATCATGATTTTTAAGTCTTCTTCCTGTTTCTAGAGGATCGATTCGATTGAATGAAATGTTGTTTGCTTGTTTGATTTTCGATCTTCTCGCTTTCGTTCAATAGTGGTTATTGAAAATTTGATGATGTTGGATTGCTGATGTTTGTAATGTGTTACTTCCGGCTTTTGTAATCAACGCAGATGGAATAGTTGGAACCATTTTCAATGTAATATAAACGAGAATTTAATCAAGGAAACAGGTGATAATTTTGATTTGAAGGCATCTTTCTGATTGGATTTGTGTACTGTGATTATGTGACTGAGATACACATCCTATTTCTTTTGCTAAGCGGATGCAATGGTATCCAGCGGCCTTGCTGCATTAGGATATGAGTATATCAATTTAGGTGATGATTTTATTTCTACTTTATACATTTCTATGATTTCCCAACTTTATGCTTTCTGTTTTTGACATTAATATGTTCTTATTTCCATATGAATTAGATGATTGTTGGGCTGAACTTAACAGAGACTCTAAGGTAAATTTCAATGTGCTATTACCTGTCTAGAATGTTTATGAAATCTCTATATTGCATTAGAAGATTCCATCAATTTTTTCAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCACAGAAAAGGGCTTAAGCTCGGAATTTATTCGGATGCTGGGTAAGTTTATACACTCATCAGATCAGCTTAAACATCTTCCTCTTCCATTTCAATTGTTGCTGTTCATCAAAGTGCTGCAGATGTTTTTTTGGCTTAAAGAAATTCTACTATCAACACTGTTTTAGTTAAATCTTCTTTACGTGAGTCATATAATTTCTTCAGGATCCAGACTTGTAGTAAAAAGATGCCAGGTTCCCTTGGTCACGAAGAACAAGATGCAAAAACTTTTGCCGCATGGGTAATTTTGCTGCTGTACACTCTTAGTCTTAACTCCTTTAATTCTTGTTTATAATAATGATTTTTCGTTTAGAAACTGATATATACACAATTGTGTAGGGGATTGATTATTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCAAAAGAAAGGTAGGATCAGCTTTCAACATTCAATTATCAAAATTTCCAAATGACTGCAACAATTCCAGGGGTATAGGTACCCAAAGATGAGTAAAGCTTTGCAACGATCTGAGAGACCGATTCTTTTTTCGTTATGCGAATGGTTAGCAATCTTTATACACAAAAGAAAAAACATGGGATTGTTGTTCTAATGCTTCCCCATTCTAAAGTACAAACCGAGTTCTTATCTTAAGTGTTTCCTCATTTAGGGGACAGGAAGATCCAGCAACTTGGGCAATAAATGTAGGAAATAGTTGGAGAACAACATCGGATATTCAAGATAACTGGACTAGGTAGTGAGCTTATATCCTTAAAAGCATATATTTTATCGTCATTTCTGGAATCATGCAACATTTTTCAAAAACTTTGTCTACATTCTTTAATTAGTTTCTTTGTCCCACTGGCCATATCTATGAATATGCCTGTACTGAAGTATGATCTTTCAAGTCCAAATTATTTTCTTTAATATGCCTGTACTAATGCTCCTACTTTATTATTACAGCATGACAACCATTGCTGATCAGAATGACAAATGGGCTTCTTACGCAAAACCAGGAGGATGGAATGGTAAAAATTCGTAAATGTAGTTCAATATGAAAATTAAACAATGTATCCTTGTGTTTAGAGAAATATTGGTTTGTTAGCTTGTCAGTACTAATTAAATTATAGATTCGAATGATTGAAGAAGTCGATCCAATCCAATCCTGTATCCTGAACAGAGGGTAGACGATTTCTACTGCTGCCATTAGAGCTTCCAGCTGTTTAAACTCAATCTTATAGTTATTACCTTATTATCAAATAATTTAGTTTGATATAAAAGTCCAATACTCATAGTTATTACCTTATTATCAAATAATTTAGTTTTATGATTGTGACTAAATGACTTTCCCTTTGGCTAATCTCTATGCAAGACATGGACATTGTAAAACAATCTAAAAAATGCTAAGGAAAATAATCTTAGTTTAAAATTTCTTTTGTTCAGATCCCGACATGCTTGAAGTGGGGAACGGAGGGATGACCACAGTAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGTAAGTACCATTGAGTAAAATTTTTCCAGAGTCTAGGTTTTAGTTCAGTAGGTAATTTCTTATACCATATTAAGACATATTTGGAATTCAAGTTCATTAAATAACATAATCATTTAAAAAATTTATATTTAGATAAAATTGAATATCGTCTGTTCCTTGATTTGTGGGTTGAGTTTATAATTTTTTGTGCTTCATAAAATATCTAAATCAAGTTATAATCAATTCCTTCCTGTTTCATATTTGAAGGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACGTCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGGTAAGTAAAAACCTCAGCTCATTCATAGAGTATAAACTGTGAAACAGATAACCAAACATAGTAATAATTTCTCTCAATAAAATGAAAAGATTAAGTTTCGAAAAATATTTGGGTGCATGGACTGCATCGATCATTAACTTAATATGACACCATGTTCGTCCATCTAAAAGTTTTTTTTATCCAAGTAGATAAGCTTGGAGTCCAAGGCAAGAAGGTCTATAAATATGGAGACCTCGAGGTAATTTTTTCATCATCTAACGTATCCCGAAAGGAAACTAAAAGAGAGAGAGAAAAAAAAAAAAAAAAAAAAAAAAAAACTGAAAATGTTGCTTTACTCTTGACTTTTTACACAAAACCAACGCAGGTTTGGGCAGGGCCGCTGAGTGGTAAAAGAGTCGCCGTGGTTCTATGGAACAGAGGTTTGTGGCGAGCCAATATCACTGCATCTTGGTCTGATATAGGTTTGAGTCCATCAACTGTAGTTACTGCTCGAGACTTATGGGAGGTAAGATATGAATTACCTTATAACTCAACTTTTATAAAAGTAAGAATTTAATCGAACATGGTTAGTGATTTAGTTCGTATACTTTTAAATTGGTAGAAATTTAGTCATTTAACTTTAGTATTAACAATTTAGTTTTTATACTCTAAATGTTACATTGATTTAGTCTCCATCACGATGGGGTGTTTATTGGTTGGGTTGAGTTGAAATACTTTTTAAACTTAACCCAATTGTTCAAGTTATAAATTTCTTCAATCCAAATAAAGCTTATTAGAGGATGAACCCAACCCAACTCAACCATGAAACATCTTGGTTGGGTTGATTTGGGTTACTTGGGTCATTTTTAAATGTTTGTTCTAAAAATAAGTAAAATGTTAATATGTAAAAAAAATTTATTTAATTATTTTTGTATATTGAATTAAGATTAACAACTCAATCTAAATTTATAATATGAAAATTGTCATTCCAAGGTGCTAAAAAATTATTTTTCAGAGTTGTTAGGGAGAATAAATTATCAAAAAAATAATAAAATTAAAATAAATAATATGTATATATAATTGTATCTTTAAGTAAAAATATATGCTTTTTAAAATTAAAATAATTTCGGGTTGGTTCGGGTTGGGTTGGGTTATTTTATGTGAACCCATGAATCAACTCAGCCCATAAAATTTTCATTTATTTGAACCCAACTCGACCTAAATGAGTTTATAACCCAACCCAAACTGTACGGGTTGGGTTCGGTTGATCGGATTTTTTGAACACCCCTATATCACGAAAAATATTGTTAAAATTTGATCAGATTTATTATATATGTAGATTGATAGACTAGTTAGGGACCAAATTTGTTTATAAACAATGATAACTAAATAGTTACATTCACACTTAACTTTGATGAAATTTCACGGTAGAGATTAAATTACTACAAATTTGAAAGTACGTAGACTAAACTGACACTAAAATTCACTAAATAATTAGAAATTTAACACTGGAAATCCATGTTTTTGAAGTTAAGCACATATGTAATACAATTTTTGTAAACAAATGTAAACTCTTATCCTATTTTATATGAGTGCATATTTGTTCATCCTTCAAAAGACCTACTTATAGGGATTTTTGTTTTAATATCTTTTTGGTCCTTAGATTTTGGGTCTAATTTCTATTTAGTCATCAGGTTTCAAAATATTACACATTTAGTTCTTTAGTTTTGAATTTGATATCAATGTATTCCCTGGCTTTCAAAATGTTACAATTTTATTCTTGAGATTTGAGTTTTGTTTCAATTTGGTCGCTCGATTTTCAAGATTTACATTTTTTACCTTGATTTTTCATTAAATACTCACTTTTCGTCTTTAGTTTTAATGTCAATTAATTAATTTAAATGAATTATAATTAATTGAGTTTCACTATTTTTCATCACTATTAAAATTAAATTCATAATGTCACTCTATAATTATTTTAATTTAATTCATTGATATTAACGCCAAATCAACGTCTAAAAATGAGTATTTAATTAAAAATTGAGATTAAAAGTGTAAATCTTGAAACATATGGACTAAATTGACATAGGACTCAAATATCATGAGTAAAATTATAACATTTCGAGACTTACTAACTAAATACTCAAAACTTAAGGATTAAATGTGTAACATTTTGAAATTAGAAACTAAATGAGACTGAAAATCGAAAAAGATATTCTTTATTATTATTTTTTTTTCTTTCTGACTGATTGATTATATTCATTTCTACGACAGCACTCCAGGCTGGTGGTTCAACATCACCTTACTGCTGAAGTAGACTCTCATGGCTGTAAGATGTACGTTCTCACACCCCACTAA

mRNA sequence

ATGTCTTCTTCCCCACTCCCGCTTTCATCTCCACCTTCTTTTATGGTTCTACTCTACTCCGTCCTTTTCTGGATGTTCTTGTTAGCTAGCAATGGCAGTGGCAATGGCGGTGCGGTGACCGGAGCGACTCGATCAAGACCTCTACGGTTTGCCGCTGAGTTTGATTCTGTTTCCACTCGAAGAGTTCTCCTCAACAATGGCCTCGCTCTAACTCCTCCGATGGGATGGAATAGTTGGAACCATTTTCAATGTAATATAAACGAGAATTTAATCAAGGAAACAGCGGATGCAATGGTATCCAGCGGCCTTGCTGCATTAGGATATGAGTATATCAATTTAGATGATTGTTGGGCTGAACTTAACAGAGACTCTAAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCACAGAAAAGGGCTTAAGCTCGGAATTTATTCGGATGCTGGGATCCAGACTTGTAGTAAAAAGATGCCAGGTTCCCTTGGTCACGAAGAACAAGATGCAAAAACTTTTGCCGCATGGGGGATTGATTATTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCAAAAGAAAGGTACCCAAAGATGAGTAAAGCTTTGCAACGATCTGAGAGACCGATTCTTTTTTCGTTATGCGAATGGGGACAGGAAGATCCAGCAACTTGGGCAATAAATGTAGGAAATAGTTGGAGAACAACATCGGATATTCAAGATAACTGGACTAGCATGACAACCATTGCTGATCAGAATGACAAATGGGCTTCTTACGCAAAACCAGGAGGATGGAATGATCCCGACATGCTTGAAGTGGGGAACGGAGGGATGACCACAGTAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACGTCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGATAAGCTTGGAGTCCAAGGCAAGAAGGTCTATAAATATGGAGACCTCGAGGTTTGGGCAGGGCCGCTGAGTGGTAAAAGAGTCGCCGTGGTTCTATGGAACAGAGGTTTGTGGCGAGCCAATATCACTGCATCTTGGTCTGATATAGGTTTGAGTCCATCAACTGTAGTTACTGCTCGAGACTTATGGGAGCACTCCAGGCTGGTGGTTCAACATCACCTTACTGCTGAAGTAGACTCTCATGGCTGTAAGATGTACGTTCTCACACCCCACTAA

Coding sequence (CDS)

ATGTCTTCTTCCCCACTCCCGCTTTCATCTCCACCTTCTTTTATGGTTCTACTCTACTCCGTCCTTTTCTGGATGTTCTTGTTAGCTAGCAATGGCAGTGGCAATGGCGGTGCGGTGACCGGAGCGACTCGATCAAGACCTCTACGGTTTGCCGCTGAGTTTGATTCTGTTTCCACTCGAAGAGTTCTCCTCAACAATGGCCTCGCTCTAACTCCTCCGATGGGATGGAATAGTTGGAACCATTTTCAATGTAATATAAACGAGAATTTAATCAAGGAAACAGCGGATGCAATGGTATCCAGCGGCCTTGCTGCATTAGGATATGAGTATATCAATTTAGATGATTGTTGGGCTGAACTTAACAGAGACTCTAAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCACAGAAAAGGGCTTAAGCTCGGAATTTATTCGGATGCTGGGATCCAGACTTGTAGTAAAAAGATGCCAGGTTCCCTTGGTCACGAAGAACAAGATGCAAAAACTTTTGCCGCATGGGGGATTGATTATTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCAAAAGAAAGGTACCCAAAGATGAGTAAAGCTTTGCAACGATCTGAGAGACCGATTCTTTTTTCGTTATGCGAATGGGGACAGGAAGATCCAGCAACTTGGGCAATAAATGTAGGAAATAGTTGGAGAACAACATCGGATATTCAAGATAACTGGACTAGCATGACAACCATTGCTGATCAGAATGACAAATGGGCTTCTTACGCAAAACCAGGAGGATGGAATGATCCCGACATGCTTGAAGTGGGGAACGGAGGGATGACCACAGTAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACGTCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGATAAGCTTGGAGTCCAAGGCAAGAAGGTCTATAAATATGGAGACCTCGAGGTTTGGGCAGGGCCGCTGAGTGGTAAAAGAGTCGCCGTGGTTCTATGGAACAGAGGTTTGTGGCGAGCCAATATCACTGCATCTTGGTCTGATATAGGTTTGAGTCCATCAACTGTAGTTACTGCTCGAGACTTATGGGAGCACTCCAGGCTGGTGGTTCAACATCACCTTACTGCTGAAGTAGACTCTCATGGCTGTAAGATGTACGTTCTCACACCCCACTAA

Protein sequence

MSSSPLPLSSPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTPH
BLAST of Cla022885 vs. Swiss-Prot
Match: AGAL2_ARATH (Alpha-galactosidase 2 OS=Arabidopsis thaliana GN=AGAL2 PE=1 SV=1)

HSP 1 Score: 617.8 bits (1592), Expect = 8.6e-176
Identity = 292/364 (80.22%), Postives = 319/364 (87.64%), Query Frame = 1

Query: 61  RVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAEL 120
           R+L+NNGLAL+P MGWNSWNHFQCNINE LIK+TADAMVSSGL+A+GY+YIN+DDCW EL
Sbjct: 29  RMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGEL 88

Query: 121 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAK 180
            RDS+G+LVAKASTFPSGIKAL+DYVH KGLKLGIYSDAG  TCS+ MPGSLGHEEQDAK
Sbjct: 89  KRDSQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAK 148

Query: 181 TFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVG 240
           TFA+WGIDYLKYDNCENTGTSP+ERYPKMSKAL  S R I FSLCEWGQEDPATWA ++G
Sbjct: 149 TFASWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIG 208

Query: 241 NSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIW 300
           NSWRTT DIQDNW SMT IADQND+WASYA+PG WNDPDMLEVGNGGMT  EY SHFSIW
Sbjct: 209 NSWRTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIW 268

Query: 301 ALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGK 360
           ALAKAPLLIGCD+RSMD VT +LLSNKEVIAVNQDKLG+QGKKV K GDLEVWAGPLS K
Sbjct: 269 ALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKK 328

Query: 361 RVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHSRL-VVQHHLTAEVDSHGCKM 420
           RVAV+LWNRG   ANITA W++IGL+ S +V ARDLWEHS    V+  L+A V+ H CKM
Sbjct: 329 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYSCVKKQLSALVEPHACKM 388

Query: 421 YVLT 424
           Y LT
Sbjct: 389 YTLT 392

BLAST of Cla022885 vs. Swiss-Prot
Match: AGAL_COFAR (Alpha-galactosidase OS=Coffea arabica PE=1 SV=1)

HSP 1 Score: 610.9 bits (1574), Expect = 1.0e-173
Identity = 293/367 (79.84%), Postives = 310/367 (84.47%), Query Frame = 1

Query: 59  TRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWA 118
           TRR LL NGL LTPPMGWNSWNHF+CN++E LI+ETADAMVS GLAALGY+YINLDDCWA
Sbjct: 11  TRRSLLANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCWA 70

Query: 119 ELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQD 178
           ELNRDS+GNLV K STFPSGIKALADYVH KGLKLGIYSDAG QTCSK MPGSLGHEEQD
Sbjct: 71  ELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQD 130

Query: 179 AKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAIN 238
           AKTFA+WG+DYLKYDNC N   SPKERYP MSKAL  S R I FSLCEWG+EDPATWA  
Sbjct: 131 AKTFASWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPATWAKE 190

Query: 239 VGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFS 298
           VGNSWRTT DI D+W+SMT+ AD NDKWASYA PGGWNDPDMLEVGNGGMTT EYRSHFS
Sbjct: 191 VGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYRSHFS 250

Query: 299 IWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLS 358
           IWALAKAPLLIGCDIRSMD  T +LLSN EVIAVNQDKLGVQG KV  YGDLEVWAGPLS
Sbjct: 251 IWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGDLEVWAGPLS 310

Query: 359 GKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEVDSHGC 418
           GKRVAV LWNRG   A ITA WSD+GL  + VV ARDLW HS    V+  ++A VD+H  
Sbjct: 311 GKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAAVDAHDS 370

Query: 419 KMYVLTP 425
           KMYVLTP
Sbjct: 371 KMYVLTP 377

BLAST of Cla022885 vs. Swiss-Prot
Match: AGAL_CYATE (Alpha-galactosidase OS=Cyamopsis tetragonoloba PE=1 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 1.4e-173
Identity = 287/389 (73.78%), Postives = 321/389 (82.52%), Query Frame = 1

Query: 37  GAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETAD 96
           G+  G    +  R +AE +  + RR L  NGL  TPPMGWNSWNHF C+INEN+++ETAD
Sbjct: 21  GSEGGRLLEKKNRTSAEAEHYNVRRYLAENGLGQTPPMGWNSWNHFGCDINENVVRETAD 80

Query: 97  AMVSSGLAALGYEYINLDDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIY 156
           AMVS+GLAALGY+YINLDDCWAELNRDS+GN+V  A+ FPSGIKALADYVH KGLKLG+Y
Sbjct: 81  AMVSTGLAALGYQYINLDDCWAELNRDSEGNMVPNAAAFPSGIKALADYVHSKGLKLGVY 140

Query: 157 SDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRS 216
           SDAG QTCSK+MPGSLGHEEQDAKTFA+WG+DYLKYDNCEN G S KERYP M KAL  S
Sbjct: 141 SDAGNQTCSKRMPGSLGHEEQDAKTFASWGVDYLKYDNCENLGISVKERYPPMGKALLSS 200

Query: 217 ERPILFSLCEWGQEDPATWAINVGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWN 276
            RPI FS+CEWG EDP  WA ++GNSWRTT DI+DNW SMT+IAD NDKWASYA PGGWN
Sbjct: 201 GRPIFFSMCEWGWEDPQIWAKSIGNSWRTTGDIEDNWNSMTSIADSNDKWASYAGPGGWN 260

Query: 277 DPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDK 336
           DPDMLEVGNGGMTT EYRSHFSIWALAKAPLL+GCDIR+MD+ T +L+SN EVIAVNQDK
Sbjct: 261 DPDMLEVGNGGMTTEEYRSHFSIWALAKAPLLVGCDIRAMDDTTHELISNAEVIAVNQDK 320

Query: 337 LGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDL 396
           LGVQGKKV    DLEVWAGPLS  +VAV+LWNR   RA +TASWSDIGL   T V ARDL
Sbjct: 321 LGVQGKKVKSTNDLEVWAGPLSDNKVAVILWNRSSSRATVTASWSDIGLQQGTTVDARDL 380

Query: 397 WEHS-RLVVQHHLTAEVDSHGCKMYVLTP 425
           WEHS + +V   ++AE+DSH CKMYVLTP
Sbjct: 381 WEHSTQSLVSGEISAEIDSHACKMYVLTP 409

BLAST of Cla022885 vs. Swiss-Prot
Match: AGAL_ORYSJ (Alpha-galactosidase OS=Oryza sativa subsp. japonica GN=Os10g0493600 PE=1 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 4.4e-156
Identity = 270/419 (64.44%), Postives = 307/419 (73.27%), Query Frame = 1

Query: 9   SSPPS--FMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNN 68
           SSPPS   ++LL   +    L  +   GN  A +   R R  R          RR    N
Sbjct: 8   SSPPSPRLLLLLLVAVAATLLPEAAALGNFTAESRGARWRSRR---------ARRRAFEN 67

Query: 69  GLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKG 128
           GL  TP MGWNSWNHF C INE +I+ETADA+V++GLA LGY+Y+N+DDCWAE +RDS+G
Sbjct: 68  GLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRDSQG 127

Query: 129 NLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWG 188
           N V    TFPSGIKALADYVH KGLKLGIYSDAG QTCS KMPGSL HEEQD KTFA+WG
Sbjct: 128 NFVPNRQTFPSGIKALADYVHAKGLKLGIYSDAGSQTCSNKMPGSLDHEEQDVKTFASWG 187

Query: 189 IDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTT 248
           +DYLKYDNC + G S  ERY +MS A++   + I FSLCEWG+E+PATWA  +GNSWRTT
Sbjct: 188 VDYLKYDNCNDAGRSVMERYTRMSNAMKTYGKNIFFSLCEWGKENPATWAGRMGNSWRTT 247

Query: 249 SDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAP 308
            DI DNW SMT+ AD+ND+WA+YA PGGWNDPDMLEVGNGGM+  EYRSHFSIWALAKAP
Sbjct: 248 GDIADNWGSMTSRADENDQWAAYAGPGGWNDPDMLEVGNGGMSEAEYRSHFSIWALAKAP 307

Query: 309 LLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVL 368
           LLIGCD+RSM   T  +LSN EVIAVNQD LGVQGKKV     LEVWAGPLS  R AVVL
Sbjct: 308 LLIGCDVRSMSQQTKNILSNSEVIAVNQDSLGVQGKKVQSDNGLEVWAGPLSNNRKAVVL 367

Query: 369 WNRGLWRANITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTPH 426
           WNR  ++A ITA WS+IGL+ S  VTARDLW HS    Q  ++A V  H CKMYVLTP+
Sbjct: 368 WNRQSYQATITAHWSNIGLAGSVAVTARDLWAHSSFAAQGQISASVAPHDCKMYVLTPN 417

BLAST of Cla022885 vs. Swiss-Prot
Match: AGAL1_ARATH (Alpha-galactosidase 1 OS=Arabidopsis thaliana GN=AGAL1 PE=2 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 2.1e-150
Identity = 260/415 (62.65%), Postives = 311/415 (74.94%), Query Frame = 1

Query: 12  PSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLALT 71
           P  M+L+ S++  M ++ S+ S N G                 DS   RR LL NGL +T
Sbjct: 11  PILMILISSMV--MTMVESSRSVNNG---------------HDDSEILRRHLLTNGLGVT 70

Query: 72  PPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLVAK 131
           PPMGWNSWNHF CNI+E +IKETADA+V++GL+ LGY Y+N+DDCWAE++RDSKG+LV K
Sbjct: 71  PPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYNYVNIDDCWAEISRDSKGSLVPK 130

Query: 132 ASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDYLK 191
            STFPSGIKA+ADYVH KGLKLGIYSDAG  TCSK MPGSLG+EE DAKTFA WGIDYLK
Sbjct: 131 KSTFPSGIKAVADYVHSKGLKLGIYSDAGYFTCSKTMPGSLGYEEHDAKTFAEWGIDYLK 190

Query: 192 YDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDIQD 251
           YDNC + G+ P  RYP M++AL +S RPI  SLCEWG   PA W   VGNSWRTT+DI+D
Sbjct: 191 YDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPALWGSPVGNSWRTTNDIKD 250

Query: 252 NWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLIGC 311
            W SM +IAD N+ +A +A+PGGWNDPDMLEVGNGGMT  EY  HFSIWA++KAPLL+GC
Sbjct: 251 TWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYIVHFSIWAISKAPLLLGC 310

Query: 312 DIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNRGL 371
           DIR+M   T+++++NKEVIA+NQD  GVQ KKV   GDLEVWAGPLSG RVA++L NRG 
Sbjct: 311 DIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRMEGDLEVWAGPLSGYRVALLLLNRGP 370

Query: 372 WRANITASWSDIGLSPSTVVTARDLWEHSRLVVQH--HLTAEVDSHGCKMYVLTP 425
            R +ITA W DI +  +++V ARDLWEH  L  +   +LTA VDSH CK+YVL P
Sbjct: 371 SRTSITALWEDIEIPANSIVEARDLWEHQTLKQKFVGNLTATVDSHACKLYVLKP 408

BLAST of Cla022885 vs. TrEMBL
Match: A0A061GFP9_THECC (Alpha-galactosidase OS=Theobroma cacao GN=TCM_027241 PE=3 SV=1)

HSP 1 Score: 628.2 bits (1619), Expect = 7.1e-177
Identity = 308/416 (74.04%), Postives = 337/416 (81.01%), Query Frame = 1

Query: 10  SPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLA 69
           S PS     +     + LL S+ + +G  +  A  +         DS S RR+LLNNGL 
Sbjct: 4   SLPSSRFSAFFCCILLLLLISSATVSGARLGNARLT---------DSSSIRRILLNNGLG 63

Query: 70  LTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLV 129
           LTP MGWNSWN F CNINE LIKETADAMVSSGLAALGY YINLDDCW ELNRD++GNLV
Sbjct: 64  LTPQMGWNSWNRFHCNINETLIKETADAMVSSGLAALGYTYINLDDCWGELNRDTQGNLV 123

Query: 130 AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDY 189
            KASTFPSGIKALADYVH KGLKLGIY+DAG QTCSK MPGSLG+EEQDAKTFA+WG+DY
Sbjct: 124 PKASTFPSGIKALADYVHSKGLKLGIYADAGTQTCSKTMPGSLGYEEQDAKTFASWGVDY 183

Query: 190 LKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDI 249
           LKYDNC NTG SPKERYPKMSKAL  S R + FSLCEWG EDPATWA ++GNSWRTT DI
Sbjct: 184 LKYDNCANTGISPKERYPKMSKALLSSGRTMFFSLCEWGNEDPATWAPSIGNSWRTTGDI 243

Query: 250 QDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLI 309
           +DNW  MT+IADQNDKWASYA+PG WNDPDMLEVGNGGMTT EYRSHFSIW+LAKAPLLI
Sbjct: 244 KDNWDRMTSIADQNDKWASYAQPGSWNDPDMLEVGNGGMTTEEYRSHFSIWSLAKAPLLI 303

Query: 310 GCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNR 369
           GCDIRSMDNVT +LLSNKEVIAVNQDKLGVQGKKV K GDLEVWAGPL+  RVAVVLWNR
Sbjct: 304 GCDIRSMDNVTFELLSNKEVIAVNQDKLGVQGKKVKKDGDLEVWAGPLTDNRVAVVLWNR 363

Query: 370 GLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEVDSHGCKMYVLTP 425
           G   ANITA WSDIGL PSTV   + LW HS  L VQ  L+A+VD+H C+MY +TP
Sbjct: 364 GSSSANITAYWSDIGLKPSTVCDVQHLWAHSTELSVQDQLSAQVDAHACRMYTITP 410

BLAST of Cla022885 vs. TrEMBL
Match: A0A061E9L0_THECC (Alpha-galactosidase OS=Theobroma cacao GN=TCM_007627 PE=3 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 1.0e-175
Identity = 311/415 (74.94%), Postives = 339/415 (81.69%), Query Frame = 1

Query: 9   SSPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGL 68
           SS   F  +++ +L  MF+ +S    N   V G TR   +      DS + RR LL+NGL
Sbjct: 4   SSSSGFFPVVFGILCLMFI-SSLLLANSAKVKG-TRVGKIELT---DSATIRRNLLDNGL 63

Query: 69  ALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNL 128
            LTP MGWNSWNHF C+INE LIKETADAMVS+GLAA+GY YINLDDCW ELNRDS+GNL
Sbjct: 64  GLTPQMGWNSWNHFHCDINETLIKETADAMVSTGLAAVGYTYINLDDCWGELNRDSQGNL 123

Query: 129 VAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGID 188
           V KASTFPSGIKALA YVH KGLKLGIYSDAG QTCSK MPGSLGHEEQDAKTFA+WGID
Sbjct: 124 VPKASTFPSGIKALARYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQDAKTFASWGID 183

Query: 189 YLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSD 248
           YLKYDNC NTG SPK+RYPKMSKAL  S RPI FSLCEWGQEDPATWA N+GNSWRTT D
Sbjct: 184 YLKYDNCANTGASPKQRYPKMSKALLDSGRPIFFSLCEWGQEDPATWAPNIGNSWRTTGD 243

Query: 249 IQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLL 308
           I+D W SMT+IADQNDKWASYA+PG WNDPDMLEVGNGGMTT EYR HFSIWALAKAPLL
Sbjct: 244 IEDKWESMTSIADQNDKWASYAQPGAWNDPDMLEVGNGGMTTEEYRCHFSIWALAKAPLL 303

Query: 309 IGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWN 368
           IGCD+RSMDNVT +LLSNKEVIAVNQDKLGVQGKKV K GDLEVWAGPL+  +VAVVLWN
Sbjct: 304 IGCDVRSMDNVTFELLSNKEVIAVNQDKLGVQGKKVKKDGDLEVWAGPLTNHKVAVVLWN 363

Query: 369 RGLWRANITASWSDIGLSPSTVVTARDLW---EHS-RLVVQHHLTAEVDSHGCKM 420
           RG   ANITA WSDIGL PST+V ARDLW   +HS     Q  ++AEVDSH CK+
Sbjct: 364 RGSSLANITAYWSDIGLKPSTIVDARDLWANEQHSTERSAQKQISAEVDSHACKI 413

BLAST of Cla022885 vs. TrEMBL
Match: A0A061E9L0_THECC (Alpha-galactosidase OS=Theobroma cacao GN=TCM_007627 PE=3 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.6e-51
Identity = 129/289 (44.64%), Postives = 164/289 (56.75%), Query Frame = 1

Query: 136 PSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDYLKYDNC 195
           PSG+ AL             Y   G+ TCSK MPGSLGHE+QDA TFA+WGIDYLKYDNC
Sbjct: 437 PSGLSALGYE----------YIKLGLLTCSKTMPGSLGHEQQDANTFASWGIDYLKYDNC 496

Query: 196 ENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDIQDNWTS 255
            N G SP+ER+ ++ K  +         LC           + V  + +T   +Q  + +
Sbjct: 497 HNQGVSPQERWVRLFKTPE--------DLCS---------TLFVNGALKT---LQLGYLA 556

Query: 256 MTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLIGCDIRS 315
           + T+ +Q +      +     D DMLEVGNGGM+T EYRSHFS WAL KAP ++GCD RS
Sbjct: 557 LGTVGEQLETSRILGR-----DLDMLEVGNGGMSTEEYRSHFSSWALVKAPRILGCDTRS 616

Query: 316 MDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNRGLWRAN 375
           MDN T +LLSN EVIAVNQD+LGVQGKKV K GDLEVWA  ++                 
Sbjct: 617 MDNDTFELLSNNEVIAVNQDELGVQGKKVRKIGDLEVWADGMT----------------- 663

Query: 376 ITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTP 425
                  +G   ST++    L  + R  V++ + A + SH CKMYVLTP
Sbjct: 677 -------LG---STLLQLLMLETYGRHSVRNQIKATLVSHACKMYVLTP 663


HSP 2 Score: 51.6 bits (122), Expect = 2.7e-03
Identity = 35/76 (46.05%), Postives = 45/76 (59.21%), Query Frame = 1

Query: 80  NHFQCNINENLIKETADAMVSSGLAALGYEYINLD--DCWAELNRDSKGNLVAKASTFPS 139
           +HF C + E LI +TAD MV SGL+ALGYEYI L    C ++    S G+    A+TF S
Sbjct: 417 SHFPCKLTEELIHQTADGMVPSGLSALGYEYIKLGLLTC-SKTMPGSLGHEQQDANTFAS 476

Query: 140 -GIKALA-DYVHRKGL 152
            GI  L  D  H +G+
Sbjct: 477 WGIDYLKYDNCHNQGV 491


HSP 3 Score: 622.1 bits (1603), Expect = 5.1e-175
Identity = 307/432 (71.06%), Postives = 341/432 (78.94%), Query Frame = 1

Query: 2   SSSPLPLSSPPSFMVLLYSVLFWMFL-LASNGSGNGGAVTGATRSRPLRFAAEFDS---- 61
           SSS   L    S +V L    F+ FL L SN    G         RP+    ++ +    
Sbjct: 7   SSSQSMLGKLGSGVVALSICFFFFFLRLVSNADAAG---------RPINMGKQYSNSSHD 66

Query: 62  ---VSTRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINL 121
              +S  R L  NGL L PPMGWNSWNHF CNI E LI++TADAMVSSGLAALGYE++NL
Sbjct: 67  DRQLSRMRGLSANGLGLAPPMGWNSWNHFHCNIEEKLIRDTADAMVSSGLAALGYEHVNL 126

Query: 122 DDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLG 181
           DDCWAELNRDS+GNLV KASTFPSGIKALADY+H KGLKLGIYSDAG QTCS  MPGSLG
Sbjct: 127 DDCWAELNRDSEGNLVPKASTFPSGIKALADYIHGKGLKLGIYSDAGSQTCSGTMPGSLG 186

Query: 182 HEEQDAKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPA 241
           HEEQDAKTFA+WG+DYLKYDNC N GTSPKERYP MSKAL  S RPI FSLCEWGQEDPA
Sbjct: 187 HEEQDAKTFASWGVDYLKYDNCNNDGTSPKERYPVMSKALLNSGRPIFFSLCEWGQEDPA 246

Query: 242 TWAINVGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEY 301
           TWA NVGNSWRTT DI DNW SMT+ ADQND+WASYA PGGWNDPDMLEVGNGGMTT EY
Sbjct: 247 TWASNVGNSWRTTGDISDNWDSMTSRADQNDQWASYAAPGGWNDPDMLEVGNGGMTTEEY 306

Query: 302 RSHFSIWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVW 361
           RSHFSIWALAKAPLLIGCD+R+M + T+++LSN+EVIAVNQDKLGVQGKKV   GDLEVW
Sbjct: 307 RSHFSIWALAKAPLLIGCDVRTMSDETIEILSNREVIAVNQDKLGVQGKKVKNNGDLEVW 366

Query: 362 AGPLSGKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEV 421
           AGPLS  ++AVVLWNRG  RA +TA WSDIGL P+T V ARDLW HS +  V+  ++A++
Sbjct: 367 AGPLSNNKIAVVLWNRGSSRATVTAYWSDIGLDPTTTVNARDLWAHSNQPSVKGQISADL 426

Query: 422 DSHGCKMYVLTP 425
           DSH CKMYVLTP
Sbjct: 427 DSHACKMYVLTP 429

BLAST of Cla022885 vs. TrEMBL
Match: A0A0D2W3G5_GOSRA (Alpha-galactosidase OS=Gossypium raimondii GN=B456_013G066400 PE=3 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 7.3e-174
Identity = 305/418 (72.97%), Postives = 334/418 (79.90%), Query Frame = 1

Query: 12  PSFMVLLYSVLFWMF--LLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLA 71
           PS   L +  L  +F     S+ +  G  V G+ R R             RR L++NGL 
Sbjct: 3   PSSSTLCFLFLLGIFSSTFTSSFTEIGARVEGSGRIR-------------RRTLMDNGLG 62

Query: 72  LTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLV 131
           LTP MGWNSWNHF C+INE LIKETADAMVS+GL+A+GY YINLDDCW ELNRDS+GNLV
Sbjct: 63  LTPQMGWNSWNHFHCDINETLIKETADAMVSTGLSAVGYIYINLDDCWGELNRDSQGNLV 122

Query: 132 AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDY 191
            KAS+FPSGIKALADY H KGLKLGIYSDAG QTCSK MPGSLGHEEQDAKTFA WG+DY
Sbjct: 123 PKASSFPSGIKALADYAHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQDAKTFALWGVDY 182

Query: 192 LKYDNCENTG-TSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSD 251
           LKYDNCE+TG  SPKERYPKMS+AL  S RP+ FSLCEWGQEDPATWA ++GNSWRTT D
Sbjct: 183 LKYDNCEDTGGLSPKERYPKMSEALLNSGRPMFFSLCEWGQEDPATWAPSIGNSWRTTGD 242

Query: 252 IQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLL 311
           I+DNW SMT IADQND+WASYAKPG WNDPDMLEVGNGGMTT EYR HFSIWALAKAPLL
Sbjct: 243 IEDNWDSMTGIADQNDQWASYAKPGAWNDPDMLEVGNGGMTTEEYRCHFSIWALAKAPLL 302

Query: 312 IGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWN 371
           IGCD+RSMDNVT +L++NKEVI VNQDKLGVQGKKV K GDLEVWAGPL+   VAVVLWN
Sbjct: 303 IGCDVRSMDNVTFELVANKEVIDVNQDKLGVQGKKVKKEGDLEVWAGPLANNMVAVVLWN 362

Query: 372 RGLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEVDSHGCKMYVLTPH 426
           RG   ANITA WSDIGL PSTVV  RDLW HS    V+  ++AEVDSH CKMY L PH
Sbjct: 363 RGSSSANITAYWSDIGLKPSTVVDCRDLWAHSTETGVEDEISAEVDSHACKMYTLKPH 407

BLAST of Cla022885 vs. TrEMBL
Match: W5RW77_CAMSI (Alpha-galactosidase OS=Camellia sinensis GN=AGAL1 PE=2 SV=1)

HSP 1 Score: 616.3 bits (1588), Expect = 2.8e-173
Identity = 295/367 (80.38%), Postives = 319/367 (86.92%), Query Frame = 1

Query: 59  TRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWA 118
           TRR L++NGL LTP MGWNSWNHFQCNI E LIK+TADAMVS+GLA LGY+YINLDDCW 
Sbjct: 50  TRRTLMDNGLGLTPQMGWNSWNHFQCNIEEKLIKQTADAMVSTGLADLGYKYINLDDCWG 109

Query: 119 ELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQD 178
           ELNRDS+GNLV KASTFP GIKALADYVH KGLKLGIYSDAG QTCSK MPGSLGHEEQD
Sbjct: 110 ELNRDSEGNLVPKASTFPHGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQD 169

Query: 179 AKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAIN 238
           AKTFA+WGIDYLKYDNC + GTSPKERYPKMS AL +S R I FSLCEWG+EDPATWA  
Sbjct: 170 AKTFASWGIDYLKYDNCNDDGTSPKERYPKMSNALLKSGRSIFFSLCEWGREDPATWAPA 229

Query: 239 VGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFS 298
           +GNSWRTT DI DNW SMT+ ADQNDKWASYA PGGWNDPDMLEVGNGGMT  EYRSHFS
Sbjct: 230 IGNSWRTTGDISDNWNSMTSRADQNDKWASYAGPGGWNDPDMLEVGNGGMTIAEYRSHFS 289

Query: 299 IWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLS 358
           IWALAKAPLLIGCDIR++DNVTL LLSN EVIAVNQDKLGVQGKKV K GDLEVWAGPL+
Sbjct: 290 IWALAKAPLLIGCDIRAIDNVTLGLLSNNEVIAVNQDKLGVQGKKVKKDGDLEVWAGPLN 349

Query: 359 GKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEVDSHGC 418
            K+VAV+LWNRG   A ITA W DIGL+ +T+V ARDLWEHS +  V+  L+A+V SH C
Sbjct: 350 HKKVAVILWNRGPSTATITAHWFDIGLTRTTIVNARDLWEHSTKKSVKGQLSADVASHDC 409

Query: 419 KMYVLTP 425
           KMYVLTP
Sbjct: 410 KMYVLTP 416

BLAST of Cla022885 vs. NCBI nr
Match: gi|659090245|ref|XP_008445911.1| (PREDICTED: alpha-galactosidase [Cucumis melo])

HSP 1 Score: 791.6 bits (2043), Expect = 6.9e-226
Identity = 380/416 (91.35%), Postives = 395/416 (94.95%), Query Frame = 1

Query: 10  SPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLA 69
           S PS MVLLY VL W FLL   G+GNG AV GA+RS  LRFAAEFDSVS+RRVLLNNGLA
Sbjct: 6   SSPSVMVLLYFVLSWTFLLG--GNGNGVAVAGASRSTALRFAAEFDSVSSRRVLLNNGLA 65

Query: 70  LTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLV 129
           LTPPMGWNSWNHFQCN+NENLIKETADAMVSSGLAALGYEYINLDDCWAEL+RDSKGNLV
Sbjct: 66  LTPPMGWNSWNHFQCNLNENLIKETADAMVSSGLAALGYEYINLDDCWAELDRDSKGNLV 125

Query: 130 AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDY 189
           AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLG+EEQDAKTFA+WGIDY
Sbjct: 126 AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGNEEQDAKTFASWGIDY 185

Query: 190 LKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDI 249
           LKYDNCENTGTSPKERYPKM+KALQ+S RPILFSLCEWGQEDPATWA+NVGNSWRTTSDI
Sbjct: 186 LKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTSDI 245

Query: 250 QDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLI 309
           QDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTT EYRSHFSIWALAKAPLLI
Sbjct: 246 QDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLLI 305

Query: 310 GCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNR 369
           GCDIRSMDN+T+KLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNR
Sbjct: 306 GCDIRSMDNITMKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNR 365

Query: 370 GLWRANITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTPH 426
           GLWRANITASWSDIGL  ST VTARDLW+HS  VVQHHLTA+VDSH CKM+VLTPH
Sbjct: 366 GLWRANITASWSDIGLCSSTTVTARDLWQHSSQVVQHHLTAQVDSHDCKMFVLTPH 419

BLAST of Cla022885 vs. NCBI nr
Match: gi|525506969|ref|NP_001267530.1| (alpha-galactosidase-like precursor [Cucumis sativus])

HSP 1 Score: 776.9 bits (2005), Expect = 1.8e-221
Identity = 371/420 (88.33%), Postives = 392/420 (93.33%), Query Frame = 1

Query: 6   LPLSSPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLN 65
           LP SSP S M+LLY +LFW FLL  NG+GNG  V  A+RS  LRFA EFDS S+RR+LLN
Sbjct: 3   LPPSSP-SVMLLLYFLLFWTFLLGGNGNGNGVVVAAASRSTALRFATEFDSASSRRILLN 62

Query: 66  NGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSK 125
           NGLALTPPMGWNSWNHFQCN+NENLIKETADAMVS+GLAALGY+YINLDDCWAEL+RDSK
Sbjct: 63  NGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSTGLAALGYQYINLDDCWAELDRDSK 122

Query: 126 GNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAW 185
           GNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGI+TCSK+MPGSLGHEEQDAKTFA+W
Sbjct: 123 GNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIRTCSKRMPGSLGHEEQDAKTFASW 182

Query: 186 GIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRT 245
           GIDYLKYDNCENTGTSPKERYPKM+KALQ+S RPILFSLCEWGQEDPATWA+NVGNSWRT
Sbjct: 183 GIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRT 242

Query: 246 TSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKA 305
           TSDIQDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMT  EYRSHFSIWALAKA
Sbjct: 243 TSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTIAEYRSHFSIWALAKA 302

Query: 306 PLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVV 365
           PLLIGCDIRSMDN T+KLLSNKEVIAVNQDKLGVQGKKV+KYGDLEVWAG LSGKRVAVV
Sbjct: 303 PLLIGCDIRSMDNNTMKLLSNKEVIAVNQDKLGVQGKKVHKYGDLEVWAGLLSGKRVAVV 362

Query: 366 LWNRGLWRANITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTPH 425
           LWNR LWRANITA WSDIGLS ST VTARDLWEHS  VV+HHLTA+VDSH CKM+VLTPH
Sbjct: 363 LWNRSLWRANITAYWSDIGLSSSTTVTARDLWEHSSQVVRHHLTAQVDSHDCKMFVLTPH 421

BLAST of Cla022885 vs. NCBI nr
Match: gi|590615546|ref|XP_007023252.1| (Alpha-galactosidase 2 [Theobroma cacao])

HSP 1 Score: 628.2 bits (1619), Expect = 1.0e-176
Identity = 308/416 (74.04%), Postives = 337/416 (81.01%), Query Frame = 1

Query: 10  SPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGLA 69
           S PS     +     + LL S+ + +G  +  A  +         DS S RR+LLNNGL 
Sbjct: 4   SLPSSRFSAFFCCILLLLLISSATVSGARLGNARLT---------DSSSIRRILLNNGLG 63

Query: 70  LTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNLV 129
           LTP MGWNSWN F CNINE LIKETADAMVSSGLAALGY YINLDDCW ELNRD++GNLV
Sbjct: 64  LTPQMGWNSWNRFHCNINETLIKETADAMVSSGLAALGYTYINLDDCWGELNRDTQGNLV 123

Query: 130 AKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDY 189
            KASTFPSGIKALADYVH KGLKLGIY+DAG QTCSK MPGSLG+EEQDAKTFA+WG+DY
Sbjct: 124 PKASTFPSGIKALADYVHSKGLKLGIYADAGTQTCSKTMPGSLGYEEQDAKTFASWGVDY 183

Query: 190 LKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDI 249
           LKYDNC NTG SPKERYPKMSKAL  S R + FSLCEWG EDPATWA ++GNSWRTT DI
Sbjct: 184 LKYDNCANTGISPKERYPKMSKALLSSGRTMFFSLCEWGNEDPATWAPSIGNSWRTTGDI 243

Query: 250 QDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLI 309
           +DNW  MT+IADQNDKWASYA+PG WNDPDMLEVGNGGMTT EYRSHFSIW+LAKAPLLI
Sbjct: 244 KDNWDRMTSIADQNDKWASYAQPGSWNDPDMLEVGNGGMTTEEYRSHFSIWSLAKAPLLI 303

Query: 310 GCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNR 369
           GCDIRSMDNVT +LLSNKEVIAVNQDKLGVQGKKV K GDLEVWAGPL+  RVAVVLWNR
Sbjct: 304 GCDIRSMDNVTFELLSNKEVIAVNQDKLGVQGKKVKKDGDLEVWAGPLTDNRVAVVLWNR 363

Query: 370 GLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEVDSHGCKMYVLTP 425
           G   ANITA WSDIGL PSTV   + LW HS  L VQ  L+A+VD+H C+MY +TP
Sbjct: 364 GSSSANITAYWSDIGLKPSTVCDVQHLWAHSTELSVQDQLSAQVDAHACRMYTITP 410

BLAST of Cla022885 vs. NCBI nr
Match: gi|590689149|ref|XP_007043147.1| (Alpha-galactosidase 2 [Theobroma cacao])

HSP 1 Score: 624.4 bits (1609), Expect = 1.5e-175
Identity = 311/415 (74.94%), Postives = 339/415 (81.69%), Query Frame = 1

Query: 9   SSPPSFMVLLYSVLFWMFLLASNGSGNGGAVTGATRSRPLRFAAEFDSVSTRRVLLNNGL 68
           SS   F  +++ +L  MF+ +S    N   V G TR   +      DS + RR LL+NGL
Sbjct: 4   SSSSGFFPVVFGILCLMFI-SSLLLANSAKVKG-TRVGKIELT---DSATIRRNLLDNGL 63

Query: 69  ALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINLDDCWAELNRDSKGNL 128
            LTP MGWNSWNHF C+INE LIKETADAMVS+GLAA+GY YINLDDCW ELNRDS+GNL
Sbjct: 64  GLTPQMGWNSWNHFHCDINETLIKETADAMVSTGLAAVGYTYINLDDCWGELNRDSQGNL 123

Query: 129 VAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGID 188
           V KASTFPSGIKALA YVH KGLKLGIYSDAG QTCSK MPGSLGHEEQDAKTFA+WGID
Sbjct: 124 VPKASTFPSGIKALARYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQDAKTFASWGID 183

Query: 189 YLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSD 248
           YLKYDNC NTG SPK+RYPKMSKAL  S RPI FSLCEWGQEDPATWA N+GNSWRTT D
Sbjct: 184 YLKYDNCANTGASPKQRYPKMSKALLDSGRPIFFSLCEWGQEDPATWAPNIGNSWRTTGD 243

Query: 249 IQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLL 308
           I+D W SMT+IADQNDKWASYA+PG WNDPDMLEVGNGGMTT EYR HFSIWALAKAPLL
Sbjct: 244 IEDKWESMTSIADQNDKWASYAQPGAWNDPDMLEVGNGGMTTEEYRCHFSIWALAKAPLL 303

Query: 309 IGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWN 368
           IGCD+RSMDNVT +LLSNKEVIAVNQDKLGVQGKKV K GDLEVWAGPL+  +VAVVLWN
Sbjct: 304 IGCDVRSMDNVTFELLSNKEVIAVNQDKLGVQGKKVKKDGDLEVWAGPLTNHKVAVVLWN 363

Query: 369 RGLWRANITASWSDIGLSPSTVVTARDLW---EHS-RLVVQHHLTAEVDSHGCKM 420
           RG   ANITA WSDIGL PST+V ARDLW   +HS     Q  ++AEVDSH CK+
Sbjct: 364 RGSSLANITAYWSDIGLKPSTIVDARDLWANEQHSTERSAQKQISAEVDSHACKI 413

BLAST of Cla022885 vs. NCBI nr
Match: gi|590689149|ref|XP_007043147.1| (Alpha-galactosidase 2 [Theobroma cacao])

HSP 1 Score: 211.8 bits (538), Expect = 2.3e-51
Identity = 129/289 (44.64%), Postives = 164/289 (56.75%), Query Frame = 1

Query: 136 PSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAAWGIDYLKYDNC 195
           PSG+ AL             Y   G+ TCSK MPGSLGHE+QDA TFA+WGIDYLKYDNC
Sbjct: 437 PSGLSALGYE----------YIKLGLLTCSKTMPGSLGHEQQDANTFASWGIDYLKYDNC 496

Query: 196 ENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPATWAINVGNSWRTTSDIQDNWTS 255
            N G SP+ER+ ++ K  +         LC           + V  + +T   +Q  + +
Sbjct: 497 HNQGVSPQERWVRLFKTPE--------DLCS---------TLFVNGALKT---LQLGYLA 556

Query: 256 MTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIWALAKAPLLIGCDIRS 315
           + T+ +Q +      +     D DMLEVGNGGM+T EYRSHFS WAL KAP ++GCD RS
Sbjct: 557 LGTVGEQLETSRILGR-----DLDMLEVGNGGMSTEEYRSHFSSWALVKAPRILGCDTRS 616

Query: 316 MDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAVVLWNRGLWRAN 375
           MDN T +LLSN EVIAVNQD+LGVQGKKV K GDLEVWA  ++                 
Sbjct: 617 MDNDTFELLSNNEVIAVNQDELGVQGKKVRKIGDLEVWADGMT----------------- 663

Query: 376 ITASWSDIGLSPSTVVTARDLWEHSRLVVQHHLTAEVDSHGCKMYVLTP 425
                  +G   ST++    L  + R  V++ + A + SH CKMYVLTP
Sbjct: 677 -------LG---STLLQLLMLETYGRHSVRNQIKATLVSHACKMYVLTP 663


HSP 2 Score: 51.6 bits (122), Expect = 3.9e-03
Identity = 35/76 (46.05%), Postives = 45/76 (59.21%), Query Frame = 1

Query: 80  NHFQCNINENLIKETADAMVSSGLAALGYEYINLD--DCWAELNRDSKGNLVAKASTFPS 139
           +HF C + E LI +TAD MV SGL+ALGYEYI L    C ++    S G+    A+TF S
Sbjct: 417 SHFPCKLTEELIHQTADGMVPSGLSALGYEYIKLGLLTC-SKTMPGSLGHEQQDANTFAS 476

Query: 140 -GIKALA-DYVHRKGL 152
            GI  L  D  H +G+
Sbjct: 477 WGIDYLKYDNCHNQGV 491


HSP 3 Score: 622.1 bits (1603), Expect = 7.3e-175
Identity = 307/432 (71.06%), Postives = 341/432 (78.94%), Query Frame = 1

Query: 2   SSSPLPLSSPPSFMVLLYSVLFWMFL-LASNGSGNGGAVTGATRSRPLRFAAEFDS---- 61
           SSS   L    S +V L    F+ FL L SN    G         RP+    ++ +    
Sbjct: 7   SSSQSMLGKLGSGVVALSICFFFFFLRLVSNADAAG---------RPINMGKQYSNSSHD 66

Query: 62  ---VSTRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYEYINL 121
              +S  R L  NGL L PPMGWNSWNHF CNI E LI++TADAMVSSGLAALGYE++NL
Sbjct: 67  DRQLSRMRGLSANGLGLAPPMGWNSWNHFHCNIEEKLIRDTADAMVSSGLAALGYEHVNL 126

Query: 122 DDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLG 181
           DDCWAELNRDS+GNLV KASTFPSGIKALADY+H KGLKLGIYSDAG QTCS  MPGSLG
Sbjct: 127 DDCWAELNRDSEGNLVPKASTFPSGIKALADYIHGKGLKLGIYSDAGSQTCSGTMPGSLG 186

Query: 182 HEEQDAKTFAAWGIDYLKYDNCENTGTSPKERYPKMSKALQRSERPILFSLCEWGQEDPA 241
           HEEQDAKTFA+WG+DYLKYDNC N GTSPKERYP MSKAL  S RPI FSLCEWGQEDPA
Sbjct: 187 HEEQDAKTFASWGVDYLKYDNCNNDGTSPKERYPVMSKALLNSGRPIFFSLCEWGQEDPA 246

Query: 242 TWAINVGNSWRTTSDIQDNWTSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEY 301
           TWA NVGNSWRTT DI DNW SMT+ ADQND+WASYA PGGWNDPDMLEVGNGGMTT EY
Sbjct: 247 TWASNVGNSWRTTGDISDNWDSMTSRADQNDQWASYAAPGGWNDPDMLEVGNGGMTTEEY 306

Query: 302 RSHFSIWALAKAPLLIGCDIRSMDNVTLKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVW 361
           RSHFSIWALAKAPLLIGCD+R+M + T+++LSN+EVIAVNQDKLGVQGKKV   GDLEVW
Sbjct: 307 RSHFSIWALAKAPLLIGCDVRTMSDETIEILSNREVIAVNQDKLGVQGKKVKNNGDLEVW 366

Query: 362 AGPLSGKRVAVVLWNRGLWRANITASWSDIGLSPSTVVTARDLWEHS-RLVVQHHLTAEV 421
           AGPLS  ++AVVLWNRG  RA +TA WSDIGL P+T V ARDLW HS +  V+  ++A++
Sbjct: 367 AGPLSNNKIAVVLWNRGSSRATVTAYWSDIGLDPTTTVNARDLWAHSNQPSVKGQISADL 426

Query: 422 DSHGCKMYVLTP 425
           DSH CKMYVLTP
Sbjct: 427 DSHACKMYVLTP 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AGAL2_ARATH8.6e-17680.22Alpha-galactosidase 2 OS=Arabidopsis thaliana GN=AGAL2 PE=1 SV=1[more]
AGAL_COFAR1.0e-17379.84Alpha-galactosidase OS=Coffea arabica PE=1 SV=1[more]
AGAL_CYATE1.4e-17373.78Alpha-galactosidase OS=Cyamopsis tetragonoloba PE=1 SV=1[more]
AGAL_ORYSJ4.4e-15664.44Alpha-galactosidase OS=Oryza sativa subsp. japonica GN=Os10g0493600 PE=1 SV=1[more]
AGAL1_ARATH2.1e-15062.65Alpha-galactosidase 1 OS=Arabidopsis thaliana GN=AGAL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A061GFP9_THECC7.1e-17774.04Alpha-galactosidase OS=Theobroma cacao GN=TCM_027241 PE=3 SV=1[more]
A0A061E9L0_THECC1.0e-17574.94Alpha-galactosidase OS=Theobroma cacao GN=TCM_007627 PE=3 SV=1[more]
A0A061E9L0_THECC1.6e-5144.64Alpha-galactosidase OS=Theobroma cacao GN=TCM_007627 PE=3 SV=1[more]
A0A0D2W3G5_GOSRA7.3e-17472.97Alpha-galactosidase OS=Gossypium raimondii GN=B456_013G066400 PE=3 SV=1[more]
W5RW77_CAMSI2.8e-17380.38Alpha-galactosidase OS=Camellia sinensis GN=AGAL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
gi|659090245|ref|XP_008445911.1|6.9e-22691.35PREDICTED: alpha-galactosidase [Cucumis melo][more]
gi|525506969|ref|NP_001267530.1|1.8e-22188.33alpha-galactosidase-like precursor [Cucumis sativus][more]
gi|590615546|ref|XP_007023252.1|1.0e-17674.04Alpha-galactosidase 2 [Theobroma cacao][more]
gi|590689149|ref|XP_007043147.1|1.5e-17574.94Alpha-galactosidase 2 [Theobroma cacao][more]
gi|590689149|ref|XP_007043147.1|2.3e-5144.64Alpha-galactosidase 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000111Glyco_hydro_27/36_CS
IPR002241Glyco_hydro_27
IPR013780Glyco_hydro_b
IPR013785Aldolase_TIM
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0000023 maltose metabolic process
biological_process GO:0007020 microtubule nucleation
biological_process GO:0043085 positive regulation of catalytic activity
biological_process GO:0009911 positive regulation of flower development
biological_process GO:0019252 starch biosynthetic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0052692 raffinose alpha-galactosidase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU79022watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla022885Cla022885.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU79022WMU79022transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000111Glycoside hydrolase family 27/36, conserved sitePROSITEPS00512ALPHA_GALACTOSIDASEcoord: 107..123
scor
IPR002241Glycoside hydrolase, family 27PRINTSPR00740GLHYDRLASE27coord: 143..164
score: 2.2E-66coord: 102..117
score: 2.2E-66coord: 205..223
score: 2.2E-66coord: 291..312
score: 2.2E-66coord: 270..289
score: 2.2E-66coord: 66..85
score: 2.2E-66coord: 178..195
score: 2.2
IPR002241Glycoside hydrolase, family 27PFAMPF16499Melibiase_2coord: 71..379
score: 5.9
IPR013780Glycosyl hydrolase, all-betaGENE3DG3DSA:2.60.40.1180coord: 330..424
score: 4.4
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 62..329
score: 6.4E
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 62..335
score: 6.32E
NoneNo IPR availablePANTHERPTHR11452ALPHA-GALACTOSIDASE/ALPHA-N-ACETYLGALACTOSAMINIDASEcoord: 59..424
score: 4.7E
NoneNo IPR availablePANTHERPTHR11452:SF33SUBFAMILY NOT NAMEDcoord: 59..424
score: 4.7E
NoneNo IPR availableunknownSSF51011Glycosyl hydrolase domaincoord: 330..423
score: 1.42