Tan0010655.4 (mRNA) Snake gourd v1

Overview
NameTan0010655.4
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAlpha-galactosidase
LocationLG04: 3771945 .. 3777115 (+)
Sequence length2939
RNA-Seq ExpressionTan0010655.4
SyntenyTan0010655.4
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATTCGGGGTAATTCGATGGAATTATTGGGGTACTGCCAAGTTTGCTGTGTTCTTTAATTATGGTGGAACCACCATTGATGGCACTTTCTCCATAATTGTTTGTTGTGCAGTGGAGAAGCATTACCAAATCCTGACGAATTCAATGCTCTGAGGTAAAGAATCAACCAGTTCTGATCTAAAACTTACTCTGTAAATGGTACATTTACCATTAATGCTCAATCACAGGCTACAGCCGTCCGCTATGGCTTTTCCCTCGCTCCCGCTTTCATCTCCAGCTTCGATTATGGTTCTCTTCTACTCCGTCCTTTTCTGGATGTTCATGTTAGGTAGCATTACCGACGGCAATGGCGGTCCGGTGGCCGGAGCGACTCGATCGAGACCTCTGCGGTTTGCCACTGAGTTTGATTCTGATTCCACCAGGAGAGTTCTCCTCAATAATGGCCTCGCTCTAACTCCTCCGATGGGGTAAGTCTTACGCTTCGATCATGAGTTTACTCTCTTTCTCACGAGGATCGATCAGATTGAATGTAATGTAGTCTCTCCTCAATAATCGATCGATCATGATTTTTCTAATGTTTTGCTTCCGGTTTGTTTTTTTTTTTTTTTTTTGATAAGCACAGATGGAATAGCTGGAACCATTTTCAATGCAATTTAAACGAGAATTTAATCAAGGAAACAGGTGATATTTTGATTTGAAGGCGTCTTTAATTTCTGTTTGGATTTTGTGTACAGTGACTGAGATACACATCCTATTTCTTTTGCTAAGCGGATGCAATGGTATCCAGCGCCCTTGCTGCGTTAGGATATGAGTATATCAATTTAGGTGATCATTATATTTCTTTCTTTTTACATTTCTTTGGAATTTTACCCTTTCTCTTTTTGATATTAATATGATCTTATTTCTATTTGAGTTAGATGATTGTTGGGCTGAACTGAATAGAGATTCTAAGGTAAATTTCAATGTCCAATTTCTTTTCTCGGGTGTCTATGAAAATCTCAATATTTGCATTATATTATGAAGTTTATTGTTGAAGATTCCATCGATTTTTTCAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCATAGAAAAGGGCTCAAGCTCGGAATTTACTCTGATGCTGGGTAAATCTATACACTCATCACATCAGCTTAAACAACTTCCTCGTTCATTTTCATTGTCATAGTTCATCGAGATGATGCTGATAGTTTTATTTGCTGATTTTCTCCCTACTTTTGGCTTGCAAATTCCATTATCAACCCTGTTTCACTAAGATCTTCTTTACCCAAGTTAATATAATTTCTTCAGGATCCAGACTTGCAGTAAAAAGATGCCAGGTTCCCTCGGTCACGAAGAACAAGATGCAAAAACTTTTGCCTCATGGGTATTTTTTCTGCTTTACACTCTTAAATGTTAACTCCTTTATTTATTGTTTAGATCGCTATAAATTATCAGGAATAGTTGATATATAATACAATTGTGTTAAGAGAACTCATTAAGTGCAACCATATATTTCTGTTACTAACCATAGTTGAGTTAGTTGTACATATGAGAGTTTATAGTTGAGTCTGTTATTTGATTAAATATCATTTTCTTTATAAATGATTATGTACTCTCTTGATTAATGAATGAGAAAGTACAACAACCATTCATTATAGTTGAGTCTGTTGTTTGATTAAATATCTTTTAATATTTTGTTAAATTGTATAGGGGATCGATTACTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCCAAAGAAAGGTAGGATCAACTTTCTTTGTTTAATTGTCAAAATTTTCAAGTAACTGCAACCATCCGTAGGACACAGATACCCGAAGATGACTAAAGCTTTACAACAATCTGGGAGACCCATACTTTTTTCTTTATGCGAATGGTTAGCAATTCTTAGAAGACAGACTTGCTCTATTTTACTCTAATGTTTCTCCATTCTATAGTACAAACAGAGTTCATTTCTTAAGTTTTTCTTCATTTAGGGGACAAGAAGATCCAGCAACTTGGGCAGTAAATGTAGGAAATAGTTGGAGAACGACATCAGATATTCAAGATAACTGGAATAGGTAGTGAGCATAAATTTTGTCAACATTTCAGGAAACATGCAACATTTTCAAAAAATTTCTCTACACTATTTGATTAGTTTCTTTTCCCAGTATCTATGAATATGCCTGCATTGAAGTATATGATATTTCATGTCCAAAGTACTTTCTTTAGCAAGAATCTGATGCAGCTACTTAATGATACAGCATGACAACCATTGCTGATCAAAATGACAAATGGGCATCTTATGCTAAACCTGGAGGATGGAATGGTAATTTCATAATTATAGTTTGATATGAAAATTAAACACGTACCCTTGTGTATAGAGAAATATAGGCTGGTTAAGCTTGACGGTTGGCCTTGGCCCCTTTTCATATCTCTCAGAACTAATTAAAGATCGATAATGATTGAAGAAGCGAATCCAATGCTGATTCTGAACAGAGGACTAGACGATCTCTACTTCTGCCATTAGCGACCAACTGTTTTAACTCAACCTTAAACAATCTTCTATTGAAAGGAAAAGTAAAATATAAACTTCTGAGTATTCATAGTTATTATCAAATAGCTTCTTTTTATGATTATGATTAAATAGCTTTTTCTATGCATTCAAAATTCTTTCCCCATGGCCAATCTCTATGGAAATTGTACAACAATCTCAGAAATGTTAAAGAAAATAAGCTTAGTTTAAATTTTCATTTGTTCAGATCCTGACATGCTTGAAGTTGGGAATGGAGGGATGACAACAGCAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGTAAGTTCCATTAAAACTTTTCCAGAATCTACTTTTTAGTTTAGATTGGTAATTTATCGGATCGTATTAAGACATATTCGGTATTCAAGTTCATTAAACGACTTAGCCATTTCAAATAACTATATTAAGATAGGGTTGAATATCTTTGCTTGATTTGTGGGTTGAGTATATAATTTTGCGTGCTTCATAAACTACCTTATCAATTTATTATCGATTCCTTTCTGTTCTTCAAATGTGAAGGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACATCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGGTAAGTAAAAGCCAAGCTGTTTCTGCTAATTCATAGAGAACTATGAAATAGATAACATAGAAATAATTCTCTCAATAAAATGGAAGATTAATCATTAACCAAATATGACATTATGCTCCATCTAAAAGTTTTTTATCCAAGTAGATAAGCTTGGAGTTCAAGGGAAGAAGGTCTATAAATTTGGAGATCTTGAGGTAATTTTTCATCATCTAACACATCCCAAAAAAAAAAACAAAAGGATAGAAGAAAAAAAAAAGGAAATTTTTTACTTTACTCTGAAATTTTCTACACAAAAAAAAAAAAACAATGCAGGTTTGGGCAGGGCCACTGAGTGGTAAAAGAGTTGCTGTTGTTCTATGGAACAGAGGCTTCTCGAGAGTCAATATCACTGCATCTTGGTCAGCTATAGGTTTGAGTCCATCAACTACTGTTACTGCTAGAGACTTATGGGAGGTAAGATATGAAGCGTGTAATTAAAAGTACTTTTCAAAGTTTTCATGACATTAGGGGGTTGCTGGTGAAAAAAAAGTGTTTTTCAAAAATGCATTTTTTTAAAACATGTTTAATAAAAACAGTTTAAAATTAGAAAACTCAATTGAGATAATCATTTTAGTTTTTGAAAATTAAACCTAGAAATACTACTCCTATTTAAGGATTTGAGTGTCTTGTTATCTACTTTTTACCTATATTTTCAAAAACCAAGCTATGTTTTGAAAACTAAATAAAGTAGTTTTTAAAAACTTGTTTTTGTTTTTAGAATTTAGCTAAGAATCCGAATGTTTTCCTTAAGGGACATGAAAACCATAGTAGAGAAGTTGGGAGAAAACAAGCTTAATTTTCAAAAATCAAATAGTTATCAAATGGAGTCTCATATCTTTGGTTACATTTTCTAAAATGTGTTTTTATACACAAGTTTAATTGGTTTCTCTACCATTAAAATACTTTTATTAATGCCAAATTACCAAAAATAACGACTACTAAAGATTATGAACTAATAAGTTCAAATTTGTATTTAAAATACTATGATCAGCCGACTGCAATTAGAGGAGTTGTCGGAAATGGTTGCCGGTGGTGGTCACCGGAGATTATCGCTGGTGATGGGTAGCAATATTGTGAGGATATTTTGGTCACTTCCATATATCATGAAAATTTGGGAGAGATTCTAAATTAAACACTTACAAATCCTTTTTCAAAAAGCAGAGTGTTAAGCCGAAAATAACTGATTTCTCACTCATTTGTTAAAAAATCATTTAGAATAGTGGTTATATCCTACCATTTTTATTAAAATGACTTTTTTTTTTTTGAAAAATCAAACCAATGGACCTTAGATTACAAGGTAAGAAGTACTTTTAAAAACTCTAATAATTAATAGTTGAAATGAATTATTATGAAATAAGCACATTATTAGTACTTTCTTAAAAAAATCGTTTCCCTAAAAAGTACTATTCTCAAAAGTTCTTCTAGACTCACTCGAAGAAATGTGCATTATTTCTATGTATGTACATTGCAAAAATTTTCTGACTGATTATTCATTCCTACAATAGCACTCCAGTCAGATGGTTCAAAATCAACTTACTGCTCAAGTAGACTCTCATGATTGTAAGATGTATGTTCTCACACCCCATTAAAGCCCAATGTGAAAAATTGCCAATGTTCTTGTGAATGATTATTGTCTTCACTTCATCTACTTGTCCTTCCACTTTATGCTGTGATTTCTGGGATTTTACATGTTTATTTGTATAAGAAGTTGCCTTTCTTTCTTCTCAAGGTGGATGAACAAATGCAACCAGATTCTTGGGTTTCTTGTTTTTCCTTTTTCTTTTTGTCTAATTTATTAGGAACTTTATAGTTGTTCAATGTTGAGAATAACATACTTTATATGAATAATTTCACACTTTTTTCTCACTAATGTATTTCCCTTTATACACACCACTTTCATTAATCCTTTTCTAATC

mRNA sequence

TTATTCGGGGTAATTCGATGGAATTATTGGGGTACTGCCAAGTTTGCTGTGTTCTTTAATTATGGTGGAACCACCATTGATGGCACTTTCTCCATAATTGTTTGTTGTGCAGTGGAGAAGCATTACCAAATCCTGACGAATTCAATGCTCTGAGGTAAAGAATCAACCAGTTCTGATCTAAAACTTACTCTGTAAATGGTACATTTACCATTAATGCTCAATCACAGGCTACAGCCGTCCGCTATGGCTTTTCCCTCGCTCCCGCTTTCATCTCCAGCTTCGATTATGGTTCTCTTCTACTCCGTCCTTTTCTGGATGTTCATGTTAGGTAGCATTACCGACGGCAATGGCGGTCCGGTGGCCGGAGCGACTCGATCGAGACCTCTGCGGTTTGCCACTGAGTTTGATTCTGATTCCACCAGGAGAGTTCTCCTCAATAATGGCCTCGCTCTAACTCCTCCGATGGGATGGAATAGCTGGAACCATTTTCAATGCAATTTAAACGAGAATTTAATCAAGGAAACAGCGGATGCAATGGTATCCAGCGCCCTTGCTGCGTTAGGATATGAGTATATCAATTTAGATGATTGTTGGGCTGAACTGAATAGAGATTCTAAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCATAGAAAAGGGCTCAAGCTCGGAATTTACTCTGATGCTGGGATCCAGACTTGCAGTAAAAAGATGCCAGGTTCCCTCGGTCACGAAGAACAAGATGCAAAAACTTTTGCCTCATGGGGGATCGATTACTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCCAAAGAAAGATACCCGAAGATGACTAAAGCTTTACAACAATCTGGGAGACCCATACTTTTTTCTTTATGCGAATGGGGACAAGAAGATCCAGCAACTTGGGCAGTAAATGTAGGAAATAGTTGGAGAACGACATCAGATATTCAAGATAACTGGAATAGCATGACAACCATTGCTGATCAAAATGACAAATGGGCATCTTATGCTAAACCTGGAGGATGGAATGATCCTGACATGCTTGAAGTTGGGAATGGAGGGATGACAACAGCAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACATCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGATAAGCTTGGAGTTCAAGGGAAGAAGGTCTATAAATTTGGAGATCTTGAGGTTTGGGCAGGGCCACTGAGTGGTAAAAGAGTTGCTGTTGTTCTATGGAACAGAGGCTTCTCGAGAGTCAATATCACTGCATCTTGGTCAGCTATAGGTTTGAGTCCATCAACTACTGTTACTGCTAGAGACTTATGGGAGGTAAGATATGAAGCGTGTAATTAAAAGTACTTTTCAAAGTTTTCATGACATTAGGGGGTTGCTGGTGAAAAAAAAGTGTTTTTCAAAAATGCATTTTTTTAAAACATGTTTAATAAAAACAGTTTAAAATTAGAAAACTCAATTGAGATAATCATTTTAGTTTTTGAAAATTAAACCTAGAAATACTACTCCTATTTAAGGATTTGAGTGTCTTGTTATCTACTTTTTACCTATATTTTCAAAAACCAAGCTATGTTTTGAAAACTAAATAAAGTAGTTTTTAAAAACTTGTTTTTGTTTTTAGAATTTAGCTAAGAATCCGAATGTTTTCCTTAAGGGACATGAAAACCATAGTAGAGAAGTTGGGAGAAAACAAGCTTAATTTTCAAAAATCAAATAGTTATCAAATGGAGTCTCATATCTTTGGTTACATTTTCTAAAATGTGTTTTTATACACAAGTTTAATTGGTTTCTCTACCATTAAAATACTTTTATTAATGCCAAATTACCAAAAATAACGACTACTAAAGATTATGAACTAATAAGTTCAAATTTGTATTTAAAATACTATGATCAGCCGACTGCAATTAGAGGAGTTGTCGGAAATGGTTGCCGGTGGTGGTCACCGGAGATTATCGCTGGTGATGGGTAGCAATATTGTGAGGATATTTTGGTCACTTCCATATATCATGAAAATTTGGGAGAGATTCTAAATTAAACACTTACAAATCCTTTTTCAAAAAGCAGAGTGTTAAGCCGAAAATAACTGATTTCTCACTCATTTGTTAAAAAATCATTTAGAATAGTGGTTATATCCTACCATTTTTATTAAAATGACTTTTTTTTTTTTGAAAAATCAAACCAATGGACCTTAGATTACAAGGTAAGAAGTACTTTTAAAAACTCTAATAATTAATAGTTGAAATGAATTATTATGAAATAAGCACATTATTAGTACTTTCTTAAAAAAATCGTTTCCCTAAAAAGTACTATTCTCAAAAGTTCTTCTAGACTCACTCGAAGAAATGTGCATTATTTCTATGTATGTACATTGCAAAAATTTTCTGACTGATTATTCATTCCTACAATAGCACTCCAGTCAGATGGTTCAAAATCAACTTACTGCTCAAGTAGACTCTCATGATTGTAAGATGTATGTTCTCACACCCCATTAAAGCCCAATGTGAAAAATTGCCAATGTTCTTGTGAATGATTATTGTCTTCACTTCATCTACTTGTCCTTCCACTTTATGCTGTGATTTCTGGGATTTTACATGTTTATTTGTATAAGAAGTTGCCTTTCTTTCTTCTCAAGGTGGATGAACAAATGCAACCAGATTCTTGGGTTTCTTGTTTTTCCTTTTTCTTTTTGTCTAATTTATTAGGAACTTTATAGTTGTTCAATGTTGAGAATAACATACTTTATATGAATAATTTCACACTTTTTTCTCACTAATGTATTTCCCTTTATACACACCACTTTCATTAATCCTTTTCTAATC

Coding sequence (CDS)

ATGGTACATTTACCATTAATGCTCAATCACAGGCTACAGCCGTCCGCTATGGCTTTTCCCTCGCTCCCGCTTTCATCTCCAGCTTCGATTATGGTTCTCTTCTACTCCGTCCTTTTCTGGATGTTCATGTTAGGTAGCATTACCGACGGCAATGGCGGTCCGGTGGCCGGAGCGACTCGATCGAGACCTCTGCGGTTTGCCACTGAGTTTGATTCTGATTCCACCAGGAGAGTTCTCCTCAATAATGGCCTCGCTCTAACTCCTCCGATGGGATGGAATAGCTGGAACCATTTTCAATGCAATTTAAACGAGAATTTAATCAAGGAAACAGCGGATGCAATGGTATCCAGCGCCCTTGCTGCGTTAGGATATGAGTATATCAATTTAGATGATTGTTGGGCTGAACTGAATAGAGATTCTAAGGGTAATTTGGTTGCTAAAGCTTCAACATTCCCTTCTGGTATTAAGGCCCTAGCAGATTATGTTCATAGAAAAGGGCTCAAGCTCGGAATTTACTCTGATGCTGGGATCCAGACTTGCAGTAAAAAGATGCCAGGTTCCCTCGGTCACGAAGAACAAGATGCAAAAACTTTTGCCTCATGGGGGATCGATTACTTGAAGTATGATAACTGTGAAAATACTGGTACAAGCCCCAAAGAAAGATACCCGAAGATGACTAAAGCTTTACAACAATCTGGGAGACCCATACTTTTTTCTTTATGCGAATGGGGACAAGAAGATCCAGCAACTTGGGCAGTAAATGTAGGAAATAGTTGGAGAACGACATCAGATATTCAAGATAACTGGAATAGCATGACAACCATTGCTGATCAAAATGACAAATGGGCATCTTATGCTAAACCTGGAGGATGGAATGATCCTGACATGCTTGAAGTTGGGAATGGAGGGATGACAACAGCAGAATACCGTTCTCATTTTAGCATTTGGGCACTGGCAAAAGCTCCATTGCTGATTGGATGTGACATCAGATCAATGGACAACATCACCTTGAAATTGTTAAGTAATAAGGAGGTTATTGCAGTTAATCAGGATAAGCTTGGAGTTCAAGGGAAGAAGGTCTATAAATTTGGAGATCTTGAGGTTTGGGCAGGGCCACTGAGTGGTAAAAGAGTTGCTGTTGTTCTATGGAACAGAGGCTTCTCGAGAGTCAATATCACTGCATCTTGGTCAGCTATAGGTTTGAGTCCATCAACTACTGTTACTGCTAGAGACTTATGGGAGGTAAGATATGAAGCGTGTAATTAA

Protein sequence

MVHLPLMLNHRLQPSAMAFPSLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWEVRYEACN
Homology
BLAST of Tan0010655.4 vs. ExPASy Swiss-Prot
Match: Q8RX86 (Alpha-galactosidase 2 OS=Arabidopsis thaliana OX=3702 GN=AGAL2 PE=1 SV=1)

HSP 1 Score: 602.8 bits (1553), Expect = 2.9e-171
Identity = 277/344 (80.52%), Postives = 305/344 (88.66%), Query Frame = 0

Query: 77  RVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAEL 136
           R+L+NNGLAL+P MGWNSWNHFQCN+NE LIK+TADAMVSS L+A+GY+YIN+DDCW EL
Sbjct: 29  RMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGEL 88

Query: 137 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAK 196
            RDS+G+LVAKASTFPSGIKAL+DYVH KGLKLGIYSDAG  TCS+ MPGSLGHEEQDAK
Sbjct: 89  KRDSQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAK 148

Query: 197 TFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVG 256
           TFASWGIDYLKYDNCENTGTSP+ERYPKM+KAL  SGR I FSLCEWGQEDPATWA ++G
Sbjct: 149 TFASWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIG 208

Query: 257 NSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIW 316
           NSWRTT DIQDNW SMT IADQND+WASYA+PG WNDPDMLEVGNGGMT  EY SHFSIW
Sbjct: 209 NSWRTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIW 268

Query: 317 ALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGK 376
           ALAKAPLLIGCD+RSMD +T +LLSNKEVIAVNQDKLG+QGKKV K GDLEVWAGPLS K
Sbjct: 269 ALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKK 328

Query: 377 RVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWEVRYEAC 421
           RVAV+LWNRG +  NITA W+ IGL+ S  V ARDLWE    +C
Sbjct: 329 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYSC 372

BLAST of Tan0010655.4 vs. ExPASy Swiss-Prot
Match: Q42656 (Alpha-galactosidase OS=Coffea arabica OX=13443 PE=1 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 1.8e-168
Identity = 278/341 (81.52%), Postives = 293/341 (85.92%), Query Frame = 0

Query: 73  DSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDC 132
           D TRR LL NGL LTPPMGWNSWNHF+CNL+E LI+ETADAMVS  LAALGY+YINLDDC
Sbjct: 9   DYTRRSLLANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDC 68

Query: 133 WAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEE 192
           WAELNRDS+GNLV K STFPSGIKALADYVH KGLKLGIYSDAG QTCSK MPGSLGHEE
Sbjct: 69  WAELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEE 128

Query: 193 QDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWA 252
           QDAKTFASWG+DYLKYDNC N   SPKERYP M+KAL  SGR I FSLCEWG+EDPATWA
Sbjct: 129 QDAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPATWA 188

Query: 253 VNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSH 312
             VGNSWRTT DI D+W+SMT+ AD NDKWASYA PGGWNDPDMLEVGNGGMTT EYRSH
Sbjct: 189 KEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYRSH 248

Query: 313 FSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGP 372
           FSIWALAKAPLLIGCDIRSMD  T +LLSN EVIAVNQDKLGVQG KV  +GDLEVWAGP
Sbjct: 249 FSIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGDLEVWAGP 308

Query: 373 LSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLW 414
           LSGKRVAV LWNRG S   ITA WS +GL  +  V ARDLW
Sbjct: 309 LSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLW 349

BLAST of Tan0010655.4 vs. ExPASy Swiss-Prot
Match: P14749 (Alpha-galactosidase OS=Cyamopsis tetragonoloba OX=3832 PE=1 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 7.4e-167
Identity = 271/362 (74.86%), Postives = 301/362 (83.15%), Query Frame = 0

Query: 53  GPVAGATRSRPLRFATEFDSDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETAD 112
           G   G    +  R + E +  + RR L  NGL  TPPMGWNSWNHF C++NEN+++ETAD
Sbjct: 21  GSEGGRLLEKKNRTSAEAEHYNVRRYLAENGLGQTPPMGWNSWNHFGCDINENVVRETAD 80

Query: 113 AMVSSALAALGYEYINLDDCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIY 172
           AMVS+ LAALGY+YINLDDCWAELNRDS+GN+V  A+ FPSGIKALADYVH KGLKLG+Y
Sbjct: 81  AMVSTGLAALGYQYINLDDCWAELNRDSEGNMVPNAAAFPSGIKALADYVHSKGLKLGVY 140

Query: 173 SDAGIQTCSKKMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQS 232
           SDAG QTCSK+MPGSLGHEEQDAKTFASWG+DYLKYDNCEN G S KERYP M KAL  S
Sbjct: 141 SDAGNQTCSKRMPGSLGHEEQDAKTFASWGVDYLKYDNCENLGISVKERYPPMGKALLSS 200

Query: 233 GRPILFSLCEWGQEDPATWAVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWN 292
           GRPI FS+CEWG EDP  WA ++GNSWRTT DI+DNWNSMT+IAD NDKWASYA PGGWN
Sbjct: 201 GRPIFFSMCEWGWEDPQIWAKSIGNSWRTTGDIEDNWNSMTSIADSNDKWASYAGPGGWN 260

Query: 293 DPDMLEVGNGGMTTAEYRSHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDK 352
           DPDMLEVGNGGMTT EYRSHFSIWALAKAPLL+GCDIR+MD+ T +L+SN EVIAVNQDK
Sbjct: 261 DPDMLEVGNGGMTTEEYRSHFSIWALAKAPLLVGCDIRAMDDTTHELISNAEVIAVNQDK 320

Query: 353 LGVQGKKVYKFGDLEVWAGPLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDL 412
           LGVQGKKV    DLEVWAGPLS  +VAV+LWNR  SR  +TASWS IGL   TTV ARDL
Sbjct: 321 LGVQGKKVKSTNDLEVWAGPLSDNKVAVILWNRSSSRATVTASWSDIGLQQGTTVDARDL 380

Query: 413 WE 415
           WE
Sbjct: 381 WE 382

BLAST of Tan0010655.4 vs. ExPASy Swiss-Prot
Match: Q9FXT4 (Alpha-galactosidase OS=Oryza sativa subsp. japonica OX=39947 GN=Os10g0493600 PE=1 SV=1)

HSP 1 Score: 525.0 bits (1351), Expect = 7.7e-148
Identity = 242/342 (70.76%), Postives = 272/342 (79.53%), Query Frame = 0

Query: 72  SDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDD 131
           S   RR    NGL  TP MGWNSWNHF C +NE +I+ETADA+V++ LA LGY+Y+N+DD
Sbjct: 48  SRRARRRAFENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDD 107

Query: 132 CWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHE 191
           CWAE +RDS+GN V    TFPSGIKALADYVH KGLKLGIYSDAG QTCS KMPGSL HE
Sbjct: 108 CWAEYSRDSQGNFVPNRQTFPSGIKALADYVHAKGLKLGIYSDAGSQTCSNKMPGSLDHE 167

Query: 192 EQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATW 251
           EQD KTFASWG+DYLKYDNC + G S  ERY +M+ A++  G+ I FSLCEWG+E+PATW
Sbjct: 168 EQDVKTFASWGVDYLKYDNCNDAGRSVMERYTRMSNAMKTYGKNIFFSLCEWGKENPATW 227

Query: 252 AVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRS 311
           A  +GNSWRTT DI DNW SMT+ AD+ND+WA+YA PGGWNDPDMLEVGNGGM+ AEYRS
Sbjct: 228 AGRMGNSWRTTGDIADNWGSMTSRADENDQWAAYAGPGGWNDPDMLEVGNGGMSEAEYRS 287

Query: 312 HFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAG 371
           HFSIWALAKAPLLIGCD+RSM   T  +LSN EVIAVNQD LGVQGKKV     LEVWAG
Sbjct: 288 HFSIWALAKAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDSLGVQGKKVQSDNGLEVWAG 347

Query: 372 PLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLW 414
           PLS  R AVVLWNR   +  ITA WS IGL+ S  VTARDLW
Sbjct: 348 PLSNNRKAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDLW 389

BLAST of Tan0010655.4 vs. ExPASy Swiss-Prot
Match: Q9FT97 (Alpha-galactosidase 1 OS=Arabidopsis thaliana OX=3702 GN=AGAL1 PE=2 SV=1)

HSP 1 Score: 513.8 bits (1322), Expect = 1.8e-144
Identity = 236/344 (68.60%), Postives = 279/344 (81.10%), Query Frame = 0

Query: 71  DSDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLD 130
           DS+  RR LL NGL +TPPMGWNSWNHF CN++E +IKETADA+V++ L+ LGY Y+N+D
Sbjct: 37  DSEILRRHLLTNGLGVTPPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYNYVNID 96

Query: 131 DCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGH 190
           DCWAE++RDSKG+LV K STFPSGIKA+ADYVH KGLKLGIYSDAG  TCSK MPGSLG+
Sbjct: 97  DCWAEISRDSKGSLVPKKSTFPSGIKAVADYVHSKGLKLGIYSDAGYFTCSKTMPGSLGY 156

Query: 191 EEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPAT 250
           EE DAKTFA WGIDYLKYDNC + G+ P  RYP MT+AL +SGRPI  SLCEWG   PA 
Sbjct: 157 EEHDAKTFAEWGIDYLKYDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPAL 216

Query: 251 WAVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYR 310
           W   VGNSWRTT+DI+D W SM +IAD N+ +A +A+PGGWNDPDMLEVGNGGMT  EY 
Sbjct: 217 WGSPVGNSWRTTNDIKDTWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYI 276

Query: 311 SHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWA 370
            HFSIWA++KAPLL+GCDIR+M   T+++++NKEVIA+NQD  GVQ KKV   GDLEVWA
Sbjct: 277 VHFSIWAISKAPLLLGCDIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRMEGDLEVWA 336

Query: 371 GPLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           GPLSG RVA++L NRG SR +ITA W  I +  ++ V ARDLWE
Sbjct: 337 GPLSGYRVALLLLNRGPSRTSITALWEDIEIPANSIVEARDLWE 380

BLAST of Tan0010655.4 vs. NCBI nr
Match: KAG6573784.1 (Alpha-galactosidase 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 763.5 bits (1970), Expect = 9.9e-217
Identity = 368/404 (91.09%), Postives = 380/404 (94.06%), Query Frame = 0

Query: 11  RLQPSAMAFPSLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEF 70
           RLQPSAM      +SSPASIM+  YSVLFW F+  S  +GNGG +AGATRSRPLRFA EF
Sbjct: 268 RLQPSAM------VSSPASIMLFLYSVLFWTFLFSS--NGNGGAMAGATRSRPLRFAAEF 327

Query: 71  DSDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLD 130
           D DS RRVLLNNGLALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLD
Sbjct: 328 DYDSPRRVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLD 387

Query: 131 DCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGH 190
           DCWAELNRD+KGNLV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGH
Sbjct: 388 DCWAELNRDAKGNLVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGH 447

Query: 191 EEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPAT 250
           EEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPAT
Sbjct: 448 EEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPAT 507

Query: 251 WAVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYR 310
           WAVNVGNSWRTTSDIQD+WNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYR
Sbjct: 508 WAVNVGNSWRTTSDIQDDWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYR 567

Query: 311 SHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWA 370
           SHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKV+K GDLEVWA
Sbjct: 568 SHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVFKLGDLEVWA 627

Query: 371 GPLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           G LSGKRVAVVLWNRGF R  ITASWSAIGLSPST+VTARDLWE
Sbjct: 628 GALSGKRVAVVLWNRGFYRAKITASWSAIGLSPSTSVTARDLWE 663

BLAST of Tan0010655.4 vs. NCBI nr
Match: XP_022966963.1 (alpha-galactosidase-like [Cucurbita maxima])

HSP 1 Score: 762.3 bits (1967), Expect = 2.2e-216
Identity = 365/391 (93.35%), Postives = 373/391 (95.40%), Query Frame = 0

Query: 24  LSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNG 83
           +SSPASIM+L YSVLFW F   S  +GNGG +AGATRSRPLRFATEFD DS RRVLLNNG
Sbjct: 2   VSSPASIMLLLYSVLFWTFFFSS--NGNGGAMAGATRSRPLRFATEFDYDSPRRVLLNNG 61

Query: 84  LALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGN 143
           LALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLDDCWAELNRD+KGN
Sbjct: 62  LALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLDDCWAELNRDAKGN 121

Query: 144 LVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 203
           LV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSL HEEQDAKTFASWGI
Sbjct: 122 LVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLDHEEQDAKTFASWGI 181

Query: 204 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 263
           DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS
Sbjct: 182 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 241

Query: 264 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 323
           DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL
Sbjct: 242 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 301

Query: 324 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLW 383
           LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYK GDLEVWAG LSGKRVAVVLW
Sbjct: 302 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKLGDLEVWAGTLSGKRVAVVLW 361

Query: 384 NRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           NRGF R  ITASWSAIGLSPSTTVTARDLWE
Sbjct: 362 NRGFYRAKITASWSAIGLSPSTTVTARDLWE 390

BLAST of Tan0010655.4 vs. NCBI nr
Match: XP_023541567.1 (alpha-galactosidase-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 760.8 bits (1963), Expect = 6.4e-216
Identity = 364/391 (93.09%), Postives = 373/391 (95.40%), Query Frame = 0

Query: 24  LSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNG 83
           +SSPASIM+L YSVLFW F+  S  +GNGG +AGATRSRPLRFA EFD DS RRVLLNNG
Sbjct: 2   VSSPASIMLLLYSVLFWTFLFSS--NGNGGALAGATRSRPLRFAAEFDYDSPRRVLLNNG 61

Query: 84  LALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGN 143
           LALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLDDCWAELNRD+KGN
Sbjct: 62  LALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLDDCWAELNRDAKGN 121

Query: 144 LVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 203
           LV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI
Sbjct: 122 LVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 181

Query: 204 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 263
           DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS
Sbjct: 182 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 241

Query: 264 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 323
           DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL
Sbjct: 242 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 301

Query: 324 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLW 383
           LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVY  GDLEVWAG LSGKRVAVVLW
Sbjct: 302 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYILGDLEVWAGALSGKRVAVVLW 361

Query: 384 NRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           NRGF R  ITASWSAIGLSPSTTVTARDLWE
Sbjct: 362 NRGFYRAKITASWSAIGLSPSTTVTARDLWE 390

BLAST of Tan0010655.4 vs. NCBI nr
Match: XP_022944991.1 (alpha-galactosidase-like [Cucurbita moschata])

HSP 1 Score: 759.2 bits (1959), Expect = 1.9e-215
Identity = 363/391 (92.84%), Postives = 372/391 (95.14%), Query Frame = 0

Query: 24  LSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNG 83
           +SSPASIM+  YSVLFW F+  S  +GNGG +AGATRSRPLRFA EFD DS RRVLLNNG
Sbjct: 2   VSSPASIMLFLYSVLFWTFLFSS--NGNGGAMAGATRSRPLRFAAEFDYDSPRRVLLNNG 61

Query: 84  LALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGN 143
           LALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLDDCWAELNRD+KGN
Sbjct: 62  LALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLDDCWAELNRDAKGN 121

Query: 144 LVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 203
           LV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI
Sbjct: 122 LVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 181

Query: 204 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 263
           DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS
Sbjct: 182 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 241

Query: 264 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 323
           DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL
Sbjct: 242 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 301

Query: 324 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLW 383
           LIGCDIRSMDNITLKLLSNK VIAVNQDKLGVQGKKVYK GDLEVWAG LSGKRVAVVLW
Sbjct: 302 LIGCDIRSMDNITLKLLSNKLVIAVNQDKLGVQGKKVYKLGDLEVWAGALSGKRVAVVLW 361

Query: 384 NRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           NRGF R  ITASWSAIGLSPSTTVTARDLWE
Sbjct: 362 NRGFYRAKITASWSAIGLSPSTTVTARDLWE 390

BLAST of Tan0010655.4 vs. NCBI nr
Match: XP_038892679.1 (alpha-galactosidase-like isoform X2 [Benincasa hispida])

HSP 1 Score: 746.1 bits (1925), Expect = 1.6e-211
Identity = 361/398 (90.70%), Postives = 370/398 (92.96%), Query Frame = 0

Query: 17  MAFPSLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTR 76
           MA P   LSSP S + L  SVLFWMF+LGS    NGG VAGA RSR LRFA  FDS STR
Sbjct: 1   MASPPFLLSSPPSTIFLLCSVLFWMFLLGS----NGGAVAGAIRSRSLRFAAGFDSVSTR 60

Query: 77  RVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAEL 136
           RVLLNNGLALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LA+LGYEYINLDDCWAEL
Sbjct: 61  RVLLNNGLALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLASLGYEYINLDDCWAEL 120

Query: 137 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAK 196
           NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHE+QDA+
Sbjct: 121 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEQQDAE 180

Query: 197 TFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVG 256
           TFASWGIDYLKYDNCENTGTSPKERYPKM+KALQQSGRPILFSLCEWGQEDPATWA+NVG
Sbjct: 181 TFASWGIDYLKYDNCENTGTSPKERYPKMSKALQQSGRPILFSLCEWGQEDPATWAINVG 240

Query: 257 NSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIW 316
           NSWRTTSDIQDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTT EYRSHFSIW
Sbjct: 241 NSWRTTSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTVEYRSHFSIW 300

Query: 317 ALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGK 376
           ALAKAPLLIGCDIRSMDNI LKLLSNKEVIAVNQDKLGVQGKKVYK+GDLEVWAGPLSGK
Sbjct: 301 ALAKAPLLIGCDIRSMDNIALKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGK 360

Query: 377 RVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           RVAVVLWNRG  R NITASWS IGLS STTVTARDLWE
Sbjct: 361 RVAVVLWNRGLWRANITASWSDIGLSTSTTVTARDLWE 394

BLAST of Tan0010655.4 vs. ExPASy TrEMBL
Match: A0A6J1HTR9 (Alpha-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111466515 PE=3 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 1.1e-216
Identity = 365/391 (93.35%), Postives = 373/391 (95.40%), Query Frame = 0

Query: 24  LSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNG 83
           +SSPASIM+L YSVLFW F   S  +GNGG +AGATRSRPLRFATEFD DS RRVLLNNG
Sbjct: 2   VSSPASIMLLLYSVLFWTFFFSS--NGNGGAMAGATRSRPLRFATEFDYDSPRRVLLNNG 61

Query: 84  LALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGN 143
           LALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLDDCWAELNRD+KGN
Sbjct: 62  LALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLDDCWAELNRDAKGN 121

Query: 144 LVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 203
           LV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSL HEEQDAKTFASWGI
Sbjct: 122 LVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLDHEEQDAKTFASWGI 181

Query: 204 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 263
           DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS
Sbjct: 182 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 241

Query: 264 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 323
           DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL
Sbjct: 242 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 301

Query: 324 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLW 383
           LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYK GDLEVWAG LSGKRVAVVLW
Sbjct: 302 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKLGDLEVWAGTLSGKRVAVVLW 361

Query: 384 NRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           NRGF R  ITASWSAIGLSPSTTVTARDLWE
Sbjct: 362 NRGFYRAKITASWSAIGLSPSTTVTARDLWE 390

BLAST of Tan0010655.4 vs. ExPASy TrEMBL
Match: A0A6J1FZL3 (Alpha-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111449362 PE=3 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 9.0e-216
Identity = 363/391 (92.84%), Postives = 372/391 (95.14%), Query Frame = 0

Query: 24  LSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLLNNG 83
           +SSPASIM+  YSVLFW F+  S  +GNGG +AGATRSRPLRFA EFD DS RRVLLNNG
Sbjct: 2   VSSPASIMLFLYSVLFWTFLFSS--NGNGGAMAGATRSRPLRFAAEFDYDSPRRVLLNNG 61

Query: 84  LALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGN 143
           LALTPPMGWNSWNHFQCN+NENLIKETADAMVSS LAALGY+YINLDDCWAELNRD+KGN
Sbjct: 62  LALTPPMGWNSWNHFQCNINENLIKETADAMVSSGLAALGYQYINLDDCWAELNRDAKGN 121

Query: 144 LVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 203
           LV K+STFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI
Sbjct: 122 LVPKSSTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGI 181

Query: 204 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 263
           DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS
Sbjct: 182 DYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTS 241

Query: 264 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 323
           DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL
Sbjct: 242 DIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPL 301

Query: 324 LIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAVVLW 383
           LIGCDIRSMDNITLKLLSNK VIAVNQDKLGVQGKKVYK GDLEVWAG LSGKRVAVVLW
Sbjct: 302 LIGCDIRSMDNITLKLLSNKLVIAVNQDKLGVQGKKVYKLGDLEVWAGALSGKRVAVVLW 361

Query: 384 NRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           NRGF R  ITASWSAIGLSPSTTVTARDLWE
Sbjct: 362 NRGFYRAKITASWSAIGLSPSTTVTARDLWE 390

BLAST of Tan0010655.4 vs. ExPASy TrEMBL
Match: A0A1S3BEH7 (Alpha-galactosidase OS=Cucumis melo OX=3656 GN=LOC103488794 PE=3 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 2.1e-209
Identity = 359/394 (91.12%), Postives = 370/394 (93.91%), Query Frame = 0

Query: 21  SLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLL 80
           +L  SSP S+MVL Y VL W F+LG   +GNG  VAGA+RS  LRFA EFDS S+RRVLL
Sbjct: 2   ALSPSSP-SVMVLLYFVLSWTFLLGG--NGNGVAVAGASRSTALRFAAEFDSVSSRRVLL 61

Query: 81  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDS 140
           NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSS LAALGYEYINLDDCWAEL+RDS
Sbjct: 62  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSGLAALGYEYINLDDCWAELDRDS 121

Query: 141 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAS 200
           KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLG+EEQDAKTFAS
Sbjct: 122 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGNEEQDAKTFAS 181

Query: 201 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 260
           WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR
Sbjct: 182 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 241

Query: 261 TTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK 320
           TTSDIQDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK
Sbjct: 242 TTSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK 301

Query: 321 APLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAV 380
           APLLIGCDIRSMDNIT+KLLSNKEVIAVNQDKLGVQGKKVYK+GDLEVWAGPLSGKRVAV
Sbjct: 302 APLLIGCDIRSMDNITMKLLSNKEVIAVNQDKLGVQGKKVYKYGDLEVWAGPLSGKRVAV 361

Query: 381 VLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           VLWNRG  R NITASWS IGL  STTVTARDLW+
Sbjct: 362 VLWNRGLWRANITASWSDIGLCSSTTVTARDLWQ 392

BLAST of Tan0010655.4 vs. ExPASy TrEMBL
Match: A0A5A7SVZ2 (Alpha-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G003280 PE=3 SV=1)

HSP 1 Score: 733.4 bits (1892), Expect = 5.3e-208
Identity = 359/395 (90.89%), Postives = 370/395 (93.67%), Query Frame = 0

Query: 21  SLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLL 80
           +L  SSP S+MVL Y VL W F+LG   +GNG  VAGA+RS  LRFA EFDS S+RRVLL
Sbjct: 2   ALSPSSP-SVMVLLYFVLSWTFLLGG--NGNGVAVAGASRSTALRFAAEFDSVSSRRVLL 61

Query: 81  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDS 140
           NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSS LAALGYEYINLDDCWAEL+RDS
Sbjct: 62  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSGLAALGYEYINLDDCWAELDRDS 121

Query: 141 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAS 200
           KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLG+EEQDAKTFAS
Sbjct: 122 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGNEEQDAKTFAS 181

Query: 201 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 260
           WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR
Sbjct: 182 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 241

Query: 261 TTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK 320
           TTSDIQDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK
Sbjct: 242 TTSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK 301

Query: 321 APLLIGCDIRSMDNITLKLLSNKEVIAVNQ-DKLGVQGKKVYKFGDLEVWAGPLSGKRVA 380
           APLLIGCDIRSMDNIT+KLLSNKEVIAVNQ DKLGVQGKKVYK+GDLEVWAGPLSGKRVA
Sbjct: 302 APLLIGCDIRSMDNITMKLLSNKEVIAVNQVDKLGVQGKKVYKYGDLEVWAGPLSGKRVA 361

Query: 381 VVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           VVLWNRG  R NITASWS IGL  STTVTARDLW+
Sbjct: 362 VVLWNRGLWRANITASWSDIGLCSSTTVTARDLWQ 393

BLAST of Tan0010655.4 vs. ExPASy TrEMBL
Match: Q2HYY3 (Alpha-galactosidase OS=Cucumis sativus OX=3659 GN=Csa_5G580630 PE=2 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 7.6e-207
Identity = 351/394 (89.09%), Postives = 368/394 (93.40%), Query Frame = 0

Query: 21  SLPLSSPASIMVLFYSVLFWMFMLGSITDGNGGPVAGATRSRPLRFATEFDSDSTRRVLL 80
           +LP SSP S+M+L Y +LFW F+LG   +GNG  VA A+RS  LRFATEFDS S+RR+LL
Sbjct: 2   ALPPSSP-SVMLLLYFLLFWTFLLGGNGNGNGVVVAAASRSTALRFATEFDSASSRRILL 61

Query: 81  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDS 140
           NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVS+ LAALGY+YINLDDCWAEL+RDS
Sbjct: 62  NNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSTGLAALGYQYINLDDCWAELDRDS 121

Query: 141 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFAS 200
           KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGI+TCSK+MPGSLGHEEQDAKTFAS
Sbjct: 122 KGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIRTCSKRMPGSLGHEEQDAKTFAS 181

Query: 201 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 260
           WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR
Sbjct: 182 WGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWR 241

Query: 261 TTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAK 320
           TTSDIQDNW SMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMT AEYRSHFSIWALAK
Sbjct: 242 TTSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTIAEYRSHFSIWALAK 301

Query: 321 APLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGKRVAV 380
           APLLIGCDIRSMDN T+KLLSNKEVIAVNQDKLGVQGKKV+K+GDLEVWAG LSGKRVAV
Sbjct: 302 APLLIGCDIRSMDNNTMKLLSNKEVIAVNQDKLGVQGKKVHKYGDLEVWAGLLSGKRVAV 361

Query: 381 VLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           VLWNR   R NITA WS IGLS STTVTARDLWE
Sbjct: 362 VLWNRSLWRANITAYWSDIGLSSSTTVTARDLWE 394

BLAST of Tan0010655.4 vs. TAIR 10
Match: AT5G08370.1 (alpha-galactosidase 2 )

HSP 1 Score: 602.8 bits (1553), Expect = 2.1e-172
Identity = 277/344 (80.52%), Postives = 305/344 (88.66%), Query Frame = 0

Query: 77  RVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAEL 136
           R+L+NNGLAL+P MGWNSWNHFQCN+NE LIK+TADAMVSS L+A+GY+YIN+DDCW EL
Sbjct: 29  RMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGEL 88

Query: 137 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAK 196
            RDS+G+LVAKASTFPSGIKAL+DYVH KGLKLGIYSDAG  TCS+ MPGSLGHEEQDAK
Sbjct: 89  KRDSQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAK 148

Query: 197 TFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVG 256
           TFASWGIDYLKYDNCENTGTSP+ERYPKM+KAL  SGR I FSLCEWGQEDPATWA ++G
Sbjct: 149 TFASWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIG 208

Query: 257 NSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIW 316
           NSWRTT DIQDNW SMT IADQND+WASYA+PG WNDPDMLEVGNGGMT  EY SHFSIW
Sbjct: 209 NSWRTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIW 268

Query: 317 ALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGK 376
           ALAKAPLLIGCD+RSMD +T +LLSNKEVIAVNQDKLG+QGKKV K GDLEVWAGPLS K
Sbjct: 269 ALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKK 328

Query: 377 RVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWEVRYEAC 421
           RVAV+LWNRG +  NITA W+ IGL+ S  V ARDLWE    +C
Sbjct: 329 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYSC 372

BLAST of Tan0010655.4 vs. TAIR 10
Match: AT5G08370.2 (alpha-galactosidase 2 )

HSP 1 Score: 602.8 bits (1553), Expect = 2.1e-172
Identity = 277/344 (80.52%), Postives = 305/344 (88.66%), Query Frame = 0

Query: 77  RVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAEL 136
           R+L+NNGLAL+P MGWNSWNHFQCN+NE LIK+TADAMVSS L+A+GY+YIN+DDCW EL
Sbjct: 3   RMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGEL 62

Query: 137 NRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAK 196
            RDS+G+LVAKASTFPSGIKAL+DYVH KGLKLGIYSDAG  TCS+ MPGSLGHEEQDAK
Sbjct: 63  KRDSQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAK 122

Query: 197 TFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVG 256
           TFASWGIDYLKYDNCENTGTSP+ERYPKM+KAL  SGR I FSLCEWGQEDPATWA ++G
Sbjct: 123 TFASWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIG 182

Query: 257 NSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIW 316
           NSWRTT DIQDNW SMT IADQND+WASYA+PG WNDPDMLEVGNGGMT  EY SHFSIW
Sbjct: 183 NSWRTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIW 242

Query: 317 ALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWAGPLSGK 376
           ALAKAPLLIGCD+RSMD +T +LLSNKEVIAVNQDKLG+QGKKV K GDLEVWAGPLS K
Sbjct: 243 ALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKK 302

Query: 377 RVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWEVRYEAC 421
           RVAV+LWNRG +  NITA W+ IGL+ S  V ARDLWE    +C
Sbjct: 303 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYSC 346

BLAST of Tan0010655.4 vs. TAIR 10
Match: AT5G08380.1 (alpha-galactosidase 1 )

HSP 1 Score: 513.8 bits (1322), Expect = 1.3e-145
Identity = 236/344 (68.60%), Postives = 279/344 (81.10%), Query Frame = 0

Query: 71  DSDSTRRVLLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLD 130
           DS+  RR LL NGL +TPPMGWNSWNHF CN++E +IKETADA+V++ L+ LGY Y+N+D
Sbjct: 37  DSEILRRHLLTNGLGVTPPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYNYVNID 96

Query: 131 DCWAELNRDSKGNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGH 190
           DCWAE++RDSKG+LV K STFPSGIKA+ADYVH KGLKLGIYSDAG  TCSK MPGSLG+
Sbjct: 97  DCWAEISRDSKGSLVPKKSTFPSGIKAVADYVHSKGLKLGIYSDAGYFTCSKTMPGSLGY 156

Query: 191 EEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPAT 250
           EE DAKTFA WGIDYLKYDNC + G+ P  RYP MT+AL +SGRPI  SLCEWG   PA 
Sbjct: 157 EEHDAKTFAEWGIDYLKYDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPAL 216

Query: 251 WAVNVGNSWRTTSDIQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYR 310
           W   VGNSWRTT+DI+D W SM +IAD N+ +A +A+PGGWNDPDMLEVGNGGMT  EY 
Sbjct: 217 WGSPVGNSWRTTNDIKDTWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYI 276

Query: 311 SHFSIWALAKAPLLIGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGDLEVWA 370
            HFSIWA++KAPLL+GCDIR+M   T+++++NKEVIA+NQD  GVQ KKV   GDLEVWA
Sbjct: 277 VHFSIWAISKAPLLLGCDIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRMEGDLEVWA 336

Query: 371 GPLSGKRVAVVLWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           GPLSG RVA++L NRG SR +ITA W  I +  ++ V ARDLWE
Sbjct: 337 GPLSGYRVALLLLNRGPSRTSITALWEDIEIPANSIVEARDLWE 380

BLAST of Tan0010655.4 vs. TAIR 10
Match: AT3G56310.1 (Melibiase family protein )

HSP 1 Score: 491.5 bits (1264), Expect = 6.7e-139
Identity = 240/393 (61.07%), Postives = 283/393 (72.01%), Query Frame = 0

Query: 36  SVLFWMFMLGSITDGNGGPVAGATRSRPLR-----------FATEFDSDSTRRVLLNNGL 95
           SVLF +  L S++      +AG  ++  L+           F + +D+    R+ LNNGL
Sbjct: 10  SVLFLVVGLFSLSVLVSQSIAGRVKAPLLQSNTGGLVFSKSFNSIYDTSMYGRLQLNNGL 69

Query: 96  ALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGNL 155
           A TP MGWNSWN F CN+NE +IKETADA+VSS LA LGY ++N+DDCW+ L RDS+G L
Sbjct: 70  ARTPQMGWNSWNFFACNINETVIKETADALVSSGLADLGYIHVNIDDCWSNLLRDSEGQL 129

Query: 156 VAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGID 215
           V    TFPSGIK LADYVH KGLKLGIYSDAG+ TC +  PGSL HE  DA  FASWG+D
Sbjct: 130 VPHPETFPSGIKLLADYVHSKGLKLGIYSDAGVFTC-EVHPGSLFHEVDDADIFASWGVD 189

Query: 216 YLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTSD 275
           YLKYDNC N G  P ERYP M  AL  +GR I +SLCEWG +DPA WA  VGNSWRTT D
Sbjct: 190 YLKYDNCFNLGIKPIERYPPMRDALNATGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDD 249

Query: 276 IQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLL 335
           I D W SMTTIAD N+KWA+YA PGGWNDPDMLE+GNGGMT  EYR HFSIWAL KAPLL
Sbjct: 250 INDTWASMTTIADLNNKWAAYAGPGGWNDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLL 309

Query: 336 IGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGD---LEVWAGPLSGKRVAVV 395
           IGCD+R+M   TL++LSNKE+IAVNQD LGVQG+K+   G+    +VW+GPLSG R+ V 
Sbjct: 310 IGCDVRNMTAETLEILSNKEIIAVNQDPLGVQGRKIQANGENDCQQVWSGPLSGDRMVVA 369

Query: 396 LWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           LWNR      ITASW  IGL  + +V+ RDLW+
Sbjct: 370 LWNRCSEPATITASWDMIGLESTISVSVRDLWQ 401

BLAST of Tan0010655.4 vs. TAIR 10
Match: AT3G56310.2 (Melibiase family protein )

HSP 1 Score: 444.9 bits (1143), Expect = 7.2e-125
Identity = 225/393 (57.25%), Postives = 266/393 (67.68%), Query Frame = 0

Query: 36  SVLFWMFMLGSITDGNGGPVAGATRSRPLR-----------FATEFDSDSTRRVLLNNGL 95
           SVLF +  L S++      +AG  ++  L+           F + +D+    R+ LNNGL
Sbjct: 10  SVLFLVVGLFSLSVLVSQSIAGRVKAPLLQSNTGGLVFSKSFNSIYDTSMYGRLQLNNGL 69

Query: 96  ALTPPMGWNSWNHFQCNLNENLIKETADAMVSSALAALGYEYINLDDCWAELNRDSKGNL 155
           A TP MGWNSWN F CN+NE +IKETADA+VSS LA LGY ++N+               
Sbjct: 70  ARTPQMGWNSWNFFACNINETVIKETADALVSSGLADLGYIHVNI--------------- 129

Query: 156 VAKASTFPSGIKALADYVHRKGLKLGIYSDAGIQTCSKKMPGSLGHEEQDAKTFASWGID 215
                    GIK LADYVH KGLKLGIYSDAG+ TC +  PGSL HE  DA  FASWG+D
Sbjct: 130 ---------GIKLLADYVHSKGLKLGIYSDAGVFTC-EVHPGSLFHEVDDADIFASWGVD 189

Query: 216 YLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTSD 275
           YLKYDNC N G  P ERYP M  AL  +GR I +SLCEWG +DPA WA  VGNSWRTT D
Sbjct: 190 YLKYDNCFNLGIKPIERYPPMRDALNATGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDD 249

Query: 276 IQDNWNSMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLL 335
           I D W SMTTIAD N+KWA+YA PGGWNDPDMLE+GNGGMT  EYR HFSIWAL KAPLL
Sbjct: 250 INDTWASMTTIADLNNKWAAYAGPGGWNDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLL 309

Query: 336 IGCDIRSMDNITLKLLSNKEVIAVNQDKLGVQGKKVYKFGD---LEVWAGPLSGKRVAVV 395
           IGCD+R+M   TL++LSNKE+IAVNQD LGVQG+K+   G+    +VW+GPLSG R+ V 
Sbjct: 310 IGCDVRNMTAETLEILSNKEIIAVNQDPLGVQGRKIQANGENDCQQVWSGPLSGDRMVVA 369

Query: 396 LWNRGFSRVNITASWSAIGLSPSTTVTARDLWE 415
           LWNR      ITASW  IGL  + +V+ RDLW+
Sbjct: 370 LWNRCSEPATITASWDMIGLESTISVSVRDLWQ 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RX862.9e-17180.52Alpha-galactosidase 2 OS=Arabidopsis thaliana OX=3702 GN=AGAL2 PE=1 SV=1[more]
Q426561.8e-16881.52Alpha-galactosidase OS=Coffea arabica OX=13443 PE=1 SV=1[more]
P147497.4e-16774.86Alpha-galactosidase OS=Cyamopsis tetragonoloba OX=3832 PE=1 SV=1[more]
Q9FXT47.7e-14870.76Alpha-galactosidase OS=Oryza sativa subsp. japonica OX=39947 GN=Os10g0493600 PE=... [more]
Q9FT971.8e-14468.60Alpha-galactosidase 1 OS=Arabidopsis thaliana OX=3702 GN=AGAL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
KAG6573784.19.9e-21791.09Alpha-galactosidase 2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022966963.12.2e-21693.35alpha-galactosidase-like [Cucurbita maxima][more]
XP_023541567.16.4e-21693.09alpha-galactosidase-like [Cucurbita pepo subsp. pepo][more]
XP_022944991.11.9e-21592.84alpha-galactosidase-like [Cucurbita moschata][more]
XP_038892679.11.6e-21190.70alpha-galactosidase-like isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1HTR91.1e-21693.35Alpha-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111466515 PE=3 SV=1[more]
A0A6J1FZL39.0e-21692.84Alpha-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111449362 PE=3 SV=1[more]
A0A1S3BEH72.1e-20991.12Alpha-galactosidase OS=Cucumis melo OX=3656 GN=LOC103488794 PE=3 SV=1[more]
A0A5A7SVZ25.3e-20890.89Alpha-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G0... [more]
Q2HYY37.6e-20789.09Alpha-galactosidase OS=Cucumis sativus OX=3659 GN=Csa_5G580630 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G08370.12.1e-17280.52alpha-galactosidase 2 [more]
AT5G08370.22.1e-17280.52alpha-galactosidase 2 [more]
AT5G08380.11.3e-14568.60alpha-galactosidase 1 [more]
AT3G56310.16.7e-13961.07Melibiase family protein [more]
AT3G56310.27.2e-12557.25Melibiase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002241Glycoside hydrolase, family 27PRINTSPR00740GLHYDRLASE27coord: 159..180
score: 62.5
coord: 307..328
score: 55.91
coord: 82..101
score: 66.5
coord: 118..133
score: 62.81
coord: 221..239
score: 52.37
coord: 286..305
score: 69.5
coord: 194..211
score: 76.39
IPR002241Glycoside hydrolase, family 27PFAMPF16499Melibiase_2coord: 87..351
e-value: 9.9E-81
score: 271.0
IPR002241Glycoside hydrolase, family 27PANTHERPTHR11452ALPHA-GALACTOSIDASE/ALPHA-N-ACETYLGALACTOSAMINIDASEcoord: 71..416
IPR002241Glycoside hydrolase, family 27CDDcd14792GH27coord: 88..351
e-value: 6.1621E-161
score: 452.01
IPR041233Alpha galactosidase, C-terminal beta sandwich domainPFAMPF17801Melibiase_Ccoord: 364..414
e-value: 4.2E-13
score: 49.2
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 80..353
e-value: 2.9E-139
score: 465.3
IPR013780Glycosyl hydrolase, all-betaGENE3D2.60.40.1180coord: 354..418
e-value: 6.7E-20
score: 72.8
NoneNo IPR availablePANTHERPTHR11452:SF33ALPHA-GALACTOSIDASE 2coord: 71..416
NoneNo IPR availableSUPERFAMILY51011Glycosyl hydrolase domaincoord: 346..414
IPR000111Glycoside hydrolase family 27/36, conserved sitePROSITEPS00512ALPHA_GALACTOSIDASEcoord: 123..139
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 78..351

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0010655Tan0010655gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010655.4-five_prime_utrTan0010655.4-five_prime_utr-LG04:3771945..3772139five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010655.4-exonTan0010655.4-exon-LG04:3771945..3772411exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3772567..3772625exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3772713..3772769exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3772862..3772896exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3773000..3773103exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3773288..3773363exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3773690..3773748exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3773816..3773881exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3773982..3774065exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3774249..3774313exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3774747..3774829exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3775070..3775160exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3775305..3775354exon
Tan0010655.4-exonTan0010655.4-exon-LG04:3775473..3777115exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010655.4-cdsTan0010655.4-cds-LG04:3772140..3772411CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3772567..3772625CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3772713..3772769CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3772862..3772896CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3773000..3773103CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3773288..3773363CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3773690..3773748CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3773816..3773881CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3773982..3774065CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3774249..3774313CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3774747..3774829CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3775070..3775160CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3775305..3775354CDS
Tan0010655.4-cdsTan0010655.4-cds-LG04:3775473..3775637CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010655.4-three_prime_utrTan0010655.4-three_prime_utr-LG04:3775638..3777115three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0010655.4Tan0010655.4-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0009965 leaf morphogenesis
biological_process GO:0009911 positive regulation of flower development
biological_process GO:0009620 response to fungus
cellular_component GO:0048046 apoplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0052692 raffinose alpha-galactosidase activity