Csa5G220910 (gene) Cucumber (Chinese Long) v2

NameCsa5G220910
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAlpha-galactosidase; contains IPR000111 (Glycoside hydrolase, clan GH-D), IPR013780 (Glycosyl hydrolase, family 13, all-beta)
LocationChr5 : 10009011 .. 10018111 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAACAGGATAAGGTTTGGATCAGTGTTTTTACTGATTTTGTTACTCTCGGCGGCATATGTGGCGATTGCAGAAAGAAGAGTGTCACTATTGAATGGTTACGATAAATCGATTTTCAAGTCATCATTTCATCGGATTTTTGATACCTCCAAGTACGGCATACTGCAGCTTCAAAATGGGTTGGCTCGAACGCCTCAGATGGGGTATTCTTCTTTTTCCATACCTCTCAATTTTCTAAGTCTCCTATGATTTTCCTCATCTTTTGAACAACTGGGTTATAGTTCATTTCTAATTTTGAGTCCATTTGGAGTTTTATATGGGAAGAACTAGTAGAAACTTTACAATTTTTTGACGATTGGGAATATCGAGTTTTGTTTGCTGGCTTTGCCAAACTGTTTATGGCCTTTATGTTTTCAATGATCAGTGGTTTTCGGGTTTGTTTTGAAGATTATGATTGTTTTTATGTGTCCTTTCAGATGGAATAGCTGGAATTTTTTCGCCTGTGATATTAATGAAACTCTAATCAAGGAAACTGGTAATCCTTCTTCCTTGCTTTCTTTCCCTCATATTCTGTGTATTGGCTGATTTGAAAAAGTGGCAGTGAAAAGCTTCTTATGCATAGTACATCTTTAAGTTTAATGGTTACACTTTGCAGGTTAATCTGCTTTTAAAAGCAATATTTTCTGTTTGAAATGGAAGACTGAATATAAACTGAAGTTTTCTTAGAGGATGAGAAGTGTTCTTTGAGTTAATGTTCCACTGCTTTCCTTCTTAAGATTGTCGGTTTATCCTGATGAAATTAATTTATTTGTGAAGTTTATGACTTATGTTAATTCATCGGGTTATTGTGCATACAAGCTGCGATACATTGAGTTCTTTAGTTCATATATGAAAATCATCACTTTGCAATGGTTGAGTTGTGTGTCAGTTACTCCTCAATAATAATGGTATTTCTCTTTCTTGCTTCTGTTTCAACATTATTTGGAAATTGGTTTTGACAGCGGATGCACTCGTTTCTACGGGTTTGGCTGAGTTAGGTTATGTGTACGTCAACATAGGTACATTTCCTGCAGTTTGAAGTTCCAACTTTTCTATATGTTTTGTTTTAATTTAGTTTTGTTTTAATTTAGTATTTACTATAGTTTAAATAACTGCAGATGATTGCTGGAACACTCAAAAGAGAGACTCAAAGGTTATTCCAATTCCATTTTGTGGAAGAAGATTTGACTGTTCAATGTTTGAAGATTGTTTTAACAAAAAACATTGGATGGAGTTGTAATTCTCCAATTTAATATTTATCCTACTCTGCCTGCGAACTTTCTTGAGGCCCATTGTGTAAATTATGAATGTGAAGTGACAAACTATTCAAACTTATTGATGCTATCATAGAAACTACCAGCAATATCTTAGAAGAAAAGAATTCTACATGATAAACCATGGAATAGGGCTTTTTGGTGACATTGTACTAAGTTCCTTGTTAAGGCTCCCATCCTTGTATGCAGGATCAACTGGTTCCTGATCCCAAGGGTTTCCCGTCAGGAATTAAACCTCTTGCTGATTATGTTCATAGTAAAGACTTGAAGCTTGGAATATATTCTGATGCTGGGTAGGAAGAAATGTCAATTTAACTTCTAAAGCTTAAGAGTTAAATTTTATCTCACTCTTCTCGTATATTACTTTATAGAAAGGCATGCTGATAACTTTTATTTTGATTCTCACATTTGACAGTCTTTTCACTTGTCAAGTTCGAGCTGGGTCACTTTACCATGAAAATGATGATGCACAGTTGTTTGCTTCTTGGGTGAGTAATTGTAATTTAACTTTTGATAATGGAATGGTGTATTCAATACTTTTCCTAAGAAGCAGTATGTATAGTTTACATATGGGTTTGCTTAGTGGTTCTTGGAGCAAGTATAGAAAAAGTAAAAGGATGGCTTTGAGAAAGTGAGTTCAAAACTTCAAACCATGGTGGCTATCTACCCAAGACATTAAAGTCATACCAGTTTTGTACTGACCAAATTTCAGGGTTTGAAAAATTGTGTAGTGAGATTATCAAATGTGTGCATAAGCTTGTCTAGACACTCAAGGTTATCAAAAGGGAAAAAAAGAAAAGAAGAAAAAAAAAAGAAAAGTAATCTACAGACATGTGTTTTATGATCCTTTTTCTCTGGCCCCATCATCCTCAGTCGGCAAAGAATTTTGGCGATGTTTTTGTTCTGAACAATTTTTAATCCTCCCTTGTAGTCATATTCCTACCTTTCCGAGGGCTGGACATCTTTTCTCCCTCCAAGATGGAATTTGATTGGCTCATTCCACCTTATTTCTTCATAACTCATTGGCCTATGGAATGTACGCTTTTTTCTTTTTTGAATGACTTATTGGAGGGTGCTTCCTCAAGGCAGAATATGTAGGTTATAGATCGACATTCAAGCACTTTTGTTTAATGAATAAACATTCATTTGTTTTGGAATTCTAAGCTGTCCATAGAATTCTACTATCACTTCAGTTTTTAAGTTCATTTTATTCCATCTGATTTACCATTTGTTTCCATATTTTTTTTTCTACTAGGGGGTTGATTATCTGAAGTATGATAACTGTTTCAATCTAGGAATTAAACCAATAAAGCGGTTTGTACCTCAGTAAACTCAATAGTTTATTTGATTGTAAATTGGAAGTTTCTTACTGTTCTGAAATTAACTTGTGACTTACAGATACCCACCTATGCGTGATGCACTAAATGCAACTGGGCGGAGTATTTTCTATTCACTTTGTGAATGGTACTCTTGTTCTTCTAAACATCTTTGCTGATTTTTAAACTTTCTTTGTAATTTTTTTTCCTTATCGTCTTTGAATTTATTGGGCTATGATTTTGCTGCTGCCAGGGGAGTTGATGATCCAGCATTATGGGCCGGCAAGGTTGGAAACAGTTGGCGTACGACAGACGACATTAATGATACATGGGCAAGGTCAGGTTTTAGGCCTTCCCATCAATCATTCTTCTCTACTGGAGATTTTTAATCTTCTGTAATTTGCCTCAAGTCATTCTCTGTTGACACCATATTTTAGCGCATTGCTCGAACTTCTGAGCAGAAAATCTTTTGAACTGGTTAGAAAAAATTGGAAGTTATTTTTAGTTTTATCACTTTGCCTGCATAGATTTGCGTTTCCACTTTTTTTATGAAACTTAGAATTAAGACAGTTTTCCTTTGTATTGATTTAAATTCTTAAAAAATTAATTGTTGGTTTTAATGCTAAATCTTTGTGACATGCTTTGAAATCCATACCGCTACCATATCTTTCATCCCTTTCCTTTTCAGGTAGTGCTAATGCTAATTCTTATCTATTATTATGCTGATAGGTATTTTATTTTGTGTTTTCTTTTCTTGTTCAAGCATGACTACTCTTGCAGATCTCAACAATAAGTGGGCAGCCTATGCCGGACCTGGTGGCTGGAATGGTAGGAATATGATTTTATGCAAAGGTGTTGATTTCTCTGTTTGTTATGAGGCTTGTTATTTATTTTAACCTTTCATGGACAGATCCAGATATGTTGGAAGTTGGTAATGGAGGCATGACTTATCAGGAATATCGTGCTCATTTTAGCATATGGGCTCTGATGAAGGTTTCTCCATCCCTTTCTCTTTATAATTTCACTGTATGAAAAAGTCCAATACAGTTAATCTACCTCTACTGAGTTATTTTTTTGACAGCCCTACATATGCACATTCATGTAGAAATGCCAATTCATTGACCATCCTAAGCTTGTTTTCTTTTTAGTTTTAAACGCCTTATTCTCTCAAGTCTCATTCCTCATTGGGGAGAAACTACATTAAAAAAAAAAAAAGACTTGAAGGCACATACGTAAATGATAAGTTGTATAGAATTCTATGTCAATCAGGTAACTTATCAGCGTATAATTCAGTAGATGATTATACTAATACATGCTCTGTTTTTATAAAAAAGGAGACAAACTTCTTTATTATTAATAAACTCAATGTACAAGAGATCTAGACAATGAAAATAATAGCGAAGCCAAGAAAACTAATACATGCTCTGGTTTAGAGTGACAGATAAGGTAGTTAGGGAAATTGAAAGGTTAAGATAATTAATATGGTTAGAAATAGTGGGCTTAGGCCTCGATGAAAGACTGTGCGTAGTTATGATTTACTTTGTCCTATTTGTTTTTCTTTTAATTAGGCTTGTGTTGCCTATACATTTTTTTCCTTTTTATACCTTTATGATTATCAAAAAATAATAAGAAACATATATTGTGGTTTTCTTCTTGTACTGGGGTTTCCACTTAGTTCTTCTGTTCTTTCTTCGTTTTTATTTTTCAATATGGTATCAGAGTGGGATATTAATGAAACCCTAGACGGCTTGACACTTGATGGAACTCAAACGGATGGAACTCATGTACAAATTTCCTCTCTAGGAATAGCCCAGAAACAATTGGACATGCTTTGACAGCAAATGCGTATTGTTGAAGGTGGAGGAGCAACGATGAAAAGAATCGAGATCGACGGAAAAATGAATAAAGTCGAAGGAGACAAGAACTCGAATGGTCATAGTAAGTTTAAGAAGGTCGAGATGCCTATGTTTAGTGGCACTGATACAAATTCGTGGCTGTTTGGAGCAGATTGGTATTTTCAGGTCCATAAATTGACGGATTTCAAGAGAAGTTGGTGAATCTGGAGAATAATCGAGGAAGGAGAAAAATATGGACAAAGGAGAGCATAGGTAGCTGAAGAGAGATCGAGATCTTATTGAAACTGGAGGGGCAAAGAGATGAGCATGTTCGCATGATGAAAAAGAAAACCAGGTGTGGGTGGAGGAAGAAATTTATAAAGAAATATCTAATACCATCGAAGATATGTCAAAACAAGAGGAAGAGAGAACAACAAAAACTAGCATGGTCGAAGATAATATCTTGAGGAAAACGTGGTTGCAACTACAGAGAAGGTGAAGAGGATGGCGAAGAGGGACCAAGAAGAAACTATAGAGGAAGGTCAGTGAGGATTGCCTGTGAGGCCTGAAATGGAGGCTCGAGTGAAGAAAAGGTTGCTGCTCAAAGAAGCAGAAAATGGCTGGTTGCAATGGAGAAAGAAGACGCTGGCAGTTCAAGGATTGGGGCAAAGAGTACGCAAATCGAATTGTTGCAAACTGCACCGGAAGGAGTTAAAGGGGAAGTCAATGAAAGTGGAAGCATTGATTATGGCAATCAAGGCCGAGTTACGGCGCTTGGCATCGTTGTTAATAGCGAGAAAAGCCGATCTACGGTGGCCCGAGAAGGTGGTTTGGAATAGGGGTGAAATGTGTGTAGAGGAAGAAGTTGCTGCCTATGTGAAATGTCATGAAATGGTGTCAATGACGTGGCAGAAATTGGTAGGAAAGTGGGTTAGGGTAAACGAAACAATTTATCTTGGGTTTCACTAGTTCAAATGGGCTTCAACGTATTCTAAATAAAACATTAGAGGCCTATCATTTTGTAGGTGAAGTGGGTGGTGAGTTCCAAAAATGGGAGAGAAGAAAAACGTGTCTGGAGGTGGGTCCTGGAAGCCCATGGCTGTTTTTAAGGAGGGTAAAGGTGTGCTGGCCCAAGATAACTGGCGTTTGGCACAACAATTTGATTATAGACCTAGCCTTCTAGAGAAGAAGAGGTTGAAAATTAAACACTTGGTGTTGGTGATATGTAATGAGTGAACCATGACAACGTGTTTGGAAAAAGGAAAATTCAAGAACATTGGGCTGCCTTGTGGGCGATTAAACACTTGGTGTTGGTGATATGTAATGAGTTAACCATGACAACGTGTTTGGAAAAAGGAAAATTCAAGAACATTGGGCTGCCTTGTGGGCGGAAGGAAGATGATTCTGGACTATGGCCTGAATTCAATGGACCTTATTGGAGCCCAGAGTTTTTCAAAGCTTAAGGGGAGAATTTATTTTTCAACATTGAAGAGCAGAATGATGGGGATGGTTAGTAAATTAAAAAAAAATGGCCAATTTTGTGACCATGTTTTTAAATACACCTTGAGGACAAGGTGTTTTGAAGTGCAGGATAATGTTAGATACCTAGATTAGTATAGGATTAGGGGTATAAGGGTACTTAGATATTTAGGCAGTTACTAGTAGTAGTAGTGGTTATAAATAAGAAATTGGTGAGTGAAGAAGGAGGACTAGGAGGTTGTTGAGTAGTTAGGTCTTGAGTAGTATTCTCAAGAGAGGGTGTTTCAAGTACCATAAACTTGGATCATATTGTAGTTCTTCATTAATTCTCAATATATTTCAGATTTTGATTCTTGTTAGAAAGTGGTGTTCTAACAGAATCCAGCATCTTGTTATGTGGAACTATAAATACAAAGATATTTGGTCCTTTGACATCATTCTGCTGTGTGTGTTTCTATGTGTATGTTTAACAATTTTCAAACTTTGTTGGTTCGGACCCTTGTCACACTTTGTACATGTTTACCAACTGATCTAATTTGTTTTTCCTTTTTATATTCTGCTTACAAGTGAGTGGGAAATAATTTGCTGCCTGTAAGAAAACTGGAATTAAGGCTGGTTATGTACAAACAAAATCTTCAACCAGTTTTTCGTCCTTTAATATCTCAAATTTTTGGCGTCAATGTGCAGTCGCCTCTTTTGATCGGTTGCGATGTTCGAAACATGACCAAAGAAACTTCAGAAATTCTTATGAATAAGGAGGTGATTGCAGTAAACCAAGGTGAGGCTTATCTTTAATCCTTCTTTCATATCTCATTGTCTTGAGCTGAAGCTGTCTTTTCGTAGTTTGTTATCTTTGTAGTTTTTCTTGTAGTTTCGGCCTTTGTGGTTTCGTTTTCTTCTTTGTATTGCCTTTGTGTATTCTTTCCTCTCATTGCCATAAAAGTTGGTTTAATCATATTAAAAGTTATTTAAATAAAAATGAGAATATCCACCAATAAAACTGTTGATCCAATTAATTGATCTATTCTCAACATTGATAGCAGTGATATTTTCTCTCGTATACATGGTCCATGATTAATTGTTTCCAAAACTTAAAATTGATCACTGAGATAACCTAGAATGTTTTAACTTAAACTATCTCGATAACAATTGCCTTTTAGTTTTTTAAAAATGAGCATGTTTTCTCATGCTTTCTTTACCACGATTTTCATTCAAGTATAATTCAGTGTTTTAAATACTACTTTTTTTAGATTTCAAAACTTGGTTTAGATTTTTTTAAAAAATATCTCTAATAAATAGATAACAAAACAAAGCAATCCATAGTTAAAACTAAACTTCAAATGGTTACCAAATGGGGCCTTAATTATTTGTCTATGGAGAGTGGACATAATGTATTAAAACTTATTTTATGCAGATCCTCTTGGGGTCCAGGGAAGGAAGGTTAAAGTTTTTGGAAAGGATGGTTGTCTTCAGGTATTCCAGTTTTTGCATCATCTTATAACCCATAAATATGGCAATATTTACTTATATTTGATGTCGAATAATGTCAAGTCAATAACTGGAAATATCTTTACCTTACAAACGTGCTGTCAGTGTTCAAATGAAAACTGAGAAGGCAAATTTGAACTATTTGAGTTTTGTTACTGTTATGGTTTGTACATTATTAACTAACCAGTTCAGTTAGCTGAGTTAGGTGTAAATATTTCAGAATTTAAAGAGTTGTCCTACAGGTTTGGGCAGGTCCTCTATCTGGAAGCCGCTTGGCTGTTGTTCTTTGGAATCGATGCTCGGTTGCATCAACAATCACCACGGATTGGAACGCACTTGGGCTTAAACCTAACACCAGCGTCTCAGTAAGAGACTTGTGGCTGGTAAGGGATCTAATCACTTGAGAAAGAATTTTCCTACAAGTGATCACTGAATTTTTTATGTTGAGATTATACACGGACACTCTTTACTTTTGATGGATTTACAAGTAGGATGTTAGGTTATTACCTAACAAGAAAACTCAAGAACAACCAAGAACACAAATATATTGAAAGACAGAAACATATAGTGCAATATCAATGAAGAAATTACAGTTTCATAGCTTTTGAGGAAGCTACGCTCTTCCAAATTCCAAAGTCACACATAGGAAATTCCTACCAAATTGATTCGCACTTTGTTCAATTGACTGACTATTTATTACCATAAGACCCTAACAAACTCACTAGCTAGCTACTAACATGTCCCTTACTAATGACCATAAAATAATCCTAATATCTCCCTAACTAGGAGTGTTACATAGGAGTAGTAGAAGCGTGTACAATGGACACGTGATAGAGCAACTGGCGTCTGACAAAACCCAACTTAATAATCTGGCCATGTTCTTCTCTAGTTTCTTTTCGTTAGGTAGGCGAGGAGATTCAATGGTTGGGTCATTTGATGCCTTCATTTTAGTTGCAATGAGCTCATAATCATACGGATAGTAGTGTTAAATTTGATAGTACGAGGTTCCATCTCAAAACCAATTGGCAATGAGAGGAGTAGCCTATCTATCTTATATGGAGTATGAGTCTCCTTGGTTTTTTCAACGTGGAACTCTCAACTTCTCCAACAAGTAGCTAACCTATCTACACGTCTTTCAATTTGTTTTTTTTACCACCTAACCTATTTTGTGAACTAACTCAGTAACTTGATTTTTGCAGCACGAAGATGTTGCAGGAGATGCAATGTCATCATTTGGAGCAGAAGTCGACCCTCATGACTGCAAAATGTTCGTATTCACACCTGTAGCCACGTTCCGTGCTGAAATGTAAATTATCCATCCTCAACTGCTTCAAAGCTGGAAAGGATAGCAAACTCACAAGTGATCCTCCTATTTCAAGTATCATTGGTGATAGTTGATCCATCAGACTGAAGAAGTTAATAAATCAATAATTGGCTCGCCCCATTTGGCGCATTGTAGCTATTTTGTGCAAGTTGTATGGTTTCTGATGGTTCAACTTTGTAAAAGCACTAAACAATGTTTGTGAAAGTACTAAACTATGTGTTTGAATTAAGTACTGAAATGAAAAACAGGGTAAAGGGATTTCAGTTGCTGTGCCTTGGATGACAAAAGATCACTTGTAAAGTATTTGTTTTTGTGTTCAATGTTATTTTTCTGGTGTTTTAAA

mRNA sequence

ATGGCGAACAGGATAAGGTTTGGATCAGTGTTTTTACTGATTTTGTTACTCTCGGCGGCATATGTGGCGATTGCAGAAAGAAGAGTGTCACTATTGAATGGTTACGATAAATCGATTTTCAAGTCATCATTTCATCGGATTTTTGATACCTCCAAGTACGGCATACTGCAGCTTCAAAATGGGTTGGCTCGAACGCCTCAGATGGGATGGAATAGCTGGAATTTTTTCGCCTGTGATATTAATGAAACTCTAATCAAGGAAACTGCGGATGCACTCGTTTCTACGGGTTTGGCTGAGTTAGGTTATGTGTACGTCAACATAGATGATTGCTGGAACACTCAAAAGAGAGACTCAAAGGATCAACTGGTTCCTGATCCCAAGGGTTTCCCGTCAGGAATTAAACCTCTTGCTGATTATGTTCATAGTAAAGACTTGAAGCTTGGAATATATTCTGATGCTGGTCTTTTCACTTGTCAAGTTCGAGCTGGGTCACTTTACCATGAAAATGATGATGCACAGTTGTTTGCTTCTTGGGGGGTTGATTATCTGAAGTATGATAACTGTTTCAATCTAGGAATTAAACCAATAAAGCGATACCCACCTATGCGTGATGCACTAAATGCAACTGGGCGGAGTATTTTCTATTCACTTTGTGAATGGGGAGTTGATGATCCAGCATTATGGGCCGGCAAGGTTGGAAACAGTTGGCGTACGACAGACGACATTAATGATACATGGGCAAGCATGACTACTCTTGCAGATCTCAACAATAAGTGGGCAGCCTATGCCGGACCTGGTGGCTGGAATGATCCAGATATGTTGGAAGTTGGTAATGGAGGCATGACTTATCAGGAATATCGTGCTCATTTTAGCATATGGGCTCTGATGAAGTCGCCTCTTTTGATCGGTTGCGATGTTCGAAACATGACCAAAGAAACTTCAGAAATTCTTATGAATAAGGAGGTGATTGCAGTAAACCAAGATCCTCTTGGGGTCCAGGGAAGGAAGGTTAAAGTTTTTGGAAAGGATGGTTGTCTTCAGGTTTGGGCAGGTCCTCTATCTGGAAGCCGCTTGGCTGTTGTTCTTTGGAATCGATGCTCGGTTGCATCAACAATCACCACGGATTGGAACGCACTTGGGCTTAAACCTAACACCAGCGTCTCAGTAAGAGACTTGTGGCTGCACGAAGATGTTGCAGGAGATGCAATGTCATCATTTGGAGCAGAAGTCGACCCTCATGACTGCAAAATGTTCGTATTCACACCTGTAGCCACGTTCCGTGCTGAAATGTAA

Coding sequence (CDS)

ATGGCGAACAGGATAAGGTTTGGATCAGTGTTTTTACTGATTTTGTTACTCTCGGCGGCATATGTGGCGATTGCAGAAAGAAGAGTGTCACTATTGAATGGTTACGATAAATCGATTTTCAAGTCATCATTTCATCGGATTTTTGATACCTCCAAGTACGGCATACTGCAGCTTCAAAATGGGTTGGCTCGAACGCCTCAGATGGGATGGAATAGCTGGAATTTTTTCGCCTGTGATATTAATGAAACTCTAATCAAGGAAACTGCGGATGCACTCGTTTCTACGGGTTTGGCTGAGTTAGGTTATGTGTACGTCAACATAGATGATTGCTGGAACACTCAAAAGAGAGACTCAAAGGATCAACTGGTTCCTGATCCCAAGGGTTTCCCGTCAGGAATTAAACCTCTTGCTGATTATGTTCATAGTAAAGACTTGAAGCTTGGAATATATTCTGATGCTGGTCTTTTCACTTGTCAAGTTCGAGCTGGGTCACTTTACCATGAAAATGATGATGCACAGTTGTTTGCTTCTTGGGGGGTTGATTATCTGAAGTATGATAACTGTTTCAATCTAGGAATTAAACCAATAAAGCGATACCCACCTATGCGTGATGCACTAAATGCAACTGGGCGGAGTATTTTCTATTCACTTTGTGAATGGGGAGTTGATGATCCAGCATTATGGGCCGGCAAGGTTGGAAACAGTTGGCGTACGACAGACGACATTAATGATACATGGGCAAGCATGACTACTCTTGCAGATCTCAACAATAAGTGGGCAGCCTATGCCGGACCTGGTGGCTGGAATGATCCAGATATGTTGGAAGTTGGTAATGGAGGCATGACTTATCAGGAATATCGTGCTCATTTTAGCATATGGGCTCTGATGAAGTCGCCTCTTTTGATCGGTTGCGATGTTCGAAACATGACCAAAGAAACTTCAGAAATTCTTATGAATAAGGAGGTGATTGCAGTAAACCAAGATCCTCTTGGGGTCCAGGGAAGGAAGGTTAAAGTTTTTGGAAAGGATGGTTGTCTTCAGGTTTGGGCAGGTCCTCTATCTGGAAGCCGCTTGGCTGTTGTTCTTTGGAATCGATGCTCGGTTGCATCAACAATCACCACGGATTGGAACGCACTTGGGCTTAAACCTAACACCAGCGTCTCAGTAAGAGACTTGTGGCTGCACGAAGATGTTGCAGGAGATGCAATGTCATCATTTGGAGCAGAAGTCGACCCTCATGACTGCAAAATGTTCGTATTCACACCTGTAGCCACGTTCCGTGCTGAAATGTAA

Protein sequence

MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVFTPVATFRAEM*
BLAST of Csa5G220910 vs. Swiss-Prot
Match: AGAL3_ARATH (Alpha-galactosidase 3 OS=Arabidopsis thaliana GN=AGAL3 PE=1 SV=1)

HSP 1 Score: 652.1 bits (1681), Expect = 4.2e-186
Identity = 299/384 (77.86%), Postives = 334/384 (86.98%), Query Frame = 1

Query: 39  IFKSSFHRIFDTSKYGILQLQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLA 98
           +F  SF+ I+DTS YG LQL NGLARTPQMGWNSWNFFAC+INET+IKETADALVS+GLA
Sbjct: 46  VFSKSFNSIYDTSMYGRLQLNNGLARTPQMGWNSWNFFACNINETVIKETADALVSSGLA 105

Query: 99  ELGYVYVNIDDCWNTQKRDSKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC 158
           +LGY++VNIDDCW+   RDS+ QLVP P+ FPSGIK LADYVHSK LKLGIYSDAG+FTC
Sbjct: 106 DLGYIHVNIDDCWSNLLRDSEGQLVPHPETFPSGIKLLADYVHSKGLKLGIYSDAGVFTC 165

Query: 159 QVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLC 218
           +V  GSL+HE DDA +FASWGVDYLKYDNCFNLGIKPI+RYPPMRDALNATGRSIFYSLC
Sbjct: 166 EVHPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIERYPPMRDALNATGRSIFYSLC 225

Query: 219 EWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGN 278
           EWGVDDPALWA +VGNSWRTTDDINDTWASMTT+ADLNNKWAAYAGPGGWNDPDMLE+GN
Sbjct: 226 EWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGWNDPDMLEIGN 285

Query: 279 GGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVK 338
           GGMTY+EYR HFSIWALMK+PLLIGCDVRNMT ET EIL NKE+IAVNQDPLGVQGRK++
Sbjct: 286 GGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETLEILSNKEIIAVNQDPLGVQGRKIQ 345

Query: 339 VFGKDGCLQVWAGPLSGSRLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDV 398
             G++ C QVW+GPLSG R+ V LWNRCS  +TIT  W+ +GL+   SVSVRDLW H+DV
Sbjct: 346 ANGENDCQQVWSGPLSGDRMVVALWNRCSEPATITASWDMIGLESTISVSVRDLWQHKDV 405

Query: 399 AGDAMSSFGAEVDPHDCKMFVFTP 423
             +   SF A+VD HDC M+V TP
Sbjct: 406 TENTSGSFEAQVDAHDCHMYVLTP 429

BLAST of Csa5G220910 vs. Swiss-Prot
Match: AGAL_CYATE (Alpha-galactosidase OS=Cyamopsis tetragonoloba PE=1 SV=1)

HSP 1 Score: 494.6 bits (1272), Expect = 1.1e-138
Identity = 232/368 (63.04%), Postives = 279/368 (75.82%), Query Frame = 1

Query: 59  QNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDS 118
           +NGL +TP MGWNSWN F CDINE +++ETADA+VSTGLA LGY Y+N+DDCW    RDS
Sbjct: 49  ENGLGQTPPMGWNSWNHFGCDINENVVRETADAMVSTGLAALGYQYINLDDCWAELNRDS 108

Query: 119 KDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRA-GSLYHENDDAQLFAS 178
           +  +VP+   FPSGIK LADYVHSK LKLG+YSDAG  TC  R  GSL HE  DA+ FAS
Sbjct: 109 EGNMVPNAAAFPSGIKALADYVHSKGLKLGVYSDAGNQTCSKRMPGSLGHEEQDAKTFAS 168

Query: 179 WGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWR 238
           WGVDYLKYDNC NLGI   +RYPPM  AL ++GR IF+S+CEWG +DP +WA  +GNSWR
Sbjct: 169 WGVDYLKYDNCENLGISVKERYPPMGKALLSSGRPIFFSMCEWGWEDPQIWAKSIGNSWR 228

Query: 239 TTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMK 298
           TT DI D W SMT++AD N+KWA+YAGPGGWNDPDMLEVGNGGMT +EYR+HFSIWAL K
Sbjct: 229 TTGDIEDNWNSMTSIADSNDKWASYAGPGGWNDPDMLEVGNGGMTTEEYRSHFSIWALAK 288

Query: 299 SPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSR 358
           +PLL+GCD+R M   T E++ N EVIAVNQD LGVQG+KVK       L+VWAGPLS ++
Sbjct: 289 APLLVGCDIRAMDDTTHELISNAEVIAVNQDKLGVQGKKVK---STNDLEVWAGPLSDNK 348

Query: 359 LAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHED---VAGDAMSSFGAEVDPHD 418
           +AV+LWNR S  +T+T  W+ +GL+  T+V  RDLW H     V+G+      AE+D H 
Sbjct: 349 VAVILWNRSSSRATVTASWSDIGLQQGTTVDARDLWEHSTQSLVSGE----ISAEIDSHA 408

Query: 419 CKMFVFTP 423
           CKM+V TP
Sbjct: 409 CKMYVLTP 409

BLAST of Csa5G220910 vs. Swiss-Prot
Match: AGAL_ORYSJ (Alpha-galactosidase OS=Oryza sativa subsp. japonica GN=Os10g0493600 PE=1 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 3.6e-137
Identity = 234/365 (64.11%), Postives = 276/365 (75.62%), Query Frame = 1

Query: 59  QNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDS 118
           +NGL RTPQMGWNSWN F C INE +I+ETADALV+TGLA+LGY YVNIDDCW    RDS
Sbjct: 57  ENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRDS 116

Query: 119 KDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRA-GSLYHENDDAQLFAS 178
           +   VP+ + FPSGIK LADYVH+K LKLGIYSDAG  TC  +  GSL HE  D + FAS
Sbjct: 117 QGNFVPNRQTFPSGIKALADYVHAKGLKLGIYSDAGSQTCSNKMPGSLDHEEQDVKTFAS 176

Query: 179 WGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWR 238
           WGVDYLKYDNC + G   ++RY  M +A+   G++IF+SLCEWG ++PA WAG++GNSWR
Sbjct: 177 WGVDYLKYDNCNDAGRSVMERYTRMSNAMKTYGKNIFFSLCEWGKENPATWAGRMGNSWR 236

Query: 239 TTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMK 298
           TT DI D W SMT+ AD N++WAAYAGPGGWNDPDMLEVGNGGM+  EYR+HFSIWAL K
Sbjct: 237 TTGDIADNWGSMTSRADENDQWAAYAGPGGWNDPDMLEVGNGGMSEAEYRSHFSIWALAK 296

Query: 299 SPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSR 358
           +PLLIGCDVR+M+++T  IL N EVIAVNQD LGVQG+KV+    D  L+VWAGPLS +R
Sbjct: 297 APLLIGCDVRSMSQQTKNILSNSEVIAVNQDSLGVQGKKVQ---SDNGLEVWAGPLSNNR 356

Query: 359 LAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKM 418
            AVVLWNR S  +TIT  W+ +GL  + +V+ RDLW H   A  A     A V PHDCKM
Sbjct: 357 KAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDLWAHSSFA--AQGQISASVAPHDCKM 416

Query: 419 FVFTP 423
           +V TP
Sbjct: 417 YVLTP 416

BLAST of Csa5G220910 vs. Swiss-Prot
Match: AGAL2_ARATH (Alpha-galactosidase 2 OS=Arabidopsis thaliana GN=AGAL2 PE=1 SV=1)

HSP 1 Score: 487.6 bits (1254), Expect = 1.4e-136
Identity = 232/365 (63.56%), Postives = 273/365 (74.79%), Query Frame = 1

Query: 58  LQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRD 117
           + NGLA +PQMGWNSWN F C+INETLIK+TADA+VS+GL+ +GY Y+NIDDCW   KRD
Sbjct: 32  MNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGELKRD 91

Query: 118 SKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC-QVRAGSLYHENDDAQLFA 177
           S+  LV     FPSGIK L+DYVHSK LKLGIYSDAG  TC Q   GSL HE  DA+ FA
Sbjct: 92  SQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFA 151

Query: 178 SWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSW 237
           SWG+DYLKYDNC N G  P +RYP M  AL  +GRSIF+SLCEWG +DPA WAG +GNSW
Sbjct: 152 SWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSW 211

Query: 238 RTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM 297
           RTT DI D W SMT +AD N++WA+YA PG WNDPDMLEVGNGGMT +EY +HFSIWAL 
Sbjct: 212 RTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIWALA 271

Query: 298 KSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGS 357
           K+PLLIGCD+R+M K T E+L NKEVIAVNQD LG+QG+KVK   K+G L+VWAGPLS  
Sbjct: 272 KAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVK---KEGDLEVWAGPLSKK 331

Query: 358 RLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCK 417
           R+AV+LWNR S ++ IT  W  +GL  +  V+ RDLW H   +        A V+PH CK
Sbjct: 332 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYS-CVKKQLSALVEPHACK 391

Query: 418 MFVFT 422
           M+  T
Sbjct: 392 MYTLT 392

BLAST of Csa5G220910 vs. Swiss-Prot
Match: AGAL_COFAR (Alpha-galactosidase OS=Coffea arabica PE=1 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 3.9e-136
Identity = 236/366 (64.48%), Postives = 271/366 (74.04%), Query Frame = 1

Query: 58  LQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRD 117
           L NGL  TP MGWNSWN F C+++E LI+ETADA+VS GLA LGY Y+N+DDCW    RD
Sbjct: 16  LANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCWAELNRD 75

Query: 118 SKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC-QVRAGSLYHENDDAQLFA 177
           S+  LVP    FPSGIK LADYVHSK LKLGIYSDAG  TC +   GSL HE  DA+ FA
Sbjct: 76  SQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQDAKTFA 135

Query: 178 SWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSW 237
           SWGVDYLKYDNC N  I P +RYP M  AL  +GRSIF+SLCEWG +DPA WA +VGNSW
Sbjct: 136 SWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPATWAKEVGNSW 195

Query: 238 RTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM 297
           RTT DI+D+W+SMT+ AD+N+KWA+YAGPGGWNDPDMLEVGNGGMT  EYR+HFSIWAL 
Sbjct: 196 RTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYRSHFSIWALA 255

Query: 298 KSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGS 357
           K+PLLIGCD+R+M   T ++L N EVIAVNQD LGVQG KVK +G    L+VWAGPLSG 
Sbjct: 256 KAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGD---LEVWAGPLSGK 315

Query: 358 RLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCK 417
           R+AV LWNR S  +TIT  W+ +GL     V+ RDLW H            A VD HD K
Sbjct: 316 RVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAH-STEKSVKGQISAAVDAHDSK 375

Query: 418 MFVFTP 423
           M+V TP
Sbjct: 376 MYVLTP 377

BLAST of Csa5G220910 vs. TrEMBL
Match: A0A067JNU2_JATCU (Alpha-galactosidase OS=Jatropha curcas GN=JCGZ_03336 PE=3 SV=1)

HSP 1 Score: 694.9 bits (1792), Expect = 6.2e-197
Identity = 327/412 (79.37%), Postives = 353/412 (85.68%), Query Frame = 1

Query: 13  LILLLSAAYVAIAERRVSLLNGYDKSIF--KSSFHRIFDTSKYGILQLQNGLARTPQMGW 72
           L L L    VAIA R  SLL  Y+K  F    SFH IFDTSKYGI QL NGL R PQMGW
Sbjct: 15  LSLSLWVIQVAIAAREGSLLQSYEKGRFGYSKSFHTIFDTSKYGIFQLNNGLGRAPQMGW 74

Query: 73  NSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKDQLVPDPKGFP 132
           NSWNFFAC+INET+IK+TADALVSTGLA+LGYVY+NIDDCW+   RD+K QLVPDPK FP
Sbjct: 75  NSWNFFACNINETVIKQTADALVSTGLADLGYVYINIDDCWSASMRDAKGQLVPDPKTFP 134

Query: 133 SGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDNCFN 192
           SGIK LADYVH K LKLGIYSDAG+FTCQVR GSLYHE DDA+LFASWGVDYLKYDNCFN
Sbjct: 135 SGIKALADYVHGKGLKLGIYSDAGVFTCQVRPGSLYHEKDDAELFASWGVDYLKYDNCFN 194

Query: 193 LGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMT 252
           LGI+P KRYPPMRDALN TGR+IFYSLCEWGVDDP LWAGKVGNSWRTTDDINDTWASMT
Sbjct: 195 LGIEPKKRYPPMRDALNETGRTIFYSLCEWGVDDPPLWAGKVGNSWRTTDDINDTWASMT 254

Query: 253 TLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVRNMT 312
           T+AD+N+KWA YAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMK+PLLIGCDVRNMT
Sbjct: 255 TIADVNDKWATYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKAPLLIGCDVRNMT 314

Query: 313 KETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRCSVAS 372
            E  EIL NKEVIA+NQDPLGVQGRKV   G +GC QVWAGPLSG RLAV LWNRCS  +
Sbjct: 315 AEAYEILTNKEVIAINQDPLGVQGRKVHTVGSEGCQQVWAGPLSGHRLAVALWNRCSKKA 374

Query: 373 TITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVFTP 423
           TIT  W+ALGLK  TSVSVRDLW H+D+ GDA++SF A VD HDC M++FTP
Sbjct: 375 TITAPWDALGLKSGTSVSVRDLWQHKDLTGDAVASFDARVDAHDCAMYIFTP 426

BLAST of Csa5G220910 vs. TrEMBL
Match: D7T4E7_VITVI (Alpha-galactosidase OS=Vitis vinifera GN=VIT_13s0067g02640 PE=3 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 6.9e-196
Identity = 327/422 (77.49%), Postives = 367/422 (86.97%), Query Frame = 1

Query: 8   GSVFLLILL-LSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQNGLARTP 67
           GSV LL+ L LSA  V IA R V L   +DKS    SF  IFD SKYGILQL NGLARTP
Sbjct: 4   GSVHLLLFLYLSAISVGIAGRVVPLHEPFDKSTSSRSFSSIFDNSKYGILQLNNGLARTP 63

Query: 68  QMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKDQLVPDP 127
           QMGWNSWNFFAC+INET+IKETADALVSTGLA+LGYVYVNIDDCW++ +RDSK QLVPDP
Sbjct: 64  QMGWNSWNFFACNINETVIKETADALVSTGLADLGYVYVNIDDCWSSLERDSKGQLVPDP 123

Query: 128 KGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYD 187
           K FPSGIK LADYVH+K LKLGIYSDAG+FTCQVR GS+YHE DDA+LFASWGVDYLKYD
Sbjct: 124 KTFPSGIKALADYVHAKGLKLGIYSDAGIFTCQVRPGSIYHERDDAELFASWGVDYLKYD 183

Query: 188 NCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTW 247
           NC+NLGIKP +RYPPMR+ALNATGR+IFYSLCEWGVDDPALWAGKVGNSWRTTDDIND+W
Sbjct: 184 NCYNLGIKPEERYPPMRNALNATGRTIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDSW 243

Query: 248 ASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDV 307
           ASMTT+ADLN++WAAYAGPGGWNDPDMLEVGNGGMT +EYRAHFSIWALMK+PLL+GCDV
Sbjct: 244 ASMTTIADLNDEWAAYAGPGGWNDPDMLEVGNGGMTLEEYRAHFSIWALMKAPLLVGCDV 303

Query: 308 RNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRC 367
           RN+T ET EI+ NKEVI +NQD LG+QGRKV V GKDGC QVWAGPLSG RL V LWNRC
Sbjct: 304 RNITAETFEIIGNKEVIDINQDSLGIQGRKVHVSGKDGCRQVWAGPLSGHRLVVALWNRC 363

Query: 368 SVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVFTPVATF 427
           S A+TIT  W  LGL+ + SVS+RDLW H D++GDA++SFG+ V  HDC M++FTPV+  
Sbjct: 364 SKAATITVGWEVLGLESSMSVSIRDLWKHVDLSGDAVASFGSLVASHDCGMYIFTPVSAS 423

Query: 428 RA 429
           R+
Sbjct: 424 RS 425

BLAST of Csa5G220910 vs. TrEMBL
Match: B9IQ46_POPTR (Alpha-galactosidase OS=Populus trichocarpa GN=POPTR_0019s08450g PE=3 SV=2)

HSP 1 Score: 685.6 bits (1768), Expect = 3.8e-194
Identity = 319/410 (77.80%), Postives = 354/410 (86.34%), Query Frame = 1

Query: 22  VAIAERRVSLLNGYDKSIF--KSSFHRIFDTSKYGILQLQNGLARTPQMGWNSWNFFACD 81
           VA A     LL GY++  +    SF+ +F TS YGI QL NGLARTPQMGWNSWNFFAC+
Sbjct: 27  VAFAGSIAPLLQGYEEGSYGYSRSFNNVFSTSNYGIFQLNNGLARTPQMGWNSWNFFACN 86

Query: 82  INETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKDQLVPDPKGFPSGIKPLADY 141
           INET+IKETADAL+STGLAELGYVYVNIDDCW++ KRDSK QL+PDPK FPSGIK LADY
Sbjct: 87  INETVIKETADALISTGLAELGYVYVNIDDCWSSTKRDSKGQLIPDPKTFPSGIKALADY 146

Query: 142 VHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRY 201
           VH K LKLGIYSDAG FTCQVR GSL HE DDA+LFASWGVDYLKYDNCFNLGI P +RY
Sbjct: 147 VHEKGLKLGIYSDAGAFTCQVRPGSLLHEKDDAELFASWGVDYLKYDNCFNLGINPKERY 206

Query: 202 PPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKW 261
           PPMRDALN+TGR++FYSLCEWGVDDPALWAGKVGNSWRTTDDIND+WASMTT ADLN+KW
Sbjct: 207 PPMRDALNSTGRTVFYSLCEWGVDDPALWAGKVGNSWRTTDDINDSWASMTTTADLNDKW 266

Query: 262 AAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMN 321
           A+YAGPGGWNDPDMLEVGNGGMTY EYRAHFSIWALMK+PLLIGCDVRNMT ET EIL N
Sbjct: 267 ASYAGPGGWNDPDMLEVGNGGMTYHEYRAHFSIWALMKAPLLIGCDVRNMTAETIEILTN 326

Query: 322 KEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRCSVASTITTDWNAL 381
           KE+IAVNQDPLG+QGRKV   G DGCLQVWAGPLSG R+ V LWNRCS A+TIT  W AL
Sbjct: 327 KEIIAVNQDPLGIQGRKVYSTGTDGCLQVWAGPLSGHRIVVALWNRCSKAATITAGWGAL 386

Query: 382 GLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVFTPVATFRAE 430
           GL+ +TSVSVRDLW  +D+ GDA++SFGA VD HDC +F+FTP + + +E
Sbjct: 387 GLESSTSVSVRDLWQGKDIVGDAVASFGARVDAHDCLIFIFTPHSVYHSE 436

BLAST of Csa5G220910 vs. TrEMBL
Match: M5XCJ0_PRUPE (Alpha-galactosidase OS=Prunus persica GN=PRUPE_ppa005977mg PE=3 SV=1)

HSP 1 Score: 683.7 bits (1763), Expect = 1.4e-193
Identity = 324/423 (76.60%), Postives = 367/423 (86.76%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIF-KSSFHRIFDTSKYGILQLQ 60
           MA R    S++ L+L LSA  +A A R V L   ++K  F + +F+RIFDTS YGILQL 
Sbjct: 1   MAKRKITLSLYTLVLTLSAVSLAFAVRVVPLTQPHEKPTFLRPTFNRIFDTSLYGILQLN 60

Query: 61  NGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSK 120
           NGLA+TPQMGWNSWNFFAC+INET+IKETADAL+STGLA+LGYVYVNIDDCW  Q R+S+
Sbjct: 61  NGLAQTPQMGWNSWNFFACNINETVIKETADALISTGLADLGYVYVNIDDCW-CQTRNSE 120

Query: 121 DQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWG 180
            QLVPDPK FPSGIK LA+Y+H K LKLGIYSDAG+FTCQVR GSLYHENDDA+LFASW 
Sbjct: 121 GQLVPDPKTFPSGIKALAEYLHRKGLKLGIYSDAGVFTCQVRPGSLYHENDDAKLFASWD 180

Query: 181 VDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTT 240
           VDYLKYDNC+NLGI P +RYPPMR+ALNA+GR+IFYS+CEWGVDDPALWAGK+GNSWRTT
Sbjct: 181 VDYLKYDNCYNLGIPPKERYPPMREALNASGRTIFYSICEWGVDDPALWAGKLGNSWRTT 240

Query: 241 DDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSP 300
           DDIND+WASMTT+ADLN+KWAAYAGPGGWNDPDMLEVGNGGM+YQEYRAHFSIWALMK+P
Sbjct: 241 DDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMSYQEYRAHFSIWALMKAP 300

Query: 301 LLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLA 360
           LL+GCDVRNMT ET EIL N+EVIAVNQDPLGVQGRKV V G DGC QVWAGPLSG RL 
Sbjct: 301 LLVGCDVRNMTAETFEILSNEEVIAVNQDPLGVQGRKVSVSGTDGCYQVWAGPLSGHRLT 360

Query: 361 VVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFV 420
           V LWNRCS A TIT  W ALGL+ + SVS+RDLW H++VA D +SSFGA VD HDC+M++
Sbjct: 361 VALWNRCSKAKTITVTWEALGLQSSISVSIRDLWEHKEVAVDTVSSFGARVDAHDCRMYI 420

Query: 421 FTP 423
           FTP
Sbjct: 421 FTP 422

BLAST of Csa5G220910 vs. TrEMBL
Match: A0A0J8CXF8_BETVU (Alpha-galactosidase OS=Beta vulgaris subsp. vulgaris GN=BVRB_2g042760 PE=3 SV=1)

HSP 1 Score: 673.3 bits (1736), Expect = 1.9e-190
Identity = 318/415 (76.63%), Postives = 358/415 (86.27%), Query Frame = 1

Query: 10  VFLLILLLSAAYVAIAERRVSLLNGYD--KSIFKSSFHRIFDTSKYGILQLQNGLARTPQ 69
           ++L I LLS +  AIA R   L + ++  K +F S+F+ IFDTSKYG +QL NGLA TPQ
Sbjct: 7   LYLCICLLSIS-TAIASRARPLQDNFNDNKLVFTSTFNNIFDTSKYGKVQLFNGLALTPQ 66

Query: 70  MGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKDQLVPDPK 129
           MGWNSWNFFAC+INET+IKETADALVSTGLA+LGYVYVNIDDCW++  RDSKDQLVPDPK
Sbjct: 67  MGWNSWNFFACNINETVIKETADALVSTGLADLGYVYVNIDDCWSSATRDSKDQLVPDPK 126

Query: 130 GFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDN 189
            FPSGIK LADYVHSKDLKLGIYSDAG FTCQVR GS++HENDDA+LFASWGVDYLKYDN
Sbjct: 127 TFPSGIKALADYVHSKDLKLGIYSDAGAFTCQVRPGSIFHENDDAKLFASWGVDYLKYDN 186

Query: 190 CFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWA 249
           CFNLGI P KRYPPMRDALNAT R IFYSLCEWGVDDPALWAG+VGNSWRTT+DIND+WA
Sbjct: 187 CFNLGIPPKKRYPPMRDALNATERPIFYSLCEWGVDDPALWAGEVGNSWRTTEDINDSWA 246

Query: 250 SMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVR 309
           SMTT+ADLN+KWA+YAGPGGWNDPDMLEVGNGGMTY EYRAHFSIWALMK+PLLIGCD+R
Sbjct: 247 SMTTIADLNDKWASYAGPGGWNDPDMLEVGNGGMTYHEYRAHFSIWALMKAPLLIGCDIR 306

Query: 310 NMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRCS 369
           NMT ET EIL N EVI VNQDPLGVQGRKV   G + C QVWAGPLSG+R+AV LWNRC 
Sbjct: 307 NMTAETLEILSNTEVIGVNQDPLGVQGRKVHASGPNDCQQVWAGPLSGNRIAVALWNRCP 366

Query: 370 VASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVFTP 423
            A+ IT  WN LGL+ + SVS++DLW H+ +A DA+SSFG  V+ HDC M++FTP
Sbjct: 367 SAAVITAGWNVLGLQSSVSVSIQDLWQHKLIAKDAVSSFGVRVESHDCAMYIFTP 420

BLAST of Csa5G220910 vs. TAIR10
Match: AT3G56310.1 (AT3G56310.1 Melibiase family protein)

HSP 1 Score: 652.1 bits (1681), Expect = 2.3e-187
Identity = 299/384 (77.86%), Postives = 334/384 (86.98%), Query Frame = 1

Query: 39  IFKSSFHRIFDTSKYGILQLQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLA 98
           +F  SF+ I+DTS YG LQL NGLARTPQMGWNSWNFFAC+INET+IKETADALVS+GLA
Sbjct: 46  VFSKSFNSIYDTSMYGRLQLNNGLARTPQMGWNSWNFFACNINETVIKETADALVSSGLA 105

Query: 99  ELGYVYVNIDDCWNTQKRDSKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC 158
           +LGY++VNIDDCW+   RDS+ QLVP P+ FPSGIK LADYVHSK LKLGIYSDAG+FTC
Sbjct: 106 DLGYIHVNIDDCWSNLLRDSEGQLVPHPETFPSGIKLLADYVHSKGLKLGIYSDAGVFTC 165

Query: 159 QVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLC 218
           +V  GSL+HE DDA +FASWGVDYLKYDNCFNLGIKPI+RYPPMRDALNATGRSIFYSLC
Sbjct: 166 EVHPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIERYPPMRDALNATGRSIFYSLC 225

Query: 219 EWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGN 278
           EWGVDDPALWA +VGNSWRTTDDINDTWASMTT+ADLNNKWAAYAGPGGWNDPDMLE+GN
Sbjct: 226 EWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGWNDPDMLEIGN 285

Query: 279 GGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVK 338
           GGMTY+EYR HFSIWALMK+PLLIGCDVRNMT ET EIL NKE+IAVNQDPLGVQGRK++
Sbjct: 286 GGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETLEILSNKEIIAVNQDPLGVQGRKIQ 345

Query: 339 VFGKDGCLQVWAGPLSGSRLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDV 398
             G++ C QVW+GPLSG R+ V LWNRCS  +TIT  W+ +GL+   SVSVRDLW H+DV
Sbjct: 346 ANGENDCQQVWSGPLSGDRMVVALWNRCSEPATITASWDMIGLESTISVSVRDLWQHKDV 405

Query: 399 AGDAMSSFGAEVDPHDCKMFVFTP 423
             +   SF A+VD HDC M+V TP
Sbjct: 406 TENTSGSFEAQVDAHDCHMYVLTP 429

BLAST of Csa5G220910 vs. TAIR10
Match: AT5G08370.1 (AT5G08370.1 alpha-galactosidase 2)

HSP 1 Score: 487.6 bits (1254), Expect = 7.6e-138
Identity = 232/365 (63.56%), Postives = 273/365 (74.79%), Query Frame = 1

Query: 58  LQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRD 117
           + NGLA +PQMGWNSWN F C+INETLIK+TADA+VS+GL+ +GY Y+NIDDCW   KRD
Sbjct: 32  MNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGELKRD 91

Query: 118 SKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC-QVRAGSLYHENDDAQLFA 177
           S+  LV     FPSGIK L+DYVHSK LKLGIYSDAG  TC Q   GSL HE  DA+ FA
Sbjct: 92  SQGSLVAKASTFPSGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFA 151

Query: 178 SWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSW 237
           SWG+DYLKYDNC N G  P +RYP M  AL  +GRSIF+SLCEWG +DPA WAG +GNSW
Sbjct: 152 SWGIDYLKYDNCENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSW 211

Query: 238 RTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM 297
           RTT DI D W SMT +AD N++WA+YA PG WNDPDMLEVGNGGMT +EY +HFSIWAL 
Sbjct: 212 RTTGDIQDNWKSMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIWALA 271

Query: 298 KSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGS 357
           K+PLLIGCD+R+M K T E+L NKEVIAVNQD LG+QG+KVK   K+G L+VWAGPLS  
Sbjct: 272 KAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKLGIQGKKVK---KEGDLEVWAGPLSKK 331

Query: 358 RLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCK 417
           R+AV+LWNR S ++ IT  W  +GL  +  V+ RDLW H   +        A V+PH CK
Sbjct: 332 RVAVILWNRGSASANITARWAEIGLNSSDIVNARDLWEHSTYS-CVKKQLSALVEPHACK 391

Query: 418 MFVFT 422
           M+  T
Sbjct: 392 MYTLT 392

BLAST of Csa5G220910 vs. TAIR10
Match: AT5G08380.1 (AT5G08380.1 alpha-galactosidase 1)

HSP 1 Score: 472.2 bits (1214), Expect = 3.3e-133
Identity = 238/426 (55.87%), Postives = 294/426 (69.01%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKS-IFKSSFHRIFDTSKYGILQLQ 60
           M+ R     + +L++L+S+  + + E   S+ NG+D S I +                L 
Sbjct: 1   MSRRAMVIKMPILMILISSMVMTMVESSRSVNNGHDDSEILRRHL-------------LT 60

Query: 61  NGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSK 120
           NGL  TP MGWNSWN F+C+I+E +IKETADALV+TGL++LGY YVNIDDCW    RDSK
Sbjct: 61  NGLGVTPPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYNYVNIDDCWAEISRDSK 120

Query: 121 DQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTC-QVRAGSLYHENDDAQLFASW 180
             LVP    FPSGIK +ADYVHSK LKLGIYSDAG FTC +   GSL +E  DA+ FA W
Sbjct: 121 GSLVPKKSTFPSGIKAVADYVHSKGLKLGIYSDAGYFTCSKTMPGSLGYEEHDAKTFAEW 180

Query: 181 GVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRT 240
           G+DYLKYDNC + G KP  RYP M  AL  +GR IF+SLCEWG   PALW   VGNSWRT
Sbjct: 181 GIDYLKYDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPALWGSPVGNSWRT 240

Query: 241 TDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKS 300
           T+DI DTW SM ++AD+N  +A +A PGGWNDPDMLEVGNGGMT  EY  HFSIWA+ K+
Sbjct: 241 TNDIKDTWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYIVHFSIWAISKA 300

Query: 301 PLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRL 360
           PLL+GCD+RNMTKET EI+ NKEVIA+NQDP GVQ +KV++   +G L+VWAGPLSG R+
Sbjct: 301 PLLLGCDIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRM---EGDLEVWAGPLSGYRV 360

Query: 361 AVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMF 420
           A++L NR    ++IT  W  + +  N+ V  RDLW H+ +    + +  A VD H CK++
Sbjct: 361 ALLLLNRGPSRTSITALWEDIEIPANSIVEARDLWEHQTLKQKFVGNLTATVDSHACKLY 410

Query: 421 VFTPVA 425
           V  PVA
Sbjct: 421 VLKPVA 410

BLAST of Csa5G220910 vs. TAIR10
Match: AT3G26380.1 (AT3G26380.1 Melibiase family protein)

HSP 1 Score: 70.1 bits (170), Expect = 3.8e-12
Identity = 47/174 (27.01%), Postives = 79/174 (45.40%), Query Frame = 1

Query: 175 FASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALW--AGKV 234
           +A WGVD++K+D  F      I+    + + L    R + YS+       P +     ++
Sbjct: 215 YAEWGVDFIKHDCVFGTDFN-IEEITYVSEVLKELDRPVLYSISPGTSVTPTMAKEVSQL 274

Query: 235 GNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGG-------WNDPDMLEVG-------N 294
            N +R T D  DTW  +T   D++   +A +  G        W D DML +G       N
Sbjct: 275 VNMYRITGDDWDTWKDVTAHFDISRDLSASSMIGARGLQGKSWPDLDMLPLGWLTDQGSN 334

Query: 295 GG------MTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVN 327
            G      +  +E ++  ++W++ KSPL+ G DVRN+   T  ++ N  ++ +N
Sbjct: 335 VGPHRACNLNLEEQKSQMTLWSIAKSPLMFGGDVRNLDATTYNLITNPTLLEIN 387

BLAST of Csa5G220910 vs. NCBI nr
Match: gi|700195551|gb|KGN50728.1| (hypothetical protein Csa_5G220910 [Cucumis sativus])

HSP 1 Score: 894.8 bits (2311), Expect = 5.9e-257
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60
           MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN
Sbjct: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60

Query: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120
           GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD
Sbjct: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120

Query: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180
           QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV
Sbjct: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180

Query: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240
           DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD
Sbjct: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240

Query: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300
           DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL
Sbjct: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300

Query: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV 360
           LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV
Sbjct: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV 360

Query: 361 VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF 420
           VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF
Sbjct: 361 VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF 420

Query: 421 TPVATFRAEM 431
           TPVATFRAEM
Sbjct: 421 TPVATFRAEM 430

BLAST of Csa5G220910 vs. NCBI nr
Match: gi|793417859|ref|NP_001292680.1| (alpha-galactosidase 3 precursor [Cucumis sativus])

HSP 1 Score: 894.4 bits (2310), Expect = 7.7e-257
Identity = 429/430 (99.77%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60
           MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN
Sbjct: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60

Query: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120
           GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD
Sbjct: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120

Query: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180
           QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV
Sbjct: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180

Query: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240
           DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD
Sbjct: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240

Query: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300
           DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL
Sbjct: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300

Query: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV 360
           LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQ+WAGPLSGSRLAV
Sbjct: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQIWAGPLSGSRLAV 360

Query: 361 VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF 420
           VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF
Sbjct: 361 VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF 420

Query: 421 TPVATFRAEM 431
           TPVATFRAEM
Sbjct: 421 TPVATFRAEM 430

BLAST of Csa5G220910 vs. NCBI nr
Match: gi|659114169|ref|XP_008456938.1| (PREDICTED: alpha-galactosidase isoform X1 [Cucumis melo])

HSP 1 Score: 868.2 bits (2242), Expect = 5.9e-249
Identity = 413/430 (96.05%), Postives = 423/430 (98.37%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60
           MANRIRFGSVFLLILL S AYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN
Sbjct: 1   MANRIRFGSVFLLILLHSVAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60

Query: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120
           GLARTPQMGWNSWNFFACDIN+TLI+ETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD
Sbjct: 61  GLARTPQMGWNSWNFFACDINDTLIQETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120

Query: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180
           QLVPDPKGFPSGIKP+ADYVHSK LKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWG+
Sbjct: 121 QLVPDPKGFPSGIKPVADYVHSKGLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGI 180

Query: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240
           DYLKYDNC+NLGIKPIKR+PPMRDALNATGRSIFYS+CEWGVDDPALWAGKVGNSWRTTD
Sbjct: 181 DYLKYDNCYNLGIKPIKRFPPMRDALNATGRSIFYSICEWGVDDPALWAGKVGNSWRTTD 240

Query: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300
           DINDTWASMTTLAD+NNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL
Sbjct: 241 DINDTWASMTTLADINNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300

Query: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV 360
           LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVK FGKDGCLQVWAGPLSGSRLAV
Sbjct: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKDFGKDGCLQVWAGPLSGSRLAV 360

Query: 361 VLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCKMFVF 420
           VLWNRCSVASTITTDWN LGLKPNTSVSVRDLWLHEDV GDA+SSFGAEVDPHDCKMF+F
Sbjct: 361 VLWNRCSVASTITTDWNVLGLKPNTSVSVRDLWLHEDVEGDAVSSFGAEVDPHDCKMFIF 420

Query: 421 TPVATFRAEM 431
           TPVAT RAEM
Sbjct: 421 TPVATSRAEM 430

BLAST of Csa5G220910 vs. NCBI nr
Match: gi|659114171|ref|XP_008456939.1| (PREDICTED: alpha-galactosidase isoform X2 [Cucumis melo])

HSP 1 Score: 707.6 bits (1825), Expect = 1.3e-200
Identity = 340/368 (92.39%), Postives = 352/368 (95.65%), Query Frame = 1

Query: 1   MANRIRFGSVFLLILLLSAAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60
           MANRIRFGSVFLLILL S AYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN
Sbjct: 1   MANRIRFGSVFLLILLHSVAYVAIAERRVSLLNGYDKSIFKSSFHRIFDTSKYGILQLQN 60

Query: 61  GLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120
           GLARTPQMGWNSWNFFACDIN+TLI+ETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD
Sbjct: 61  GLARTPQMGWNSWNFFACDINDTLIQETADALVSTGLAELGYVYVNIDDCWNTQKRDSKD 120

Query: 121 QLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGV 180
           QLVPDPKGFPSGIKP+ADYVHSK LKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWG+
Sbjct: 121 QLVPDPKGFPSGIKPVADYVHSKGLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFASWGI 180

Query: 181 DYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTD 240
           DYLKYDNC+NLGIKPIKR+PPMRDALNATGRSIFYS+CEWGVDDPALWAGKVGNSWRTTD
Sbjct: 181 DYLKYDNCYNLGIKPIKRFPPMRDALNATGRSIFYSICEWGVDDPALWAGKVGNSWRTTD 240

Query: 241 DINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300
           DINDTWASMTTLAD+NNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL
Sbjct: 241 DINDTWASMTTLADINNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKSPL 300

Query: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGSRLAV 360
           LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVK FGKDGCLQ     ++G+   +
Sbjct: 301 LIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKDFGKDGCLQNNLKSITGNIFTL 360

Query: 361 VLWNRCSV 369
            L N  SV
Sbjct: 361 HLANVLSV 368

BLAST of Csa5G220910 vs. NCBI nr
Match: gi|1000962381|ref|XP_015575755.1| (PREDICTED: alpha-galactosidase 3 [Ricinus communis])

HSP 1 Score: 702.6 bits (1812), Expect = 4.3e-199
Identity = 335/433 (77.37%), Postives = 366/433 (84.53%), Query Frame = 1

Query: 1   MANRIRFGSVFL--LILLLSAAYVAIAERRVSLLNGYDKSIF--KSSFHRIFDTSKYGIL 60
           M NR    S++L  + L L A  VAIA R V LL  Y K  F     FH IFDTSKYG  
Sbjct: 1   MENRSNSCSLYLVGISLFLWALRVAIAGREVPLLQSYQKGSFGYNKLFHNIFDTSKYGTF 60

Query: 61  QLQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKR 120
           QL NGLARTPQMGWNSWNFFAC+INET+IKETADAL+STGLA+LGYVYVNIDDCW+   R
Sbjct: 61  QLNNGLARTPQMGWNSWNFFACNINETVIKETADALISTGLADLGYVYVNIDDCWSAATR 120

Query: 121 DSKDQLVPDPKGFPSGIKPLADYVHSKDLKLGIYSDAGLFTCQVRAGSLYHENDDAQLFA 180
           D+K QLVPDPK FPSGIK LADY+H K LKLGIYSDAG+FTCQVR GSL+HE DDA LFA
Sbjct: 121 DAKGQLVPDPKTFPSGIKALADYIHGKGLKLGIYSDAGIFTCQVRPGSLHHEEDDADLFA 180

Query: 181 SWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSW 240
           SWGVDYLKYDNCFNLGIKP +RYPPMRDALNA+GR+IFYSLCEWGVDDPALWAGKVGNSW
Sbjct: 181 SWGVDYLKYDNCFNLGIKPKERYPPMRDALNASGRTIFYSLCEWGVDDPALWAGKVGNSW 240

Query: 241 RTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM 300
           RTTDDIND+W SMTT+ADLN+KWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM
Sbjct: 241 RTTDDINDSWVSMTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALM 300

Query: 301 KSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLGVQGRKVKVFGKDGCLQVWAGPLSGS 360
           K+PLLIGCDVRNMT ET EIL NKEVIAVNQD LGVQGRKV+  G DGCLQVWAGPLSG 
Sbjct: 301 KAPLLIGCDVRNMTAETYEILTNKEVIAVNQDSLGVQGRKVQASGTDGCLQVWAGPLSGH 360

Query: 361 RLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRDLWLHEDVAGDAMSSFGAEVDPHDCK 420
           R+AVVLWNRCS A+TIT  W+ALGL+  TSV+VRDLW H+D+ GD+++SFG  VD HDC 
Sbjct: 361 RMAVVLWNRCSKAATITARWDALGLESGTSVAVRDLWQHKDITGDSVASFGTRVDAHDCA 420

Query: 421 MFVFTPVATFRAE 430
           M+ FTP     AE
Sbjct: 421 MYTFTPKTVLPAE 433

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AGAL3_ARATH4.2e-18677.86Alpha-galactosidase 3 OS=Arabidopsis thaliana GN=AGAL3 PE=1 SV=1[more]
AGAL_CYATE1.1e-13863.04Alpha-galactosidase OS=Cyamopsis tetragonoloba PE=1 SV=1[more]
AGAL_ORYSJ3.6e-13764.11Alpha-galactosidase OS=Oryza sativa subsp. japonica GN=Os10g0493600 PE=1 SV=1[more]
AGAL2_ARATH1.4e-13663.56Alpha-galactosidase 2 OS=Arabidopsis thaliana GN=AGAL2 PE=1 SV=1[more]
AGAL_COFAR3.9e-13664.48Alpha-galactosidase OS=Coffea arabica PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A067JNU2_JATCU6.2e-19779.37Alpha-galactosidase OS=Jatropha curcas GN=JCGZ_03336 PE=3 SV=1[more]
D7T4E7_VITVI6.9e-19677.49Alpha-galactosidase OS=Vitis vinifera GN=VIT_13s0067g02640 PE=3 SV=1[more]
B9IQ46_POPTR3.8e-19477.80Alpha-galactosidase OS=Populus trichocarpa GN=POPTR_0019s08450g PE=3 SV=2[more]
M5XCJ0_PRUPE1.4e-19376.60Alpha-galactosidase OS=Prunus persica GN=PRUPE_ppa005977mg PE=3 SV=1[more]
A0A0J8CXF8_BETVU1.9e-19076.63Alpha-galactosidase OS=Beta vulgaris subsp. vulgaris GN=BVRB_2g042760 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G56310.12.3e-18777.86 Melibiase family protein[more]
AT5G08370.17.6e-13863.56 alpha-galactosidase 2[more]
AT5G08380.13.3e-13355.87 alpha-galactosidase 1[more]
AT3G26380.13.8e-1227.01 Melibiase family protein[more]
Match NameE-valueIdentityDescription
gi|700195551|gb|KGN50728.1|5.9e-257100.00hypothetical protein Csa_5G220910 [Cucumis sativus][more]
gi|793417859|ref|NP_001292680.1|7.7e-25799.77alpha-galactosidase 3 precursor [Cucumis sativus][more]
gi|659114169|ref|XP_008456938.1|5.9e-24996.05PREDICTED: alpha-galactosidase isoform X1 [Cucumis melo][more]
gi|659114171|ref|XP_008456939.1|1.3e-20092.39PREDICTED: alpha-galactosidase isoform X2 [Cucumis melo][more]
gi|1000962381|ref|XP_015575755.1|4.3e-19977.37PREDICTED: alpha-galactosidase 3 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000111Glyco_hydro_27/36_CS
IPR002241Glyco_hydro_27
IPR013780Glyco_hydro_b
IPR013785Aldolase_TIM
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0006635 fatty acid beta-oxidation
biological_process GO:0009755 hormone-mediated signaling pathway
biological_process GO:0006891 intra-Golgi vesicle-mediated transport
biological_process GO:0006869 lipid transport
biological_process GO:0010351 lithium ion transport
biological_process GO:0016558 protein import into peroxisome matrix
biological_process GO:0048767 root hair elongation
biological_process GO:0044763 single-organism cellular process
biological_process GO:0044765 single-organism transport
cellular_component GO:0005773 vacuole
cellular_component GO:0005575 cellular_component
molecular_function GO:0052692 raffinose alpha-galactosidase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU107431cucumber EST collection version 3.0transcribed_cluster
CU143627cucumber EST collection version 3.0transcribed_cluster
CU157833cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G220910.1Csa5G220910.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU107431CU107431transcribed_cluster
CU157833CU157833transcribed_cluster
CU143627CU143627transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000111Glycoside hydrolase family 27/36, conserved sitePROSITEPS00512ALPHA_GALACTOSIDASEcoord: 101..117
scor
IPR002241Glycoside hydrolase, family 27PRINTSPR00740GLHYDRLASE27coord: 137..158
score: 7.1E-70coord: 96..111
score: 7.1E-70coord: 284..305
score: 7.1E-70coord: 198..216
score: 7.1E-70coord: 60..79
score: 7.1E-70coord: 263..282
score: 7.1E-70coord: 171..188
score: 7.1
IPR002241Glycoside hydrolase, family 27PFAMPF16499Melibiase_2coord: 65..395
score: 6.5
IPR013780Glycosyl hydrolase, all-betaGENE3DG3DSA:2.60.40.1180coord: 323..423
score: 1.6
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 58..322
score: 2.6E
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 57..329
score: 2.18E
NoneNo IPR availablePANTHERPTHR11452ALPHA-GALACTOSIDASE/ALPHA-N-ACETYLGALACTOSAMINIDASEcoord: 58..422
score: 9.7E
NoneNo IPR availablePANTHERPTHR11452:SF33SUBFAMILY NOT NAMEDcoord: 58..422
score: 9.7E
NoneNo IPR availableunknownSSF51011Glycosyl hydrolase domaincoord: 323..421
score: 7.37