Cp4.1LG02g12820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g12820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBeta-galactosidase
LocationCp4.1LG02 : 11484854 .. 11490786 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATGACGGGAGTTCGATATGCGCTAGTGGTTGTTTTGTTAGTTTTAGGCGTTTTAGACTCATTTTCGCTTGCGGCCAATGTGACGTACGATCATCGGGCGCTGGTGATCGACGGCAAGCGGAGAGTGTTGGTTTCTGGATCCATACACTATCCTCGCAGCACTCCTGAGGTTTGTCTCCTTCAGCCTTGGAACTTTGACGTTCTTTCTCTGAGGATTTTTCTATCTGAATCGGTTAGTGGTGTTTGTACTCGGCAGATGTGGCCGGACCTTATTCAGAAATCTAAGGATGGAGGTCTGGATGTGATTGAAACTTACGTGTTCTGGAATCTACACGAACCTGTTCGAAACCAGGTACATTTTTGCGACTTCGTTGATCCAAGAGCGGCGTTTGGTTCCAAGGAAATTGTTGGATTGGGAGATATTTGAGAAATCTGTTTACCGTTGAGTTGTGTTTTTCATCCCCTGAAAAAGCTCAATGATGGATGGAACCTGTCTTCTGTTATCATCTTTTCGGTCTTCTTACGTTTTCTCGTCTACCAAACAGTGGCAATTATTATTCATTTTTACATGAGTACTACATAATATGTTGCGATTTCTCAATCAACAGTATGACTTCGAGGGAAGGAAGGATTTAGTTAAATTCGTAAAGCTGGTAGGAGCTGCTGGTCTATATGTACATATACGAATTGGTCCTTATGTGTGCGCGGAATGGAATTACGGGTACATCTCTTTTCCATTCTCAATTCGTCAGTTGCTTTCCATTTCCGTTCTCTCTGGGTTTTACACCTCTTCCCTTTTGTGCAATGCCCATAACGTTTTTTATATTTTATTTTCCCCCCGGAAACTGTTTGATGTAAAAACAGAGGTTTTCCAGTTTGGTTGCATTTCGTACCTGGCATTAAATTCCGCACAGATAATGAACCATTCAAGGTTAAGTATAACTTTACCATTTCGGTGCTTTTGCTTTTTTTTCTTCTCCAGTGTCGTGTTTATCCTCATATTAAGAAAAAATAAATAATGGTTTTGTATAGGCTGAAATGAAGCGATTTACAGCCAAGATTGTTGATGTATTGAAGCAGGAGAAATTATATGCCTCTCAGGGCGGACCAGTTATTTTATCTCAGGTGGATTTCAATTTTCAAGTACCTTCTTCTATTTCTCCCTCCTCTCCCCATTTAGCTTGCTAAATGAAAGCAATCTGGAATGAGGGTGGATTGGAAGGGAGTCCAACTAATTTAGAGAATAATCACGAGTGTATAAATAAATAAATACATAATTGGTATTATAGTTTGTGAGAAGTCCAAAGCAAAATTACTACCGGTTATGCTAGAAGTTGACAAGCTCATCCTACCATTCTGGAAAGTCCTAACAATGTGTTAACACCAATATTTTTTACTCGCATATGTCCTGGTGATGGTGTCTCAGGACTATCACACATTTGAAGTAAGAGTACTTATAAGGTTAATATAATGGCTGCTTTTCTTCTAAAAACAGATTGAAAATGAATATGGAAACGTTCAGTCTTCTTATGGATCTGCTGCTAAATCCTATATCCAATGGGCGGCAACCATGGCTACGTCTTTGAACACGGGAGTTCCTTGGGTTATGTGCAACCAACCTGATGCCCCCGATCCCATTGTGAGGAATCTTTCCGTATCATGACCATAAATTGTAGACGACAAATCTTAATTGCTCTAGTTACACATTACAGCTAGGCCAAATTTTGATTGAATATAAACTTACCTTATTATATACAACTTCCCCGTGCTTGTCATGTGGTCCCACAGATTAACACTTGCAATGGATTCTACTGTGATCAATTCACGCCAAATTCTAAGAATAAGCCCAAAATCTGGACTGAGAATTGGACTGGATGGTAGACTTTCGCATTTGCTGTTTCCTGCATTTTTTTCTCTGTCAACTCGATTTCTTAATTACGCACGAGTCATTTACGCTTTTGACAATTTCGATTGTTTTATCATAGGTTTCTTTCCTTTGGTGGAGCCTCGCCATACAGACCCGTGGAAGATCTTGCATATGCTGTGGCACGCTTTTATCAGAATGGTGGAACTTTACAGAATTATTACATGGTACTCCACTTGTGATATGCTCATTATGGTTGACAGAATAAATGATTTATTTACGTTTCCTTGAAACGTATGAACCCATGATAGATGGCATCGGTTTTTCCTTATTTTCTCTAATACAGTACCACGGTGGGACAAACTTTGGTAGGACTACTGGTGGACCGTTTATTTCTACTAGTTATGATTATGATGCCCCTATAGATGAGTATGGTATGGTTTCCGTCTATTAAATATTGTTCTTAATTGATTTGTTATGATGTAAAAATCTTCTCAAGTATTGCAATTATATCTTCAAATTTTGTGCAAAGAATGAAGATGCTGGTTGGAATTGTAGTGATGTTGTGAACTTGATCATAAAAATGGAAAAAAAGAAACGATATGGATACACGATCATCCTAATTCTTTACAATCTACTTCTCTTCTATACGATTATTCTTCTATTCTGCAGGACTGGTTAGGCAACCTAATTGGGGTCACTTGAGGGAAGTCCACGAGGCTATAAAGATGTGCGAAGAAGCATTAGTGAGCACAGAACCCGCAGTTAGTTCTCTTGGTCAAAATTTGGAGGTAGGTTATTCCGATAACTGTCTTTTCGTCTTAAAACTATCAAGAGCTTACCCAGTACAATAATAAGTTGAAATGTACGTCTAGTTTGGATTCCTTTTAGTAGGGGAAACAATTAGAGTTTGACTTGGTATACGAAAGAACGACTACTATTTCTGGCTTTAGTTTAACATGGAACCTATCGGAATTATGCATCAGAATGTATGGTAAGAAAATCAGAAGAAACTCCTCGCAGAAATTTTGAAGAGAGTATCCAAGGAAAAGTTGTCCAGATCACAATTATAAAACTGTGCTCTTACCTTAAATAATCATATTCCTTCGTGTGCTGTGCATTCATTGAATAAAGATGGTCAAAAATGGTTAAGATGTTATTCTTTGTACTAATTTCAGGCGACTGTTTATAAGTCCGGTTCTCAATGTTGTGCTTTTCTTGCCAATGTGGATACCCAATCTGATGCGACGGTGACTTTCAATGGTAATACATATCATTTGCCAGCATGGTCTGTTAGCATCTTGCCTGACTGCAAGAATGTGGTGCTCAATACTGCAAAGGTTATATTGTTTTCTTCTCAGCAAATGTATTTGCAGCCGAGAACTAAAATTCTTATCTTTTGTTTGTGACTTGTCATTGTTGGTACTCTAGTCAAGTGAATATTTCCTTTCAATTCTTATCGCTACTTTTATTTGCTCCTAATTGACTTAATTGTTGCACGGATGAAGTATACCTCATTTTATGGCAGATTAATTCGGTAACAATGAGGCCATCCTTTTCAAACCAACCTTTGAAAGTTGATGCTAGTGCTTCTGAAGCATTTGACTCAGGATGGGGTTGGATAGACGAACCAGTTGGTATCTCAAAGGCTAATTCTTTTGCGAAACTTGGACTTTCAGAGCAAATAAATACTACAGCAGATCAAAGCGATTACTTGTGGTATTCTTTAAGGTACGTACTCTATTTATTATTAAATCTTCAGTTTCTTTATTCAACGGACTGCCAACCAGTTTCTAAAAATAGTTTTGTCAACTTTTGCTACTACCATTTGGCAGCACTGATATAAAAGGTGACGAACCTTTCCTTGAAAATGGAACGGAAACTGTCCTCCATGTTGAATCCCTTGGCCATGCTCTTCACGTTTTTATTAATAAAAAGCTTGCAGGTGATTGGAATTAATTATTTGGTTCTGTGCGTTTGAATGTCTTTAAAATTACTCCTGCGTTGTTGCAATGGTGCTGCTTTATAGATGTCGTTTGAATTTCTTTGTAGGAAGTGGAAGAGGTGGTAAAGGCAATTCTAAGGTTTCTTTGGAGATCCCCATCACACTGGTACCTGGGAAAAATACAATTGACCTCCTGAGTTTGACAGTGGGACTTCAGGTTCTGCTTCACTTAAATTTGTGCCGGCATATATCTTGCGAGAAATCATTACCTGAATCTCAAAAAGAAATATGCAATGTGAATTTCTTTTATTGGATGAATTTTTTATATATTTTTTAACCCACAAACTCTGTAATGCAATATATCAGCATTATGGAGCATTTTTTGAGACGAAAGGGGCAGGGGTCACAGGGGTAAAACTGGAAAGCCAAAAAAATGGCATCACTGTTGATATCTCTTCTGGACAATGGACATATCAGGTTGGTTTCTTCAAACATCCTGGAGTTTAATTGCCTTATACTGCTACCGCAGTCGTTGTTTCATGATTTGATTAAAGTCATCGTAGTATGCCTTTAGTTTCATTTTTTTATTATTAGTTTCTTTTGTAGTCTCTTTCATACGCATCAATGCTGTAATTTAACCTTTTTTCATAAATTTTTTTTTTTTGTCGTATGATGGTAGATTGGACTTAAAGGTGAAGATTTAGGACTGTCGAGTGGAAGCTCTTCACAATGGCTTTCACAACCAAGCTTGCCTAAGAATAAACCTTTGACGTGGTACAAGGTATGGTTATTTCATGTATGATTTATAAGAATAAGAAAATAGTTAAGAGTTGTTTCCTACTAGAGAGCGAGGGGACTCCTTGTACCTTGTTTTACAAGATTGGTCTAGATTATGTTGAACATTTTCTCGGTTATATAAATATGATTTTGGTCTTGTGTTTTGAATTTTCTCTATACTTTCAAATGTTCAATTTAGTCCTCTATTTTGAATAAAGTTTAAAATTGATTCTGGTTAACTCTTCTAAAAATATTTAATCTCTATTAGTCTTTTCTTTATAAAGGTTGAAAATACGTTTTCTATTCGTCTTTCTTTTATAAAGGTTGGGAATACGTTTGCTAGGCTCGAAGATACCTCTCATACAAGTTTGTTAGATGTTGTTCTGCTAAGACAATTGCTTGCATTTTAACATTTCATCATCTTATTCAAGTATGCACCATAATTATGAATGACGTGTAAGTGAGTTAAACCACAAAAGGTAAAGTATGAGGCACATGGATTTCCACTAACAATCTGTTGATTCAGACCACGTTCGATGCCCCTGATGGTAGTGACCCTGTTGCATTAGACTTCACAGGCTTTGGAAAGGGCGAAGCATGGATAAACGGACAAAGTATTGGTCGTTATTGGCCATCATATATTGCCTCCGGTCATTGTACCGCATATTGCAATTACAGAGGAGCTTATAGGTCAAGCAAATGCCTTAAGAACTGTGACAAACCATCCCAAACTCTGTAAGTTTTCACTAACTTGAGGATAAGGCTTCCAAGTAACTTCAAATGTCCTTGTATCTTATCTTCCTAATTCTCAGATACCATGTACCTCAATCCTGGCTGAAACCTACCGGCAACACCCTAGTACTTTTTGAGGAAATTGGCAGCGATCCAACTCGATTGTCGTTCGCTTCGAAACAGATCGAATCTGTGTGTGCCCATGTATCTGAGTCCCATCCACCACCTGTAGATATGTGGAGCTCAGACACCAAACTACAGAAATCAGGACCCGTACTCTCTCTTGAGTGTCCATCTCCCAATCAGATCATTTCTTCTATAAAATTTGCAAGTTTTGGCACTCCTCTTGGAACTTGTGGGAGCTTTAGCCAAGGTCAATGCAGCAGCCAAAATGCACTCTCCACTGTACAAAAGGTTTTATATTTCCTGATACAGTAGAAATTTATGGTGTTTCTGTGCATGTGTTAGAACTGGATTGTGTACTCATTGATGCTTACTCTTGTCTACAGGCTTGCATTGGATCGAAAAGTTGTAGCGTTCAAGTGTCGATTAAAGCATTCGGCGATCCTTGTAGAGGAAAAACAAAGAGCTTGGCTGTGGAAGCCTCTTGTGAA

mRNA sequence

CAATGACGGGAGTTCGATATGCGCTAGTGGTTGTTTTGTTAGTTTTAGGCGTTTTAGACTCATTTTCGCTTGCGGCCAATGTGACGTACGATCATCGGGCGCTGGTGATCGACGGCAAGCGGAGAGTGTTGGTTTCTGGATCCATACACTATCCTCGCAGCACTCCTGAGATGTGGCCGGACCTTATTCAGAAATCTAAGGATGGAGGTCTGGATGTGATTGAAACTTACGTGTTCTGGAATCTACACGAACCTGTTCGAAACCAGGTACATTTTTGCGACTTCGTTGATCCAAGAGCGGCGTTTGGTTCCAAGGAAATTCTCAATGATGGATGGAACCTGTCTTCTGTTATCATCTTTTCGGTCTTCTTACTACTACATAATATGTTGCGATTTCTCAATCAACAGTATGACTTCGAGGGAAGGAAGGATTTAGTTAAATTCGTAAAGCTGGTAGGAGCTGCTGGTCTATATGTACATATACGAATTGGTCCTTATGTGTGCGCGGAATGGAATTACGGGTACATCTCTTTTCCATTCTCAATTCTTTGGTTGCATTTCGTACCTGGCATTAAATTCCGCACAGATAATGAACCATTCAAGGTTAAGTATAACTTTACCATTTCGGTGCTTTTGCTTTTTTTTCTTCTCCAGTGTCGTGCTGAAATGAAGCGATTTACAGCCAAGATTGTTGATGTATTGAAGCAGGAGAAATTATATGCCTCTCAGGGCGGACCAGTTATTTTATCTCAGATTGAAAATGAATATGGAAACGTTCAGTCTTCTTATGGATCTGCTGCTAAATCCTATATCCAATGGGCGGCAACCATGGCTACGTCTTTGAACACGGGAGTTCCTTGGGTTATGTGCAACCAACCTGATGCCCCCGATCCCATTATTAACACTTGCAATGGATTCTACTGTGATCAATTCACGCCAAATTCTAAGAATAAGCCCAAAATCTGGACTGAGAATTGGACTGGATGGTTTCTTTCCTTTGGTGGAGCCTCGCCATACAGACCCGTGGAAGATCTTGCATATGCTGTGGCACGCTTTTATCAGAATGGTGGAACTTTACAGAATTATTACATGTACCACGGTGGGACAAACTTTGGTAGGACTACTGGTGGACCGTTTATTTCTACTAGTTATGATTATGATGCCCCTATAGATGAGTATGGACTGGTTAGGCAACCTAATTGGGGTCACTTGAGGGAAGTCCACGAGGCTATAAAGATGTGCGAAGAAGCATTAGTGAGCACAGAACCCGCAGTTAGTTCTCTTGGTCAAAATTTGGAGATTAATTCGGTAACAATGAGGCCATCCTTTTCAAACCAACCTTTGAAAGTTGATGCTAGTGCTTCTGAAGCATTTGACTCAGGATGGGGAAGTGGAAGAGGTGGTAAAGGCAATTCTAAGGTTTCTTTGGAGATCCCCATCACACTGGTACCTGGGAAAAATACAATTGACCTCCTGAGTTTGACAGTGGGACTTCAGCATTATGGAGCATTTTTTGAGACGAAAGGGGCAGGGGTCACAGGGGTAAAACTGGAAAGCCAAAAAAATGGCATCACTGTTGATATCTCTTCTGGACAATGGACATATCAGATTGGACTTAAAGGTGAAGATTTAGGACTGTCGAGTGGAAGCTCTTCACAATGGCTTTCACAACCAAGCTTGCCTAAGAATAAACCTTTGACGTGGTACAAGACCACGTTCGATGCCCCTGATGGTAGTGACCCTGTTGCATTAGACTTCACAGGCTTTGGAAAGGGCGAAGCATGGATAAACGGACAAAGTATTGGTCGTTATTGGCCATCATATATTGCCTCCGGTCATTGTACCGCATATTGCAATTACAGAGGAGCTTATAGTTTTGGCACTCCTCTTGGAACTTGTGGGAGCTTTAGCCAAGGTCAATGCAGCAGCCAAAATGCACTCTCCACTGTACAAAAGGCTTGCATTGGATCGAAAAGTTGTAGCGTTCAAGTGTCGATTAAAGCATTCGGCGATCCTTGTAGAGGAAAAACAAAGAGCTTGGCTGTGGAAGCCTCTTGTGAA

Coding sequence (CDS)

ATGACGGGAGTTCGATATGCGCTAGTGGTTGTTTTGTTAGTTTTAGGCGTTTTAGACTCATTTTCGCTTGCGGCCAATGTGACGTACGATCATCGGGCGCTGGTGATCGACGGCAAGCGGAGAGTGTTGGTTTCTGGATCCATACACTATCCTCGCAGCACTCCTGAGATGTGGCCGGACCTTATTCAGAAATCTAAGGATGGAGGTCTGGATGTGATTGAAACTTACGTGTTCTGGAATCTACACGAACCTGTTCGAAACCAGGTACATTTTTGCGACTTCGTTGATCCAAGAGCGGCGTTTGGTTCCAAGGAAATTCTCAATGATGGATGGAACCTGTCTTCTGTTATCATCTTTTCGGTCTTCTTACTACTACATAATATGTTGCGATTTCTCAATCAACAGTATGACTTCGAGGGAAGGAAGGATTTAGTTAAATTCGTAAAGCTGGTAGGAGCTGCTGGTCTATATGTACATATACGAATTGGTCCTTATGTGTGCGCGGAATGGAATTACGGGTACATCTCTTTTCCATTCTCAATTCTTTGGTTGCATTTCGTACCTGGCATTAAATTCCGCACAGATAATGAACCATTCAAGGTTAAGTATAACTTTACCATTTCGGTGCTTTTGCTTTTTTTTCTTCTCCAGTGTCGTGCTGAAATGAAGCGATTTACAGCCAAGATTGTTGATGTATTGAAGCAGGAGAAATTATATGCCTCTCAGGGCGGACCAGTTATTTTATCTCAGATTGAAAATGAATATGGAAACGTTCAGTCTTCTTATGGATCTGCTGCTAAATCCTATATCCAATGGGCGGCAACCATGGCTACGTCTTTGAACACGGGAGTTCCTTGGGTTATGTGCAACCAACCTGATGCCCCCGATCCCATTATTAACACTTGCAATGGATTCTACTGTGATCAATTCACGCCAAATTCTAAGAATAAGCCCAAAATCTGGACTGAGAATTGGACTGGATGGTTTCTTTCCTTTGGTGGAGCCTCGCCATACAGACCCGTGGAAGATCTTGCATATGCTGTGGCACGCTTTTATCAGAATGGTGGAACTTTACAGAATTATTACATGTACCACGGTGGGACAAACTTTGGTAGGACTACTGGTGGACCGTTTATTTCTACTAGTTATGATTATGATGCCCCTATAGATGAGTATGGACTGGTTAGGCAACCTAATTGGGGTCACTTGAGGGAAGTCCACGAGGCTATAAAGATGTGCGAAGAAGCATTAGTGAGCACAGAACCCGCAGTTAGTTCTCTTGGTCAAAATTTGGAGATTAATTCGGTAACAATGAGGCCATCCTTTTCAAACCAACCTTTGAAAGTTGATGCTAGTGCTTCTGAAGCATTTGACTCAGGATGGGGAAGTGGAAGAGGTGGTAAAGGCAATTCTAAGGTTTCTTTGGAGATCCCCATCACACTGGTACCTGGGAAAAATACAATTGACCTCCTGAGTTTGACAGTGGGACTTCAGCATTATGGAGCATTTTTTGAGACGAAAGGGGCAGGGGTCACAGGGGTAAAACTGGAAAGCCAAAAAAATGGCATCACTGTTGATATCTCTTCTGGACAATGGACATATCAGATTGGACTTAAAGGTGAAGATTTAGGACTGTCGAGTGGAAGCTCTTCACAATGGCTTTCACAACCAAGCTTGCCTAAGAATAAACCTTTGACGTGGTACAAGACCACGTTCGATGCCCCTGATGGTAGTGACCCTGTTGCATTAGACTTCACAGGCTTTGGAAAGGGCGAAGCATGGATAAACGGACAAAGTATTGGTCGTTATTGGCCATCATATATTGCCTCCGGTCATTGTACCGCATATTGCAATTACAGAGGAGCTTATAGTTTTGGCACTCCTCTTGGAACTTGTGGGAGCTTTAGCCAAGGTCAATGCAGCAGCCAAAATGCACTCTCCACTGTACAAAAGGCTTGCATTGGATCGAAAAGTTGTAGCGTTCAAGTGTCGATTAAAGCATTCGGCGATCCTTGTAGAGGAAAAACAAAGAGCTTGGCTGTGGAAGCCTCTTGTGAA

Protein sequence

MTGVRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLEINSVTMRPSFSNQPLKVDASASEAFDSGWGSGRGGKGNSKVSLEIPITLVPGKNTIDLLSLTVGLQHYGAFFETKGAGVTGVKLESQKNGITVDISSGQWTYQIGLKGEDLGLSSGSSSQWLSQPSLPKNKPLTWYKTTFDAPDGSDPVALDFTGFGKGEAWINGQSIGRYWPSYIASGHCTAYCNYRGAYSFGTPLGTCGSFSQGQCSSQNALSTVQKACIGSKSCSVQVSIKAFGDPCRGKTKSLAVEASCE
BLAST of Cp4.1LG02g12820 vs. Swiss-Prot
Match: BGAL8_ARATH (Beta-galactosidase 8 OS=Arabidopsis thaliana GN=BGAL8 PE=2 SV=2)

HSP 1 Score: 481.5 bits (1238), Expect = 1.5e-134
Identity = 249/432 (57.64%), Postives = 296/432 (68.52%), Query Frame = 1

Query: 1   MTGVRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPD 60
           M  VR   +++LL+L ++ + + AANVTYDHRALV                         
Sbjct: 7   MVKVRKMEMILLLILVIVVA-ATAANVTYDHRALV------------------------- 66

Query: 61  LIQKSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFS 120
                 DG   V+   +  ++H P      + + +      G             VI   
Sbjct: 67  -----IDGKRKVL---ISGSIHYPRSTPEMWPELIQKSKDGGL-----------DVIETY 126

Query: 121 VFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFS 180
           VF   H   +    +Y+FEGR DLVKFVKL   AGLYVH+RIGPYVCAEWNYG   FP  
Sbjct: 127 VFWSGHEPEK---NKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYG--GFP-- 186

Query: 181 ILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYA 240
            +WLHFVPGIKFRTDNEPFK                    EM+RFT KIVD++KQEKLYA
Sbjct: 187 -VWLHFVPGIKFRTDNEPFK-------------------EEMQRFTTKIVDLMKQEKLYA 246

Query: 241 SQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIIN 300
           SQGGP+ILSQIENEYGN+ S+YG+AAKSYI+W+A+MA SL+TGVPW MC Q DAPDP+IN
Sbjct: 247 SQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMIN 306

Query: 301 TCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQN 360
           TCNGFYCDQFTPNS NKPK+WTENW+GWFL FG  SPYRPVEDLA+AVARFYQ GGT QN
Sbjct: 307 TCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQN 366

Query: 361 YYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVST 420
           YYMYHGGTNF RT+GGP ISTSYDYDAPIDEYGL+RQP WGHLR++H+AIK+CE+AL++T
Sbjct: 367 YYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIAT 366

Query: 421 EPAVSSLGQNLE 433
           +P ++SLG NLE
Sbjct: 427 DPTITSLGSNLE 366

BLAST of Cp4.1LG02g12820 vs. Swiss-Prot
Match: BGAL6_ORYSJ (Beta-galactosidase 6 OS=Oryza sativa subsp. japonica GN=Os03g0255100 PE=1 SV=2)

HSP 1 Score: 480.7 bits (1236), Expect = 2.6e-134
Identity = 252/461 (54.66%), Postives = 315/461 (68.33%), Query Frame = 1

Query: 116 VIIFSVFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYI 175
           VI   VF  +H  +R    QYDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG  
Sbjct: 78  VIETYVFWDIHEAVR---GQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYG-- 137

Query: 176 SFPFSILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQ 235
            FP   +WLHFVPGIKFRTDNE FK                   AEM+RFT K+VD +K 
Sbjct: 138 GFP---VWLHFVPGIKFRTDNEAFK-------------------AEMQRFTEKVVDTMKG 197

Query: 236 EKLYASQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAP 295
             LYASQGGP+ILSQIENEYGN+ S+YG+A K+Y++WAA MA SL+TGVPWVMC Q DAP
Sbjct: 198 AGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAP 257

Query: 296 DPIINTCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNG 355
           DP+INTCNGFYCDQFTPNSK+KPK+WTENW+GWFLSFGGA PYRP EDLA+AVARFYQ G
Sbjct: 258 DPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRG 317

Query: 356 GTLQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEE 415
           GT QNYYMYHGGTNFGR+TGGPFI+TSYDYDAPIDEYG+VRQP WGHLR+VH+AIK+CE 
Sbjct: 318 GTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEP 377

Query: 416 ALVSTEPAVSSLGQNLEINSVTMRPSFSNQPLKVDASASEAFDSGWGSGRGGKGNSKVSL 475
           AL++ EP+ SSLGQN E    T+  +  N    + A+     D+         GN+    
Sbjct: 378 ALIAAEPSYSSLGQNTE---ATVYQTADN---SICAAFLANVDAQSDKTVKFNGNTYKLP 437

Query: 476 EIPITLVPGKNTIDLLSLTVGLQHYGAFFETKGAGVTGVKLESQKNGITVDISSGQWTYQ 535
              ++++P    + L +  +  Q   +   + G+ +     ++  + IT ++++  W+Y 
Sbjct: 438 AWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQ----DTDDSLITPELATAGWSYA 489

Query: 536 IGLKGEDLGLSSGSSSQWLSQPSLPKNKPLTWYKTTFDAPD 577
           I    E +G++  ++   L++P L     +    TT DA D
Sbjct: 498 I----EPVGITKENA---LTKPGL-----MEQINTTADASD 489

BLAST of Cp4.1LG02g12820 vs. Swiss-Prot
Match: BGAL_ASPOF (Beta-galactosidase OS=Asparagus officinalis PE=2 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 2.9e-117
Identity = 200/298 (67.11%), Postives = 226/298 (75.84%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           QY F GR DLV+F+KLV  AGLY H+RIGPYVCAEWN+G   FP   +WL +VPGI FRT
Sbjct: 88  QYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFG--GFP---VWLKYVPGIHFRT 147

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DN PFK                   A M +FT KIV ++K E LY +QGGP+ILSQIENE
Sbjct: 148 DNGPFK-------------------AAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENE 207

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG V+   G+A KSY  WAA MA  LNTGVPWVMC Q DAPDP+INTCNGFYCD F+PN 
Sbjct: 208 YGPVEYYDGAAGKSYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNK 267

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
            NKPK+WTE WTGWF  FGGA P RP ED+A+AVARF Q GG+  NYYMYHGGTNFGRT 
Sbjct: 268 DNKPKMWTEAWTGWFTGFGGAVPQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTA 327

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLE 433
           GGPFISTSYDYDAPIDEYGL+RQP WGHLR++H+AIK+CE ALVS EP ++SLGQN E
Sbjct: 328 GGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQE 361

BLAST of Cp4.1LG02g12820 vs. Swiss-Prot
Match: BGAL_SOLLC (Beta-galactosidase OS=Solanum lycopersicum PE=1 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 2.6e-113
Identity = 189/298 (63.42%), Postives = 222/298 (74.50%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           +Y FE R DLVKF+K+V  AGLYVH+RIGPY CAEWN+G   FP   +WL +VPGI FRT
Sbjct: 85  KYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFG--GFP---VWLKYVPGISFRT 144

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           +NEPFK                   A M++FT KIVD++K EKLY +QGGP+ILSQIENE
Sbjct: 145 NNEPFK-------------------AAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENE 204

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG ++   G   K Y +WAA MA  L TGVPW+MC Q D PDPIINTCNGFYCD FTPN 
Sbjct: 205 YGPMEWELGEPGKVYSEWAAKMAVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNK 264

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
            NKPK+WTE WT WF  FGG  PYRP ED+A+AVARF Q GG+  NYYMYHGGTNFGRT+
Sbjct: 265 ANKPKMWTEAWTAWFTEFGGPVPYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTS 324

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLE 433
           GGPFI+TSYDYDAP+DE+G +RQP WGHL+++H AIK+CE ALVS +P V+SLG   E
Sbjct: 325 GGPFIATSYDYDAPLDEFGSLRQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQE 358

BLAST of Cp4.1LG02g12820 vs. Swiss-Prot
Match: BGAL1_ARATH (Beta-galactosidase 1 OS=Arabidopsis thaliana GN=BGAL1 PE=2 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 8.3e-112
Identity = 188/298 (63.09%), Postives = 220/298 (73.83%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           +Y FEG  DLVKFVKLV  +GLY+H+RIGPYVCAEWN+G   FP   +WL ++PGI FRT
Sbjct: 95  KYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFG--GFP---VWLKYIPGISFRT 154

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DN PFK                   A+M+RFT KIV+++K E+L+ SQGGP+ILSQIENE
Sbjct: 155 DNGPFK-------------------AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENE 214

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG ++   G+  +SY  WAA MA  L TGVPWVMC Q DAPDPIIN CNGFYCD F+PN 
Sbjct: 215 YGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNK 274

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
             KPK+WTE WTGWF  FGG  PYRP ED+A++VARF Q GG+  NYYMYHGGTNFGRT 
Sbjct: 275 AYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTA 334

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLE 433
           GGPFI+TSYDYDAP+DEYGL RQP WGHL+++H AIK+CE ALVS EP    LG   E
Sbjct: 335 GGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQE 368

BLAST of Cp4.1LG02g12820 vs. TrEMBL
Match: A0A0E0GJW0_ORYNI (Beta-galactosidase OS=Oryza nivara PE=3 SV=1)

HSP 1 Score: 664.8 bits (1714), Expect = 1.1e-187
Identity = 339/551 (61.52%), Postives = 394/551 (71.51%), Query Frame = 1

Query: 116 VIIFSVFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYI 175
           VI   VF  +H  +R   QQYDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG  
Sbjct: 178 VIETYVFWDIHEPVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYG-- 237

Query: 176 SFPFSILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQ 235
            FP   +WLHFVPGIKFRTDNE FK                   AEM+RFT K+VD +K 
Sbjct: 238 GFP---VWLHFVPGIKFRTDNEAFK-------------------AEMQRFTEKVVDTMKG 297

Query: 236 EKLYASQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAP 295
             LYASQGGP+ILSQIENEYGN+ S+YG+A K+Y++WAA MA SL+TGVPWVMC Q DAP
Sbjct: 298 AGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAP 357

Query: 296 DPIINTCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNG 355
           DP+INTCNGFYCDQFTPNSK+KPK+WTENW+GWFLSFGGA PYRP EDLA+AVARFYQ G
Sbjct: 358 DPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRG 417

Query: 356 GTLQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEE 415
           GT QNYYMYHGGTNFGR+TGGPFI+TSYDYDAPIDEYG+VRQP WGHLR+VH+AIK+CE 
Sbjct: 418 GTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEP 477

Query: 416 ALVSTEPAVSSLGQNLEINSVTMRPSFSNQ-----PLKVDASASEAFD--------SGW- 475
           AL++ EP+ SSLGQN E    T+  +  N         VDA + +A            W 
Sbjct: 478 ALIAAEPSYSSLGQNTE---ATVYQTADNSICAAFLANVDAQSDKAVKFNGNTYKLPAWS 537

Query: 476 ------------GSGRGGKGNSKVSLEIPITLVPGKNTIDLLSLTVGLQHYGAFFETKGA 535
                        + +G   +S +SL+ P+TLVPGKN IDLLS TVGL +YGAFF+  GA
Sbjct: 538 VSILPDCKNVVLNTAQGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLIGA 597

Query: 536 GVTGVKLESQKNGITVDISSGQWTYQIGLKGEDLGL--SSGSSSQWLSQPSLPKNKPLTW 595
           GVTG    S  NG  +++SS  WTYQIGL+GEDL L   S +S +W+S  + P N+PL W
Sbjct: 598 GVTGPVKLSGPNG-ALNLSSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIW 657

Query: 596 YKTTFDAPDGSDPVALDFTGFGKGEAWINGQSIGRYWPSYIA-SGHCTAYCNYRGAYSFG 638
           YKT F AP G DPVA+DFTG GKGEAW+NGQSIGRYWP+ +A    C   CNYRGAYS  
Sbjct: 658 YKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSN 700

BLAST of Cp4.1LG02g12820 vs. TrEMBL
Match: A0A0E0CY77_9ORYZ (Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1)

HSP 1 Score: 593.2 bits (1528), Expect = 4.1e-166
Identity = 314/575 (54.61%), Postives = 390/575 (67.83%), Query Frame = 1

Query: 4   VRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQ 63
           +R  L+ VL+V+ +L   S AANVTYDHRA+VIDG RRVLVSGSIHYPRSTP+MWP LIQ
Sbjct: 14  LRLRLLPVLVVVSLLVGASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQ 73

Query: 64  KSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVF- 123
           KSKDGGLDVIETYVFW++HEPVR Q                        LS +++ +   
Sbjct: 74  KSKDGGLDVIETYVFWDIHEPVRGQART--------------------TLSQLLLSTTAR 133

Query: 124 -LLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSI 183
            ++LH +       YDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG   FP   
Sbjct: 134 SVVLHLI-------YDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYG--GFP--- 193

Query: 184 LWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYAS 243
           +WLHFVPGIKFRTDNE FK                   AEM+RFT K+VD +K   LYAS
Sbjct: 194 VWLHFVPGIKFRTDNEAFK-------------------AEMQRFTEKVVDTMKGAGLYAS 253

Query: 244 QGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINT 303
           QGGP+ILSQIENEYGN+ S+YG+A K+Y++WAA MA SL+TGVPWVMC Q DAPDP+INT
Sbjct: 254 QGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINT 313

Query: 304 CNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNY 363
           CNGFYCDQFTPNSK+KPK+WTENW+GWFLSFGGA PYRP EDLA+AVARFYQ GGT QNY
Sbjct: 314 CNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNY 373

Query: 364 YMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTE 423
           YMYHGGTNFGR+TGGPFI+TSYDYDAPIDEYG+VRQP WGHLR+VH+AIK+CE AL++ E
Sbjct: 374 YMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAE 433

Query: 424 PAVSSLGQNLEINSVTMRPSFSNQPLKVDASASEAFDSGWGSGRGGKGNSKVSLEIPITL 483
           P+ SSLGQN E    T+  +  N    + A+     D+         GN+       +++
Sbjct: 434 PSYSSLGQNTE---ATVYQTADN---SICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSI 493

Query: 484 VPGKNTIDLLSLTVGLQHYGAFFETKGAGVTGVKLESQKNGITVDISSGQWTYQIGLKGE 543
           +P    + L +  +  Q   +   + G+       ++  + IT ++++  W+Y I    E
Sbjct: 494 LPDCKNVVLNTAQINSQVTTSEMRSLGSSAQ----DTDDSSITPELATAGWSYAI----E 515

Query: 544 DLGLSSGSSSQWLSQPSLPKNKPLTWYKTTFDAPD 577
            +G++  ++   L++P L     +    TT DA D
Sbjct: 554 PVGITKENA---LTKPGL-----MEQINTTADASD 515

BLAST of Cp4.1LG02g12820 vs. TrEMBL
Match: A0A0E0CY76_9ORYZ (Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1)

HSP 1 Score: 593.2 bits (1528), Expect = 4.1e-166
Identity = 314/575 (54.61%), Postives = 390/575 (67.83%), Query Frame = 1

Query: 4   VRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQ 63
           +R  L+ VL+V+ +L   S AANVTYDHRA+VIDG RRVLVSGSIHYPRSTP+MWP LIQ
Sbjct: 14  LRLRLLPVLVVVSLLVGASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQ 73

Query: 64  KSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVF- 123
           KSKDGGLDVIETYVFW++HEPVR Q                        LS +++ +   
Sbjct: 74  KSKDGGLDVIETYVFWDIHEPVRGQART--------------------TLSQLLLSTTAR 133

Query: 124 -LLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSI 183
            ++LH +       YDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG   FP   
Sbjct: 134 SVVLHLI-------YDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYG--GFP--- 193

Query: 184 LWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYAS 243
           +WLHFVPGIKFRTDNE FK                   AEM+RFT K+VD +K   LYAS
Sbjct: 194 VWLHFVPGIKFRTDNEAFK-------------------AEMQRFTEKVVDTMKGAGLYAS 253

Query: 244 QGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINT 303
           QGGP+ILSQIENEYGN+ S+YG+A K+Y++WAA MA SL+TGVPWVMC Q DAPDP+INT
Sbjct: 254 QGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINT 313

Query: 304 CNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNY 363
           CNGFYCDQFTPNSK+KPK+WTENW+GWFLSFGGA PYRP EDLA+AVARFYQ GGT QNY
Sbjct: 314 CNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNY 373

Query: 364 YMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTE 423
           YMYHGGTNFGR+TGGPFI+TSYDYDAPIDEYG+VRQP WGHLR+VH+AIK+CE AL++ E
Sbjct: 374 YMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAE 433

Query: 424 PAVSSLGQNLEINSVTMRPSFSNQPLKVDASASEAFDSGWGSGRGGKGNSKVSLEIPITL 483
           P+ SSLGQN E    T+  +  N    + A+     D+         GN+       +++
Sbjct: 434 PSYSSLGQNTE---ATVYQTADN---SICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSI 493

Query: 484 VPGKNTIDLLSLTVGLQHYGAFFETKGAGVTGVKLESQKNGITVDISSGQWTYQIGLKGE 543
           +P    + L +  +  Q   +   + G+       ++  + IT ++++  W+Y I    E
Sbjct: 494 LPDCKNVVLNTAQINSQVTTSEMRSLGSSAQ----DTDDSSITPELATAGWSYAI----E 515

Query: 544 DLGLSSGSSSQWLSQPSLPKNKPLTWYKTTFDAPD 577
            +G++  ++   L++P L     +    TT DA D
Sbjct: 554 PVGITKENA---LTKPGL-----MEQINTTADASD 515

BLAST of Cp4.1LG02g12820 vs. TrEMBL
Match: A0A0E0CY79_9ORYZ (Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1)

HSP 1 Score: 593.2 bits (1528), Expect = 4.1e-166
Identity = 314/575 (54.61%), Postives = 390/575 (67.83%), Query Frame = 1

Query: 4   VRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQ 63
           +R  L+ VL+V+ +L   S AANVTYDHRA+VIDG RRVLVSGSIHYPRSTP+MWP LIQ
Sbjct: 14  LRLRLLPVLVVVSLLVGASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQ 73

Query: 64  KSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVF- 123
           KSKDGGLDVIETYVFW++HEPVR Q                        LS +++ +   
Sbjct: 74  KSKDGGLDVIETYVFWDIHEPVRGQART--------------------TLSQLLLSTTAR 133

Query: 124 -LLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSI 183
            ++LH +       YDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG   FP   
Sbjct: 134 SVVLHLI-------YDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYG--GFP--- 193

Query: 184 LWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYAS 243
           +WLHFVPGIKFRTDNE FK                   AEM+RFT K+VD +K   LYAS
Sbjct: 194 VWLHFVPGIKFRTDNEAFK-------------------AEMQRFTEKVVDTMKGAGLYAS 253

Query: 244 QGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINT 303
           QGGP+ILSQIENEYGN+ S+YG+A K+Y++WAA MA SL+TGVPWVMC Q DAPDP+INT
Sbjct: 254 QGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINT 313

Query: 304 CNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNY 363
           CNGFYCDQFTPNSK+KPK+WTENW+GWFLSFGGA PYRP EDLA+AVARFYQ GGT QNY
Sbjct: 314 CNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNY 373

Query: 364 YMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTE 423
           YMYHGGTNFGR+TGGPFI+TSYDYDAPIDEYG+VRQP WGHLR+VH+AIK+CE AL++ E
Sbjct: 374 YMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAE 433

Query: 424 PAVSSLGQNLEINSVTMRPSFSNQPLKVDASASEAFDSGWGSGRGGKGNSKVSLEIPITL 483
           P+ SSLGQN E    T+  +  N    + A+     D+         GN+       +++
Sbjct: 434 PSYSSLGQNTE---ATVYQTADN---SICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSI 493

Query: 484 VPGKNTIDLLSLTVGLQHYGAFFETKGAGVTGVKLESQKNGITVDISSGQWTYQIGLKGE 543
           +P    + L +  +  Q   +   + G+       ++  + IT ++++  W+Y I    E
Sbjct: 494 LPDCKNVVLNTAQINSQVTTSEMRSLGSSAQ----DTDDSSITPELATAGWSYAI----E 515

Query: 544 DLGLSSGSSSQWLSQPSLPKNKPLTWYKTTFDAPD 577
            +G++  ++   L++P L     +    TT DA D
Sbjct: 554 PVGITKENA---LTKPGL-----MEQINTTADASD 515

BLAST of Cp4.1LG02g12820 vs. TrEMBL
Match: M0SQP6_MUSAM (Beta-galactosidase OS=Musa acuminata subsp. malaccensis PE=3 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 3.8e-148
Identity = 267/425 (62.82%), Postives = 304/425 (71.53%), Query Frame = 1

Query: 8   LVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKD 67
           LV+ L  L        AA VTYDHRALVIDG RRVL+SGSIHYPRSTP            
Sbjct: 21  LVIFLCFLCGCSHLCAAATVTYDHRALVIDGTRRVLISGSIHYPRSTP------------ 80

Query: 68  GGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVFLLLHN 127
                       NL   V     + D ++           N G ++    +F       N
Sbjct: 81  -----------ENLQPSVAVLQMWPDLIEKSK--------NGGLDVVETYVF------WN 140

Query: 128 MLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFV 187
           +   +  QYDFEGRKDLV+FVK V  AGLYVH+RIGPYVCAEWNYG   FP   LWLHF+
Sbjct: 141 LHEPVQGQYDFEGRKDLVRFVKTVAEAGLYVHLRIGPYVCAEWNYG--GFP---LWLHFI 200

Query: 188 PGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVI 247
           PGIKFRTDNEPFK                    EM+RFT KIV+++KQEKLYASQGGP+I
Sbjct: 201 PGIKFRTDNEPFK-------------------REMQRFTTKIVEMMKQEKLYASQGGPII 260

Query: 248 LSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYC 307
           LSQIENEYGN+ SSYG+AAK+YI W+A+MATSL+TGVPWVMC Q DAPDPIINTCNGFYC
Sbjct: 261 LSQIENEYGNIDSSYGAAAKTYINWSASMATSLDTGVPWVMCQQADAPDPIINTCNGFYC 320

Query: 308 DQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGG 367
           DQFTPNS  KPK+WTENWTGWFLSFGG  PYRPVEDLA+AVARF+Q GGT QNYYMYHGG
Sbjct: 321 DQFTPNSNKKPKMWTENWTGWFLSFGGGVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 380

Query: 368 TNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSL 427
           TNFGRTTGGPFI+TSYDYDAPIDEYG++RQP WGHLR++H+ IK+CE ALV+T+P  +SL
Sbjct: 381 TNFGRTTGGPFIATSYDYDAPIDEYGILRQPKWGHLRDLHKVIKLCEGALVATDPTYTSL 384

Query: 428 GQNLE 433
           GQNLE
Sbjct: 441 GQNLE 384

BLAST of Cp4.1LG02g12820 vs. TAIR10
Match: AT2G28470.1 (AT2G28470.1 beta-galactosidase 8)

HSP 1 Score: 481.5 bits (1238), Expect = 8.7e-136
Identity = 249/432 (57.64%), Postives = 296/432 (68.52%), Query Frame = 1

Query: 1   MTGVRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPD 60
           M  VR   +++LL+L ++ + + AANVTYDHRALV                         
Sbjct: 7   MVKVRKMEMILLLILVIVVA-ATAANVTYDHRALV------------------------- 66

Query: 61  LIQKSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFS 120
                 DG   V+   +  ++H P      + + +      G             VI   
Sbjct: 67  -----IDGKRKVL---ISGSIHYPRSTPEMWPELIQKSKDGGL-----------DVIETY 126

Query: 121 VFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFS 180
           VF   H   +    +Y+FEGR DLVKFVKL   AGLYVH+RIGPYVCAEWNYG   FP  
Sbjct: 127 VFWSGHEPEK---NKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYG--GFP-- 186

Query: 181 ILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYA 240
            +WLHFVPGIKFRTDNEPFK                    EM+RFT KIVD++KQEKLYA
Sbjct: 187 -VWLHFVPGIKFRTDNEPFK-------------------EEMQRFTTKIVDLMKQEKLYA 246

Query: 241 SQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIIN 300
           SQGGP+ILSQIENEYGN+ S+YG+AAKSYI+W+A+MA SL+TGVPW MC Q DAPDP+IN
Sbjct: 247 SQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMIN 306

Query: 301 TCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQN 360
           TCNGFYCDQFTPNS NKPK+WTENW+GWFL FG  SPYRPVEDLA+AVARFYQ GGT QN
Sbjct: 307 TCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQN 366

Query: 361 YYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVST 420
           YYMYHGGTNF RT+GGP ISTSYDYDAPIDEYGL+RQP WGHLR++H+AIK+CE+AL++T
Sbjct: 367 YYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIAT 366

Query: 421 EPAVSSLGQNLE 433
           +P ++SLG NLE
Sbjct: 427 DPTITSLGSNLE 366

BLAST of Cp4.1LG02g12820 vs. TAIR10
Match: AT3G13750.1 (AT3G13750.1 beta galactosidase 1)

HSP 1 Score: 406.0 bits (1042), Expect = 4.7e-113
Identity = 188/298 (63.09%), Postives = 220/298 (73.83%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           +Y FEG  DLVKFVKLV  +GLY+H+RIGPYVCAEWN+G   FP   +WL ++PGI FRT
Sbjct: 95  KYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFG--GFP---VWLKYIPGISFRT 154

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DN PFK                   A+M+RFT KIV+++K E+L+ SQGGP+ILSQIENE
Sbjct: 155 DNGPFK-------------------AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENE 214

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG ++   G+  +SY  WAA MA  L TGVPWVMC Q DAPDPIIN CNGFYCD F+PN 
Sbjct: 215 YGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNK 274

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
             KPK+WTE WTGWF  FGG  PYRP ED+A++VARF Q GG+  NYYMYHGGTNFGRT 
Sbjct: 275 AYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTA 334

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLE 433
           GGPFI+TSYDYDAP+DEYGL RQP WGHL+++H AIK+CE ALVS EP    LG   E
Sbjct: 335 GGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQE 368

BLAST of Cp4.1LG02g12820 vs. TAIR10
Match: AT4G36360.1 (AT4G36360.1 beta-galactosidase 3)

HSP 1 Score: 400.2 bits (1027), Expect = 2.6e-111
Identity = 185/294 (62.93%), Postives = 219/294 (74.49%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           +YDFEGR DLV+FVK +  AGLY H+RIGPYVCAEWN+G   FP   +WL +VPGI FRT
Sbjct: 94  KYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFG--GFP---VWLKYVPGISFRT 153

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DNEPFK                  RA MK FT +IV+++K E L+ SQGGP+ILSQIENE
Sbjct: 154 DNEPFK------------------RA-MKGFTERIVELMKSENLFESQGGPIILSQIENE 213

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG      G+   +Y+ WAA MA +  TGVPWVMC + DAPDP+INTCNGFYCD F PN 
Sbjct: 214 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNK 273

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
             KP IWTE W+GWF  FGG   +RPV+DLA+ VARF Q GG+  NYYMYHGGTNFGRT 
Sbjct: 274 PYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTA 333

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLG 429
           GGPF++TSYDYDAPIDEYGL+RQP +GHL+E+H AIKMCE+ALVS +P V+S+G
Sbjct: 334 GGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIG 363

BLAST of Cp4.1LG02g12820 vs. TAIR10
Match: AT4G26140.1 (AT4G26140.1 beta-galactosidase 12)

HSP 1 Score: 394.4 bits (1012), Expect = 1.4e-109
Identity = 184/298 (61.74%), Postives = 222/298 (74.50%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           QY FE R DLVKF+K+V  AGLYVH+RIGPYVCAEWN+G   FP   +WL +VPG+ FRT
Sbjct: 90  QYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFG--GFP---VWLKYVPGMVFRT 149

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DNEPFK                   A M++FT KIV ++K+EKL+ +QGGP+ILSQIENE
Sbjct: 150 DNEPFK-------------------AAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENE 209

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG ++   G+  K+Y +W A MA  L+TGVPW+MC Q DAP+ IINTCNGFYC+ F PNS
Sbjct: 210 YGPIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNS 269

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
            NKPK+WTENWTGWF  FGGA PYRP ED+A +VARF QNGG+  NYYMYHGGTNF R T
Sbjct: 270 DNKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDR-T 329

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLE 433
            G FI+TSYDYDAP+DEYGL R+P + HL+ +H+ IK+CE ALVS +P V+SLG   E
Sbjct: 330 AGEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQE 362

BLAST of Cp4.1LG02g12820 vs. TAIR10
Match: AT2G32810.1 (AT2G32810.1 beta galactosidase 9)

HSP 1 Score: 394.0 bits (1011), Expect = 1.8e-109
Identity = 178/299 (59.53%), Postives = 223/299 (74.58%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           QY+FEGR DLVKFVKL+G++GLY+H+RIGPYVCAEWN+G   FP   +WL  +PGI+FRT
Sbjct: 99  QYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFG--GFP---VWLRDIPGIEFRT 158

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DNEPFK                    EM++F  KIVD++++ KL+  QGGP+I+ QIENE
Sbjct: 159 DNEPFK-------------------KEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENE 218

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG+V+ SYG   K Y++WAA+MA  L  GVPWVMC Q DAP+ II+ CNG+YCD F PNS
Sbjct: 219 YGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNS 278

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
           + KP +WTE+W GW+  +GG+ P+RP EDLA+AVARFYQ GG+ QNYYMY GGTNFGRT+
Sbjct: 279 RTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTS 338

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTE-PAVSSLGQNLE 433
           GGPF  TSYDYDAP+DEYGL  +P WGHL+++H AIK+CE ALV+ + P    LG   E
Sbjct: 339 GGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQE 373

BLAST of Cp4.1LG02g12820 vs. NCBI nr
Match: gi|449462081|ref|XP_004148770.1| (PREDICTED: beta-galactosidase 8 [Cucumis sativus])

HSP 1 Score: 564.7 bits (1454), Expect = 2.2e-157
Identity = 291/432 (67.36%), Postives = 323/432 (74.77%), Query Frame = 1

Query: 1   MTGVRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPD 60
           M G+R+A+V VLL+LGVL SFSLA NVTYDHRALV                         
Sbjct: 1   MKGLRFAVVFVLLLLGVLHSFSLAVNVTYDHRALV------------------------- 60

Query: 61  LIQKSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFS 120
                 DG   V+   V  +LH P R+       +  ++  G  +++             
Sbjct: 61  -----IDGKRKVL---VSGSLHYP-RSTPEMWPGIIQKSKDGGLDVIET----------Y 120

Query: 121 VFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFS 180
           VF  LH  +R    QYDFEGRKDLVKF+KLVGAAGLYVH+RIGPYVCAEWNYG   FP  
Sbjct: 121 VFWNLHEPVR---NQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYG--GFP-- 180

Query: 181 ILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYA 240
            +WLHFVPG++FRTDNEPFK                   AEMKRFTAKIVDVLKQEKLYA
Sbjct: 181 -VWLHFVPGVQFRTDNEPFK-------------------AEMKRFTAKIVDVLKQEKLYA 240

Query: 241 SQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIIN 300
           SQGGP+ILSQIENEYGNVQSS+GSAAKSY+QWAATMATSLNTGVPWVMCNQPDAPDPIIN
Sbjct: 241 SQGGPIILSQIENEYGNVQSSFGSAAKSYVQWAATMATSLNTGVPWVMCNQPDAPDPIIN 300

Query: 301 TCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQN 360
           TCNGFYCDQFTPNS NKPK+WTENW+GWFLSFGGA PYRPVEDLA+AVARFYQ GG+LQN
Sbjct: 301 TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGALPYRPVEDLAFAVARFYQTGGSLQN 360

Query: 361 YYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVST 420
           YYMYHGGTNFGRT+GGPFI+TSYDYDAPIDEYGLVRQP WGHLR+VH+AIKMCEEALVST
Sbjct: 361 YYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKMCEEALVST 361

Query: 421 EPAVSSLGQNLE 433
           +PAV+SLG NLE
Sbjct: 421 DPAVTSLGPNLE 361

BLAST of Cp4.1LG02g12820 vs. NCBI nr
Match: gi|848865818|ref|XP_012833596.1| (PREDICTED: beta-galactosidase 8-like [Erythranthe guttata])

HSP 1 Score: 557.4 bits (1435), Expect = 3.5e-155
Identity = 276/433 (63.74%), Postives = 321/433 (74.13%), Query Frame = 1

Query: 1   MTGVRYALVVVLLVLGVLDSFSLAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPD 60
           M G R A VV LL L    S   A +V YD RALVIDGKRRVLVSGSIHYPRSTP+MWPD
Sbjct: 1   MAGSRTAAVVFLL-LAASASLCTAVSVDYDGRALVIDGKRRVLVSGSIHYPRSTPDMWPD 60

Query: 61  LIQKSKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFS 120
           LI+KSKDGGLD+IETYVFW++HEPVR Q +                 ND  +        
Sbjct: 61  LIKKSKDGGLDIIETYVFWDMHEPVRGQNN-----------------NDNDDDDD----- 120

Query: 121 VFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFS 180
                +N ++F   QYDF  RKDLVKFVKLV  AGLY+H+RIGPYVCAEWNYG   FP  
Sbjct: 121 -----NNQVKFW--QYDFTERKDLVKFVKLVAEAGLYLHLRIGPYVCAEWNYG--GFP-- 180

Query: 181 ILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYA 240
            LWLHF+PGI+ RTDN+ +K                   AEM+RFTAKIV ++KQ  LYA
Sbjct: 181 -LWLHFLPGIQLRTDNDVYK-------------------AEMQRFTAKIVGMMKQNNLYA 240

Query: 241 SQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIIN 300
           SQGGP++LSQIENEYGN+   YGS+AK YI WAA MATS+NTGVPWVMC Q +APDP+IN
Sbjct: 241 SQGGPIVLSQIENEYGNIDWQYGSSAKPYIDWAAQMATSMNTGVPWVMCQQNNAPDPMIN 300

Query: 301 TCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQN 360
           TCNGFYCDQFTPNS +KPK WTE W+GWF ++G   PYRPVED+A+A ARFYQN GTL N
Sbjct: 301 TCNGFYCDQFTPNSPSKPKFWTELWSGWFSAWGNPVPYRPVEDVAFAAARFYQNNGTLLN 360

Query: 361 YYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVST 420
           YYMYHGGTNF RT+GGPFI+TSYDYD+PIDEYGL+RQP WGHL+++H+AIK+CEEA+VST
Sbjct: 361 YYMYHGGTNFARTSGGPFITTSYDYDSPIDEYGLLRQPKWGHLKDLHKAIKLCEEAMVST 379

Query: 421 EPAVSSLGQNLEI 434
               +SLGQNLE+
Sbjct: 421 VGNTTSLGQNLEV 379

BLAST of Cp4.1LG02g12820 vs. NCBI nr
Match: gi|659072190|ref|XP_008463829.1| (PREDICTED: beta-galactosidase 8 [Cucumis melo])

HSP 1 Score: 536.2 bits (1380), Expect = 8.5e-149
Identity = 258/317 (81.39%), Postives = 276/317 (87.07%), Query Frame = 1

Query: 116 VIIFSVFLLLHNMLRFLNQQYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYI 175
           VI   VF  LH  +R    QYDFEGRKDLVKF+KLVGAAGLYVH+RIGPYVCAEWNYG  
Sbjct: 72  VIETYVFWNLHEPVR---NQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYG-- 131

Query: 176 SFPFSILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQ 235
            FP   +WLHFVPGIKFRTDNEPFK                   AEMKRFTAKIVDVLKQ
Sbjct: 132 GFP---VWLHFVPGIKFRTDNEPFK-------------------AEMKRFTAKIVDVLKQ 191

Query: 236 EKLYASQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAP 295
           E LYASQGGP+ILSQIENEYGNVQSS+GSAAKSY+QWAATMATSLNTGVPWVMCNQPDAP
Sbjct: 192 ENLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWAATMATSLNTGVPWVMCNQPDAP 251

Query: 296 DPIINTCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNG 355
           DPIINTCNGFYCDQFTPNSKNKPK+WTENW+GWFLSFGGA PYRPVEDLA+AVARFYQ G
Sbjct: 252 DPIINTCNGFYCDQFTPNSKNKPKMWTENWSGWFLSFGGALPYRPVEDLAFAVARFYQTG 311

Query: 356 GTLQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEE 415
           G+LQNYYMYHGGTNFGRT+GGPFI+TSYDYDAPIDEYGLVRQP WGHLR+VH+AIKMCEE
Sbjct: 312 GSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKMCEE 361

Query: 416 ALVSTEPAVSSLGQNLE 433
           AL+ST+PAV+SLG NLE
Sbjct: 372 ALISTDPAVTSLGPNLE 361

BLAST of Cp4.1LG02g12820 vs. NCBI nr
Match: gi|743905143|ref|XP_011045964.1| (PREDICTED: beta-galactosidase-like isoform X1 [Populus euphratica])

HSP 1 Score: 512.3 bits (1318), Expect = 1.3e-141
Identity = 248/440 (56.36%), Postives = 309/440 (70.23%), Query Frame = 1

Query: 10  VVLLVLGVLDSFS-----LAANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQK 69
           ++LL+L +L  FS     + A+V+YDH+A++I+G+RR+L+SGSIHYPRSTPEMWPDLIQK
Sbjct: 6   LLLLLLQLLFFFSSRISTVTASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQK 65

Query: 70  SKDGGLDVIETYVFWNLHEPVRNQVHFCDFVDPRAAFGSKEILNDGWNLSSVIIFSVFLL 129
           +KDGG+DVI+TYVFWN HEP    V    F                        F +F L
Sbjct: 66  AKDGGVDVIQTYVFWNGHEPSPGNVMQIPFYS---------------GFCVCCAFFLFFL 125

Query: 130 LHNMLRFLNQ----------QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGY 189
                +FL +          QY FE R DLVKF+KLV  AGLY+H+RIGPY+CAEWN+G 
Sbjct: 126 FCFHFKFLTRLIFTLWGKKNQYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFG- 185

Query: 190 ISFPFSILWLHFVPGIKFRTDNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLK 249
             FP   +WL +VPGI+FRTDN PFK                   A M++FT KIV ++K
Sbjct: 186 -GFP---VWLKYVPGIEFRTDNGPFK-------------------AAMQKFTEKIVGMMK 245

Query: 250 QEKLYASQGGPVILSQIENEYGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDA 309
            EKL+ +QGGP+ILSQIENEYG V+   G+  K+Y +WAA MA  L TGVPW+MC Q DA
Sbjct: 246 SEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADMAVKLGTGVPWIMCKQEDA 305

Query: 310 PDPIINTCNGFYCDQFTPNSKNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQN 369
           PDP+I+TCNGFYC+ F PN   KPKIWTE WTGW+  FGGA P+RP ED+A++VARF QN
Sbjct: 306 PDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTEFGGAVPHRPAEDMAFSVARFIQN 365

Query: 370 GGTLQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCE 429
           GG+  NYYMYHGGTNFGRT GGPFI+TSYDYDAP+DE+GL R+P WGHLR++H+AIK+CE
Sbjct: 366 GGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLPREPKWGHLRDLHKAIKLCE 406

Query: 430 EALVSTEPAVSSLGQNLEIN 435
            ALVS +P V+SLG N E++
Sbjct: 426 PALVSVDPTVTSLGSNQEVH 406

BLAST of Cp4.1LG02g12820 vs. NCBI nr
Match: gi|474448519|gb|EMS68743.1| (Beta-galactosidase 4 [Triticum urartu])

HSP 1 Score: 511.9 bits (1317), Expect = 1.7e-141
Identity = 269/553 (48.64%), Postives = 330/553 (59.67%), Query Frame = 1

Query: 135 QYDFEGRKDLVKFVKLVGAAGLYVHIRIGPYVCAEWNYGYISFPFSILWLHFVPGIKFRT 194
           QY F  R DLV+FVKL G AGL+VH+RIGPYVCAEWN+G   FP   +WL +VPGI FRT
Sbjct: 86  QYHFADRYDLVRFVKLAGQAGLFVHLRIGPYVCAEWNFG--GFP---VWLKYVPGISFRT 145

Query: 195 DNEPFKVKYNFTISVLLLFFLLQCRAEMKRFTAKIVDVLKQEKLYASQGGPVILSQIENE 254
           DN PFK                   AEM+RF  KIV ++K E L+  QGGP+IL+Q+ENE
Sbjct: 146 DNGPFK-------------------AEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENE 205

Query: 255 YGNVQSSYGSAAKSYIQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNS 314
           YG ++S+ G+ AK Y  WAA MA +   GVPWVMC Q DAPDP+INTCNGFYCD F+PNS
Sbjct: 206 YGPMESAMGAGAKPYASWAANMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNS 265

Query: 315 KNKPKIWTENWTGWFLSFGGASPYRPVEDLAYAVARFYQNGGTLQNYYMYHGGTNFGRTT 374
             KP +WTE WTGWF +FGG  P+RPVED+A+AVARF Q GG+  NYYMYHGGTNF RT 
Sbjct: 266 NGKPTMWTEAWTGWFTAFGGPVPHRPVEDMAFAVARFVQKGGSFVNYYMYHGGTNFDRTA 325

Query: 375 GGPFISTSYDYDAPIDEYGLVRQPNWGHLREVHEAIKMCEEALVSTEPAVSSLGQNLEI- 434
           GGPFI+TSYDYDAPIDEYGL+RQP WGHLR++H+AIK  E ALVS +P V  +G   +  
Sbjct: 326 GGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKQAEPALVSGDPTVQRIGNYEKAY 385

Query: 435 -----------------NSVTMRPSFSNQPLKVDASASEAFDSGW--------------- 494
                             S   R  ++ + + +D+S        W               
Sbjct: 386 VFKSSAGACAAFLSNYHTSAAARVVYNGRRVNIDSSEQFLKSGQWPQLTINSAGHSVQVF 445

Query: 495 ------GSGRGGKGNSKVSLEIPITLVPGKNTIDLLSLTVGL-----------QHYGAFF 554
                 G   GG  N K++   P+ +  G N I +LS  +GL           Q+ G  +
Sbjct: 446 VNGQSFGVAYGGYNNPKLTYSKPVKMWQGSNKISILSSAMGLPAISDMVTLWRQNQGTHY 505

Query: 555 ETKGAGVTGVKLESQKNGITVDISSGQWTYQIGLKGEDLGLSSGSSSQWLSQPSLPKNKP 614
           E    GV G    S  N    D+S+ +WTYQIGLKGE LG++S S S  +   S    +P
Sbjct: 506 EAWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVNSVSGSSSVEWGSATGAQP 565

Query: 615 LTWYKTTFDAPDGSDPVALDFTGFGKGEAWINGQSIGRYWPSYIASGHCTAYCNYRGAYS 638
           LTW+K  F AP GS PVALD    GKG+ W+NG + GRYW SY ASG C A C+Y G +S
Sbjct: 566 LTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYW-SYRASGSCGA-CSYAGTFS 612

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGAL8_ARATH1.5e-13457.64Beta-galactosidase 8 OS=Arabidopsis thaliana GN=BGAL8 PE=2 SV=2[more]
BGAL6_ORYSJ2.6e-13454.66Beta-galactosidase 6 OS=Oryza sativa subsp. japonica GN=Os03g0255100 PE=1 SV=2[more]
BGAL_ASPOF2.9e-11767.11Beta-galactosidase OS=Asparagus officinalis PE=2 SV=1[more]
BGAL_SOLLC2.6e-11363.42Beta-galactosidase OS=Solanum lycopersicum PE=1 SV=1[more]
BGAL1_ARATH8.3e-11263.09Beta-galactosidase 1 OS=Arabidopsis thaliana GN=BGAL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0E0GJW0_ORYNI1.1e-18761.52Beta-galactosidase OS=Oryza nivara PE=3 SV=1[more]
A0A0E0CY77_9ORYZ4.1e-16654.61Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1[more]
A0A0E0CY76_9ORYZ4.1e-16654.61Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1[more]
A0A0E0CY79_9ORYZ4.1e-16654.61Beta-galactosidase OS=Oryza meridionalis PE=3 SV=1[more]
M0SQP6_MUSAM3.8e-14862.82Beta-galactosidase OS=Musa acuminata subsp. malaccensis PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28470.18.7e-13657.64 beta-galactosidase 8[more]
AT3G13750.14.7e-11363.09 beta galactosidase 1[more]
AT4G36360.12.6e-11162.93 beta-galactosidase 3[more]
AT4G26140.11.4e-10961.74 beta-galactosidase 12[more]
AT2G32810.11.8e-10959.53 beta galactosidase 9[more]
Match NameE-valueIdentityDescription
gi|449462081|ref|XP_004148770.1|2.2e-15767.36PREDICTED: beta-galactosidase 8 [Cucumis sativus][more]
gi|848865818|ref|XP_012833596.1|3.5e-15563.74PREDICTED: beta-galactosidase 8-like [Erythranthe guttata][more]
gi|659072190|ref|XP_008463829.1|8.5e-14981.39PREDICTED: beta-galactosidase 8 [Cucumis melo][more]
gi|743905143|ref|XP_011045964.1|1.3e-14156.36PREDICTED: beta-galactosidase-like isoform X1 [Populus euphratica][more]
gi|474448519|gb|EMS68743.1|1.7e-14148.64Beta-galactosidase 4 [Triticum urartu][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0030246carbohydrate binding
Vocabulary: INTERPRO
TermDefinition
IPR019801Glyco_hydro_35_CS
IPR017853Glycoside_hydrolase_SF
IPR013781Glycoside hydrolase, catalytic domain
IPR008979Galactose-bd-like_sf
IPR001944Glycoside_Hdrlase_35
IPR000922Lectin_gal-bd_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g12820.1Cp4.1LG02g12820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000922D-galactoside/L-rhamnose binding SUEL lectin domainPFAMPF02140Gal_Lectincoord: 624..685
score: 3.3
IPR000922D-galactoside/L-rhamnose binding SUEL lectin domainPROFILEPS50228SUEL_LECTINcoord: 624..686
score: 13
IPR001944Glycoside hydrolase, family 35PRINTSPR00742GLHYDRLASE35coord: 37..54
score: 7.2E-32coord: 590..606
score: 7.2E-32coord: 563..577
score: 7.2E-32coord: 58..76
score: 7.2
IPR001944Glycoside hydrolase, family 35PANTHERPTHR23421BETA-GALACTOSIDASE RELATEDcoord: 135..200
score: 0.0coord: 220..671
score: 0.0coord: 4..87
score:
IPR008979Galactose-binding domain-likeGENE3DG3DSA:2.60.120.260coord: 563..605
score: 8.
IPR008979Galactose-binding domain-likeunknownSSF49785Galactose-binding domain-likecoord: 531..606
score: 5.95
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 25..92
score: 9.8E-26coord: 132..414
score: 9.6
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..93
score: 4.0E-20coord: 134..414
score: 6.33
IPR019801Glycoside hydrolase, family 35, conserved sitePROSITEPS01182GLYCOSYL_HYDROL_F35coord: 243..255
scor
NoneNo IPR availablePANTHERPTHR23421:SF58BETA-GALACTOSIDASE 8coord: 135..200
score: 0.0coord: 220..671
score: 0.0coord: 4..87
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g12820Cp4.1LG20g09030Cucurbita pepo (Zucchini)cpecpeB432