Tan0013897 (gene) Snake gourd v1

Overview
NameTan0013897
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyl hydrolase family protein
LocationLG04: 6370037 .. 6377502 (-)
RNA-Seq ExpressionTan0013897
SyntenyTan0013897
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCTTATTTTTAAAGGTACGTGGCCAAATAAGAACAAATCTAAAAATTCAAATATTAAAATTATAATTTTACCTTTTCTTTTCTATTTGGTTTGGGAAAAGCCAAAATATTTGTATAGCTAGTTGGTAGTATTTTTTTTCTTTTTGGTAATGATTTATTTGTATTTTCAAGAAAAAAAGTAATGGCGGGGCTTTTGTTTTTGTTTTTTATTTTTTTTTTCTTTTTGGTAATGATTTATTTGTATTTTCAAGAAAAAATAAAGATAAATTGTGAGAAAACAAACATAATTTTTAAGAAAGATGGAAACCATTGTAGGGAAGATAAAAACCATGATAGATAAATTGTGAGAAAACAAACACAATTTTTAAAAACCAAAAACAAAAAATCAAATGGTTATCAAACAAGACCTTAATTTTTTGTTTTTAAAAATTGTATTTGTTTTCTCACTATTTTTCTCTAATAGTTTCATATTCTCTACCATAACTTCAATATTTCTTAAAGAAATATTTAAGTTATGGTAGAGAATATCAAAACTTTCGGTCAAATTGCAAAAAGAAAAACAAAATTTTGAAAACTATTATTTTTAGTTTTCAAAACTTGTCTTAATTTTGAAAAACATATTTAATAAAAAGATACAAATTCATATGTGAAATTAGTATTTATGGTTAAAAAAATTAAATAGTTATAAAAAATGCTTAAATTATTACTTATATGAAACTTCATGTACTAGATTTTTAACTTAACTTCAAAAGAAGAATGAACCTATTGAAAAGTATTAGGGGGATAGTTTTGCAAATTTTCCATGAGAAAAATCGGGCGCACCATGAATGAATTTGCCACTCAACTGTGACTCAAAAGACAAAGTTCAGTAATGGCGTAAAAACCCTCAGGAATCTCTTCTCTCTCTCTCTCTCTCTCACTTCTTTTTCCAAGAACAGCCCATAATAAACCCACGATTTAGCTTTCTCCTTCTTCACCTACCTCTCCACTTTTACGGATTCACCATTCCCCAGCTTTCACCCACCACTGCAAAACCCATTTTTGGTTTTGTAAAGCTCCTCCACTCTGAAACATTATCTCCATTTCCAAGTTTCCATTGGTTTAACTTATCTCCACTCTTCCTCTTCTTTTATTCTGCTTTCTGCTATTCCCACTATCATTTACACTTTCAGGTAACCTCCTCTCTCTCTCTCTCTCTCTCTCTCCGCTATCTGATTCTGGTAACTGCTGAAGGTGGGTTTGAGCGGCAACAGAAAATAGTGTTTCAATCTTCGCTCACACTGTAATTTGGGAATTGGGTTTGCTTTTATTCTGTTTCATGCTTGATTGTTCTTCAATTTACTTCAAGAAACTGATTGCTGCGTATTGAAACTGAGAGGGAATGAACAAAAACTTAGAGAAAATGGAGGCAGCGATTCGGGATTGAGAGGGGTTCAGAATAGAGAAAGAAGGACGAAAAAACAGAGCTAGGAGATCTTTGAGTCTAATCACTAATTTTAACATTCAAATATCTGATATAGCTTTCGGAACACCGATTGTATGAGACAGTCTTCTTTCTTCTTTCTTCCATTTCCTGCATTTTTCTTGGCGTTTAATTACAGTGTACAACAAGTGGGTTCAATTTATGTATTTGCTCTTTTCAGTGTGAGTTGGGCTCTCGAGGAAAGAAGATGGCGAAGATTTTTGTTCAGCTGTTTGTGATTTTGTGCTTGGGTTGGTGGTGGTGGGCAATAATGGTGGGCGCGGAGAACTTGAAGTACAAAGACCCTAAGCAACCGGTTGCTGTGCGAGTTAAGGACCTTCTTGGCCGAATGACGCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGAAGCGTTGCCAATGCTACAGTTATGAAAGATTATTTTATTGGTAAAGCAATAGTCTCCTGCTCTCTCTTTCTTTTACACTCTCAATTTTATCTATATGCTATTGAAGATCTGCATACTTGATGATAATATGTCTCATGCTTAACTTTTTAGTTTTGATTTAGAGTGTTGGAGTTTGATATAACTAAATTTACCATAACCTACCGGTTTAAGCTTTTAGGTTGATTGTTGATTTAACATAAATTAGTTCATAGTAAAGTTTGATGTTATTTTCAGTTTTGTGAGTGCTATGTTCCTGATCTGATTTATGTCTACCCTGAATGGAAGTTGTTCTAGGCTCAATTACACTTGATTCTTGTGACAAACAGTTGAGATGGCAAATGGGTTCTGCTTGCGCCTTTGTCACTAAATCATTACAGTTTTTTTTAAGTAATATTCAGTTCTAAGGAGAAGAAAACAGAAGATTAATTGATATCATTTTCATCATAATCAACACCTTTTTGTGTAATGAAAATCTTTTACTCAAGGGAATGTTTTAAGAGTTTGATCGTTGCATGCTATGAGAATCATATTTGAAGCATTATAGCTGTATATGTGTAGTGTCAATTTTGTTGAAACAATCTCAAATTCACGGTCAGGGGCACATAATCTAGACTCTAGAGGTTATTGAGTTCCATTATGACTTGTCACAAATAGCAAATGATTGAAGAGCTTCTCATTCATGCGAAAGCTAACCTTTACTATTCAAAATTTGATTATCAGAGTCAATTGACAGAAGTTTTCTAGGAGCATGAACTGTGAAATTTGTACAGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAATGATTTCCAGAAGGGTTCTCTTTCTAGTCGATTGGGTATCCCAATGATTTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACTGTATTCCCTCATAATGTTGGACTTGGAGCTACCAGGTAAGTATTTTTCTTTTCCATGTAGCCGTGCACATAAAAGTTGTACTATACTATAGTCAGTGTTTATATTGATCCTTTGCTTCATATATCTCTTTGATGTGGTCTTAAGTTTAACCAAAACTTATATGCTCAATTAAGCTGTGTTGAAGTTCGTCCACCTAAAATGATCCAATAAAAATTTGTTTTGATGGAGCATGGTAGAATCAAGAATGTTCTCCAGCTCCTCAGTCGTCCAGTGGTTGGATCAAATTCATATTGTATATATTATTTCATTGTGATGTATTTTATAAATTATAATTATAGAGCCACCTGCAAATGTACAACTTCTTTACATTTGTTTCTTGCCTCTCCTACACAGAAACCCTGACCTGGTTCGAAGGATTGGTGCAGCAACTGCACTTGAAGTTAGAGCTACAGGGATTTCTTATACATTTGCTCCATGCCTTGCGGTAAACACATGTTTCTTATTTCCAGAAAGAGATGTTTGATTGAATATCTCACTATTTGTTTTAGTTGTCAGCTAGATATATAACAGTCTCTTTCCCTGGAAACTTTCTTTAGGTTTGTAGGGACCCGAGGTGGGGTCGATGTTATGAAAGCTACAGTGAGGATCCAAGAATTGTGCAAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCAAATTTTCGGAAGGGGATTCCATATGTTGGTGGAACGTATGGCCTTCGAAGATGTAGAATTTTGTTTTTGTGTTTTTTAATATAAGACTGAATCCATACAAAATGTTTCCTAAAAGCCTTTGCAGGCCAGTTGGGAAATGTTTTCATCTAGTCAAGAGAAATCATATGGCATATAAATGTGTGGGTCCATTTTATATAACATTCTGGAAGTTGAATATGGAAGTGTTGCACATTTAGAAATAGAGCATTGCATTCTGCAAAATTGTTATTCTTCAACCAAATTACTGAGATACTATCTGATTTCTTTGTAAACTCTCTTCCTCGTCCCTGCAAAATTGTAAAATTACACTCGAACTTTTCTTGTCTTGCAGTAAAAAGGTTATCGCCTGTGCGAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAATACCGTTATTGACAGGCATGGACTGCTCAGCATTCACATGCCTGCTTATTTAGATTCGATCATCAAGGGTGTTTCATCGGTAATGGCTTCCTATTCTAGTTGGAATGGAGTAAAGATGCACGCAAACCGTGAGCTGATTACTGGTTTCCTCAAGGGTACCCTTAAATTTAAGGTATGTCTTGACAATTTAAATTTGTCTTGAAAGCCATTTGCTGATATTTAACTGCAATTATGCTACTGTCTGATGTTTACTAATTTTACTATGTGCATTTTCAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGATTGACTTCTACACCACATTCTAATTACACGTACTCTGTCAAAGCTGCAATTTTAGCTGGCATTGACATGGTTGGTGATTCTCTTTAACTTTAAGGATACCTTAAATCAAACTTGACTGAATATATGACTGATGAGTTGTGTTTGAAATGTTGGAATTGGGTGTAATATTTGATGACGGAAGTTGGGATTATTTAGGCTTTGGTTCTGTGTTCTGCAGGTCATGATTCCTTACAAGTATGCAGAGTTCATTGATGATCTTACGTTTCTAGTGAAGAGCAATGTCATCCCAATGGATCGTATTGATGATGCTGTTGGGAGAATTTTGTCTGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCTTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGCCAGGTCTGACTCCAACTATTGCTAAATATATGACTGATTTATTCGGATTCCATTCGTGGGCTTGATGTTCTGGATGCCCTTTCATTCCTTCTTCTTCTTCTTCTTTTTTTTTTTTTTTAATATTTATTATTATTATTTTTTTATAATTTAAAAAAAAAACTTTAATTTTTAATTGGCTATTCAATTTAAGTGAAGTTCATCTTTAGTAAGAATATCAGATGTCGCACTTATCTTCTTTTCATTTCCATTGTCCTTGAAACATTTCTCATTGGGGCATTAAGAAAAATCAAAAGTTTCTTTCCACTAAGCTCCTCTTGAAACTTTTATCTTAAATTTAAATATAAGCTTCTCTCGTCATTTAAGCTTTATATTTTTCTTAAGAAATGAATATTTTTTTATTCTCTAGCTAAAGAAATCCCATGAATGTCTCTTACACCTCTGTTAGTTGGAGTGAGAAGTCTCATGGACATAATTTTGCATCTATAGGAACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGGAAAAATGACAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGATCCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGTATGTTTCTTTTCACTCATTTTTCAGAACCATAAATCAAGAGTTAGAATTTAAGATCTCTGGGGTAAGTTTTGAAACTGAAGTATCCAATTCTGGCGTTGTTCAAATAATTTGGAGATGTATGTTAGTTGGGATTCAGAAAATGACCCTTGTTTGCTTGTCTAAAAGTGTTCATACGGTCATTCTTATAAAATTTACATATCTGTAGTCAATGAAGAGTTTTCTTTTAAATTGAGAGTTTAGACCTACAAGTGTTGTACCATAATAAGTAGAGTAGAGAACAAAGAGTTTTCCTCTTGACTTCTTAGATACCAAAACTTCCTCAGATAAGCCAGATTATTCTCTCTGTTTATGTTCATCCTTCAAGACCTTCTGTTTGATATTTAGATTTCCTTTTCCCAATTTGCAGGAACTAGCATCCTCGCTGCCATCAAATCAACAGTTGATCCAAGCACAGAGGTAGTATTCCGTGAGGATCCTGACAGTGATTTTGTTAAGTCCAATGACTTCTCATACGCCATTGTTGTTATTGGGGAAGCCCCATATGCCGAGACTGAAGGGGATAGTACAACACTTACCATGTTGGATCCTGGTCCAAGCATCGTAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTTGTCATTTCTGGAAGGCCAATTGTGATGGAACCATATGTTTCATCAATGGATGCTCTTGTAGCAGCTTGGTTACCTGGTACTGAAGGCCTAGGAGTCACTGATGCCCTTTATGGAGACCATGGTTTTAGTGGGAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACACTATGATCCACTTTTCCCGCTTGGTTTCGGACTCACAACTGGATCCGTTAAGGACGTTGTCGCGAGGTAAGTACTAATATTACTCCATATCAACTACTGATATCCTATGATTTCATTTCCCTTACTCTTCATTCACTCTTGTTTACTTATCCCAGAAGGAAATAAATTTGATTTTTTTTTTGCTTATGTATAGGTCAACATCGGCGGGAATTAAAGGAACACCATCCTTCATTGCAATGATCATTGCTACAATTGCCATTTGTATATTGCAGGTACACTTGTAGTTCTAACCAGTAGTACAATTGCCATTTAGGAAAGTTGAAGCTCTGAGAGCCTCAAATGCTTAGCCATTTCATTTAGTGACGGTGGTTTTATGGGATTTGTAGTAGCAAAGACATCTCATTTCCATTCATCTAAACCTTATTTATTTTCTTTATCATGATGCCCTTCGGCACTTTAAAACTAACTCTTCGAATGATTCTCCATTGATAGAGAGGGAGAGAGACAAAGGCAAAAGAATTTAGAATGAAAAAAATGTTACAAGAGATAAGTTCTTGTTAGAAGAGATATAAGTTCTAGCTTAAATTACAAGTTTAAGCAGAGCTTAATAGACAAATTCAAAAATTTTAAAACTTTTATATCTAATAGGACTTTAAACTTTCAAACCAAGTGAATGACTTATTAGACCGAAAATTAAAAATTCAAGACTTGATAAGTTAGAATTTTCGAAAATTCAAGAACTTAAAAGACAAACTCTTTTGTAATTACATCATTAACTTAGATTAGAAGGTTTTCTTGTAATTCCTTTTAGTTGTGGGGACCTCTTTGTCCCTTTGCTTGCTAGCTTGTATAGCTGTTTTTTTAAGAATGAAGTGGTTCTCCTTTCTTATATATAAAAAAAGTTAAGGCAAGAGCAAAAAAAAGGGAAGCTTAACTTGAAAGGATTGAAAAAAGGGAAAGTTCTAAACTGAAAATTTAAAGGGTTTTTTTGCTTTAATCAGTGAAAAGTTAAAAAGAGAGAAAAAGTAGGAAAAAGAAATTAGAAAGTGAAGTGAGGAAGAAAAATTATGGAGATAAAACAAAGAAAGACAAAAATGATAGGAAGAGATATTAAGAGAGAGAAGTATATAGTTTTAGACAGAAATCTTATTAAATTATTATTTTTTTTAAAAAAGGAAACTCAAAAAACAAAAATGATTATGGAGGTAAAAGGAGTTTCTTCCCATTCAATTTCATGAATGA

mRNA sequence

ATGGATCTTATTTTTAAAGGTACGTGGCCAAATAAGAACAAATCTAAAAATTCAAATATTAAAATTATAATTTTACCTTTTCTTTTCTATTTGTGTGAGTTGGGCTCTCGAGGAAAGAAGATGGCGAAGATTTTTGTTCAGCTGTTTGTGATTTTGTGCTTGGGTTGGTGGTGGTGGGCAATAATGGTGGGCGCGGAGAACTTGAAGTACAAAGACCCTAAGCAACCGGTTGCTGTGCGAGTTAAGGACCTTCTTGGCCGAATGACGCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGAAGCGTTGCCAATGCTACAGTTATGAAAGATTATTTTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAATGATTTCCAGAAGGGTTCTCTTTCTAGTCGATTGGGTATCCCAATGATTTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACTGTATTCCCTCATAATGTTGGACTTGGAGCTACCAGAAACCCTGACCTGGTTCGAAGGATTGGTGCAGCAACTGCACTTGAAGTTAGAGCTACAGGGATTTCTTATACATTTGCTCCATGCCTTGCGGTTTGTAGGGACCCGAGGTGGGGTCGATGTTATGAAAGCTACAGTGAGGATCCAAGAATTGTGCAAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCAAATTTTCGGAAGGGGATTCCATATGTTGGTGGAACTAAAAAGGTTATCGCCTGTGCGAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAATACCGTTATTGACAGGCATGGACTGCTCAGCATTCACATGCCTGCTTATTTAGATTCGATCATCAAGGGTGTTTCATCGGTAATGGCTTCCTATTCTAGTTGGAATGGAGTAAAGATGCACGCAAACCGTGAGCTGATTACTGGTTTCCTCAAGGGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGATTGACTTCTACACCACATTCTAATTACACGTACTCTGTCAAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTGATGATCTTACGTTTCTAGTGAAGAGCAATGTCATCCCAATGGATCGTATTGATGATGCTGTTGGGAGAATTTTGTCTGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCTTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGCCAGGAACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGGAAAAATGACAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGATCCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGAACTAGCATCCTCGCTGCCATCAAATCAACAGTTGATCCAAGCACAGAGGTAGTATTCCGTGAGGATCCTGACAGTGATTTTGTTAAGTCCAATGACTTCTCATACGCCATTGTTGTTATTGGGGAAGCCCCATATGCCGAGACTGAAGGGGATAGTACAACACTTACCATGTTGGATCCTGGTCCAAGCATCGTAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTTGTCATTTCTGGAAGGCCAATTGTGATGGAACCATATGTTTCATCAATGGATGCTCTTGTAGCAGCTTGGTTACCTGGTACTGAAGGCCTAGGAGTCACTGATGCCCTTTATGGAGACCATGGTTTTAGTGGGAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACACTATGATCCACTTTTCCCGCTTGGTTTCGGACTCACAACTGGATCCGTTAAGGACGTTGTCGCGAGGTCAACATCGGCGGGAATTAAAGGAACACCATCCTTCATTGCAATGATCATTGCTACAATTGCCATTTGTATATTGCAGGAGTTTCTTCCCATTCAATTTCATGAATGA

Coding sequence (CDS)

ATGGATCTTATTTTTAAAGGTACGTGGCCAAATAAGAACAAATCTAAAAATTCAAATATTAAAATTATAATTTTACCTTTTCTTTTCTATTTGTGTGAGTTGGGCTCTCGAGGAAAGAAGATGGCGAAGATTTTTGTTCAGCTGTTTGTGATTTTGTGCTTGGGTTGGTGGTGGTGGGCAATAATGGTGGGCGCGGAGAACTTGAAGTACAAAGACCCTAAGCAACCGGTTGCTGTGCGAGTTAAGGACCTTCTTGGCCGAATGACGCTGGAAGAGAAAATTGGTCAGATGGTTCAGATTGACAGAAGCGTTGCCAATGCTACAGTTATGAAAGATTATTTTATTGGAAGTGTGCTAAGTGGTGGTGGAAGTGTGCCGCTTCCAGATGCTCGTGCTCAAGATTGGGTTGACATGATAAATGATTTCCAGAAGGGTTCTCTTTCTAGTCGATTGGGTATCCCAATGATTTATGGCATTGATGCTGTTCATGGCCATAACAACGTTTACAATGCTACTGTATTCCCTCATAATGTTGGACTTGGAGCTACCAGAAACCCTGACCTGGTTCGAAGGATTGGTGCAGCAACTGCACTTGAAGTTAGAGCTACAGGGATTTCTTATACATTTGCTCCATGCCTTGCGGTTTGTAGGGACCCGAGGTGGGGTCGATGTTATGAAAGCTACAGTGAGGATCCAAGAATTGTGCAAAATATGACTGAGATTATAATTGGTCTGCAAGGAGAGCCTCCTGCAAATTTTCGGAAGGGGATTCCATATGTTGGTGGAACTAAAAAGGTTATCGCCTGTGCGAAGCACTTTGTTGGAGATGGTGGGACAACTCATGGCATCAATGAGAATAATACCGTTATTGACAGGCATGGACTGCTCAGCATTCACATGCCTGCTTATTTAGATTCGATCATCAAGGGTGTTTCATCGGTAATGGCTTCCTATTCTAGTTGGAATGGAGTAAAGATGCACGCAAACCGTGAGCTGATTACTGGTTTCCTCAAGGGTACCCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGATTGACTTCTACACCACATTCTAATTACACGTACTCTGTCAAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGATTCCTTACAAGTATGCAGAGTTCATTGATGATCTTACGTTTCTAGTGAAGAGCAATGTCATCCCAATGGATCGTATTGATGATGCTGTTGGGAGAATTTTGTCTGTCAAGTTCACAATGGGTCTTTTTGAAAGCCCTTTGGGTGATTACAGCCTTGTCAATGAGCTTGGGAGCCAGGAACATAGAGACTTGGCAAGAGATGCTGTGAGGCAGTCACTCGTACTGCTGAAGAATGGGAAAAATGACAGCGATCCGTTGCTACCCCTTTCAAAGAAGGCCCCAAAGATCCTTGTTGCTGGCACTCACGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGCATGGCAAGGATTCAGTGGCAACAATGCTACAAGGGGAACTAGCATCCTCGCTGCCATCAAATCAACAGTTGATCCAAGCACAGAGGTAGTATTCCGTGAGGATCCTGACAGTGATTTTGTTAAGTCCAATGACTTCTCATACGCCATTGTTGTTATTGGGGAAGCCCCATATGCCGAGACTGAAGGGGATAGTACAACACTTACCATGTTGGATCCTGGTCCAAGCATCGTAAAAAATGTTTGTGATTCTGTAAAGTGTGTGGTGGTTGTCATTTCTGGAAGGCCAATTGTGATGGAACCATATGTTTCATCAATGGATGCTCTTGTAGCAGCTTGGTTACCTGGTACTGAAGGCCTAGGAGTCACTGATGCCCTTTATGGAGACCATGGTTTTAGTGGGAAGCTTCCAAGAACTTGGTTTAAATCTGTAGATCAACTGCCAATGAACGTTGGAGATCGACACTATGATCCACTTTTCCCGCTTGGTTTCGGACTCACAACTGGATCCGTTAAGGACGTTGTCGCGAGGTCAACATCGGCGGGAATTAAAGGAACACCATCCTTCATTGCAATGATCATTGCTACAATTGCCATTTGTATATTGCAGGAGTTTCTTCCCATTCAATTTCATGAATGA

Protein sequence

MDLIFKGTWPNKNKSKNSNIKIIILPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQEFLPIQFHE
Homology
BLAST of Tan0013897 vs. ExPASy Swiss-Prot
Match: A7LXU3 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) OX=411476 GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 4.3e-82
Identity = 213/653 (32.62%), Postives = 330/653 (50.54%), Query Frame = 0

Query: 73  PKQP-VAVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------ATVMKDYF 132
           P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y 
Sbjct: 30  PTDPAIETHIREWLQKMTLEQKIGQMCEITIDVVSDLETSRKKGFCLSEAMLDTVIGKYK 89

Query: 133 IGSVLSGGGSVPLPDARAQD-WVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATV 192
           +GS+L    +VPL  A+ ++ W + I   Q+ S+   +GIP IYG+D +HG     + T+
Sbjct: 90  VGSLL----NVPLGVAQKKEKWAEAIKQIQEKSM-KEIGIPCIYGVDQIHGTTYTLDGTM 149

Query: 193 FPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPR 252
           FP  + +GAT N +L RR    +A E +A  I +TFAP + + RDPRW R +E+Y ED  
Sbjct: 150 FPQGINMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCY 209

Query: 253 IVQNM-TEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDR 312
           +   M    + G QGE P           G   V AC KH++G G    G +   + I R
Sbjct: 210 VNAEMGVSAVKGFQGEDPNRI--------GEYNVAACMKHYMGYGVPVSGKDRTPSSISR 269

Query: 313 HGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWE 372
             +   H   +L ++ +G  SVM +    NG+  HANREL+T +LK  L + G +++DW 
Sbjct: 270 SDMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWA 329

Query: 373 GLDRLTSTPHSNYT--YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDA 432
            ++ L +  H   T   +VK  I AGIDM M+PY+   F D L  LV+   + M+RIDDA
Sbjct: 330 DINNLCTRDHIAATKKEAVKIVINAGIDMSMVPYE-VSFCDYLKELVEEGEVSMERIDDA 389

Query: 433 VGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPL 492
           V R+L +K+ +GLF+ P  D    ++ GS+E   +A  A  +S VLLKN  N    +LP+
Sbjct: 390 VARVLRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKNDGN----ILPI 449

Query: 493 SKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNN-ATRGTSILAAI-----KSTVDPST 552
           + K  KIL+ G +A+++    GGW+ +WQG   +  A    +I  A+     K  +    
Sbjct: 450 A-KGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKENIIYEP 509

Query: 553 EVVFREDPDSDFVKSN------------DFSYAIVVIGEAPYAETEGDSTTLTMLDPGPS 612
            V +    + ++ + N                 I  IGE  Y ET G+ T LT+ +   +
Sbjct: 510 GVTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRN 569

Query: 613 IVKNVCDSVKCVVVVIS-GRPIVMEPYVSSMDALVAAWLPGT-EGLGVTDALYGDHGFSG 669
           +VK +  + K +V+V++ GRP ++   V    A+V   LP    G  + + L GD  FSG
Sbjct: 570 LVKALAATGKPIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFSG 629

BLAST of Tan0013897 vs. ExPASy Swiss-Prot
Match: Q23892 (Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=2)

HSP 1 Score: 275.8 bits (704), Expect = 1.4e-72
Identity = 197/636 (30.97%), Postives = 322/636 (50.63%), Query Frame = 0

Query: 81  VKDLLGRMTLEEKIGQMVQIDRS--------VANATVM----KDYFIGSVL----SGGGS 140
           V +L+ +M++ EKIGQM Q+D +          N T +    K Y+IGS L    SGG +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 141 VPLPDARAQDWVDMINDFQKGSL-SSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGAT 200
             +    +  W+DMIN  Q   +  S   IPMIYG+D+VHG N V+ AT+FPHN GL AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 201 RNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNM-TEII 260
            N +        T+ +  A GI + FAP L +   P W R YE++ EDP +   M    +
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 261 IGLQG-----EPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLS 320
            G QG     + P N              +  AKH+ G    T G +     I    L  
Sbjct: 260 RGFQGGNNSFDGPIN----------APSAVCTAKHYFGYSDPTSGKDRTAAWIPERMLRR 319

Query: 321 IHMPAYLDSII-KGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR 380
             +P++ ++I   G  ++M +    NGV MH + + +T  L+G L+F+G  ++DW+ +++
Sbjct: 320 YFLPSFAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEK 379

Query: 381 LTSTPHS--NYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRI 440
           L    H+  +   ++  A+ AGIDM M+P   + F   L  +V +  +P  R+D +V RI
Sbjct: 380 LVYFHHTAGSAEEAILQALDAGIDMSMVPLDLS-FPIILAEMVAAGTVPESRLDLSVRRI 439

Query: 441 LSVKFTMGLFESPL--GDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSK 500
           L++K+ +GLF +P    + ++V+ +G  + R+ A     +S+ LL+N  N    +LPL+ 
Sbjct: 440 LNLKYALGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQNKNN----ILPLNT 499

Query: 501 KAPK-ILVAGTHADNLGYQCGGWTIAWQG-FSGNNATRGTSILAAIKSTVDPSTEVVFRE 560
              K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++   + + +   + 
Sbjct: 500 NTIKNVLLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTADFNIQY 559

Query: 561 ---------------DPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVK 620
                          D   +  +S+D    +VVIGE P AET GD   L+M      +++
Sbjct: 560 TIGHEIGVPTNQTSIDEAVELAQSSD--VVVVVIGELPEAETPGDIYDLSMDPNEVLLLQ 619

Query: 621 NVCDSVKCVV-VVISGRPIVMEP-YVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKL 664
            + D+ K VV +++  RP ++ P  V S  A++ A+LPG+E G  + + L G+   SG+L
Sbjct: 620 QLVDTGKPVVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRL 679

BLAST of Tan0013897 vs. ExPASy Swiss-Prot
Match: Q56078 (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=bglX PE=3 SV=2)

HSP 1 Score: 241.9 bits (616), Expect = 2.2e-62
Identity = 203/685 (29.64%), Postives = 312/685 (45.55%), Query Frame = 0

Query: 65  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGS 124
           AENL    P  P A    V DLL +MT++EKIGQ+  I     N       ++KD  +G+
Sbjct: 20  AENLFGNHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGA 79

Query: 125 VLSGGGSVPLPDAR-AQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPH 184
           + +   +V   D R  QD V  +         SRL IP+ +  D VHG       TVFP 
Sbjct: 80  IFN---TVTRQDIRQMQDQVMAL---------SRLKIPLFFAYDVVHGQR-----TVFPI 139

Query: 185 NVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQ 244
           ++GL ++ N D VR +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +  
Sbjct: 140 SLGLASSFNLDAVRTVGRVSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTS 199

Query: 245 NMTEIII-GLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGL 304
            M E ++  +QG+ PA+             V+   KHF   G    G   N   +    L
Sbjct: 200 IMGETMVKAMQGKSPAD----------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRL 259

Query: 305 LSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLD 364
            + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +SD   + 
Sbjct: 260 FNDYMPPYKAGLDAGSGAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIK 319

Query: 365 RLTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRI 424
            L      ++   +V+ A+ AG+DM M    Y+++   L  L+KS  + M  +DDA   +
Sbjct: 320 ELIKHGTAADPEDAVRVALKAGVDMSMADEYYSKY---LPGLIKSGKVTMAELDDATRHV 379

Query: 425 LSVKFTMGLFESPLGDYS------LVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLL 484
           L+VK+ MGLF  P           +     S+ HR  AR+  R+S+VLLKN        L
Sbjct: 380 LNVKYDMGLFNDPYSHLGPKESDPVDTNAESRLHRKEAREVARESVVLLKNRLE----TL 439

Query: 485 PLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVF 544
           PL KK+  I V G  AD+     G W+ A        A +  ++LA I++ V    ++++
Sbjct: 440 PL-KKSGTIAVVGPLADSQRDVMGSWSAA------GVANQSVTVLAGIQNAVGDGAKILY 499

Query: 545 RE-----------------------DP-------DSDFVKSNDFSYAIVVIGEAPYAETE 604
            +                       DP       D     +      + V+GE+     E
Sbjct: 500 AKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQGMAHE 559

Query: 605 GDSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGT 664
             S T   +   P   +++  ++K      V+V+++GRP+ +       DA++  W  GT
Sbjct: 560 ASSRTNITI---PQSQRDLITALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGT 619

Query: 665 E-GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD-- 675
           E G  + D L+GD+  SGKLP ++ +SV Q+P     +N G             R++D  
Sbjct: 620 EGGNAIADVLFGDYNPSGKLPISFPRSVGQIPVYYSHLNTGRPYNPEKPNKYTSRYFDEA 660

BLAST of Tan0013897 vs. ExPASy Swiss-Prot
Match: P33363 (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX PE=3 SV=2)

HSP 1 Score: 231.5 bits (589), Expect = 3.0e-59
Identity = 196/680 (28.82%), Postives = 310/680 (45.59%), Query Frame = 0

Query: 65  AENLKYKDPKQPVA--VRVKDLLGRMTLEEKIGQMVQIDRSVAN-----ATVMKDYFIGS 124
           A++L    P  P A    V +LL +MT++EKIGQ+  I     N       ++KD  +G+
Sbjct: 20  ADDLFGNHPLTPEARDAFVTELLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGA 79

Query: 125 VLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHN 184
           + +   +V   D RA    D + +       SRL IP+ +  D +HG       TVFP +
Sbjct: 80  IFN---TVTRQDIRAMQ--DQVMEL------SRLKIPLFFAYDVLHGQR-----TVFPIS 139

Query: 185 VGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQN 244
           +GL ++ N D V+ +G  +A E    G++ T+AP + V RDPRWGR  E + ED  +   
Sbjct: 140 LGLASSFNLDAVKTVGRVSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTST 199

Query: 245 MTEIII-GLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLL 304
           M + ++  +QG+ PA+             V+   KHF   G    G   N   +    L 
Sbjct: 200 MGKTMVEAMQGKSPAD----------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLF 259

Query: 305 SIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDR 364
           + +MP Y   +  G  +VM + +S NG    ++  L+   L+    FKG  +SD   +  
Sbjct: 260 NDYMPPYKAGLDAGSGAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKE 319

Query: 365 LTS-TPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRIL 424
           L      ++   +V+ A+ +GI+M M    Y+++   L  L+KS  + M  +DDA   +L
Sbjct: 320 LIKHGTAADPEDAVRVALKSGINMSMSDEYYSKY---LPGLIKSGKVTMAELDDAARHVL 379

Query: 425 SVKFTMGLFESPLGDYS------LVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLP 484
           +VK+ MGLF  P           +     S+ HR  AR+  R+SLVLLKN        LP
Sbjct: 380 NVKYDMGLFNDPYSHLGPKESDPVDTNAESRLHRKEAREVARESLVLLKNRLE----TLP 439

Query: 485 LSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVF- 544
           L KK+  I V G  AD+     G W+ A        A +  ++L  IK+ V  + +V++ 
Sbjct: 440 L-KKSATIAVVGPLADSKRDVMGSWSAA------GVADQSVTVLTGIKNAVGENGKVLYA 499

Query: 545 ----------------------REDP-------DSDFVKSNDFSYAIVVIGEAPYAETEG 604
                                 + DP       D     +      + V+GEA     E 
Sbjct: 500 KGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEA 559

Query: 605 DSTTLTMLDPGPSIVKNVCDSVKC-----VVVVISGRPIVMEPYVSSMDALVAAWLPGTE 664
            S T   +   P   +++  ++K      V+V+++GRP+ +       DA++  W  GTE
Sbjct: 560 SSRTDITI---PQSQRDLIAALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTE 619

Query: 665 -GLGVTDALYGDHGFSGKLPRTWFKSVDQLP-----MNVG------------DRHYD--- 671
            G  + D L+GD+  SGKLP ++ +SV Q+P     +N G             R++D   
Sbjct: 620 GGNAIADVLFGDYNPSGKLPMSFPRSVGQIPVYYSHLNTGRPYNADKPNKYTSRYFDEAN 656

BLAST of Tan0013897 vs. ExPASy Swiss-Prot
Match: T2KMH0 (Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005 / KMM 3901) OX=1347342 GN=BN863_22130 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.3e-53
Identity = 178/585 (30.43%), Postives = 277/585 (47.35%), Query Frame = 0

Query: 143 QKGSLSSRLGIPMIYGIDAVHGHNNVY----NATVFPHNVGLGATRNPDLVRRIGAATAL 202
           Q    + RLGIP +   +A+HG   V     N TV+P  V   +T  P+L++++ + TA 
Sbjct: 55  QDAPANERLGIPSMKYGEALHGLWLVLDYYGNTTVYPQAVAAASTWEPELIKKMASQTAR 114

Query: 203 EVRATGISYTFAPCLAV-CRDPRWGRCYESYSEDPRIVQNM-TEIIIGLQGEPPANFRKG 262
           E RA G+++ ++P L V   D R+GR  ESY EDP +V  M    I GLQG     F + 
Sbjct: 115 EARALGVTHCYSPNLDVYAGDARYGRVEESYGEDPYLVSRMGVAFIEGLQGTGEEQFDE- 174

Query: 263 IPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIK-GVSSVM 322
                    VIA AKHFVG      GIN   + +    L  +++P +  ++ + GV SVM
Sbjct: 175 -------NHVIATAKHFVGYPENRRGINGGFSDMSERRLREVYLPPFEAAVKEAGVGSVM 234

Query: 323 ASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAIL- 382
             +  +NGV  H N  L+   L+  L F GF++SD   + RL  T H       +AAIL 
Sbjct: 235 PGHQDFNGVPCHMNTWLLKDILRDELGFDGFIVSDNNDVGRL-ETMHFIAENRTEAAILG 294

Query: 383 --AGIDMVMIPYKYAEFIDDLTFLVKSNVIP----MDRIDDAVGRILSVKFTMGLFES-P 442
             AG+DM ++  K  E     T ++K  ++     M  ID A  RIL+ K+ +GLF++ P
Sbjct: 295 LKAGVDMDLVIGKNVELATYHTNILKDTILKNPALMKYIDQATSRILTAKYKLGLFDAKP 354

Query: 443 LGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLS-KKAPKILVAGTHADN 502
               +   E G+ EHR+ A +   +S+++LKN  N    LLPL   K   + V G +A  
Sbjct: 355 KKIDTETVETGTDEHREFALELAEKSIIMLKNDNN----LLPLDVSKIKSLAVIGPNAHE 414

Query: 503 LGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAI 562
              + G + +   G+SG       S+L  +K  V    ++ + +  D D      F  AI
Sbjct: 415 ERPKKGTYKLL-GGYSG-LPPYYVSVLDGLKKKVGEHVKINYAKGCDIDSFSKEGFPEAI 474

Query: 563 ----------VVIGEAPYAETE-GDSTTLTMLDPGPSIVKNVCDSVKCVVVV-ISGRPIV 622
                     +V+G +     E GD   L +      +V+ +  + K V+VV I+GRP+ 
Sbjct: 475 SAAKNSDAVVLVVGSSHKTCGEGGDRADLDLYGVQKELVEAIHKTGKPVIVVLINGRPLS 534

Query: 623 MEPYVSSMDALVAAWLPGTE-GLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR--- 682
           +     ++ +++  W  G   G  V + ++GD    GKL  ++ + V Q+P+   +R   
Sbjct: 535 INYIAENIPSILETWYGGMRAGDAVANVIFGDVNPGGKLTMSFPRDVGQVPVTYLERPDF 594

BLAST of Tan0013897 vs. NCBI nr
Match: XP_022968400.1 (uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima])

HSP 1 Score: 1299.6 bits (3362), Expect = 0.0e+00
Identity = 630/675 (93.33%), Postives = 654/675 (96.89%), Query Frame = 0

Query: 25  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDL 84
           LP+ +  C L S GKKMAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDL
Sbjct: 6   LPWKWKECGLNSSGKKMAKIFVQVVVILCLGWWWWAIMVDAENLKYKDPKQPVSVRVKDL 65

Query: 85  LGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 144
           LGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQK
Sbjct: 66  LGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 125

Query: 145 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATG 204
           GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATG
Sbjct: 126 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATG 185

Query: 205 ISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 264
           ISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTK
Sbjct: 186 ISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPYVGGTK 245

Query: 265 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV 324
           KVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Sbjct: 246 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGV 305

Query: 325 KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPY 384
           KMHANR+LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PY
Sbjct: 306 KMHANRDLITRFLKGTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPY 365

Query: 385 KYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRD 444
           KYAEFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRD
Sbjct: 366 KYAEFIDDLKLLVKNNVVPMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRD 425

Query: 445 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 504
           LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGN
Sbjct: 426 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVVGTHADNLGYQCGGWTIAWQGFSGN 485

Query: 505 NATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTL 564
           NATRGT+ILAAIKSTVDPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAET GDSTTL
Sbjct: 486 NATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETGGDSTTL 545

Query: 565 TMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 624
           TMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYG
Sbjct: 546 TMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSMDALVAAWLPGTEGLGVTDALYG 605

Query: 625 DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPS 684
           DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPS
Sbjct: 606 DHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGTRGTPS 665

Query: 685 FIAMIIATIAICILQ 700
           FIAMI+ATIA+C+LQ
Sbjct: 666 FIAMIVATIAVCVLQ 680

BLAST of Tan0013897 vs. NCBI nr
Match: XP_023541708.1 (uncharacterized protein LOC111801784 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1298.5 bits (3359), Expect = 0.0e+00
Identity = 629/668 (94.16%), Postives = 650/668 (97.31%), Query Frame = 0

Query: 32  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLE 91
           C   S GKKMAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLE
Sbjct: 15  CGFNSSGKKMAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLE 74

Query: 92  EKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRL 151
           EKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRL
Sbjct: 75  EKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRL 134

Query: 152 GIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAP 211
           GIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAP
Sbjct: 135 GIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAP 194

Query: 212 CLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAK 271
           CLAVCRDPRWGRCYESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAK
Sbjct: 195 CLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPYVGGTKKVIACAK 254

Query: 272 HFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE 331
           HFVGDGGTTHGINENNTVIDRHGLL +HMPAYLDSIIKGVSSVM SYSSWNGVKMHANRE
Sbjct: 255 HFVGDGGTTHGINENNTVIDRHGLLGVHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRE 314

Query: 332 LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFID 391
           LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKY EFID
Sbjct: 315 LITRFLKGTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYVEFID 374

Query: 392 DLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVR 451
           DLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRDLARDAVR
Sbjct: 375 DLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVR 434

Query: 452 QSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTS 511
           QSLVLLKNGKNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGT+
Sbjct: 435 QSLVLLKNGKNDSDPLLPLSKKAPKILVSGTHADNLGYQCGGWTIAWQGFSGNNATRGTT 494

Query: 512 ILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGP 571
           ILAAIKSTVDPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGP
Sbjct: 495 ILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGP 554

Query: 572 SIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK 631
           SI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK
Sbjct: 555 SIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK 614

Query: 632 LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIA 691
           LPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+A
Sbjct: 615 LPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVA 674

Query: 692 TIAICILQ 700
           TIAIC+LQ
Sbjct: 675 TIAICVLQ 682

BLAST of Tan0013897 vs. NCBI nr
Match: XP_022945501.1 (uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1295.0 bits (3350), Expect = 0.0e+00
Identity = 631/675 (93.48%), Postives = 651/675 (96.44%), Query Frame = 0

Query: 25  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDL 84
           LP+ +  C   S GKKMAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDL
Sbjct: 6   LPWKWKECGFNSSGKKMAKNFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDL 65

Query: 85  LGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 144
           LGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQK
Sbjct: 66  LGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 125

Query: 145 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATG 204
           GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATG
Sbjct: 126 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATG 185

Query: 205 ISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 264
           ISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTK
Sbjct: 186 ISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 245

Query: 265 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV 324
           KVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Sbjct: 246 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGV 305

Query: 325 KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPY 384
           KMHANRELIT FLK TLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PY
Sbjct: 306 KMHANRELITRFLKSTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPY 365

Query: 385 KYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRD 444
           KYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRD
Sbjct: 366 KYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRD 425

Query: 445 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 504
           LARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN
Sbjct: 426 LARDAVRQSLVLLKNGKNDSDSLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 485

Query: 505 NATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTL 564
           NATRGT+IL AIKSTVDPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTL
Sbjct: 486 NATRGTTILTAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTL 545

Query: 565 TMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 624
           TMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
Sbjct: 546 TMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 605

Query: 625 DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPS 684
           DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPS
Sbjct: 606 DHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPS 665

Query: 685 FIAMIIATIAICILQ 700
           FIAMI+ATIAIC+LQ
Sbjct: 666 FIAMIVATIAICVLQ 680

BLAST of Tan0013897 vs. NCBI nr
Match: KAG7012970.1 (hypothetical protein SDJN02_25724 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 627/659 (95.14%), Postives = 645/659 (97.88%), Query Frame = 0

Query: 41  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQI 100
           MAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQI
Sbjct: 1   MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQI 60

Query: 101 DRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 160
           DRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
Sbjct: 61  DRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 120

Query: 161 AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR 220
           AVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPR
Sbjct: 121 AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPR 180

Query: 221 WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 280
           WGRCYESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT
Sbjct: 181 WGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 240

Query: 281 HGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT 340
           HGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLK T
Sbjct: 241 HGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST 300

Query: 341 LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSN 400
           LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+N
Sbjct: 301 LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNN 360

Query: 401 VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG 460
           V+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNG
Sbjct: 361 VVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG 420

Query: 461 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV 520
           KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTV
Sbjct: 421 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTV 480

Query: 521 DPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDS 580
           DPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+S
Sbjct: 481 DPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCES 540

Query: 581 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 640
           VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Sbjct: 541 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 600

Query: 641 DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ 700
           DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Sbjct: 601 DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQ 659

BLAST of Tan0013897 vs. NCBI nr
Match: XP_023541709.1 (uncharacterized protein LOC111801784 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 624/659 (94.69%), Postives = 645/659 (97.88%), Query Frame = 0

Query: 41  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQI 100
           MAKIFVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQI
Sbjct: 1   MAKIFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQI 60

Query: 101 DRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 160
           DRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
Sbjct: 61  DRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 120

Query: 161 AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR 220
           AVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPR
Sbjct: 121 AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPR 180

Query: 221 WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 280
           WGRCYESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTT
Sbjct: 181 WGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTT 240

Query: 281 HGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT 340
           HGINENNTVIDRHGLL +HMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLKGT
Sbjct: 241 HGINENNTVIDRHGLLGVHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKGT 300

Query: 341 LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSN 400
           LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKY EFIDDLT LVK+N
Sbjct: 301 LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYVEFIDDLTLLVKNN 360

Query: 401 VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG 460
           V+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNG
Sbjct: 361 VVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG 420

Query: 461 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV 520
           KNDSDPLLPLSKKAPKILV+GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTV
Sbjct: 421 KNDSDPLLPLSKKAPKILVSGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTV 480

Query: 521 DPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDS 580
           DPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+S
Sbjct: 481 DPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCES 540

Query: 581 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 640
           VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Sbjct: 541 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 600

Query: 641 DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ 700
           DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Sbjct: 601 DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQ 659

BLAST of Tan0013897 vs. ExPASy TrEMBL
Match: A0A6J1HX36 (uncharacterized protein LOC111467651 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467651 PE=3 SV=1)

HSP 1 Score: 1299.6 bits (3362), Expect = 0.0e+00
Identity = 630/675 (93.33%), Postives = 654/675 (96.89%), Query Frame = 0

Query: 25  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDL 84
           LP+ +  C L S GKKMAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDL
Sbjct: 6   LPWKWKECGLNSSGKKMAKIFVQVVVILCLGWWWWAIMVDAENLKYKDPKQPVSVRVKDL 65

Query: 85  LGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 144
           LGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQK
Sbjct: 66  LGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 125

Query: 145 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATG 204
           GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATG
Sbjct: 126 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATG 185

Query: 205 ISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 264
           ISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTK
Sbjct: 186 ISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPYVGGTK 245

Query: 265 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV 324
           KVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Sbjct: 246 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGV 305

Query: 325 KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPY 384
           KMHANR+LIT FLKGTLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PY
Sbjct: 306 KMHANRDLITRFLKGTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPY 365

Query: 385 KYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRD 444
           KYAEFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRD
Sbjct: 366 KYAEFIDDLKLLVKNNVVPMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRD 425

Query: 445 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 504
           LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGN
Sbjct: 426 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVVGTHADNLGYQCGGWTIAWQGFSGN 485

Query: 505 NATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTL 564
           NATRGT+ILAAIKSTVDPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAET GDSTTL
Sbjct: 486 NATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETGGDSTTL 545

Query: 565 TMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 624
           TMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYG
Sbjct: 546 TMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSMDALVAAWLPGTEGLGVTDALYG 605

Query: 625 DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPS 684
           DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPS
Sbjct: 606 DHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGTRGTPS 665

Query: 685 FIAMIIATIAICILQ 700
           FIAMI+ATIA+C+LQ
Sbjct: 666 FIAMIVATIAVCVLQ 680

BLAST of Tan0013897 vs. ExPASy TrEMBL
Match: A0A6J1G118 (uncharacterized protein LOC111449719 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449719 PE=3 SV=1)

HSP 1 Score: 1295.0 bits (3350), Expect = 0.0e+00
Identity = 631/675 (93.48%), Postives = 651/675 (96.44%), Query Frame = 0

Query: 25  LPFLFYLCELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDL 84
           LP+ +  C   S GKKMAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDL
Sbjct: 6   LPWKWKECGFNSSGKKMAKNFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDL 65

Query: 85  LGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 144
           LGRMTLEEKIGQMVQIDRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQK
Sbjct: 66  LGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQK 125

Query: 145 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATG 204
           GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATG
Sbjct: 126 GSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATG 185

Query: 205 ISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 264
           ISYTFAPCLAVCRDPRWGRCYESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTK
Sbjct: 186 ISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTK 245

Query: 265 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGV 324
           KVIACAKHFVGDGGTTHGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGV
Sbjct: 246 KVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGV 305

Query: 325 KMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPY 384
           KMHANRELIT FLK TLKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PY
Sbjct: 306 KMHANRELITRFLKSTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPY 365

Query: 385 KYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRD 444
           KYAEFIDDLT LVK+NV+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRD
Sbjct: 366 KYAEFIDDLTLLVKNNVVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRD 425

Query: 445 LARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 504
           LARDAVRQSLVLLKNGKNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN
Sbjct: 426 LARDAVRQSLVLLKNGKNDSDSLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGN 485

Query: 505 NATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTL 564
           NATRGT+IL AIKSTVDPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTL
Sbjct: 486 NATRGTTILTAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTL 545

Query: 565 TMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 624
           TMLDPGPSI+KNVC+SVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG
Sbjct: 546 TMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYG 605

Query: 625 DHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPS 684
           DHGFSGKLPRTWFKSVDQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPS
Sbjct: 606 DHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPS 665

Query: 685 FIAMIIATIAICILQ 700
           FIAMI+ATIAIC+LQ
Sbjct: 666 FIAMIVATIAICVLQ 680

BLAST of Tan0013897 vs. ExPASy TrEMBL
Match: A0A6J1HZJ2 (uncharacterized protein LOC111467651 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467651 PE=3 SV=1)

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 622/659 (94.39%), Postives = 644/659 (97.72%), Query Frame = 0

Query: 41  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQI 100
           MAKIFVQ+ VILCLGWWWWAIMV AENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQI
Sbjct: 1   MAKIFVQVVVILCLGWWWWAIMVDAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQI 60

Query: 101 DRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 160
           DRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
Sbjct: 61  DRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 120

Query: 161 AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR 220
           AVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPR
Sbjct: 121 AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPR 180

Query: 221 WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 280
           WGRCYESYSEDP++VQNMTEIIIGLQGEPPAN+RKGIPYVGGTKKVIACAKHFVGDGGTT
Sbjct: 181 WGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTT 240

Query: 281 HGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT 340
           HGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANR+LIT FLKGT
Sbjct: 241 HGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRDLITRFLKGT 300

Query: 341 LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSN 400
           LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDL  LVK+N
Sbjct: 301 LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLKLLVKNN 360

Query: 401 VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG 460
           V+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNG
Sbjct: 361 VVPMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG 420

Query: 461 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV 520
           KNDSDPLLPLSKKAPKILV GTHADNLGYQCGGWTIAWQGFSGNNATRGT+ILAAIKSTV
Sbjct: 421 KNDSDPLLPLSKKAPKILVVGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILAAIKSTV 480

Query: 521 DPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDS 580
           DPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAET GDSTTLTMLDPGPSI+KNVC+S
Sbjct: 481 DPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETGGDSTTLTMLDPGPSIIKNVCES 540

Query: 581 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 640
           VKCVVVVISGRPIVMEPY+SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Sbjct: 541 VKCVVVVISGRPIVMEPYISSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 600

Query: 641 DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ 700
           DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG +GTPSFIAMI+ATIA+C+LQ
Sbjct: 601 DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGTRGTPSFIAMIVATIAVCVLQ 659

BLAST of Tan0013897 vs. ExPASy TrEMBL
Match: A0A6J1G143 (uncharacterized protein LOC111449719 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111449719 PE=3 SV=1)

HSP 1 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 624/659 (94.69%), Postives = 642/659 (97.42%), Query Frame = 0

Query: 41  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQI 100
           MAK FVQ+ VILCLGWWWWAIMVGAENLKYKDPKQPV+VRVKDLLGRMTLEEKIGQMVQI
Sbjct: 1   MAKNFVQVVVILCLGWWWWAIMVGAENLKYKDPKQPVSVRVKDLLGRMTLEEKIGQMVQI 60

Query: 101 DRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 160
           DRSVANATVMK+YFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID
Sbjct: 61  DRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 120

Query: 161 AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR 220
           AVHGHNNVYNATVFPHNVGLGATRNPDL+RRIGAATALEVRATGISYTFAPCLAVCRDPR
Sbjct: 121 AVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALEVRATGISYTFAPCLAVCRDPR 180

Query: 221 WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 280
           WGRCYESYSEDP++VQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT
Sbjct: 181 WGRCYESYSEDPKLVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 240

Query: 281 HGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT 340
           HGINENNTVIDRHGLL IHMPAYLDSIIKGVSSVM SYSSWNGVKMHANRELIT FLK T
Sbjct: 241 HGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYSSWNGVKMHANRELITRFLKST 300

Query: 341 LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSN 400
           LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAI AGIDMVM+PYKYAEFIDDLT LVK+N
Sbjct: 301 LKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDMVMVPYKYAEFIDDLTLLVKNN 360

Query: 401 VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG 460
           V+ MDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGSQ HRDLARDAVRQSLVLLKNG
Sbjct: 361 VVSMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGSQAHRDLARDAVRQSLVLLKNG 420

Query: 461 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV 520
           KNDSD LLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGT+IL AIKSTV
Sbjct: 421 KNDSDSLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTTILTAIKSTV 480

Query: 521 DPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDS 580
           DPSTEVVFREDPDSDFVKSN FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSI+KNVC+S
Sbjct: 481 DPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIIKNVCES 540

Query: 581 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 640
           VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV
Sbjct: 541 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 600

Query: 641 DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICILQ 700
           DQLPMN GDRHYDPLFPLGFGLTTGSVKD+VARSTSAG + TPSFIAMI+ATIAIC+LQ
Sbjct: 601 DQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGAQATPSFIAMIVATIAICVLQ 659

BLAST of Tan0013897 vs. ExPASy TrEMBL
Match: A0A1S3BGE4 (beta-glucosidase BoGH3B isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489355 PE=3 SV=1)

HSP 1 Score: 1262.7 bits (3266), Expect = 0.0e+00
Identity = 610/667 (91.45%), Postives = 638/667 (95.65%), Query Frame = 0

Query: 32  CELGSRGKKMAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLE 91
           C L ++ KKMAKIFVQ+ VILCLGW WWA MV AENLKYKDPKQPV VRVKDLLGRMTLE
Sbjct: 111 CGLNTQAKKMAKIFVQVVVILCLGWLWWATMVDAENLKYKDPKQPVGVRVKDLLGRMTLE 170

Query: 92  EKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRL 151
           EKIGQMVQIDRSVANATVMKDYFIGS+LSGGGSVPLPDARA+DWVDMINDFQKGSLSSRL
Sbjct: 171 EKIGQMVQIDRSVANATVMKDYFIGSILSGGGSVPLPDARAEDWVDMINDFQKGSLSSRL 230

Query: 152 GIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAP 211
           GIPM YGIDAVHGHNNVYNATVFPHNVGLGATRNPDL RRIGAATALEVRATGISYTFAP
Sbjct: 231 GIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLARRIGAATALEVRATGISYTFAP 290

Query: 212 CLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAK 271
           CLAVCRDPRWGRCYESYSEDP+IV+ MTEIIIGLQGEPPAN+RKG PYVGGTKKVIACAK
Sbjct: 291 CLAVCRDPRWGRCYESYSEDPKIVKEMTEIIIGLQGEPPANYRKGTPYVGGTKKVIACAK 350

Query: 272 HFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE 331
           HFVGDGGTTHGINENNTVI+RHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE
Sbjct: 351 HFVGDGGTTHGINENNTVINRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRE 410

Query: 332 LITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFID 391
           LIT FLKG LKFKGFVISDWEGLDR+TSTPHSNYTYSV+AAILAGIDMVMIPYKYAEFID
Sbjct: 411 LITDFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSVQAAILAGIDMVMIPYKYAEFID 470

Query: 392 DLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVR 451
           DL FLVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP+ DYSLVNELGSQ HRDLARDAVR
Sbjct: 471 DLKFLVKSNVIPMDRIDDAVGRILTVKFTMGLFESPMADYSLVNELGSQAHRDLARDAVR 530

Query: 452 QSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTS 511
           QSLVLLKNGKNDS PLLPLSKK+PKILVAGTHADNLGYQCGGWTIAWQGFSGNN TRGT+
Sbjct: 531 QSLVLLKNGKNDSKPLLPLSKKSPKILVAGTHADNLGYQCGGWTIAWQGFSGNNGTRGTT 590

Query: 512 ILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGP 571
           ILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAET GDSTTLTMLDPGP
Sbjct: 591 ILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETGGDSTTLTMLDPGP 650

Query: 572 SIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGK 631
           +I+KNVCD V+CVV++ISGRPIV+EPY+SS+DALVAAWLPGTEG GVTDALYGDHGFSGK
Sbjct: 651 NIIKNVCDHVECVVILISGRPIVIEPYISSIDALVAAWLPGTEGQGVTDALYGDHGFSGK 710

Query: 632 LPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIA 691
           LPRTWFKSVDQLPMNVGD HYDPLFP GFGLTTGSVKD++ARSTSAGI+GTPS IA I+ 
Sbjct: 711 LPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTGSVKDIIARSTSAGIRGTPSLIASIVV 770

Query: 692 TIAICIL 699
            I +CIL
Sbjct: 771 AITLCIL 777

BLAST of Tan0013897 vs. TAIR 10
Match: AT5G04885.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 991.9 bits (2563), Expect = 2.7e-289
Identity = 466/657 (70.93%), Postives = 559/657 (85.08%), Query Frame = 0

Query: 41  MAKIFVQLFVILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQI 100
           M++  V++  +L     W       E L YKDPKQ V+ RV DL GRMTLEEKIGQMVQI
Sbjct: 1   MSRDSVRIVGVLLWMCMWVCCYGDGEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQI 60

Query: 101 DRSVANATVMKDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGID 160
           DRSVA   +M+DYFIGSVLSGGGS PLP+A AQ+WVDMIN++QKG+L SRLGIPMIYGID
Sbjct: 61  DRSVATVNIMRDYFIGSVLSGGGSAPLPEASAQNWVDMINEYQKGALVSRLGIPMIYGID 120

Query: 161 AVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPR 220
           AVHGHNNVYNAT+FPHNVGLGATR+PDLV+RIGAATA+EVRATGI YTFAPC+AVCRDPR
Sbjct: 121 AVHGHNNVYNATIFPHNVGLGATRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPR 180

Query: 221 WGRCYESYSEDPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTT 280
           WGRCYESYSED ++V++MT++I+GLQGEPP+N++ G+P+VGG  KV ACAKH+VGDGGTT
Sbjct: 181 WGRCYESYSEDHKVVEDMTDVILGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTT 240

Query: 281 HGINENNTVIDRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGT 340
            G+NENNTV D HGLLS+HMPAY D++ KGVS+VM SYSSWNG KMHAN ELITG+LKGT
Sbjct: 241 RGVNENNTVTDLHGLLSVHMPAYADAVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGT 300

Query: 341 LKFKGFVISDWEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSN 400
           LKFKGFVISDW+G+D++++ PH++YT SV+AAI AGIDMVM+P+ + EF++DLT LVK+N
Sbjct: 301 LKFKGFVISDWQGVDKISTPPHTHYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNN 360

Query: 401 VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNG 460
            IP+ RIDDAV RIL VKFTMGLFE+PL DYS  +ELGSQ HRDLAR+AVR+SLVLLKNG
Sbjct: 361 SIPVTRIDDAVRRILLVKFTMGLFENPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNG 420

Query: 461 KNDSDPLLPLSKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTV 520
            N ++P+LPL +K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS V
Sbjct: 421 -NKTNPMLPLPRKTSKILVAGTHADNLGYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAV 480

Query: 521 DPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDS 580
           D STEVVFRE+PD++F+KSN+F+YAI+ +GE PYAET GDS  LTMLDPGP+I+ + C +
Sbjct: 481 DQSTEVVFRENPDAEFIKSNNFAYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISSTCQA 540

Query: 581 VKCVVVVISGRPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSV 640
           VKCVVVVISGRP+VMEPYV+S+DALVAAWLPGTEG G+TDAL+GDHGFSGKLP TWF++ 
Sbjct: 541 VKCVVVVISGRPLVMEPYVASIDALVAAWLPGTEGQGITDALFGDHGFSGKLPVTWFRNT 600

Query: 641 DQLPMNVGDRHYDPLFPLGFGLTTGSVKDVVARSTSAGIKGTPSFIAMIIATIAICI 698
           +QLPM+ GD HYDPLF  G GL T SV  +VARSTSA    T   +  ++ +  +C+
Sbjct: 601 EQLPMSYGDTHYDPLFAYGSGLETESVASIVARSTSASATNTKPCLYTVLVSATLCL 656

BLAST of Tan0013897 vs. TAIR 10
Match: AT5G20950.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 923.7 bits (2386), Expect = 8.9e-269
Identity = 434/618 (70.23%), Postives = 516/618 (83.50%), Query Frame = 0

Query: 51  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVM 110
           +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VM
Sbjct: 7   VLCLMLLCCIVAAAEGTLKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVM 66

Query: 111 KDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYN 170
           K YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+RLGIPMIYGIDAVHGHNNVY 
Sbjct: 67  KKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAVHGHNNVYG 126

Query: 171 ATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSE 230
           AT+FPHNVGLG TR+P+LV+RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSE
Sbjct: 127 ATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCYESYSE 186

Query: 231 DPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVI 290
           D RIVQ MTEII GLQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVI
Sbjct: 187 DYRIVQQMTEIIPGLQGDLPTK-RKGVPFVGGKTKVAACAKHFVGDGGTVRGIDENNTVI 246

Query: 291 DRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD 350
           D  GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN+EL+TGFLK  LKF+GFVISD
Sbjct: 247 DSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLKFRGFVISD 306

Query: 351 WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDA 410
           W+G+DR+T+ PH NY+YSV A I AGIDM+M+PY Y EFID+++  ++  +IP+ RIDDA
Sbjct: 307 WQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLIPISRIDDA 366

Query: 411 VGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPL 470
           + RIL VKFTMGLFE PL D S  N+LGS+EHR+LAR+AVR+SLVLLKNGK  + PLLPL
Sbjct: 367 LKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKTGAKPLLPL 426

Query: 471 SKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRE 530
            KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ +
Sbjct: 427 PKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAPTTQVVYSQ 486

Query: 531 DPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISG 590
           +PD++FVKS  F YAIVV+GE PYAE  GD+T LT+ DPGPSI+ NVC SVKCVVVV+SG
Sbjct: 487 NPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVKCVVVVVSG 546

Query: 591 RPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR 650
           RP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Sbjct: 547 RPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQLPMNVGDR 606

Query: 651 HYDPLFPLGFGLTTGSVK 669
           HYDPL+P GFGLTT   K
Sbjct: 607 HYDPLYPFGFGLTTKPYK 623

BLAST of Tan0013897 vs. TAIR 10
Match: AT5G20950.2 (Glycosyl hydrolase family protein )

HSP 1 Score: 923.7 bits (2386), Expect = 8.9e-269
Identity = 434/618 (70.23%), Postives = 516/618 (83.50%), Query Frame = 0

Query: 51  ILCLGWWWWAIMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVM 110
           +LCL      +      LKYKDPKQP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VM
Sbjct: 7   VLCLMLLCCIVAAAEGTLKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVM 66

Query: 111 KDYFIGSVLSGGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYN 170
           K YFIGSVLSGGGSVP   A  + WV+M+N+ QK SLS+RLGIPMIYGIDAVHGHNNVY 
Sbjct: 67  KKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAVHGHNNVYG 126

Query: 171 ATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSE 230
           AT+FPHNVGLG TR+P+LV+RIGAATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSE
Sbjct: 127 ATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCYESYSE 186

Query: 231 DPRIVQNMTEIIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVI 290
           D RIVQ MTEII GLQG+ P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVI
Sbjct: 187 DYRIVQQMTEIIPGLQGDLPTK-RKGVPFVGGKTKVAACAKHFVGDGGTVRGIDENNTVI 246

Query: 291 DRHGLLSIHMPAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISD 350
           D  GL  IHMP Y +++ KGV+++M SYS+WNG++MHAN+EL+TGFLK  LKF+GFVISD
Sbjct: 247 DSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLKFRGFVISD 306

Query: 351 WEGLDRLTSTPHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDA 410
           W+G+DR+T+ PH NY+YSV A I AGIDM+M+PY Y EFID+++  ++  +IP+ RIDDA
Sbjct: 307 WQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLIPISRIDDA 366

Query: 411 VGRILSVKFTMGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPL 470
           + RIL VKFTMGLFE PL D S  N+LGS+EHR+LAR+AVR+SLVLLKNGK  + PLLPL
Sbjct: 367 LKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKTGAKPLLPL 426

Query: 471 SKKAPKILVAGTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFRE 530
            KK+ KILVAG HADNLGYQCGGWTI WQG +GN+ T GT+ILAA+K+TV P+T+VV+ +
Sbjct: 427 PKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAPTTQVVYSQ 486

Query: 531 DPDSDFVKSNDFSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISG 590
           +PD++FVKS  F YAIVV+GE PYAE  GD+T LT+ DPGPSI+ NVC SVKCVVVV+SG
Sbjct: 487 NPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVKCVVVVVSG 546

Query: 591 RPIVMEPYVSSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDR 650
           RP+V++PYVS++DALVAAWLPGTEG GV DAL+GD+GF+GKL RTWFKSV QLPMNVGDR
Sbjct: 547 RPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQLPMNVGDR 606

Query: 651 HYDPLFPLGFGLTTGSVK 669
           HYDPL+P GFGLTT   K
Sbjct: 607 HYDPLYPFGFGLTTKPYK 623

BLAST of Tan0013897 vs. TAIR 10
Match: AT5G20940.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 855.9 bits (2210), Expect = 2.3e-248
Identity = 409/598 (68.39%), Postives = 491/598 (82.11%), Query Frame = 0

Query: 67  NLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLSGGGSVP 126
           N KYKDPK+P+ VR+K+L+  MTLEEKIGQMVQ++R  A   VM+ YF+GSV SGGGSVP
Sbjct: 29  NAKYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVNATTEVMQKYFVGSVFSGGGSVP 88

Query: 127 LPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNP 186
            P    + WV+M+N+ QK +LS+RLGIP+IYGIDAVHGHN VYNAT+FPHNVGLG TR+P
Sbjct: 89  KPYIGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHGHNTVYNATIFPHNVGLGVTRDP 148

Query: 187 DLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTEIIIGLQ 246
            LV+RIG ATALEVRATGI Y FAPC+AVCRDPRWGRCYESYSED +IVQ MTEII GLQ
Sbjct: 149 GLVKRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRCYESYSEDHKIVQQMTEIIPGLQ 208

Query: 247 GEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHMPAYLDS 306
           G+ P   +KG+P+V G  KV ACAKHFVGDGGT  G+N NNTVI+ +GLL IHMPAY D+
Sbjct: 209 GDLPTG-QKGVPFVAGKTKVAACAKHFVGDGGTLRGMNANNTVINSNGLLGIHMPAYHDA 268

Query: 307 IIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTSTPHSNYT 366
           + KGV++VM SYSS NG+KMHAN++LITGFLK  LKF+G VISD+ G+D++ +   +NY+
Sbjct: 269 VNKGVATVMVSYSSINGLKMHANKKLITGFLKNKLKFRGIVISDYLGVDQINTPLGANYS 328

Query: 367 YSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFES 426
           +SV AA  AG+DM M      + ID+LT  VK   IPM RIDDAV RIL VKFTMGLFE+
Sbjct: 329 HSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPMSRIDDAVKRILRVKFTMGLFEN 388

Query: 427 PLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVAGTHADN 486
           P+ D+SL  +LGS+EHR+LAR+AVR+SLVLLKNG+N   PLLPL KKA KILVAGTHADN
Sbjct: 389 PIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGENADKPLLPLPKKANKILVAGTHADN 448

Query: 487 LGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAI 546
           LGYQCGGWTI WQG +GNN T GT+ILAA+K TVDP T+V++ ++PD++FVK+ DF YAI
Sbjct: 449 LGYQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDPKTQVIYNQNPDTNFVKAGDFDYAI 508

Query: 547 VVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYVSSMDALV 606
           V +GE PYAE  GDST LT+ +PGPS + NVC SVKCVVVV+SGRP+VM+  +S++DALV
Sbjct: 509 VAVGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVKCVVVVVSGRPVVMQ--ISNIDALV 568

Query: 607 AAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPLGFGLTT 665
           AAWLPGTEG GV D L+GD+GF+GKL RTWFK+VDQLPMNVGD HYDPL+P GFGL T
Sbjct: 569 AAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPMNVGDPHYDPLYPFGFGLIT 623

BLAST of Tan0013897 vs. TAIR 10
Match: AT3G47000.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 734.9 bits (1896), Expect = 5.9e-212
Identity = 358/609 (58.78%), Postives = 447/609 (73.40%), Query Frame = 0

Query: 61  IMVGAENLKYKDPKQPVAVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSVLS 120
           ++V   +  YK+   PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+
Sbjct: 1   MVVEESSCVYKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLN 60

Query: 121 GGGSVPLPDARAQDWVDMINDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGL 180
            GGSVP  DA++ DW DMI+ FQ+ +L+SRLGIP+IYG DAVHG+NNVY ATVFPHN+GL
Sbjct: 61  AGGSVPFEDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGL 120

Query: 181 GATRNPDLVRRIGAATALEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPRIVQNMTE 240
           GATR+ DLVRRIGAATALEVRA+G+ + F+PC+AV RDPRWGRCYESY EDP +V  MT 
Sbjct: 121 GATRDADLVRRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTS 180

Query: 241 IIIGLQGEPPANFRKGIPYVGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLSIHM 300
           ++ GLQG PP     G P+V G   V+AC KHFVGDGGT  GINE NT+     L  IH+
Sbjct: 181 LVSGLQGVPPEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHI 240

Query: 301 PAYLDSIIKGVSSVMASYSSWNGVKMHANRELITGFLKGTLKFKGFVISDWEGLDRLTST 360
           P YL  + +GVS+VMASYSSWNG ++HA+R L+T  LK  L FKGF++SDWEGLDRL+  
Sbjct: 241 PPYLKCLAQGVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEP 300

Query: 361 PHSNYTYSVKAAILAGIDMVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFT 420
             SNY Y +K A+ AGIDMVM+P+KY +FI D+T LV+S  IPM RI+DAV RIL VKF 
Sbjct: 301 QGSNYRYCIKTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFV 360

Query: 421 MGLFESPLGDYSLVNELGSQEHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVA 480
            GLF  PL D SL+  +G +EHR+LA++AVR+SLVLLK+GKN   P LPL + A +ILV 
Sbjct: 361 AGLFGHPLTDRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVT 420

Query: 481 GTHADNLGYQCGGWTIAWQGFSGNNATRGTSILAAIKSTVDPSTEVVFREDPDSDFVKSN 540
           GTHAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S+
Sbjct: 421 GTHADDLGYQCGGWTKTWFGLSG-RITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASS 480

Query: 541 D-FSYAIVVIGEAPYAETEGDSTTLTMLDPGPSIVKNVCDSVKCVVVVISGRPIVMEPYV 600
           + FSYAIV +GE PYAET GD++ L +   G  IV  V + +  +V++ISGRP+V+EP V
Sbjct: 481 EGFSYAIVAVGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTV 540

Query: 601 -SSMDALVAAWLPGTEGLGVTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDRHYDPLFPL 660
               +ALVAAWLPGTEG GV D ++GD+ F GKLP +WFK V+ LP++     YDPLFP 
Sbjct: 541 LEKTEALVAAWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPF 600

Query: 661 GFGLTTGSV 668
           GFGL +  V
Sbjct: 601 GFGLNSKPV 608

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A7LXU34.3e-8232.62Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
Q238921.4e-7230.97Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=... [more]
Q560782.2e-6229.64Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
P333633.0e-5928.82Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX P... [more]
T2KMH09.3e-5330.43Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005... [more]
Match NameE-valueIdentityDescription
XP_022968400.10.0e+0093.33uncharacterized protein LOC111467651 isoform X2 [Cucurbita maxima][more]
XP_023541708.10.0e+0094.16uncharacterized protein LOC111801784 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022945501.10.0e+0093.48uncharacterized protein LOC111449719 isoform X1 [Cucurbita moschata][more]
KAG7012970.10.0e+0095.14hypothetical protein SDJN02_25724 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023541709.10.0e+0094.69uncharacterized protein LOC111801784 isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1HX360.0e+0093.33uncharacterized protein LOC111467651 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1G1180.0e+0093.48uncharacterized protein LOC111449719 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HZJ20.0e+0094.39uncharacterized protein LOC111467651 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1G1430.0e+0094.69uncharacterized protein LOC111449719 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3BGE40.0e+0091.45beta-glucosidase BoGH3B isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489355 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT5G04885.12.7e-28970.93Glycosyl hydrolase family protein [more]
AT5G20950.18.9e-26970.23Glycosyl hydrolase family protein [more]
AT5G20950.28.9e-26970.23Glycosyl hydrolase family protein [more]
AT5G20940.12.3e-24868.39Glycosyl hydrolase family protein [more]
AT3G47000.15.9e-21258.78Glycosyl hydrolase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 266..282
score: 40.06
coord: 174..193
score: 38.33
coord: 150..166
score: 40.76
coord: 220..236
score: 44.12
coord: 336..354
score: 45.49
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 89..417
e-value: 3.0E-70
score: 237.2
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 455..663
e-value: 4.7E-34
score: 118.1
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilyGENE3D3.40.50.1700coord: 439..665
e-value: 1.4E-69
score: 236.4
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilySUPERFAMILY52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 454..663
IPR036962Glycoside hydrolase, family 3, N-terminal domain superfamilyGENE3D3.20.20.300coord: 63..438
e-value: 1.7E-133
score: 447.1
NoneNo IPR availablePANTHERPTHR30620:SF35GLYCOSYL HYDROLASE FAMILY PROTEINcoord: 56..689
NoneNo IPR availablePANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 56..689
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 69..453

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013897.1Tan0013897.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009251 glucan catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds