HG10013515 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013515
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSarcosine oxidase
LocationChr02: 2259883 .. 2261115 (+)
RNA-Seq ExpressionHG10013515
SyntenyHG10013515
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGATTCCGGCACTCAATTCGATGTAATCGTCGTTGGCGCCGGCGTAATGGGTAGTTCCACGGCGTACCATCTCGCTAAAACAGGGAACAGAATTCTGATTCTCGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACTTATCCGGAGGATTACTACCACGGCCTCGTTATGGAATCTTACGAGCTCTGGCGGATGACGGAGGCGGAAATTGGCTACAGAGTTTATTTTCCGGCGGAGCAGCTCGATATCGGCCCTTCCGACGACAAAAGTCTCGCCGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGAGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCAACGAAGGCGGTTTCGATGTTCCAAAGTTTGGCTTACAAAAACGGCGCCGTTTTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAACAGCCAATGGGCAGAGTTTTAGAGGAAAAAAATGTGTGGTGACAGTTGGAGCTTGGGCTAGAAAGTTATTTAAATCAGTTGGTGGGATTGAATTGCCAATTCAGCCATTGGAGGCTACTGTATCCTACTGGAGGATCAAGGAAGGGGTGGAGGCGGCCGAGTATGCAATCGGAGGGGACTTTCCGACATTTGCTAGCTATGGTGACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATACACGGCGGGCATCAGTGCGACCCGGACAAGCGATCGTGGGGGAATGGGGAGCGGCTACCGATAGCTACATTAAAGGAGTGGATAGAGGAGAGGTTTGGGGGGAGGGTGGATTCAAGTGAGCCGGTGTCAACGCAGTTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTGGGAGGGGAATTTGAAAAGGATGTAGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCGGCGGTCGGGAGGATACTGGCGGAGCTTGCATTGAAGGGAGAAGCGGAAGGAACGGAGCTGAAGTATTTTAAGATGGCAAGGTTTGAGGAGAATCCAAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAGCTTCACTAG

mRNA sequence

ATGGCTGATTCCGGCACTCAATTCGATGTAATCGTCGTTGGCGCCGGCGTAATGGGTAGTTCCACGGCGTACCATCTCGCTAAAACAGGGAACAGAATTCTGATTCTCGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACTTATCCGGAGGATTACTACCACGGCCTCGTTATGGAATCTTACGAGCTCTGGCGGATGACGGAGGCGGAAATTGGCTACAGAGTTTATTTTCCGGCGGAGCAGCTCGATATCGGCCCTTCCGACGACAAAAGTCTCGCCGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGAGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCAACGAAGGCGGTTTCGATGTTCCAAAGTTTGGCTTACAAAAACGGCGCCGTTTTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAACAGCCAATGGGCAGAGTTTTAGAGGAAAAAAATGTGTGGTGACAGTTGGAGCTTGGGCTAGAAAGTTATTTAAATCAGTTGGTGGGATTGAATTGCCAATTCAGCCATTGGAGGCTACTGTATCCTACTGGAGGATCAAGGAAGGGGTGGAGGCGGCCGAGTATGCAATCGGAGGGGACTTTCCGACATTTGCTAGCTATGGTGACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATACACGGCGGGCATCAGTGCGACCCGGACAAGCGATCGTGGGGGAATGGGGAGCGGCTACCGATAGCTACATTAAAGGAGTGGATAGAGGAGAGGTTTGGGGGGAGGGTGGATTCAAGTGAGCCGGTGTCAACGCAGTTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTGGGAGGGGAATTTGAAAAGGATGTAGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCGGCGGTCGGGAGGATACTGGCGGAGCTTGCATTGAAGGGAGAAGCGGAAGGAACGGAGCTGAAGTATTTTAAGATGGCAAGGTTTGAGGAGAATCCAAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAGCTTCACTAG

Coding sequence (CDS)

ATGGCTGATTCCGGCACTCAATTCGATGTAATCGTCGTTGGCGCCGGCGTAATGGGTAGTTCCACGGCGTACCATCTCGCTAAAACAGGGAACAGAATTCTGATTCTCGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACTTATCCGGAGGATTACTACCACGGCCTCGTTATGGAATCTTACGAGCTCTGGCGGATGACGGAGGCGGAAATTGGCTACAGAGTTTATTTTCCGGCGGAGCAGCTCGATATCGGCCCTTCCGACGACAAAAGTCTCGCCGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGAGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCAACGAAGGCGGTTTCGATGTTCCAAAGTTTGGCTTACAAAAACGGCGCCGTTTTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAACAGCCAATGGGCAGAGTTTTAGAGGAAAAAAATGTGTGGTGACAGTTGGAGCTTGGGCTAGAAAGTTATTTAAATCAGTTGGTGGGATTGAATTGCCAATTCAGCCATTGGAGGCTACTGTATCCTACTGGAGGATCAAGGAAGGGGTGGAGGCGGCCGAGTATGCAATCGGAGGGGACTTTCCGACATTTGCTAGCTATGGTGACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATACACGGCGGGCATCAGTGCGACCCGGACAAGCGATCGTGGGGGAATGGGGAGCGGCTACCGATAGCTACATTAAAGGAGTGGATAGAGGAGAGGTTTGGGGGGAGGGTGGATTCAAGTGAGCCGGTGTCAACGCAGTTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTGGGAGGGGAATTTGAAAAGGATGTAGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCGGCGGTCGGGAGGATACTGGCGGAGCTTGCATTGAAGGGAGAAGCGGAAGGAACGGAGCTGAAGTATTTTAAGATGGCAAGGTTTGAGGAGAATCCAAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAGCTTCACTAG

Protein sequence

MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYPEDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVLDRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIKRDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEGVEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPIATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH
Homology
BLAST of HG10013515 vs. NCBI nr
Match: XP_038896765.1 (probable sarcosine oxidase [Benincasa hispida])

HSP 1 Score: 781.6 bits (2017), Expect = 3.4e-222
Identity = 376/409 (91.93%), Postives = 394/409 (96.33%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQFDVIVVGAGVMGSST YHLAKTGNR+LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQFDVIVVGAGVMGSSTGYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYHGLVMESYELWRM EAEIGYRVYFPAEQLDIGPSDDKSLAAVV+TCRKHSIPH+VL
Sbjct: 61  EDYYHGLVMESYELWRMAEAEIGYRVYFPAEQLDIGPSDDKSLAAVVDTCRKHSIPHMVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DRGQLAEKYSGRVEIP+DWVAVWSKYGGVIKPTKAVSMFQ LAYKNGAVLKDNAEVVEIK
Sbjct: 121 DRGQLAEKYSGRVEIPSDWVAVWSKYGGVIKPTKAVSMFQCLAYKNGAVLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RD S+G IVVS ANG+SF GKKCVVTVGAWARKL KSV  IELPIQPLEATVSYWRIKEG
Sbjct: 181 RDESNGRIVVSIANGESFSGKKCVVTVGAWARKLVKSVSRIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E AEYAIGGDFPTFASYG+PY+YGTPSLEFPGLIKVA+HGGHQC+PDKRSWG+G RLPI
Sbjct: 241 AE-AEYAIGGDFPTFASYGNPYVYGTPSLEFPGLIKVAMHGGHQCNPDKRSWGSGGRLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
           + LKEWIEERFGGRVDSS+P++TQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF
Sbjct: 301 SALKEWIEERFGGRVDSSDPIATQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKL 410
           KMSPA+GRILAELALKGEAEG ELKYFK+ARFEENPKGN+KSFADQVKL
Sbjct: 361 KMSPAIGRILAELALKGEAEGVELKYFKIARFEENPKGNIKSFADQVKL 408

BLAST of HG10013515 vs. NCBI nr
Match: XP_022936028.1 (probable sarcosine oxidase [Cucurbita moschata])

HSP 1 Score: 767.7 bits (1981), Expect = 5.1e-218
Identity = 370/410 (90.24%), Postives = 388/410 (94.63%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++L LEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLTLEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWR  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWRTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFKNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           R+GSSGG+VVSTANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 REGSSGGVVVSTANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGGDFPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDPDKRSWG+G  LPI
Sbjct: 241 AE-GEYAIGGDFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPDKRSWGSGGELPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LKEWIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKEWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFEENPKGNVKSFADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGLELKYFRIARFEENPKGNVKSFADQVRLH 409

BLAST of HG10013515 vs. NCBI nr
Match: XP_022975268.1 (probable sarcosine oxidase [Cucurbita maxima])

HSP 1 Score: 764.6 bits (1973), Expect = 4.3e-217
Identity = 368/410 (89.76%), Postives = 388/410 (94.63%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWR  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWRTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFKNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RDGSSGG+VVS ANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 RDGSSGGVVVSVANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGG+FPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDPDKRSWG+G +LPI
Sbjct: 241 AE-GEYAIGGNFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPDKRSWGSGGQLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LK+WIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKKWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFEENPKGNVK FADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGLELKYFRIARFEENPKGNVKGFADQVRLH 409

BLAST of HG10013515 vs. NCBI nr
Match: KAG7024517.1 (putative sarcosine oxidase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 764.2 bits (1972), Expect = 5.6e-217
Identity = 368/410 (89.76%), Postives = 388/410 (94.63%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELW+  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWQTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA++NGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFRNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RDGSSGG+VVS ANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 RDGSSGGVVVSIANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGGDFPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDP+KRSWG+G  LPI
Sbjct: 241 AE-GEYAIGGDFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPNKRSWGSGGELPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LKEWIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKEWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFEENPKGNVKSFADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGLELKYFRIARFEENPKGNVKSFADQVRLH 409

BLAST of HG10013515 vs. NCBI nr
Match: XP_023534889.1 (probable sarcosine oxidase [Cucurbita pepo subsp. pepo])

HSP 1 Score: 762.3 bits (1967), Expect = 2.1e-216
Identity = 368/410 (89.76%), Postives = 385/410 (93.90%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWR  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWRTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKY GRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYYGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFKNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RDGSSGG+VVS ANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 RDGSSGGVVVSIANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGGDFPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDPDKRSWG+G  LPI
Sbjct: 241 AE-GEYAIGGDFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPDKRSWGSGGELPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LKEWIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKEWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFE NPKGNVK FADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGMELKYFRIARFEGNPKGNVKDFADQVRLH 409

BLAST of HG10013515 vs. ExPASy Swiss-Prot
Match: Q9SJA7 (Probable sarcosine oxidase OS=Arabidopsis thaliana OX=3702 GN=At2g24580 PE=2 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 3.5e-161
Identity = 269/411 (65.45%), Postives = 329/411 (80.05%), Query Frame = 0

Query: 2   ADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYPE 61
           +D G +FDVIVVGAGVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPE
Sbjct: 4   SDDG-RFDVIVVGAGVMGSSAAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPE 63

Query: 62  DYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVLD 121
           DYY+ +V ES  LW   ++EIGY+V+FP +Q D+GP+D +SL +VV TC+KH + H V+D
Sbjct: 64  DYYYSMVSESTRLWAAAQSEIGYKVHFPTQQFDMGPADQQSLLSVVATCQKHGLAHRVMD 123

Query: 122 RGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIKR 181
              ++E +SGR+ IP +W+ V ++ GG+IKPTKAVSMFQ+LA  +GA+L+DN +V  IKR
Sbjct: 124 SHAVSEHFSGRISIPENWIGVSTELGGIIKPTKAVSMFQTLAIGHGAILRDNTKVANIKR 183

Query: 182 DGSSG-GIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 241
           DG SG G++V T  G  F GKKC+VT GAW  KL K+V GI+ P++PLE TV YWRIKEG
Sbjct: 184 DGESGEGVIVCTVKGDKFYGKKCIVTAGAWISKLVKTVAGIDFPVEPLETTVCYWRIKEG 243

Query: 242 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 301
            E  ++ I G+FPTFASYG PY+YGTPSLE+PGLIKVA+HGG+ CDPDKR WG G +L  
Sbjct: 244 HE-EKFTIDGEFPTFASYGAPYVYGTPSLEYPGLIKVAVHGGYWCDPDKRPWGPGVKL-- 303

Query: 302 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 361
             LKEWI+ERFGG VDS  PV+TQLCMYSMTPDEDFVIDFLGGEF +DVV+GGGFSGHGF
Sbjct: 304 EELKEWIKERFGGMVDSEGPVATQLCMYSMTPDEDFVIDFLGGEFGRDVVVGGGFSGHGF 363

Query: 362 KMSPAVGRILAELALKGEA--EGTELKYFKMARFEENPKGNVKSFADQVKL 410
           KM+PAVGRILA++A++ EA   G E+K F + RFE+NPKGN K + DQV L
Sbjct: 364 KMAPAVGRILADMAMEVEAGGGGVEMKQFSLRRFEDNPKGNAKEYPDQVIL 410

BLAST of HG10013515 vs. ExPASy Swiss-Prot
Match: Q29RU9 (Peroxisomal sarcosine oxidase OS=Bos taurus OX=9913 GN=PIPOX PE=2 SV=2)

HSP 1 Score: 243.0 bits (619), Expect = 5.7e-63
Identity = 143/406 (35.22%), Postives = 213/406 (52.46%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA     +D IV+GAG+ G  TAYHLAK   ++L+LEQF   H RGSSHG+SR IR  YP
Sbjct: 1   MAAQRELYDAIVIGAGIQGCFTAYHLAKHSKKVLLLEQFFLPHSRGSSHGQSRIIRRAYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           ED+Y  ++ E Y LW   E E G ++Y     L +G  ++  L  +  T  +  + H  L
Sbjct: 61  EDFYTQMMAECYSLWAQLEHEAGTQLYRQTGLLLLGMKENPELKIIQATLSRQGVEHQCL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
              +L +++   + +    V +    GGV+   KA+   Q    + G ++ D  +VVEIK
Sbjct: 121 SSEELKQRFP-NIRLARGEVGLLEVSGGVLYADKALRALQDAIRQLGGIVHDGEKVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
               SG  V+     +S++ K  ++T G W  +L + +G  ELP+Q L   V YW+ K  
Sbjct: 181 ----SGLPVMVKTTSRSYQAKSLIITAGPWTNRLLRPLGA-ELPLQTLRINVCYWQEK-- 240

Query: 241 VEAAEYAIGGDFPTFASYG----DPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSW--GN 300
                Y++   FP F   G      +IYG PS E+PGL+KV  H G+  DP++R      
Sbjct: 241 -VPGSYSVSQAFPCFMGLGLSLAPHHIYGLPSREYPGLMKVCYHHGNNADPEERDCPAAF 300

Query: 301 GERLPIATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGG 360
            +   +  L  ++ +         EP   + CMY+ TPD  FV+D        ++VIG G
Sbjct: 301 SDIQDVHILSGFVRDHLPDL--QPEPAVMEHCMYTNTPDGHFVLD--RHPKYDNIVIGAG 360

Query: 361 FSGHGFKMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNV 401
           FSGHGFK+SP VG+IL EL++K      +L  F+++RF    K ++
Sbjct: 361 FSGHGFKLSPVVGKILYELSMK-LTPSYDLTPFRISRFPSLGKAHL 392

BLAST of HG10013515 vs. ExPASy Swiss-Prot
Match: Q9P0Z9 (Peroxisomal sarcosine oxidase OS=Homo sapiens OX=9606 GN=PIPOX PE=1 SV=2)

HSP 1 Score: 240.4 bits (612), Expect = 3.7e-62
Identity = 140/404 (34.65%), Postives = 216/404 (53.47%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA     +D IV+GAG+ G  TAYHLAK   RIL+LEQF   H RGSSHG+SR IR  Y 
Sbjct: 1   MAAQKDLWDAIVIGAGIQGCFTAYHLAKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYL 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           ED+Y  ++ E Y++W   E E G +++     L +G  +++ L  +     +  + H  L
Sbjct: 61  EDFYTRMMHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
              +L +++   + +P   V +    GGVI   KA+   Q    + G +++D  +VVEI 
Sbjct: 121 SSEELKQRFP-NIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEI- 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
              + G +V      +S++ K  V+T G W  +L + + GIE+P+Q L   V YWR    
Sbjct: 181 ---NPGLLVTVKTTSRSYQAKSLVITAGPWTNQLLRPL-GIEMPLQTLRINVCYWR---E 240

Query: 241 VEAAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRS--WGNGE 300
           +    Y +   FP F   G    +IYG P+ E+PGL+KV+ H G+  DP++R       +
Sbjct: 241 MVPGSYGVSQAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTD 300

Query: 301 RLPIATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFS 360
              +  L  ++ +         EP   + CMY+ TPDE F++D        ++VIG GFS
Sbjct: 301 IGDVQILSSFVRDHLPDL--KPEPAVIESCMYTNTPDEQFILD--RHPKYDNIVIGAGFS 360

Query: 361 GHGFKMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNV 401
           GHGFK++P VG+IL EL++K      +L  F+++RF    K ++
Sbjct: 361 GHGFKLAPVVGKILYELSMK-LTPSYDLAPFRISRFPSLGKAHL 390

BLAST of HG10013515 vs. ExPASy Swiss-Prot
Match: P79371 (Peroxisomal sarcosine oxidase OS=Oryctolagus cuniculus OX=9986 GN=PIPOX PE=1 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 3.1e-61
Identity = 138/404 (34.16%), Postives = 214/404 (52.97%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA     +D IV+GAG+ G  T YHL K   RIL+LEQF   H RGSSHG+SR IR  Y 
Sbjct: 1   MAAQKDLWDAIVIGAGIQGCFTVYHLVKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYL 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           ED+Y  ++ E Y++W   E E G +++     L +G  +++ L  +     +  + H  L
Sbjct: 61  EDFYTRMMHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
              +L +++   + +P   V +    GGVI   KA+   Q    + G +++D  +VVEI 
Sbjct: 121 SSEELKQRFP-NIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEI- 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
              + G +V      +S++ K  V+T G W  +L + + GIE+P+Q L   V YWR    
Sbjct: 181 ---NPGLLVTVKTTSRSYQAKSLVITAGPWTNQLLRPL-GIEMPLQTLRINVCYWR---E 240

Query: 241 VEAAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRS--WGNGE 300
           +    Y +   FP F   G    +IYG P+ E+PGL+KV+ H G+  DP++R       +
Sbjct: 241 MVPGSYGVSQAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTD 300

Query: 301 RLPIATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFS 360
              +  L  ++ +         EP   + CMY+ TPDE F++D        ++VIG GFS
Sbjct: 301 IGDVQILSSFVRDHLPDL--KPEPAVIESCMYTNTPDEQFILD--RHPKYDNIVIGAGFS 360

Query: 361 GHGFKMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNV 401
           GHGFK++P VG+IL EL++K      +L  F+++RF    K ++
Sbjct: 361 GHGFKLAPVVGKILYELSMK-LTPSYDLAPFRISRFPSLGKAHL 390

BLAST of HG10013515 vs. ExPASy Swiss-Prot
Match: Q9D826 (Peroxisomal sarcosine oxidase OS=Mus musculus OX=10090 GN=Pipox PE=1 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 1.6e-60
Identity = 141/404 (34.90%), Postives = 211/404 (52.23%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA     +D IV+GAG+ G  TAYHLAK    +L+LEQF   H RGSSHG+SR IR  YP
Sbjct: 1   MAAQTDFWDAIVIGAGIQGCFTAYHLAKHSKSVLLLEQFFLPHSRGSSHGQSRIIRKAYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           ED+Y  ++ E Y+ W   E E G +++   E L +G  ++  L  +  T  +  I H  L
Sbjct: 61  EDFYTMMMKECYQTWAQLEREAGTQLHRQTELLLLGTKENPGLKTIQATLSRQGIDHEYL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
               L +++   +      V +  K GGV+   KA+   Q +  + G  + D  +VVEI+
Sbjct: 121 SSVDLKQRFP-NIRFTRGEVGLLDKTGGVLYADKALRALQHIICQLGGTVCDGEKVVEIR 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
                G  V      +S++    V+T G W  +L   + GIELP+Q L   V YWR K  
Sbjct: 181 ----PGLPVTVKTTLKSYQANSLVITAGPWTNRLLHPL-GIELPLQTLRINVCYWREK-- 240

Query: 241 VEAAEYAIGGDFPTF--ASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGN--GE 300
                Y +   FP          +IYG P+ E+PGL+K+  H G   DP++R       +
Sbjct: 241 -VPGSYGVSQAFPCILGLDLAPHHIYGLPASEYPGLMKICYHHGDNVDPEERDCPKTFSD 300

Query: 301 RLPIATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFS 360
              +  L  ++ +   G    +EP   + CMY+ TPDE F++D    +++ ++VIG GFS
Sbjct: 301 IQDVQILCHFVRDHLPGL--RAEPDIMERCMYTNTPDEHFILD-CHPKYD-NIVIGAGFS 360

Query: 361 GHGFKMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNV 401
           GHGFK++P VG+IL EL++K      +L  F+M+RF    K ++
Sbjct: 361 GHGFKLAPVVGKILYELSMK-LPPSYDLAPFRMSRFSTLSKAHL 390

BLAST of HG10013515 vs. ExPASy TrEMBL
Match: A0A6J1FC48 (Sarcosine oxidase OS=Cucurbita moschata OX=3662 GN=LOC111442750 PE=3 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 2.5e-218
Identity = 370/410 (90.24%), Postives = 388/410 (94.63%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++L LEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLTLEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWR  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWRTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFKNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           R+GSSGG+VVSTANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 REGSSGGVVVSTANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGGDFPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDPDKRSWG+G  LPI
Sbjct: 241 AE-GEYAIGGDFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPDKRSWGSGGELPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LKEWIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKEWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFEENPKGNVKSFADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGLELKYFRIARFEENPKGNVKSFADQVRLH 409

BLAST of HG10013515 vs. ExPASy TrEMBL
Match: A0A6J1IIQ6 (Sarcosine oxidase OS=Cucurbita maxima OX=3661 GN=LOC111474391 PE=3 SV=1)

HSP 1 Score: 764.6 bits (1973), Expect = 2.1e-217
Identity = 368/410 (89.76%), Postives = 388/410 (94.63%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MADSGTQ+DVIVVGAGVMGSSTAYHLAKTGN++LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MADSGTQYDVIVVGAGVMGSSTAYHLAKTGNKVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWR  EAEIGYRVYFPAEQLDIG SDDKSLAA V+TC+KHSIPHLVL
Sbjct: 61  EDYYHSLVMESYELWRTAEAEIGYRVYFPAEQLDIGHSDDKSLAAAVDTCKKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DR QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVVEIK
Sbjct: 121 DREQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAFKNGADLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RDGSSGG+VVS ANG+ FRGKKCVVTVGAWARKL KSVGGIELPIQPLEATVSYWRIKEG
Sbjct: 181 RDGSSGGVVVSVANGERFRGKKCVVTVGAWARKLVKSVGGIELPIQPLEATVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E  EYAIGG+FPTFASYGD YIYGTPSLEFPGLIKVA+HGGH+CDPDKRSWG+G +LPI
Sbjct: 241 AE-GEYAIGGNFPTFASYGDTYIYGTPSLEFPGLIKVAVHGGHRCDPDKRSWGSGGQLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
             LK+WIE RFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGF
Sbjct: 301 TALKKWIEGRFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILA+LALKG AEG ELKYF++ARFEENPKGNVK FADQV+LH
Sbjct: 361 KMSPAIGRILADLALKGVAEGLELKYFRIARFEENPKGNVKGFADQVRLH 409

BLAST of HG10013515 vs. ExPASy TrEMBL
Match: A0A0A0LJG6 (Sarcosine oxidase OS=Cucumis sativus OX=3659 GN=Csa_3G881890 PE=3 SV=1)

HSP 1 Score: 741.9 bits (1914), Expect = 1.4e-210
Identity = 357/410 (87.07%), Postives = 379/410 (92.44%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA S T FDVIVVGAGVMGSSTAYHLAKTGNR+LILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTLFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYY+GLVMESYELWRM E EIGY+VYFP EQLDIG  DDKSL AVV+TCRKHSIPHLVL
Sbjct: 61  EDYYYGLVMESYELWRMAEEEIGYKVYFPTEQLDIGSPDDKSLTAVVDTCRKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           D G+L EKYSGRVEIPADWV VWSKYGGVIKPTKAVSM+Q+LAYKNGAV+KDNAEVVEIK
Sbjct: 121 DSGELREKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMYQTLAYKNGAVMKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RD S+G IVVS ANG+SFRGKKCVVTVGAW++KL KSVGGIELPI+PLE +VSYWRIKEG
Sbjct: 181 RDESNGRIVVSIANGESFRGKKCVVTVGAWSKKLVKSVGGIELPIRPLEVSVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E AEYAIGG FPT ASYG+PY+YGTPSLEFPGLIKVAIHGGH+C+PDKRSWG G RLPI
Sbjct: 241 FE-AEYAIGGGFPTIASYGEPYVYGTPSLEFPGLIKVAIHGGHECNPDKRSWGKGGRLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
           A LKEWI+E+FGGRVDSS PVSTQ CMYSMTPD DFVIDFLGGEFEKDVVIGGGFSGHGF
Sbjct: 301 AALKEWIDEKFGGRVDSSGPVSTQSCMYSMTPDGDFVIDFLGGEFEKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSP +GRILAELAL G AEG ELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 KMSPTIGRILAELALDGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of HG10013515 vs. ExPASy TrEMBL
Match: A0A5D3E5F8 (Sarcosine oxidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006680 PE=3 SV=1)

HSP 1 Score: 734.2 bits (1894), Expect = 3.0e-208
Identity = 355/410 (86.59%), Postives = 380/410 (92.68%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA S TQFDVIVVGAGVMGSSTAYHLAKTGNR+L+LEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLLLEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWRM EAEIG++VY+PAEQLDIGPS+ +SL AVV+TC KHSIPHLVL
Sbjct: 61  EDYYHDLVMESYELWRMAEAEIGFKVYYPAEQLDIGPSNSESLTAVVDTCLKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DRG+L EKYSGRVEIPA+WVAV SKYGGVIKPTKAVSMFQ+LAYKNG VLKDNAEVVEIK
Sbjct: 121 DRGELMEKYSGRVEIPANWVAVCSKYGGVIKPTKAVSMFQTLAYKNGTVLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RD S+G IVVSTANG+SF GKKCVVTVGAW++KL KSVGGIELPI PLE +VSYWRIKEG
Sbjct: 181 RDESNGRIVVSTANGESFHGKKCVVTVGAWSKKLVKSVGGIELPILPLEVSVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E AEYAI G FPT ASYG+PY+YGTPSLEFPGLIKVAIH G+ C+PDKRSWG   RLPI
Sbjct: 241 FE-AEYAIEGGFPTMASYGEPYVYGTPSLEFPGLIKVAIHAGYSCNPDKRSWGRDGRLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
           A LKEWI+E+FGGRVDSS PVS+QLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF
Sbjct: 301 AALKEWIDEKFGGRVDSSRPVSSQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILAELALKG AEG ELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 KMSPAIGRILAELALKGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of HG10013515 vs. ExPASy TrEMBL
Match: A0A1S3CR44 (Sarcosine oxidase OS=Cucumis melo OX=3656 GN=LOC103503750 PE=3 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 3.9e-208
Identity = 355/410 (86.59%), Postives = 380/410 (92.68%), Query Frame = 0

Query: 1   MADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYP 60
           MA S TQFDVIVVGAGVMGSSTAYHLAKTGNR+L+LEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLLLEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVL 120
           EDYYH LVMESYELWRM EAEIG++VY+PAEQLDIGPS+ +SL AVV+TC KHSIPHLVL
Sbjct: 61  EDYYHDLVMESYELWRMAEAEIGFKVYYPAEQLDIGPSNSESLTAVVDTCLKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIK 180
           DRG+L EKYSGRVEIPA+WVAV SKYGGVIKPTKAVSMFQ+LAYKNG VLKDNAEVVEIK
Sbjct: 121 DRGELMEKYSGRVEIPANWVAVCSKYGGVIKPTKAVSMFQTLAYKNGTVLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 240
           RD S+G IVVSTANG+SF GKKCVVTVGAW++KL KSVGGIELPI PLE +VSYWRIKEG
Sbjct: 181 RDESNGRIVVSTANGESFHGKKCVVTVGAWSKKLVKSVGGIELPILPLEVSVSYWRIKEG 240

Query: 241 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 300
            E AEYAI G FPT ASYG+PY+YGTPSLEFPGLIKVAIH G+ C+PDKRSWG   RLPI
Sbjct: 241 FE-AEYAIEGGFPTMASYGEPYVYGTPSLEFPGLIKVAIHAGYSCNPDKRSWGREGRLPI 300

Query: 301 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360
           A LKEWI+E+FGGRVDSS PVS+QLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF
Sbjct: 301 AALKEWIDEKFGGRVDSSRPVSSQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 360

Query: 361 KMSPAVGRILAELALKGEAEGTELKYFKMARFEENPKGNVKSFADQVKLH 411
           KMSPA+GRILAELALKG AEG ELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 KMSPAIGRILAELALKGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of HG10013515 vs. TAIR 10
Match: AT2G24580.1 (FAD-dependent oxidoreductase family protein )

HSP 1 Score: 569.3 bits (1466), Expect = 2.5e-162
Identity = 269/411 (65.45%), Postives = 329/411 (80.05%), Query Frame = 0

Query: 2   ADSGTQFDVIVVGAGVMGSSTAYHLAKTGNRILILEQFDFLHHRGSSHGESRTIRATYPE 61
           +D G +FDVIVVGAGVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPE
Sbjct: 4   SDDG-RFDVIVVGAGVMGSSAAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPE 63

Query: 62  DYYHGLVMESYELWRMTEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETCRKHSIPHLVLD 121
           DYY+ +V ES  LW   ++EIGY+V+FP +Q D+GP+D +SL +VV TC+KH + H V+D
Sbjct: 64  DYYYSMVSESTRLWAAAQSEIGYKVHFPTQQFDMGPADQQSLLSVVATCQKHGLAHRVMD 123

Query: 122 RGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVEIKR 181
              ++E +SGR+ IP +W+ V ++ GG+IKPTKAVSMFQ+LA  +GA+L+DN +V  IKR
Sbjct: 124 SHAVSEHFSGRISIPENWIGVSTELGGIIKPTKAVSMFQTLAIGHGAILRDNTKVANIKR 183

Query: 182 DGSSG-GIVVSTANGQSFRGKKCVVTVGAWARKLFKSVGGIELPIQPLEATVSYWRIKEG 241
           DG SG G++V T  G  F GKKC+VT GAW  KL K+V GI+ P++PLE TV YWRIKEG
Sbjct: 184 DGESGEGVIVCTVKGDKFYGKKCIVTAGAWISKLVKTVAGIDFPVEPLETTVCYWRIKEG 243

Query: 242 VEAAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKVAIHGGHQCDPDKRSWGNGERLPI 301
            E  ++ I G+FPTFASYG PY+YGTPSLE+PGLIKVA+HGG+ CDPDKR WG G +L  
Sbjct: 244 HE-EKFTIDGEFPTFASYGAPYVYGTPSLEYPGLIKVAVHGGYWCDPDKRPWGPGVKL-- 303

Query: 302 ATLKEWIEERFGGRVDSSEPVSTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGF 361
             LKEWI+ERFGG VDS  PV+TQLCMYSMTPDEDFVIDFLGGEF +DVV+GGGFSGHGF
Sbjct: 304 EELKEWIKERFGGMVDSEGPVATQLCMYSMTPDEDFVIDFLGGEFGRDVVVGGGFSGHGF 363

Query: 362 KMSPAVGRILAELALKGEA--EGTELKYFKMARFEENPKGNVKSFADQVKL 410
           KM+PAVGRILA++A++ EA   G E+K F + RFE+NPKGN K + DQV L
Sbjct: 364 KMAPAVGRILADMAMEVEAGGGGVEMKQFSLRRFEDNPKGNAKEYPDQVIL 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896765.13.4e-22291.93probable sarcosine oxidase [Benincasa hispida][more]
XP_022936028.15.1e-21890.24probable sarcosine oxidase [Cucurbita moschata][more]
XP_022975268.14.3e-21789.76probable sarcosine oxidase [Cucurbita maxima][more]
KAG7024517.15.6e-21789.76putative sarcosine oxidase, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023534889.12.1e-21689.76probable sarcosine oxidase [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9SJA73.5e-16165.45Probable sarcosine oxidase OS=Arabidopsis thaliana OX=3702 GN=At2g24580 PE=2 SV=... [more]
Q29RU95.7e-6335.22Peroxisomal sarcosine oxidase OS=Bos taurus OX=9913 GN=PIPOX PE=2 SV=2[more]
Q9P0Z93.7e-6234.65Peroxisomal sarcosine oxidase OS=Homo sapiens OX=9606 GN=PIPOX PE=1 SV=2[more]
P793713.1e-6134.16Peroxisomal sarcosine oxidase OS=Oryctolagus cuniculus OX=9986 GN=PIPOX PE=1 SV=... [more]
Q9D8261.6e-6034.90Peroxisomal sarcosine oxidase OS=Mus musculus OX=10090 GN=Pipox PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1FC482.5e-21890.24Sarcosine oxidase OS=Cucurbita moschata OX=3662 GN=LOC111442750 PE=3 SV=1[more]
A0A6J1IIQ62.1e-21789.76Sarcosine oxidase OS=Cucurbita maxima OX=3661 GN=LOC111474391 PE=3 SV=1[more]
A0A0A0LJG61.4e-21087.07Sarcosine oxidase OS=Cucumis sativus OX=3659 GN=Csa_3G881890 PE=3 SV=1[more]
A0A5D3E5F83.0e-20886.59Sarcosine oxidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G00... [more]
A0A1S3CR443.9e-20886.59Sarcosine oxidase OS=Cucumis melo OX=3656 GN=LOC103503750 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G24580.12.5e-16265.45FAD-dependent oxidoreductase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006281Sarcosine oxidase, monomericTIGRFAMTIGR01377TIGR01377coord: 8..398
e-value: 3.0E-153
score: 508.0
IPR006076FAD dependent oxidoreductasePFAMPF01266DAOcoord: 9..373
e-value: 2.7E-47
score: 162.1
IPR036188FAD/NAD(P)-binding domain superfamilyGENE3D3.50.50.60coord: 8..378
e-value: 1.9E-103
score: 348.8
IPR036188FAD/NAD(P)-binding domain superfamilySUPERFAMILY51905FAD/NAD(P)-binding domaincoord: 7..393
NoneNo IPR availableGENE3D3.30.9.10coord: 91..330
e-value: 1.9E-103
score: 348.8
NoneNo IPR availablePANTHERPTHR10961PEROXISOMAL SARCOSINE OXIDASEcoord: 6..402
NoneNo IPR availablePANTHERPTHR10961:SF7PEROXISOMAL SARCOSINE OXIDASEcoord: 6..402
NoneNo IPR availableSUPERFAMILY54373FAD-linked reductases, C-terminal domaincoord: 226..332

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013515.1HG10013515.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046653 tetrahydrofolate metabolic process
molecular_function GO:0008115 sarcosine oxidase activity
molecular_function GO:0016491 oxidoreductase activity