ClCG05G026450 (gene) Watermelon (Charleston Gray)

NameClCG05G026450
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionSarcosine oxidase family protein
LocationCG_Chr05 : 37692366 .. 37693595 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGATTCCGACACTCAATTCGATGTTATCGTCGTCGGCGCCGGTGTAATGGGCAGTTCGACGGCGTACCATCTCGCTAAAACAGGGAACAGAGTTCTGATTCTGGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACCTATCCGGAGGATTACTACCACGACCTCGTCATGGAATCTTACGAGCTCTGGAAGATGGCGGAGGCGGAAATTGGCTACAGAGTCTATTTTCCGGCGGAACAGCTCGATATCGGTCCTTTCGACGACAAAAGTCTCGTTGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGTGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCGACGAAGGCGGTTTCGATGTTCCAAACTTTGGCGTACAAAAACGGCGCCGTTCTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAATAGCCAATGGGGAGAGTTTTAGGGGAAAAAAATGTGTGGTGACAGTTGGCGCTTGGGCTAGAAAGTTAGTTAAATCAGTTAGTGGGATTGAATTGCCAATTCAGCCATTGGAGGTTGCTGTGTCTTATTGGAGGATCAAGGAAGGGGCCGAGGCCGAGTATGCAATCGGAGGGGGCTTTCCGACACTTGCTAGCTATGGCAACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATTCACGGTGGGCATCTGTGTGACCCGAACAAGCGGACGTGGGGGAGTGGGGAACAACTACCCATAGCTGTATTAAAAGAGTGGATAGAGGGGAGGTTTGGAGGGAGGGTGGATTCAAGTGAGCCAGCGGTAACGCAACTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTAGGAGGGGAATTTGAAAAGGATGTGGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCAGTGGTCGGGAGGATACTGTCGGAGCTTGCATTGAAGGGGGAAGCAGAAGGGGTGGAGCTAAAGTATTTCAAGATAGCAAGGTTTGAGGAGAATCCGAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAACTTCACTAG

mRNA sequence

ATGGTTGATTCCGACACTCAATTCGATGTTATCGTCGTCGGCGCCGGTGTAATGGGCAGTTCGACGGCGTACCATCTCGCTAAAACAGGGAACAGAGTTCTGATTCTGGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACCTATCCGGAGGATTACTACCACGACCTCGTCATGGAATCTTACGAGCTCTGGAAGATGGCGGAGGCGGAAATTGGCTACAGAGTCTATTTTCCGGCGGAACAGCTCGATATCGGTCCTTTCGACGACAAAAGTCTCGTTGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGTGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCGACGAAGGCGGTTTCGATGTTCCAAACTTTGGCGTACAAAAACGGCGCCGTTCTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAATAGCCAATGGGGAGAGTTTTAGGGGAAAAAAATGTGTGGTGACAGTTGGCGCTTGGGCTAGAAAGTTAGTTAAATCAGTTAGTGGGATTGAATTGCCAATTCAGCCATTGGAGGTTGCTGTGTCTTATTGGAGGATCAAGGAAGGGGCCGAGGCCGAGTATGCAATCGGAGGGGGCTTTCCGACACTTGCTAGCTATGGCAACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATTCACGGTGGGCATCTGTGTGACCCGAACAAGCGGACGTGGGGGAGTGGGGAACAACTACCCATAGCTGTATTAAAAGAGTGGATAGAGGGGAGGTTTGGAGGGAGGGTGGATTCAAGTGAGCCAGCGGTAACGCAACTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTAGGAGGGGAATTTGAAAAGGATGTGGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCAGTGGTCGGGAGGATACTGTCGGAGCTTGCATTGAAGGGGGAAGCAGAAGGGGTGGAGCTAAAGTATTTCAAGATAGCAAGGTTTGAGGAGAATCCGAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAACTTCACTAG

Coding sequence (CDS)

ATGGTTGATTCCGACACTCAATTCGATGTTATCGTCGTCGGCGCCGGTGTAATGGGCAGTTCGACGGCGTACCATCTCGCTAAAACAGGGAACAGAGTTCTGATTCTGGAGCAATTCGATTTTCTTCATCACAGAGGCTCTTCTCATGGCGAATCTCGTACCATACGCGCCACCTATCCGGAGGATTACTACCACGACCTCGTCATGGAATCTTACGAGCTCTGGAAGATGGCGGAGGCGGAAATTGGCTACAGAGTCTATTTTCCGGCGGAACAGCTCGATATCGGTCCTTTCGACGACAAAAGTCTCGTTGCCGTCGTCGAGACCTGCCGGAAGCATTCGATCCCTCATCTGGTCCTGGATCGTGGACAACTGGCGGAGAAGTACTCCGGCAGAGTGGAGATTCCGGCGGACTGGGTGGCGGTGTGGAGCAAGTACGGCGGCGTAATTAAGCCGACGAAGGCGGTTTCGATGTTCCAAACTTTGGCGTACAAAAACGGCGCCGTTCTGAAGGACAATGCGGAAGTGGTGGAGATTAAGAGAGATGGGAGTAGTGGAGGAATAGTTGTTTCAATAGCCAATGGGGAGAGTTTTAGGGGAAAAAAATGTGTGGTGACAGTTGGCGCTTGGGCTAGAAAGTTAGTTAAATCAGTTAGTGGGATTGAATTGCCAATTCAGCCATTGGAGGTTGCTGTGTCTTATTGGAGGATCAAGGAAGGGGCCGAGGCCGAGTATGCAATCGGAGGGGGCTTTCCGACACTTGCTAGCTATGGCAACCCATATATTTACGGGACACCATCGTTGGAGTTTCCAGGATTGATCAAAGTGGCTATTCACGGTGGGCATCTGTGTGACCCGAACAAGCGGACGTGGGGGAGTGGGGAACAACTACCCATAGCTGTATTAAAAGAGTGGATAGAGGGGAGGTTTGGAGGGAGGGTGGATTCAAGTGAGCCAGCGGTAACGCAACTGTGTATGTACTCAATGACGCCAGATGAGGACTTTGTGATTGATTTTTTAGGAGGGGAATTTGAAAAGGATGTGGTGATCGGTGGCGGGTTTTCAGGGCATGGGTTCAAAATGTCACCAGTGGTCGGGAGGATACTGTCGGAGCTTGCATTGAAGGGGGAAGCAGAAGGGGTGGAGCTAAAGTATTTCAAGATAGCAAGGTTTGAGGAGAATCCGAAAGGTAATGTCAAGAGCTTTGCAGATCAAGTAAAACTTCACTAG

Protein sequence

MVDSDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGSSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEAEYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLKEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSPVVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKLH
BLAST of ClCG05G026450 vs. Swiss-Prot
Match: SOX_ARATH (Probable sarcosine oxidase OS=Arabidopsis thaliana GN=At2g24580 PE=2 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 3.5e-158
Identity = 264/407 (64.86%), Postives = 322/407 (79.12%), Query Frame = 1

Query: 5   DTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYY 64
           D +FDVIVVGAGVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDYY
Sbjct: 6   DGRFDVIVVGAGVMGSSAAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDYY 65

Query: 65  HDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQ 124
           + +V ES  LW  A++EIGY+V+FP +Q D+GP D +SL++VV TC+KH + H V+D   
Sbjct: 66  YSMVSESTRLWAAAQSEIGYKVHFPTQQFDMGPADQQSLLSVVATCQKHGLAHRVMDSHA 125

Query: 125 LAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGS 184
           ++E +SGR+ IP +W+ V ++ GG+IKPTKAVSMFQTLA  +GA+L+DN +V  IKRDG 
Sbjct: 126 VSEHFSGRISIPENWIGVSTELGGIIKPTKAVSMFQTLAIGHGAILRDNTKVANIKRDGE 185

Query: 185 SG-GIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 244
           SG G++V    G+ F GKKC+VT GAW  KLVK+V+GI+ P++PLE  V YWRIKEG E 
Sbjct: 186 SGEGVIVCTVKGDKFYGKKCIVTAGAWISKLVKTVAGIDFPVEPLETTVCYWRIKEGHEE 245

Query: 245 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 304
           ++ I G FPT ASYG PY+YGTPSLE+PGLIKVA+HGG+ CDP+KR WG G +L    LK
Sbjct: 246 KFTIDGEFPTFASYGAPYVYGTPSLEYPGLIKVAVHGGYWCDPDKRPWGPGVKL--EELK 305

Query: 305 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 364
           EWI+ RFGG VDS  P  TQLCMYSMTPDEDFVIDFLGGEF +DVV+GGGFSGHGFKM+P
Sbjct: 306 EWIKERFGGMVDSEGPVATQLCMYSMTPDEDFVIDFLGGEFGRDVVVGGGFSGHGFKMAP 365

Query: 365 VVGRILSELALKGEA--EGVELKYFKIARFEENPKGNVKSFADQVKL 409
            VGRIL+++A++ EA   GVE+K F + RFE+NPKGN K + DQV L
Sbjct: 366 AVGRILADMAMEVEAGGGGVEMKQFSLRRFEDNPKGNAKEYPDQVIL 410

BLAST of ClCG05G026450 vs. Swiss-Prot
Match: SOX_BOVIN (Peroxisomal sarcosine oxidase OS=Bos taurus GN=PIPOX PE=2 SV=2)

HSP 1 Score: 240.0 bits (611), Expect = 4.7e-62
Identity = 145/398 (36.43%), Postives = 212/398 (53.27%), Query Frame = 1

Query: 8   FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHDL 67
           +D IV+GAG+ G  TAYHLAK   +VL+LEQF   H RGSSHG+SR IR  YPED+Y  +
Sbjct: 8   YDAIVIGAGIQGCFTAYHLAKHSKKVLLLEQFFLPHSRGSSHGQSRIIRRAYPEDFYTQM 67

Query: 68  VMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQLAE 127
           + E Y LW   E E G ++Y     L +G  ++  L  +  T  +  + H  L   +L +
Sbjct: 68  MAECYSLWAQLEHEAGTQLYRQTGLLLLGMKENPELKIIQATLSRQGVEHQCLSSEELKQ 127

Query: 128 KYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGSSGG 187
           ++   + +    V +    GGV+   KA+   Q    + G ++ D  +VVEIK    SG 
Sbjct: 128 RFP-NIRLARGEVGLLEVSGGVLYADKALRALQDAIRQLGGIVHDGEKVVEIK----SGL 187

Query: 188 IVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEAEYAI 247
            V+      S++ K  ++T G W  +L++ + G ELP+Q L + V YW  +E     Y++
Sbjct: 188 PVMVKTTSRSYQAKSLIITAGPWTNRLLRPL-GAELPLQTLRINVCYW--QEKVPGSYSV 247

Query: 248 GGGFPTLASYG----NPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGS--GEQLPIAV 307
              FP     G      +IYG PS E+PGL+KV  H G+  DP +R   +   +   + +
Sbjct: 248 SQAFPCFMGLGLSLAPHHIYGLPSREYPGLMKVCYHHGNNADPEERDCPAAFSDIQDVHI 307

Query: 308 LKEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKM 367
           L  ++           EPAV + CMY+ TPD  FV+D        ++VIG GFSGHGFK+
Sbjct: 308 LSGFVRDHLPDL--QPEPAVMEHCMYTNTPDGHFVLD--RHPKYDNIVIGAGFSGHGFKL 367

Query: 368 SPVVGRILSELALKGEAEGVELKYFKIARFEENPKGNV 400
           SPVVG+IL EL++K      +L  F+I+RF    K ++
Sbjct: 368 SPVVGKILYELSMK-LTPSYDLTPFRISRFPSLGKAHL 392

BLAST of ClCG05G026450 vs. Swiss-Prot
Match: SOX_HUMAN (Peroxisomal sarcosine oxidase OS=Homo sapiens GN=PIPOX PE=1 SV=2)

HSP 1 Score: 236.9 bits (603), Expect = 4.0e-61
Identity = 141/396 (35.61%), Postives = 215/396 (54.29%), Query Frame = 1

Query: 8   FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHDL 67
           +D IV+GAG+ G  TAYHLAK   R+L+LEQF   H RGSSHG+SR IR  Y ED+Y  +
Sbjct: 8   WDAIVIGAGIQGCFTAYHLAKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYLEDFYTRM 67

Query: 68  VMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQLAE 127
           + E Y++W   E E G +++     L +G  +++ L  +     +  + H  L   +L +
Sbjct: 68  MHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCLSSEELKQ 127

Query: 128 KYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGSSGG 187
           ++   + +P   V +    GGVI   KA+   Q    + G +++D  +VVEI    + G 
Sbjct: 128 RFP-NIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEI----NPGL 187

Query: 188 IVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEAEYAI 247
           +V       S++ K  V+T G W  +L++ + GIE+P+Q L + V YWR  E     Y +
Sbjct: 188 LVTVKTTSRSYQAKSLVITAGPWTNQLLRPL-GIEMPLQTLRINVCYWR--EMVPGSYGV 247

Query: 248 GGGFPTLASYG--NPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSG--EQLPIAVLK 307
              FP     G    +IYG P+ E+PGL+KV+ H G+  DP +R   +   +   + +L 
Sbjct: 248 SQAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQILS 307

Query: 308 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 367
            ++           EPAV + CMY+ TPDE F++D        ++VIG GFSGHGFK++P
Sbjct: 308 SFVRDHLPDL--KPEPAVIESCMYTNTPDEQFILD--RHPKYDNIVIGAGFSGHGFKLAP 367

Query: 368 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNV 400
           VVG+IL EL++K      +L  F+I+RF    K ++
Sbjct: 368 VVGKILYELSMK-LTPSYDLAPFRISRFPSLGKAHL 390

BLAST of ClCG05G026450 vs. Swiss-Prot
Match: SOX_RABIT (Peroxisomal sarcosine oxidase OS=Oryctolagus cuniculus GN=PIPOX PE=1 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 3.3e-60
Identity = 139/396 (35.10%), Postives = 213/396 (53.79%), Query Frame = 1

Query: 8   FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHDL 67
           +D IV+GAG+ G  T YHL K   R+L+LEQF   H RGSSHG+SR IR  Y ED+Y  +
Sbjct: 8   WDAIVIGAGIQGCFTVYHLVKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYLEDFYTRM 67

Query: 68  VMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQLAE 127
           + E Y++W   E E G +++     L +G  +++ L  +     +  + H  L   +L +
Sbjct: 68  MHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCLSSEELKQ 127

Query: 128 KYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGSSGG 187
           ++   + +P   V +    GGVI   KA+   Q    + G +++D  +VVEI    + G 
Sbjct: 128 RFP-NIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEI----NPGL 187

Query: 188 IVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEAEYAI 247
           +V       S++ K  V+T G W  +L++ + GIE+P+Q L + V YWR  E     Y +
Sbjct: 188 LVTVKTTSRSYQAKSLVITAGPWTNQLLRPL-GIEMPLQTLRINVCYWR--EMVPGSYGV 247

Query: 248 GGGFPTLASYG--NPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSG--EQLPIAVLK 307
              FP     G    +IYG P+ E+PGL+KV+ H G+  DP +R   +   +   + +L 
Sbjct: 248 SQAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQILS 307

Query: 308 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 367
            ++           EPAV + CMY+ TPDE F++D        ++VIG GFSGHGFK++P
Sbjct: 308 SFVRDHLPDL--KPEPAVIESCMYTNTPDEQFILD--RHPKYDNIVIGAGFSGHGFKLAP 367

Query: 368 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNV 400
           VVG+IL EL++K      +L  F+I+RF    K ++
Sbjct: 368 VVGKILYELSMK-LTPSYDLAPFRISRFPSLGKAHL 390

BLAST of ClCG05G026450 vs. Swiss-Prot
Match: SOX_MOUSE (Peroxisomal sarcosine oxidase OS=Mus musculus GN=Pipox PE=1 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 1.7e-59
Identity = 140/397 (35.26%), Postives = 214/397 (53.90%), Query Frame = 1

Query: 8   FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHDL 67
           +D IV+GAG+ G  TAYHLAK    VL+LEQF   H RGSSHG+SR IR  YPED+Y  +
Sbjct: 8   WDAIVIGAGIQGCFTAYHLAKHSKSVLLLEQFFLPHSRGSSHGQSRIIRKAYPEDFYTMM 67

Query: 68  VMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQLAE 127
           + E Y+ W   E E G +++   E L +G  ++  L  +  T  +  I H  L    L +
Sbjct: 68  MKECYQTWAQLEREAGTQLHRQTELLLLGTKENPGLKTIQATLSRQGIDHEYLSSVDLKQ 127

Query: 128 KYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGSSGG 187
           ++   +      V +  K GGV+   KA+   Q +  + G  + D  +VVEI+      G
Sbjct: 128 RFP-NIRFTRGEVGLLDKTGGVLYADKALRALQHIICQLGGTVCDGEKVVEIR-----PG 187

Query: 188 IVVSIANG-ESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEAEYA 247
           + V++    +S++    V+T G W  +L+  + GIELP+Q L + V YWR  E     Y 
Sbjct: 188 LPVTVKTTLKSYQANSLVITAGPWTNRLLHPL-GIELPLQTLRINVCYWR--EKVPGSYG 247

Query: 248 IGGGFPTL--ASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGS--GEQLPIAVL 307
           +   FP +        +IYG P+ E+PGL+K+  H G   DP +R       +   + +L
Sbjct: 248 VSQAFPCILGLDLAPHHIYGLPASEYPGLMKICYHHGDNVDPEERDCPKTFSDIQDVQIL 307

Query: 308 KEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMS 367
             ++     G    +EP + + CMY+ TPDE F++D    +++ ++VIG GFSGHGFK++
Sbjct: 308 CHFVRDHLPGL--RAEPDIMERCMYTNTPDEHFILD-CHPKYD-NIVIGAGFSGHGFKLA 367

Query: 368 PVVGRILSELALKGEAEGVELKYFKIARFEENPKGNV 400
           PVVG+IL EL++K      +L  F+++RF    K ++
Sbjct: 368 PVVGKILYELSMK-LPPSYDLAPFRMSRFSTLSKAHL 390

BLAST of ClCG05G026450 vs. TrEMBL
Match: A0A0A0LJG6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G881890 PE=4 SV=1)

HSP 1 Score: 733.4 bits (1892), Expect = 1.5e-208
Identity = 356/409 (87.04%), Postives = 379/409 (92.67%), Query Frame = 1

Query: 1   MVDSDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60
           M  SDT FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTLFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVL 120
           EDYY+ LVMESYELW+MAE EIGY+VYFP EQLDIG  DDKSL AVV+TCRKHSIPHLVL
Sbjct: 61  EDYYYGLVMESYELWRMAEEEIGYKVYFPTEQLDIGSPDDKSLTAVVDTCRKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIK 180
           D G+L EKYSGRVEIPADWV VWSKYGGVIKPTKAVSM+QTLAYKNGAV+KDNAEVVEIK
Sbjct: 121 DSGELREKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMYQTLAYKNGAVMKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEG 240
           RD S+G IVVSIANGESFRGKKCVVTVGAW++KLVKSV GIELPI+PLEV+VSYWRIKEG
Sbjct: 181 RDESNGRIVVSIANGESFRGKKCVVTVGAWSKKLVKSVGGIELPIRPLEVSVSYWRIKEG 240

Query: 241 AEAEYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIA 300
            EAEYAIGGGFPT+ASYG PY+YGTPSLEFPGLIKVAIHGGH C+P+KR+WG G +LPIA
Sbjct: 241 FEAEYAIGGGFPTIASYGEPYVYGTPSLEFPGLIKVAIHGGHECNPDKRSWGKGGRLPIA 300

Query: 301 VLKEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFK 360
            LKEWI+ +FGGRVDSS P  TQ CMYSMTPD DFVIDFLGGEFEKDVVIGGGFSGHGFK
Sbjct: 301 ALKEWIDEKFGGRVDSSGPVSTQSCMYSMTPDGDFVIDFLGGEFEKDVVIGGGFSGHGFK 360

Query: 361 MSPVVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKLH 410
           MSP +GRIL+ELAL G AEGVELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 MSPTIGRILAELALDGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of ClCG05G026450 vs. TrEMBL
Match: A0A0D2SRF3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G121600 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 3.4e-168
Identity = 288/405 (71.11%), Postives = 340/405 (83.95%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           SD +FDVIVVGAGVMGSSTAY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDY
Sbjct: 4   SDNEFDVIVVGAGVMGSSTAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDY 63

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y+ LV ESY LW+ A++EIG++VYF A+Q D+GP D KSL++V+ TCRK+ IP+ VLD  
Sbjct: 64  YYGLVDESYRLWEQAQSEIGFKVYFKAQQFDMGPSDAKSLLSVISTCRKNGIPYQVLDHR 123

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           Q+AE++SGR++IP DW+ V  + GG+IKPTKAVSMFQ LA+KNGA LKDN +VV I +DG
Sbjct: 124 QVAERFSGRIDIPEDWIGVSCELGGIIKPTKAVSMFQMLAFKNGACLKDNIKVVSINKDG 183

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
              G+ V+ +NGE F GKKCVVTVG W R LVK V GIELPIQPLE  V YWRIK+G E 
Sbjct: 184 DR-GLKVAASNGEIFWGKKCVVTVGGWMRNLVKMVCGIELPIQPLETNVCYWRIKDGHEV 243

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
           EYAIG  FPT ASYG+PYIYGTPSLE+PGLIKVA+HGG+ C+P+KR WG G  L    LK
Sbjct: 244 EYAIGNDFPTFASYGHPYIYGTPSLEYPGLIKVAVHGGYQCNPDKRPWGPG--LVPDSLK 303

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           +W+E RF G+VDSS+PA+TQLC+YSMTPDEDFVIDFLGGEF KDVVIGGGFSGHGFKMSP
Sbjct: 304 QWVEQRFKGKVDSSKPAMTQLCVYSMTPDEDFVIDFLGGEFGKDVVIGGGFSGHGFKMSP 363

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKL 409
            +GRIL++LAL GEA+GVELK F+IARFEENP+GN+K + DQV+L
Sbjct: 364 AIGRILADLALIGEAKGVELKQFRIARFEENPRGNIKEYEDQVEL 405

BLAST of ClCG05G026450 vs. TrEMBL
Match: A0A061GJC3_THECC (FAD-dependent oxidoreductase family protein OS=Theobroma cacao GN=TCM_036834 PE=4 SV=1)

HSP 1 Score: 595.5 bits (1534), Expect = 4.9e-167
Identity = 285/405 (70.37%), Postives = 338/405 (83.46%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S  +FDVIVVGAGVMGSSTAY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDY
Sbjct: 4   SADEFDVIVVGAGVMGSSTAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDY 63

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           YHD+V ESY++W+ A++EIG+RVYF A  +D+GP D KSL+AV+ TC++ S+PH VLDR 
Sbjct: 64  YHDMVNESYQMWEQAQSEIGFRVYFKARHVDMGPADAKSLLAVISTCQRKSMPHQVLDRQ 123

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           Q+ EK+SGR++IP  W+ V  ++GGVIKPTKAVSMFQ LA K+GA L DN EV  + RDG
Sbjct: 124 QVTEKFSGRIDIPEGWIGVSCEHGGVIKPTKAVSMFQMLALKHGAFLWDNTEVNGVTRDG 183

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
             GG++VS +NG+ F GKKCVVT G+W RKLVK VSG+ELPIQPLE  V YWRIKEG EA
Sbjct: 184 VKGGVIVSTSNGDKFWGKKCVVTAGSWMRKLVKKVSGVELPIQPLETNVCYWRIKEGHEA 243

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
           +YAI   FPT ASYG PY+YGTPSLE+PGLIKVA+HGG+ CDP+KRTWG G  +P + LK
Sbjct: 244 KYAIESDFPTFASYGKPYMYGTPSLEYPGLIKVAVHGGYPCDPDKRTWGPG-VIP-SSLK 303

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           +WIE  F G VDSS PA TQLC+YSMTPDEDFV+DFLGGEF KDVVIGGGFSGHGFKM+P
Sbjct: 304 QWIEETFRGSVDSSGPAATQLCVYSMTPDEDFVLDFLGGEFGKDVVIGGGFSGHGFKMAP 363

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKL 409
           V+GRIL++L L G+AEG+ELK+F+IARF+E+P GNVK F DQV L
Sbjct: 364 VIGRILADLVLTGKAEGIELKHFRIARFKEHPGGNVKDFEDQVGL 406

BLAST of ClCG05G026450 vs. TrEMBL
Match: M5XD12_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006663mg PE=4 SV=1)

HSP 1 Score: 594.0 bits (1530), Expect = 1.4e-166
Identity = 287/399 (71.93%), Postives = 332/399 (83.21%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S  +FDVIVVGAG+MGSSTAY  AK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDY
Sbjct: 4   SADEFDVIVVGAGIMGSSTAYQTAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDY 63

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y  LV++SY+LW+ AE+EIGY VYF A QLD+ P +DK L AVVE+CRK+ +P   ++R 
Sbjct: 64  YTPLVLQSYKLWQQAESEIGYNVYFKAHQLDMAPANDKVLHAVVESCRKNLVPFRFMNRD 123

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           QL  ++SGR+ IP DWVAV +++GGVIKPTKAVSMFQTLA +NGAVL+DN  V  ++RDG
Sbjct: 124 QLDREFSGRIRIPEDWVAVATEHGGVIKPTKAVSMFQTLALQNGAVLRDNMGVKGVERDG 183

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
             GG+ V   NGE F GKKCVVTVGAW  KLVK+V+GIELPI+PLE  V YWRIKEG E 
Sbjct: 184 VRGGVWVCTENGERFWGKKCVVTVGAWTTKLVKTVAGIELPIKPLETTVCYWRIKEGHEG 243

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
            +AIGG FPT ASYG+ YIYGTPSLE+PGLIKVA+HGG+ CDP+KR WG G   P+A LK
Sbjct: 244 GFAIGGDFPTFASYGDTYIYGTPSLEYPGLIKVAVHGGYPCDPDKRPWGPGN--PLAPLK 303

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           EWIEGRF G VDS  P  TQLCMYSMTPDEDFVIDFLGGEF KDVV+GGGFSGHGFK+SP
Sbjct: 304 EWIEGRFSGVVDSGGPVATQLCMYSMTPDEDFVIDFLGGEFGKDVVVGGGFSGHGFKLSP 363

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSF 403
           VVGRIL++LAL GEA+GVELK+F+IARF+ENPKGNVK F
Sbjct: 364 VVGRILADLALSGEAQGVELKHFRIARFQENPKGNVKDF 400

BLAST of ClCG05G026450 vs. TrEMBL
Match: B9ILE2_POPTR (Putative sarcosine oxidase family protein OS=Populus trichocarpa GN=POPTR_0018s03430g PE=4 SV=1)

HSP 1 Score: 585.9 bits (1509), Expect = 3.9e-164
Identity = 278/403 (68.98%), Postives = 331/403 (82.13%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S   FDVIVVGAG+MGSSTAY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDY
Sbjct: 4   SSHHFDVIVVGAGIMGSSTAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDY 63

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y D+VMES + W+ A++EIGY+VYF A+Q D+GP D+KSL++V+ +C + S+PH VLD  
Sbjct: 64  YCDMVMESSQSWEQAQSEIGYKVYFKAQQFDMGPSDNKSLLSVISSCERKSLPHQVLDGQ 123

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           Q+A+++SGR+ IP  WV V ++ GGVIKPTKAVSMFQ LA++ GAVL+DN EV  I +D 
Sbjct: 124 QVADRFSGRINIPESWVGVLTEVGGVIKPTKAVSMFQALAFQKGAVLRDNMEVKNIVKDE 183

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
           + GG+ V +ANGE + GKKCVVT GAW  KLVK+VSG+ELPIQ LE  V YWRIKEG EA
Sbjct: 184 ARGGVNVVVANGEEYWGKKCVVTAGAWMGKLVKTVSGLELPIQALETTVCYWRIKEGHEA 243

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
           ++AIG  FPT ASYG PYIYGTPSLEFPGLIK+A+HGG+ CDP+KR WG G  +    +K
Sbjct: 244 KFAIGSDFPTFASYGEPYIYGTPSLEFPGLIKIAVHGGYTCDPDKRPWGPG--ISSDSMK 303

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           EWIEGRF G VD   P  TQLCMYSMTPD DFVIDFLGGEF KDVV+GGGFSGHGFKM+P
Sbjct: 304 EWIEGRFSGLVDYGGPVATQLCMYSMTPDGDFVIDFLGGEFGKDVVVGGGFSGHGFKMAP 363

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQV 407
           VVGRIL++LAL GEA+GV+LK+F+I RF+ENPKGNVK + DQV
Sbjct: 364 VVGRILADLALSGEAKGVDLKHFRIQRFQENPKGNVKDYEDQV 404

BLAST of ClCG05G026450 vs. TAIR10
Match: AT2G24580.1 (AT2G24580.1 FAD-dependent oxidoreductase family protein)

HSP 1 Score: 559.3 bits (1440), Expect = 2.0e-159
Identity = 264/407 (64.86%), Postives = 322/407 (79.12%), Query Frame = 1

Query: 5   DTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYY 64
           D +FDVIVVGAGVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDYY
Sbjct: 6   DGRFDVIVVGAGVMGSSAAYQLAKRGQKTLLLEQFDFLHHRGSSHGESRTIRATYPEDYY 65

Query: 65  HDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRGQ 124
           + +V ES  LW  A++EIGY+V+FP +Q D+GP D +SL++VV TC+KH + H V+D   
Sbjct: 66  YSMVSESTRLWAAAQSEIGYKVHFPTQQFDMGPADQQSLLSVVATCQKHGLAHRVMDSHA 125

Query: 125 LAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDGS 184
           ++E +SGR+ IP +W+ V ++ GG+IKPTKAVSMFQTLA  +GA+L+DN +V  IKRDG 
Sbjct: 126 VSEHFSGRISIPENWIGVSTELGGIIKPTKAVSMFQTLAIGHGAILRDNTKVANIKRDGE 185

Query: 185 SG-GIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 244
           SG G++V    G+ F GKKC+VT GAW  KLVK+V+GI+ P++PLE  V YWRIKEG E 
Sbjct: 186 SGEGVIVCTVKGDKFYGKKCIVTAGAWISKLVKTVAGIDFPVEPLETTVCYWRIKEGHEE 245

Query: 245 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 304
           ++ I G FPT ASYG PY+YGTPSLE+PGLIKVA+HGG+ CDP+KR WG G +L    LK
Sbjct: 246 KFTIDGEFPTFASYGAPYVYGTPSLEYPGLIKVAVHGGYWCDPDKRPWGPGVKL--EELK 305

Query: 305 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 364
           EWI+ RFGG VDS  P  TQLCMYSMTPDEDFVIDFLGGEF +DVV+GGGFSGHGFKM+P
Sbjct: 306 EWIKERFGGMVDSEGPVATQLCMYSMTPDEDFVIDFLGGEFGRDVVVGGGFSGHGFKMAP 365

Query: 365 VVGRILSELALKGEA--EGVELKYFKIARFEENPKGNVKSFADQVKL 409
            VGRIL+++A++ EA   GVE+K F + RFE+NPKGN K + DQV L
Sbjct: 366 AVGRILADMAMEVEAGGGGVEMKQFSLRRFEDNPKGNAKEYPDQVIL 410

BLAST of ClCG05G026450 vs. NCBI nr
Match: gi|449437334|ref|XP_004136447.1| (PREDICTED: probable sarcosine oxidase [Cucumis sativus])

HSP 1 Score: 733.4 bits (1892), Expect = 2.2e-208
Identity = 356/409 (87.04%), Postives = 379/409 (92.67%), Query Frame = 1

Query: 1   MVDSDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60
           M  SDT FDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTLFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVL 120
           EDYY+ LVMESYELW+MAE EIGY+VYFP EQLDIG  DDKSL AVV+TCRKHSIPHLVL
Sbjct: 61  EDYYYGLVMESYELWRMAEEEIGYKVYFPTEQLDIGSPDDKSLTAVVDTCRKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIK 180
           D G+L EKYSGRVEIPADWV VWSKYGGVIKPTKAVSM+QTLAYKNGAV+KDNAEVVEIK
Sbjct: 121 DSGELREKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMYQTLAYKNGAVMKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEG 240
           RD S+G IVVSIANGESFRGKKCVVTVGAW++KLVKSV GIELPI+PLEV+VSYWRIKEG
Sbjct: 181 RDESNGRIVVSIANGESFRGKKCVVTVGAWSKKLVKSVGGIELPIRPLEVSVSYWRIKEG 240

Query: 241 AEAEYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIA 300
            EAEYAIGGGFPT+ASYG PY+YGTPSLEFPGLIKVAIHGGH C+P+KR+WG G +LPIA
Sbjct: 241 FEAEYAIGGGFPTIASYGEPYVYGTPSLEFPGLIKVAIHGGHECNPDKRSWGKGGRLPIA 300

Query: 301 VLKEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFK 360
            LKEWI+ +FGGRVDSS P  TQ CMYSMTPD DFVIDFLGGEFEKDVVIGGGFSGHGFK
Sbjct: 301 ALKEWIDEKFGGRVDSSGPVSTQSCMYSMTPDGDFVIDFLGGEFEKDVVIGGGFSGHGFK 360

Query: 361 MSPVVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKLH 410
           MSP +GRIL+ELAL G AEGVELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 MSPTIGRILAELALDGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of ClCG05G026450 vs. NCBI nr
Match: gi|659132624|ref|XP_008466296.1| (PREDICTED: probable sarcosine oxidase [Cucumis melo])

HSP 1 Score: 724.5 bits (1869), Expect = 1.0e-205
Identity = 352/409 (86.06%), Postives = 379/409 (92.67%), Query Frame = 1

Query: 1   MVDSDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYP 60
           M  SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVL+LEQFDFLHHRGSSHGESRTIRATYP
Sbjct: 1   MAASDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLLLEQFDFLHHRGSSHGESRTIRATYP 60

Query: 61  EDYYHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVL 120
           EDYYHDLVMESYELW+MAEAEIG++VY+PAEQLDIGP + +SL AVV+TC KHSIPHLVL
Sbjct: 61  EDYYHDLVMESYELWRMAEAEIGFKVYYPAEQLDIGPSNSESLTAVVDTCLKHSIPHLVL 120

Query: 121 DRGQLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIK 180
           DRG+L EKYSGRVEIPA+WVAV SKYGGVIKPTKAVSMFQTLAYKNG VLKDNAEVVEIK
Sbjct: 121 DRGELMEKYSGRVEIPANWVAVCSKYGGVIKPTKAVSMFQTLAYKNGTVLKDNAEVVEIK 180

Query: 181 RDGSSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEG 240
           RD S+G IVVS ANGESF GKKCVVTVGAW++KLVKSV GIELPI PLEV+VSYWRIKEG
Sbjct: 181 RDESNGRIVVSTANGESFHGKKCVVTVGAWSKKLVKSVGGIELPILPLEVSVSYWRIKEG 240

Query: 241 AEAEYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIA 300
            EAEYAI GGFPT+ASYG PY+YGTPSLEFPGLIKVAIH G+ C+P+KR+WG   +LPIA
Sbjct: 241 FEAEYAIEGGFPTMASYGEPYVYGTPSLEFPGLIKVAIHAGYSCNPDKRSWGREGRLPIA 300

Query: 301 VLKEWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFK 360
            LKEWI+ +FGGRVDSS P  +QLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFK
Sbjct: 301 ALKEWIDEKFGGRVDSSRPVSSQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFK 360

Query: 361 MSPVVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQVKLH 410
           MSP +GRIL+ELALKG AEGVELKYFK+ARFEENPKGNVKSFADQVKLH
Sbjct: 361 MSPAIGRILAELALKGAAEGVELKYFKLARFEENPKGNVKSFADQVKLH 409

BLAST of ClCG05G026450 vs. NCBI nr
Match: gi|1009156308|ref|XP_015896180.1| (PREDICTED: probable sarcosine oxidase [Ziziphus jujuba])

HSP 1 Score: 613.2 bits (1580), Expect = 3.2e-172
Identity = 287/403 (71.22%), Postives = 339/403 (84.12%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S   FDVIVVGAG+MGSSTAY  +K G++ L+LEQFDFLHHRGSSHGESRTIR TYP+DY
Sbjct: 5   SGEDFDVIVVGAGIMGSSTAYQTSKRGHKTLLLEQFDFLHHRGSSHGESRTIRPTYPQDY 64

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y  +V+ESY LW+ AE+EIGY+V F A Q D+GP D  +  A++ +CRK+S+PH VLD+ 
Sbjct: 65  YCSMVLESYTLWEQAESEIGYKVSFKASQFDMGPADATNFKALISSCRKNSLPHQVLDKA 124

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           Q+A+K+SGR+ IP DWV V ++YGGVIKPTKAVSMFQTLA KNGAVL+DN EV +IKRDG
Sbjct: 125 QVAQKFSGRIHIPEDWVGVSTEYGGVIKPTKAVSMFQTLALKNGAVLRDNMEVKDIKRDG 184

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
             GG+VV  +NGE FRGKKCVVTVGAW +KLVK+V G+E+PIQPLE  V YWRI EG E 
Sbjct: 185 EGGGLVVFTSNGEKFRGKKCVVTVGAWMKKLVKTVIGVEIPIQPLETTVCYWRINEGHET 244

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
           +YAIGG FPT ASYG PY+YGTPSLEFPGLIKVA+HGG+ CDP+KR WG G  + +A LK
Sbjct: 245 DYAIGGDFPTFASYGEPYVYGTPSLEFPGLIKVAVHGGYACDPDKRPWGPG--ISLAALK 304

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           EWI+GRF G VDS+ P  TQLCMYSMTPDEDFV+DFLGGEF KD+VIGGGFSGHGFKM+P
Sbjct: 305 EWIQGRFSGLVDSAGPVATQLCMYSMTPDEDFVMDFLGGEFGKDLVIGGGFSGHGFKMAP 364

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQV 407
           VVGRIL++L L GEAEGV+LK+F++ARF ENPKGNVK+F DQV
Sbjct: 365 VVGRILADLVLSGEAEGVDLKHFRVARFNENPKGNVKAFEDQV 405

BLAST of ClCG05G026450 vs. NCBI nr
Match: gi|1009172411|ref|XP_015867257.1| (PREDICTED: probable sarcosine oxidase [Ziziphus jujuba])

HSP 1 Score: 610.5 bits (1573), Expect = 2.1e-171
Identity = 286/403 (70.97%), Postives = 338/403 (83.87%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S   FDVIVVGAG+MGSSTAY  +K G++ L+LEQFDFLHHRGSSHGESRTIR TYP+DY
Sbjct: 5   SGEDFDVIVVGAGIMGSSTAYQTSKRGHKTLLLEQFDFLHHRGSSHGESRTIRPTYPQDY 64

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y  +V+ESY LW+ AE+EIGY+V F A Q D+GP D  +  A++ +CRK+S+PH VLD+ 
Sbjct: 65  YCSMVLESYTLWEQAESEIGYKVSFKASQFDMGPADATNFKALISSCRKNSLPHQVLDKA 124

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           Q+A+K+SGR+ IP DWV V ++YGG IKPTKAVSMFQTLA KNGAVL+DN EV +IKRDG
Sbjct: 125 QVAQKFSGRIHIPEDWVGVSTEYGGGIKPTKAVSMFQTLALKNGAVLRDNMEVKDIKRDG 184

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
             GG+VV  +NGE FRGKKCVVTVGAW +KLVK+V G+E+PIQPLE  V YWRI EG E 
Sbjct: 185 EGGGLVVFTSNGEKFRGKKCVVTVGAWMKKLVKTVIGVEIPIQPLETTVCYWRINEGHET 244

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
           +YAIGG FPT ASYG PY+YGTPSLEFPGLIKVA+HGG+ CDP+KR WG G  + +A LK
Sbjct: 245 DYAIGGDFPTFASYGEPYVYGTPSLEFPGLIKVAVHGGYACDPDKRPWGPG--ISLAALK 304

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           EWI+GRF G VDS+ P  TQLCMYSMTPDEDFV+DFLGGEF KD+VIGGGFSGHGFKM+P
Sbjct: 305 EWIQGRFSGLVDSAGPVATQLCMYSMTPDEDFVMDFLGGEFGKDLVIGGGFSGHGFKMAP 364

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSFADQV 407
           VVGRIL++L L GEAEGV+LK+F++ARF ENPKGNVK+F DQV
Sbjct: 365 VVGRILADLVLSGEAEGVDLKHFRVARFNENPKGNVKAFEDQV 405

BLAST of ClCG05G026450 vs. NCBI nr
Match: gi|694324599|ref|XP_009353308.1| (PREDICTED: probable sarcosine oxidase [Pyrus x bretschneideri])

HSP 1 Score: 601.3 bits (1549), Expect = 1.3e-168
Identity = 293/399 (73.43%), Postives = 334/399 (83.71%), Query Frame = 1

Query: 4   SDTQFDVIVVGAGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDY 63
           S  +FDVIVVGAGVMGSSTAY  AK G++ L+LEQFDFLHHRGSSHGESRTIRATYPEDY
Sbjct: 4   SGDEFDVIVVGAGVMGSSTAYQTAKRGHKTLLLEQFDFLHHRGSSHGESRTIRATYPEDY 63

Query: 64  YHDLVMESYELWKMAEAEIGYRVYFPAEQLDIGPFDDKSLVAVVETCRKHSIPHLVLDRG 123
           Y  LV+ESY+LW+ AE+EIGY VYF A QLD+   +DK L+AVVE+CRK+S+   V++R 
Sbjct: 64  YTPLVLESYKLWQQAESEIGYNVYFKATQLDMALANDKLLLAVVESCRKNSVAFSVMNRD 123

Query: 124 QLAEKYSGRVEIPADWVAVWSKYGGVIKPTKAVSMFQTLAYKNGAVLKDNAEVVEIKRDG 183
           QL +++SGRV IP DWV V +++GGVIKPTKAVSMFQTLA +NGAVL+DN EV  ++RDG
Sbjct: 124 QLHQEFSGRVMIPEDWVGVVTEHGGVIKPTKAVSMFQTLALQNGAVLRDNMEVKGVERDG 183

Query: 184 SSGGIVVSIANGESFRGKKCVVTVGAWARKLVKSVSGIELPIQPLEVAVSYWRIKEGAEA 243
             GG+ VS A GE F GKKCVVTVGAW  KLVK+V GIELPIQPLE  V YWRIKEG E 
Sbjct: 184 VRGGVWVSTAKGERFWGKKCVVTVGAWTTKLVKTVGGIELPIQPLETTVCYWRIKEGHEG 243

Query: 244 EYAIGGGFPTLASYGNPYIYGTPSLEFPGLIKVAIHGGHLCDPNKRTWGSGEQLPIAVLK 303
            +AIGG FPT ASYGNPYIYGTPSLE+PGLIKVA+HGG+ CDP+KR WG G   P+A LK
Sbjct: 244 AFAIGGDFPTFASYGNPYIYGTPSLEYPGLIKVAVHGGYPCDPDKRPWGPGN--PLAPLK 303

Query: 304 EWIEGRFGGRVDSSEPAVTQLCMYSMTPDEDFVIDFLGGEFEKDVVIGGGFSGHGFKMSP 363
           EWIEG F G VDS  P  TQLCMYSMTPDEDFVIDFLGGEF KDVV+GGGFSGHGFKMSP
Sbjct: 304 EWIEGMFSGVVDSGGPVATQLCMYSMTPDEDFVIDFLGGEFGKDVVVGGGFSGHGFKMSP 363

Query: 364 VVGRILSELALKGEAEGVELKYFKIARFEENPKGNVKSF 403
           VVGRIL++LAL GEAEGVELK+F++ARF+ENPKGN K F
Sbjct: 364 VVGRILADLALTGEAEGVELKHFRMARFQENPKGNAKDF 400

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SOX_ARATH3.5e-15864.86Probable sarcosine oxidase OS=Arabidopsis thaliana GN=At2g24580 PE=2 SV=1[more]
SOX_BOVIN4.7e-6236.43Peroxisomal sarcosine oxidase OS=Bos taurus GN=PIPOX PE=2 SV=2[more]
SOX_HUMAN4.0e-6135.61Peroxisomal sarcosine oxidase OS=Homo sapiens GN=PIPOX PE=1 SV=2[more]
SOX_RABIT3.3e-6035.10Peroxisomal sarcosine oxidase OS=Oryctolagus cuniculus GN=PIPOX PE=1 SV=1[more]
SOX_MOUSE1.7e-5935.26Peroxisomal sarcosine oxidase OS=Mus musculus GN=Pipox PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LJG6_CUCSA1.5e-20887.04Uncharacterized protein OS=Cucumis sativus GN=Csa_3G881890 PE=4 SV=1[more]
A0A0D2SRF3_GOSRA3.4e-16871.11Uncharacterized protein OS=Gossypium raimondii GN=B456_010G121600 PE=4 SV=1[more]
A0A061GJC3_THECC4.9e-16770.37FAD-dependent oxidoreductase family protein OS=Theobroma cacao GN=TCM_036834 PE=... [more]
M5XD12_PRUPE1.4e-16671.93Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006663mg PE=4 SV=1[more]
B9ILE2_POPTR3.9e-16468.98Putative sarcosine oxidase family protein OS=Populus trichocarpa GN=POPTR_0018s0... [more]
Match NameE-valueIdentityDescription
AT2G24580.12.0e-15964.86 FAD-dependent oxidoreductase family protein[more]
Match NameE-valueIdentityDescription
gi|449437334|ref|XP_004136447.1|2.2e-20887.04PREDICTED: probable sarcosine oxidase [Cucumis sativus][more]
gi|659132624|ref|XP_008466296.1|1.0e-20586.06PREDICTED: probable sarcosine oxidase [Cucumis melo][more]
gi|1009156308|ref|XP_015896180.1|3.2e-17271.22PREDICTED: probable sarcosine oxidase [Ziziphus jujuba][more]
gi|1009172411|ref|XP_015867257.1|2.1e-17170.97PREDICTED: probable sarcosine oxidase [Ziziphus jujuba][more]
gi|694324599|ref|XP_009353308.1|1.3e-16873.43PREDICTED: probable sarcosine oxidase [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006076FAD-dep_OxRdtase
IPR006281SoxA_mon
IPR023753FAD/NAD-binding_dom
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0008115sarcosine oxidase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO:0046653tetrahydrofolate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006544 glycine metabolic process
biological_process GO:0006563 L-serine metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0046653 tetrahydrofolate metabolic process
biological_process GO:0006566 threonine metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008115 sarcosine oxidase activity
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G026450.1ClCG05G026450.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006076FAD dependent oxidoreductasePFAMPF01266DAOcoord: 9..372
score: 2.2
IPR006281Sarcosine oxidase, monomericTIGRFAMsTIGR01377TIGR01377coord: 8..397
score: 3.4E
IPR023753FAD/NAD(P)-binding domainGENE3DG3DSA:3.50.50.60coord: 6..62
score: 1.9E-56coord: 152..216
score: 1.9E-56coord: 334..395
score: 1.9
IPR023753FAD/NAD(P)-binding domainunknownSSF51905FAD/NAD(P)-binding domaincoord: 7..225
score: 3.11E-44coord: 331..392
score: 3.11
NoneNo IPR availableGENE3DG3DSA:3.30.9.10coord: 63..151
score: 1.3E-34coord: 223..331
score: 1.3
NoneNo IPR availablePANTHERPTHR10961PEROXISOMAL SARCOSINE OXIDASEcoord: 1..406
score: 7.4E
NoneNo IPR availablePANTHERPTHR10961:SF7PEROXISOMAL SARCOSINE OXIDASEcoord: 1..406
score: 7.4E
NoneNo IPR availableunknownSSF54373FAD-linked reductases, C-terminal domaincoord: 227..331
score: 4.32

The following gene(s) are paralogous to this gene:

None