Cla021877 (gene) Watermelon (97103) v1

NameCla021877
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionStrictosidine synthase family protein (AHRD V1 **-- Q58L86_BRANA); contains Interpro domain(s) IPR004141 Strictosidine synthase
LocationChr5 : 7189113 .. 7192612 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCCATTTGAGATGAGCCCTGTGGGGGGCCATGATTTCAGGCCCGTGAAGCATGACATTGCACCTTACAGCCAGGTTATGGGCCGTTGGCCCAAGGACAATGAGAGCCGTTTGGGCCTTGGAAATTTGGAATTTGAAGATGAGGTTTTTGGGCCTGAGTCTTTGGAGTTTGATGCATTGGGACGTGGGCCTTACACAGGGCTTGCTGATGGCCGCATTGTTAGATGGATGGGGGAGGAAGTTGGGTGGGAGACTTTTGCAATTGTCACAACAAATTGGTAGAATTTTAGAATATAGTTTTTTCTTTTAAATTCTAAAATTCAATTTATCATATATCCCTTATTTTTTTCAAATAAATTGTTATTGCCATTTAAAATTGAACCCTAGACTTACAAATTGTTGCAATGCTTATTAATTTCTACTAGATACACCCCTAATTTTAACTAAAAAAAAAAAAAAGAAAAAATCTTTCATAACTCTCTAATTTCAACCAAATTAAAGTTTGGCACAAATAATTAACCTTGATTAAAGTAAAAGATCACCCATTAAAAAGTAAAATAAAACCTATAGATTTTTGTATAATAATAGTAACATTTTTTTTCGAAGTTTCATTAATTTTGTACTAAACATATTCTTTTAACAAAGATTTATCAGACCAAAGTTGAAACAATTTATTGTAATTGAGTGTAGAATTTAATTAAATTAAAAGTCTAGGGTAAAAAAAACCTAATGCCAAACAATTGAAGTTCAAATAAACTAGGAAAATTATCTCAAAAAGCAAAACTACTGAAAATATTTACAACTAATAGCAAAATACACAGTTTATCTCAATCTGTCGCAGATAGACAGTGAAATTTTGTTATATTTGTAAATATTTTGGTTACTTTTTCTATATTTGAAAAGAGTCCAATAAACTATTTTTATTTTATTATTATTATTTTATTTTTAATGAAATTGAAATTCAGGTCAGAGAAGGTATGTGCACAAGGGGTTGATTCAACCACAGCAAAACAATGGAAACATGAGAAGAAATGTGGAAGGCCACTTGGGCTAAGATTTGAGAAAGAAAGTGGGAATTTGTACATTGCTGATGCATATTATGGGCTTCTTGTGGTTGGCCCTCAAGGAGGAATTGCCACTCCCTTAGCCACTCATGTTGAAGGAACTCCCATTCTCTTTGCTAATGATCTTGACATTCACAACAATGGCTCCATCTTCTTCACTGACACTAGCAAGAGATATAATAGAGTGTAAGTATTCATTTACTAATCAAGTGGTGATATCATCAAATTTCTATAACTAATTAACCCCATCCACTTAATCAACTTTTAATCAACTTTTGGATATTTATTCATCATTTAACACGATATCATAGACATAGACGCAACATAGATCTTAAATTAATTCATTTATTGTAAATTATGGTTTTGAAGTTATAGTTTAGGTCATATGGTTTATATAACTAGAATTTAGTACCCCTATGGTTTGATAAAATCCTCTTTGACAAAATTATATTATGTGAGACATATTTATAAAACTATAGGGATTATTTATGAGGTTTTATCAAACTTTAGGGACTACATTCTAATTTTTTCCAAATCATGAGAACCAATTTTGTAATTTCATCTTGAATATCTCAATATTGTCAAAGTATTGTTTTAATCATATTTATGATCTTAATTTGTCAGTTTAAGCTTTTTGGTTAATTAGAAACGCAAAAGATCTTATATACAAACTGATTTATAATGTCATTTCTTTCTCAATCAATTAATTTATGGTAACCCATACGACGATTATGTTTCGTTGTTGTTATAGGAATCATTTCTTCATATTATTGGAAGGAGAAGCTTCAGGTAGGCTTCTTAGATATGATCCTTCAACTAAAACAACTCATGTTGTGTTGAATGGATTGGCCTTTCCAAATGGGGTGCAATTGTCTAAAGACCATACCTTCCTTCTTTATACAGAAACTACCAATTGCAGGTATTTTAAATGCTATTAGAGAGAAAATTTCTAATACTCCTTCCATTTAGTGCTTGCCACATAAGAAACTTTTTAAGAATTATATGAGATCGATAATAAATATTTAATAAATTAGTAAATCATTTAATGATGTGTCAAATAATCAATATAGGTAGTGTGTACTATTGTAACACTAGAAATTTCTCATTATTGGATGATATTTCATATATCAATACTCATTATTTCACTTGTGCATTAATTTACATCTTCATTGGTTGTTAATTAGTTGTGTTATTATATTCTCACTTTTATTCATTTTAGTCTTAGTATTAAATTTTCGTTGATATTTCTACATTTTTGTGGGGACATCGACATAGGTGAATTGACACATTTCCATTTTTGTACAAAATTTTCATGAAACATAAAAAATAAACCCTAAAATGTCACATATCACTAATGTGAACATAATATAAAGATATAATTAACATTTTATCTAAGTTAAAGGAGTAAAATGTTTATTTATTAATTTACACTTTAAAATTTTCTTTTTATAACTTTTTTAAAAATATCCATCGACATCAACATTTATCGACATTTCCATAAAATTGAGACTTCAACATCGAAATTTTGACCATTGCCTTACTTTCAAATAGCGACTATCTTGGTTCTTGTTTTATACAATATTCTAACACAAAGTCTATACACACAATGAAAACCCTTGAATACGCGGCTACATTTCAAAAAAAAAAAAAAAAAACTATTGAGTATAGATTTGTGTCCAATTTGTGCTTGGAAGATAAAAATATTCTAGGACCAAAAGGCATAACGGTTTAAAAGTACACGAAAACCAAAAAAACAAAAGTACAAAAACAAAAACAAACATTACATTTTAAACTTTACATGTTTTTTTTAATTTTTATTTTTATGTACAACAATTGTAAGTGATGTATTGAACTTTCAACCTCTAAGTACAGAGGACATGCGAATACACCATTGAGCCAATTTGACCTCTTGAACCTTATATGTTGATACTCCAATGATGTAGATGATATATTTTCATATATATTAACAGCATTGTGATTATACACACATACAGATTAATGAAGCTGTGGCTAGAAGGTGCAAAGAATGGAAAAGTTGAAGTAGTAGCCAATCTTCCAGGGTTTCCAGACAATGTAAGAAGGAATGAAAGGGATGAATATTGGGTAGCCATTGATTGTTGTAGAACTAAAACACAAGAAGTTTTAACTCACAATCCATGGATAAGAAACATTTATTTTAGGTTACCATTCAGAATGAGTTTGTTGGCCAGATTAATGGGGATGAAAATGTACACTGTGATCTCACTTTTTAGTGAAAATGGAGAGATTTTAGAAGTTCTTGAAGATCAAAAGGGTGAAGTTATGAAGCTAATGAGTGAAGTTAGAGAAGTGAAAGGAAAGCTTTGGATTGGAACTGTGGCTCATAATCATATTGCTACATTGACTTACCCTTTACAAATAAACAACAACAATAATAATAATGTAACACTTTAA

mRNA sequence

ATGGACCCATTTGAGATGAGCCCTGTGGGGGGCCATGATTTCAGGCCCGTGAAGCATGACATTGCACCTTACAGCCAGGTTATGGGCCGTTGGCCCAAGGACAATGAGAGCCGTTTGGGCCTTGGAAATTTGGAATTTGAAGATGAGGTTTTTGGGCCTGAGTCTTTGGAGTTTGATGCATTGGGACGTGGGCCTTACACAGGGCTTGCTGATGGCCGCATTGTTAGATGGATGGGGGAGGAAGTTGGGTGGGAGACTTTTGCAATTGTCACAACAAATTGGTCAGAGAAGGTATGTGCACAAGGGGTTGATTCAACCACAGCAAAACAATGGAAACATGAGAAGAAATGTGGAAGGCCACTTGGGCTAAGATTTGAGAAAGAAAGTGGGAATTTGTACATTGCTGATGCATATTATGGGCTTCTTGTGGTTGGCCCTCAAGGAGGAATTGCCACTCCCTTAGCCACTCATGTTGAAGGAACTCCCATTCTCTTTGCTAATGATCTTGACATTCACAACAATGGCTCCATCTTCTTCACTGACACTAGCAAGAGATATAATAGAGTGAATCATTTCTTCATATTATTGGAAGGAGAAGCTTCAGGTAGGCTTCTTAGATATGATCCTTCAACTAAAACAACTCATGTTGTGTTGAATGGATTGGCCTTTCCAAATGGGGTGCAATTGTCTAAAGACCATACCTTCCTTCTTTATACAGAAACTACCAATTGCAGATTAATGAAGCTGTGGCTAGAAGGTGCAAAGAATGGAAAAGTTGAAGTAGTAGCCAATCTTCCAGGGTTTCCAGACAATGTAAGAAGGAATGAAAGGGATGAATATTGGGTAGCCATTGATTGTTGTAGAACTAAAACACAAGAAGTTTTAACTCACAATCCATGGATAAGAAACATTTATTTTAGGTTACCATTCAGAATGAGTTTGTTGGCCAGATTAATGGGGATGAAAATGTACACTGTGATCTCACTTTTTAGTGAAAATGGAGAGATTTTAGAAGTTCTTGAAGATCAAAAGGGTGAAGTTATGAAGCTAATGAGTGAAGTTAGAGAAGTGAAAGGAAAGCTTTGGATTGGAACTGTGGCTCATAATCATATTGCTACATTGACTTACCCTTTACAAATAAACAACAACAATAATAATAATGTAACACTTTAA

Coding sequence (CDS)

ATGGACCCATTTGAGATGAGCCCTGTGGGGGGCCATGATTTCAGGCCCGTGAAGCATGACATTGCACCTTACAGCCAGGTTATGGGCCGTTGGCCCAAGGACAATGAGAGCCGTTTGGGCCTTGGAAATTTGGAATTTGAAGATGAGGTTTTTGGGCCTGAGTCTTTGGAGTTTGATGCATTGGGACGTGGGCCTTACACAGGGCTTGCTGATGGCCGCATTGTTAGATGGATGGGGGAGGAAGTTGGGTGGGAGACTTTTGCAATTGTCACAACAAATTGGTCAGAGAAGGTATGTGCACAAGGGGTTGATTCAACCACAGCAAAACAATGGAAACATGAGAAGAAATGTGGAAGGCCACTTGGGCTAAGATTTGAGAAAGAAAGTGGGAATTTGTACATTGCTGATGCATATTATGGGCTTCTTGTGGTTGGCCCTCAAGGAGGAATTGCCACTCCCTTAGCCACTCATGTTGAAGGAACTCCCATTCTCTTTGCTAATGATCTTGACATTCACAACAATGGCTCCATCTTCTTCACTGACACTAGCAAGAGATATAATAGAGTGAATCATTTCTTCATATTATTGGAAGGAGAAGCTTCAGGTAGGCTTCTTAGATATGATCCTTCAACTAAAACAACTCATGTTGTGTTGAATGGATTGGCCTTTCCAAATGGGGTGCAATTGTCTAAAGACCATACCTTCCTTCTTTATACAGAAACTACCAATTGCAGATTAATGAAGCTGTGGCTAGAAGGTGCAAAGAATGGAAAAGTTGAAGTAGTAGCCAATCTTCCAGGGTTTCCAGACAATGTAAGAAGGAATGAAAGGGATGAATATTGGGTAGCCATTGATTGTTGTAGAACTAAAACACAAGAAGTTTTAACTCACAATCCATGGATAAGAAACATTTATTTTAGGTTACCATTCAGAATGAGTTTGTTGGCCAGATTAATGGGGATGAAAATGTACACTGTGATCTCACTTTTTAGTGAAAATGGAGAGATTTTAGAAGTTCTTGAAGATCAAAAGGGTGAAGTTATGAAGCTAATGAGTGAAGTTAGAGAAGTGAAAGGAAAGCTTTGGATTGGAACTGTGGCTCATAATCATATTGCTACATTGACTTACCCTTTACAAATAAACAACAACAATAATAATAATGTAACACTTTAA

Protein sequence

MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRPLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFTDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPWIRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGKLWIGTVAHNHIATLTYPLQINNNNNNNVTL
BLAST of Cla021877 vs. Swiss-Prot
Match: SSL13_ARATH (Protein STRICTOSIDINE SYNTHASE-LIKE 13 OS=Arabidopsis thaliana GN=SSL13 PE=1 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 4.2e-177
Identity = 291/382 (76.18%), Postives = 330/382 (86.39%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGL-GNLEFEDEVFGPESLEFD 60
           +DPF MSP+GG +F+PVKH++APY +VMG WP+DN SRLG  G LEF D+VFGPESLEFD
Sbjct: 32  IDPFHMSPIGGREFKPVKHEVAPYKEVMGSWPRDNLSRLGNHGKLEFVDQVFGPESLEFD 91

Query: 61  ALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGR 120
           +LGRGPYTGLADGR+VRWMGE +GWETF++VT+ WSE+ C +GVDSTT KQWKHEK CGR
Sbjct: 92  SLGRGPYTGLADGRVVRWMGEAIGWETFSVVTSKWSEEACVRGVDSTTNKQWKHEKLCGR 151

Query: 121 PLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFF 180
           PLGLRF KE+GNLYIADAYYGLLVVGP+GGIATPLATHVEG PILFANDLDIH NGSIFF
Sbjct: 152 PLGLRFHKETGNLYIADAYYGLLVVGPEGGIATPLATHVEGKPILFANDLDIHRNGSIFF 211

Query: 181 TDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYT 240
           TDTSKRY+R NHFFILLEGE++GRLLRYDP TKTTH+VL GLAFPNG+QLSKD +FLL+T
Sbjct: 212 TDTSKRYDRANHFFILLEGESTGRLLRYDPPTKTTHIVLEGLAFPNGIQLSKDQSFLLFT 271

Query: 241 ETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNP 300
           ETTNCRL+K WLEG K G+VEVVA+LPGFPDNVR NE  ++WVAIDCCRT  QEVLT+NP
Sbjct: 272 ETTNCRLVKYWLEGPKMGEVEVVADLPGFPDNVRINEEGQFWVAIDCCRTPAQEVLTNNP 331

Query: 301 WIRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKG 360
           WIR+IYFRLP  M LLA+ MGM+MYTVIS F E G++LEVLED++G+VM           
Sbjct: 332 WIRSIYFRLPIPMKLLAKTMGMRMYTVISRFDEEGKVLEVLEDRQGKVM----------- 391

Query: 361 KLWIGTVAHNHIATLTYPLQIN 382
           KLWIGTVAHNHIATL YPL +N
Sbjct: 392 KLWIGTVAHNHIATLPYPLTMN 402

BLAST of Cla021877 vs. Swiss-Prot
Match: SSL3_ARATH (Protein STRICTOSIDINE SYNTHASE-LIKE 3 OS=Arabidopsis thaliana GN=SSL3 PE=2 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 2.3e-82
Identity = 163/379 (43.01%), Postives = 231/379 (60.95%), Query Frame = 1

Query: 1   MDPFEMSPVGGH-DFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFD 60
           +DPF  S +    DF+  K D+ P S +     +D ++ L    + F +EV GPES+ FD
Sbjct: 19  IDPFSHSSISKFPDFKTYKIDMPPLSSLPKE--RDRQNLLQNSEIRFLNEVQGPESIAFD 78

Query: 61  ALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGR 120
             GRGPYTG+ADGRI+ W G    W  FA  + N SE +C      +     K E  CGR
Sbjct: 79  PQGRGPYTGVADGRILFWNGTR--WTDFAYTSNNRSE-LCDP--KPSLLDYLKDEDICGR 138

Query: 121 PLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFF 180
           PLGLRF+K++G+LYIADAY G++ VGP+GG+AT +    +G P+ F NDLDI + G+++F
Sbjct: 139 PLGLRFDKKNGDLYIADAYLGIMKVGPEGGLATSVTNEADGVPLRFTNDLDIDDEGNVYF 198

Query: 181 TDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYT 240
           TD+S  + R     +++ GE SGR+L+Y+P TK T  ++  L FPNG+ L KD +F ++ 
Sbjct: 199 TDSSSFFQRRKFMLLIVSGEDSGRVLKYNPKTKETTTLVRNLQFPNGLSLGKDGSFFIFC 258

Query: 241 ETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNP 300
           E +  RL K WL+G K G  EVVA L GFPDN+R N+  ++WVA+ C R     ++ H P
Sbjct: 259 EGSIGRLRKYWLKGEKAGTSEVVALLHGFPDNIRTNKDGDFWVAVHCHRNIFTHLMAHYP 318

Query: 301 WIRNIYFRLPFRMSLLARL-MGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVK 360
            +R  + +LP  +     L +G   + V   +SE G++L+VLED KG+V+K +SEV E  
Sbjct: 319 RVRKFFLKLPISVKFQYLLQVGGWPHAVAVKYSEEGKVLKVLEDSKGKVVKAVSEVEEKD 378

Query: 361 GKLWIGTVAHNHIATLTYP 378
           GKLW+G+V  + IA    P
Sbjct: 379 GKLWMGSVLMSFIAVYDLP 390

BLAST of Cla021877 vs. Swiss-Prot
Match: SSL10_ARATH (Protein STRICTOSIDINE SYNTHASE-LIKE 10 OS=Arabidopsis thaliana GN=SSL10 PE=2 SV=1)

HSP 1 Score: 297.7 bits (761), Expect = 1.8e-79
Identity = 155/318 (48.74%), Postives = 205/318 (64.47%), Query Frame = 1

Query: 52  GPESLEFDALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQW 111
           GPES+ FD  G GPY G++DGRI++W GE +GW  FA  ++N  E  CA+      A + 
Sbjct: 56  GPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQE--CARPF----APEL 115

Query: 112 KHEKKCGRPLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDI 171
           +H   CGRPLGLRF+K++G+LYIADAY+GLLVVGP GG+A PL T  EG P  F NDLDI
Sbjct: 116 EHV--CGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDI 175

Query: 172 HNNGS-IFFTDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLS 231
                 I+FTDTS R+ R      +L  + +GR ++YD S+K   V+L GLAF NGV LS
Sbjct: 176 DEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALS 235

Query: 232 KDHTFLLYTETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTK 291
           KD +F+L  ETT C++++LWL G   G  +V A LPGFPDN+RRN   E+WVA+   +  
Sbjct: 236 KDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGL 295

Query: 292 TQEVLTHNPWIRNIYFRLPFRMSLLARLM--GMKMYTVISLFSENGEILEVLEDQKGEVM 351
             ++     W R++  RLP     L  L   G+   T I L SE+G++LEVLED++G+ +
Sbjct: 296 FAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKL-SESGKVLEVLEDKEGKTL 355

Query: 352 KLMSEVREVKGKLWIGTV 367
           + +SEV E  GKLWIG+V
Sbjct: 356 RFISEVEEKDGKLWIGSV 364

BLAST of Cla021877 vs. Swiss-Prot
Match: SSL8_ARATH (Protein STRICTOSIDINE SYNTHASE-LIKE 8 OS=Arabidopsis thaliana GN=SSL8 PE=2 SV=1)

HSP 1 Score: 245.0 bits (624), Expect = 1.4e-63
Identity = 134/322 (41.61%), Postives = 190/322 (59.01%), Query Frame = 1

Query: 50  VFGPESLEFDALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAK 109
           V GPESLEFD  G GPY G+ DGRI++W GEE+GW  FA  + +       + V S    
Sbjct: 52  VDGPESLEFDPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPS---- 111

Query: 110 QWKHEKKCGRPLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDL 169
                  CGRPLGL FE+++G+LYI D Y+G++ VGP+GG+A  +    EG  ++FAN  
Sbjct: 112 -------CGRPLGLSFERKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQG 171

Query: 170 DIHNNGSIF-FTDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQ 229
           DI     IF F D+S  Y+  + F++ L G   GR++RYD   K   V+++ L  PNG+ 
Sbjct: 172 DIDEEEDIFYFNDSSDTYHFRDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLA 231

Query: 230 LSKDHTFLLYTETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCR 289
           LSK+ +F++  E++     ++W++G K+G  EV A LPG PDN+RR    ++WVA+ C +
Sbjct: 232 LSKNGSFVVTCESSTNICHRIWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKK 291

Query: 290 TK-TQEVLTHNPWIRNIYFRLPFRMSLLARLM--GMKMYTVISLFSENGEILEVLEDQKG 349
              T+ VL H  W+   +F    +M  +   M  G     V+ L  E GEILE+LED +G
Sbjct: 292 NLFTRAVLIHT-WVGR-FFMNTMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEG 351

Query: 350 EVMKLMSEVREVK-GKLWIGTV 367
           + +K +SE  E K GKLWIG+V
Sbjct: 352 KTVKYVSEAYETKDGKLWIGSV 360

BLAST of Cla021877 vs. Swiss-Prot
Match: SSL9_ARATH (Protein STRICTOSIDINE SYNTHASE-LIKE 9 OS=Arabidopsis thaliana GN=SSL9 PE=2 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 2.9e-61
Identity = 119/320 (37.19%), Postives = 186/320 (58.13%), Query Frame = 1

Query: 50  VFGPESLEFDALGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAK 109
           V GPES+EFD  G GPY  + DGRI++W G+++GW  FA  + +       + V +    
Sbjct: 51  VAGPESIEFDPKGEGPYAAVVDGRILKWRGDDLGWVDFAYTSPHRGNCSKTEVVPT---- 110

Query: 110 QWKHEKKCGRPLGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDL 169
                  CGRPLGL FEK++G+LYI D Y GL+ VGP+GG+A  +    EG  ++FAN  
Sbjct: 111 -------CGRPLGLTFEKKTGDLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQG 170

Query: 170 DIHNNGSIF-FTDTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQ 229
           DI     +F F D+S +Y+  + FF+ + GE SGR++RYD  TK   V+++ L   NG+ 
Sbjct: 171 DIDEEEDVFYFNDSSDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLA 230

Query: 230 LSKDHTFLLYTETTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCR 289
           L+KD +FL+  E+    + + W++G K G  ++ A +PG+PDN+R     ++W+ + C +
Sbjct: 231 LNKDRSFLITCESGTSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKK 290

Query: 290 TKTQEVLTHNPWIRNIYFRLPFRMSLLARLMGMKMYTV-ISLFSENGEILEVLEDQKGEV 349
                ++    W+  +  +      ++A + G K + V + +  E GE+LE+LED++G+ 
Sbjct: 291 NLIGRLIVKYKWLGKLVEKTMKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKT 350

Query: 350 MKLMSEVRE-VKGKLWIGTV 367
           MK +SE  E   GKLW G+V
Sbjct: 351 MKYVSEAYERDDGKLWFGSV 359

BLAST of Cla021877 vs. TrEMBL
Match: A0A0A0L9D9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G174550 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 1.5e-218
Identity = 368/392 (93.88%), Postives = 380/392 (96.94%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPFEMSPVGG+DFRPVKHDIAPYSQVMG WPKDNESRLGLGNLEFEDEVFGPESLEFDA
Sbjct: 30  MDPFEMSPVGGYDFRPVKHDIAPYSQVMGHWPKDNESRLGLGNLEFEDEVFGPESLEFDA 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPYTGLADGRIVRWMGEE+GWETFAIVT NWSEKVCA+GVDSTTAKQWK+EKKCGRP
Sbjct: 90  LGRGPYTGLADGRIVRWMGEEIGWETFAIVTPNWSEKVCAKGVDSTTAKQWKNEKKCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRFEK+SGNLYIADAYYGLLVVGPQGG ATPLATHVEGTPILFANDLDIHNNGSIFFT
Sbjct: 150 LGLRFEKQSGNLYIADAYYGLLVVGPQGGTATPLATHVEGTPILFANDLDIHNNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRV HFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE
Sbjct: 210 DTSKRYNRVEHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMKLWLEGA+NGKVEVVANLPGFPDNVRRN+R+EYWVAIDCCRTK QEVLTHNPW
Sbjct: 270 TTNCRLMKLWLEGARNGKVEVVANLPGFPDNVRRNDRNEYWVAIDCCRTKAQEVLTHNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR+IYFRLP RMS LARL+GMKMYTVISLFSENGEILEVLEDQKGEVM+LMSEVREV+GK
Sbjct: 330 IRSIYFRLPLRMSFLARLIGMKMYTVISLFSENGEILEVLEDQKGEVMELMSEVREVQGK 389

Query: 361 LWIGTVAHNHIATLTYPLQINNN--NNNNVTL 391
           LWIGTVAHNHIATLTYPLQ  NN  NNNN TL
Sbjct: 390 LWIGTVAHNHIATLTYPLQSKNNDHNNNNATL 421

BLAST of Cla021877 vs. TrEMBL
Match: M5XQY7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006506mg PE=4 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 1.4e-192
Identity = 314/377 (83.29%), Postives = 348/377 (92.31%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDP +M P+G H+FRPVKH+IAPY QVM RWP+DNESRLG G LEFEDEVFGPESLEFDA
Sbjct: 32  MDPLQMGPLGDHEFRPVKHNIAPYKQVMERWPRDNESRLGFGKLEFEDEVFGPESLEFDA 91

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
            GRGPYTGLADGRIVRWMG ++GWETFA+VT+NWS++VCA+G+DSTT KQWKHEKKCGRP
Sbjct: 92  FGRGPYTGLADGRIVRWMGNDLGWETFALVTSNWSKQVCAKGIDSTTHKQWKHEKKCGRP 151

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRF+KESG+LYIADAYYGLLVVGPQGG+ATPL+THVEG PILFANDLDIH NGSIFFT
Sbjct: 152 LGLRFDKESGDLYIADAYYGLLVVGPQGGLATPLSTHVEGKPILFANDLDIHKNGSIFFT 211

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRVNHFFILLEGE++GRLLRYDP TKTTH+VL GLAFPNG+QLSKD TFLL+TE
Sbjct: 212 DTSKRYNRVNHFFILLEGESTGRLLRYDPPTKTTHIVLEGLAFPNGLQLSKDQTFLLFTE 271

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMK WLEG KNG VE+VANLPGFPDN+R NE+ ++WVAIDCCRT  QEVL+HNPW
Sbjct: 272 TTNCRLMKYWLEGPKNGTVELVANLPGFPDNIRINEKGQFWVAIDCCRTPAQEVLSHNPW 331

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR++YFRLP RM+ LAR MGMKMYT+ISLF+E GEILEVLEDQKG VMKL+SEVRE KGK
Sbjct: 332 IRSVYFRLPIRMTYLARFMGMKMYTLISLFNEKGEILEVLEDQKGAVMKLVSEVREAKGK 391

Query: 361 LWIGTVAHNHIATLTYP 378
           LWIGTVAHNHIATL YP
Sbjct: 392 LWIGTVAHNHIATLPYP 408

BLAST of Cla021877 vs. TrEMBL
Match: B9MY61_POPTR (Strictosidine synthase family protein OS=Populus trichocarpa GN=POPTR_0017s05630g PE=4 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 2.8e-188
Identity = 306/377 (81.17%), Postives = 345/377 (91.51%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPF M P+G HDF+P KHD+APY QVM  WP+DN SRLG GNLEF DEVFGPESLEFD+
Sbjct: 30  MDPFRMGPLGDHDFKPFKHDLAPYKQVMENWPRDNRSRLGSGNLEFVDEVFGPESLEFDS 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPY GLADGR+VRWMG++VGWETFA+VTTNWSEK+CA+GVDSTT+KQWKHEK CGRP
Sbjct: 90  LGRGPYAGLADGRVVRWMGQDVGWETFALVTTNWSEKLCARGVDSTTSKQWKHEKLCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLR  KESGNLYIADAYYGLLVVGP+GG+ATPLATH+ G PILFANDLDIH NGSIFFT
Sbjct: 150 LGLRLHKESGNLYIADAYYGLLVVGPEGGLATPLATHLGGDPILFANDLDIHKNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRY+RV+HFFILLEGE++GRLLRYDP TKTTHVVL+GLAFPNGVQLS+D TF+++TE
Sbjct: 210 DTSKRYDRVDHFFILLEGESTGRLLRYDPPTKTTHVVLDGLAFPNGVQLSRDQTFIVFTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMK WLEG K G+VE+VANLPGFPDNVR N+R ++WVAIDCCRT  QEVLT NPW
Sbjct: 270 TTNCRLMKYWLEGPKTGRVELVANLPGFPDNVRLNDRGQFWVAIDCCRTAAQEVLTQNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           ++++YFRLP +M  LAR+MGMKMYTV+SLF+ENGEILEVLED KGEVMKL+SEVREV+GK
Sbjct: 330 MKSVYFRLPIQMRYLARMMGMKMYTVVSLFNENGEILEVLEDPKGEVMKLVSEVREVEGK 389

Query: 361 LWIGTVAHNHIATLTYP 378
           LWIGTVAHNHIATL YP
Sbjct: 390 LWIGTVAHNHIATLPYP 406

BLAST of Cla021877 vs. TrEMBL
Match: F6HYV0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00880 PE=4 SV=1)

HSP 1 Score: 662.9 bits (1709), Expect = 2.4e-187
Identity = 307/377 (81.43%), Postives = 345/377 (91.51%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPF M P+GGH+F PVKHDIAPY +VM  WP+DN SRLG G LEF DEVFGPESLEFD 
Sbjct: 30  MDPFHMGPIGGHEFMPVKHDIAPYKRVMEDWPRDNRSRLGQGKLEFVDEVFGPESLEFDI 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
            GRGPYTGLADGRIVRWMG+ VGWETFA+VT NWSEK+CA+G+DSTT+KQWK E++CGRP
Sbjct: 90  FGRGPYTGLADGRIVRWMGDSVGWETFALVTPNWSEKLCAKGIDSTTSKQWKVEQRCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRF KE+G+LYIADAYYGLLVVGP+GG+ATPL THV+G PILFANDLDIH NGSIFFT
Sbjct: 150 LGLRFHKETGDLYIADAYYGLLVVGPEGGLATPLVTHVQGKPILFANDLDIHKNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNR+NHFFILLEGEA+GRLLRYDP T+TTH+VL+GLAFPNGVQLS D +FLL+TE
Sbjct: 210 DTSKRYNRMNHFFILLEGEATGRLLRYDPPTRTTHLVLDGLAFPNGVQLSGDQSFLLFTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMK WLEG K+G VE+VANLPGFPDNVR NER ++WVAIDCCRT  QEVLTHNPW
Sbjct: 270 TTNCRLMKYWLEGPKSGIVELVANLPGFPDNVRLNERGQFWVAIDCCRTPAQEVLTHNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           ++NIYFRLP ++S+LARLMGMKMYTVISLF+E GEILEVLED+KG VM+L+SEVREVKGK
Sbjct: 330 LKNIYFRLPVKLSMLARLMGMKMYTVISLFNEKGEILEVLEDRKGLVMRLVSEVREVKGK 389

Query: 361 LWIGTVAHNHIATLTYP 378
           LWIGTVAHNHIATL+YP
Sbjct: 390 LWIGTVAHNHIATLSYP 406

BLAST of Cla021877 vs. TrEMBL
Match: B9SU13_RICCO (Strictosidine synthase, putative OS=Ricinus communis GN=RCOM_0454640 PE=4 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 5.8e-186
Identity = 301/372 (80.91%), Postives = 339/372 (91.13%), Query Frame = 1

Query: 6   MSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDALGRGP 65
           M P+G HDFRPV+HD+APY QVM  WP+D E+RLG G LEF DEVFGPESLEFD+LGRGP
Sbjct: 1   MGPLGRHDFRPVEHDVAPYKQVMENWPRDKEARLGTGKLEFVDEVFGPESLEFDSLGRGP 60

Query: 66  YTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRPLGLRF 125
           Y GLADGRIVRWMGE VGWETFA+VTTNWSEK+CA+GVDSTTAKQWKHEK+CGRPLGLRF
Sbjct: 61  YAGLADGRIVRWMGEAVGWETFAVVTTNWSEKICAKGVDSTTAKQWKHEKRCGRPLGLRF 120

Query: 126 EKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFTDTSKR 185
           +K +GNLY+AD+YYGLLV+GP+GG+A PLAT V G PILFANDLDIH NGSIFFTDTSKR
Sbjct: 121 DKNTGNLYVADSYYGLLVIGPEGGLAKPLATQVAGKPILFANDLDIHENGSIFFTDTSKR 180

Query: 186 YNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTETTNCR 245
           Y+RVNHFFILLEGE++GRLLRYDP T TTH+VL+GLAFPNGVQLSKD  FLL+TETTNCR
Sbjct: 181 YDRVNHFFILLEGESTGRLLRYDPPTGTTHIVLDGLAFPNGVQLSKDQKFLLFTETTNCR 240

Query: 246 LMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPWIRNIY 305
           +MK W+EG K G VE+VANLPGFPDN+R N++  YWVAIDCCRT+ QE+LTHNPWIR++Y
Sbjct: 241 IMKYWIEGPKTGNVELVANLPGFPDNIRVNDKGHYWVAIDCCRTRAQEILTHNPWIRSVY 300

Query: 306 FRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGKLWIGT 365
           FRLP RMS+LARLMGMKMYTV+SLF+ENGEILEVLED KG VMKL+SEVREV+GKLWIGT
Sbjct: 301 FRLPIRMSILARLMGMKMYTVVSLFNENGEILEVLEDPKGVVMKLVSEVREVQGKLWIGT 360

Query: 366 VAHNHIATLTYP 378
           VAHNHIATL YP
Sbjct: 361 VAHNHIATLPYP 372

BLAST of Cla021877 vs. NCBI nr
Match: gi|659077181|ref|XP_008439075.1| (PREDICTED: strictosidine synthase [Cucumis melo])

HSP 1 Score: 767.3 bits (1980), Expect = 1.3e-218
Identity = 368/391 (94.12%), Postives = 382/391 (97.70%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPFEMSPVGG+DFRPVKHDIAPYSQVMG WPKDNESRLGLGNLEFEDEVFGPESLEFDA
Sbjct: 30  MDPFEMSPVGGYDFRPVKHDIAPYSQVMGHWPKDNESRLGLGNLEFEDEVFGPESLEFDA 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPYTGLADGRIVRWMGEE+GWETFAIVT NWSEKVCA+GVDSTTAKQWK+EKKCGRP
Sbjct: 90  LGRGPYTGLADGRIVRWMGEEIGWETFAIVTPNWSEKVCAKGVDSTTAKQWKNEKKCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRFEK+SGNLYIADAYYGLLVVGPQGG ATPLATHVEGTPILFANDLDIHNNGSIFFT
Sbjct: 150 LGLRFEKQSGNLYIADAYYGLLVVGPQGGTATPLATHVEGTPILFANDLDIHNNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRV HFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE
Sbjct: 210 DTSKRYNRVEHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMKLWLEGA+NGKVEVVANLPGFPDNVRRNER+EYWVAIDCCRTK QEVLTHNPW
Sbjct: 270 TTNCRLMKLWLEGARNGKVEVVANLPGFPDNVRRNERNEYWVAIDCCRTKAQEVLTHNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR+IYFRLP RMS LARL+GMKMYTVISLFSENGEILEVLEDQKGEVM+LMSEVREV+GK
Sbjct: 330 IRSIYFRLPLRMSFLARLIGMKMYTVISLFSENGEILEVLEDQKGEVMELMSEVREVQGK 389

Query: 361 LWIGTVAHNHIATLTYPLQI-NNNNNNNVTL 391
           LWIGTVAHNHIATLTYPLQ+ +N+ NNNVTL
Sbjct: 390 LWIGTVAHNHIATLTYPLQLKDNDRNNNVTL 420

BLAST of Cla021877 vs. NCBI nr
Match: gi|700202119|gb|KGN57252.1| (hypothetical protein Csa_3G174550 [Cucumis sativus])

HSP 1 Score: 766.5 bits (1978), Expect = 2.2e-218
Identity = 368/392 (93.88%), Postives = 380/392 (96.94%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPFEMSPVGG+DFRPVKHDIAPYSQVMG WPKDNESRLGLGNLEFEDEVFGPESLEFDA
Sbjct: 30  MDPFEMSPVGGYDFRPVKHDIAPYSQVMGHWPKDNESRLGLGNLEFEDEVFGPESLEFDA 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPYTGLADGRIVRWMGEE+GWETFAIVT NWSEKVCA+GVDSTTAKQWK+EKKCGRP
Sbjct: 90  LGRGPYTGLADGRIVRWMGEEIGWETFAIVTPNWSEKVCAKGVDSTTAKQWKNEKKCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRFEK+SGNLYIADAYYGLLVVGPQGG ATPLATHVEGTPILFANDLDIHNNGSIFFT
Sbjct: 150 LGLRFEKQSGNLYIADAYYGLLVVGPQGGTATPLATHVEGTPILFANDLDIHNNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRV HFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE
Sbjct: 210 DTSKRYNRVEHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMKLWLEGA+NGKVEVVANLPGFPDNVRRN+R+EYWVAIDCCRTK QEVLTHNPW
Sbjct: 270 TTNCRLMKLWLEGARNGKVEVVANLPGFPDNVRRNDRNEYWVAIDCCRTKAQEVLTHNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR+IYFRLP RMS LARL+GMKMYTVISLFSENGEILEVLEDQKGEVM+LMSEVREV+GK
Sbjct: 330 IRSIYFRLPLRMSFLARLIGMKMYTVISLFSENGEILEVLEDQKGEVMELMSEVREVQGK 389

Query: 361 LWIGTVAHNHIATLTYPLQINNN--NNNNVTL 391
           LWIGTVAHNHIATLTYPLQ  NN  NNNN TL
Sbjct: 390 LWIGTVAHNHIATLTYPLQSKNNDHNNNNATL 421

BLAST of Cla021877 vs. NCBI nr
Match: gi|449459884|ref|XP_004147676.1| (PREDICTED: protein STRICTOSIDINE SYNTHASE-LIKE 13 [Cucumis sativus])

HSP 1 Score: 764.2 bits (1972), Expect = 1.1e-217
Identity = 364/387 (94.06%), Postives = 378/387 (97.67%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDPFEMSPVGG+DFRPVKHDIAPYSQVMG WPKDNESRLGLGNLEFEDEVFGPESLEFDA
Sbjct: 30  MDPFEMSPVGGYDFRPVKHDIAPYSQVMGHWPKDNESRLGLGNLEFEDEVFGPESLEFDA 89

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPYTGLADGRIVRWMGEE+GWETFAIVT NWSEKVCA+GVDSTTAKQWK+EKKCGRP
Sbjct: 90  LGRGPYTGLADGRIVRWMGEEIGWETFAIVTPNWSEKVCAKGVDSTTAKQWKNEKKCGRP 149

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRFEK+SGNLYIADAYYGLLVVGPQGG ATPLATHVEGTPILFANDLDIHNNGSIFFT
Sbjct: 150 LGLRFEKQSGNLYIADAYYGLLVVGPQGGTATPLATHVEGTPILFANDLDIHNNGSIFFT 209

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRV HFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE
Sbjct: 210 DTSKRYNRVEHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 269

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMKLWLEGA+NGKVEVVANLPGFPDNVRRN+R+EYWVAIDCCRTK QEVLTHNPW
Sbjct: 270 TTNCRLMKLWLEGARNGKVEVVANLPGFPDNVRRNDRNEYWVAIDCCRTKAQEVLTHNPW 329

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR+IYFRLP RMS LARL+GMKMYTVISLFSENGEILEVLEDQKGEVM+LMSEVREV+GK
Sbjct: 330 IRSIYFRLPLRMSFLARLIGMKMYTVISLFSENGEILEVLEDQKGEVMELMSEVREVQGK 389

Query: 361 LWIGTVAHNHIATLTYPLQINNNNNNN 388
           LWIGTVAHNHIATLTYPLQ  NN++NN
Sbjct: 390 LWIGTVAHNHIATLTYPLQSKNNDHNN 416

BLAST of Cla021877 vs. NCBI nr
Match: gi|596148609|ref|XP_007222749.1| (hypothetical protein PRUPE_ppa006506mg [Prunus persica])

HSP 1 Score: 680.2 bits (1754), Expect = 2.1e-192
Identity = 314/377 (83.29%), Postives = 348/377 (92.31%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDP +M P+G H+FRPVKH+IAPY QVM RWP+DNESRLG G LEFEDEVFGPESLEFDA
Sbjct: 32  MDPLQMGPLGDHEFRPVKHNIAPYKQVMERWPRDNESRLGFGKLEFEDEVFGPESLEFDA 91

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
            GRGPYTGLADGRIVRWMG ++GWETFA+VT+NWS++VCA+G+DSTT KQWKHEKKCGRP
Sbjct: 92  FGRGPYTGLADGRIVRWMGNDLGWETFALVTSNWSKQVCAKGIDSTTHKQWKHEKKCGRP 151

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRF+KESG+LYIADAYYGLLVVGPQGG+ATPL+THVEG PILFANDLDIH NGSIFFT
Sbjct: 152 LGLRFDKESGDLYIADAYYGLLVVGPQGGLATPLSTHVEGKPILFANDLDIHKNGSIFFT 211

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRVNHFFILLEGE++GRLLRYDP TKTTH+VL GLAFPNG+QLSKD TFLL+TE
Sbjct: 212 DTSKRYNRVNHFFILLEGESTGRLLRYDPPTKTTHIVLEGLAFPNGLQLSKDQTFLLFTE 271

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMK WLEG KNG VE+VANLPGFPDN+R NE+ ++WVAIDCCRT  QEVL+HNPW
Sbjct: 272 TTNCRLMKYWLEGPKNGTVELVANLPGFPDNIRINEKGQFWVAIDCCRTPAQEVLSHNPW 331

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR++YFRLP RM+ LAR MGMKMYT+ISLF+E GEILEVLEDQKG VMKL+SEVRE KGK
Sbjct: 332 IRSVYFRLPIRMTYLARFMGMKMYTLISLFNEKGEILEVLEDQKGAVMKLVSEVREAKGK 391

Query: 361 LWIGTVAHNHIATLTYP 378
           LWIGTVAHNHIATL YP
Sbjct: 392 LWIGTVAHNHIATLPYP 408

BLAST of Cla021877 vs. NCBI nr
Match: gi|645231726|ref|XP_008222532.1| (PREDICTED: strictosidine synthase 1-like [Prunus mume])

HSP 1 Score: 678.3 bits (1749), Expect = 7.8e-192
Identity = 314/377 (83.29%), Postives = 347/377 (92.04%), Query Frame = 1

Query: 1   MDPFEMSPVGGHDFRPVKHDIAPYSQVMGRWPKDNESRLGLGNLEFEDEVFGPESLEFDA 60
           MDP +M P+G H+FRPVKH+IAPY QVM RWP+DNESRLG G LEFEDEVFGPESLEFDA
Sbjct: 32  MDPLQMGPLGDHEFRPVKHNIAPYKQVMERWPRDNESRLGFGKLEFEDEVFGPESLEFDA 91

Query: 61  LGRGPYTGLADGRIVRWMGEEVGWETFAIVTTNWSEKVCAQGVDSTTAKQWKHEKKCGRP 120
           LGRGPYTGLADGRIVRWMG ++GWETFA+VTTNWS+ VCA+G+DSTT KQWKHEKKCGRP
Sbjct: 92  LGRGPYTGLADGRIVRWMGNDLGWETFALVTTNWSKPVCAKGIDSTTHKQWKHEKKCGRP 151

Query: 121 LGLRFEKESGNLYIADAYYGLLVVGPQGGIATPLATHVEGTPILFANDLDIHNNGSIFFT 180
           LGLRF+KESG+LYIADAYYGLLVVGPQGG+ATPL+THVEG PILFANDLDIH NGSIFFT
Sbjct: 152 LGLRFDKESGDLYIADAYYGLLVVGPQGGLATPLSTHVEGKPILFANDLDIHKNGSIFFT 211

Query: 181 DTSKRYNRVNHFFILLEGEASGRLLRYDPSTKTTHVVLNGLAFPNGVQLSKDHTFLLYTE 240
           DTSKRYNRVNHFFILLEGE++GRLLRYDP TKTTH+VL GLAFPNG+QLSKD TFL +TE
Sbjct: 212 DTSKRYNRVNHFFILLEGESTGRLLRYDPPTKTTHIVLEGLAFPNGLQLSKDQTFLFFTE 271

Query: 241 TTNCRLMKLWLEGAKNGKVEVVANLPGFPDNVRRNERDEYWVAIDCCRTKTQEVLTHNPW 300
           TTNCRLMK WLEG KNG VE+VA+LPGFPDN+R NE+ ++WVAIDCCRT  QEVL+HNPW
Sbjct: 272 TTNCRLMKYWLEGPKNGTVELVADLPGFPDNIRINEKGQFWVAIDCCRTPAQEVLSHNPW 331

Query: 301 IRNIYFRLPFRMSLLARLMGMKMYTVISLFSENGEILEVLEDQKGEVMKLMSEVREVKGK 360
           IR++YFRLP RM+ LAR MGMKMYT+ISLF+E GEILEVLEDQKG VMKL+SEVRE KGK
Sbjct: 332 IRSVYFRLPIRMTYLARFMGMKMYTLISLFNEKGEILEVLEDQKGAVMKLVSEVRESKGK 391

Query: 361 LWIGTVAHNHIATLTYP 378
           LWIGTVAHNHIATL YP
Sbjct: 392 LWIGTVAHNHIATLPYP 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SSL13_ARATH4.2e-17776.18Protein STRICTOSIDINE SYNTHASE-LIKE 13 OS=Arabidopsis thaliana GN=SSL13 PE=1 SV=... [more]
SSL3_ARATH2.3e-8243.01Protein STRICTOSIDINE SYNTHASE-LIKE 3 OS=Arabidopsis thaliana GN=SSL3 PE=2 SV=1[more]
SSL10_ARATH1.8e-7948.74Protein STRICTOSIDINE SYNTHASE-LIKE 10 OS=Arabidopsis thaliana GN=SSL10 PE=2 SV=... [more]
SSL8_ARATH1.4e-6341.61Protein STRICTOSIDINE SYNTHASE-LIKE 8 OS=Arabidopsis thaliana GN=SSL8 PE=2 SV=1[more]
SSL9_ARATH2.9e-6137.19Protein STRICTOSIDINE SYNTHASE-LIKE 9 OS=Arabidopsis thaliana GN=SSL9 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L9D9_CUCSA1.5e-21893.88Uncharacterized protein OS=Cucumis sativus GN=Csa_3G174550 PE=4 SV=1[more]
M5XQY7_PRUPE1.4e-19283.29Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006506mg PE=4 SV=1[more]
B9MY61_POPTR2.8e-18881.17Strictosidine synthase family protein OS=Populus trichocarpa GN=POPTR_0017s05630... [more]
F6HYV0_VITVI2.4e-18781.43Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0102g00880 PE=4 SV=... [more]
B9SU13_RICCO5.8e-18680.91Strictosidine synthase, putative OS=Ricinus communis GN=RCOM_0454640 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659077181|ref|XP_008439075.1|1.3e-21894.12PREDICTED: strictosidine synthase [Cucumis melo][more]
gi|700202119|gb|KGN57252.1|2.2e-21893.88hypothetical protein Csa_3G174550 [Cucumis sativus][more]
gi|449459884|ref|XP_004147676.1|1.1e-21794.06PREDICTED: protein STRICTOSIDINE SYNTHASE-LIKE 13 [Cucumis sativus][more]
gi|596148609|ref|XP_007222749.1|2.1e-19283.29hypothetical protein PRUPE_ppa006506mg [Prunus persica][more]
gi|645231726|ref|XP_008222532.1|7.8e-19283.29PREDICTED: strictosidine synthase 1-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR0110426-blade_b-propeller_TolB-like
IPR018119Strictosidine_synth_cons-reg
Vocabulary: Biological Process
TermDefinition
GO:0009058biosynthetic process
Vocabulary: Molecular Function
TermDefinition
GO:0016844strictosidine synthase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042432 indole biosynthetic process
biological_process GO:0009827 plant-type cell wall modification
biological_process GO:0010584 pollen exine formation
biological_process GO:0009860 pollen tube growth
biological_process GO:0016114 terpenoid biosynthetic process
biological_process GO:0009058 biosynthetic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016844 strictosidine synthase activity
molecular_function GO:0016788 hydrolase activity, acting on ester bonds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021877Cla021877.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011042Six-bladed beta-propeller, TolB-likeGENE3DG3DSA:2.120.10.30coord: 46..373
score: 1.0
IPR018119Strictosidine synthase, conserved regionPFAMPF03088Str_synthcoord: 167..253
score: 3.1
NoneNo IPR availableunknownCoilCoilcoord: 340..360
scor
NoneNo IPR availablePANTHERPTHR10426:SF21POLLEN DEVELOPMENT PROTEIN LAP3coord: 1..377
score: 3.2E
NoneNo IPR availableunknownSSF63829Calcium-dependent phosphotriesterasecoord: 117..286
score: 1.31E-44coord: 324..375
score: 1.31E-44coord: 45..77
score: 1.31