Tan0001786 (gene) Snake gourd v1

Overview
NameTan0001786
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein SET DOMAIN GROUP 41
LocationLG10: 54087597 .. 54090003 (+)
RNA-Seq ExpressionTan0001786
SyntenyTan0001786
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGGAAATGAGGGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATCGCCTCCCCTCACCTCCGCCCTCCACGATTCCTTCCTCCTTACACACTGCTCCTCCTGCTTCTTCCCTCTCCCAATTTCCCCAATTTCTCACTCCAATCTCCTCCGCTATTGCTCCCCCAAATGCTCCGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCAATCATCTCTCCTTCTCCGACACCGCGGACCTCCGCGCCTCGCTTCGCCTCCTCCATCTCCTCTCCGATCCCTCGGCTTGGCGCTCTGCTCCTCCCGAGCGCATCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGTCGAAGACGACGACGAGGTTCTCGTCAGGATTCGGAAAGGAGCCGACGCCTTGGCCGCTTCCAGAAGGACGAACTCTGCCGATATTCACCATGGAAACGCCTTGGAAGAGGCCGTCCTATGCCTCGTTATTACCAACGCTGTGGAGGTTCAGGATTCGAACGGCCGTACCATTGGAATCGCTGTGTATGATCCTACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTTCGTCGGATTCCTTGAAGACAAGGATGCAGATTTCCCCCAAATGCACTGACCTTGGCACTGGTGAAGGAAGTTGTCGTCAAGTAATTGTTTGAACTGCGAATAATTTTGATGTTTGTGAACATTTTTCTCCCTTTCTCGTGAAACTAGTGAGAGCTTTTGCATGAATGTTTTTGGCAGATGGGTACTGTGGGTAGCAACCTTTCGGATTTCATAAGAAAAGGTGTGTTTCTTCACCTTTGCATTCAGCTAATGTTTGTATATTTATTTGAATTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAATTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTAACAATCGCATACTGTGACTTGTTACAACCTAAGGTATTATAGCTTAACTAAGTTTCACATAAGCATGTGCATGCCATGTTTCATTATTACCATAGTATATTTTTCAACTTTGAAGATTTGCAGAGTAAATGAATGTGATGCTTCTTTAATATTGTTGTTAATTGCCTCTTCTTGTTGAAGAATTATGATGAAATGACTGGCAATAGTTGTCTGAGAAATTTGTTGGTTTTGTCTCAGGAAATGAGGCAGACAGAGTTGTGTTCTAGATATCAATTTATCTGTAGTTGCCACCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGTAAGAATTTGGCGACTTAATCGTGGGTTTCTAGATTTCTTTATTGCCATGATCTCAAAAATGGTATGCATTTTAAATTTCTTCAGGAAATTTCTGCTGTCAAAGAGAAATTGTTTGTTGGTTCAACTTCCATTAGCAACTTTGATAATGACAATGCAGTGAGAAGAATAAAAGATTATGTTGATAATGCAATCTACGAGTATCTATCTATTGGTTCTCCCGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGTTTCTGTGACGAGCAAGTGGAAGAGGAGGAAGGAAAACAGTTGCATAATTTGAGGCTGCATCCCTTGAACTACCTGTCACTGAATGCATACACAGCTCTCGCATCGTCTTATAAAGTCCGTTCATGTGATTTATTGGCTTCCGATTCCAAAATGGGCGATGGCAACGGCGATGAACATCGACAAAATGCATCTACCACAACCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACCCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCTTCTGCTGCAAACTGTTGGGTTGTTGCTGGAGAATCTTTGCTTATTCTTGCTAGAAGCAGCTCATCATGTGCTACTAACACATCAAAATGTAGTTTCCCTCTGCGCAAAAGAATGTGTTCTAATTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCATGGTCGATCTTTCAAAGCCAATTTTTGCGAGTTTTCAAGTGGTATTTCAAATTGCATTGCTAATATTTCACAAAAATCTTGGAGCTTTCTGACTGATGGCTGCCCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCAAAGACGACCACAGCGTATACGAATAACCAAGATATACGGGCTCATAGCATCGATCATTCGAGTGCTTATAGTGAAACTAAAGATATTGTTCCTCAGTGTGAACCTCAGGTGCATTCTGACCAAGAGTGGCAATCTATCTTTGAGCTTGGCATCCATTGCTTATGCTTTGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACTTCTGGCATCTCAGATTCAAAACATTTTAGATAAGATGAACTGA

mRNA sequence

ATGGAGATGGAAATGAGGGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATCGCCTCCCCTCACCTCCGCCCTCCACGATTCCTTCCTCCTTACACACTGCTCCTCCTGCTTCTTCCCTCTCCCAATTTCCCCAATTTCTCACTCCAATCTCCTCCGCTATTGCTCCCCCAAATGCTCCGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCAATCATCTCTCCTTCTCCGACACCGCGGACCTCCGCGCCTCGCTTCGCCTCCTCCATCTCCTCTCCGATCCCTCGGCTTGGCGCTCTGCTCCTCCCGAGCGCATCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGTCGAAGACGACGACGAGGTTCTCGTCAGGATTCGGAAAGGAGCCGACGCCTTGGCCGCTTCCAGAAGGACGAACTCTGCCGATATTCACCATGGAAACGCCTTGGAAGAGGCCGTCCTATGCCTCGTTATTACCAACGCTGTGGAGGTTCAGGATTCGAACGGCCGTACCATTGGAATCGCTGTGTATGATCCTACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTTCGTCGGATTCCTTGAAGACAAGGATGCAGATTTCCCCCAAATGCACTGACCTTGGCACTGGTGAAGGAAGTTGTCGTCAAATGGGTACTGTGGGTAGCAACCTTTCGGATTTCATAAGAAAAGATTTTCAGGGTTATGGTCCAAGAATTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTAACAATCGCATACTGTGACTTGTTACAACCTAAGGAAATGAGGCAGACAGAGTTGTGTTCTAGATATCAATTTATCTGTAGTTGCCACCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGAGAAATTGTTTGTTGGTTCAACTTCCATTAGCAACTTTGATAATGACAATGCAGTGAGAAGAATAAAAGATTATGTTGATAATGCAATCTACGAGTATCTATCTATTGGTTCTCCCGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGTTTCTGTGACGAGCAAGTGGAAGAGGAGGAAGGAAAACAGTTGCATAATTTGAGGCTGCATCCCTTGAACTACCTGTCACTGAATGCATACACAGCTCTCGCATCGTCTTATAAAGTCCGTTCATGTGATTTATTGGCTTCCGATTCCAAAATGGGCGATGGCAACGGCGATGAACATCGACAAAATGCATCTACCACAACCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACCCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCTTCTGCTGCAAACTGTTGGGTTGTTGCTGGAGAATCTTTGCTTATTCTTGCTAGAAGCAGCTCATCATGTGCTACTAACACATCAAAATGTAGTTTCCCTCTGCGCAAAAGAATGTGTTCTAATTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCATGGTCGATCTTTCAAAGCCAATTTTTGCGAGTTTTCAAGTGGTATTTCAAATTGCATTGCTAATATTTCACAAAAATCTTGGAGCTTTCTGACTGATGGCTGCCCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCAAAGACGACCACAGCGTATACGAATAACCAAGATATACGGGCTCATAGCATCGATCATTCGAGTGCTTATAGTGAAACTAAAGATATTGTTCCTCAGTGTGAACCTCAGGTGCATTCTGACCAAGAGTGGCAATCTATCTTTGAGCTTGGCATCCATTGCTTATGCTTTGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACTTCTGGCATCTCAGATTCAAAACATTTTAGATAAGATGAACTGA

Coding sequence (CDS)

ATGGAGATGGAAATGAGGGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATCGCCTCCCCTCACCTCCGCCCTCCACGATTCCTTCCTCCTTACACACTGCTCCTCCTGCTTCTTCCCTCTCCCAATTTCCCCAATTTCTCACTCCAATCTCCTCCGCTATTGCTCCCCCAAATGCTCCGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCAATCATCTCTCCTTCTCCGACACCGCGGACCTCCGCGCCTCGCTTCGCCTCCTCCATCTCCTCTCCGATCCCTCGGCTTGGCGCTCTGCTCCTCCCGAGCGCATCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGTCGAAGACGACGACGAGGTTCTCGTCAGGATTCGGAAAGGAGCCGACGCCTTGGCCGCTTCCAGAAGGACGAACTCTGCCGATATTCACCATGGAAACGCCTTGGAAGAGGCCGTCCTATGCCTCGTTATTACCAACGCTGTGGAGGTTCAGGATTCGAACGGCCGTACCATTGGAATCGCTGTGTATGATCCTACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTTCGTCGGATTCCTTGAAGACAAGGATGCAGATTTCCCCCAAATGCACTGACCTTGGCACTGGTGAAGGAAGTTGTCGTCAAATGGGTACTGTGGGTAGCAACCTTTCGGATTTCATAAGAAAAGATTTTCAGGGTTATGGTCCAAGAATTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTAACAATCGCATACTGTGACTTGTTACAACCTAAGGAAATGAGGCAGACAGAGTTGTGTTCTAGATATCAATTTATCTGTAGTTGCCACCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGAGAAATTGTTTGTTGGTTCAACTTCCATTAGCAACTTTGATAATGACAATGCAGTGAGAAGAATAAAAGATTATGTTGATAATGCAATCTACGAGTATCTATCTATTGGTTCTCCCGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGTTTCTGTGACGAGCAAGTGGAAGAGGAGGAAGGAAAACAGTTGCATAATTTGAGGCTGCATCCCTTGAACTACCTGTCACTGAATGCATACACAGCTCTCGCATCGTCTTATAAAGTCCGTTCATGTGATTTATTGGCTTCCGATTCCAAAATGGGCGATGGCAACGGCGATGAACATCGACAAAATGCATCTACCACAACCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACCCACCATCTTTTTCTTGCTGAACCATCTTTGATTGCTTCTGCTGCAAACTGTTGGGTTGTTGCTGGAGAATCTTTGCTTATTCTTGCTAGAAGCAGCTCATCATGTGCTACTAACACATCAAAATGTAGTTTCCCTCTGCGCAAAAGAATGTGTTCTAATTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCATGGTCGATCTTTCAAAGCCAATTTTTGCGAGTTTTCAAGTGGTATTTCAAATTGCATTGCTAATATTTCACAAAAATCTTGGAGCTTTCTGACTGATGGCTGCCCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCAAAGACGACCACAGCGTATACGAATAACCAAGATATACGGGCTCATAGCATCGATCATTCGAGTGCTTATAGTGAAACTAAAGATATTGTTCCTCAGTGTGAACCTCAGGTGCATTCTGACCAAGAGTGGCAATCTATCTTTGAGCTTGGCATCCATTGCTTATGCTTTGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACTTCTGGCATCTCAGATTCAAAACATTTTAGATAAGATGAACTGA

Protein sequence

MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESLLILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN
Homology
BLAST of Tan0001786 vs. ExPASy Swiss-Prot
Match: Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 5.1e-98
Identity = 242/648 (37.35%), Postives = 341/648 (52.62%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC 62
           ME+RA EDIE+  D+ PP  PL S+L+DSFL +HCSSCF  LP SP        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLML 122
           S +DS T +      ++    +D+R S   LHLL+  +   S+ P R+  LLTN   LM 
Sbjct: 61  SLTDSFTNSPQFPPEITPILPSDIRTS---LHLLNSTAVDTSSSPHRLNNLLTNHHLLMA 120

Query: 123 VEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGI 182
              D  + V I   A+ +A   R+N         LEEA +C V+TNAVEV DSNG  +GI
Sbjct: 121 ---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLALGI 180

Query: 183 AVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLS 242
           A+Y+ +F WINHSCSPN+CYRF  +  S           T+  T      Q    G++L+
Sbjct: 181 ALYNSSFSWINHSCSPNSCYRFVNNRTSYH-----DVHVTNTETSSNLELQEQVCGTSLN 240

Query: 243 DFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS 302
                   G GP+++VRSIK I+ GE +T++Y DLLQP  +RQ++L S+Y+F+C+C RC+
Sbjct: 241 -----SGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCGRCA 300

Query: 303 AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG- 362
           A PP YVD  L+ +  ++ +     T++ +FD     D AV ++ DY+  AI ++LS   
Sbjct: 301 ASPPAYVDSILEGVLTLESE----KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNI 360

Query: 363 SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDL 422
            P++CCE +E++L  G     ++ +E  Q H LRLH  +Y++LNAY  LA++Y++RS   
Sbjct: 361 DPKTCCEMIESVLHHG-----IQFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI-- 420

Query: 423 LASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGE 482
              DS+ G              ++ SAAYSLFLAG +HHLF AE S   SAA  W  AGE
Sbjct: 421 ---DSETG---------IVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGE 480

Query: 483 SLLILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISN 542
            L  LA       +  S          C+ C  ++  N+ R        +  E S  I +
Sbjct: 481 LLFDLAPKLLMELSVESDVK-------CTKCLMLETSNSHR--------DIKEKSRQILS 540

Query: 543 CIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD 602
           C+ +ISQ +WSFLT GCPYL+ F  P DFS  +T                          
Sbjct: 541 CVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRTNG------------------------ 557

Query: 603 IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ 646
                E +  S  +  ++  L  HCL +   L  +CYG  S L S+ +
Sbjct: 601 -----EREESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of Tan0001786 vs. ExPASy Swiss-Prot
Match: Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 7.8e-06
Identity = 58/260 (22.31%), Postives = 94/260 (36.15%), Query Frame = 0

Query: 50  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRS 109
           S   + +YCS KC      D     +   S       D+  L    R++  L D     S
Sbjct: 63  SQCRIAKYCSAKCQKKAWPDHRRECSCLKSCKPRYPPDSVRLLG--RVIVKLMDEKPSES 122

Query: 110 APPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCL 169
                 + L +N  K  L ED  E L ++             +++ +     L EA    
Sbjct: 123 EKLYSFYDLESNISK--LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEA-FAK 182

Query: 170 VITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDL 229
           VI N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 183 VICNSFTICNAEMQEVGVGLY-PSMSLLNHSCDPNCSIVFN------------------- 242

Query: 230 GTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMR 289
                                       GP +++R+++ I  GE +TI Y D+L   E R
Sbjct: 243 ----------------------------GPHLLLRAVREIEAGEELTICYLDMLMTSEER 269

Query: 290 QTELCSRYQFICSCHRCSAK 305
           + +L  +Y F C C RC  +
Sbjct: 303 RKQLRDQYCFECDCIRCQTQ 269

BLAST of Tan0001786 vs. ExPASy Swiss-Prot
Match: Q9NRG4 (N-lysine methyltransferase SMYD2 OS=Homo sapiens OX=9606 GN=SMYD2 PE=1 SV=2)

HSP 1 Score: 53.1 bits (126), Expect = 1.3e-05
Identity = 76/342 (22.22%), Postives = 118/342 (34.50%), Query Frame = 0

Query: 36  HCSSCFFPLP-ISPISHSNLLRYCSPKCSDSDSATAAFFSANHLSFSDTADLRASLRLLH 95
           HC  CF     +S         YC+ +C   D        +  + F +  +   ++RL  
Sbjct: 51  HCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTA 110

Query: 96  LLSDPSAWRSAPPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHH- 155
            +    A +   PER     T  EKL+ V++ +  L ++      L  S   + A +HH 
Sbjct: 111 RI---LAKQKIHPER-----TPSEKLLAVKEFESHLDKLDNEKKDLIQS---DIAALHHF 170

Query: 156 --------GNALEEAVLCLVITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFE 215
                    N     +   V  N   ++D     +G A++ P    +NHSC PN    ++
Sbjct: 171 YSKHLGFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIF-PDVALMNHSCCPNVIVTYK 230

Query: 216 TSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIR 275
                                        GT+                    VR+++ I+
Sbjct: 231 -----------------------------GTLAE------------------VRAVQEIK 290

Query: 276 KGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTYVDHALQEISAVKEKLFV 335
            GE V  +Y DLL P E R   L   Y F C C  C+ K               K+K  V
Sbjct: 291 PGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD--------------KDKAKV 319

Query: 336 GSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSPESCCEKLE 368
               +S+     A+R +  Y  N I E+      +S  E LE
Sbjct: 351 EIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLE 319

BLAST of Tan0001786 vs. ExPASy Swiss-Prot
Match: Q9H7B4 (Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 SV=4)

HSP 1 Score: 53.1 bits (126), Expect = 1.3e-05
Identity = 58/260 (22.31%), Postives = 94/260 (36.15%), Query Frame = 0

Query: 50  SHSNLLRYCSPKCS-----DSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRS 109
           S   + +YCS KC      D         S       D+  L    R++  L D +   S
Sbjct: 63  SQCRVAKYCSAKCQKKAWPDHKRECKCLKSCKPRYPPDSVRLLG--RVVFKLMDGAPSES 122

Query: 110 APPERIFGLLTNREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCL 169
                 + L +N  K  L ED  E L ++             +++ +     L EA    
Sbjct: 123 EKLYSFYDLESNINK--LTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFDLFEA-FAK 182

Query: 170 VITNAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDL 229
           VI N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 183 VICNSFTICNAEMQEVGVGLY-PSISLLNHSCDPNCSIVFN------------------- 242

Query: 230 GTGEGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMR 289
                                       GP +++R+++ I  GE +TI Y D+L   E R
Sbjct: 243 ----------------------------GPHLLLRAVRDIEVGEELTICYLDMLMTSEER 269

Query: 290 QTELCSRYQFICSCHRCSAK 305
           + +L  +Y F C C RC  +
Sbjct: 303 RKQLRDQYCFECDCFRCQTQ 269

BLAST of Tan0001786 vs. ExPASy Swiss-Prot
Match: Q557F7 (SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0273589 PE=3 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 3.3e-04
Identity = 38/135 (28.15%), Postives = 56/135 (41.48%), Query Frame = 0

Query: 168 NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTG 227
           N   +   N + IG+AV  P+  + NHSC PN                     CTD+   
Sbjct: 235 NQFGIWTKNDKCIGVAV-SPSSSYFNHSCIPN---------------------CTDVRD- 294

Query: 228 EGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTE 287
                     GSN++                +S+  I+KG+ +TI+Y +L QP + R+ E
Sbjct: 295 ----------GSNMT---------------FKSLYPIKKGDQLTISYIELDQPIQDRKDE 321

Query: 288 LCSRYQFICSCHRCS 303
           L   Y F C C RC+
Sbjct: 355 LKYGYYFDCICPRCN 321

BLAST of Tan0001786 vs. NCBI nr
Match: XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 935.6 bits (2417), Expect = 2.2e-268
Identity = 489/653 (74.89%), Postives = 537/653 (82.24%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPP PPLT+ALHD+FLLTHCSSCF PLP S ISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS     FSDT+DLRASLRLLH LLSDPSAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML +DD EV V+IR+G+DA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS GRT
Sbjct: 121 LMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGRT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKSIR GEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEISAV  +L + STSISNFD D A+ RI DYV+NAI EYLSIGS 
Sbjct: 301 RCSAKPPTYVDHALQEISAVNVEL-LDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGSS 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L LNAYTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   NGDE++ NA T +KTSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNGDENQCNA-TMSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCI 540
           LIL + SS   +NTSK S P+ +  C NCSWVDKFN SRIHGRS +A+F EFS GISNCI
Sbjct: 481 LILVKHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV 600
           ANISQK WSFL   C YLKAFTDPFDFSWPKT T  +N +       D S   S+ +D+ 
Sbjct: 541 ANISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYR-------DRSCDCSKIQDV- 600

Query: 601 PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
                   SDQ+ QSIFELGIHCL +GGYLASICYGHHS LASQIQ IL  MN
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCILHDMN 623

BLAST of Tan0001786 vs. NCBI nr
Match: XP_022974027.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima])

HSP 1 Score: 931.0 bits (2405), Expect = 5.5e-267
Identity = 481/653 (73.66%), Postives = 531/653 (81.32%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+T
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L LN YTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCI 540
           L L R SS   +NTSK S P+ +  C NCSWVDKFN SRIHGRS + +F EFS GISNCI
Sbjct: 481 LRLVRHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNCI 540

Query: 541 ANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV 600
           ANIS K WSFLT  CPYLKAFTDPFDFSWPKT T  +N +       D    YS+ +D+ 
Sbjct: 541 ANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQDV- 600

Query: 601 PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
                   SDQ+ QSIFELGIHCL +GGYLASICYGH S L+SQIQ IL  MN
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625

BLAST of Tan0001786 vs. NCBI nr
Match: XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])

HSP 1 Score: 917.1 bits (2369), Expect = 8.3e-263
Identity = 480/653 (73.51%), Postives = 530/653 (81.16%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPP PPLT+ALHD+F LTHCSSCF PLP S ISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML EDD EV V+IRKGADA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS G+T
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+ TR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKS+RKGEAVTIAYCDLLQPK +RQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEISA   +L + STSISNFD D A+RRI DYV+NAI EYLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEISAFNVEL-LDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L LN YTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   N DE++ NA T +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCI 540
           LIL + SS   +NTSK S P+ +  C NCSWVDKFN +RIHGRS +A+F EFS GISNCI
Sbjct: 481 LILVKHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV 600
           A+IS K WSFL   C YLKAFTDPFDFSWPKT T   N           S   S+ +D+ 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
                   S+Q+ QSIFELGIHCL +GGYLASICYGH S LASQI+ IL  MN
Sbjct: 601 --------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of Tan0001786 vs. NCBI nr
Match: XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])

HSP 1 Score: 906.7 bits (2342), Expect = 1.1e-259
Identity = 483/658 (73.40%), Postives = 533/658 (81.00%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEMEM AMEDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHSNLLRYCSP
Sbjct: 1   MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60

Query: 61  KC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLT 120
           KC  S SD  TAAFFS +     FS T+DLRASLRLLH LLS P A  S PPERIFGLLT
Sbjct: 61  KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120

Query: 121 NREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDS 180
           NR KLM  + D E+  ++R+G DA+AA     SADI HG+ L EA LCLV TNAV+V DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAA---LLSADIPHGHTLAEAALCLVFTNAVDVHDS 180

Query: 181 NGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMG 240
            GRTIGIAVY PTFCWINHSCSPNACYRFETSS S  TR +I+P CTDL TG+GSC QMG
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240

Query: 241 TVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI 300
           TV SNLSDFI +DFQG GPR++VRSIKSIR+GEAVTIAYCDLLQPK MRQ+EL SRYQF+
Sbjct: 241 TVRSNLSDFITEDFQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300

Query: 301 CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLS 360
           CSC RCSAKP TYVDHALQE+SA K +L   STSISNFD+D AVRRI DYV++AI EYLS
Sbjct: 301 CSCQRCSAKPLTYVDHALQELSASKVELH-DSTSISNFDHDKAVRRIDDYVNSAITEYLS 360

Query: 361 IGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSC 420
           IGSPESCCEKL NLLTLGF DEQ E+ E KQ  NLRLHPL++LSLN YTALAS+YKVRSC
Sbjct: 361 IGSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRSC 420

Query: 421 DLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVA 480
           DLLA  S+M   N D+   NAST  K SAAYSLFLAGATHHLFL+EPSLI SA+ CWV+A
Sbjct: 421 DLLALSSEMDCDNEDQ--CNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLA 480

Query: 481 GESLLILARSSSSCA-TNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSG 540
           GESLL LAR S   A TNTSK  FP+ KRMCS CSWVDKFNASRIHG+  +A+F EFS G
Sbjct: 481 GESLLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIG 540

Query: 541 ISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSE 600
           ISNCIAN+S+KSWSFLT GCPYLKAFTDPF+FSWPK    Y++++DIRAHSID   A S 
Sbjct: 541 ISNCIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSN 600

Query: 601 TKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
           +KD+  QCEPQ HS+QE +SI  LGIHCL +GGYLASICYGHHS LASQIQNIL  +N
Sbjct: 601 SKDVCFQCEPQ-HSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILYDLN 651

BLAST of Tan0001786 vs. NCBI nr
Match: XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])

HSP 1 Score: 884.8 bits (2285), Expect = 4.5e-253
Identity = 474/659 (71.93%), Postives = 529/659 (80.27%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC 62
           MEMRA+EDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHS LL YCS KC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  --SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---LLSDPSAWRSAPPERIFGLLT 122
             S SD  TAAFFS + L  + SDT+DLRASLRLLH   LLS PS   S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDS 182
           NR KLM  ++  EV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 NGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMG 242
            G+TIGIAVY PTF WINHSCSPNACYRFET SD   TR +I+P CTD  + EG+CRQMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI 302
            V SN+ DF+R+DFQG GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQF+
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLS 362
           CSC RCSA P TYVDHALQEISAVK +L + S  ISNFD+D AVRRI +YVDNAI EYLS
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVEL-LDSAPISNFDHDTAVRRIDEYVDNAITEYLS 360

Query: 363 IGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSC 422
           IGSPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLHP ++L LNAYTAL S+YKVRSC
Sbjct: 361 IGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSC 420

Query: 423 DLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVA 482
           DLLA  S+M   N  E+R NA T +KTSAAY+LFLAGATHHLFL EPSLIASAANCWVVA
Sbjct: 421 DLLALSSEMDKDN--ENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVA 480

Query: 483 GESLLILARSSS--SCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSS 542
           GESLLILAR SS  +  TNTS   FPL KRMCSNCSWVD+FN SRIHGR  +A+F EFS 
Sbjct: 481 GESLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSI 540

Query: 543 GISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS 602
           GISNCIA+IS+K WSFLT GCPYLKAFTDPFDFSWPK     TN+ DI  H ID S A S
Sbjct: 541 GISNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACS 600

Query: 603 ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
           +TKDI  +CEPQ  S+QE +SI  LGIHCL +GGYLASICYG+HS LASQIQNIL+ +N
Sbjct: 601 KTKDICFECEPQ-DSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of Tan0001786 vs. ExPASy TrEMBL
Match: A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 931.0 bits (2405), Expect = 2.7e-267
Identity = 481/653 (73.66%), Postives = 531/653 (81.32%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+T
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L LN YTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCI 540
           L L R SS   +NTSK S P+ +  C NCSWVDKFN SRIHGRS + +F EFS GISNCI
Sbjct: 481 LRLVRHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNCI 540

Query: 541 ANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV 600
           ANIS K WSFLT  CPYLKAFTDPFDFSWPKT T  +N +       D    YS+ +D+ 
Sbjct: 541 ANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQDV- 600

Query: 601 PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
                   SDQ+ QSIFELGIHCL +GGYLASICYGH S L+SQIQ IL  MN
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625

BLAST of Tan0001786 vs. ExPASy TrEMBL
Match: A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 917.1 bits (2369), Expect = 4.0e-263
Identity = 480/653 (73.51%), Postives = 530/653 (81.16%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPP PPLT+ALHD+F LTHCSSCF PLP S ISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML EDD EV V+IRKGADA+AASRRTNSADI + NALEEA+LCLV+TNAVEVQDS G+T
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+ TR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKS+RKGEAVTIAYCDLLQPK +RQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEISA   +L + STSISNFD D A+RRI DYV+NAI EYLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEISAFNVEL-LDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ E+ +GKQL NLRLHP+++L LN YTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   N DE++ NA T +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISNCI 540
           LIL + SS   +NTSK S P+ +  C NCSWVDKFN +RIHGRS +A+F EFS GISNCI
Sbjct: 481 LILVKHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKDIV 600
           A+IS K WSFL   C YLKAFTDPFDFSWPKT T   N           S   S+ +D+ 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 PQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
                   S+Q+ QSIFELGIHCL +GGYLASICYGH S LASQI+ IL  MN
Sbjct: 601 --------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of Tan0001786 vs. ExPASy TrEMBL
Match: A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)

HSP 1 Score: 884.8 bits (2285), Expect = 2.2e-253
Identity = 474/659 (71.93%), Postives = 529/659 (80.27%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC 62
           MEMRA+EDIEMAEDITPP  PLTSALHDSFL THCSSCF  LP  PISHS LL YCS KC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  --SDSDSATAAFFSANHL--SFSDTADLRASLRLLH---LLSDPSAWRSAPPERIFGLLT 122
             S SD  TAAFFS + L  + SDT+DLRASLRLLH   LLS PS   S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDS 182
           NR KLM  ++  EV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 NGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMG 242
            G+TIGIAVY PTF WINHSCSPNACYRFET SD   TR +I+P CTD  + EG+CRQMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFI 302
            V SN+ DF+R+DFQG GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQF+
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLS 362
           CSC RCSA P TYVDHALQEISAVK +L + S  ISNFD+D AVRRI +YVDNAI EYLS
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVEL-LDSAPISNFDHDTAVRRIDEYVDNAITEYLS 360

Query: 363 IGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSC 422
           IGSPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLHP ++L LNAYTAL S+YKVRSC
Sbjct: 361 IGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSC 420

Query: 423 DLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVA 482
           DLLA  S+M   N  E+R NA T +KTSAAY+LFLAGATHHLFL EPSLIASAANCWVVA
Sbjct: 421 DLLALSSEMDKDN--ENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVA 480

Query: 483 GESLLILARSSS--SCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSS 542
           GESLLILAR SS  +  TNTS   FPL KRMCSNCSWVD+FN SRIHGR  +A+F EFS 
Sbjct: 481 GESLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSI 540

Query: 543 GISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYS 602
           GISNCIA+IS+K WSFLT GCPYLKAFTDPFDFSWPK     TN+ DI  H ID S A S
Sbjct: 541 GISNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACS 600

Query: 603 ETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKMN 653
           +TKDI  +CEPQ  S+QE +SI  LGIHCL +GGYLASICYG+HS LASQIQNIL+ +N
Sbjct: 601 KTKDICFECEPQ-DSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of Tan0001786 vs. ExPASy TrEMBL
Match: A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)

HSP 1 Score: 860.1 bits (2221), Expect = 5.8e-246
Identity = 464/661 (70.20%), Postives = 524/661 (79.27%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEMEM A+EDIEMAEDI+PP  PLTSALHDSFL THCSSCF  LP  PISHS  L YCS 
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KC--SDSDSATAAFFSANHL--SFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLT 120
           KC  S SD  T AFFS +    + SDT+DLRASLRLLH LLS PS   S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NREKLMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDS 180
           NR KLM  ++D EV +++R+GA+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 NGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMG 240
            G+TIGIAVY  TF WINHSCSPNACYRFET SDS+ TR +I+P CTD  + EGSCRQMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVGSNLSDFIRKD--FQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQ 300
            V SN+ DFIR+     G GPR+VVRSIK I+KGEAVTIAYCDLLQPK  RQ+EL SRYQ
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300

Query: 301 FICSCHRCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEY 360
           F+CSC RCSA P TYVDHALQEIS+VK +L + ST ISNFD+D AVRRI +YVDNAI EY
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVEL-LDSTPISNFDHDTAVRRIDEYVDNAITEY 360

Query: 361 LSIGSPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVR 420
           LS  SPESCCEKL+NLLT GF DEQVE+ EGKQ  +LRLHPL++L LNAYTAL S+YKVR
Sbjct: 361 LSTSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVR 420

Query: 421 SCDLLASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWV 480
           SCDL+A  S+M   NG+ H  NA T  KTSAAY+LFLAGATH LFL EPSL+ASAANCWV
Sbjct: 421 SCDLVALSSEMDKDNGNRH--NALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWV 480

Query: 481 VAGESLLILARSSS--SCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEF 540
           VAGESLLILAR SS  +  TNTS   FPL KRMC NCSWVD+FNASRIHG+  +A+F EF
Sbjct: 481 VAGESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREF 540

Query: 541 SSGISNCIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSA 600
           S GISNCIA+ISQK WS LT GCPYLKAFT PFDFSWPK     TN QDI    IDHS A
Sbjct: 541 SIGISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCA 600

Query: 601 YSETKDIVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQNILDKM 653
            S+T+D+  +C+PQ  S+QE +SI  LGIHCL +GGYLASICYGHHS LASQIQNIL+ +
Sbjct: 601 CSKTQDVCLECKPQ-DSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDL 652

BLAST of Tan0001786 vs. ExPASy TrEMBL
Match: A0A6J1IF01 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 3.0e-226
Identity = 406/533 (76.17%), Postives = 448/533 (84.05%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPP PPLT+ALHDSFLLTHCSSCF PLP SPISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCSDSDSATAAFFSANHLSFSDTADLRASLRLLH-LLSDPSAWRSAPPERIFGLLTNREK 120
            CS SDS TAA FS +H  FSDT+DLRASLRLLH LLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLVEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRT 180
           LML +DD EV  +IRKGADA+A SRRTNSADI + NALEEA++CLV+TNAVEVQDS G+T
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGS 240
           IGIAVY PTFCWINHSCSPNACYRFET SDS+KTR++ISP CTD+GTGEGSC QM TV  
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCH 300
           N S FI KDFQGYGPR++VRSIKSIRKGEAVTIAYCDLLQPK MRQ+EL SRY+F+CSC 
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKPPTYVDHALQEISAVKEKLFVGSTSISNFDNDNAVRRIKDYVDNAIYEYLSIGSP 360
           RCSAKPPTYVDHALQEI AV  +  + STSISNFD D A+ RI DYV+NAI EYLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDLLA 420
           ESCCEKL+NLLTLGF DEQ ++ +GKQL NLRLHP+++L LN YTALAS+YKVRS     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRS----- 420

Query: 421 SDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGESL 480
                   N +E++ N ST +KTSAAYSLFLAGATHHLFL EPSLIASAANCWVVAGESL
Sbjct: 421 -------WNDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFS 533
           L L R SS   +NTSK S P+ +  C NCSWVDKFN SRIHGRS + +F EFS
Sbjct: 481 LRLVRHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFS 521

BLAST of Tan0001786 vs. TAIR 10
Match: AT1G43245.1 (SET domain-containing protein )

HSP 1 Score: 360.1 bits (923), Expect = 3.6e-99
Identity = 242/648 (37.35%), Postives = 341/648 (52.62%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPSPPLTSALHDSFLLTHCSSCFFPLPISPISHSNLLRYCSPKC 62
           ME+RA EDIE+  D+ PP  PL S+L+DSFL +HCSSCF  LP SP        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SDSDSATAAFFSANHLSFSDTADLRASLRLLHLLSDPSAWRSAPPERIFGLLTNREKLML 122
           S +DS T +      ++    +D+R S   LHLL+  +   S+ P R+  LLTN   LM 
Sbjct: 61  SLTDSFTNSPQFPPEITPILPSDIRTS---LHLLNSTAVDTSSSPHRLNNLLTNHHLLMA 120

Query: 123 VEDDDEVLVRIRKGADALAASRRTNSADIHHGNALEEAVLCLVITNAVEVQDSNGRTIGI 182
              D  + V I   A+ +A   R+N         LEEA +C V+TNAVEV DSNG  +GI
Sbjct: 121 ---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLALGI 180

Query: 183 AVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTGEGSCRQMGTVGSNLS 242
           A+Y+ +F WINHSCSPN+CYRF  +  S           T+  T      Q    G++L+
Sbjct: 181 ALYNSSFSWINHSCSPNSCYRFVNNRTSYH-----DVHVTNTETSSNLELQEQVCGTSLN 240

Query: 243 DFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCS 302
                   G GP+++VRSIK I+ GE +T++Y DLLQP  +RQ++L S+Y+F+C+C RC+
Sbjct: 241 -----SGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCGRCA 300

Query: 303 AKPPTYVDHALQEISAVKEKLFVGSTSISNFD----NDNAVRRIKDYVDNAIYEYLSIG- 362
           A PP YVD  L+ +  ++ +     T++ +FD     D AV ++ DY+  AI ++LS   
Sbjct: 301 ASPPAYVDSILEGVLTLESE----KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNI 360

Query: 363 SPESCCEKLENLLTLGFCDEQVEEEEGKQLHNLRLHPLNYLSLNAYTALASSYKVRSCDL 422
            P++CCE +E++L  G     ++ +E  Q H LRLH  +Y++LNAY  LA++Y++RS   
Sbjct: 361 DPKTCCEMIESVLHHG-----IQFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI-- 420

Query: 423 LASDSKMGDGNGDEHRQNASTTTKTSAAYSLFLAGATHHLFLAEPSLIASAANCWVVAGE 482
              DS+ G              ++ SAAYSLFLAG +HHLF AE S   SAA  W  AGE
Sbjct: 421 ---DSETG---------IVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGE 480

Query: 483 SLLILARSSSSCATNTSKCSFPLRKRMCSNCSWVDKFNASRIHGRSFKANFCEFSSGISN 542
            L  LA       +  S          C+ C  ++  N+ R        +  E S  I +
Sbjct: 481 LLFDLAPKLLMELSVESDVK-------CTKCLMLETSNSHR--------DIKEKSRQILS 540

Query: 543 CIANISQKSWSFLTDGCPYLKAFTDPFDFSWPKTTTAYTNNQDIRAHSIDHSSAYSETKD 602
           C+ +ISQ +WSFLT GCPYL+ F  P DFS  +T                          
Sbjct: 541 CVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRTNG------------------------ 557

Query: 603 IVPQCEPQVHSDQEWQSIFELGIHCLCFGGYLASICYGHHSLLASQIQ 646
                E +  S  +  ++  L  HCL +   L  +CYG  S L S+ +
Sbjct: 601 -----EREESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of Tan0001786 vs. TAIR 10
Match: AT2G17900.1 (SET domain group 37 )

HSP 1 Score: 46.2 bits (108), Expect = 1.2e-04
Identity = 43/150 (28.67%), Postives = 55/150 (36.67%), Query Frame = 0

Query: 168 NAVEVQDSNGRTIGIAVYDPTFCWINHSCSPNACYRFETSSDSLKTRMQISPKCTDLGTG 227
           NA  + DS  R  GI ++ P    INHSCSPNA   FE                      
Sbjct: 189 NAHSICDSELRPQGIGLF-PLVSIINHSCSPNAVLVFE---------------------- 248

Query: 228 EGSCRQMGTVGSNLSDFIRKDFQGYGPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTE 287
                QM                      VVR++ +I K   +TI+Y +       RQ  
Sbjct: 249 ----EQMA---------------------VVRAMDNISKDSEITISYIETAGSTLTRQKS 290

Query: 288 LCSRYQFICSCHRCS--AKPPTYVDHALQE 316
           L  +Y F C C RCS   KP    + A+ E
Sbjct: 309 LKEQYLFHCQCARCSNFGKPHDIEESAILE 290

BLAST of Tan0001786 vs. TAIR 10
Match: AT1G26760.1 (SET domain protein 35 )

HSP 1 Score: 43.1 bits (100), Expect = 9.8e-04
Identity = 19/56 (33.93%), Postives = 31/56 (55.36%), Query Frame = 0

Query: 253 GPRIVVRSIKSIRKGEAVTIAYCDLLQPKEMRQTELCSRYQFICSCHRCSAKPPTY 309
           G  ++V + + I+ GE ++ AY D+L P E R+ E+   + F C C RC  +   Y
Sbjct: 354 GDYVIVHASRDIKTGEEISFAYFDVLSPLEKRK-EMAESWGFCCGCSRCKFESVLY 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3ECY65.1e-9837.35Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1[more]
Q9CWR27.8e-0622.31Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... [more]
Q9NRG41.3e-0522.22N-lysine methyltransferase SMYD2 OS=Homo sapiens OX=9606 GN=SMYD2 PE=1 SV=2[more]
Q9H7B41.3e-0522.31Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 S... [more]
Q557F73.3e-0428.15SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum ... [more]
Match NameE-valueIdentityDescription
XP_023520942.12.2e-26874.89protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022974027.15.5e-26773.66protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima][more]
XP_022932824.18.3e-26373.51protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata][more]
XP_038886411.11.1e-25973.40protein SET DOMAIN GROUP 41 [Benincasa hispida][more]
XP_008463080.14.5e-25371.93PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1I9542.7e-26773.66protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A6J1EY394.0e-26373.51protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A1S3CIT02.2e-25371.93protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... [more]
A0A0A0KAK35.8e-24670.20SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... [more]
A0A6J1IF013.0e-22676.17protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
Match NameE-valueIdentityDescription
AT1G43245.13.6e-9937.35SET domain-containing protein [more]
AT2G17900.11.2e-0428.67SET domain group 37 [more]
AT1G26760.19.8e-0433.93SET domain protein 35 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.220.160coord: 80..169
e-value: 3.5E-17
score: 65.0
NoneNo IPR availableGENE3D6.10.140.2220coord: 36..79
e-value: 3.5E-17
score: 65.0
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 6..216
e-value: 3.5E-17
score: 65.0
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 253..301
e-value: 2.3E-13
score: 52.6
NoneNo IPR availablePANTHERPTHR47780PROTEIN SET DOMAIN GROUP 41coord: 3..650
NoneNo IPR availableCDDcd20071SET_SMYDcoord: 157..301
e-value: 1.87909E-17
score: 76.6475
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 181..299
IPR001214SET domainPFAMPF00856SETcoord: 5..274
e-value: 4.1E-6
score: 27.3

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001786.1Tan0001786.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding