CmaCh14G008320 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G008320
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionProtein SET DOMAIN GROUP, putative
LocationCma_Chr14 : 4271660 .. 4276300 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTAATACCTCGACAAAATGGTGAAGAAGCAGAGGCTTAAACCTTGTGTGTAAGGGAGCACAGAGGTTTTGATGGTGGGTTTGTGAAATGGGAACCGAAGGAAGTTTTGAAAGCCTGCTGAGATGGGCGGCGGATCACGGAATTTCAGATTCTGTCGACAAACAGAGTTCACATTCTTGTTTGGGTCGTTCTTTATGCGTCTGTTTCTTCCCTGATGCAGGCGGGTATGCTCTGATTTCTTGTTTTCGAGTTATATCACTCTTTTTCTTTCTTTAGCTGTATGTATTCAATTTTGAGGGAAGTTTTGCTTCAGGAGAGGTTTGGGGGCTGTTCGTCATCTTACCAAAGGAGAATTAGTGCTAAAAGTTCCAAAATCTGTCTTGTTGACGACCCAAAGTTTGTCGTTGCAAGATGAGAAGTTATCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCCACTCAGGTTCTTCCTTCGCTAACATGGACTTTTCCTCTTCTTTATACATCACTTATGCATAATTAGCGCGAACTTTTGTGAGAAAAAGGTGCAAAATTAAAGTTTGAAACCCTACAAATACATGGTACATTAAACAAATCAAGAGCTTAAAAATGTCTTTTGATGTTAAGAATACGTATAGTTTAGTCTTGAACACAAATTGTTCCCAATCCTAGTGCAGTTGGTAAGCTGCCTATTAATGTGCAACAGTTATACAAGAGTGTGATCATTATAGTGTAGACAGTGATTTCTGTTGGTGTTTGTACAGAGGCTGATGAGAATCAACGAATTTCCCTATATATTTTTAGGATTAAGGTTGAGTATTCAAGATAAGACCTCCATCCTATTACAAATTTAACTGATGCAAACCTGGCTCATAAGATTGAAAAACATCTAGAAGAGTTTCTAGAAGAATCAGTCATCTAAACACAGGGGGAGTAATGGATGATCAAAGAATTTCCAAATTGGGAGTCCTACCTTCAATAATCAAGGATTTAAAGGAAAACCCCCAATGGCCCAACCAGTAACGTATCAGGACCTAAGGGTAGAGGAAACCTTACTTGAAGGAAGTCCAATAATCTTGAAACAGGTACCTTTGGAAAATATTTTAGGTGTGGGCAAACTTGATGAGGGGGAAACAACCATCTTGTTCAGTACAAAGAATCCTCAAAGACGCTCCGTGTCCGATCAGTAAATAGATTTGTAATGTTCTTATAGACGTTGGGAATTGAATATTGTAGCTTTATAACTACATTCTAACCTCACCCTAAAGCCCATAGAAGGTAGGGTAGGAATTGAGAAGTGGAGAAACAGCTTTTCACACTCAATTGTGGAGAAGATTTTGTTTGAGCGTTGGTAATTACGCAAAGGCAAAAAGAGACCGGGAAAAAGTTAGAACCAAAAGAACTACATTTTCTTATGGAATTTCTAGCTTAATTTCTGGCTCTGCCTTATTGCACCTGGCGTACTATTAAATAAGTCCTAATGAGTAACAAATAAAGGACAAAAGGCCAAGGTTGAACCCTTGTGCAATCTCGGACCTGGATGAAACTTAGCGCAATTGCATGGATAGCACAAGCATAAACCAAATTACAATGAAATGTACGTTCCTAATCCCTCCGTTATTAGATTTGTTAGATGAAAGAGAAGGTGCCAAGGCCTTCTCAAGGTAGATTTGAAAATGGTCAAATAGATACCGTTTGTTTCACGTTTGTTGGGTATAAATAGGTGATGGCTTTCATTACTAGTGAAGGCATATTTGATTGGATGGTGGTGCCATTTGAGCCATCAGATACCTGAAGCACCTTCATGTGCCTAATGAGCTAGGTTTTGACAAGTGATTTCATTTCAAAGAAAGAGAGACCTGGGAAGAAACTTTAAGAATTACAAGCATGCCTGTCCCTTGAAATGTAAAAAGAGAAGGGGTGTTATTCAAAGTTCCTTTTAGGTATGTTTAAAGATGTTTTATATTCCATTCTTATTAATCTTTAATAGTTCTTACCTTAGTGCAACTGCGACAAAGATCTTTGGGATTTAACTTAATTTAATTCTCTTTTAGATTTAAAGTTGTTTATCTAAAACATTTCTGGTGTGTTTATCTGAGACATTACCCATTCTCCCATGTTAGAATGTTCTAAAGGTTTACTAATATTAATTCCCCCAAACTCAAATGCAGAAGTTGACCTTCTGTTTACTCTACGAAATTGGTAAAGGAAGCAGTTCTTGGTGGTTCCCTTACTTCAAGCATTTGCCCACGACTTACGAAACACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCGTTTTGTGGCTTTTTTTATTTATTTATTTCTTAGGATTATATTCGTATTCATGAACCATATTCAATAATGATTTTTACCTGTGATTTCGATAGTTTTTTTTTTTTTTTTTTATAATAAATAGTTGAAGTCTATTAACAGGTGGATTATGCTCTCTGGGAAGCAGAGAAGGCTGCTTCGAAGTCTCATACGGAGTGGAGAGGAGTTAAAGGACTAATGGAAGAATCTAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGCATGTACTGCATCGTTAGATAAGACCTGGATTTATACAGAGTAGTCAAACCCACATTTTCTTATAAGATTTCAAGTTTTTTTTTTTTTTTCATTCATTTTTGTTTGGTGAAAATTTAAATGCATCATTTAAAGGTGGTTTAGCTGGAGGTCTCGTGTGGGTAAATTATGCAAAACAACTGTCCAAGGTCAGTGCAAATATGTTTTTCTTATAGAAGAAAAGACCACCAAATGACACCCATTTGCTGATTATATGTCCTTCCATTACCCATTACCAGCCTGTAAGCGTCATTAAGAATCCTATTCTATCTAATGGTGCCACCCTGCCTCTAATAGTTCACTGGTTTTCTTCCGACTTATGAAGCGGAAGCTTGCACAAAGGCTTTAGAATCATCCCGAACAATAAGGATACTAGGACGTTCATTCAAGTACGAGTGAATTTTAGTATTCACATCCCATCAAATTTCCCATAGATTTTTGTTGCAACAGAATATAACCCATCCTCTTGTTTATGAATGCCAGTTGTTTGAGAGCTTGTTTTGTAATTTTCAATTAGCCCTTCTTGTGGAGGTTTTAAGTATTGTCCTGTTCTGTACATTATTCGATGACTCACTAAACATGAGAATGTTTTGGTCCGATGTAAATTTTCTCAATCTTAATCTCAGTTTATTTAACTTATTTCGACGTTTAATAGTTCAATACTAAAAATATATGTTTCCTTTGAGAGATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTCAATTATGCTGCACCTGAAGGGGAGTCCCTTGATATTATGGATGTTTCATCATTTTCCCAACATGCTTCTTTGAATGGAAACATAACTACTGATGGGTTACACAAAGAGGAACAAGATACTCAGCGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATATTGCTTCTATGCCCGGGAAAGTTATAAAAGAGGAGAGCAGGTATTTTATATTTTCTTGATGTCATTTAATATTTTTCACCCATATGCGAAGTGGAGTTGAAATCTTTCTATTGTTTGACGCAGGTTCTTTTAAGCTATGGTACATATTCAAACTTAGAGCTTCTTCAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAGAGTTTTCATTCCTTTGGAACATGAAATTTATAGCTCCAGTTCTTGGCCTAAGGAGTCTCTTTTTATTCACCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGGTCTCAACTCTCGGTCAAGAATGAAGTATTAGTCATGCAATGGTTATCAAAGAACTGCCATGCTGTTTTAAACAATCTGCCAACGTCGGTTGAAGAAGACAATCAGCTTTTGTGCAACATCTGCAAAATCCAGGATTTGCAAGGACCAACGGAGCTCGGGAAGATGCTATTGACTGTCGGAGGCGAGTTTTGTGCTTTCTTGGAGACCTATGGCCTGGTGAATAGAGAGGAAACCGAGTTACACTTAACTGGGAAAATTAAACGTTCTCTGGAGAGATGGAAACTGGCAGTGCAGTGGAGGATCTTGTACAAGAAGGCGTTGGTTGATTGCACAAGTTACTGCACCAGAACTACTTGTTCTCTATCTTCTTGATCTGGTTGGCTGTGATTACCAGGTACTCAAAACCTATCTCTTTTACATTCAAAGAAACTACTTGACATATTTTATGAAAAGAAGTTATTAGATCACTGACGAATGTTGTAACATTGTGTCATGATGAATGTTTAAACATTGAGTGAGTGGTGTCGAGATGAATGTTAACATTGAGTGATTTATGTCACCAAGTTGAACTCTCTATAGATCTTATAAGCGGTCAACTCATGATTAGCGTTTTGACTCGTGGACCGGGTCAATAGTAGTTATAAATCCTTTTCAGTATATCTCAAAATAATACTAATGATAC

mRNA sequence

AATTAATACCTCGACAAAATGGTGAAGAAGCAGAGGCTTAAACCTTGTGTGTAAGGGAGCACAGAGGTTTTGATGGTGGGTTTGTGAAATGGGAACCGAAGGAAGTTTTGAAAGCCTGCTGAGATGGGCGGCGGATCACGGAATTTCAGATTCTGTCGACAAACAGAGTTCACATTCTTGTTTGGGTCGTTCTTTATGCGTCTGTTTCTTCCCTGATGCAGGCGGGAGAGGTTTGGGGGCTGTTCGTCATCTTACCAAAGGAGAATTAGTGCTAAAAGTTCCAAAATCTGTCTTGTTGACGACCCAAAGTTTGTCGTTGCAAGATGAGAAGTTATCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCCACTCAGAAGTTGACCTTCTGTTTACTCTACGAAATTGGTAAAGGAAGCAGTTCTTGGTGGTTCCCTTACTTCAAGCATTTGCCCACGACTTACGAAACACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTGGATTATGCTCTCTGGGAAGCAGAGAAGGCTGCTTCGAAGTCTCATACGGAGTGGAGAGGAGTTAAAGGACTAATGGAAGAATCTAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTCAATTATGCTGCACCTGAAGGGGAGTCCCTTGATATTATGGATGTTTCATCATTTTCCCAACATGCTTCTTTGAATGGAAACATAACTACTGATGGGTTACACAAAGAGGAACAAGATACTCAGCGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATATTGCTTCTATGCCCGGGAAAGTTATAAAAGAGGAGAGCAGGTTCTTTTAAGCTATGGTACATATTCAAACTTAGAGCTTCTTCAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAGAGTTTTCATTCCTTTGGAACATGAAATTTATAGCTCCAGTTCTTGGCCTAAGGAGTCTCTTTTTATTCACCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGGTCTCAACTCTCGGTCAAGAATGAAGTATTAGTCATGCAATGGTTATCAAAGAACTGCCATGCTGTTTTAAACAATCTGCCAACGTCGGTTGAAGAAGACAATCAGCTTTTGTGCAACATCTGCAAAATCCAGGATTTGCAAGGACCAACGGAGCTCGGGAAGATGCTATTGACTGTCGGAGGCGAGTTTTGTGCTTTCTTGGAGACCTATGGCCTGGTGAATAGAGAGGAAACCGAGTTACACTTAACTGGGAAAATTAAACGTTCTCTGGAGAGATGGAAACTGGCAGTGCAGTGGAGGATCTTGTACAAGAAGGCGTTGGTTGATTGCACAAGTTACTGCACCAGAACTACTTGTTCTCTATCTTCTTGATCTGGTTGGCTGTGATTACCAGGTACTCAAAACCTATCTCTTTTACATTCAAAGAAACTACTTGACATATTTTATGAAAAGAAGTTATTAGATCACTGACGAATGTTGTAACATTGTGTCATGATGAATGTTTAAACATTGAGTGAGTGGTGTCGAGATGAATGTTAACATTGAGTGATTTATGTCACCAAGTTGAACTCTCTATAGATCTTATAAGCGGTCAACTCATGATTAGCGTTTTGACTCGTGGACCGGGTCAATAGTAGTTATAAATCCTTTTCAGTATATCTCAAAATAATACTAATGATAC

Coding sequence (CDS)

ATGGGAACCGAAGGAAGTTTTGAAAGCCTGCTGAGATGGGCGGCGGATCACGGAATTTCAGATTCTGTCGACAAACAGAGTTCACATTCTTGTTTGGGTCGTTCTTTATGCGTCTGTTTCTTCCCTGATGCAGGCGGGAGAGGTTTGGGGGCTGTTCGTCATCTTACCAAAGGAGAATTAGTGCTAAAAGTTCCAAAATCTGTCTTGTTGACGACCCAAAGTTTGTCGTTGCAAGATGAGAAGTTATCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCCACTCAGAAGTTGACCTTCTGTTTACTCTACGAAATTGGTAAAGGAAGCAGTTCTTGGTGGTTCCCTTACTTCAAGCATTTGCCCACGACTTACGAAACACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTGGATTATGCTCTCTGGGAAGCAGAGAAGGCTGCTTCGAAGTCTCATACGGAGTGGAGAGGAGTTAAAGGACTAATGGAAGAATCTAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTCAATTATGCTGCACCTGAAGGGGAGTCCCTTGATATTATGGATGTTTCATCATTTTCCCAACATGCTTCTTTGAATGGAAACATAACTACTGATGGGTTACACAAAGAGGAACAAGATACTCAGCGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATATTGCTTCTATGCCCGGGAAAGTTATAAAAGAGGAGAGCAGGTTCTTTTAAGCTATGGTACATATTCAAACTTAGAGCTTCTTCAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAGAGTTTTCATTCCTTTGGAACATGAAATTTATAGCTCCAGTTCTTGGCCTAAGGAGTCTCTTTTTATTCACCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGGTCTCAACTCTCGGTCAAGAATGAAGTATTAGTCATGCAATGGTTATCAAAGAACTGCCATGCTGTTTTAAACAATCTGCCAACGTCGGTTGAAGAAGACAATCAGCTTTTGTGCAACATCTGCAAAATCCAGGATTTGCAAGGACCAACGGAGCTCGGGAAGATGCTATTGACTGTCGGAGGCGAGTTTTGTGCTTTCTTGGAGACCTATGGCCTGGTGAATAGAGAGGAAACCGAGTTACACTTAACTGGGAAAATTAAACGTTCTCTGGAGAGATGGAAACTGGCAGTGCAGTGGAGGATCTTGTACAAGAAGGCGTTGGTTGATTGCACAAGTTACTGCACCAGAACTACTTGTTCTCTATCTTCTTGA

Protein sequence

MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLLTVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRTTCSLSS
BLAST of CmaCh14G008320 vs. Swiss-Prot
Match: SDG40_ARATH (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1)

HSP 1 Score: 491.1 bits (1263), Expect = 1.4e-137
Identity = 259/478 (54.18%), Postives = 330/478 (69.04%), Query Frame = 1

Query: 6   SFESLLRWAADHGISDSVDKQSSH-SCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLKV 65
           + E+ LRWAA+ GISDS+D      SCLG SL V  FPDAGGRGLGA R L KGELVLKV
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPTT 125
           P+  L+TT+S+  +D KLS A+  + SLSSTQ L+ CLLYE+ K   S+W+PY  H+P  
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWA 185
           Y+ LATFG FEKQALQV+ A+W  EKA +K  +EW+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLH 245
           SATISSR L+VPWD AGCLCPVGDLFNY AP        D S+  Q      N+   GL 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPG-------DYSNTPQGPESANNVEEAGLV 246

Query: 246 KEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENP 305
            E    +  LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELL++YGF+L+EN 
Sbjct: 247 VETHSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENS 306

Query: 306 NDRVFIPLEHEIYS-SSSWPKESLFIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYA 365
           ND+VFIPLE  ++S +SSWPK+SL+IHQ+G  SFAL+S LRLW    ++R + V  L YA
Sbjct: 307 NDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYA 366

Query: 366 GSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLLTV 425
           GSQ+SVKNE+LVM+W+S+ C +VL +LPTSV ED  LL NI K+QD +   E  K     
Sbjct: 367 GSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAF 426

Query: 426 GGEFCAFLET---YGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYC 478
           G E  AFL+    + +       +  + K  R L +W+ +VQWR+ YK+ L DC SYC
Sbjct: 427 GSEVRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYC 474

BLAST of CmaCh14G008320 vs. Swiss-Prot
Match: SETD3_DANRE (Histone-lysine N-methyltransferase setd3 OS=Danio rerio GN=setd3 PE=1 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 3.1e-12
Identity = 100/425 (23.53%), Postives = 165/425 (38.82%), Query Frame = 1

Query: 4   EGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLK 63
           E  F  L+ WAA+             SC G    +  F D G  GL A + +   EL L 
Sbjct: 76  EDFFSELMAWAAE----------CRASCDGFE--ISNFADEG-YGLKATKDIKAEELFLW 135

Query: 64  VPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPT 123
           +P+ +L+T +S          +  R         L   LL E    SS W  PY K LP+
Sbjct: 136 IPRKMLMTVESAKNSVLGPLYSQDRILQAMGNVTLALHLLCERANPSSPW-LPYIKTLPS 195

Query: 124 TYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQ-----TF 183
            Y+T   F E E + L    A+ +       +  ++     ++      ++L      TF
Sbjct: 196 EYDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYAYFYKVIHTHPNASKLPLKDAFTF 255

Query: 184 KAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNI 243
             + WA +++ +R   +P                   +G  + +  +  +      NG I
Sbjct: 256 DDYRWAVSSVMTRQNQIP-----------------TADGSRVTLALIPLWDMCNHTNGLI 315

Query: 244 TTDGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGF 303
           TT    ++++    AL D                 YK GEQ+ + YGT SN E + + GF
Sbjct: 316 TTGYNLEDDRCECVALKD-----------------YKEGEQIYIFYGTRSNAEFVIHNGF 375

Query: 304 LLQENPNDRVFIPL-----------EHEIYSSSSWPKESLF-IHQNGNP-SFALLSALRL 363
             ++N +DRV I L           + E+ + +  P  S+F +H +  P S  LL+ LR+
Sbjct: 376 FFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCSEPPISAQLLAFLRV 435

Query: 364 WATHPNKRRG----------VGHLAYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEE 401
           +     + R           +  L      +S +NE+ +  +L      +L    T+ EE
Sbjct: 436 FCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEE 452

BLAST of CmaCh14G008320 vs. Swiss-Prot
Match: SETD4_HUMAN (SET domain-containing protein 4 OS=Homo sapiens GN=SETD4 PE=2 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 4.9e-10
Identity = 89/398 (22.36%), Postives = 156/398 (39.20%), Query Frame = 1

Query: 35  SLCVCFFPDAGGRGLGAVRHLTKGELVLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSS 94
           +L    FP  G RGL +   L +G++++ +P+S LLTT ++ ++    +   K  P  S 
Sbjct: 49  NLAPACFPGTG-RGLMSQTSLQEGQMIISLPESCLLTTDTV-IRSYLGAYITKWKPPPSP 108

Query: 95  TQKLTFCLLYEIGKGSSSWWFPYFKHLPTTYETLATFGEFEKQALQVDYALWEAEKAASK 154
              L   L+ E   G  S W PY + LP  Y         E + + +     +A+    +
Sbjct: 109 LLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL---EPEVVNLLPKSLKAKAEEQR 168

Query: 155 SHTE---------WRGVKGLMEESNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCP 214
           +H +         +  ++ L  E+   + + ++ A LWA  T+++RA+Y+   +  CL  
Sbjct: 169 AHVQEFFASSRDFFSSLQPLFAEA--VDSIFSYSALLWAWCTVNTRAVYLRPRQRECLSA 228

Query: 215 VGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLHKEEQDTQRALTDGGFEENVSA 274
             D    A         +D+ + S H  +                        F E   +
Sbjct: 229 EPDTCALAP-------YLDLLNHSPHVQVKA---------------------AFNEETHS 288

Query: 275 YCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENPNDRVFIPLEHEIYSSSSWPK- 334
           Y       +++ E+V + YG + N  L   YGF+   NP+  V++  E  +    S  K 
Sbjct: 289 YEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQ 348

Query: 335 --------------ESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKN 394
                         E+L    +G PS+ LL+AL+L      K          G  +S  N
Sbjct: 349 MDKKISILKDHGYIENLTFGWDG-PSWRLLTALKLLCLEAEKFT-CWKKVLLGEVISDTN 402

Query: 395 EVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQD 409
           E   +    K C+         +EE N +L  +  ++D
Sbjct: 409 EKTSLDIAQKICYYF-------IEETNAVLQKVSHMKD 402

BLAST of CmaCh14G008320 vs. Swiss-Prot
Match: SETD4_MOUSE (SET domain-containing protein 4 OS=Mus musculus GN=Setd4 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 2.3e-07
Identity = 76/323 (23.53%), Postives = 132/323 (40.87%), Query Frame = 1

Query: 46  GRGLGAVRHLTKGELVLKVPKSVLLTTQSLSLQDEKLSMALKRY-PSLSSTQKLTFCLLY 105
           GRGL +   L +G++++ +P+S LLTT ++      L   +K++ P +S    L   L+ 
Sbjct: 58  GRGLMSKASLQEGQVMISLPESCLLTTDTVIRSS--LGPYIKKWKPPVSPLLALCTFLVS 117

Query: 106 EIGKGSSSWWFPYFKHLPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKG 165
           E   G  S W  Y   LP +Y T     E E   L       +AE+  ++    +   +G
Sbjct: 118 EKHAGCRSLWKSYLDILPKSY-TCPVCLEPEVVDLLPSPLKAKAEEQRARVQDLFTSARG 177

Query: 166 LMEE-----SNIKNQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGES 225
                    +   + + +++A+LWA  T+++RA+Y+      CL    D    A      
Sbjct: 178 FFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAP----- 237

Query: 226 LDIMDVSSFSQHASLNGNITTDGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQ 285
              +D+ + S H  +                        F E    Y        ++ ++
Sbjct: 238 --FLDLLNHSPHVQVKA---------------------AFNEKTRCYEIRTASRCRKHQE 297

Query: 286 VLLSYGTYSNLELLQYYGFLLQENPNDRV---------FIP-LEHEIYSSSSWPKESLFI 345
           V + YG + N  LL  YGF+   NP+  V         F+P  + +++   +  K+  F 
Sbjct: 298 VFICYGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGF- 346

BLAST of CmaCh14G008320 vs. TrEMBL
Match: A0A0A0L7L4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 806.6 bits (2082), Expect = 1.7e-230
Identity = 388/473 (82.03%), Postives = 424/473 (89.64%), Query Frame = 1

Query: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60
           M TEGS  SLLRWAADHGISDSVD+ +SHSCLG SLCV FFPD GGRGL AVR L KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120
           VL+ PKS+LLTTQSLSL+DEKL MALKRYPSLSSTQKLTFCLLYEI KG SSWWFPY KH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS T+WRGV+GLM+ESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + +DV SF  HASLN  +  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELE- 240

Query: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300
             L +E++D+Q ALTDGGFEEN SAYCFYARESY++GEQVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 --LLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IY SSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420
           YAGSQLSVKNE+LVMQWLSKNCH VLNNLPTS+EEDNQLLCNI K+QDLQ P EL K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDC 474
           T GGEFCAFLET G+VNR+E E H + K+KRSL+RWKLAVQWR+LYKKALVDC
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDC 470

BLAST of CmaCh14G008320 vs. TrEMBL
Match: A0A067KHN9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 4.8e-161
Identity = 294/488 (60.25%), Postives = 361/488 (73.98%), Query Frame = 1

Query: 8   ESLLRWAADHGISDS---VDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLKV 67
           E  L WAA+ GISDS      ++ +SC G SL +  FPDAGGRGLGA R L KGELVL+V
Sbjct: 11  EGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPDAGGRGLGAARDLWKGELVLRV 70

Query: 68  PKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPTT 127
           PK  LLT  SL L+D  LS  +  +PSLS TQ LT CLLYE+GKG SS+W+PY  HLP +
Sbjct: 71  PKPALLTRDSL-LKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYLMHLPRS 130

Query: 128 YETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWA 187
           YETLATF EFEKQA QVD A+W  EKA SK+ +EW+    LM+E  +K +  T +AW+WA
Sbjct: 131 YETLATFSEFEKQAFQVDDAVWTTEKAISKAESEWKEANLLMQELKLKPRFLTLRAWIWA 190

Query: 188 SATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNIT----T 247
           SATISSR L++PWDEAGCLCPVGDLFNYAAP  ES  +    S   ++S  G+++    T
Sbjct: 191 SATISSRTLHIPWDEAGCLCPVGDLFNYAAPGEESTGLESAESCMLNSSPQGSLSCGHPT 250

Query: 248 DGLHKEEQDTQ-RALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFL 307
           D L++   D   + LTDGGF+E++ AYCFYAR++YK+GEQVLLSYGTY+NLELL++YGF+
Sbjct: 251 DYLYEGRFDAHLQRLTDGGFDEDLDAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFV 310

Query: 308 LQENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHL 367
           L ENPND+VFIPLE  +YSS+SWPKES++IHQ+G PSFALLSALRLWAT PN+RR VGHL
Sbjct: 311 LDENPNDKVFIPLEPSMYSSNSWPKESMYIHQDGKPSFALLSALRLWATPPNQRRSVGHL 370

Query: 368 AYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKML 427
           AY+GSQLSV+NE  V++W+SK+CH +LNNLPT VEED+ LL  I KIQ+L  P ELG+ML
Sbjct: 371 AYSGSQLSVENETWVLKWISKSCHEILNNLPTKVEEDHLLLSTIDKIQNLYNPMELGQML 430

Query: 428 LTVGGEFCAFLETYGL-VNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCT 487
               GEF  FLE   +   +   EL L+ K K+++ERWKLAVQWR  YKK +VDC S CT
Sbjct: 431 CQFKGEFRDFLEASSIGKGKNGDELMLSSKTKQAIERWKLAVQWRFRYKKIVVDCISSCT 490

BLAST of CmaCh14G008320 vs. TrEMBL
Match: A0A061EFC1_THECC (SET domain group 40, putative isoform 1 OS=Theobroma cacao GN=TCM_017553 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 7.1e-157
Identity = 285/482 (59.13%), Postives = 354/482 (73.44%), Query Frame = 1

Query: 5   GSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLKV 64
           GS +S L+WAA  G+SDS +  S  SCLG SL V +FPDAGGRGLGAVR +T+GEL+LKV
Sbjct: 27  GSLDSFLKWAAGLGVSDSPNPDSC-SCLGHSLGVSYFPDAGGRGLGAVRDITRGELLLKV 86

Query: 65  PKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPTT 124
           PKS L+TT SL L DE+LS ALK +PSLS  Q LT C LYE+ KG +S W PY  HLP +
Sbjct: 87  PKSALITTHSL-LNDERLSTALKAHPSLSPAQVLTICFLYEMSKGKASPWHPYLLHLPRS 146

Query: 125 YETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWA 184
           Y  LA FGEFEKQALQVDYA+W A+KA SK+  EW+    LM+E  +K Q  TF+AW+WA
Sbjct: 147 YGILAAFGEFEKQALQVDYAIWAAQKALSKAEYEWKKATPLMKELKLKLQFLTFRAWIWA 206

Query: 185 SATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLH 244
           + TISSR L++PWDEAGCLCPVGDLFNYAAP GE     D++ F    +L      D L 
Sbjct: 207 TGTISSRTLHIPWDEAGCLCPVGDLFNYAAP-GE-----DLNGFDNVDNLQNGYALDDL- 266

Query: 245 KEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENP 304
            + Q +QR LTDG FEE+ +AYCFYA+ +YK+GEQVLLSYGTY+NLELL+YYGFLL++NP
Sbjct: 267 -DTQHSQR-LTDGAFEEDAAAYCFYAKTNYKKGEQVLLSYGTYTNLELLEYYGFLLEDNP 326

Query: 305 NDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGS 364
           N++VFIPLE +I+SSSSWP +SL+IHQNG PSFAL++ALR+WAT P +R+ + H AY+GS
Sbjct: 327 NEKVFIPLEPDIHSSSSWPNDSLYIHQNGRPSFALMAALRVWATPPYQRKSIRHQAYSGS 386

Query: 365 QLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLLTVGG 424
           QLS  NE+ VM W++K CHA L  +PTS+E+DN LL    KIQ+     E GK +   GG
Sbjct: 387 QLSQDNEISVMTWIAKKCHATLKAMPTSIEDDNLLLSFTDKIQEFDNLWEWGKAMPAFGG 446

Query: 425 EFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRTTCSL 484
           EFC  L+   L   +E+    + + K  ++RWKLAV WR++YKK LVDC SYCT T  SL
Sbjct: 447 EFCNLLQATNLKRNDES--FASRRAKMLIDRWKLAVHWRLIYKKVLVDCISYCTDTINSL 495

Query: 485 SS 487
           SS
Sbjct: 507 SS 495

BLAST of CmaCh14G008320 vs. TrEMBL
Match: V4SX96_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011537mg PE=4 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 2.1e-156
Identity = 292/482 (60.58%), Postives = 350/482 (72.61%), Query Frame = 1

Query: 6   SFESLLRWAADHGISDSVDKQSSHS--CLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLK 65
           S E LL+WAA+ GI+DS  +  S S  CLG SL V  FP+AGGRGL A R LTKGEL+L+
Sbjct: 7   SLEKLLKWAAEMGITDSTIQNPSRSRNCLGHSLTVSHFPEAGGRGLAAARDLTKGELILR 66

Query: 66  VPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPT 125
           VPK+ L TT+ L   D+K S+A+ R+  LS +Q L  CLLYE+GKG SS W+ Y   LP 
Sbjct: 67  VPKTALFTTECLLKSDQKRSLAVNRHLFLSPSQILIVCLLYEVGKGKSSRWYTYLMLLPR 126

Query: 126 TYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLW 185
            YE LATFG FEKQALQVD A+W AEKA SK+ +EW+    LMEE  +K QL +FKAWLW
Sbjct: 127 CYEILATFGPFEKQALQVDDAIWAAEKAVSKAESEWKQAIKLMEELKLKPQLLSFKAWLW 186

Query: 186 ASATISSRALYVPWDEAGCLCPVGDLFNYAAP-EGE--SLDIMDVSSFSQHASLNGNITT 245
           ASAT+SSR +++ WDEAGCLCPVGDLFNYAAP EGE  ++ I DV  +     L    TT
Sbjct: 187 ASATVSSRTMHISWDEAGCLCPVGDLFNYAAPGEGEESNIGIEDVEGWMPAPCLPKGDTT 246

Query: 246 DGLHKEEQDTQ-RALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFL 305
           D L  E+ +   R LTDG FEE+V++YCFYAR +YKRGEQVLLSYGTY+NLELL++YGFL
Sbjct: 247 DVLDSEKFNGHLRRLTDGRFEEDVNSYCFYARNNYKRGEQVLLSYGTYTNLELLEHYGFL 306

Query: 306 LQENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHL 365
           L ENPND+VFI LE  +YS  SWP+ES +I QNG PSFALLSALRLW T  N+RR VGHL
Sbjct: 307 LNENPNDKVFISLEPGMYSCCSWPRESQYIDQNGKPSFALLSALRLWMTPANQRRSVGHL 366

Query: 366 AYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKML 425
           AY+G QLSV NE+ VM+WLS N   +LN+LPTS EED  LLC I KIQD+    EL K+L
Sbjct: 367 AYSGHQLSVDNEISVMKWLSNNSRVMLNSLPTSKEEDALLLCAIDKIQDIYTAMELKKVL 426

Query: 426 LTVGGEFCAFLETYGLVNREE-TELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCT 481
              GGE C FLE YG+  R+   +L L+ K K S++RWKLA+QWR+ YKK L DC SYC 
Sbjct: 427 SDFGGEVCTFLENYGVQCRQRGAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLADCISYCD 486

BLAST of CmaCh14G008320 vs. TrEMBL
Match: B9T3H1_RICCO (Protein SET DOMAIN GROUP, putative OS=Ricinus communis GN=RCOM_1123320 PE=4 SV=1)

HSP 1 Score: 553.5 bits (1425), Expect = 2.5e-154
Identity = 287/491 (58.45%), Postives = 356/491 (72.51%), Query Frame = 1

Query: 8   ESLLRWAA-DHGISDSVDKQSS----HSCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVL 67
           E  L+WAA + GISDS +   S    +SCLG SL V  FPDAGGRGLGA R L KGELVL
Sbjct: 11  EGFLKWAAAELGISDSSNSSQSLEEPNSCLGISLTVSHFPDAGGRGLGAARDLKKGELVL 70

Query: 68  KVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLP 127
           +VPKS LLT  S  L+D  L  A+  + +LS TQ LT CLLYE+ KG SS+W+PY  HLP
Sbjct: 71  RVPKSALLTKDSF-LKDGLLLSAINNHSALSPTQTLTVCLLYEMSKGQSSFWYPYLMHLP 130

Query: 128 TTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWL 187
            +YE LATF EFEKQALQVD A+W AEKA SK+  + +    LM+E  +K Q  T +AW+
Sbjct: 131 RSYEILATFSEFEKQALQVDDAIWTAEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWI 190

Query: 188 WASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFS-----QHASLNGN 247
           WA ATISSR +++PWDEAGCLCPVGD FNYAAP  ES    +  S+      + ASL+  
Sbjct: 191 WACATISSRTMHIPWDEAGCLCPVGDFFNYAAPGEESSSPENDESWKPASCLEDASLSSE 250

Query: 248 ITTDGLHKEEQDTQ-RALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYY 307
            +T     E  D Q ++LTDGGF+E+ +AYCFYAR++YK+G QVLLSYGTY+NLELL++Y
Sbjct: 251 RSTSNFCSETFDVQLKSLTDGGFDEDKAAYCFYARQNYKKGAQVLLSYGTYTNLELLEHY 310

Query: 308 GFLLQENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGV 367
           GFLL ENPND+VFIPLE  + SS++WPKES++IHQ+G PSF+LL ALRLWAT  N+RR +
Sbjct: 311 GFLLNENPNDKVFIPLELSMQSSNTWPKESMYIHQDGKPSFSLLCALRLWATPSNRRRSM 370

Query: 368 GHLAYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELG 427
           GHLAY+GSQLSV+NEV +++W+S+ CHAVL  LPT+VEED+ LL  I KIQ+   P ELG
Sbjct: 371 GHLAYSGSQLSVENEVSILKWISRKCHAVLKKLPTTVEEDSLLLSAIDKIQNCHSPLELG 430

Query: 428 KMLLTVGGEFCAFLETYGLVNRE--ETELHLTGKIKRSLERWKLAVQWRILYKKALVDCT 486
           KML    G+  AF+E + L+N +       L GK KRS+ERWKLAV+WR+ YKK L+DC 
Sbjct: 431 KMLHGFEGQASAFVEAHNLLNIKIGTESTMLCGKAKRSMERWKLAVKWRLSYKKTLIDCI 490

BLAST of CmaCh14G008320 vs. TAIR10
Match: AT5G17240.1 (AT5G17240.1 SET domain group 40)

HSP 1 Score: 491.1 bits (1263), Expect = 7.8e-139
Identity = 259/478 (54.18%), Postives = 330/478 (69.04%), Query Frame = 1

Query: 6   SFESLLRWAADHGISDSVDKQSSH-SCLGRSLCVCFFPDAGGRGLGAVRHLTKGELVLKV 65
           + E+ LRWAA+ GISDS+D      SCLG SL V  FPDAGGRGLGA R L KGELVLKV
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKHLPTT 125
           P+  L+TT+S+  +D KLS A+  + SLSSTQ L+ CLLYE+ K   S+W+PY  H+P  
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWA 185
           Y+ LATFG FEKQALQV+ A+W  EKA +K  +EW+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLH 245
           SATISSR L+VPWD AGCLCPVGDLFNY AP        D S+  Q      N+   GL 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPG-------DYSNTPQGPESANNVEEAGLV 246

Query: 246 KEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENP 305
            E    +  LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELL++YGF+L+EN 
Sbjct: 247 VETHSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENS 306

Query: 306 NDRVFIPLEHEIYS-SSSWPKESLFIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYA 365
           ND+VFIPLE  ++S +SSWPK+SL+IHQ+G  SFAL+S LRLW    ++R + V  L YA
Sbjct: 307 NDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYA 366

Query: 366 GSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLLTV 425
           GSQ+SVKNE+LVM+W+S+ C +VL +LPTSV ED  LL NI K+QD +   E  K     
Sbjct: 367 GSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAF 426

Query: 426 GGEFCAFLET---YGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYC 478
           G E  AFL+    + +       +  + K  R L +W+ +VQWR+ YK+ L DC SYC
Sbjct: 427 GSEVRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYC 474

BLAST of CmaCh14G008320 vs. TAIR10
Match: AT3G07670.1 (AT3G07670.1 Rubisco methyltransferase family protein)

HSP 1 Score: 56.6 bits (135), Expect = 4.9e-08
Identity = 95/401 (23.69%), Postives = 154/401 (38.40%), Query Frame = 1

Query: 43  DAGGRGLGAVRHLTKGELVLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCL 102
           D G RGL A ++L KGE +L VP S++++  S    + +    +KRY  +     L   L
Sbjct: 97  DIGERGLVASQNLRKGEKLLFVPPSLVISADS-EWTNAEAGEVMKRY-DVPDWPLLATYL 156

Query: 103 LYEIGKGSSSWWFPYFKHLPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGV 162
           + E     SS WF Y   LP    +L  +   E     +D  L EA +   ++      V
Sbjct: 157 ISEASLQKSSRWFNYISALPRQPYSLLYWTRTE-----LDMYL-EASQIRERAIERITNV 216

Query: 163 KGLMEESNIK----------NQLQTFKAWLWASATISSRALYVP-WDEAGCLCPVGDLFN 222
            G  E+   +           ++   + + W+   + SR + +P  D    L P  D+ N
Sbjct: 217 VGTYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLN 276

Query: 223 YAAPEGESLDIMDVSSFSQHASLNGNITTDGLHKEEQDTQRALTDGGFEENVSAYCFYAR 282
           +       LD          +S     TTD                              
Sbjct: 277 HNCEVETFLDY-------DKSSKGVVFTTDR----------------------------- 336

Query: 283 ESYKRGEQVLLSYGTYSNLELLQYYGFLLQE--NPNDRVFIPLEHEIYSSSSWPK-ESLF 342
             Y+ GEQV +SYG  SN ELL  YGF+ +E  NP+D V + L           K ++L 
Sbjct: 337 -PYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALK 396

Query: 343 IHQNGNPS----------FALLSALRLWATHPNKRRGVGHLAYAGS-QLSVKNEVLVMQW 402
            H    P             L++   L  + P+ R     +A A S + S KN++   + 
Sbjct: 397 KHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPEI 452

Query: 403 LSKNCHAVLNNLPTSVEEDNQLLCNICKIQ-DLQGPTELGK 418
                  +L++  TS+ + ++ L     +  D+  P +L +
Sbjct: 457 EEDALQFILDSCETSISKYSRFLKESGSMDLDITSPKQLNR 452

BLAST of CmaCh14G008320 vs. NCBI nr
Match: gi|449456212|ref|XP_004145844.1| (PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus])

HSP 1 Score: 823.2 bits (2125), Expect = 2.5e-235
Identity = 397/486 (81.69%), Postives = 433/486 (89.09%), Query Frame = 1

Query: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60
           M TEGS  SLLRWAADHGISDSVD+ +SHSCLG SLCV FFPD GGRGL AVR L KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120
           VL+ PKS+LLTTQSLSL+DEKL MALKRYPSLSSTQKLTFCLLYEI KG SSWWFPY KH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS T+WRGV+GLM+ESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + +DV SF  HASLN  +  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELE- 240

Query: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300
             L +E++D+Q ALTDGGFEEN SAYCFYARESY++GEQVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 --LLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IY SSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420
           YAGSQLSVKNE+LVMQWLSKNCH VLNNLPTS+EEDNQLLCNI K+QDLQ P EL K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480
           T GGEFCAFLET G+VNR+E E H + K+KRSL+RWKLAVQWR+LYKKALVDC  YCT T
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTT 480

Query: 481 TCSLSS 487
            CSLSS
Sbjct: 481 ICSLSS 483

BLAST of CmaCh14G008320 vs. NCBI nr
Match: gi|659114359|ref|XP_008457030.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo])

HSP 1 Score: 810.4 bits (2092), Expect = 1.6e-231
Identity = 394/486 (81.07%), Postives = 432/486 (88.89%), Query Frame = 1

Query: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60
           M TEGSF SLLRWAADHGISDS+D+ +S SCLGRSLCV FFPD+GGRGL AVR L KGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120
           +L+ PKSVLLTTQSLSL+DEKL+MALK +PSLSSTQKLTFCLL EI KG+SS WFPY KH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  +WRGVKGLM+ESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN  + +
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES 240

Query: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300
               +E++D+Q  LTDGGFEEN SAYCFYARESYK+GEQVLLSYGTY+N+ELL+YYGFLL
Sbjct: 241 ---LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLL 300

Query: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IY SSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420
           YAGSQLSVKNE LVMQWLSKNCH VLNNLPTS+EED+QLLCNI K+QDLQ   EL KMLL
Sbjct: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLL 420

Query: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480
           T GGE CAFLET G+VNR+E E HL+ K+KRSLERWKLAVQWR+LYKKALVDC  YCTRT
Sbjct: 421 TYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRT 480

Query: 481 TCSLSS 487
            CSLSS
Sbjct: 481 ICSLSS 483

BLAST of CmaCh14G008320 vs. NCBI nr
Match: gi|700202665|gb|KGN57798.1| (hypothetical protein Csa_3G307670 [Cucumis sativus])

HSP 1 Score: 806.6 bits (2082), Expect = 2.4e-230
Identity = 388/473 (82.03%), Postives = 424/473 (89.64%), Query Frame = 1

Query: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60
           M TEGS  SLLRWAADHGISDSVD+ +SHSCLG SLCV FFPD GGRGL AVR L KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120
           VL+ PKS+LLTTQSLSL+DEKL MALKRYPSLSSTQKLTFCLLYEI KG SSWWFPY KH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS T+WRGV+GLM+ESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + +DV SF  HASLN  +  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELE- 240

Query: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300
             L +E++D+Q ALTDGGFEEN SAYCFYARESY++GEQVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 --LLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLL 300

Query: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IY SSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420
           YAGSQLSVKNE+LVMQWLSKNCH VLNNLPTS+EEDNQLLCNI K+QDLQ P EL K LL
Sbjct: 361 YAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420

Query: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDC 474
           T GGEFCAFLET G+VNR+E E H + K+KRSL+RWKLAVQWR+LYKKALVDC
Sbjct: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDC 470

BLAST of CmaCh14G008320 vs. NCBI nr
Match: gi|659114357|ref|XP_008457029.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo])

HSP 1 Score: 804.3 bits (2076), Expect = 1.2e-229
Identity = 394/491 (80.24%), Postives = 432/491 (87.98%), Query Frame = 1

Query: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGG-----RGLGAVRHL 60
           M TEGSF SLLRWAADHGISDS+D+ +S SCLGRSLCV FFPD+GG     RGL AVR L
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  TKGELVLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF 120
            KGEL+L+ PKSVLLTTQSLSL+DEKL+MALK +PSLSSTQKLTFCLL EI KG+SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYFKHLPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQL 180
           PY KHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  +WRGVKGLM+ESNIKNQL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLN 240
           QTFKAWLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 GNITTDGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQY 300
             + +    +E++D+Q  LTDGGFEEN SAYCFYARESYK+GEQVLLSYGTY+N+ELL+Y
Sbjct: 241 DELES---LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEY 300

Query: 301 YGFLLQENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRG 360
           YGFLLQENPND+VFIP+EH+IY SSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRG
Sbjct: 301 YGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRG 360

Query: 361 VGHLAYAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTEL 420
           VGHLAYAGSQLSVKNE LVMQWLSKNCH VLNNLPTS+EED+QLLCNI K+QDLQ   EL
Sbjct: 361 VGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQREL 420

Query: 421 GKMLLTVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTS 480
            KMLLT GGE CAFLET G+VNR+E E HL+ K+KRSLERWKLAVQWR+LYKKALVDC  
Sbjct: 421 RKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIG 480

Query: 481 YCTRTTCSLSS 487
           YCTRT CSLSS
Sbjct: 481 YCTRTICSLSS 488

BLAST of CmaCh14G008320 vs. NCBI nr
Match: gi|659114393|ref|XP_008457032.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo])

HSP 1 Score: 588.6 bits (1516), Expect = 1.0e-164
Identity = 283/347 (81.56%), Postives = 309/347 (89.05%), Query Frame = 1

Query: 140 QVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKAWLWASATISSRALYVPWDE 199
           QVDYA+W  EKAA KS  +WRGVKGLM+ESNIKNQLQTFKAWLWASATISSR LYVPWDE
Sbjct: 96  QVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKAWLWASATISSRTLYVPWDE 155

Query: 200 AGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITTDGLHKEEQDTQRALTDGGF 259
           AGCLCPVGDLFNYAAPEGES + MDV SF  HASLN  + +    +E++D+Q  LTDGGF
Sbjct: 156 AGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES---LEEQRDSQWDLTDGGF 215

Query: 260 EENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLLQENPNDRVFIPLEHEIYSS 319
           EEN SAYCFYARESYK+GEQVLLSYGTY+N+ELL+YYGFLLQENPND+VFIP+EH+IY S
Sbjct: 216 EENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVS 275

Query: 320 SSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEVLVMQWLS 379
           SSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNE LVMQWLS
Sbjct: 276 SSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLS 335

Query: 380 KNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLLTVGGEFCAFLETYGLVNRE 439
           KNCH VLNNLPTS+EED+QLLCNI K+QDLQ   EL KMLLT GGE CAFLET G+VNR+
Sbjct: 336 KNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRD 395

Query: 440 ETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRTTCSLSS 487
           E E HL+ K+KRSLERWKLAVQWR+LYKKALVDC  YCTRT CSLSS
Sbjct: 396 EAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICSLSS 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SDG40_ARATH1.4e-13754.18Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1[more]
SETD3_DANRE3.1e-1223.53Histone-lysine N-methyltransferase setd3 OS=Danio rerio GN=setd3 PE=1 SV=1[more]
SETD4_HUMAN4.9e-1022.36SET domain-containing protein 4 OS=Homo sapiens GN=SETD4 PE=2 SV=1[more]
SETD4_MOUSE2.3e-0723.53SET domain-containing protein 4 OS=Mus musculus GN=Setd4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7L4_CUCSA1.7e-23082.03Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1[more]
A0A067KHN9_JATCU4.8e-16160.25Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1[more]
A0A061EFC1_THECC7.1e-15759.13SET domain group 40, putative isoform 1 OS=Theobroma cacao GN=TCM_017553 PE=4 SV... [more]
V4SX96_9ROSI2.1e-15660.58Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011537mg PE=4 SV=1[more]
B9T3H1_RICCO2.5e-15458.45Protein SET DOMAIN GROUP, putative OS=Ricinus communis GN=RCOM_1123320 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G17240.17.8e-13954.18 SET domain group 40[more]
AT3G07670.14.9e-0823.69 Rubisco methyltransferase family protein[more]
Match NameE-valueIdentityDescription
gi|449456212|ref|XP_004145844.1|2.5e-23581.69PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus][more]
gi|659114359|ref|XP_008457030.1|1.6e-23181.07PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo][more]
gi|700202665|gb|KGN57798.1|2.4e-23082.03hypothetical protein Csa_3G307670 [Cucumis sativus][more]
gi|659114357|ref|XP_008457029.1|1.2e-22980.24PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo][more]
gi|659114393|ref|XP_008457032.1|1.0e-16481.56PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001214SET_dom
IPR015353Rubisco_LSMT_subst-bd
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G008320.1CmaCh14G008320.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 46..285
score: 1.
IPR001214SET domainPROFILEPS50280SETcoord: 34..285
score: 11
IPR015353Rubisco LSMT, substrate-binding domainGENE3DG3DSA:3.90.1420.10coord: 325..410
score: 8.5E-10coord: 454..476
score: 8.5
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 336..401
score: 1.
IPR015353Rubisco LSMT, substrate-binding domainunknownSSF81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 295..402
score: 6.5
NoneNo IPR availableGENE3DG3DSA:3.90.1410.10coord: 6..298
score: 4.2
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 7..220
score: 9.2E-195coord: 452..486
score: 9.2E-195coord: 261..405
score: 9.2E
NoneNo IPR availablePANTHERPTHR13271:SF19PROTEIN SET DOMAIN GROUP 40coord: 7..220
score: 9.2E-195coord: 261..405
score: 9.2E-195coord: 452..486
score: 9.2E
NoneNo IPR availableunknownSSF82199SET domaincoord: 7..303
score: 1.53