CSPI03G21300 (gene) Wild cucumber (PI 183967)

NameCSPI03G21300
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionSET domain group 40
LocationChr3 : 17506623 .. 17515433 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATAAAATAGAGCAATGAGAAAAGTTATGATAAATGACGTGTGAAATTAAAAGGAAAAAAGAAGAAGAAAAAGATAAAGAAAAAGAAGGTGAATTGCGTCTGTTGAAAGGTTTTGATGGAGGGTTTATGAGATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGTATGCTTTGATTTCTCGTTTTCTAGTTACATCACTCTTTTTCTCTTCTTTCGCCGCTTGTAAGTTGTTCATTTTGACGGATTGTTCTTCAGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGGTTATTCCCTTGCTAACCTAATTTGCGCGAACTTTTGTTAGGAAGTAGGAGAGCTAGGAAGAAATTTGGGTGTTTAGGGCGCTAGGTTTAGTTATGATGCAACGACTACGTGCATGCATTACTCGAAAATAGATCTTTTGCAAATTCTAAAACAAAACATTATTAAATTACAAACGAAGTCCTTGCTAACCTTTAACTAATTGATCTTTTTTACGAGGAATTACAACTTTAGAAAAAAATGAAAGAATACAAGGGCATACAAATAAACCAAACCACAAATAATACCCCAACTAAAGGAAGGGGACTAACTGAAAAGGATATCACCTATGGAATAATTACAAAATGATCTTGAAATCGAAGTCTAAGGGGACACATGAAATCTAATCAAAGACCAAACCTCAATAAGATCCCTTTCCTTTCCCTACCACAAAACACACAATCATTCCTCTCCCTCCAAACGTCCCACAAGATAGCACGCACCCTAATGATCCATAGAAAACCTCCCTTGAGAAAGTGGATGGAGGTGAAACTCCTCGATGGTCATACGAACCCCTCTAAGATCAACATATCTCATGCCAAACTCCTAAAACAACATACTTCACAAAGCAAGCGCATAATGACAATCCCAAAAGAGGTGACCAAGATCTTTCTTTGCCCTTCGACAAAGCAAGCAACAAAAAGGCCTAACAAGAGAAGATCTCCTCCTCACAAGCCCATCCACAGTGTTAATCCGACTAACAGAACTTGCTAGGTGAAGAACCTATCTTTCATAGGAACCTTGGTCCTCCAAACCACATTAAAAACAGACCCCCTATAGGGGGAGAGATCCAATAACAAACTAAAAAAAGATTTACATAAGAATCCCTAACTTGGATGAGGACTCCAAACACAAACATCCTCCTTCCCTCCTTAAGACCGACCCTTTCAACCAGAAAAAGAAGAGAGACCACCTCCATCGTTTTTCTATCGGTCAAATTACGACAGAACCAGAAAAAGAAGGATACATAATTATCGAACCTGACCAAAAAGTCCAATATTGTACAATTTTTTTGAATTGGATAAATGATAAAGATGCAGAAACAAGGAGCAAAGAGAAAGCATTGATCCTCCCAGAAAATCATGTCCTTATCCTCCCTCACAAAACAACGGATGAAATGGGAAAAAAGAACTCAAGACAATCATCTTTCCAAGAATTGCGTTTTGTGCGTATAACCCTTTTCGTCAACCACTCAAAAGGATGGGTACCGTACTTACTCTCAATAATCCTATGCCATAAGTAGTCGGACACGAGGTGAAAGCGCCAAATCCACTTCGGCAACAAGGCTCTGTTTCAGAGACTTAGATTACCGATCTCTAACCCCCCTGACCAACAGGGTGCCCCACTGCAATCCTATTAACCAGATGGGGGCCATGACCTTCCTCCACCCCTTCCAAGAAAATTTCTCATGTATTTTTCTAGGCTCTTGAACACTGACCTAGGAGCCCTGAAGAGAGACAAATAATAAACCAGAATCCCACTAAGCACAGACCTTATTAGGGTTAATCTACCAGCTCTTGAAAAGAACCCTCTTCCATACTGCTAGTCTCTTACGAATTTTATCATAAAGAGGATCTTAAAAAGTCAAAGCTTTCAGATTTCCACCAAGGGGAAGCCCAAGATACGTCGAAGGGAACAAGCCAACCTTACAATCAAACATCTCAGCCCAACTTAGCAACTTAGCTTGATTAGTGTTAATACCAAAAATGGAATATTTTCTTCTGATAATTTTTAACTTGAATAGATCTTCAAAGAAAGCCACAATATGGTTGATAATAAGGAAGGACTCCTCTTTTCCAGAACGAAGAAGCATAGTTATCATCCGCGAATTGAAGATGCGATAAGGCAACTTCATTCCTCCCACACTAAAAGGGTCAACAATGTTTCCTCCCACCCCTTCATAAATAATCCGACTTAAAACATCCACAACCAATAAAAACAGGGAGGGGATAGAGGATCCTTGTCTTAACTCTTTGGAGGCTTGAATAGGCCCTCTGGGGGTACCATTGATGAGAATATAATGCTTCACATTCCTCACAACCCCCCACATCCACATTATCCATTTAGAGCCAAACCCTTTTTTAATTAGAACTCTATCCAAAAATCCCAATCCACATGATCGTATGCCTTTTCAAAATCAAGTTTAAGCAACACACCTCTTTTCCTAGATCTGTACTCCTCGATAGCTTCATTTGCAACAAGGACTTGATCCGGAATCTGCCTTCCTGCCACCAAGGCACCTTGGGCCTCAGAAATAGTAGATGACATAACTTTCCTTAAATGATTACCAAGAACCTTAGTCAAAATCTTATAAACACTGGCAACAAGCCTAATAGACCTAAAATCCTAACTCTGGCGCACTATCCTTTTTAGGAATCAAACATACAAAGGTTTCAACCAGCGAACTATTTAAAATACCTCTTGGTTTACTCGTTGCTTTGTGGGGGATGGTAAGGACACGTGCTTTTGGGAGGATCAGTGGGTGGGGGAAAATTTCATTTGTTCTTTATTTCCACATCTTTATTATTTATCTTCTTCCAAAAAAGGTATGATATTGGATCTTTTGGTTGGGTTCGAGAATCCCATGTTTATTTCTTTCGGGTTTGTCGTAATTTGACCATTAAAGAAACGATGGAGGTGACCTCTCTTCTTGCTTTGGTTGAGGGGTGTAGTTTTAGGGAGGGGAGAAGGGATGTTTGTGTTTTTTGGAATCGTAATCCAAGTTAGGGTTATCTGCATGGGTTTAGTCCTTTGTTAGATCCCTACCCCCCTAGGGAGTCGGTTTATGATGCGTTTGGAGGACTAAGGTTCCTAAGAAAGTTAGGTTATTTATCTGGCAAGTCTTGCTTGGTCGGGTTAACATTGTTGATAAGAAAGTTAAGTTATTTATCTGGCAAGTCTTGCTTGGTCGGGTTAACATTGTTGATAGGCTTCTTAGGAGTACTTTGCTTGTTAGACCTTTTTGTTGCATGCTTTGTCAGAAGGCAGAGGAAGATCTTGATCATCTTTTTTGGGATTGCCCTTATGTGGGCTGAGTGAATTTTCTTTTCAGGAGTTTGGTGTTAGTTATGCCGGCCTTCAGAGTGTCAAAGTGACGATTGAGGAGTTCCTCCTCCATCCGTTCTTCAAAGACAAAAGGGGTTTTTTTTATGGCTAGCCGAGGTGTGTGCCTTTAACTAGGACATTTGGGTAAGAGGAATGATCGGGTGTTTCTTGGTAGGGATTTGGTTGTTGGTTGGATTTCACGTGTTTCTTTGGGCTTTGATTTTGAAGACCTTTCAATAATTATTCAATTGGGCTTTGTTTTGGTGGGTTGGTTATTTTGTATGCTCTTATATTCTTTCATTTGTTCTCAATTAAAGTTGTTTTCATAAAGAAAAACGAAAAAAGAGATTGAGTTCTTTGATAAAGATAGGTACGACGTGAGATGCACCTTAGTATTTTCTCTTTAAGAATTACAAGCCTGCAGCCACCTAAATGGAAAAAGAGGAGATCTTTATACATGAATGACCATAGTTATGCTTAAAGATGTTTTATATTCTATTCTATGTTAATGTTTAAATGTTCTTAGTGCAATTGGGACAGATTATCTAAAACCAAATGGATCCTTTTCAGCACATTGGGATAACCCAATATCCTCTTGGTCCATAACTTTTAGAAGCCTTCTTAAAGATAACGAGATTCTGGATTTTCAAAATTTGATGGCTCAACTGTCAATTCAAACCTTGGTAGAAGACCATGGTCTTTGGAAGCTTCTGGCAGTTTCTCGGTCAAATCTCTCACATAACACCTCTCAGCTTTATCTCTTTTGGATAGAGACCTCTAAAAGCTTTGTGGAATCTCATATTACACTTCTCGGCTTTATCTCTTTTGGATAGAGGCCTCTAAAAGATTTTTGGAAATCTAAATGCCCGAGAAGAGTAAACATTCTTATAACCCATGTTGGACTTCAAAACTGCTCTTTAGTCATGCAAAGAAAGTTACCAAACAGCTGCCTTTCACCCTCAATATGCCCACTGTGCAAGCAAGAAGGAGAAGATTTGCAGCATTTGTTCTTTTCCTGTTTCTATTCTGCTAATTGTTGGTGGAAGTTCTTCTCAATATTTGAAGTTGCGTGGGTTTTTAGAGAATCATTTAGCCTAAATGTACAGAAAGTTTTATGGGGCCCATTTTAAAGAAAGGAATGAGACTAATTGGGCAAATTTATCAAAGGTCGTGTTGGCAGAAATTTGGTTTGAACGAAATCAAAAGAGAGATGTATTGGTTAGACCTCTTTGAGAAGGAATGCTGCAGCTTGGTGTACTTTACAGATCATTGGAAGCCACCTTCTTCATGACTCATGTTTTTGCAACCATCGTTGAAAGCTCTTTGTCTTCACATCTTCAGGCCATTCGTGCCTCACTTGTGCCGTTTTCTTGTTTCTTATTTTAGTTGCTTTACAGCCCCTACTTGTTGGATTCTTGTAAGATCTTTTTATGTATCTTAGGCTGTATTTTTGTTGGATATGACTAGGGCGCTAGGGGGGATGTAAACCTAGTTGAGATGTTGGGCTACGCCAGCTGATTCTTTGGTCTCTTTTTGTTTCGCTCTTTGTACGAACCTTTTGTAGTTTGAGCTTTATTCTTATCAATAAAGAGGTTGGTTTCCTTTTCAGAAAGAAAAAACATCACATTTTCTCTTGTTAGTTCTAAAGGTGTACTGATATTAATTTCACAATCCCTCCCCCTACTCAAATGCAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCCGTTTGTGGCAATTTGATTTATTTCTTAGGATCATATTCCTATTCGTGAATCATATTCAATGAGAATATTTTACATGTGTTTTTGACATTTGTTTTATAATAAAATAGTTAAAGTCATTAGCAGGTTGATTATGCAATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACCGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGATTGTATATTGCATGGTTAGATTGGACCTGGATTTTAAAATTTATTTTTATTTTTGGTAAGAAACCAAACTTTTATTGAGAAAAAAAATGAGTGAAAGAGGGCCCCCAACTAACTTTGAAAAGAGTTCCAATCTAAGAAATTCCAAATTCTTAATTACGATATAGGATTAATAGACATTTTCTTATTAGTTATTACCAAAAACCTGGGCTTAGGTTTCAATTTTTTTAGGTTTCAATTTTTTTTTAACAATTAATTAATTTGTTTGTTTGGTGAAAAAATTATGCATCATTTAAAGGTGGTTTAGCTGGAGGTCTCGTATGGGTAAATTATCCAAGACCACTATCCAAGGTCACTGCAAATATGCTTTTCTTATCGAGGAAAGATTCCACTAATTGACACCCATTTGCTGATTATCTATCCTTCCATTACCCACTACCTTGTAATTGTAGTCCTGTACGCATTTCCCATTACCAGCCAGTAAGCATAATTTAGACTCCTATTCTATCTAATATGCTCTCTCTCCCATCCCACCTTGCTCTCCTCCTATAGTCCTATCCCCTTCACTTAAGGTTCCCCTTTAGGTATTCTTTGTCTCAAACTACTGTCACTGACCTTTCGTGATAGGACCTAATTATTGAAGTTGAGGGTGGATGGTTATAGATGAAGCAAACCAAAACTCTCTAGGATTGGAAAGGATAACTTTTCCTCCATCTCTCTCATCAATGGGGTCCTCTCGTGGATTAAAACTTGCTACATAGGCGTGCTCCTCCACTAGAACTCTATAGGGAGGATTATGTGTTTGCATTGAATAAATCTCGTATAGGAAAGAGACTACTGTTGAGATCATTAAGCTGAACTCAAAGACACATTTTATTCCATGTAGGTGAGAATTATTGAGGTCAGTTCTTCTCCTTGTGAAAGATTACCTTGTGCCACCCTGCCTCTAATAGTTCATTGTTTTCCTTGACTTGTGAAGTGGAAGTTGCTTGCACAGAGGCTTTTGGATCGTCCCATACAAGAATACTTGGATGCTCCTTCAAGCACGGTTGCATTTTGTACTAAAATCCCTTCAAAGTCTCAATAGCTTTTTGTTGCCATAGAATAGAACTATAGAATTCATCCTCTTTTTTATGAATGTGAATTGTTGAGACTCGAGAACTTGTTTTGTAATTTTCTCTTACCCTTCTTTTGGAGGATATATTATTGACCTTACTTGTATCTATTTTTCTTATGGAAAAACAAGTCTTTGTTTGTCTTATGCATTGTATGATGAGTCTAAATGTGAGCATGGTTTGGTCCGATGTAAACTTTTTCAATCTTAACCTCAGTTTGTTGAATTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGACGTTAGTAGCTAAAAACTAAAAGTATATGTTGCCCTTTGAGAGATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCTTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACATGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAGGAAGGGAGAGCAGGTATTTTATATTTTCTTAGTGTCATTTATATATTTTGTACTCATATGGAAAGAGGAGTTAATGCAAAATACAATAAACATAAGGTTAACATTTTATTTATGGAGTTTGATTTCTATTTGAAGAAGGTTGAAATTTGCATTAACATTTTTTATTTGTTCTCTGATGCATTCCTATTAATCTGCACCTTGAGATTTCCATAATAAAACTCGAGATTGTTTCTTCATGTCTCAAGGCCGAGATTGTTTCTTCATGTCTCAAGGCATGAATATCAATAAGTCCATTTTTAATTTGAAAAGAAAAATAAATGTTCTCTAAAAAATAATGCTTTGCACCAGTCTCTAAAAAAGAATGTGTGTGTGGATGAGTAACAATTGAAACCTTAAGAGGCCATGGGTTCAATCCATGGTAGGCATCTTTGATTTCCAATGAGTTTTCTTAACACCCAAATGTTGTAGGGTTAGACGAGTTGTCTTGTGAGATTAGTCGAGTTGCACGTAAGCTGATCCAGACACTCATGGATATCAGAAGAAAGTAACGATTAAAACATTTCTATTGTTTGCTGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATACTATGGGTTTCTTCTGCAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCAAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCACAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTGAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAATCTGGTTCAGGTTGTTATTTTCAGGTTTTAATTTAAATTAGCTATTTAATGAATTTTATAGGTGTTAAAAAGATTAGAATGGTATGAGTACGATAGCATTGAACCGACTGCCTCCTTTTGCATTTGCTTTATGCTCCTAATTTGCCAAC

mRNA sequence

ATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTGATTATGCAATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACCGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCTTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACATGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAGGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATACTATGGGTTTCTTCTGCAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCAAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCACAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTGAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAA

Coding sequence (CDS)

ATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTGATTATGCAATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACCGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCTTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACATGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAGGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATACTATGGGTTTCTTCTGCAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCAAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCACAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTGAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAA
BLAST of CSPI03G21300 vs. Swiss-Prot
Match: SDG40_ARATH (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 1.7e-143
Identity = 266/482 (55.19%), Postives = 341/482 (70.75%), Query Frame = 1

Query: 6   SLGSLLRWAADHGISDSVDQPTSH-SCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRA 65
           ++ + LRWAA+ GISDS+D      SCLGHSL VS FPD GGRGL A R+LKKGELVL+ 
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQS 125
           P+  L+TT+S+  +D KL  A+  + SLSSTQ L+ CLLYE+SK   S+W+PYL H+P+ 
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K +++W+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQ 245
           SATISSRTL+VPWD AGCLCPVGDLFNY AP   S    +    P  A+  +E  L+ E 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYS----NTPQGPESANNVEEAGLVVET 246

Query: 246 RDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDK 305
              +  LTDGGFEE+ +AYC YAR +Y+ GEQVLL YGTYTNLELLE+YGF+L+EN NDK
Sbjct: 247 HSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 306

Query: 306 VFIPIEHDIYG-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 365
           VFIP+E  ++  +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 307 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 366

Query: 366 LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGE 425
           +SVKNE LVM+W+S+ C +VL +LPTS+ ED  LL NI K+QD ++  E QK    +G E
Sbjct: 367 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 426

Query: 426 FCAFLETN---GVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTIC 482
             AFL+ N    V          S+K  R L +W+ +VQWRL YK+ L DCI YC   + 
Sbjct: 427 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of CSPI03G21300 vs. Swiss-Prot
Match: SETD3_XENTR (Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis GN=setd3 PE=2 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 6.6e-15
Identity = 93/389 (23.91%), Postives = 161/389 (41.39%), Query Frame = 1

Query: 41  FPDTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTF 100
           FP+ G  GL A R++K  EL L  P+ +L+T +S          +  R         L F
Sbjct: 101 FPEEGF-GLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAF 160

Query: 101 CLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWR 160
            LL E    P+S+W PY+K LP  YD    F E E Q LQ   AI         +   + 
Sbjct: 161 HLLCE-RADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYA 220

Query: 161 GVEGLMQESNIKSQLQ-----TFKAWLWASATISSRTLYVPWDEAG----CLCPVGDLFN 220
               ++Q     ++L      TF  + WA +++ +R   +P ++       L P+ D+ N
Sbjct: 221 YFYKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 280

Query: 221 YAAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARESY 280
           +        N +    +            LE+ R    AL D                 +
Sbjct: 281 HT-------NGLITTGYN-----------LEDDRCECVALQD-----------------F 340

Query: 281 RKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPI-----------EHDIYGSSSWP 340
           + GEQ+ + YGT +N E + + GF  + N +D+V I +           + ++   +  P
Sbjct: 341 KSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIP 400

Query: 341 KESLY-IHQNGNP-SFALLSALRLWATHPNKRRG----------VGHLAYAGSQLSVKNE 398
             S++ +H    P S  LL+ LR++  + ++ +G          +  L  +   +S +NE
Sbjct: 401 TSSVFALHVTEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVSWENE 452

BLAST of CSPI03G21300 vs. Swiss-Prot
Match: SETD4_HUMAN (SET domain-containing protein 4 OS=Homo sapiens GN=SETD4 PE=2 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 6.2e-13
Identity = 96/392 (24.49%), Postives = 154/392 (39.29%), Query Frame = 1

Query: 35  SLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSS 94
           +L  + FP TG RGL +   L++G++++  P+S LLTT ++ +         K  P  S 
Sbjct: 49  NLAPACFPGTG-RGLMSQTSLQEGQMIISLPESCLLTTDTV-IRSYLGAYITKWKPPPSP 108

Query: 95  TQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAA-- 154
              L   L+ E   G  S W PYL+ LP++Y             L       A E+ A  
Sbjct: 109 LLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCLEPEVVNLLPKSLKAKAEEQRAHV 168

Query: 155 ----LKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGD 214
                 SR  +  ++ L  E+     + ++ A LWA  T+++R +Y+   +  CL    D
Sbjct: 169 QEFFASSRDFFSSLQPLFAEA--VDSIFSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 228

Query: 215 LFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYAR 274
               A         +D+L+   H  +                     F E   +Y     
Sbjct: 229 TCALAP-------YLDLLNHSPHVQVK------------------AAFNEETHSYEIRTT 288

Query: 275 ESYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPK------- 334
             +RK E+V + YG + N  L   YGF+   NP+  V++  E  +    S  K       
Sbjct: 289 SRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKIS 348

Query: 335 --------ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQ 394
                   E+L    +G PS+ LL+AL+L      K          G  +S  NE   + 
Sbjct: 349 ILKDHGYIENLTFGWDG-PSWRLLTALKLLCLEAEKFT-CWKKVLLGEVISDTNEKTSLD 402

Query: 395 WLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQD 406
              K C+         IEE N +L  ++ ++D
Sbjct: 409 IAQKICYYF-------IEETNAVLQKVSHMKD 402

BLAST of CSPI03G21300 vs. Swiss-Prot
Match: SETD3_CHICK (Histone-lysine N-methyltransferase setd3 OS=Gallus gallus GN=SETD3 PE=2 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.8e-12
Identity = 93/420 (22.14%), Postives = 162/420 (38.57%), Query Frame = 1

Query: 10  LLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSIL 69
           L++WA ++G S    +  +              +  G GL A R++K  EL L  P+ +L
Sbjct: 82  LIKWATENGASTEGFEIANF-------------EEEGFGLKATREIKAEELFLWVPRKLL 141

Query: 70  LTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILA 129
           +T +S          +  R         L F LL E    P+S+W PY++ LP  YD   
Sbjct: 142 MTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLCE-RANPNSFWLPYIQTLPSEYDTPL 201

Query: 130 TFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQ-----TFKAWLWA 189
            F E E Q L+   AI         +   +     ++Q     S+L      T+  + WA
Sbjct: 202 YFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKVIQTHPNASKLPLKDSFTYDDYRWA 261

Query: 190 SATISSRTLYVPWDEAG----CLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 249
            +++ +R   +P ++       L P+ D+ N+        N +    +            
Sbjct: 262 VSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT-------NGLITTGYN----------- 321

Query: 250 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 309
           LE+ R    AL D                 ++ GEQ+ + YGT +N E + + GF    N
Sbjct: 322 LEDDRCECVALQD-----------------FKAGEQIYIFYGTRSNAEFVIHSGFFFDNN 381

Query: 310 PNDKVFIPI-----------EHDIYGSSSWPKESLYIHQNGNP--SFALLSALRLWATHP 369
            +D+V I +           + ++   +  P  S++   +  P  S  LL+ LR++  + 
Sbjct: 382 SHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSIEPPISAQLLAFLRVFCMNE 441

Query: 370 N--KRRGVGH--------LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLL 398
              K   +G         L  +   +S  NE  +  +L      +L    T++E+D   L
Sbjct: 442 EELKEHLIGEHAIDKIFTLGNSEFPISWDNEVKLWTFLEARASLLLKTYKTTVEDDKSFL 452

BLAST of CSPI03G21300 vs. Swiss-Prot
Match: SETD4_MOUSE (SET domain-containing protein 4 OS=Mus musculus GN=Setd4 PE=2 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 1.7e-10
Identity = 97/399 (24.31%), Postives = 156/399 (39.10%), Query Frame = 1

Query: 41  FPDTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRY-PSLSSTQKLT 100
           FP TG RGL +   L++G++++  P+S LLTT ++      L   +K++ P +S    L 
Sbjct: 54  FPGTG-RGLMSKASLQEGQVMISLPESCLLTTDTVIRSS--LGPYIKKWKPPVSPLLALC 113

Query: 101 FCLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR--T 160
             L+ E   G  S W  YL  LP+SY             L       A E+ A      T
Sbjct: 114 TFLVSEKHAGCRSLWKSYLDILPKSYTCPVCLEPEVVDLLPSPLKAKAEEQRARVQDLFT 173

Query: 161 DWRGVEGLMQE--SNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAP 220
             RG    +Q   +     + +++A+LWA  T+++R +Y+      CL    D    A  
Sbjct: 174 SARGFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAP- 233

Query: 221 EGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGE 280
                  +D+L+   H  +                     F E    Y        RK +
Sbjct: 234 ------FLDLLNHSPHVQVK------------------AAFNEKTRCYEIRTASRCRKHQ 293

Query: 281 QVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQ------- 340
           +V + YG + N  LL  YGF+   NP+    +P+  D+      P     +H+       
Sbjct: 294 EVFICYGPHDNQRLLLEYGFVSVRNPH--ACVPVSADML-VKFLPAADKQLHRKITILKD 353

Query: 341 ---NGN-------PSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKN 400
               GN       PS+ LL+AL+L      +R         G  +S  NE   +    K 
Sbjct: 354 HGFTGNLTFGWDGPSWRLLTALKLLCLEA-ERFTSWKKVLLGEVISDTNEKTSLGVAQKI 413

Query: 401 CHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 418
           C  V       IEE + +L  ++ +++  V    Q +L+
Sbjct: 414 CSDV-------IEETHAVLRKVSDMKEGTVSLRNQLSLV 413

BLAST of CSPI03G21300 vs. TrEMBL
Match: A0A0A0L7L4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 963.4 bits (2489), Expect = 1.1e-277
Identity = 471/472 (99.79%), Postives = 471/472 (99.79%), Query Frame = 1

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 473
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 472

BLAST of CSPI03G21300 vs. TrEMBL
Match: A0A061EFC1_THECC (SET domain group 40, putative isoform 1 OS=Theobroma cacao GN=TCM_017553 PE=4 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 7.8e-164
Identity = 293/482 (60.79%), Postives = 355/482 (73.65%), Query Frame = 1

Query: 2   ETEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELV 61
           E  GSL S L+WAA  G+SDS + P S SCLGHSL VS+FPD GGRGL AVR + +GEL+
Sbjct: 24  EERGSLDSFLKWAAGLGVSDSPN-PDSCSCLGHSLGVSYFPDAGGRGLGAVRDITRGELL 83

Query: 62  LRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHL 121
           L+ PKS L+TT SL L DE+L  ALK +PSLS  Q LT C LYE+SKG +S W PYL HL
Sbjct: 84  LKVPKSALITTHSL-LNDERLSTALKAHPSLSPAQVLTICFLYEMSKGKASPWHPYLLHL 143

Query: 122 PQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAW 181
           P+SY ILA FGEFEKQALQVDYAIWA +KA  K+  +W+    LM+E  +K Q  TF+AW
Sbjct: 144 PRSYGILAAFGEFEKQALQVDYAIWAAQKALSKAEYEWKKATPLMKELKLKLQFLTFRAW 203

Query: 182 LWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELL 241
           +WA+ TISSRTL++PWDEAGCLCPVGDLFNYAAP GE  N  D +    +    D+L+  
Sbjct: 204 IWATGTISSRTLHIPWDEAGCLCPVGDLFNYAAP-GEDLNGFDNVDNLQNGYALDDLDTQ 263

Query: 242 EEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQENP 301
             QR     LTDG FEE+A+AYCFYA+ +Y+KGEQVLLSYGTYTNLELLEYYGFLL++NP
Sbjct: 264 HSQR-----LTDGAFEEDAAAYCFYAKTNYKKGEQVLLSYGTYTNLELLEYYGFLLEDNP 323

Query: 302 NDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGS 361
           N+KVFIP+E DI+ SSSWP +SLYIHQNG PSFAL++ALR+WAT P +R+ + H AY+GS
Sbjct: 324 NEKVFIPLEPDIHSSSSWPNDSLYIHQNGRPSFALMAALRVWATPPYQRKSIRHQAYSGS 383

Query: 362 QLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGG 421
           QLS  NE  VM W++K CH  L  +PTSIE+DN LL    K+Q+     E  K +  +GG
Sbjct: 384 QLSQDNEISVMTWIAKKCHATLKAMPTSIEDDNLLLSFTDKIQEFDNLWEWGKAMPAFGG 443

Query: 422 EFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICSL 481
           EFC  L+   +   D  ES +S++ K  +DRWKLAV WRL+YKK LVDCI YCT TI SL
Sbjct: 444 EFCNLLQATNLKRND--ESFASRRAKMLIDRWKLAVHWRLIYKKVLVDCISYCTDTINSL 495

Query: 482 SS 484
           SS
Sbjct: 504 SS 495

BLAST of CSPI03G21300 vs. TrEMBL
Match: A0A067KHN9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1)

HSP 1 Score: 561.2 bits (1445), Expect = 1.2e-156
Identity = 289/489 (59.10%), Postives = 351/489 (71.78%), Query Frame = 1

Query: 7   LGSLLRWAADHGISDS---VDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLR 66
           L   L WAA+ GISDS         +SC G+SL +S FPD GGRGL A R L KGELVLR
Sbjct: 10  LEGFLEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPDAGGRGLGAARDLWKGELVLR 69

Query: 67  APKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQ 126
            PK  LLT  SL L+D  L   +  +PSLS TQ LT CLLYE+ KG SS+W+PYL HLP+
Sbjct: 70  VPKPALLTRDSL-LKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKSSFWYPYLMHLPR 129

Query: 127 SYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLW 186
           SY+ LATF EFEKQA QVD A+W TEKA  K+ ++W+    LMQE  +K +  T +AW+W
Sbjct: 130 SYETLATFSEFEKQAFQVDDAVWTTEKAISKAESEWKEANLLMQELKLKPRFLTLRAWIW 189

Query: 187 ASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELE---- 246
           ASATISSRTL++PWDEAGCLCPVGDLFNYAAP  ES       S   ++S    L     
Sbjct: 190 ASATISSRTLHIPWDEAGCLCPVGDLFNYAAPGEESTGLESAESCMLNSSPQGSLSCGHP 249

Query: 247 ---LLEEQRDSQWA-LTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGF 306
              L E + D+    LTDGGF+E+  AYCFYAR++Y+KGEQVLLSYGTYTNLELLE+YGF
Sbjct: 250 TDYLYEGRFDAHLQRLTDGGFDEDLDAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGF 309

Query: 307 LLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 366
           +L ENPNDKVFIP+E  +Y S+SWPKES+YIHQ+G PSFALLSALRLWAT PN+RR VGH
Sbjct: 310 VLDENPNDKVFIPLEPSMYSSNSWPKESMYIHQDGKPSFALLSALRLWATPPNQRRSVGH 369

Query: 367 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKT 426
           LAY+GSQLSV+NET V++W+SK+CH +LNNLPT +EED+ LL  I K+Q+L  P EL + 
Sbjct: 370 LAYSGSQLSVENETWVLKWISKSCHEILNNLPTKVEEDHLLLSTIDKIQNLYNPMELGQM 429

Query: 427 LLTYGGEFCAFLETNGV-VNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYC 484
           L  + GEF  FLE + +   ++  E   S K K++++RWKLAVQWR  YKK +VDCI  C
Sbjct: 430 LCQFKGEFRDFLEASSIGKGKNGDELMLSSKTKQAIERWKLAVQWRFRYKKIVVDCISSC 489

BLAST of CSPI03G21300 vs. TrEMBL
Match: G7KFS1_MEDTR (SET domain group 40 protein OS=Medicago truncatula GN=MTR_5g076640 PE=4 SV=2)

HSP 1 Score: 560.8 bits (1444), Expect = 1.6e-156
Identity = 288/481 (59.88%), Postives = 354/481 (73.60%), Query Frame = 1

Query: 1   METE-GSLGSLLRWAADHGISDSVDQPT-----SHSCLGHSLCVSFFPDTGGRGLAAVRQ 60
           ME E GS    L W +  GISDS    T     S S LGHSLCVS FP +GGRGL AVR 
Sbjct: 1   MEQEHGSFERFLTWTSHLGISDSPTTNTDQSQHSLSSLGHSLCVSTFPHSGGRGLGAVRD 60

Query: 61  LKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWW 120
           LK+GE++LR PKS L+T++S+ +ED+KL +A+ R+ SLSS Q LT CLLYE+ KG +S W
Sbjct: 61  LKRGEIILRVPKSALMTSESVIMEDKKLCLAVNRHSSLSSVQILTVCLLYEVGKGKTSRW 120

Query: 121 FPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQ 180
            PYL HLPQSYD+LA FGEFEKQALQVD A+W TEKA  K++++W+    LM++   K Q
Sbjct: 121 HPYLVHLPQSYDLLAMFGEFEKQALQVDEAMWVTEKAVQKAKSEWKEAHALMEDLMFKPQ 180

Query: 181 LQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASL 240
           L TFKAW+WA+ATISSRTL++PWDEAGCLCPVGDLFNY AP  E     DV  F S+  +
Sbjct: 181 LLTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGEELSGVEDVDHFLSNGDM 240

Query: 241 NDELELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYG 300
           N  ++  +   +SQ  LTDGGFEE+A+AYCFYAR +Y+KG+QVLL YGTYTNLELLE+YG
Sbjct: 241 NVVIDEGQIDFNSQ-RLTDGGFEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEHYG 300

Query: 301 FLLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVG 360
           FLLQENPNDK+FIP+E  +Y S+SW KESLYIH NG PSFALL+ALRLWAT  NKRR +G
Sbjct: 301 FLLQENPNDKIFIPLEPAMYTSTSWSKESLYIHPNGKPSFALLAALRLWATPHNKRRSIG 360

Query: 361 HLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQK 420
           HLAY+GSQLS  NE +VM+WLSK C  VL N+PTSIE+D  LL  +   QD     ++ K
Sbjct: 361 HLAYSGSQLSADNEIIVMKWLSKTCDAVLKNMPTSIEDDTLLLNALDCSQDFITFMKIVK 420

Query: 421 TLLTYGGEFCAFLETNGVVNR-DEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGY 475
            L++   E   FLE + + +     ++ SS+K +RS+DRWKLAV WRL YK+ LVDCI Y
Sbjct: 421 -LMSSRDEVYTFLEAHNITDALSFCDTISSKKTRRSMDRWKLAVLWRLRYKRVLVDCISY 479

BLAST of CSPI03G21300 vs. TrEMBL
Match: M5XBQ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004975mg PE=4 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 2.1e-156
Identity = 283/483 (58.59%), Postives = 349/483 (72.26%), Query Frame = 1

Query: 4   EGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLR 63
           +G L  LL+WAA+ GISDS       SCLGHSL VS+FP  GGRGL A R L++GEL+L+
Sbjct: 5   QGYLERLLKWAAEIGISDSTC--CGDSCLGHSLDVSYFPSAGGRGLGAARDLREGELLLK 64

Query: 64  APKSILLTTQSLSLEDEKLDMALKRYP--SLSSTQKLTFCLLYEISKGPSSWWFPYLKHL 123
            PKS+L+T +SL L+DEKL +++  Y   SLS TQ L  CLLYE+ KG  SWW PYL +L
Sbjct: 65  VPKSVLMTKESLLLKDEKLSLSVNDYAHHSLSPTQILAVCLLYEMGKGKISWWHPYLMNL 124

Query: 124 PQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAW 183
           P+SYDILATFGEFEKQALQVD AIWA EKA LK+  +W+    LM++  +K QL TFKAW
Sbjct: 125 PRSYDILATFGEFEKQALQVDDAIWAAEKATLKAEYEWKEANALMKQLKLKPQLLTFKAW 184

Query: 184 LWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDE---L 243
           LWASATISSRTL++PWD AGCLCPVGDLFNY+AP GE  +  + +    H  +N++   +
Sbjct: 185 LWASATISSRTLHIPWDAAGCLCPVGDLFNYSAP-GEEPSRCESMEHTMHDLVNEDTSGM 244

Query: 244 ELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQ 303
             +E+       LTDGGFE++  AYCFYA++SY+KGEQVLLSYGTYTNLELLE+YGFLL 
Sbjct: 245 ADVEQLVSDSRRLTDGGFEKDVDAYCFYAKKSYKKGEQVLLSYGTYTNLELLEHYGFLLN 304

Query: 304 ENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAY 363
           ENPNDKV+IP+E +IY S SWPKESL+IHQNG PSFALLS LRLWAT  N+RR VGHL Y
Sbjct: 305 ENPNDKVYIPLEPEIYSSCSWPKESLFIHQNGKPSFALLSTLRLWATPQNQRRSVGHLVY 364

Query: 364 AGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLT 423
           +G  LS++NE  +++W+SK C T+L NL TS E+D+ LL  I K+Q+L  P EL     T
Sbjct: 365 SGLHLSIQNEMFILRWISKKCTTILKNLSTSFEDDSLLLSAIDKIQNLDAPLELNNVSST 424

Query: 424 YGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTI 482
              E CAF     V+ + E  S  S+      +RW+LAV+WRL YKK LVDCI YC   +
Sbjct: 425 CRDEICAF--KANVLQKGERSSMESK------ERWRLAVEWRLSYKKILVDCISYCDEIV 476

BLAST of CSPI03G21300 vs. TAIR10
Match: AT5G17240.1 (AT5G17240.1 SET domain group 40)

HSP 1 Score: 510.8 bits (1314), Expect = 9.5e-145
Identity = 266/482 (55.19%), Postives = 341/482 (70.75%), Query Frame = 1

Query: 6   SLGSLLRWAADHGISDSVDQPTSH-SCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRA 65
           ++ + LRWAA+ GISDS+D      SCLGHSL VS FPD GGRGL A R+LKKGELVL+ 
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQS 125
           P+  L+TT+S+  +D KL  A+  + SLSSTQ L+ CLLYE+SK   S+W+PYL H+P+ 
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K +++W+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQ 245
           SATISSRTL+VPWD AGCLCPVGDLFNY AP   S    +    P  A+  +E  L+ E 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYS----NTPQGPESANNVEEAGLVVET 246

Query: 246 RDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDK 305
              +  LTDGGFEE+ +AYC YAR +Y+ GEQVLL YGTYTNLELLE+YGF+L+EN NDK
Sbjct: 247 HSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 306

Query: 306 VFIPIEHDIYG-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 365
           VFIP+E  ++  +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 307 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 366

Query: 366 LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGE 425
           +SVKNE LVM+W+S+ C +VL +LPTS+ ED  LL NI K+QD ++  E QK    +G E
Sbjct: 367 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 426

Query: 426 FCAFLETN---GVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTIC 482
             AFL+ N    V          S+K  R L +W+ +VQWRL YK+ L DCI YC   + 
Sbjct: 427 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of CSPI03G21300 vs. NCBI nr
Match: gi|449456212|ref|XP_004145844.1| (PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus])

HSP 1 Score: 986.1 bits (2548), Expect = 2.2e-284
Identity = 482/483 (99.79%), Postives = 482/483 (99.79%), Query Frame = 1

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 480
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 480

Query: 481 LSS 484
           LSS
Sbjct: 481 LSS 483

BLAST of CSPI03G21300 vs. NCBI nr
Match: gi|700202665|gb|KGN57798.1| (hypothetical protein Csa_3G307670 [Cucumis sativus])

HSP 1 Score: 963.4 bits (2489), Expect = 1.5e-277
Identity = 471/472 (99.79%), Postives = 471/472 (99.79%), Query Frame = 1

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 473
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 472

BLAST of CSPI03G21300 vs. NCBI nr
Match: gi|659114359|ref|XP_008457030.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo])

HSP 1 Score: 917.5 bits (2370), Expect = 9.5e-264
Identity = 450/483 (93.17%), Postives = 464/483 (96.07%), Query Frame = 1

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GGRGLAAVRQL KGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           +LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WFPYLKH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLNDELE 
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300
           LEEQRDSQW LTDGGFEENASAYCFYARESY+KGEQVLLSYGTYTN+ELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K LLTYG
Sbjct: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 480
           GE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT TICS
Sbjct: 421 GECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICS 480

Query: 481 LSS 484
           LSS
Sbjct: 481 LSS 483

BLAST of CSPI03G21300 vs. NCBI nr
Match: gi|659114357|ref|XP_008457029.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo])

HSP 1 Score: 911.4 bits (2354), Expect = 6.8e-262
Identity = 450/488 (92.21%), Postives = 464/488 (95.08%), Query Frame = 1

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGG-----RGLAAVRQL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GG     RGLAAVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  KKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWF 120
            KGEL+LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQL 180
           PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLN 240
           QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 DELELLEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGF 300
           DELE LEEQRDSQW LTDGGFEENASAYCFYARESY+KGEQVLLSYGTYTN+ELLEYYGF
Sbjct: 241 DELESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGF 300

Query: 301 LLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360
           LLQENPNDKVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH
Sbjct: 301 LLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360

Query: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKT 420
           LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K 
Sbjct: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKM 420

Query: 421 LLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 480
           LLTYGGE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT
Sbjct: 421 LLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCT 480

Query: 481 TTICSLSS 484
            TICSLSS
Sbjct: 481 RTICSLSS 488

BLAST of CSPI03G21300 vs. NCBI nr
Match: gi|659114393|ref|XP_008457032.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo])

HSP 1 Score: 673.7 bits (1737), Expect = 2.4e-190
Identity = 326/344 (94.77%), Postives = 335/344 (97.38%), Query Frame = 1

Query: 140 QVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDE 199
           QVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QLQTFKAWLWASATISSRTLYVPWDE
Sbjct: 96  QVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKAWLWASATISSRTLYVPWDE 155

Query: 200 AGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEEN 259
           AGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLNDELE LEEQRDSQW LTDGGFEEN
Sbjct: 156 AGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELESLEEQRDSQWDLTDGGFEEN 215

Query: 260 ASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSW 319
           ASAYCFYARESY+KGEQVLLSYGTYTN+ELLEYYGFLLQENPNDKVFIPIEHDIY SSSW
Sbjct: 216 ASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVSSSW 275

Query: 320 PKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNC 379
           PKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNC
Sbjct: 276 PKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNC 335

Query: 380 HTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAE 439
           HTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K LLTYGGE CAFLETNGVVNRDEAE
Sbjct: 336 HTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRDEAE 395

Query: 440 SHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICSLSS 484
           SH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT TICSLSS
Sbjct: 396 SHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICSLSS 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SDG40_ARATH1.7e-14355.19Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1[more]
SETD3_XENTR6.6e-1523.91Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis GN=setd3 PE=2 SV=... [more]
SETD4_HUMAN6.2e-1324.49SET domain-containing protein 4 OS=Homo sapiens GN=SETD4 PE=2 SV=1[more]
SETD3_CHICK1.8e-1222.14Histone-lysine N-methyltransferase setd3 OS=Gallus gallus GN=SETD3 PE=2 SV=1[more]
SETD4_MOUSE1.7e-1024.31SET domain-containing protein 4 OS=Mus musculus GN=Setd4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7L4_CUCSA1.1e-27799.79Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1[more]
A0A061EFC1_THECC7.8e-16460.79SET domain group 40, putative isoform 1 OS=Theobroma cacao GN=TCM_017553 PE=4 SV... [more]
A0A067KHN9_JATCU1.2e-15659.10Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1[more]
G7KFS1_MEDTR1.6e-15659.88SET domain group 40 protein OS=Medicago truncatula GN=MTR_5g076640 PE=4 SV=2[more]
M5XBQ3_PRUPE2.1e-15658.59Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004975mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G17240.19.5e-14555.19 SET domain group 40[more]
Match NameE-valueIdentityDescription
gi|449456212|ref|XP_004145844.1|2.2e-28499.79PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus][more]
gi|700202665|gb|KGN57798.1|1.5e-27799.79hypothetical protein Csa_3G307670 [Cucumis sativus][more]
gi|659114359|ref|XP_008457030.1|9.5e-26493.17PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo][more]
gi|659114357|ref|XP_008457029.1|6.8e-26292.21PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo][more]
gi|659114393|ref|XP_008457032.1|2.4e-19094.77PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001214SET_dom
IPR015353Rubisco_LSMT_subst-bd
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21300.1CSPI03G21300.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 46..282
score: 9.1
IPR001214SET domainPROFILEPS50280SETcoord: 34..282
score: 10
IPR015353Rubisco LSMT, substrate-binding domainGENE3DG3DSA:3.90.1420.10coord: 451..474
score: 1.3E-9coord: 322..407
score: 1.
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 333..461
score: 4.
IPR015353Rubisco LSMT, substrate-binding domainunknownSSF81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 292..399
score: 8.7
NoneNo IPR availableGENE3DG3DSA:3.90.1410.10coord: 254..295
score: 5.0E-47coord: 9..222
score: 5.0
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 2..220
score: 1.5E-201coord: 258..402
score: 1.5E-201coord: 449..483
score: 1.5E
NoneNo IPR availablePANTHERPTHR13271:SF19PROTEIN SET DOMAIN GROUP 40coord: 2..220
score: 1.5E-201coord: 449..483
score: 1.5E-201coord: 258..402
score: 1.5E
NoneNo IPR availableunknownSSF82199SET domaincoord: 252..300
score: 7.19E-40coord: 7..212
score: 7.19