Lsi05G003440 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G003440
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionProtein SET DOMAIN GROUP 40
Locationchr05 : 4354513 .. 4360738 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAGTTATTTGGCATTTGAAAAAAAAGGGCCAATTGATATTTTGGACCCAAATAGCAGGTCATTCGTATAAATTTCCCAAATATTTTGAACTTAAAAGCCAAATCTAAATTTGACTCAAAGCTTAAGGGATGGAAACGTATTTTTCCCTCAAGGGGCAGAGGTTTTGATGGAGGGTTTGTGAAATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGTACGCTCTGATTTCTCCTTTTCTAGTTACATCGCTCTTTTCTTTCTTTCGCCGCTTGTAAGTTGTTCATTTTGAGGGAAATTTTGCTTCAGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTCTTCCTTTGCTAACATAATTAGAACGAACTTTTGTGAGAAAAAGGATGCTTCCTTTTGTAGGTTTGGTTTTTTTTTTTTTCTTTTTTTTTTCTTTTTTTTTTTTTTTTTGTATGCCTCGTAAGTTGGTCTTTGAGGGAGGGGACTATCAGATACATGAGGCACCTCCCTGTGCCTAATGAGGTAGGTTTTGAAAGTGATTACAGCCCAAAAAGTGATTACATTTCAAAGTAGGAGAGATATGGAATGGAAAATAGAAGGGATCTTTATACACGAGCGACCATAGGTATGCTTTTTTTTTTTTTTTTCTTCCTGATAGAAACAACTGTCTTCATTTAGAAAAAAAAAATGAAAGAATACAAGGGCATACAAAAAACCAAGCCCACCAAAGACCAAAGACCAAACCTTGCTAGGGTTTCTCTCTACACCTTGAAACACTCTATTGTTCCTCTCCCCCCAAAGATTCCACAATAACGCACACACCCCTGCTAACCAAAGAAAACGACCTTTCTCTTTGAAGTGCGGATGGAGGAGGAACTTCCTGATGATCAACTGAACATCCCTCTGACGAGCAAGCAAAAAATCAAACGCTTGAAAGAAACCATTCCATACAGATCTCGCAAACTGACACTCCCATAGGAGGTGATCCATGTTTTCCTCCACCTTTCAACACAGGATACAACAAAAAGGACTCATTAAAGAAGATATCTTTCTCAAAATCCTATCCATCATGTTCACACAACCAAGCAAAACTTGCTTCGGAAAGAACTTAAACTTTCTTTGGAATTTTAATCCTCCATAAAACATCAAAGACTGACTCAACAAAAGGAGAGAGATCCAATAAAATCCAAAGAATGACTTACAAAAAAATCCCTCCAAAGGGTTAGGACTCCAAACACGAACATCCCTTCTCCCAAGCCTAAAGTTAAAAACCCTCAACAAGGAAAGAAGAGAAGCCACTTTCGTCGTTTCCCTATTGGACAACAGTCGACAGAAACCAAAGGAGAAAGAAACACAAGTTCCCCGACCAAACCAAAAAGTCCGACACGAAACAATTATTCAAGAAGGAAAAATGATGTAACCTAGAAAACACAGAAGAGAGAGGTCTATCCCATACCCAATGATCTTCCCAAAAATACGCTTCCTTACCATCTCCCACAACACAAAAAACAAGATGGAAAAAAGAAGGGAGCTGAATAGAACCACAAATTTCGGTGCGTGCCATTTAACCCCTCGCCACCCACTCACAGCGATGGGGACCATGTTTGCTCACAATAGTTCTCTGCCGTAGGTATGCTTAAAGATGTTTTATATTCCATTCTTTGTTAATGGTTAATCGTCCTTAATGCAACTGCGACAAAGATCTTAACTGATTTTAATTCTCTTTTATATTTAAGGTTTATCTAAAACATTTCTTGTGTGTTTATCTAAGACATTACCCATTTTCCCCTGTTAGTTCTAAAGGTGTACTAATATTAATTTCACAATCCCTCCCCCCCACTCAAATGCAGAAGTTGACTTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTTGTGGCTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCCGTTTGTGGCATTTTTATTTATTTCTTAGGATTATATTCGTATTCGTGAACCATATTTAATTAGGATTTTTTACATGTGCTTTTGACATTTGTTTTATAATAAAATAGTCAAAGTCATTAACAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGAATGTATTCCATAGCTAGATTGGACCTGGATTTGTTTTTTTTTTTGGGTAAGAAACCAAACTTTCATTGAGAAAAAAATGAGAGAAAGAAGGCATATGAAAAACAGCCAACAAAAAAAGGAGGATCCCCAACTAACTACAAAGAGGGCTCCAATCTAACAAATGCCAAATTCATAATTACAAAAGAGTATTAACAGACATATACTTATTACAAAAAACCTGCGTTTAGGTTTCTATTTTTTTTTAATAATTAATTATTTATTTGTTTGGTGAAAATTTTATGCATCATTTGAAGGTGGTTTAGCTGGATGTCCAAGGTCACTGCAAATATGCTTTTCTTATAGAAGAAAACACCACCAAGTGACACCCATTTGCTGATTATCTATCTTCCATTACCCATTACCTAAGAAGCACGGATACGGATACAAGACATGGATACGACACGGACACGGTGACACGCCATATTTTAAATATCTAGGACACGATACGACAATGACACGTTTAATAAAATATACATTTTTAAAAATATATATCATTTTCATACTAGAATAAAATTAAAATAAATGGGTTGATGCATTTATATGCTTAAAAGACTTAACTTGATGTATTTCACACTCAAAAGTTATTATTATTCTCATATATGTGTCTTCTTGGTCTACTCAACAAGTGTTCAATGCATGTCTAACATATTTGTTGTACTAACAAGTGTCCGATACGTGTCCAACAAGTGTCAGAGTGTCCAAGTGTCTGACACGTGTCGGACATGGACATGCTAGCCAAATTAAAGTGTCCGTGCTTCTTAGCCCATTACCTTGTAGTCCTGTACACATTACTCATTACCAGCCAGTAAGCATCATGACTTCTATTCGATCTAATATGCTCCCTCTCCCATCCCACCTTGCCCTCCCTAAAGTCCTGACCCCCTCACTTAAGGGCCCCCTTAACTTAGGTATTCTTTGTCTCCTTGAAATCTATGGACTGTAACTACTGTCATTGAACTTTGGTGGTATATAAAGGGACCTATATTACTGAAGTTGATGGTGGATGTCTATAAATGAAGCAAACCAAAGCTCTCTAATATTGGAAAGGATAAGTTTTTCTTCATCTCTCGTCAATGGAGCACTCTCGTGGATTCAAAATTGCTATGTAGGCCTGCTCCTCTTCCCTCTTAATGGTTTTTTCCCCCAAGAATTCTATAGTGAGGATTATGTGTTTGCATTGAATAAATCTCCTATAGGAAACACTAAAGAGACTACTGTTGATATCATTAAGCTGAACTCAAGAGACACATTTTATTCCATGTAGGACAGAATTGCTCAGGTCAGTTCTCCTCCTCATGAAAGATTACCCTTGTGCCACCCTGCCTCTAATAGTGGACTGTTTTTCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAAAGGCTTTTGGATCAGCCCAGACAAGGATACTAGGATGTTCGTTTAAGTACGATTGAATTTAGTACTCACATCCCTTCAATTTTTCCCATAGCTTTTTGTTGCCATAGAATATAGAATTCATCCTCTTTTTTATGGATGAGAATTGTCGAGACTTGAGAGTTTGTTTTGTAATTTTCTCTTAGCCCGTCTTTTGGAGGATATATTATCGTCCTCACTTGTATCTATTTTTAATGTTAATGCAATTATTTGGTTCTTATAGAAAAACGAGTGTTTCTTTATTTGTCTTCATGTGTTGTATGATGACTCTAAATGGCATGGTTTGGTCCGATGTACATTTTTTCAATCCTAATCTCAGTTTGTTGAACTTCCTTCTACGTGTAGTAGTTAAAAACTAAAAATTTATGTTGCCCTTTGAGAGATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTATTTTATATTTTCTTGGTGTCATTTAATATTTTGTACTCGTGAAAAGTGGAGTTGATGCAAAATACATTAATTGGAAGTTTAACATTTTAGTTATTGAGTTTGATTTCCTATTATGAAGAGGGTTGAAATTTACATTAACCTTTTTTATTTGCTCTCTGATGCATTCTTGTTAATCTGCACCTTGAGATTTCCATAGTAAAACGAAACCAGAGATTGTTTCTTTGTGTCTTAGACTGTTATTGCACTGATGTTGAACTTTTTCTTTTTTCATCTTTTTATTTTCTGTTATTACTGTTACTTTTTCTGCCCCAAAGGTATTTTATTCTCATTATAGTTCGTAGGTTAGTTGCTTTTTTGGGTCTTATCCATCTCTCCAATTCTTTGTGTTACTAACACATCCATCAAACGTGACTCTATACCTCACCATTTTTTTTTAAAAAAAAAGCAACAGCTGAAAACGAATTAATTTGATGTTTTGGTGGGTAGCAATATTTTCAAAATGCAAGGCGCCCCCAAAGTGTCCCGTCTTTTGATTGTTTGTATATGAGTGACCTAGAAAATTAACACCTCAGTGAAGCAAAACCATACTTTATAAATTAATAAACAAATACAAAATATTTAAGACATGAATATCAATAAGTCCATTTAAAATTTCAAAAGAAAAAAAATGGTCTGTCTCTCAAACATAATTGCTTTGCATGAATCTTTCGTTCTCAAAAAAGAATGTGTGGATGAGTAACGATTGAAACCTTTCTATTTTTTACTGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAATCTGGTTCAGGTTGTGATTAGCAGGTTCTAACTTAAGTTACCTATTAAATGAACTTTTTAGGAATCAAAAGATAAGAGAATGGTATGAATATGATAGCATTGAACCATCTAGTAAGGATGGTGTTGGGAGAAGTTGTATATGTAATATCAGTATTTCATAACTCAAAGAGAATCCCTTATCAAGCTCTTTACTCGATCTCTTTTAC

mRNA sequence

AAAAAGTTATTTGGCATTTGAAAAAAAAGGGCCAATTGATATTTTGGACCCAAATAGCAGGTCATTCGTATAAATTTCCCAAATATTTTGAACTTAAAAGCCAAATCTAAATTTGACTCAAAGCTTAAGGGATGGAAACGTATTTTTCCCTCAAGGGGCAGAGGTTTTGATGGAGGGTTTGTGAAATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTTGGACTGTTTTTCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAAAGGCTTTTGGATCAGCCCAGACAAGGATACTAGGATGTTCGTTTAAGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAATCTGGTTCAGGTTGTGATTAGCAGGTTCTAACTTAAGTTACCTATTAAATGAACTTTTTAGGAATCAAAAGATAAGAGAATGGTATGAATATGATAGCATTGAACCATCTAGTAAGGATGGTGTTGGGAGAAGTTGTATATGTAATATCAGTATTTCATAACTCAAAGAGAATCCCTTATCAAGCTCTTTACTCGATCTCTTTTAC

Coding sequence (CDS)

ATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTTGGACTGTTTTTCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAAAGGCTTTTGGATCAGCCCAGACAAGGATACTAGGATGTTCGTTTAAGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAA

Protein sequence

METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS
BLAST of Lsi05G003440 vs. Swiss-Prot
Match: SDG40_ARATH (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 1.2e-101
Identity = 223/489 (45.60%), Postives = 296/489 (60.53%), Query Frame = 1

Query: 9   SLLRWAADHGISDPVDQQTSH-SCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISD +D      SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEW--------RGVK 128
            L+TT+S+  +D KL+ A+  + SLSSTQ+  ++    + + + ++ W        R   
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQI-LSVCLLYEMSKEKKSFWYPYLFHIPRDYD 129

Query: 129 GLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRI----------- 188
            L    N + Q    +  +WA+   T     C+ E    K  GS    +           
Sbjct: 130 LLATFGNFEKQALQVEDAVWATEKATA---KCQSE---WKEAGSLMKELELKPKFRSFQA 189

Query: 189 -----LGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 248
                   S + L+VPWD AGCLCPVGDLFNY AP   S       S +   ++      
Sbjct: 190 WLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESAN---NVEEAGLV 249

Query: 249 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 308
            E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L
Sbjct: 250 VETHSER------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFML 309

Query: 309 QENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGH 368
           +EN NDKVFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  
Sbjct: 310 EENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMR 369

Query: 369 LAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKM 428
           L YAGSQ+S+KNEILVM+W+S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK 
Sbjct: 370 LVYAGSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKE 429

Query: 429 LLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS 468
              +G E  AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCIS
Sbjct: 430 TEAFGSEVRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCIS 481

BLAST of Lsi05G003440 vs. TrEMBL
Match: A0A0A0L7L4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 715.3 bits (1845), Expect = 4.8e-203
Identity = 367/508 (72.24%), Postives = 393/508 (77.36%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGS GSLLRWAADHGISD VDQ TSHSCLG SLCV FFPD GGRGL AVRQL KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLED--------------------EKLAMALKRYPSL-------- 120
           VLR PKS+LLTTQSLSLED                      L   + + PS         
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 ----------------SSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                            + QVDYAIWATEKAALKSRT+WRGV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYA 240
           WLWASAT                            S + LYVPWDEAGCLCPVGDLFNYA
Sbjct: 181 WLWASAT---------------------------ISSRTLYVPWDEAGCLCPVGDLFNYA 240

Query: 241 APEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES 300
           APEGES + +DV SF  HASLN ++   EL EE+RD+QWALTDGGFEEN SAYCFYARES
Sbjct: 241 APEGESFNAVDVLSFPSHASLNDEL---ELLEEQRDSQWALTDGGFEENASAYCFYARES 300

Query: 301 YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNG 360
           Y+KGEQVLLSYGTY+NLELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNG
Sbjct: 301 YRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNG 360

Query: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSV 420
           NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNEILVMQWLSKNCHTVLNNLPTS+
Sbjct: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSI 420

Query: 421 EEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSL 465
           EEDNQLLCNI K+QDLQVPRELQK LLTYGGEFCAFLETNG+VNR+EAE H S K+KRSL
Sbjct: 421 EEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSL 478

BLAST of Lsi05G003440 vs. TrEMBL
Match: A0A061ELB5_THECC (Set domain group 40, putative isoform 3 OS=Theobroma cacao GN=TCM_017553 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.4e-133
Identity = 259/470 (55.11%), Postives = 320/470 (68.09%), Query Frame = 1

Query: 2   ETEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELV 61
           E  GS  S L+WAA  G+SD  +   S SCLG SL V +FPDAGGRGLGAVR + +GEL+
Sbjct: 24  EERGSLDSFLKWAAGLGVSDSPNPD-SCSCLGHSLGVSYFPDAGGRGLGAVRDITRGELL 83

Query: 62  LRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVKGL 121
           L+VPKS L+TT SL L DE+L+ ALK +PSLS  QVDYAIWA +KA  K+  EW+    L
Sbjct: 84  LKVPKSALITTHSL-LNDERLSTALKAHPSLSPAQVDYAIWAAQKALSKAEYEWKKATPL 143

Query: 122 MQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWD 181
           M+E  +K Q  TF+AW+WA+ T                            S + L++PWD
Sbjct: 144 MKELKLKLQFLTFRAWIWATGT---------------------------ISSRTLHIPWD 203

Query: 182 EAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWA--LTD 241
           EAGCLCPVGDLFNYAAP GE     D++ F    +L      D+L     DTQ +  LTD
Sbjct: 204 EAGCLCPVGDLFNYAAP-GE-----DLNGFDNVDNLQNGYALDDL-----DTQHSQRLTD 263

Query: 242 GGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDI 301
           G FEE+ +AYCFYA+ +YKKGEQVLLSYGTY+NLELLEYYGFLL++NPN+KVFIP+E DI
Sbjct: 264 GAFEEDAAAYCFYAKTNYKKGEQVLLSYGTYTNLELLEYYGFLLEDNPNEKVFIPLEPDI 323

Query: 302 YSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQ 361
           +SSSSWP +SLY+HQNG PSFAL++ALR+WAT P +R+ + H AY+GSQLS  NEI VM 
Sbjct: 324 HSSSSWPNDSLYIHQNGRPSFALMAALRVWATPPYQRKSIRHQAYSGSQLSQDNEISVMT 383

Query: 362 WLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLV 421
           W++K CH  L  +PTS+E+DN LL    KIQ+     E  K +  +GGEFC  L+   L 
Sbjct: 384 WIAKKCHATLKAMPTSIEDDNLLLSFTDKIQEFDNLWEWGKAMPAFGGEFCNLLQATNL- 443

Query: 422 NRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS 470
            RN+ E   S + K  ++RWKLAV WRL+YKK LVDCISYCT TI SLSS
Sbjct: 444 KRND-ESFASRRAKMLIDRWKLAVHWRLIYKKVLVDCISYCTDTINSLSS 451

BLAST of Lsi05G003440 vs. TrEMBL
Match: B9H3F8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s07950g PE=4 SV=2)

HSP 1 Score: 471.9 bits (1213), Expect = 9.3e-130
Identity = 265/504 (52.58%), Postives = 324/504 (64.29%), Query Frame = 1

Query: 7   FGSLLRWAADHGISDPVDQQTSH-----SCLGRSLCVCFFPDAGGRGLGAVRQLNKGELV 66
           F   L+WAA+ GISD     + H     SCLG SL V  FPDAGGRGL AVR L KGELV
Sbjct: 37  FERFLKWAANLGISDCTTNLSLHPQSPTSCLGHSLTVSHFPDAGGRGLAAVRDLKKGELV 96

Query: 67  LRVPKSVLLTTQSLSLEDEKLAMALKR--YPSLSSTQV---------------------- 126
           LRVPKSVL+T  SL L+DEKL   +    Y SLS TQ+                      
Sbjct: 97  LRVPKSVLITRDSL-LKDEKLCSFVNNNTYSSLSPTQILAVCLLYEMGKGKSSWWYPYLM 156

Query: 127 ----DYAIWAT-EKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCE 186
                Y + A+ +KA  K+++EW+    LM    +K QL TF+AW+WASAT         
Sbjct: 157 HLPRSYDVLASFKKAVSKAKSEWKEANSLMDALKLKPQLLTFRAWIWASAT--------- 216

Query: 187 VEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDV------M 246
                              S +AL++PWDEAGCLCPVGDLFNYAAP  ES D+      M
Sbjct: 217 ------------------ISSRALHIPWDEAGCLCPVGDLFNYAAPGEESNDLENVVHLM 276

Query: 247 DVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLS 306
           + SS    +  NG+ T D + ++       LTDGGF EN++AYCFYAR++YKKG QVLL 
Sbjct: 277 NASSLEDTSLSNGETTDDFIGDQPDIGLERLTDGGFNENMAAYCFYARKNYKKGTQVLLG 336

Query: 307 YGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSAL 366
           YGTY+NLELLE+YGFLL ENPNDKVFIP+E  +YS  SWPK S+Y+HQ+G PSFALLSAL
Sbjct: 337 YGTYTNLELLEHYGFLLNENPNDKVFIPLEPSMYSFISWPKVSMYIHQDGKPSFALLSAL 396

Query: 367 RLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNI 426
           RLWAT PN+RR + HL Y+GS+LS+ NEI V++W+SKNC  +L+NLPT +EED+ LL  I
Sbjct: 397 RLWATPPNQRRSISHLVYSGSRLSVYNEISVLKWISKNCALILSNLPTVIEEDSLLLSTI 456

Query: 427 CKIQDLQVPRELQKMLLTYGGEFCAFLETNGL-VNRNEAELHLSGKIKRSLERWKLAVQW 470
            KI++   P EL   + T GGE  AFLE + L   +N +EL  SGK KR +ERWKLAVQW
Sbjct: 457 NKIENFDKPTEL---VCTSGGEARAFLEASDLQKGKNGSELMFSGKTKRVIERWKLAVQW 509

BLAST of Lsi05G003440 vs. TrEMBL
Match: A0A067KHN9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 2.1e-121
Identity = 258/488 (52.87%), Postives = 313/488 (64.14%), Query Frame = 1

Query: 11  LRWAADHGISDP---VDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 70
           L WAA+ GISD       +  +SC G SL +  FPDAGGRGLGA R L KGELVLRVPK 
Sbjct: 14  LEWAAELGISDSPYNFQSRNPNSCFGNSLTLSHFPDAGGRGLGAARDLWKGELVLRVPKP 73

Query: 71  VLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNI 130
            LLT  SL L+D  L+  +  +PSLS TQ+       E    KS + W     LM     
Sbjct: 74  ALLTRDSL-LKDGLLSSFVNGHPSLSPTQILTVCLLYEMGKGKS-SFW--YPYLMHLPRS 133

Query: 131 KNQLQTF-----KAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSF--------- 190
              L TF     +A+    A WT      + E    +A    Q   L   F         
Sbjct: 134 YETLATFSEFEKQAFQVDDAVWTTEKAISKAESEWKEANLLMQELKLKPRFLTLRAWIWA 193

Query: 191 ------KALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSS----FSPHASLNGDMTT 250
                 + L++PWDEAGCLCPVGDLFNYAAP  ES  +    S     SP  SL+    T
Sbjct: 194 SATISSRTLHIPWDEAGCLCPVGDLFNYAAPGEESTGLESAESCMLNSSPQGSLSCGHPT 253

Query: 251 DELHEEKRDTQ-WALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFL 310
           D L+E + D     LTDGGF+E++ AYCFYAR++YKKGEQVLLSYGTY+NLELLE+YGF+
Sbjct: 254 DYLYEGRFDAHLQRLTDGGFDEDLDAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFV 313

Query: 311 LQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHL 370
           L ENPNDKVFIP+E  +YSS+SWPKES+Y+HQ+G PSFALLSALRLWAT PN+RR VGHL
Sbjct: 314 LDENPNDKVFIPLEPSMYSSNSWPKESMYIHQDGKPSFALLSALRLWATPPNQRRSVGHL 373

Query: 371 AYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKML 430
           AY+GSQLS++NE  V++W+SK+CH +LNNLPT VEED+ LL  I KIQ+L  P EL +ML
Sbjct: 374 AYSGSQLSVENETWVLKWISKSCHEILNNLPTKVEEDHLLLSTIDKIQNLYNPMELGQML 433

Query: 431 LTYGGEFCAFLETNGL-VNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCT 470
             + GEF  FLE + +   +N  EL LS K K+++ERWKLAVQWR  YKK +VDCIS CT
Sbjct: 434 CQFKGEFRDFLEASSIGKGKNGDELMLSSKTKQAIERWKLAVQWRFRYKKIVVDCISSCT 493

BLAST of Lsi05G003440 vs. TrEMBL
Match: V4SX96_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011537mg PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 6.3e-118
Identity = 250/490 (51.02%), Postives = 310/490 (63.27%), Query Frame = 1

Query: 2   ETEGSFGSLLRWAADHGISDPVDQQTSHS--CLGRSLCVCFFPDAGGRGLGAVRQLNKGE 61
           E + S   LL+WAA+ GI+D   Q  S S  CLG SL V  FP+AGGRGL A R L KGE
Sbjct: 3   EEDESLEKLLKWAAEMGITDSTIQNPSRSRNCLGHSLTVSHFPEAGGRGLAAARDLTKGE 62

Query: 62  LVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVK 121
           L+LRVPK+ L TT+ L   D+K ++A+ R+  LS +Q+       E    KS   +  + 
Sbjct: 63  LILRVPKTALFTTECLLKSDQKRSLAVNRHLFLSPSQILIVCLLYEVGKGKSSRWYTYLM 122

Query: 122 GLMQESNI--------KNQLQTFKA-WL-----------WASATWTVFLLTCEVEVACTK 181
            L +   I        K  LQ   A W            W  A   +  L  + ++   K
Sbjct: 123 LLPRCYEILATFGPFEKQALQVDDAIWAAEKAVSKAESEWKQAIKLMEELKLKPQLLSFK 182

Query: 182 AFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAP-EGE--SLDVMDVSSFSPHAS 241
           A+  A   +   S + +++ WDEAGCLCPVGDLFNYAAP EGE  ++ + DV  + P   
Sbjct: 183 AWLWASATV---SSRTMHISWDEAGCLCPVGDLFNYAAPGEGEESNIGIEDVEGWMPAPC 242

Query: 242 LNGDMTTDELHEEKRDTQW-ALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLEL 301
           L    TTD L  EK +     LTDG FEE+V++YCFYAR +YK+GEQVLLSYGTY+NLEL
Sbjct: 243 LPKGDTTDVLDSEKFNGHLRRLTDGRFEEDVNSYCFYARNNYKRGEQVLLSYGTYTNLEL 302

Query: 302 LEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNK 361
           LE+YGFLL ENPNDKVFI +E  +YS  SWP+ES Y+ QNG PSFALLSALRLW T  N+
Sbjct: 303 LEHYGFLLNENPNDKVFISLEPGMYSCCSWPRESQYIDQNGKPSFALLSALRLWMTPANQ 362

Query: 362 RRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVP 421
           RR VGHLAY+G QLS+ NEI VM+WLS N   +LN+LPTS EED  LLC I KIQD+   
Sbjct: 363 RRSVGHLAYSGHQLSVDNEISVMKWLSNNSRVMLNSLPTSKEEDALLLCAIDKIQDIYTA 422

Query: 422 RELQKMLLTYGGEFCAFLETNGLVNRNE-AELHLSGKIKRSLERWKLAVQWRLLYKKALV 465
            EL+K+L  +GGE C FLE  G+  R   A+L LS K K S++RWKLA+QWRL YKK L 
Sbjct: 423 MELKKVLSDFGGEVCTFLENYGVQCRQRGAKLSLSRKTKLSMQRWKLAIQWRLRYKKTLA 482

BLAST of Lsi05G003440 vs. TAIR10
Match: AT5G17240.1 (AT5G17240.1 SET domain group 40)

HSP 1 Score: 371.7 bits (953), Expect = 6.7e-103
Identity = 223/489 (45.60%), Postives = 296/489 (60.53%), Query Frame = 1

Query: 9   SLLRWAADHGISDPVDQQTSH-SCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKS 68
           + LRWAA+ GISD +D      SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+ 
Sbjct: 10  TFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKVPRK 69

Query: 69  VLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEW--------RGVK 128
            L+TT+S+  +D KL+ A+  + SLSSTQ+  ++    + + + ++ W        R   
Sbjct: 70  ALMTTESIIAKDLKLSDAVNLHNSLSSTQI-LSVCLLYEMSKEKKSFWYPYLFHIPRDYD 129

Query: 129 GLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRI----------- 188
            L    N + Q    +  +WA+   T     C+ E    K  GS    +           
Sbjct: 130 LLATFGNFEKQALQVEDAVWATEKATA---KCQSE---WKEAGSLMKELELKPKFRSFQA 189

Query: 189 -----LGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTT 248
                   S + L+VPWD AGCLCPVGDLFNY AP   S       S +   ++      
Sbjct: 190 WLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESAN---NVEEAGLV 249

Query: 249 DELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 308
            E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YGF+L
Sbjct: 250 VETHSER------LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFML 309

Query: 309 QENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGH 368
           +EN NDKVFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  
Sbjct: 310 EENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMR 369

Query: 369 LAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKM 428
           L YAGSQ+S+KNEILVM+W+S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK 
Sbjct: 370 LVYAGSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKE 429

Query: 429 LLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS 468
              +G E  AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCIS
Sbjct: 430 TEAFGSEVRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCIS 481

BLAST of Lsi05G003440 vs. NCBI nr
Match: gi|659114393|ref|XP_008457032.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo])

HSP 1 Score: 781.2 bits (2016), Expect = 1.0e-222
Identity = 387/469 (82.52%), Postives = 410/469 (87.42%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEWRGVKG 120
           +LR PKSVLLTTQSLSLEDEKLAMALK +PSLSSTQVDYAIWATEKAALKSR +WRGVKG
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQVDYAIWATEKAALKSRMDWRGVKG 120

Query: 121 LMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPW 180
           LMQESNIKNQLQTFKAWLWASAT                            S + LYVPW
Sbjct: 121 LMQESNIKNQLQTFKAWLWASAT---------------------------ISSRTLYVPW 180

Query: 181 DEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDG 240
           DEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDG
Sbjct: 181 DEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL---ESLEEQRDSQWDLTDG 240

Query: 241 GFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIY 300
           GFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY
Sbjct: 241 GFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIY 300

Query: 301 SSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQW 360
            SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQW
Sbjct: 301 VSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQW 360

Query: 361 LSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVN 420
           LSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VN
Sbjct: 361 LSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVN 420

Query: 421 RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS 470
           R+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTICSLSS
Sbjct: 421 RDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICSLSS 439

BLAST of Lsi05G003440 vs. NCBI nr
Match: gi|659114361|ref|XP_008457031.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Cucumis melo])

HSP 1 Score: 775.0 bits (2000), Expect = 7.4e-221
Identity = 387/474 (81.65%), Postives = 410/474 (86.50%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALKSRTEW 120
           NKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSSTQVDYAIWATEKAALKSR +W
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQVDYAIWATEKAALKSRMDW 120

Query: 121 RGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKA 180
           RGVKGLMQESNIKNQLQTFKAWLWASAT                            S + 
Sbjct: 121 RGVKGLMQESNIKNQLQTFKAWLWASAT---------------------------ISSRT 180

Query: 181 LYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQW 240
           LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++   E  EE+RD+QW
Sbjct: 181 LYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDEL---ESLEEQRDSQW 240

Query: 241 ALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPM 300
            LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+
Sbjct: 241 DLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPI 300

Query: 301 EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEI 360
           EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE 
Sbjct: 301 EHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNET 360

Query: 361 LVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLET 420
           LVMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLET
Sbjct: 361 LVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLET 420

Query: 421 NGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS 470
           NG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTICSLSS
Sbjct: 421 NGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICSLSS 444

BLAST of Lsi05G003440 vs. NCBI nr
Match: gi|449456212|ref|XP_004145844.1| (PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus])

HSP 1 Score: 734.9 bits (1896), Expect = 8.5e-209
Identity = 376/513 (73.29%), Postives = 401/513 (78.17%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGS GSLLRWAADHGISD VDQ TSHSCLG SLCV FFPD GGRGL AVRQL KGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLED--------------------EKLAMALKRYPSL-------- 120
           VLR PKS+LLTTQSLSLED                      L   + + PS         
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 ----------------SSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
                            + QVDYAIWATEKAALKSRT+WRGV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYA 240
           WLWASAT                            S + LYVPWDEAGCLCPVGDLFNYA
Sbjct: 181 WLWASAT---------------------------ISSRTLYVPWDEAGCLCPVGDLFNYA 240

Query: 241 APEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES 300
           APEGES + +DV SF  HASLN ++   EL EE+RD+QWALTDGGFEEN SAYCFYARES
Sbjct: 241 APEGESFNAVDVLSFPSHASLNDEL---ELLEEQRDSQWALTDGGFEENASAYCFYARES 300

Query: 301 YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNG 360
           Y+KGEQVLLSYGTY+NLELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNG
Sbjct: 301 YRKGEQVLLSYGTYTNLELLEYYGFLLQENPNDKVFIPIEHDIYGSSSWPKESLYIHQNG 360

Query: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSV 420
           NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNEILVMQWLSKNCHTVLNNLPTS+
Sbjct: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNEILVMQWLSKNCHTVLNNLPTSI 420

Query: 421 EEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSL 470
           EEDNQLLCNI K+QDLQVPRELQK LLTYGGEFCAFLETNG+VNR+EAE H S K+KRSL
Sbjct: 421 EEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSL 480

BLAST of Lsi05G003440 vs. NCBI nr
Match: gi|659114359|ref|XP_008457030.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo])

HSP 1 Score: 733.8 bits (1893), Expect = 1.9e-208
Identity = 378/513 (73.68%), Postives = 402/513 (78.36%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGEL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLSLED---------------------------EKLAMA-----LKR 120
           +LR PKSVLLTTQSLSLED                            K A +     LK 
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 YPSL------------SSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKA 180
            P               + QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYA 240
           WLWASAT                            S + LYVPWDEAGCLCPVGDLFNYA
Sbjct: 181 WLWASAT---------------------------ISSRTLYVPWDEAGCLCPVGDLFNYA 240

Query: 241 APEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES 300
           APEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARES
Sbjct: 241 APEGESFNAMDVLSFPSHASLNDEL---ESLEEQRDSQWDLTDGGFEENASAYCFYARES 300

Query: 301 YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNG 360
           YKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNG
Sbjct: 301 YKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNG 360

Query: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSV 420
           NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+
Sbjct: 361 NPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSI 420

Query: 421 EEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSL 470
           EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSL
Sbjct: 421 EEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSL 480

BLAST of Lsi05G003440 vs. NCBI nr
Match: gi|659114357|ref|XP_008457029.1| (PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo])

HSP 1 Score: 727.6 bits (1877), Expect = 1.4e-206
Identity = 378/518 (72.97%), Postives = 402/518 (77.61%), Query Frame = 1

Query: 1   METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQL 60
           METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  NKGELVLRVPKSVLLTTQSLSLED---------------------------EKLAMA--- 120
           NKGEL+LR PKSVLLTTQSLSLED                            K A +   
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 --LKRYPSL------------SSTQVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQL 180
             LK  P               + QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGD 240
           QTFKAWLWASAT                            S + LYVPWDEAGCLCPVGD
Sbjct: 181 QTFKAWLWASAT---------------------------ISSRTLYVPWDEAGCLCPVGD 240

Query: 241 LFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCF 300
           LFNYAAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCF
Sbjct: 241 LFNYAAPEGESFNAMDVLSFPSHASLNDEL---ESLEEQRDSQWDLTDGGFEENASAYCF 300

Query: 301 YARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLY 360
           YARESYKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY
Sbjct: 301 YARESYKKGEQVLLSYGTYTNIELLEYYGFLLQENPNDKVFIPIEHDIYVSSSWPKESLY 360

Query: 361 VHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNN 420
           +HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNN
Sbjct: 361 IHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNN 420

Query: 421 LPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGK 470
           LPTS+EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K
Sbjct: 421 LPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYGGECCAFLETNGVVNRDEAESHLSEK 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SDG40_ARATH1.2e-10145.60Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana GN=SDG40 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7L4_CUCSA4.8e-20372.24Uncharacterized protein OS=Cucumis sativus GN=Csa_3G307670 PE=4 SV=1[more]
A0A061ELB5_THECC1.4e-13355.11Set domain group 40, putative isoform 3 OS=Theobroma cacao GN=TCM_017553 PE=4 SV... [more]
B9H3F8_POPTR9.3e-13052.58Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s07950g PE=4 SV=2[more]
A0A067KHN9_JATCU2.1e-12152.87Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10483 PE=4 SV=1[more]
V4SX96_9ROSI6.3e-11851.02Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011537mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G17240.16.7e-10345.60 SET domain group 40[more]
Match NameE-valueIdentityDescription
gi|659114393|ref|XP_008457032.1|1.0e-22282.52PREDICTED: protein SET DOMAIN GROUP 40 isoform X4 [Cucumis melo][more]
gi|659114361|ref|XP_008457031.1|7.4e-22181.65PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Cucumis melo][more]
gi|449456212|ref|XP_004145844.1|8.5e-20973.29PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus][more]
gi|659114359|ref|XP_008457030.1|1.9e-20873.68PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo][more]
gi|659114357|ref|XP_008457029.1|1.4e-20672.97PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR015353Rubisco_LSMT_subst-bd
IPR001214SET_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G003440.1Lsi05G003440.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 46..268
score: 9.
IPR001214SET domainPROFILEPS50280SETcoord: 34..268
score: 10
IPR015353Rubisco LSMT, substrate-binding domainGENE3DG3DSA:3.90.1420.10coord: 308..393
score: 4.1E-11coord: 437..463
score: 4.1
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 319..384
score: 2.
IPR015353Rubisco LSMT, substrate-binding domainunknownSSF81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 278..391
score: 3.0
NoneNo IPR availableGENE3DG3DSA:3.90.1410.10coord: 9..113
score: 1.7E-6coord: 114..281
score: 8.2
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 435..469
score: 1.1E-141coord: 244..388
score: 1.1E-141coord: 2..215
score: 1.1E
NoneNo IPR availablePANTHERPTHR13271:SF19PROTEIN SET DOMAIN GROUP 40coord: 435..469
score: 1.1E-141coord: 244..388
score: 1.1E-141coord: 2..215
score: 1.1E
NoneNo IPR availableunknownSSF82199SET domaincoord: 8..149
score: 5.76E-21coord: 177..286
score: 5.76