Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAATTTTTAGTTGGAGGAGGACAGGAGAGACAGAAATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAGTAATTGTTTGAACTACGAATAATTTTTGTGTTTATGGGGTTTTTTTCTCCCTTTCTGGTGAAATTAGTGAGAGCTTTTGCATGAATGTTTTTGGCAGATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGTTTGCATTCAATTTAGTGCTGTAAGTGTTGTCTCTGTTTGTATATTTATGTGAAGTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGTATTGTAGCTATATCTTAGTTTTGTTAACTTCCCTTCAGGATATTTCAGAGGGAATTATTTTCTGAAGTAAGTTCCACATTAGCATGCCAAGTTTCATTATTACCTTAGCCCCATTTGATAAATATGGTTTTTTGTTTTTGAAAATTTAGCTTACTTCCACCTATAAGTTTCTATGGTTTGTTTTCTACTTTCTATGGTTTGTTTTCTACTTTCTACTCATGTTTTAAAAAATGGAGCCAAGTTTTAAAAGATAATAGAGTAGTTTTCAAACACATGTTCTTGTATTTAGATAATGAAATTGAGGGAAACAAGCATAAAATTTAATATATAGAAAACTACAAAACGAAATTGTTATCAAATGGCTTCTTAGTATATATTTTAACTCAAAGATTTTCAGAGTAAATGTATTTGGTTCTTGTGGAATATCGTTGTTAATTGCCTCTTCTTGTTGAGGAATTATGATGAAATGACTGGCTATAGATTTGTGAGAAATTTGTTGGTTTTGTCTCAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGTAAGAAAAACTAAAAGATGAGTAGTTTTTGAAAACTTGTTTTTGTTTTTGAAATTTGGCTAAGAATTCAAATGTTTCCTTTAACAAAGACAAAAACCATTGTAAAAGGAGGGGAGAAACAAACACAATTACCAATTTGTTAGGACTCTTCAGAAAACCCCAACAAAAACACAAATCTGGCAAAAACTTCCTTGATTTTTATATTTATATTCTGGAAAAGTATCACAGCAAAGTCAACAACAAGGGAAATAGCAAGAACAAGCAAAACCAATCAAAGCTAACCCAACACATTCGGGAGAGTTGGGAAATTCTCTCCTAGAAGACTACTCTCCTCTACTAAAAACTTCACACCCAAAACACTATTGAAAAACCCTACTTAAGCCCTCAGACATCCAATCTCTCTTAGCCATTCTTGTGGTCACTTCCCTTTTTGCTTAGCATAACAACCCTCCACTTCCACTAACAGTTTGCCCCCTTGGAAAAGATTCTCTTGCCCTTCCTTTCATATGTGTGAATAATTGGTGGCCTAACGATTCCCTTGGGTTTGAAACTCACCTTATCCTGGTGGAACCTAGGGAACGATCTATTCATCATCTCCGTGTCTTCCCAAGTGGCTTCACCCTTAGGTAGATTTGTCCACTTTATGAACCATTCATCACGAGCTCTATCACTGTTCCACCTCTTACCCAACACCACAACTGGTGTTACTTGCAATTCAAATTCATTAGACAAACCCAGGGGGCACTCTTGTGCAGCCACATTCAACCCATCACCTTCTTCAACTGTGAAACATGAAAAACATCATGGATCTTTGCCTCTGGGGTAACTCCAGCCTATAAGCTACTTCTCTTATACATTCTTTAATCACCTTGGCGACAATACACTTGGCAAATGATGCCAAGTTGTGGTGGAGGTCCTGTTACATGGACATCCAATAAGGTTGTTGTACCATCAACGCATGGGAAAAACTGAAACAAGAATGCGCAGCGGTTGGTGGGTGATCGTATGGGAGAAGTAGCTTATAGGTTGGAGTTACCTCCAAAGGCAAAGATTCCTGATGTTTTCCATGTCTCACAGTTGAAGAAGGTGATAGGATCGAATGTGGTTACACAAAAGTGCCCCCGGATTTGTCTGATAAATTTGAATTGCAAGTAACACTAGCTGTGGTGTTGGGTAAGAGGTGGAACAACGAAAGAACAAAAGAGCTCGTGATGAATGCCTTATAAAGTGGACTAAGCTATCTGAGGATGAAGCCACCTTCAGGACCAAGTGAGTTTCAAACCCGAGGGAATTGTTAGGCCACCAATTATTCACACATATGAAAGGAAGGGCAAGAGAGTCTTTTCCAAGGGGCATTCAATTAGTAGAAGTGAAGGATTGTTAAGCGAAAAATGGAGGGGACCAGGAGAATGGCTGAGAGCGCCTGGACGTCTGAGGGCTTAAGTAGGGTTTTTTCAATAGTGTTTTGGGTGTGAAGTTTTTAGTAGAAGAGAGACTAGGAGAGAATTTCCAAGCTTTCTTTAATGTTCTGGGTTAGCTTTGGTTAATGGTTTTACTTCTTGTTATTTCTCTTATTGACTTTGCTGTGATAGTTTTTCCAGAATATAAATATATAAATAAGGGAAGTTCTTGCCCTCTTTTCTGGGATTTGTGTGTTTTTGTTGGGATTTTCTGTATGCAGGTGGAAGATTCCTAACATTATCAAAGTGGGCCTAGCAATTAGGATGAATTTGGTATCACTCTTGACATAATGAAAAATGTATCTGGCCTTTTGAAAACTTTCAAGATGTTTGATTTAAAAGATTCTGACTATGCATAGAACACTTGGAAAAAAAAAAGAACTTAAGCATTTAAAAGCAGTTTAGAAGGAGAGTTTGGTATAGTATTTGGGAGAGTGAAAAGTGCTTTTAATTAATCAAAAGCACTTTTCCAAATTTGTATGGTGGATTGTAACATAAACAGTGTGATTTAAAAAAACTCTAAAAACACTTGAAGGCATTTTAAAGTTTTCCCCTTAGAAATGATCATTTTAGAAAACTCATTACAAACTCAAACCTATTCTCAAATTGTCGGTGATTAGATTTCTGGTTATATATGTTGTTCTTTTGAATATTTTGTCGTTGTTTTTTGGAGCTCTACCTTATTTTTTCTCTTATGAGGGAAGATAAATTTCTTCAGGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACGAGGAAGAAAAACAGCCAGTTAACCTGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCAAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAATACAGTTATCAAGTAGAAATGTAAATTATTCTGAGATTGAAACTTTTTTCTCCCACCCTACCCATGGTATAAAATTAGGCCTCGTTTCAT
mRNA sequence
TGAAATTTTTAGTTGGAGGAGGACAGGAGAGACAGAAATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACGAGGAAGAAAAACAGCCAGTTAACCTGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCAAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAATACAGTTATCAAGTAGAAATGTAAATTATTCTGAGATTGAAACTTTTTTCTCCCACCCTACCCATGGTATAAAATTAGGCCTCGTTTCAT
Coding sequence (CDS)
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACGAGGAAGAAAAACAGCCAGTTAACCTGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCAAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAA
Protein sequence
MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
Homology
BLAST of Lsi08G002110 vs. ExPASy Swiss-Prot
Match:
Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)
HSP 1 Score: 321.6 bits (823), Expect = 1.9e-86
Identity = 229/649 (35.29%), Postives = 325/649 (50.08%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
ME+RA EDIE+ D+ PPL PL ++L+DSFL +HCSSCFS LP P YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
S LT +F ++ FP T L + +R LL+ + S+ P R+ LLTN H
Sbjct: 61 S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120
Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
LM D + + + + IA R N R LEEA +C VLTNAV+V DS G
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180
Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVR 242
+GIA+Y +F WINHSCSPN+CYRF + S D+ VTN + + +
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF----------VNNRTSYHDVHVTNTETSSNLELQE 240
Query: 243 SNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI------------- 302
+ G +G GP+++VRSIK I+ GE +T++Y DLLQP +
Sbjct: 241 QVCGTSLNSG---NGNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMC 300
Query: 303 -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
+A ++DS T++ +FD D+AV +++DY+ AI ++LS
Sbjct: 301 NCGRCAASPPAYVDSILEGVLTLESEKTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360
Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSC 422
+ P++C E ++++L G + +E+ QP LRLH H+++LNAY LA+AY++RS
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420
Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
D ++ G MSR SAAYSLFLAG +HHLF ++ S SAA W AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480
Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
E L LA + + S C+ C ++ N+ R D E S I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540
Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT 602
+C+ ++SQ +WSFLT GCPYL+ F P DFS T +N +
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS-----LTRTNGE---------------- 557
Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 615
+ S + ++L L HCL Y L +CYG SHL S+ +
Sbjct: 601 -------REESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557
BLAST of Lsi08G002110 vs. ExPASy TrEMBL
Match:
A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)
HSP 1 Score: 978.4 bits (2528), Expect = 1.4e-281
Identity = 506/657 (77.02%), Postives = 541/657 (82.34%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 302
VRSN+ DF+RE G GPRVVVRSIK I+KGEAVTIAYCDLLQPK
Sbjct: 241 NVRSNILDFMRED--FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 303 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 362
EISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 363 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 422
SI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP HFL LNAYTAL SAYKVRS
Sbjct: 361 SIGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRS 420
Query: 423 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
CDLLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAG
Sbjct: 421 CDLLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAG 480
Query: 483 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 542
ESLLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIG 540
Query: 543 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK 602
ISNCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK +N D+ H ID SCACSK
Sbjct: 541 ISNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSK 600
Query: 603 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
TKD+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 TKDICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650
BLAST of Lsi08G002110 vs. ExPASy TrEMBL
Match:
A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)
HSP 1 Score: 973.4 bits (2515), Expect = 4.5e-280
Identity = 500/657 (76.10%), Postives = 542/657 (82.50%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS L YCSL
Sbjct: 1 MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60
Query: 61 KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLT AFFS HPFP SSDTSDLRASLRLLHLLLSHPS S PP+RI+GLLT
Sbjct: 61 KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240
Query: 241 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 300
VRSN+ DFIREGA L+G GPRVVVRSIK I+KGEAVTIAYCDLLQPK
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 301 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 360
EIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 361 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 420
S SSPESC EKLQNLLT GF DEQ ED E KQ V+LRLHPLHFL LNAYTAL SAYKVRS
Sbjct: 361 STSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRS 420
Query: 421 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 480
CDL+ALSS+MD D+ N+ A M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAG
Sbjct: 421 CDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAG 480
Query: 481 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 540
ESLLILA HSSLWA TTN+S W P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIG 540
Query: 541 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK 600
ISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK +N QD+ ID SCACSK
Sbjct: 541 ISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSK 600
Query: 601 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 TQDVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 652
BLAST of Lsi08G002110 vs. ExPASy TrEMBL
Match:
A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)
HSP 1 Score: 872.5 bits (2253), Expect = 1.1e-249
Identity = 465/654 (71.10%), Postives = 507/654 (77.52%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S SD LTAA FS FP SDTSDLRASLRLLHLLLS SA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240
Query: 241 RSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK-------------- 300
R N S FI + GYGPRV+VRSIK +RKGEAVTIAYCDLLQPK
Sbjct: 241 RRNFSHFITKD--FQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFV 300
Query: 301 -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
EISA VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI
Sbjct: 301 CSCQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSI 360
Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 420
SPESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+HFL LN YTALASAYKVRS
Sbjct: 361 GSPESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW- 420
Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
NDDENQ A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGES
Sbjct: 421 ---------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGES 480
Query: 481 LLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISN 540
LLIL HSSLW +N+SK P+G+ C NCSWVDKFN +RI GRSIEADF EFSIGISN
Sbjct: 481 LLILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISN 540
Query: 541 CIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD 600
CIA++S K WSFL H C YLKAFTDP DFSWPK ITT NY SC CSK +D
Sbjct: 541 CIADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQD 600
Query: 601 VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
V S Q+R+SI LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 V--------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623
BLAST of Lsi08G002110 vs. ExPASy TrEMBL
Match:
A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)
HSP 1 Score: 862.1 bits (2226), Expect = 1.5e-246
Identity = 462/655 (70.53%), Postives = 504/655 (76.95%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEME+RAMEDIEMAEDITPPL PLTAALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS
Sbjct: 1 MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S+SD LTAA FS F SDTSDLRASLRLLHLLLS SA+ S PPERIFGLLTNR
Sbjct: 61 IC--SYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ DDS+VF K+R+G DAIA SRR NSADIR+ NALEEA++CLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240
Query: 241 RSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK-------------- 300
R N S FI + GYGPRV+VRSIK IRKGEAVTIAYCDLLQPK
Sbjct: 241 RRNFSHFITKD--FQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFV 300
Query: 301 -------------------EISAVKV-EFLDSTSISNFDHDQAVRRIDDYVDNAITEYLS 360
EI AV V E LDSTSISNFD+D A+ RIDDYV+NAI EYLS
Sbjct: 301 CSCQRCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLS 360
Query: 361 ISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSC 420
I SPESC EKLQNLLTLGF DEQA+D + KQ +NLRLHP+HFL LN YTALASAYKVRS
Sbjct: 361 IGSPESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW 420
Query: 421 DLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGE 480
ND+ENQ S MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGE
Sbjct: 421 ----------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGE 480
Query: 481 SLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 540
SLL L HSSLW +N+SK P+G+ C NCSWVDKFN SRI GRSIE DF EFSIGIS
Sbjct: 481 SLLRLVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGIS 540
Query: 541 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTK 600
NCIAN+S K WSFLTH CPYLKAFTDP DFSWPK ITT SNY+ D C SK +
Sbjct: 541 NCIANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQ 600
Query: 601 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
DV S+Q+R+SI LGIHCLFYGGYLASICYGH SHL+SQIQ IL D++
Sbjct: 601 DV--------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625
BLAST of Lsi08G002110 vs. ExPASy TrEMBL
Match:
A0A1S3CJZ3 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)
HSP 1 Score: 777.7 bits (2007), Expect = 3.6e-221
Identity = 411/534 (76.97%), Postives = 435/534 (81.46%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 302
VRSN+ DF+RE G GPRVVVRSIK I+KGEAVTIAYCDLLQPK
Sbjct: 241 NVRSNILDFMRED--FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 303 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 362
EISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 363 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 422
SI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP HFL LNAYTAL SAYKVRS
Sbjct: 361 SIGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRS 420
Query: 423 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
CDLLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAG
Sbjct: 421 CDLLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAG 480
Query: 483 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADF 499
ESLLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF
Sbjct: 481 ESLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADF 532
BLAST of Lsi08G002110 vs. NCBI nr
Match:
XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])
HSP 1 Score: 1004.6 bits (2596), Expect = 3.8e-289
Identity = 515/656 (78.51%), Postives = 549/656 (83.69%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM AMEDIEMAEDITPPL PLT+ALHDSFL THCSSCFS LPNPPISHSNLLRYCS
Sbjct: 1 MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPS--SDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLTAAFFS HPFPS S TSDLRASLRLLHLLLSHP A S PPERIFGLLT
Sbjct: 61 KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ D+++F KLREGVDAIAA SADI HG+ L EA LCLV TNAVDV DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAALL---SADIPHGHTLAEAALCLVFTNAVDVHDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
GRTIGIAVY PTFCWINHSCSPNACYRFET S STTTR RIAPSCTDL+T +GSC+QMG
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240
Query: 241 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 300
TVRSNLSDFI E G GPRV+VRSIK IR+GEAVTIAYCDLLQPK
Sbjct: 241 TVRSNLSDFITED--FQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQ 300
Query: 301 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 360
E+SA KVE DSTSISNFDHD+AVRRIDDYV++AITEYL
Sbjct: 301 FVCSCQRCSAKPLTYVDHALQELSASKVELHDSTSISNFDHDKAVRRIDDYVNSAITEYL 360
Query: 361 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 420
SI SPESC EKL+NLLTLGF DEQAED E+KQPVNLRLHPLHFLSLN YTALASAYKVRS
Sbjct: 361 SIGSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRS 420
Query: 421 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 480
CDLLALSS+MD D+E+Q AS M + SAAYSLFLAGATHHLFLS+PSLI SA+ CWV+AG
Sbjct: 421 CDLLALSSEMDCDNEDQCNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLAG 480
Query: 481 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 540
ESLL LA HS LWATTN+SKWG PVGKRMCS CSWVDKFNASRI G+ IEADF EFSIGI
Sbjct: 481 ESLLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIGI 540
Query: 541 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT 600
SNCIANMS+KSWSFLTHGCPYLKAFTDP +FSWPK I YS+ +D++AHSID CACS +
Sbjct: 541 SNCIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSNS 600
Query: 601 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
KDVCFQ EPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 KDVCFQCEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILYDLN 651
BLAST of Lsi08G002110 vs. NCBI nr
Match:
XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])
HSP 1 Score: 978.4 bits (2528), Expect = 2.9e-281
Identity = 506/657 (77.02%), Postives = 541/657 (82.34%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 302
VRSN+ DF+RE G GPRVVVRSIK I+KGEAVTIAYCDLLQPK
Sbjct: 241 NVRSNILDFMRED--FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 303 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 362
EISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 363 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 422
SI SPESC EKLQNLLT GF DEQ ED E KQPV+LRLHP HFL LNAYTAL SAYKVRS
Sbjct: 361 SIGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRS 420
Query: 423 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
CDLLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAG
Sbjct: 421 CDLLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAG 480
Query: 483 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 542
ESLLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIG 540
Query: 543 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK 602
ISNCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK +N D+ H ID SCACSK
Sbjct: 541 ISNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSK 600
Query: 603 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
TKD+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 TKDICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650
BLAST of Lsi08G002110 vs. NCBI nr
Match:
XP_011656459.1 (protein SET DOMAIN GROUP 41 [Cucumis sativus])
HSP 1 Score: 961.8 bits (2485), Expect = 2.8e-276
Identity = 497/657 (75.65%), Postives = 538/657 (81.89%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS L YCSL
Sbjct: 1 MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60
Query: 61 KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLT AFFS HPFP SSDTSDLRASLRLLHLLLSHPS S PP+RI+GLLT
Sbjct: 61 KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240
Query: 241 TVRSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK------------ 300
VRSN+ DFIRE G GPRVVVRSIK I+KGEAVTIAYCDLLQPK
Sbjct: 241 NVRSNILDFIRED--FQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 301 ---------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 360
EIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 361 SISSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRS 420
S SSPESC EKLQNLLT GF DEQ ED E KQ V+LRLHPLHFL LNAYTAL SAYKVRS
Sbjct: 361 STSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRS 420
Query: 421 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 480
CDL+ALSS+MD D+ N+ A M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAG
Sbjct: 421 CDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAG 480
Query: 481 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 540
ESLLILA HSSLWA TTN+S W P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIG 540
Query: 541 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSK 600
ISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK +N QD+ ID SCACSK
Sbjct: 541 ISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSK 600
Query: 601 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 TQDVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 650
BLAST of Lsi08G002110 vs. NCBI nr
Match:
XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 891.3 bits (2302), Expect = 4.6e-255
Identity = 476/654 (72.78%), Postives = 513/654 (78.44%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+FLLTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C SHSD LTAA FS FP SDTSDLRASLRLLHLLLS PSA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ DDS+VF+K+REG DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
RTIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 RTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240
Query: 241 RSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK-------------- 300
R N S FI + GYGPRV+VRSIK IR GEAVTIAYCDLLQPK
Sbjct: 241 RRNFSHFITKD--FQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFV 300
Query: 301 -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
EISAV VE LDSTSISNFD+D A+ RIDDYV+NAI EYLSI
Sbjct: 301 CSCQRCSAKPPTYVDHALQEISAVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSI 360
Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 420
S ESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+HFL LNAYTALASAYKVRS
Sbjct: 361 GSSESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSW- 420
Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
N DENQ A+ MS+TSAAYSLFLAGATHHLFLS+PSLIASAANCWVVAGES
Sbjct: 421 ---------NGDENQCNAT-MSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES 480
Query: 481 LLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISN 540
LLIL HSSLW +N+SK P+G+ C NCSWVDKFN SRI GRSIEADF EFSIGISN
Sbjct: 481 LLILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISN 540
Query: 541 CIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD 600
CIAN+SQK WSFL H C YLKAFTDP DFSWPK ITT SNY+ D SC CSK +D
Sbjct: 541 CIANISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYR-------DRSCDCSKIQD 600
Query: 601 VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
V S+Q+R+SI LGIHCLFYGGYLASICYGHHSHLASQIQ ILHD++
Sbjct: 601 V--------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCILHDMN 623
BLAST of Lsi08G002110 vs. NCBI nr
Match:
XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])
HSP 1 Score: 872.5 bits (2253), Expect = 2.2e-249
Identity = 465/654 (71.10%), Postives = 507/654 (77.52%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S SD LTAA FS FP SDTSDLRASLRLLHLLLS SA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240
Query: 241 RSNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPK-------------- 300
R N S FI + GYGPRV+VRSIK +RKGEAVTIAYCDLLQPK
Sbjct: 241 RRNFSHFITKD--FQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFV 300
Query: 301 -------------------EISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
EISA VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI
Sbjct: 301 CSCQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSI 360
Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 420
SPESC EKLQNLLTLGF DEQAED + KQ +NLRLHP+HFL LN YTALASAYKVRS
Sbjct: 361 GSPESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW- 420
Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
NDDENQ A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGES
Sbjct: 421 ---------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGES 480
Query: 481 LLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISN 540
LLIL HSSLW +N+SK P+G+ C NCSWVDKFN +RI GRSIEADF EFSIGISN
Sbjct: 481 LLILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISN 540
Query: 541 CIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKTKD 600
CIA++S K WSFL H C YLKAFTDP DFSWPK ITT NY SC CSK +D
Sbjct: 541 CIADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQD 600
Query: 601 VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 622
V S Q+R+SI LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 V--------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623
BLAST of Lsi08G002110 vs. TAIR 10
Match:
AT1G43245.1 (SET domain-containing protein )
HSP 1 Score: 321.6 bits (823), Expect = 1.4e-87
Identity = 229/649 (35.29%), Postives = 325/649 (50.08%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
ME+RA EDIE+ D+ PPL PL ++L+DSFL +HCSSCFS LP P YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
S LT +F ++ FP T L + +R LL+ + S+ P R+ LLTN H
Sbjct: 61 S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120
Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
LM D + + + + IA R N R LEEA +C VLTNAV+V DS G
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180
Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL-VTNEGSCNQMGTVR 242
+GIA+Y +F WINHSCSPN+CYRF + S D+ VTN + + +
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF----------VNNRTSYHDVHVTNTETSSNLELQE 240
Query: 243 SNLSDFIREGAFLHGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKEI------------- 302
+ G +G GP+++VRSIK I+ GE +T++Y DLLQP +
Sbjct: 241 QVCGTSLNSG---NGNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMC 300
Query: 303 -----SAVKVEFLDS------------TSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
+A ++DS T++ +FD D+AV +++DY+ AI ++LS
Sbjct: 301 NCGRCAASPPAYVDSILEGVLTLESEKTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360
Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDEEEKQPVNLRLHPLHFLSLNAYTALASAYKVRSC 422
+ P++C E ++++L G + +E+ QP LRLH H+++LNAY LA+AY++RS
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420
Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
D ++ G MSR SAAYSLFLAG +HHLF ++ S SAA W AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480
Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
E L LA + + S C+ C ++ N+ R D E S I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540
Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYQDLQAHSIDPSCACSKT 602
+C+ ++SQ +WSFLT GCPYL+ F P DFS T +N +
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS-----LTRTNGE---------------- 557
Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 615
+ S + ++L L HCL Y L +CYG SHL S+ +
Sbjct: 601 -------REESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q3ECY6 | 1.9e-86 | 35.29 | Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CIT0 | 1.4e-281 | 77.02 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... | [more] |
A0A0A0KAK3 | 4.5e-280 | 76.10 | SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... | [more] |
A0A6J1EY39 | 1.1e-249 | 71.10 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... | [more] |
A0A6J1I954 | 1.5e-246 | 70.53 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... | [more] |
A0A1S3CJZ3 | 3.6e-221 | 76.97 | protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 P... | [more] |
Match Name | E-value | Identity | Description | |
AT1G43245.1 | 1.4e-87 | 35.29 | SET domain-containing protein | [more] |