Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTGGAAAGAAAGGAGTGGTGATAATAGGTCTCGGTCTCCGTCGCTTCGACGGAGAACTTCAGAACCTCAGGTTGAAGAAAACCGGCATTGTCATTCTCACTGGTTTTCGGGTTCTGCACGAGAACGACCGGTGACGAATGGACATGCGGGTTCTTCTGTCAGAGACCATTACTACGAAAGCCGTCTTTATGAGGATAAAGACGAACATTTTCGTAAACTCTCTCAGTTTTGCGAGAATTTGCAGCGTAGGGAATCACCGTCGAAAAAGTTTCGATGGGAAATTTTGTTTGCCAAAAGTCCCGCCAATGCGAATTCGAAATCGAGTCTGGGGTTGAAACATGTAAATGGATGTGATGATGATAATCAAGGACTTGGGGTTTCCGGTTCTCATGTGATTCCAGAATCGTCGTCCAAGGATATTTTGGAAGCTAATAATTTGCGCACATTCCATATGAACATTGGGGCAACTAAAGACAGTAACGTCAACAATGGGGATGCTTCCAGAAGTTTTGGAATCAATGATTGTAGGCATTTGTCTTCATCTAGAACGTTTGATGGGCCCATATACGAGACCAGTGATGTTCATGTTCGGGACCCTCCGATGTTTGAATCAGCAAGAAATTCCCATAAAGGAAAACGAAGCGGAACTTCTTCACATGGGGCACAGGCGTCTCATCCGCACTCCAGTGCACGTGCTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGTCGTTTACCTCTAGCCTCTCCGGACTCCACTTGGAAGAAAGAAACGCTTAGAGAACCAGTTGAAACTGAACTGAGTATGGAAAAACTTCTGGAGTGTAAACAAGCTCGGGGGAATCATGTCGAGCACTTTGACGATTGCAATCAGTATTTCAAAGACCAACCATGTAAGAGGAGTGACATTGGTGCTGCTCTCAACAGTCCTTTCTCTCAGCAGATGGTTTGTATCCCACAAGACGACTTCTATCAAGATTCTACGCGGACCAGTGTTGTAATGGATCCAGTTGTCGAGGGATATGAAGAAACTGAAAGCTATGTCATGGGTGATATGGAAGAGAGCCGGCCAAGCGACCACTATGGTCTTTTTAAGGAGCCGTTCAGCATTGAAGGTTCTTATGTGGGCAACGCCCCTTTTGCGATGGAGCGGGATGGCGAAGTTTTGGGTTCTGGAACTGGAAGTCCGCTGAAGTGTGAAAAAGAAGCATATGCAGGTTGCGAGAAGTTGTTCTGGGCAGAAGATGGTTATAAGACAAATTCTGGGAAATGGTCGCATGAGGATGGGTTAAAAGAAACATTTGTATCAAAACATGAACAAGATTTGGGCGTCATGGAAGACAGTAGAAAGCTGAGATGGAAAGCCGCACATTCAACAAAACCGAGGGTCAAGGGAAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCCTGGGTCTGATCCATCTAGAAAACGTAATGTGTTTAGCAGAATCCATTATTTTAGTCATGGAGATGAAATGAGTACTGTTAAAGATATTGACATCAATCTAAACTGTAGAAACAAGCCGTTGAATGACGAGGATACTTCCATTACCTTCACCTCCTACAAACGGCCGTTACCTTGGCTAAATCATGCCTCTCAGCGTCTAAAGTCTAAACGCAGAGACAGAAAGAAACGTTTGCGGACCTCCTTGAGGGATCCCAGTTCAAACCCTTTAGTTAGAGAACGTAAAAGAAATAAGCGTCTCAGGAACACAAATGTCAATCGCGGGTGCCTTGATGTTCAAGCAGGTGACTGCTTTGAAGAGAAGACAAAAAGTTCAACAAGTAGGCCACCTGAAGATCCCTTGGAGTTGAACCAGCTAATAAAGAATGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAATCCAGCCCGACGGAAGAAGTTCACTGAGCCAGGGTCTGGTATTATAAAGTGCATTGTCTGCGGCAGGTGTGTTTTCCCCAATTTGCCATGTTTTCCACCATTAAGCTTTTGATGTCCGTATCCTTGTTAGAAGTCTAATTTTAGCCTGATCGTCATTTAGGGTGACTGGGTGAGTTTGGTGAGGATGACAGATACTTTTAGCCAATTAAAAACTCTTTTCAATATTTTCATGCTGTTTGGTTGCATAGTGATTTGAAAAATTCGAAAAACACTTAGAACACTTTAAATACTTCTAGAGGAAACACTTAATTCTTCTAGAAGTGTTTAGAGAAAAAGTGTTTTATAGAAACACTATTCTCAAAAGCCATCCTAAACTCATTTTTCATATTTTCTCAACGTCAAAAGTGTTCAAGCAGGTGACTGTCTCATTGATTTTATGAGCTATAATATGTTTGTCTGATCAGTTATATCTTTCTTGTCTCAAAGAAATGGAAAATGATTTTGATCCCTTGTAAGTAGAAGTAACAGTAGAGGCCTAGTATATGACTAACAGATTGGCAATCGTAGGTTTGACTGGATAGACAGACAATATACATAGTGCATGCTTTTACAAACATGTTCTTTGATTGTGGACTTCTCATATAAGCTTGAACTGGGGAAGATTTATTGTATGCAAGTGCTATTTCATATATCTCTCTTTTCATAGTTTTTGATGGAAGCTTTGTCTTGAAATGTAATTGATGGGACTTTCTGTAGCTGGTTTAGGCACTTGCACGTCTTGGAATCAGTCTCTCATTCCTTTCCTTTCAAATGATCAGTGAAAGAGGTGTCTTTGTTTTGATTTTCAATTGAGAAATCATAGTGATTTAAAGCATGGATGGAGGAGAATAGAGTAGAAAAACAGGCAATCAGAGAGAAGAAGAGTGTAACGCAAGCTCAAAAAATTTGTTGCAATCAGTTCTCTAGTTTGAGTTTTACTCTGTAGGAAGGTTTCAATCGTTAGAAAGTTCTTTGCCTTTCTGCTTTTCGACCCTCCCTTGACTACATCATTTCTTGCGTGTATTGTTCTTTTCCTATCTTCTTTAGAAGATATTTACTGAAGTTTGACAGATGCTGTATACAATCTCTTCATATAAGTTTGTAACGAGTGAATAATTTGACAGTCATTACTATATATTCTGGTATATAAGACTTTTTCCATGGAACCCAGTTTTGGCCAAGCAAAACCTCTGTATCTTGCTGCAGATGTTTCAGGGTTCTGTATATGTATTATTGCTTAATCTACTCACATTTTTATCAGCAAGTCCAGGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTTTGAGTCGCTGGAAGGATCAAGGGCGGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAACGGCGTCGAAGGTTCTATGGGTTCGAAGCATATTGCCCCATGCTGAAGCCTGTGTTTTGAAGGAGGATCTCATTATATGGCCTCCTGTTCTCATCATTCATAACAGTTCTATTGCAATTGATAATACGTCTGAACGGGTAACCATAAGTTATGAAGAGCTTGAGGTTGTTATTAGAGGTAAACCTTGTCCTTGCAGTAAATATGAATTGCTGGGTTTTACATATTTTTTAATACTCAAGTTGCATGTGGATGGCACATTATTCAAGTCCTCTTTTTAAATTAACCACGATATGTTGTGATATAAAGATGTTCTCAAAATCTATTTTAAAAAGTGGTCCATTGATTTTTTTTATGGTATCCAAACTGGGACTAATTATATATATTTTTTTCTTTAAGATAATTACAATCACACTTTGTGACTTGCCCATTTTGAAAATTATATAATTGTATTTTGAAAATAGTTTTAAATACTCTTGTGATTTGCAAAATCTAAACATTGTGCCTTTCTTTTAAGTTTTGCCGTTAATTTTAATTGAATTGGTAGATGGGTTTCTTCTAGGTATGTTATGCGTACCAACTTCACTAAAGTTGACTCACATAAGGGTATGATTTGCAAATTTCTGAAGTATATGGGTTTGTTTGAAACCATTTTTCAGATTAAGGGGTATGATTTGTAAAATAGGCAAATTATAAGAGTGTAATTTGCAATTATTAATTTGCAATTATTCTTTTTCTTTTCAGTTAATATGATCCTCCTGTTAGGCCAGACTTTACCAGCTTTTAAGATGAATAATGCATGTTTGTCAGACCGTCTGACATTTGCTAGTCGATAGAATCTTTGTTGTTTAGTAGCTTTTGGTTTCACCATAGTTTTGGTTTCTCCCAAAACAAATTAGCAGGAATGGGTTTTGGAGGGAAGATCAAAGTGGTACGTGGTAAACCTGCAAATCAGAGTATTATGGTAGTAACTTTCGGTGCAATGTTTTCCGGTTTGCAAGAAGCAAAAAGACTTCACAAAAACTTTGCAGATAACAGTCATGGTAGAGACGAGTTCCAGAAAATCAATTCGAGTCATCTCTTTGACAGCCATAGGGATCTGCATAAAGCGGGAGCAAACACGATGGAAAGCATTCTTTATGGCTACTTGGGCCTCGCAGAGGACTTGGATAAACTTGACTTGGAGACCAAGAAGAGGTCTGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATCGTGGATGCATCTCTTCACTGTTAA
mRNA sequence
ATGAACTGGAAAGAAAGGAGTGGTGATAATAGGTCTCGGTCTCCGTCGCTTCGACGGAGAACTTCAGAACCTCAGGTTGAAGAAAACCGGCATTGTCATTCTCACTGGTTTTCGGGTTCTGCACGAGAACGACCGGTGACGAATGGACATGCGGGTTCTTCTGTCAGAGACCATTACTACGAAAGCCGTCTTTATGAGGATAAAGACGAACATTTTCGTAAACTCTCTCAGTTTTGCGAGAATTTGCAGCGTAGGGAATCACCGTCGAAAAAGTTTCGATGGGAAATTTTGTTTGCCAAAAGTCCCGCCAATGCGAATTCGAAATCGAGTCTGGGGTTGAAACATGTAAATGGATGTGATGATGATAATCAAGGACTTGGGGTTTCCGGTTCTCATGTGATTCCAGAATCGTCGTCCAAGGATATTTTGGAAGCTAATAATTTGCGCACATTCCATATGAACATTGGGGCAACTAAAGACAGTAACGTCAACAATGGGGATGCTTCCAGAAGTTTTGGAATCAATGATTGTAGGCATTTGTCTTCATCTAGAACGTTTGATGGGCCCATATACGAGACCAGTGATGTTCATGTTCGGGACCCTCCGATGTTTGAATCAGCAAGAAATTCCCATAAAGGAAAACGAAGCGGAACTTCTTCACATGGGGCACAGGCGTCTCATCCGCACTCCAGTGCACGTGCTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGTCGTTTACCTCTAGCCTCTCCGGACTCCACTTGGAAGAAAGAAACGCTTAGAGAACCAGTTGAAACTGAACTGAGTATGGAAAAACTTCTGGAGTGTAAACAAGCTCGGGGGAATCATGTCGAGCACTTTGACGATTGCAATCAGTATTTCAAAGACCAACCATGTAAGAGGAGTGACATTGGTGCTGCTCTCAACAGTCCTTTCTCTCAGCAGATGGTTTGTATCCCACAAGACGACTTCTATCAAGATTCTACGCGGACCAGTGTTGTAATGGATCCAGTTGTCGAGGGATATGAAGAAACTGAAAGCTATGTCATGGGTGATATGGAAGAGAGCCGGCCAAGCGACCACTATGGTCTTTTTAAGGAGCCGTTCAGCATTGAAGGTTCTTATGTGGGCAACGCCCCTTTTGCGATGGAGCGGGATGGCGAAGTTTTGGGTTCTGGAACTGGAAGTCCGCTGAAGTGTGAAAAAGAAGCATATGCAGGTTGCGAGAAGTTGTTCTGGGCAGAAGATGGTTATAAGACAAATTCTGGGAAATGGTCGCATGAGGATGGGTTAAAAGAAACATTTGTATCAAAACATGAACAAGATTTGGGCGTCATGGAAGACAGTAGAAAGCTGAGATGGAAAGCCGCACATTCAACAAAACCGAGGGTCAAGGGAAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCCTGGGTCTGATCCATCTAGAAAACGTAATGTGTTTAGCAGAATCCATTATTTTAGTCATGGAGATGAAATGAGTACTGTTAAAGATATTGACATCAATCTAAACTGTAGAAACAAGCCGTTGAATGACGAGGATACTTCCATTACCTTCACCTCCTACAAACGGCCGTTACCTTGGCTAAATCATGCCTCTCAGCGTCTAAAGTCTAAACGCAGAGACAGAAAGAAACGTTTGCGGACCTCCTTGAGGGATCCCAGTTCAAACCCTTTAGTTAGAGAACGTAAAAGAAATAAGCGTCTCAGGAACACAAATGTCAATCGCGGGTGCCTTGATGTTCAAGCAGGTGACTGCTTTGAAGAGAAGACAAAAAGTTCAACAAGTAGGCCACCTGAAGATCCCTTGGAGTTGAACCAGCTAATAAAGAATGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAATCCAGCCCGACGGAAGAAGTTCACTGAGCCAGGGTCTGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAGGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTTTGAGTCGCTGGAAGGATCAAGGGCGGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAACGGCGTCGAAGGTTCTATGGGTTCGAAGCATATTGCCCCATGCTGAAGCCTGTGTTTTGAAGGAGGATCTCATTATATGGCCTCCTGTTCTCATCATTCATAACAGTTCTATTGCAATTGATAATACGTCTGAACGGGTAACCATAAGTTATGAAGAGCTTGAGGTTGTTATTAGAGCAGGAATGGGTTTTGGAGGGAAGATCAAAGTGGTACGTGGTAAACCTGCAAATCAGAGTATTATGGTAGTAACTTTCGGTGCAATGTTTTCCGGTTTGCAAGAAGCAAAAAGACTTCACAAAAACTTTGCAGATAACAGTCATGGTAGAGACGAGTTCCAGAAAATCAATTCGAGTCATCTCTTTGACAGCCATAGGGATCTGCATAAAGCGGGAGCAAACACGATGGAAAGCATTCTTTATGGCTACTTGGGCCTCGCAGAGGACTTGGATAAACTTGACTTGGAGACCAAGAAGAGGTCTGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATCGTGGATGCATCTCTTCACTGTTAA
Coding sequence (CDS)
ATGAACTGGAAAGAAAGGAGTGGTGATAATAGGTCTCGGTCTCCGTCGCTTCGACGGAGAACTTCAGAACCTCAGGTTGAAGAAAACCGGCATTGTCATTCTCACTGGTTTTCGGGTTCTGCACGAGAACGACCGGTGACGAATGGACATGCGGGTTCTTCTGTCAGAGACCATTACTACGAAAGCCGTCTTTATGAGGATAAAGACGAACATTTTCGTAAACTCTCTCAGTTTTGCGAGAATTTGCAGCGTAGGGAATCACCGTCGAAAAAGTTTCGATGGGAAATTTTGTTTGCCAAAAGTCCCGCCAATGCGAATTCGAAATCGAGTCTGGGGTTGAAACATGTAAATGGATGTGATGATGATAATCAAGGACTTGGGGTTTCCGGTTCTCATGTGATTCCAGAATCGTCGTCCAAGGATATTTTGGAAGCTAATAATTTGCGCACATTCCATATGAACATTGGGGCAACTAAAGACAGTAACGTCAACAATGGGGATGCTTCCAGAAGTTTTGGAATCAATGATTGTAGGCATTTGTCTTCATCTAGAACGTTTGATGGGCCCATATACGAGACCAGTGATGTTCATGTTCGGGACCCTCCGATGTTTGAATCAGCAAGAAATTCCCATAAAGGAAAACGAAGCGGAACTTCTTCACATGGGGCACAGGCGTCTCATCCGCACTCCAGTGCACGTGCTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGTCGTTTACCTCTAGCCTCTCCGGACTCCACTTGGAAGAAAGAAACGCTTAGAGAACCAGTTGAAACTGAACTGAGTATGGAAAAACTTCTGGAGTGTAAACAAGCTCGGGGGAATCATGTCGAGCACTTTGACGATTGCAATCAGTATTTCAAAGACCAACCATGTAAGAGGAGTGACATTGGTGCTGCTCTCAACAGTCCTTTCTCTCAGCAGATGGTTTGTATCCCACAAGACGACTTCTATCAAGATTCTACGCGGACCAGTGTTGTAATGGATCCAGTTGTCGAGGGATATGAAGAAACTGAAAGCTATGTCATGGGTGATATGGAAGAGAGCCGGCCAAGCGACCACTATGGTCTTTTTAAGGAGCCGTTCAGCATTGAAGGTTCTTATGTGGGCAACGCCCCTTTTGCGATGGAGCGGGATGGCGAAGTTTTGGGTTCTGGAACTGGAAGTCCGCTGAAGTGTGAAAAAGAAGCATATGCAGGTTGCGAGAAGTTGTTCTGGGCAGAAGATGGTTATAAGACAAATTCTGGGAAATGGTCGCATGAGGATGGGTTAAAAGAAACATTTGTATCAAAACATGAACAAGATTTGGGCGTCATGGAAGACAGTAGAAAGCTGAGATGGAAAGCCGCACATTCAACAAAACCGAGGGTCAAGGGAAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCCTGGGTCTGATCCATCTAGAAAACGTAATGTGTTTAGCAGAATCCATTATTTTAGTCATGGAGATGAAATGAGTACTGTTAAAGATATTGACATCAATCTAAACTGTAGAAACAAGCCGTTGAATGACGAGGATACTTCCATTACCTTCACCTCCTACAAACGGCCGTTACCTTGGCTAAATCATGCCTCTCAGCGTCTAAAGTCTAAACGCAGAGACAGAAAGAAACGTTTGCGGACCTCCTTGAGGGATCCCAGTTCAAACCCTTTAGTTAGAGAACGTAAAAGAAATAAGCGTCTCAGGAACACAAATGTCAATCGCGGGTGCCTTGATGTTCAAGCAGGTGACTGCTTTGAAGAGAAGACAAAAAGTTCAACAAGTAGGCCACCTGAAGATCCCTTGGAGTTGAACCAGCTAATAAAGAATGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAATCCAGCCCGACGGAAGAAGTTCACTGAGCCAGGGTCTGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAGGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTTTGAGTCGCTGGAAGGATCAAGGGCGGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAACGGCGTCGAAGGTTCTATGGGTTCGAAGCATATTGCCCCATGCTGAAGCCTGTGTTTTGAAGGAGGATCTCATTATATGGCCTCCTGTTCTCATCATTCATAACAGTTCTATTGCAATTGATAATACGTCTGAACGGGTAACCATAAGTTATGAAGAGCTTGAGGTTGTTATTAGAGCAGGAATGGGTTTTGGAGGGAAGATCAAAGTGGTACGTGGTAAACCTGCAAATCAGAGTATTATGGTAGTAACTTTCGGTGCAATGTTTTCCGGTTTGCAAGAAGCAAAAAGACTTCACAAAAACTTTGCAGATAACAGTCATGGTAGAGACGAGTTCCAGAAAATCAATTCGAGTCATCTCTTTGACAGCCATAGGGATCTGCATAAAGCGGGAGCAAACACGATGGAAAGCATTCTTTATGGCTACTTGGGCCTCGCAGAGGACTTGGATAAACTTGACTTGGAGACCAAGAAGAGGTCTGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATCGTGGATGCATCTCTTCACTGTTAA
Protein sequence
MNWKERSGDNRSRSPSLRRRTSEPQVEENRHCHSHWFSGSARERPVTNGHAGSSVRDHYYESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSPANANSKSSLGLKHVNGCDDDNQGLGVSGSHVIPESSSKDILEANNLRTFHMNIGATKDSNVNNGDASRSFGINDCRHLSSSRTFDGPIYETSDVHVRDPPMFESARNSHKGKRSGTSSHGAQASHPHSSARATESKGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETELSMEKLLECKQARGNHVEHFDDCNQYFKDQPCKRSDIGAALNSPFSQQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVMGDMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEKLFWAEDGYKTNSGKWSHEDGLKETFVSKHEQDLGVMEDSRKLRWKAAHSTKPRVKGKCFVSSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMSTVKDIDINLNCRNKPLNDEDTSITFTSYKRPLPWLNHASQRLKSKRRDRKKRLRTSLRDPSSNPLVRERKRNKRLRNTNVNRGCLDVQAGDCFEEKTKSSTSRPPEDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGSGIIKCIVCGSKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKAGANTMESILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC
Homology
BLAST of Sgr023297 vs. NCBI nr
Match:
XP_038900433.1 (uncharacterized protein LOC120087658 [Benincasa hispida])
HSP 1 Score: 1133.2 bits (2930), Expect = 0.0e+00
Identity = 625/888 (70.38%), Postives = 690/888 (77.70%), Query Frame = 0
Query: 1 MNWKERSGDNRSRSP-SLRRRTSEPQVEENRHCHSHWFSGSARERPVTNGHAGSSVRDHY 60
MN++E S D RS+SP S RRTSEP+VEEN HCHS WFS S+RE PVTNG AGSS+RDHY
Sbjct: 1 MNYRETSCDKRSQSPSSFGRRTSEPRVEENPHCHSLWFSRSSREVPVTNGLAGSSIRDHY 60
Query: 61 YESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSPANANSKSSLGLKHVNGC 120
SRLYE+ DEHFRKLSQ CENLQ RESPSKKFRWE LFA +PANANSKSS+GLKH N C
Sbjct: 61 NGSRLYENTDEHFRKLSQLCENLQ-RESPSKKFRWENLFANNPANANSKSSMGLKHENIC 120
Query: 121 DDDNQGLGVSGSHVIPESSSKDILEANNLRTFHMNIGATKDSNV-NNGDASRSFGINDCR 180
D N+G+ VSGSH+ +SS +IL +NLRTFHMNIG TKDSNV NNGD SRSFGI+DC
Sbjct: 121 DGYNRGIRVSGSHL--GTSSNNILGGSNLRTFHMNIGETKDSNVKNNGDISRSFGIDDCS 180
Query: 181 HLSSSRTFDGPIYETSDVHVRDPPMFESARNSHKGKRSGTSSHGAQASHPHSSARATESK 240
HLSSSR FDGP+YETSDVHVRD P+FESA NSH+G+R+ SSHG QAS+ SSA TESK
Sbjct: 181 HLSSSRKFDGPLYETSDVHVRDRPIFESAENSHRGRRNVASSHGLQASNLQSSAPVTESK 240
Query: 241 GISQDEFHGFYEGRLPLASPDSTWKKETLREPVETELSMEKLLECKQARGNHVEHFDDCN 300
GISQDEFH F LE K+AR N++E FDD N
Sbjct: 241 GISQDEFHDF--------------------------------LEYKRARRNNIEQFDDSN 300
Query: 301 QYFKDQPCKRSDIGAALNSPFSQQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVMG 360
QYF QP KRSDI A LNS FSQQMV IPQDDFYQDSTRTSVVMD VVEG+++TES++
Sbjct: 301 QYFSVQPGKRSDIDATLNSTFSQQMVRIPQDDFYQDSTRTSVVMDSVVEGFKDTESHL-- 360
Query: 361 DMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEKL 420
E +RP D Y FKEPF IEGSY+G APF ME GE LGSG S +K E+EAY EKL
Sbjct: 361 -EETTRPRDRYDSFKEPFVIEGSYMGTAPFEMELYGEGLGSGAESSMKGEREAYISSEKL 420
Query: 421 FWA-EDGYKTNSGKWSHEDGLKETFVSKHEQDLGVMEDSRKLRWKAAHSTKPRVKGKCFV 480
A EDGY+T GKW HEDG+ + VSKH+QDL ME SRKLRWKA +STK RV+G
Sbjct: 421 LLAEEDGYRTYYGKWLHEDGVNGSLVSKHKQDLSDMEGSRKLRWKATNSTKLRVEG---- 480
Query: 481 SSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMSTVKDIDINLNCRNKPLNDEDTSITFTS 540
+RC MH PGS SRK NVFSRI + SHGDE VKD DINLNCR+K N+EDTSI TS
Sbjct: 481 -TRCIMHEPGSCSSRKPNVFSRIQFLSHGDENIAVKDTDINLNCRSKWWNEEDTSIYLTS 540
Query: 541 YKRPLPW-LNHASQRLKSKRRDRKKRLRTSLRDPSSNPLVRERKR--NKRLRNTNVNRGC 600
KRPLPW +NHAS K KRRD +KRL LRDPSS+PLVR+RKR NKRLR NVN C
Sbjct: 541 SKRPLPWVINHASPHSKLKRRDLRKRLGFPLRDPSSSPLVRDRKRKKNKRLRKRNVNHSC 600
Query: 601 LDVQAGDCFEEKTKSSTSRPPEDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGSGII 660
LDVQ D EEK +S TSR ED ELNQLIK+AF KF+KVL+ENPARRKKFTEPG GII
Sbjct: 601 LDVQTDDYMEEKVQSPTSRLLEDQEELNQLIKSAFLKFVKVLSENPARRKKFTEPGCGII 660
Query: 661 KCIVCGSKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVRSI 720
KCIVCGSKS+EFADALSLSQHA ++LEGSRAEHLGL KALCWLMGWSSE A WVR I
Sbjct: 661 KCIVCGSKSKEFADALSLSQHASQTLEGSRAEHLGLQKALCWLMGWSSEAAPDGRWVRRI 720
Query: 721 LPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVV 780
LP E LKEDLIIWPPVLIIHNSSIAID+ SERV IS EELEVVIR GMG GGKIKVV
Sbjct: 721 LPLEEVLALKEDLIIWPPVLIIHNSSIAIDSPSERVAISCEELEVVIR-GMGCGGKIKVV 780
Query: 781 RGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKA- 840
RGKP NQSIM+VTF AMFSGLQEA+RLHK+FAD SHGRDEFQKI SSHL DSH+DLHKA
Sbjct: 781 RGKPGNQSIMIVTFDAMFSGLQEAERLHKSFADKSHGRDEFQKIYSSHLIDSHKDLHKAT 840
Query: 841 GANTMESILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
GANT++++LYGYLGL EDLDKLD ETKKRSVVKSKKEIQAIV+ASLHC
Sbjct: 841 GANTLDNVLYGYLGLTEDLDKLDFETKKRSVVKSKKEIQAIVNASLHC 844
BLAST of Sgr023297 vs. NCBI nr
Match:
XP_008458617.1 (PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 uncharacterized protein E6C27_scaffold111G00320 [Cucumis melo var. makuwa])
HSP 1 Score: 1047.3 bits (2707), Expect = 7.2e-302
Identity = 594/890 (66.74%), Postives = 662/890 (74.38%), Query Frame = 0
Query: 1 MNWKERSGDNRSRSPSL-RRRTSEPQVEENRHCHSHWFSGSARERPVTNGHAGSSVRDHY 60
MN +E + D RS+SPSL RRTSEP+VEE HC+SHWFS S+RERP+TN GSS+RDHY
Sbjct: 1 MNSREMNRDKRSQSPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRDHY 60
Query: 61 YESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSP-ANANSKSSLGLKHVNG 120
SRLY KDEHFRKLSQFCENLQ ESP+KKF+WE LF + AN NSK+S+GLKHVNG
Sbjct: 61 NGSRLYFHKDEHFRKLSQFCENLQ-GESPAKKFQWENLFVNNNLANGNSKASMGLKHVNG 120
Query: 121 CDDDNQGLGVSGSHVIPESSSKDILEANNLRTFHMNIGATKDSNV-NNGDASRSFGINDC 180
D DN+G+ VSGSH+ +SSK IL NLRTFHMNIGATKDSNV NNGD SRS GINDC
Sbjct: 121 SDGDNRGIRVSGSHL--GTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDC 180
Query: 181 RHLSSSRTFDGPIYETSDVHVRDPPMFESARNSHKGKRSGTSSHGAQASHPHSSARATES 240
HLSSSR +DGP+++ ++VHVRD P+FE NSH+G+R+ TSS G QASH HSSA ES
Sbjct: 181 NHLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAES 240
Query: 241 KGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETELSMEKLLECKQARGNHVEHFDDC 300
KGISQ EFH LLE K+AR NH+EHFDD
Sbjct: 241 KGISQGEFH--------------------------------DLLEYKRARRNHIEHFDDS 300
Query: 301 NQYFKDQPCKRSDIGAALNSPFSQQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVM 360
NQYF QPCKR+DI A + PFSQ MV IPQDDFY+DSTRTSVVMD VVEG+++TES+
Sbjct: 301 NQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDDFYRDSTRTSVVMDSVVEGFQDTESHF- 360
Query: 361 GDMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEK 420
E +RP DH IEGS + APFAME+ EVLGSGT S E+EAY EK
Sbjct: 361 --EETTRPRDHNAF------IEGSCMSTAPFAMEQYVEVLGSGTESSQDGEREAYISSEK 420
Query: 421 LFWA-EDGYKTNSGKWSHEDGLKETFVSKHEQDLGVMEDSRKLRWKAAHSTKPRVKGKCF 480
L EDGY+TN GKW+ EDG+ + VSKH+QDLG MED RKL WKA HSTKPRV+G
Sbjct: 421 LLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGDMEDRRKLTWKAQHSTKPRVEG--- 480
Query: 481 VSSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMSTVKDIDINLNCRNKPLNDEDTSITFT 540
+R MH PG +K NVFSRI + +HGD VKD D NLNCRN DEDTS
Sbjct: 481 --ARSKMHDPGPGSFKKPNVFSRIQFLNHGD----VKDTDFNLNCRNNWQVDEDTSF--- 540
Query: 541 SYKRPLPW-LNHASQRLKSKRRDRKKRLRTSLRDPSSNPLV--RERKRNKRLRNTNVNRG 600
S KR LPW +NH S R K KRR+ KKRL L DP+SN LV RERKRNKRLR TNV+ G
Sbjct: 541 SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLVRERERKRNKRLRKTNVDHG 600
Query: 601 CLDVQAGDCFEEKTKSSTSRPP-EDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGSG 660
CLDVQ GD EEK +S TSRPP EDP ELNQLIK+AF KF+KVL+ENPARRKK TEPG G
Sbjct: 601 CLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFVKVLSENPARRKKLTEPGCG 660
Query: 661 IIKCIVCGSKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVR 720
II CIVCGSKS+EF DALSLSQHA +LEGSRAEHLGLHKALCWLMGWSSETA LWVR
Sbjct: 661 IITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKALCWLMGWSSETAPNGLWVR 720
Query: 721 SILPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIK 780
ILP E LKEDLIIWPPVLIIHNSSIAID S+ V IS EELE VIR GMG GGKIK
Sbjct: 721 RILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAISCEELEAVIR-GMGCGGKIK 780
Query: 781 VVRGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHK 840
VVRG+P NQSIMVVTFGAMFSGLQEA+RLHK+FAD SHGRDE KIN HL DS+ DLHK
Sbjct: 781 VVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDEVHKINLRHLIDSNVDLHK 832
Query: 841 A-GANTMESILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
A GANT+ES+LYGYLGLAEDL KLD ETKKRSVVKSKKEIQAIV+ASL C
Sbjct: 841 ATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQAIVNASLQC 832
BLAST of Sgr023297 vs. NCBI nr
Match:
XP_028110286.1 (uncharacterized protein LOC114308815 isoform X2 [Camellia sinensis] >THG03513.1 hypothetical protein TEA_013723 [Camellia sinensis var. sinensis])
HSP 1 Score: 320.5 bits (820), Expect = 4.6e-83
Identity = 296/924 (32.03%), Postives = 434/924 (46.97%), Query Frame = 0
Query: 6 RSGDNRSRSPSLRRRTSEPQVEENRHCHSHWFSG-SARE---RPVTNGHAGSSVRDHYYE 65
RS D+R+ SP R + + + R HSH+ G + RE R + + H S + Y
Sbjct: 33 RSRDHRTGSPI--RSSVQGRDYSWRERHSHFVPGLNEREKSRRVLGSEHPSGSSQRRDYS 92
Query: 66 SRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSPANANSKSSLGLKHVNGCDD 125
L D D +KLSQF E L +R+SPS KF+W+ L + N + K+ C
Sbjct: 93 KHLDGDGDGELQKLSQFNEALSQRDSPSLKFQWKYLLDEH-KRVNDNVNHSSKNFPDC-- 152
Query: 126 DNQGLG-VSGSHVIPESSSKDILEANNLRT-------FHMNIGATKDSN---VNNGDASR 185
G+G +S + V E + + I A+ R+ HM IG + +G
Sbjct: 153 ---GVGIISSTRVNTERNYQGIGFADTARSGMMVAKPIHMEIGRDRTFPGYLPPSGTWRS 212
Query: 186 SFGINDCRHLSSSRTFDGPIYETSDVHVRD--PPMFESARNSHK--------GKRSGTSS 245
S + L S+ + + + D+ + P AR K + T
Sbjct: 213 SVDTDSGGLLLPSQKLNSTLDKDEDMRFQAHLPADKLPARELLKEEDISKFYSREEKTHC 272
Query: 246 HGAQASH-----PHSSARATESKGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETE- 305
H SH S T G S +++H P+ + L +P+ E
Sbjct: 273 HSRDTSHYAIPSSQSKTSTTVPFGSSMNDYHYSGGNGYPIRLDGFSRSSGLLNDPISHEA 332
Query: 306 --------------LSMEKLLECKQARGNHVEHFDDCNQYFKDQPCKRSDIGAALNSPFS 365
L +E + + + + E Y + Q ++SD+G +L
Sbjct: 333 YTHDSHSNSSRDPTLHLEDMTNYMKVQLSPKEGTQRGYAYSEVQRSEKSDMG-SLTDKLY 392
Query: 366 QQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVMGDMEESRPSDHYGLFKEPFSIEG 425
+MV I D +++S +V +P+++ TES + E R D++ +E
Sbjct: 393 GKMVSIEDDYGHRESLGPRIV-EPIIDRIVVTESSGRERLIEGRLRDYHRSSQEQPISNY 452
Query: 426 SYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEKLFWAED-GYKTNSGKWSHEDGLK 485
+ +A +++GE LG L+ E+E Y G E ++ +ED GY + S E+ L
Sbjct: 453 PDAARSSYASKKEGEDLGP-RSVHLEYERELYRGHENIYSSEDHGYGIDDHLHSDEERLN 512
Query: 486 ETFVSKHEQDLGVMEDSRKLRW-----KAAHSTKPRVKGKCFVSSRCGMHYPGSDPSRKR 545
+ + ++ L ++D+ + R+ + K +K K V + PGS S R
Sbjct: 513 MSLMEDYDLWLDGVDDNHQDRFIVEELGSLEHPKRMLKRKWDVDKKLIRQNPGSKFSSNR 572
Query: 546 NVFSRIHYFSHGDEM---STVKDIDINLNCRNKPLNDEDTSITFTSYKRPLPWLNHASQR 605
+ +IH + S +++ N N + S P +R
Sbjct: 573 KITGKIHDTKSNKPLLSSSRYRNVGKTFNGMVGHRNCNSNASGSLSACNP--------RR 632
Query: 606 LKSKRRDRKKRLRTSLRDPS-SNPLVRERKRNKRLRNTNVN-RGCLDVQAGDCFEEKTKS 665
LK RD KKRL + P ++ + K LRN G + + G E K
Sbjct: 633 LKYSFRDIKKRLGPVPQHVHIPQPSAKKIRLQKSLRNNQDGYHGSIHAEEGGPSEVKLTP 692
Query: 666 STSRPPEDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGS-GIIKCIVCGSKSREFAD 725
+ S PPE+ E QL+ +AFFKF+K LNENPA+R++ E G G +KC VCGS S+ F D
Sbjct: 693 AKSEPPENSKEFKQLVHSAFFKFVKQLNENPAQRRRLKEQGKIGSLKCSVCGSDSKVFLD 752
Query: 726 ALSLSQHAFESLE-GSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEACVLKEDL 785
SL HA + + G RA HLG HKA+C LMGW S WV +LP+AE +KED
Sbjct: 753 TKSLVMHACTAPKVGFRARHLGFHKAVCSLMGWKSAEILNSQWVHQVLPNAETVAVKEDF 812
Query: 786 IIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPANQSIMVVT 845
IIWPPV++IHNSS+ N RV +S EELE +++ MGFGGK KV RGKPANQSIMVV
Sbjct: 813 IIWPPVVVIHNSSVGNINPDGRVIVSIEELEAILK-DMGFGGKTKVCRGKPANQSIMVVK 872
Query: 846 FGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKAGANTMESILYGYLG 872
F FSGLQEA+RLHK F+ N GR E +++ + +S+ D AN +E +LYGYLG
Sbjct: 873 FSGTFSGLQEAERLHKFFSGNERGRTELKQVTPKN--NSNADDKTQKANKVERVLYGYLG 932
BLAST of Sgr023297 vs. NCBI nr
Match:
XP_034687202.1 (uncharacterized protein LOC117915679 [Vitis riparia])
HSP 1 Score: 320.1 bits (819), Expect = 6.0e-83
Identity = 250/685 (36.50%), Postives = 356/685 (51.97%), Query Frame = 0
Query: 227 HPHSSARATESKGISQDEFHGFYEGRLPLASPDSTWKKETLREPV----ETELSMEKLLE 286
H A++ S +S+D+FH Y+ P + +ET EP + +S +
Sbjct: 325 HMMPLAQSEASSSVSKDDFHRPYKNGPTF--PSDGFSRETNGEPFSWGGDGRMSGFRSPA 384
Query: 287 CKQARGNHVEHFDDCNQYFKDQPC------KRSDIGAALNSPFSQQMVCIPQDDFYQDST 346
+ R F D PC +R D+G + F +M + +D +QD
Sbjct: 385 KPELRPKRQVQFIPTECKIWDHPCPELWRSERGDLGMVYDDQFYGRMANVWRDCDHQDFV 444
Query: 347 RTSVVMDPVVEGYEETESYVMGDMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEV 406
R SV+ D VV+ ++TES +++SR DH+ +E + + + DGEV
Sbjct: 445 RGSVI-DSVVDRIDDTESSYSNYIKDSRLGDHHNSSQESPIHKYLDASKTQYGIRLDGEV 504
Query: 407 LGS-GTGSPLKCEKEAYAGCEKLFWAEDGY--KTNSGKWSHEDGLKETFVSKHEQDLGVM 466
LGS GT C ++ CE + E GY + ++ W +E+ L H+ GV
Sbjct: 505 LGSRGT-----CRQD----CESMH-QEKGYDFERDADPWPYEEKLP---ALDHDPASGVC 564
Query: 467 ED-SRKLRWKAAHSTKPR-VKGKCFVSSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMST 526
S L H +K K + + G H P S S R ++I
Sbjct: 565 PQLSLTLEEPGMHEVSENCLKRKRSMDKKMGNHNPRSKLSSNRKTSTKI----------- 624
Query: 527 VKDIDINLNCRNKPLNDEDTSITFTSYK-------RPLPWL-NHASQRLKSKRRDRKKRL 586
NL+ +N+ ED F S + R L + N SQ K +D KKR
Sbjct: 625 -----CNLSNKNEGWASEDIGEIFWSKRLACIHSSRNLSGIQNRLSQPNKPGGKDIKKR- 684
Query: 587 RTSLRDPS----SNPLVRERKRNKRL-RNTNVNRGCLDVQAGDCFEEKTKSSTSRPPEDP 646
S+ P S P+VR+ K +K L R+ + + G L ++ G + K ++ + PE
Sbjct: 685 --SVPGPQNVHISCPVVRKHKSHKFLKRSLDGSHGSLHIE-GVPLKTKVSAAINELPEGS 744
Query: 647 LELNQLIKNAFFKFIKVLNENPARRKKFTEPG-SGIIKCIVCGSKSREFADALSLSQHAF 706
E Q + + F KF+K+LNENPA+R+ +TE G + +KC +CGS S+EF + + L H
Sbjct: 745 EEFKQQVHSMFLKFVKLLNENPAQRRIYTEQGKASNLKCSICGSNSKEFMNTIGLVMHTI 804
Query: 707 ESLE-GSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEACVLKEDLIIWPPVLII 766
S + G R +HLGL KALC LMGW+SE W +LP AE+ LKEDLIIWPPV+I+
Sbjct: 805 MSPKVGLRVQHLGLFKALCLLMGWNSEVTPNKPWAHQVLPAAESLALKEDLIIWPPVVIV 864
Query: 767 HNSSIAIDNTSERVTISYEELEVVIRAGMGF-GGKIKVVRGKPANQSIMVVTFGAMFSGL 826
HNSSI + ER+ ++ + L ++R MGF GGK K+ RGKPANQSIMVV F A FSGL
Sbjct: 865 HNSSIGNSDPDERMIVTIDMLVTILR-DMGFDGGKTKICRGKPANQSIMVVRFNATFSGL 924
Query: 827 QEAKRLHKNFADNSHGRDEFQKIN-SSHLFDSHRDLHKAGANTMESILYGYLGLAEDLDK 880
Q+A++LH +A+N HGR EFQ+IN ++ S R+ KA A+ +E +LYGYLG+A DLDK
Sbjct: 925 QKAEKLHNMYAENQHGRAEFQQINFNNGKTSSCRENRKAQADKVEHVLYGYLGIAGDLDK 972
BLAST of Sgr023297 vs. NCBI nr
Match:
XP_011657058.1 (uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical protein Csa_020974 [Cucumis sativus])
HSP 1 Score: 318.2 bits (814), Expect = 2.3e-82
Identity = 169/222 (76.13%), Postives = 179/222 (80.63%), Query Frame = 0
Query: 661 SKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEA 720
SKS+EF DALSL QHA +LEGSRAEHLGLHKALCWLMGWSSE A LWVR ILP E
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 721 CVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPAN 780
LKEDLIIWP VLIIHNSSIAID E V IS E+LE +RA MG GGK KVVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRA-MGCGGKFKVVRGKAVN 153
Query: 781 QSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKA-GANTME 840
QSIMVVTFGAMF GLQEA+RLH NFAD SHGRDEF KIN L DS+ D+HKA GANT+E
Sbjct: 154 QSIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLE 213
Query: 841 SILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
S+ YGYLGL EDLDKLD ETKKRSVV+SKKEIQAIV ASL C
Sbjct: 214 SVRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of Sgr023297 vs. ExPASy TrEMBL
Match:
A0A5A7SQC0 (XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G00320 PE=4 SV=1)
HSP 1 Score: 1047.3 bits (2707), Expect = 3.5e-302
Identity = 594/890 (66.74%), Postives = 662/890 (74.38%), Query Frame = 0
Query: 1 MNWKERSGDNRSRSPSL-RRRTSEPQVEENRHCHSHWFSGSARERPVTNGHAGSSVRDHY 60
MN +E + D RS+SPSL RRTSEP+VEE HC+SHWFS S+RERP+TN GSS+RDHY
Sbjct: 1 MNSREMNRDKRSQSPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRDHY 60
Query: 61 YESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSP-ANANSKSSLGLKHVNG 120
SRLY KDEHFRKLSQFCENLQ ESP+KKF+WE LF + AN NSK+S+GLKHVNG
Sbjct: 61 NGSRLYFHKDEHFRKLSQFCENLQ-GESPAKKFQWENLFVNNNLANGNSKASMGLKHVNG 120
Query: 121 CDDDNQGLGVSGSHVIPESSSKDILEANNLRTFHMNIGATKDSNV-NNGDASRSFGINDC 180
D DN+G+ VSGSH+ +SSK IL NLRTFHMNIGATKDSNV NNGD SRS GINDC
Sbjct: 121 SDGDNRGIRVSGSHL--GTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDC 180
Query: 181 RHLSSSRTFDGPIYETSDVHVRDPPMFESARNSHKGKRSGTSSHGAQASHPHSSARATES 240
HLSSSR +DGP+++ ++VHVRD P+FE NSH+G+R+ TSS G QASH HSSA ES
Sbjct: 181 NHLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAES 240
Query: 241 KGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETELSMEKLLECKQARGNHVEHFDDC 300
KGISQ EFH LLE K+AR NH+EHFDD
Sbjct: 241 KGISQGEFH--------------------------------DLLEYKRARRNHIEHFDDS 300
Query: 301 NQYFKDQPCKRSDIGAALNSPFSQQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVM 360
NQYF QPCKR+DI A + PFSQ MV IPQDDFY+DSTRTSVVMD VVEG+++TES+
Sbjct: 301 NQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDDFYRDSTRTSVVMDSVVEGFQDTESHF- 360
Query: 361 GDMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEK 420
E +RP DH IEGS + APFAME+ EVLGSGT S E+EAY EK
Sbjct: 361 --EETTRPRDHNAF------IEGSCMSTAPFAMEQYVEVLGSGTESSQDGEREAYISSEK 420
Query: 421 LFWA-EDGYKTNSGKWSHEDGLKETFVSKHEQDLGVMEDSRKLRWKAAHSTKPRVKGKCF 480
L EDGY+TN GKW+ EDG+ + VSKH+QDLG MED RKL WKA HSTKPRV+G
Sbjct: 421 LLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGDMEDRRKLTWKAQHSTKPRVEG--- 480
Query: 481 VSSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMSTVKDIDINLNCRNKPLNDEDTSITFT 540
+R MH PG +K NVFSRI + +HGD VKD D NLNCRN DEDTS
Sbjct: 481 --ARSKMHDPGPGSFKKPNVFSRIQFLNHGD----VKDTDFNLNCRNNWQVDEDTSF--- 540
Query: 541 SYKRPLPW-LNHASQRLKSKRRDRKKRLRTSLRDPSSNPLV--RERKRNKRLRNTNVNRG 600
S KR LPW +NH S R K KRR+ KKRL L DP+SN LV RERKRNKRLR TNV+ G
Sbjct: 541 SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLVRERERKRNKRLRKTNVDHG 600
Query: 601 CLDVQAGDCFEEKTKSSTSRPP-EDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGSG 660
CLDVQ GD EEK +S TSRPP EDP ELNQLIK+AF KF+KVL+ENPARRKK TEPG G
Sbjct: 601 CLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFVKVLSENPARRKKLTEPGCG 660
Query: 661 IIKCIVCGSKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVR 720
II CIVCGSKS+EF DALSLSQHA +LEGSRAEHLGLHKALCWLMGWSSETA LWVR
Sbjct: 661 IITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKALCWLMGWSSETAPNGLWVR 720
Query: 721 SILPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIK 780
ILP E LKEDLIIWPPVLIIHNSSIAID S+ V IS EELE VIR GMG GGKIK
Sbjct: 721 RILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAISCEELEAVIR-GMGCGGKIK 780
Query: 781 VVRGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHK 840
VVRG+P NQSIMVVTFGAMFSGLQEA+RLHK+FAD SHGRDE KIN HL DS+ DLHK
Sbjct: 781 VVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDEVHKINLRHLIDSNVDLHK 832
Query: 841 A-GANTMESILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
A GANT+ES+LYGYLGLAEDL KLD ETKKRSVVKSKKEIQAIV+ASL C
Sbjct: 841 ATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQAIVNASLQC 832
BLAST of Sgr023297 vs. ExPASy TrEMBL
Match:
A0A1S3C894 (uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=4 SV=1)
HSP 1 Score: 1047.3 bits (2707), Expect = 3.5e-302
Identity = 594/890 (66.74%), Postives = 662/890 (74.38%), Query Frame = 0
Query: 1 MNWKERSGDNRSRSPSL-RRRTSEPQVEENRHCHSHWFSGSARERPVTNGHAGSSVRDHY 60
MN +E + D RS+SPSL RRTSEP+VEE HC+SHWFS S+RERP+TN GSS+RDHY
Sbjct: 1 MNSREMNRDKRSQSPSLFGRRTSEPRVEEYPHCYSHWFSRSSRERPMTNELPGSSIRDHY 60
Query: 61 YESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSP-ANANSKSSLGLKHVNG 120
SRLY KDEHFRKLSQFCENLQ ESP+KKF+WE LF + AN NSK+S+GLKHVNG
Sbjct: 61 NGSRLYFHKDEHFRKLSQFCENLQ-GESPAKKFQWENLFVNNNLANGNSKASMGLKHVNG 120
Query: 121 CDDDNQGLGVSGSHVIPESSSKDILEANNLRTFHMNIGATKDSNV-NNGDASRSFGINDC 180
D DN+G+ VSGSH+ +SSK IL NLRTFHMNIGATKDSNV NNGD SRS GINDC
Sbjct: 121 SDGDNRGIRVSGSHL--GTSSKSIL-GGNLRTFHMNIGATKDSNVKNNGDTSRSVGINDC 180
Query: 181 RHLSSSRTFDGPIYETSDVHVRDPPMFESARNSHKGKRSGTSSHGAQASHPHSSARATES 240
HLSSSR +DGP+++ ++VHVRD P+FE NSH+G+R+ TSS G QASH HSSA ES
Sbjct: 181 NHLSSSRKYDGPLHDINEVHVRDRPIFELVENSHRGRRNETSSRGIQASHLHSSAPVAES 240
Query: 241 KGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETELSMEKLLECKQARGNHVEHFDDC 300
KGISQ EFH LLE K+AR NH+EHFDD
Sbjct: 241 KGISQGEFH--------------------------------DLLEYKRARRNHIEHFDDS 300
Query: 301 NQYFKDQPCKRSDIGAALNSPFSQQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVM 360
NQYF QPCKR+DI A + PFSQ MV IPQDDFY+DSTRTSVVMD VVEG+++TES+
Sbjct: 301 NQYFSVQPCKRTDIDAGPSRPFSQHMVRIPQDDFYRDSTRTSVVMDSVVEGFQDTESHF- 360
Query: 361 GDMEESRPSDHYGLFKEPFSIEGSYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEK 420
E +RP DH IEGS + APFAME+ EVLGSGT S E+EAY EK
Sbjct: 361 --EETTRPRDHNAF------IEGSCMSTAPFAMEQYVEVLGSGTESSQDGEREAYISSEK 420
Query: 421 LFWA-EDGYKTNSGKWSHEDGLKETFVSKHEQDLGVMEDSRKLRWKAAHSTKPRVKGKCF 480
L EDGY+TN GKW+ EDG+ + VSKH+QDLG MED RKL WKA HSTKPRV+G
Sbjct: 421 LLLVEEDGYRTNFGKWTLEDGVNGSSVSKHKQDLGDMEDRRKLTWKAQHSTKPRVEG--- 480
Query: 481 VSSRCGMHYPGSDPSRKRNVFSRIHYFSHGDEMSTVKDIDINLNCRNKPLNDEDTSITFT 540
+R MH PG +K NVFSRI + +HGD VKD D NLNCRN DEDTS
Sbjct: 481 --ARSKMHDPGPGSFKKPNVFSRIQFLNHGD----VKDTDFNLNCRNNWQVDEDTSF--- 540
Query: 541 SYKRPLPW-LNHASQRLKSKRRDRKKRLRTSLRDPSSNPLV--RERKRNKRLRNTNVNRG 600
S KR LPW +NH S R K KRR+ KKRL L DP+SN LV RERKRNKRLR TNV+ G
Sbjct: 541 SSKRQLPWVVNHVSPRSKLKRRNLKKRLGLPLGDPNSNSLVRERERKRNKRLRKTNVDHG 600
Query: 601 CLDVQAGDCFEEKTKSSTSRPP-EDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGSG 660
CLDVQ GD EEK +S TSRPP EDP ELNQLIK+AF KF+KVL+ENPARRKK TEPG G
Sbjct: 601 CLDVQTGDYLEEKVQSPTSRPPLEDPEELNQLIKSAFLKFVKVLSENPARRKKLTEPGCG 660
Query: 661 IIKCIVCGSKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVR 720
II CIVCGSKS+EF DALSLSQHA +LEGSRAEHLGLHKALCWLMGWSSETA LWVR
Sbjct: 661 IITCIVCGSKSKEFVDALSLSQHASRTLEGSRAEHLGLHKALCWLMGWSSETAPNGLWVR 720
Query: 721 SILPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIK 780
ILP E LKEDLIIWPPVLIIHNSSIAID S+ V IS EELE VIR GMG GGKIK
Sbjct: 721 RILPLEEVLALKEDLIIWPPVLIIHNSSIAIDKLSDGVAISCEELEAVIR-GMGCGGKIK 780
Query: 781 VVRGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHK 840
VVRG+P NQSIMVVTFGAMFSGLQEA+RLHK+FAD SHGRDE KIN HL DS+ DLHK
Sbjct: 781 VVRGEPGNQSIMVVTFGAMFSGLQEAERLHKSFADKSHGRDEVHKINLRHLIDSNVDLHK 832
Query: 841 A-GANTMESILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
A GANT+ES+LYGYLGLAEDL KLD ETKKRSVVKSKKEIQAIV+ASL C
Sbjct: 841 ATGANTLESVLYGYLGLAEDLVKLDFETKKRSVVKSKKEIQAIVNASLQC 832
BLAST of Sgr023297 vs. ExPASy TrEMBL
Match:
A0A5B7BIG5 (XS domain-containing protein OS=Davidia involucrata OX=16924 GN=Din_036972 PE=4 SV=1)
HSP 1 Score: 321.6 bits (823), Expect = 1.0e-83
Identity = 313/958 (32.67%), Postives = 446/958 (46.56%), Query Frame = 0
Query: 8 GDNRSRSP------SLRRRTSEPQVEENRHCHSHWFSGSARERP----VTNGHAGS---- 67
G+ R+R P +R + ENR HS S S+RE +G +GS
Sbjct: 9 GEIRTRFPRQDPWARVRDYENSSSRHENRERHSRHVSSSSREPERSGRFLSGESGSRGLV 68
Query: 68 SVRDHYY----ESRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFA---KSPANAN 127
RDH +S D+D R+LSQF E+ + RES S KF+WE L + KS N N
Sbjct: 69 ERRDHDRNVDGDSDRDRDRDSQQRRLSQFNEDSKCRESTSMKFQWEHLLSEPLKSNVNMN 128
Query: 128 SKS------SLGLKHVNGCDDDNQGLGVS----GSHVIPESSSKDILEANNLRTFHMNIG 187
S L V+ + + QG G S ++ + SS +I A + +G
Sbjct: 129 QTSKHSTDDGLSASRVS-TERNYQGSGFSDVGRSGVLVEKPSSVEIERARIHSPYPPEVG 188
Query: 188 ATKDSNVNNGDASRSFGINDCRHLSSSRTFDGPIYETSDVHVRD-------PPMFESARN 247
++ S +G SR+ + G +T D+H +D P SAR
Sbjct: 189 ISRSSGPLDGGFSRNMNV-------------GLHKDTEDLHFQDHLHMNKLPADRLSARE 248
Query: 248 SHKGKRSGTSSHGAQASHPHSSARATESKG----ISQDEFHGFYEGRLPLASPDSTWKKE 307
K + S P S A AT S I D F L + + +
Sbjct: 249 EEKPNLYLRDTCHYMISSPQSKAFATGSYNDGPHIHSDGFSQSSGMITKLVARNGHCQTL 308
Query: 308 TLREPVETELSMEKLLECKQARGNHVEH-FDDCNQYFKDQPCKRSDIGAALNSPFSQQMV 367
++ P E E+ + ++ + + + E + D F R++ G + S
Sbjct: 309 HMKSPAEIEVPLNDVMNYRHPQPSPSERKYRD----FMYPEVGRNEKGGKMTS------- 368
Query: 368 CIPQDDFYQDSTRTSVVMDPVVEGYEETESYVMGDMEESRPSDHYGLFKEPFSIEGSYVG 427
Q++F + ++ ++DP+ TE + ESR H +E
Sbjct: 369 --VQEEFGHRDSLSAGIVDPIFNRINVTEGSHRSHLSESRQRVHSRSSQEQPISNYLDAS 428
Query: 428 NAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEKLFWAED-GYKTNSGKWSHEDGLKETFV 487
+ ++DGEV+ + L E++ + G E L + ED G+ ++ S E+ L +
Sbjct: 429 GTSHSRQQDGEVV-VYRNTHLDYERQVHLGHECLPFGEDYGHGKDAVPRSLEERLDMLPM 488
Query: 488 SKHE----------------QDLGVMEDSRKLRWKAAHSTKPRV-------------KGK 547
++ ++LG+ E SRK+ K H +V K
Sbjct: 489 IDYDPHLNGIDGGSRKRSTVEELGLHEPSRKM-LKQKHGVDKKVTRHHPINKFLSNGKTT 548
Query: 548 CFVSSRCGM--HYPGS--DP--SRKRNVFSRIHYFSHG---DEMSTVKDIDINLNCRNKP 607
C + G + G DP K++ FS Y G DEM K ++
Sbjct: 549 CKIQELGGAAEQWTGEEMDPLFLPKKSRFSHTQYRKAGSTSDEMGRQK-----ISYSGNR 608
Query: 608 LNDEDTSITFTSYKRPLPWLNHASQRLKSKRRDRKKRLRTSLRDPS-SNPLVRERKRNKR 667
L+ + S++ H S+ K RD KKRLR + S+PLV++ K NK
Sbjct: 609 LSSNNLSVSMP---------RHLSKPHKIGARDIKKRLRPGPPNVHISHPLVKKYKPNKF 668
Query: 668 LRNTNVN-RGCLDVQAGDCFEEKTKSSTSRPPEDPLELNQLIKNAFFKFIKVLNENPARR 727
R + + G L VQ G E + + PPE + QL+ +AFFKF+K LNEN A+R
Sbjct: 669 QRRIHGDFHGNLHVQRGGPTEVVVTPAKTEPPESSEDFKQLVHSAFFKFVKQLNENLAQR 728
Query: 728 KKFTEPGSGI-IKCIVCGSKSREFADALSLSQHAFESLE-GSRAEHLGLHKALCWLMGWS 787
++F G +KC +CGS S EF D SL HA S + G +A+HLG HKALC LMGW
Sbjct: 729 RRFMVQGKADGLKCSICGSNSNEFVDTESLVMHACTSPKVGFKAQHLGFHKALCVLMGWK 788
Query: 788 SETASKVLWVRSILPHAEACVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVI 847
S A W+ +LP E+ LKEDLIIWPPV++IHNSSI+ N ERV +S E LE ++
Sbjct: 789 SAVALNRSWICQVLPDNESLALKEDLIIWPPVVVIHNSSISNYNPDERVIVSIEVLEALL 848
Query: 848 RAGMGFGGKIKVVRGKPANQSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSS 880
MGFG K KV RGKPANQSIMVV F + FSGLQEA+RLHK + +N GR EFQ++N +
Sbjct: 849 -GDMGFGEKTKVCRGKPANQSIMVVKFNSTFSGLQEAERLHKFYTENKRGRAEFQQVNQN 908
BLAST of Sgr023297 vs. ExPASy TrEMBL
Match:
A0A4S4DKT0 (XS domain-containing protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_013723 PE=4 SV=1)
HSP 1 Score: 320.5 bits (820), Expect = 2.2e-83
Identity = 296/924 (32.03%), Postives = 434/924 (46.97%), Query Frame = 0
Query: 6 RSGDNRSRSPSLRRRTSEPQVEENRHCHSHWFSG-SARE---RPVTNGHAGSSVRDHYYE 65
RS D+R+ SP R + + + R HSH+ G + RE R + + H S + Y
Sbjct: 33 RSRDHRTGSPI--RSSVQGRDYSWRERHSHFVPGLNEREKSRRVLGSEHPSGSSQRRDYS 92
Query: 66 SRLYEDKDEHFRKLSQFCENLQRRESPSKKFRWEILFAKSPANANSKSSLGLKHVNGCDD 125
L D D +KLSQF E L +R+SPS KF+W+ L + N + K+ C
Sbjct: 93 KHLDGDGDGELQKLSQFNEALSQRDSPSLKFQWKYLLDEH-KRVNDNVNHSSKNFPDC-- 152
Query: 126 DNQGLG-VSGSHVIPESSSKDILEANNLRT-------FHMNIGATKDSN---VNNGDASR 185
G+G +S + V E + + I A+ R+ HM IG + +G
Sbjct: 153 ---GVGIISSTRVNTERNYQGIGFADTARSGMMVAKPIHMEIGRDRTFPGYLPPSGTWRS 212
Query: 186 SFGINDCRHLSSSRTFDGPIYETSDVHVRD--PPMFESARNSHK--------GKRSGTSS 245
S + L S+ + + + D+ + P AR K + T
Sbjct: 213 SVDTDSGGLLLPSQKLNSTLDKDEDMRFQAHLPADKLPARELLKEEDISKFYSREEKTHC 272
Query: 246 HGAQASH-----PHSSARATESKGISQDEFHGFYEGRLPLASPDSTWKKETLREPVETE- 305
H SH S T G S +++H P+ + L +P+ E
Sbjct: 273 HSRDTSHYAIPSSQSKTSTTVPFGSSMNDYHYSGGNGYPIRLDGFSRSSGLLNDPISHEA 332
Query: 306 --------------LSMEKLLECKQARGNHVEHFDDCNQYFKDQPCKRSDIGAALNSPFS 365
L +E + + + + E Y + Q ++SD+G +L
Sbjct: 333 YTHDSHSNSSRDPTLHLEDMTNYMKVQLSPKEGTQRGYAYSEVQRSEKSDMG-SLTDKLY 392
Query: 366 QQMVCIPQDDFYQDSTRTSVVMDPVVEGYEETESYVMGDMEESRPSDHYGLFKEPFSIEG 425
+MV I D +++S +V +P+++ TES + E R D++ +E
Sbjct: 393 GKMVSIEDDYGHRESLGPRIV-EPIIDRIVVTESSGRERLIEGRLRDYHRSSQEQPISNY 452
Query: 426 SYVGNAPFAMERDGEVLGSGTGSPLKCEKEAYAGCEKLFWAED-GYKTNSGKWSHEDGLK 485
+ +A +++GE LG L+ E+E Y G E ++ +ED GY + S E+ L
Sbjct: 453 PDAARSSYASKKEGEDLGP-RSVHLEYERELYRGHENIYSSEDHGYGIDDHLHSDEERLN 512
Query: 486 ETFVSKHEQDLGVMEDSRKLRW-----KAAHSTKPRVKGKCFVSSRCGMHYPGSDPSRKR 545
+ + ++ L ++D+ + R+ + K +K K V + PGS S R
Sbjct: 513 MSLMEDYDLWLDGVDDNHQDRFIVEELGSLEHPKRMLKRKWDVDKKLIRQNPGSKFSSNR 572
Query: 546 NVFSRIHYFSHGDEM---STVKDIDINLNCRNKPLNDEDTSITFTSYKRPLPWLNHASQR 605
+ +IH + S +++ N N + S P +R
Sbjct: 573 KITGKIHDTKSNKPLLSSSRYRNVGKTFNGMVGHRNCNSNASGSLSACNP--------RR 632
Query: 606 LKSKRRDRKKRLRTSLRDPS-SNPLVRERKRNKRLRNTNVN-RGCLDVQAGDCFEEKTKS 665
LK RD KKRL + P ++ + K LRN G + + G E K
Sbjct: 633 LKYSFRDIKKRLGPVPQHVHIPQPSAKKIRLQKSLRNNQDGYHGSIHAEEGGPSEVKLTP 692
Query: 666 STSRPPEDPLELNQLIKNAFFKFIKVLNENPARRKKFTEPGS-GIIKCIVCGSKSREFAD 725
+ S PPE+ E QL+ +AFFKF+K LNENPA+R++ E G G +KC VCGS S+ F D
Sbjct: 693 AKSEPPENSKEFKQLVHSAFFKFVKQLNENPAQRRRLKEQGKIGSLKCSVCGSDSKVFLD 752
Query: 726 ALSLSQHAFESLE-GSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEACVLKEDL 785
SL HA + + G RA HLG HKA+C LMGW S WV +LP+AE +KED
Sbjct: 753 TKSLVMHACTAPKVGFRARHLGFHKAVCSLMGWKSAEILNSQWVHQVLPNAETVAVKEDF 812
Query: 786 IIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPANQSIMVVT 845
IIWPPV++IHNSS+ N RV +S EELE +++ MGFGGK KV RGKPANQSIMVV
Sbjct: 813 IIWPPVVVIHNSSVGNINPDGRVIVSIEELEAILK-DMGFGGKTKVCRGKPANQSIMVVK 872
Query: 846 FGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKAGANTMESILYGYLG 872
F FSGLQEA+RLHK F+ N GR E +++ + +S+ D AN +E +LYGYLG
Sbjct: 873 FSGTFSGLQEAERLHKFFSGNERGRTELKQVTPKN--NSNADDKTQKANKVERVLYGYLG 932
BLAST of Sgr023297 vs. ExPASy TrEMBL
Match:
A0A0A0KGN5 (XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=1)
HSP 1 Score: 318.2 bits (814), Expect = 1.1e-82
Identity = 169/222 (76.13%), Postives = 179/222 (80.63%), Query Frame = 0
Query: 661 SKSREFADALSLSQHAFESLEGSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEA 720
SKS+EF DALSL QHA +LEGSRAEHLGLHKALCWLMGWSSE A LWVR ILP E
Sbjct: 34 SKSKEFVDALSLPQHASRTLEGSRAEHLGLHKALCWLMGWSSEIAPNGLWVRMILPPVEV 93
Query: 721 CVLKEDLIIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPAN 780
LKEDLIIWP VLIIHNSSIAID E V IS E+LE +RA MG GGK KVVRGK N
Sbjct: 94 LALKEDLIIWPSVLIIHNSSIAIDKRYEGVAISCEKLEAAVRA-MGCGGKFKVVRGKAVN 153
Query: 781 QSIMVVTFGAMFSGLQEAKRLHKNFADNSHGRDEFQKINSSHLFDSHRDLHKA-GANTME 840
QSIMVVTFGAMF GLQEA+RLH NFAD SHGRDEF KIN L DS+ D+HKA GANT+E
Sbjct: 154 QSIMVVTFGAMFYGLQEAERLHNNFADKSHGRDEFHKINLRCLVDSNVDMHKATGANTLE 213
Query: 841 SILYGYLGLAEDLDKLDLETKKRSVVKSKKEIQAIVDASLHC 882
S+ YGYLGL EDLDKLD ETKKRSVV+SKKEIQAIV ASL C
Sbjct: 214 SVRYGYLGLVEDLDKLDFETKKRSVVRSKKEIQAIVHASLQC 254
BLAST of Sgr023297 vs. TAIR 10
Match:
AT3G22430.1 (CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); BEST Arabidopsis thaliana protein match is: XS domain-containing protein / XS zinc finger domain-containing protein-related (TAIR:AT5G23570.1); Has 565 Blast hits to 510 proteins in 121 species: Archae - 2; Bacteria - 90; Metazoa - 191; Fungi - 32; Plants - 51; Viruses - 4; Other Eukaryotes - 195 (source: NCBI BLink). )
HSP 1 Score: 136.0 bits (341), Expect = 1.5e-31
Identity = 90/269 (33.46%), Postives = 140/269 (52.04%), Query Frame = 0
Query: 611 TSRPPEDPLELNQL-IKNAFFKFIKVLNENPARRKKFTEPG-SGIIKCIVCGSKSREFAD 670
+SR +++Q+ +K +F F+K + E+P +K + E G G ++C+VCG S++ D
Sbjct: 238 SSRHDNGGFQVDQVALKKSFLGFVKRVFEDPMEKKNYLENGRKGRLQCLVCGRSSKDVQD 297
Query: 671 ALSLSQHAFESLE-GSRAEHLGLHKALCWLMGWSSETASKVLWVRSILPHAEACVLKEDL 730
SL H + S + SR HLGLHKALC LMGW+ A LP EA + + L
Sbjct: 298 THSLVMHTYCSDDSSSRVHHLGLHKALCVLMGWNFSKAPDNSKAYQNLPADEAAINQAQL 357
Query: 731 IIWPPVLIIHNSSIAIDNTSERVTISYEELEVVIRAGMGFGGKIKVVRGKPANQSIMVVT 790
IIWPP +I+ N+S + ++ IR GGK K + G+ + I +
Sbjct: 358 IIWPPHVIVQNTSTGKGKEGRMEGFGNKTMDNRIRELGLTGGKSKSLYGREGHLGITLFK 417
Query: 791 FGAMFSGLQEAKRLHKNFADNSHGRDEF---QKINSSHLFDSHRDLHKAGANTMES--IL 850
F SGL++A R+ + F + GR + Q + S + + L + T E I
Sbjct: 418 FAGDDSGLRDAMRMAEYFEKINRGRKSWGRVQPLTPSKDDEKNPGLVEVDGRTGEKKRIF 477
Query: 851 YGYLGLAEDLDKLDLETKKRSVVKSKKEI 872
YGYL DLDK+D+ETKK++ ++S +E+
Sbjct: 478 YGYLATVTDLDKVDVETKKKTTIESLREL 506
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038900433.1 | 0.0e+00 | 70.38 | uncharacterized protein LOC120087658 [Benincasa hispida] | [more] |
XP_008458617.1 | 7.2e-302 | 66.74 | PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo] >KAA0033382.1 unc... | [more] |
XP_028110286.1 | 4.6e-83 | 32.03 | uncharacterized protein LOC114308815 isoform X2 [Camellia sinensis] >THG03513.1 ... | [more] |
XP_034687202.1 | 6.0e-83 | 36.50 | uncharacterized protein LOC117915679 [Vitis riparia] | [more] |
XP_011657058.1 | 2.3e-82 | 76.13 | uncharacterized protein LOC105435801 [Cucumis sativus] >KGN46926.1 hypothetical ... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SQC0 | 3.5e-302 | 66.74 | XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... | [more] |
A0A1S3C894 | 3.5e-302 | 66.74 | uncharacterized protein LOC103497964 OS=Cucumis melo OX=3656 GN=LOC103497964 PE=... | [more] |
A0A5B7BIG5 | 1.0e-83 | 32.67 | XS domain-containing protein OS=Davidia involucrata OX=16924 GN=Din_036972 PE=4 ... | [more] |
A0A4S4DKT0 | 2.2e-83 | 32.03 | XS domain-containing protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA... | [more] |
A0A0A0KGN5 | 1.1e-82 | 76.13 | XS domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G151630 PE=4 SV=... | [more] |
Match Name | E-value | Identity | Description | |
AT3G22430.1 | 1.5e-31 | 33.46 | CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); ... | [more] |