BLAST of CmoCh02G004750 vs. Swiss-Prot
Match:
ATX10_BOVIN (Ataxin-10 OS=Bos taurus GN=ATXN10 PE=2 SV=1)
HSP 1 Score: 110.2 bits (274), Expect = 6.8e-23
Identity = 106/428 (24.77%), Postives = 196/428 (45.79%), Query Frame = 1
Query: 70 NALQLSS--LRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRV-------I 129
++LQL + R LRN C NQN +GV + ++ LLF RV
Sbjct: 76 SSLQLITECFRCLRNACIECSVNQNSIRNLGTIGVAVDLI----LLFRELRVEQDSLLTA 135
Query: 130 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 189
R GLQ L N++ E+ Q +W FP+ F+S +I SMIL+ S NSE
Sbjct: 136 FRCGLQFLGNIASRNEDSQSVVWMHAFPELFLSCLNHPDRKIVAYSSMILFT--SLNSER 195
Query: 190 VASLCSD--VGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSK 249
+ L + + + ++E + + +W L+++ L+ P + A
Sbjct: 196 MKELEENLNIAIDVVEAHQKQP-----ESEWPFLIITDHFLKSPELVKAMYA-------- 255
Query: 250 DGGKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERS 309
S+++ +T++ ++ + +GD + KD A +F S +I+ST + +
Sbjct: 256 ------KMSNQER--VTLLDLMIAKIVGDEPLTKDDAP----VFLSHAELIASTFVDQCK 315
Query: 310 LPTGTTAVDVLG-----YSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSL-GLIDLLLG 369
+ T+ ++ +L +C K + D + L GL++ ++
Sbjct: 316 IVLKLTSEQHTDDEEALATIRLLDVLC---------EKTANTDLLGYLQVFPGLLERVID 375
Query: 370 ILRDIEPPAIVKKAIQQAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQD 429
+LR I I A + D +S + +GF+ ++ +I N Y+ K QD
Sbjct: 376 LLRLIHVAGNDSTNIFSACASIKADGDVSSVA-----EGFKSHLIRLIGNLCYKNKDNQD 435
Query: 430 DIRKKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEI 481
+ + +G+ ++L C +D++NPFL +W ++A+RNL E N +N+ L+A++E QG + +
Sbjct: 436 KVNELDGIPLILDSCGLDDSNPFLTQWVVYAIRNLTEDNSQNQDLIAKMEEQGLADASLL 458
BLAST of CmoCh02G004750 vs. Swiss-Prot
Match:
ATX10_RAT (Ataxin-10 OS=Rattus norvegicus GN=Atxn10 PE=1 SV=1)
HSP 1 Score: 104.8 bits (260), Expect = 2.9e-21
Identity = 116/481 (24.12%), Postives = 213/481 (44.28%), Query Frame = 1
Query: 31 SLEALIEASKSVEGRSNFASQNILPCVLELIQCLDYTSNNALQLSSL-----------RL 90
+L AL + ++ E Q +L + + Q ++ + Q+ L R
Sbjct: 28 ALTALFKEQRNRETAPRTIFQRVLDILKKSTQAVELACRDPSQVEHLASSLQLITECFRC 87
Query: 91 LRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRV-------IIRLGLQVLANVS 150
LRN C NQN + +GV + ++ LLF RV R GLQ L NV+
Sbjct: 88 LRNACIECSVNQNSIRNLDTIGVAVDLV----LLFRELRVEQDSLLTAFRCGLQFLGNVA 147
Query: 151 LAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCSDVGLPI 210
E+ Q +W FP+ F+S +I SMIL+ S NSE + L ++ + I
Sbjct: 148 SRNEDSQSIVWVHAFPELFMSCLNHPDKKIVAYCSMILFT--SLNSERMKDLEENLNIAI 207
Query: 211 --LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGKDMSFSSEQ 270
+E + + +W L+++ L+ P L A+ GK S+++
Sbjct: 208 NVIEAHQKHP-----ESEWPFLIITDHFLKSP---ELVEAMY--------GK---LSNQE 267
Query: 271 AFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTAVDVLG 330
+T++ ++ + +GD + KD S IF +I+++ + +
Sbjct: 268 R--VTLLDIMIAKIVGDEQLTKDDIS----IFLRHAELIANSFVDQ-------------- 327
Query: 331 YSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIEPPAIVKKAIQ-- 390
N+L+ E E V+ +DVL + LLG L+ P ++++ I
Sbjct: 328 -CRNVLK--LTSEPQTEDKEALVTIRLLDVLCEMTSNTELLGYLQVF--PGLMERVIDVL 387
Query: 391 ---QAENENRTDLPNTSKSCPCP------YKGFRRDIVAVIANCLYRKKHVQDDIRKKNG 450
+ ++ T++ + S S +GF+ ++ +I N Y+ K QD + + +G
Sbjct: 388 RVIHSVGKDSTNIFSPSDSLKAEGDIEHMTEGFKSHLIRLIGNLCYKNKENQDKVNELDG 447
Query: 451 VFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQV 481
+ ++L +D+NNPF+ +W ++AVRNL E N +N+ +A++E QG + + ++G +V
Sbjct: 448 IPLILDSSNIDDNNPFMMQWVVYAVRNLTEDNSQNQDFIAKMEEQGLADASLLKKMGFEV 458
BLAST of CmoCh02G004750 vs. Swiss-Prot
Match:
ATX10_MOUSE (Ataxin-10 OS=Mus musculus GN=Atxn10 PE=1 SV=2)
HSP 1 Score: 103.6 bits (257), Expect = 6.4e-21
Identity = 108/433 (24.94%), Postives = 194/433 (44.80%), Query Frame = 1
Query: 70 NALQLSS--LRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRV-------I 129
++LQL + R LRN C NQN + +GV + ++ LLF RV
Sbjct: 76 SSLQLITECFRCLRNACIECSVNQNSIRNLDTIGVAVDLV----LLFRELRVEQDSLLTA 135
Query: 130 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 189
R GLQ L NV+ EE Q +W FP+ F+S +I SMIL+ S N+E
Sbjct: 136 FRCGLQFLGNVASRNEESQSIVWVHAFPELFMSCLNHPDKKIVAYCSMILFT--SLNAER 195
Query: 190 VASLCSDVGLPI--LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSK 249
+ L ++ + I +E + +W L++S L+ P L A+
Sbjct: 196 MKDLEENLNIAINVIEAHQKHPA-----SEWPFLIISDHFLKSP---ELVEAMY------ 255
Query: 250 DGGKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERS 309
GK S+++ +T++ ++ + +G+ + KD S IF +I+++
Sbjct: 256 --GK---LSNQER--ITLLDIVIAKLVGEEQLTKDDIS----IFVRHAELIANS------ 315
Query: 310 LPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIE 369
+ N+L+ E E V+ +DVL + LLG L+
Sbjct: 316 ---------FMDQCRNVLK--LTSEPHTEDKEALVTIRLLDVLCEMTSNTELLGYLQVF- 375
Query: 370 PPAIVKKAIQQAENENRTDLPNTSKSCPCPY-----------KGFRRDIVAVIANCLYRK 429
P ++++ I + +T+ P +GF+ ++ +I N Y+
Sbjct: 376 -PGLMERVIDVLRVIHEVGKESTNIFSPSDSLKAEGDIEHMTEGFKSHLIRLIGNLCYKN 435
Query: 430 KHVQDDIRKKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPV 481
K QD + + +G+ ++L +D+NNPF+ +W ++AVRNL E N +N+ ++A++E QG
Sbjct: 436 KENQDKVNELDGIPLILDSSNIDDNNPFMMQWVVYAVRNLTEDNSQNQDVIAKMEEQGLA 458
BLAST of CmoCh02G004750 vs. Swiss-Prot
Match:
ATX10_DICDI (Ataxin-10 homolog OS=Dictyostelium discoideum GN=atxn10 PE=3 SV=1)
HSP 1 Score: 94.0 bits (232), Expect = 5.1e-18
Identity = 41/111 (36.94%), Postives = 71/111 (63.96%), Query Frame = 1
Query: 390 KGFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLE 449
KGF+ +++ ++ N Y+ + QD+IR+ G+ ++L C D NNP+++EW ++A+RNL E
Sbjct: 498 KGFKIELIRILGNLSYKNRGNQDEIRELGGIEIILNHCRFDVNNPYIKEWSVFAIRNLCE 557
Query: 450 GNLENKKLVAELEVQGPVNMPEIAELGLQVEVDPKTKAAKLVNASRPFKDN 501
N+EN+ L+ L+V+G N E+ +LGL+V V + K N + K+N
Sbjct: 558 DNVENQNLIESLKVKGVANNDELKDLGLEVGV-TENGTIKFKNVPKKEKEN 607
BLAST of CmoCh02G004750 vs. Swiss-Prot
Match:
ATX10_XENTR (Ataxin-10 OS=Xenopus tropicalis GN=atxn10 PE=2 SV=1)
HSP 1 Score: 83.2 bits (204), Expect = 8.9e-15
Identity = 33/90 (36.67%), Postives = 59/90 (65.56%), Query Frame = 1
Query: 391 GFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEG 450
GF+ ++ +I N Y+ K Q+ + + +G+ ++L C +D+NNPFL +W ++A+RNL E
Sbjct: 379 GFKAHLIRLIGNLCYQNKENQEKVYQLDGIALILDNCSIDDNNPFLNQWAVFAIRNLTEN 438
Query: 451 NLENKKLVAELEVQGPVNMPEIAELGLQVE 481
N +N++L+A +E QG + + +GLQ E
Sbjct: 439 NDKNQELIASMERQGLADSSLLKSMGLQAE 468
BLAST of CmoCh02G004750 vs. TrEMBL
Match:
A0A0A0LFC4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G913990 PE=4 SV=1)
HSP 1 Score: 749.2 bits (1933), Expect = 3.2e-213
Identity = 389/505 (77.03%), Postives = 434/505 (85.94%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
MKNS+ FE SIPERI Q L AS+S TLEASLE LIEAS+S EGRSN ASQNILPCVLEL
Sbjct: 1 MKNSSPFELSIPERISQQLFLASSSNTLEASLETLIEASRSSEGRSNLASQNILPCVLEL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQCL YTS + L LSSL+LLRNLCAGEIRNQN+FIEQNGV VV ILQ+AML+ DPDRV
Sbjct: 61 IQCLIYTSGDVLLLSSLKLLRNLCAGEIRNQNIFIEQNGVRVVSKILQDAMLINDPDRVT 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IRLGLQVLANVSLAGEEHQQAIWH LFPD F+ LAR+ +CEISDPL MI+YNLCS +SEL
Sbjct: 121 IRLGLQVLANVSLAGEEHQQAIWHELFPDNFLLLARLPFCEISDPLCMIIYNLCSGHSEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDG 240
VASLC D+GLPI+EEI RT + VGF EDWVKLLLSRICLEE YFP LFS LRPIDT KD
Sbjct: 181 VASLCGDLGLPIIEEIVRTVSSVGFVEDWVKLLLSRICLEELYFPMLFSGLRPIDTYKDS 240
Query: 241 ----GKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICE 300
+D+SFSSEQA+LLT+ISEILNE+IGDI +PKDFASC++RIFQSSI II STP+ +
Sbjct: 241 NIAESRDISFSSEQAYLLTVISEILNEQIGDIVVPKDFASCVYRIFQSSISIIDSTPVSK 300
Query: 301 RSLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRD 360
LPTG A DV+GYSL ILRDICAQ+ K G KDV +DAVDVLLSLGLIDLLL IL D
Sbjct: 301 SGLPTGRIAGDVVGYSLTILRDICAQDSNK--GDKDVYEDAVDVLLSLGLIDLLLSILHD 360
Query: 361 IEPPAIVKKAIQQAEN-ENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIR 420
IEPPAI+KKA+QQ EN E+ T LPN K PCPYKGFRRDIVAVIANCLYR+KHVQDDIR
Sbjct: 361 IEPPAILKKALQQVENEEDGTSLPNAVK--PCPYKGFRRDIVAVIANCLYRRKHVQDDIR 420
Query: 421 KKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAEL 480
+KNGVFVLLQQCV D+NNPFLREWGIWAVRNLLEGNLEN++LV+ELEVQG ++PEIAEL
Sbjct: 421 QKNGVFVLLQQCVADKNNPFLREWGIWAVRNLLEGNLENQRLVSELEVQGSAHVPEIAEL 480
Query: 481 GLQVEVDPKTKAAKLVNASRPFKDN 501
GL+VEVD KT+ AKLVNASRPF+++
Sbjct: 481 GLRVEVDAKTRRAKLVNASRPFQNS 501
BLAST of CmoCh02G004750 vs. TrEMBL
Match:
M5XFG6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004765mg PE=4 SV=1)
HSP 1 Score: 528.1 bits (1359), Expect = 1.2e-146
Identity = 272/497 (54.73%), Postives = 363/497 (73.04%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
M +A E +PE ++Q LLSASNS TL SLE LI+ ++ +GR++ AS++ILP V++L
Sbjct: 1 MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQ L Y S L SL+LLRNLCAGE+ NQ F+EQ+GV ++ ++L +A + +PD +
Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IR+GLQVLANVSLAGE HQ IW LFP +F++LAR++ E DPL M+++ C + EL
Sbjct: 121 IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSAL---RPIDTS 240
LC D G+ I++EI RTT VGF EDWVKLLLSRICLE PYF LFS L +
Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240
Query: 241 KDGGKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICER 300
++ FSS+QAF L IIS+ILNER+ +I++P+DFA C+ IF+ S+ ++ +
Sbjct: 241 DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300
Query: 301 SLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDI 360
LPTGT+ +DVLGYSL ILRD+CAQ+ + G ++ DAVDVLLS GLI+L+L +LRD+
Sbjct: 301 GLPTGTSMIDVLGYSLTILRDVCAQKTLR--GFQEDLGDAVDVLLSHGLIELILCLLRDL 360
Query: 361 EPPAIVKKAIQQAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKK 420
EPPAI++KAI+Q E ++ T N+ S PCPYKGFRRDIVAVI NC Y++K VQD+IR++
Sbjct: 361 EPPAIIRKAIKQGEGQDGT---NSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQR 420
Query: 421 NGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGL 480
+G+ +LLQQC +DE+NPFL+EWGIW VRNLLEGN +NK++V ELE+QG V+ PEIA LG
Sbjct: 421 DGILLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGF 480
Query: 481 QVEVDPKTKAAKLVNAS 495
+VEV+P+T KLVN S
Sbjct: 481 RVEVNPETGRPKLVNVS 492
BLAST of CmoCh02G004750 vs. TrEMBL
Match:
A0A061FBW8_THECC (ARM repeat superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_033454 PE=4 SV=1)
HSP 1 Score: 521.9 bits (1343), Expect = 8.4e-145
Identity = 270/481 (56.13%), Postives = 355/481 (73.80%), Query Frame = 1
Query: 13 ERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLELIQCLDYTSNNAL 72
E ++QPLLSASNS +L+ +LE LI+ S++ R+ A +NILP VL+L++ TS+
Sbjct: 13 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72
Query: 73 QLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVIIRLGLQVLANVS 132
++SL+LLRNLCAGE+ NQN F EQNGV VVLS+L++A LL +PD +IR+ LQVLANVS
Sbjct: 73 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132
Query: 133 LAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCSDVGLPI 192
LAGE+HQQAIW FP++F LAR+R E +DPL MILY C LVA LC D+GLPI
Sbjct: 133 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192
Query: 193 LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGK----DMSFSS 252
+ I RT VGF EDW KLLLSR+CLE+ +FP +FS +S++ G D F S
Sbjct: 193 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252
Query: 253 EQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTAVDV 312
EQAFLL IISEILNERI +I + +FA C+ IF+ S+ ++ SLPTG T++DV
Sbjct: 253 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312
Query: 313 LGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIEPPAIVKKAIQ 372
+GYSL ILRDICA+E G K+ S D VD+LLS LID+LL +LRD++PPAI++K ++
Sbjct: 313 MGYSLIILRDICAREG--VGDLKNDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLK 372
Query: 373 QAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCV 432
+ +N+ N S S CPYKGFRRD++AVI NC YR+KHVQD+IR+KNG+ +LLQQCV
Sbjct: 373 EGDNQGL----NLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCV 432
Query: 433 VDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQVEVDPKTKAA 490
D++NP+LREWGIW++RNLLEG+ EN++ VA+LE+QG V+MPE++ LGL+VEVD KT+ A
Sbjct: 433 TDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRA 487
BLAST of CmoCh02G004750 vs. TrEMBL
Match:
A0A061FA36_THECC (ARM repeat superfamily protein, putative isoform 4 OS=Theobroma cacao GN=TCM_033454 PE=4 SV=1)
HSP 1 Score: 521.9 bits (1343), Expect = 8.4e-145
Identity = 270/481 (56.13%), Postives = 355/481 (73.80%), Query Frame = 1
Query: 13 ERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLELIQCLDYTSNNAL 72
E ++QPLLSASNS +L+ +LE LI+ S++ R+ A +NILP VL+L++ TS+
Sbjct: 25 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 84
Query: 73 QLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVIIRLGLQVLANVS 132
++SL+LLRNLCAGE+ NQN F EQNGV VVLS+L++A LL +PD +IR+ LQVLANVS
Sbjct: 85 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 144
Query: 133 LAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCSDVGLPI 192
LAGE+HQQAIW FP++F LAR+R E +DPL MILY C LVA LC D+GLPI
Sbjct: 145 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 204
Query: 193 LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGK----DMSFSS 252
+ I RT VGF EDW KLLLSR+CLE+ +FP +FS +S++ G D F S
Sbjct: 205 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 264
Query: 253 EQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTAVDV 312
EQAFLL IISEILNERI +I + +FA C+ IF+ S+ ++ SLPTG T++DV
Sbjct: 265 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 324
Query: 313 LGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIEPPAIVKKAIQ 372
+GYSL ILRDICA+E G K+ S D VD+LLS LID+LL +LRD++PPAI++K ++
Sbjct: 325 MGYSLIILRDICAREG--VGDLKNDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLK 384
Query: 373 QAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCV 432
+ +N+ N S S CPYKGFRRD++AVI NC YR+KHVQD+IR+KNG+ +LLQQCV
Sbjct: 385 EGDNQGL----NLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCV 444
Query: 433 VDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQVEVDPKTKAA 490
D++NP+LREWGIW++RNLLEG+ EN++ VA+LE+QG V+MPE++ LGL+VEVD KT+ A
Sbjct: 445 TDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRA 499
BLAST of CmoCh02G004750 vs. TrEMBL
Match:
A0A061FB09_THECC (ARM repeat superfamily protein, putative isoform 5 OS=Theobroma cacao GN=TCM_033454 PE=4 SV=1)
HSP 1 Score: 521.9 bits (1343), Expect = 8.4e-145
Identity = 270/481 (56.13%), Postives = 355/481 (73.80%), Query Frame = 1
Query: 13 ERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLELIQCLDYTSNNAL 72
E ++QPLLSASNS +L+ +LE LI+ S++ R+ A +NILP VL+L++ TS+
Sbjct: 13 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72
Query: 73 QLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVIIRLGLQVLANVS 132
++SL+LLRNLCAGE+ NQN F EQNGV VVLS+L++A LL +PD +IR+ LQVLANVS
Sbjct: 73 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132
Query: 133 LAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCSDVGLPI 192
LAGE+HQQAIW FP++F LAR+R E +DPL MILY C LVA LC D+GLPI
Sbjct: 133 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192
Query: 193 LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGK----DMSFSS 252
+ I RT VGF EDW KLLLSR+CLE+ +FP +FS +S++ G D F S
Sbjct: 193 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252
Query: 253 EQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTAVDV 312
EQAFLL IISEILNERI +I + +FA C+ IF+ S+ ++ SLPTG T++DV
Sbjct: 253 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312
Query: 313 LGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIEPPAIVKKAIQ 372
+GYSL ILRDICA+E G K+ S D VD+LLS LID+LL +LRD++PPAI++K ++
Sbjct: 313 MGYSLIILRDICAREG--VGDLKNDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLK 372
Query: 373 QAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCV 432
+ +N+ N S S CPYKGFRRD++AVI NC YR+KHVQD+IR+KNG+ +LLQQCV
Sbjct: 373 EGDNQGL----NLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCV 432
Query: 433 VDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQVEVDPKTKAA 490
D++NP+LREWGIW++RNLLEG+ EN++ VA+LE+QG V+MPE++ LGL+VEVD KT+ A
Sbjct: 433 TDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRA 487
BLAST of CmoCh02G004750 vs. TAIR10
Match:
AT4G00231.1 (AT4G00231.1 ARM repeat superfamily protein)
HSP 1 Score: 459.9 bits (1182), Expect = 2.0e-129
Identity = 243/494 (49.19%), Postives = 344/494 (69.64%), Query Frame = 1
Query: 8 EQSIPERIIQPLLSASN-SCTLEASLEALIEASKSVEGRSNFASQNILPCVLELIQCLDY 67
E S+PE ++QPLL AS+ S +LE L+ L+E+SK+ GRS+ AS++ILP +L L+Q L Y
Sbjct: 2 EASLPEEVLQPLLHASDLSYSLEDCLKFLLESSKTDSGRSDLASKSILPSILRLLQLLPY 61
Query: 68 TSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVIIRLGLQ 127
S+ SL++LRNLCAGE+ NQN F++ +G +V +L +A+ F+ +R GLQ
Sbjct: 62 PSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHDGSAIVSDLLDSAIADFET----VRFGLQ 121
Query: 128 VLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCS 187
VLANV L GE+ Q+ +W +P++F+S+A+IR E DPL MILY +SE+ + LCS
Sbjct: 122 VLANVVLFGEKRQRDVWLRFYPERFLSIAKIRKRETFDPLCMILYTCVDGSSEIASELCS 181
Query: 188 DVGLPILEEITRTTTLVGFKED-WVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGKDMS 247
GL I+ E RT++ VG ED W+KLL+SRIC+E+ YF +LFS L + ++
Sbjct: 182 CQGLTIIAETLRTSSSVGSVEDYWLKLLVSRICVEDGYFLKLFSKLY------EDAENEI 241
Query: 248 FSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTA 307
FSSEQAFL+ ++S+I NERIG +SIPKD A I +F+ S+ + LPTG+T
Sbjct: 242 FSSEQAFLVRMVSDIANERIGKVSIPKDTACSILGLFRQSVDVFDFVSGERSELPTGSTI 301
Query: 308 VDVLGYSLNILRDICA-------QEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIE 367
VDV+GYSL I+RD CA +ED K+ G D V++LLS GLI+LLL +L ++
Sbjct: 302 VDVMGYSLVIIRDACAGGRLEELKEDNKDSG------DTVELLLSSGLIELLLDLLSKLD 361
Query: 368 PPAIVKKAIQQAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKKN 427
PP +KKA+ Q+ + + + L PCPY+GFRRDIV+VI NC YR+K VQD+IR+++
Sbjct: 362 PPTTIKKALNQSPSSSSSSLK------PCPYRGFRRDIVSVIGNCAYRRKEVQDEIRERD 421
Query: 428 GVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQ 487
G+F++LQQCV D+ NPFLREWG+W +RNLLEGN EN+++VAELE++G V++P++ E+GL+
Sbjct: 422 GLFLMLQQCVTDDENPFLREWGLWCIRNLLEGNPENQEVVAELEIKGSVDVPQLREIGLR 473
Query: 488 VEVDPKTKAAKLVN 493
VE+DPKT KLVN
Sbjct: 482 VEIDPKTARPKLVN 473
BLAST of CmoCh02G004750 vs. NCBI nr
Match:
gi|659131835|ref|XP_008465880.1| (PREDICTED: ataxin-10 [Cucumis melo])
HSP 1 Score: 773.1 bits (1995), Expect = 3.0e-220
Identity = 396/505 (78.42%), Postives = 442/505 (87.52%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
MKNS+ FE SIP+RIIQPL ASNS TLEASLE LIEASKS EGRSN ASQNILPCVLEL
Sbjct: 1 MKNSSPFELSIPKRIIQPLFLASNSNTLEASLETLIEASKSSEGRSNLASQNILPCVLEL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQC+ YTS + L LSSL+LLRNLCAGEIRNQN+FIEQNGVGVV +LQ+AM++ DPDRV
Sbjct: 61 IQCVVYTSGDVLLLSSLKLLRNLCAGEIRNQNIFIEQNGVGVVSKVLQDAMVMNDPDRVT 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IRLGLQVLANVSLAGE+HQQAIWHGLFPDKF+ LAR+ +CEISDPLSMILYN+CS +SEL
Sbjct: 121 IRLGLQVLANVSLAGEKHQQAIWHGLFPDKFLLLARLPFCEISDPLSMILYNICSGHSEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDG 240
VASLC D+GLPI+EEI RT + VGF EDWVKLLLSRICLEEPYFP LFS LRPIDT KD
Sbjct: 181 VASLCGDIGLPIIEEIVRTVSSVGFVEDWVKLLLSRICLEEPYFPMLFSQLRPIDTYKDS 240
Query: 241 GK----DMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICE 300
K D+SFSSEQA+LLT++SEILNE+IGDI +PKDFA C++R FQSSI II STP+ +
Sbjct: 241 NKAESRDVSFSSEQAYLLTVVSEILNEQIGDIVVPKDFAMCVYRTFQSSISIIDSTPVSK 300
Query: 301 RSLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRD 360
SLPTGT A DVLGYSL ILRDICAQ+ K G KD+ +DAVDVLLSLGLIDLLL IL D
Sbjct: 301 CSLPTGTIAGDVLGYSLTILRDICAQDSSK--GDKDIYEDAVDVLLSLGLIDLLLSILHD 360
Query: 361 IEPPAIVKKAIQQAEN-ENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIR 420
IEPPAI+KKA+QQ EN E+RT LP KS CPYKGFRRDIVAVIANCLYR+KHVQDDIR
Sbjct: 361 IEPPAILKKALQQVENEEDRTSLPKALKS--CPYKGFRRDIVAVIANCLYRRKHVQDDIR 420
Query: 421 KKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAEL 480
+KNGVFVLLQQCV DENNPFLREWGIWAVRNLLEGNLENK+LV+ELEVQG ++PEIAEL
Sbjct: 421 QKNGVFVLLQQCVADENNPFLREWGIWAVRNLLEGNLENKRLVSELEVQGSAHVPEIAEL 480
Query: 481 GLQVEVDPKTKAAKLVNASRPFKDN 501
GL+VEVDPKT+ AKLVN+SRPF+D+
Sbjct: 481 GLRVEVDPKTRRAKLVNSSRPFQDS 501
BLAST of CmoCh02G004750 vs. NCBI nr
Match:
gi|778688201|ref|XP_011652695.1| (PREDICTED: ataxin-10 homolog [Cucumis sativus])
HSP 1 Score: 749.2 bits (1933), Expect = 4.6e-213
Identity = 389/505 (77.03%), Postives = 434/505 (85.94%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
MKNS+ FE SIPERI Q L AS+S TLEASLE LIEAS+S EGRSN ASQNILPCVLEL
Sbjct: 1 MKNSSPFELSIPERISQQLFLASSSNTLEASLETLIEASRSSEGRSNLASQNILPCVLEL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQCL YTS + L LSSL+LLRNLCAGEIRNQN+FIEQNGV VV ILQ+AML+ DPDRV
Sbjct: 61 IQCLIYTSGDVLLLSSLKLLRNLCAGEIRNQNIFIEQNGVRVVSKILQDAMLINDPDRVT 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IRLGLQVLANVSLAGEEHQQAIWH LFPD F+ LAR+ +CEISDPL MI+YNLCS +SEL
Sbjct: 121 IRLGLQVLANVSLAGEEHQQAIWHELFPDNFLLLARLPFCEISDPLCMIIYNLCSGHSEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDG 240
VASLC D+GLPI+EEI RT + VGF EDWVKLLLSRICLEE YFP LFS LRPIDT KD
Sbjct: 181 VASLCGDLGLPIIEEIVRTVSSVGFVEDWVKLLLSRICLEELYFPMLFSGLRPIDTYKDS 240
Query: 241 ----GKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICE 300
+D+SFSSEQA+LLT+ISEILNE+IGDI +PKDFASC++RIFQSSI II STP+ +
Sbjct: 241 NIAESRDISFSSEQAYLLTVISEILNEQIGDIVVPKDFASCVYRIFQSSISIIDSTPVSK 300
Query: 301 RSLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRD 360
LPTG A DV+GYSL ILRDICAQ+ K G KDV +DAVDVLLSLGLIDLLL IL D
Sbjct: 301 SGLPTGRIAGDVVGYSLTILRDICAQDSNK--GDKDVYEDAVDVLLSLGLIDLLLSILHD 360
Query: 361 IEPPAIVKKAIQQAEN-ENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIR 420
IEPPAI+KKA+QQ EN E+ T LPN K PCPYKGFRRDIVAVIANCLYR+KHVQDDIR
Sbjct: 361 IEPPAILKKALQQVENEEDGTSLPNAVK--PCPYKGFRRDIVAVIANCLYRRKHVQDDIR 420
Query: 421 KKNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAEL 480
+KNGVFVLLQQCV D+NNPFLREWGIWAVRNLLEGNLEN++LV+ELEVQG ++PEIAEL
Sbjct: 421 QKNGVFVLLQQCVADKNNPFLREWGIWAVRNLLEGNLENQRLVSELEVQGSAHVPEIAEL 480
Query: 481 GLQVEVDPKTKAAKLVNASRPFKDN 501
GL+VEVD KT+ AKLVNASRPF+++
Sbjct: 481 GLRVEVDAKTRRAKLVNASRPFQNS 501
BLAST of CmoCh02G004750 vs. NCBI nr
Match:
gi|645254021|ref|XP_008232844.1| (PREDICTED: ataxin-10 [Prunus mume])
HSP 1 Score: 532.7 bits (1371), Expect = 6.8e-148
Identity = 276/498 (55.42%), Postives = 366/498 (73.49%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
M N+A E +PE ++Q LSASNS TL SLE LI+ ++ +GR++ AS+++LP V++L
Sbjct: 1 MDNTALQEFFVPEDVLQIFLSASNSSTLVDSLETLIQVCRTADGRADLASKSVLPSVVQL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQ L Y S L SL+LLRNLCAGE NQ F+EQ+GV ++ ++L +A L +PD I
Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEGSNQKSFLEQSGVAIISNVLNSANLSLEPDSGI 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IR+GLQVLANVSLAGE HQ AIW LFP +F++LAR++ E DPL M+++ C + EL
Sbjct: 121 IRMGLQVLANVSLAGERHQHAIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKD- 240
LC D G+ I++EI RTT VGF EDW KLLLSRICLE PYF LFS L + T+++
Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWFKLLLSRICLEGPYFSSLFSNLGFVSTTENV 240
Query: 241 ---GGKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICE 300
++ FSSEQAF L IIS+ILNER+ +I++P DFA C+ IF+ S+ +++ +
Sbjct: 241 EDTEFREDLFSSEQAFFLRIISDILNERLREITVPSDFALCVFGIFKKSVGVLNCVTRGQ 300
Query: 301 RSLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRD 360
LPTG++ +DVLGYSL ILRD CAQ+ + G ++ DAVDVLLS GLI+L+L +LRD
Sbjct: 301 SGLPTGSSMIDVLGYSLTILRDACAQKTLR--GFQEDLGDAVDVLLSHGLIELILCLLRD 360
Query: 361 IEPPAIVKKAIQQAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRK 420
+EPPAI++KAI+Q E ++ T N+ S PCPYKGFRRDIVAVI NC Y++K VQD+IR+
Sbjct: 361 LEPPAIIRKAIKQGEGQDGT---NSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQ 420
Query: 421 KNGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELG 480
K+G+ +LLQQC +DE+NPFL+EWGIW VRNLLEGN +NK++V ELE+QG V+ PEIA LG
Sbjct: 421 KDGILLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLG 480
Query: 481 LQVEVDPKTKAAKLVNAS 495
L+VEV+P+T KLVN S
Sbjct: 481 LRVEVNPETGRPKLVNVS 493
BLAST of CmoCh02G004750 vs. NCBI nr
Match:
gi|596021914|ref|XP_007219054.1| (hypothetical protein PRUPE_ppa004765mg [Prunus persica])
HSP 1 Score: 528.1 bits (1359), Expect = 1.7e-146
Identity = 272/497 (54.73%), Postives = 363/497 (73.04%), Query Frame = 1
Query: 1 MKNSASFEQSIPERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLEL 60
M +A E +PE ++Q LLSASNS TL SLE LI+ ++ +GR++ AS++ILP V++L
Sbjct: 1 MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60
Query: 61 IQCLDYTSNNALQLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVI 120
IQ L Y S L SL+LLRNLCAGE+ NQ F+EQ+GV ++ ++L +A + +PD +
Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120
Query: 121 IRLGLQVLANVSLAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSEL 180
IR+GLQVLANVSLAGE HQ IW LFP +F++LAR++ E DPL M+++ C + EL
Sbjct: 121 IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180
Query: 181 VASLCSDVGLPILEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSAL---RPIDTS 240
LC D G+ I++EI RTT VGF EDWVKLLLSRICLE PYF LFS L +
Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240
Query: 241 KDGGKDMSFSSEQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICER 300
++ FSS+QAF L IIS+ILNER+ +I++P+DFA C+ IF+ S+ ++ +
Sbjct: 241 DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300
Query: 301 SLPTGTTAVDVLGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDI 360
LPTGT+ +DVLGYSL ILRD+CAQ+ + G ++ DAVDVLLS GLI+L+L +LRD+
Sbjct: 301 GLPTGTSMIDVLGYSLTILRDVCAQKTLR--GFQEDLGDAVDVLLSHGLIELILCLLRDL 360
Query: 361 EPPAIVKKAIQQAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKK 420
EPPAI++KAI+Q E ++ T N+ S PCPYKGFRRDIVAVI NC Y++K VQD+IR++
Sbjct: 361 EPPAIIRKAIKQGEGQDGT---NSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQR 420
Query: 421 NGVFVLLQQCVVDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGL 480
+G+ +LLQQC +DE+NPFL+EWGIW VRNLLEGN +NK++V ELE+QG V+ PEIA LG
Sbjct: 421 DGILLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGF 480
Query: 481 QVEVDPKTKAAKLVNAS 495
+VEV+P+T KLVN S
Sbjct: 481 RVEVNPETGRPKLVNVS 492
BLAST of CmoCh02G004750 vs. NCBI nr
Match:
gi|590613387|ref|XP_007022650.1| (ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao])
HSP 1 Score: 521.9 bits (1343), Expect = 1.2e-144
Identity = 270/481 (56.13%), Postives = 355/481 (73.80%), Query Frame = 1
Query: 13 ERIIQPLLSASNSCTLEASLEALIEASKSVEGRSNFASQNILPCVLELIQCLDYTSNNAL 72
E ++QPLLSASNS +L+ +LE LI+ S++ R+ A +NILP VL+L++ TS+
Sbjct: 25 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 84
Query: 73 QLSSLRLLRNLCAGEIRNQNVFIEQNGVGVVLSILQNAMLLFDPDRVIIRLGLQVLANVS 132
++SL+LLRNLCAGE+ NQN F EQNGV VVLS+L++A LL +PD +IR+ LQVLANVS
Sbjct: 85 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 144
Query: 133 LAGEEHQQAIWHGLFPDKFVSLARIRYCEISDPLSMILYNLCSTNSELVASLCSDVGLPI 192
LAGE+HQQAIW FP++F LAR+R E +DPL MILY C LVA LC D+GLPI
Sbjct: 145 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 204
Query: 193 LEEITRTTTLVGFKEDWVKLLLSRICLEEPYFPRLFSALRPIDTSKDGGK----DMSFSS 252
+ I RT VGF EDW KLLLSR+CLE+ +FP +FS +S++ G D F S
Sbjct: 205 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 264
Query: 253 EQAFLLTIISEILNERIGDISIPKDFASCIHRIFQSSIPIISSTPICERSLPTGTTAVDV 312
EQAFLL IISEILNERI +I + +FA C+ IF+ S+ ++ SLPTG T++DV
Sbjct: 265 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 324
Query: 313 LGYSLNILRDICAQEDGKEGGHKDVSKDAVDVLLSLGLIDLLLGILRDIEPPAIVKKAIQ 372
+GYSL ILRDICA+E G K+ S D VD+LLS LID+LL +LRD++PPAI++K ++
Sbjct: 325 MGYSLIILRDICAREG--VGDLKNDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLK 384
Query: 373 QAENENRTDLPNTSKSCPCPYKGFRRDIVAVIANCLYRKKHVQDDIRKKNGVFVLLQQCV 432
+ +N+ N S S CPYKGFRRD++AVI NC YR+KHVQD+IR+KNG+ +LLQQCV
Sbjct: 385 EGDNQGL----NLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCV 444
Query: 433 VDENNPFLREWGIWAVRNLLEGNLENKKLVAELEVQGPVNMPEIAELGLQVEVDPKTKAA 490
D++NP+LREWGIW++RNLLEG+ EN++ VA+LE+QG V+MPE++ LGL+VEVD KT+ A
Sbjct: 445 TDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRA 499
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
ATX10_BOVIN | 6.8e-23 | 24.77 | Ataxin-10 OS=Bos taurus GN=ATXN10 PE=2 SV=1 | [more] |
ATX10_RAT | 2.9e-21 | 24.12 | Ataxin-10 OS=Rattus norvegicus GN=Atxn10 PE=1 SV=1 | [more] |
ATX10_MOUSE | 6.4e-21 | 24.94 | Ataxin-10 OS=Mus musculus GN=Atxn10 PE=1 SV=2 | [more] |
ATX10_DICDI | 5.1e-18 | 36.94 | Ataxin-10 homolog OS=Dictyostelium discoideum GN=atxn10 PE=3 SV=1 | [more] |
ATX10_XENTR | 8.9e-15 | 36.67 | Ataxin-10 OS=Xenopus tropicalis GN=atxn10 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LFC4_CUCSA | 3.2e-213 | 77.03 | Uncharacterized protein OS=Cucumis sativus GN=Csa_3G913990 PE=4 SV=1 | [more] |
M5XFG6_PRUPE | 1.2e-146 | 54.73 | Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004765mg PE=4 SV=1 | [more] |
A0A061FBW8_THECC | 8.4e-145 | 56.13 | ARM repeat superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_033... | [more] |
A0A061FA36_THECC | 8.4e-145 | 56.13 | ARM repeat superfamily protein, putative isoform 4 OS=Theobroma cacao GN=TCM_033... | [more] |
A0A061FB09_THECC | 8.4e-145 | 56.13 | ARM repeat superfamily protein, putative isoform 5 OS=Theobroma cacao GN=TCM_033... | [more] |
Match Name | E-value | Identity | Description | |
AT4G00231.1 | 2.0e-129 | 49.19 | ARM repeat superfamily protein | [more] |