BLAST of CmoCh16G002000 vs. Swiss-Prot
Match:
PP443_ARATH (Pentatricopeptide repeat-containing protein At5g62370 OS=Arabidopsis thaliana GN=At5g62370 PE=2 SV=1)
HSP 1 Score: 720.7 bits (1859), Expect = 2.0e-206
Identity = 394/884 (44.57%), Postives = 559/884 (63.24%), Query Frame = 1
Query: 20 TTCTVPLDP-PVTSS---SSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 79
TTC + + P TS+ S+++ +H++ C SL+ +L RRGL A++VI+R++ SSSIS
Sbjct: 18 TTCALSSELFPSTSAAVFSAASGDHRSRCLSLIVKLGRRGLLDSAREVIRRVIDGSSSIS 77
Query: 80 EAISIVDFAAERGLELDLDTHGVFWRQLV-YSRPQLAELLYDKKFTFRGAEPDASVLDSM 139
EA + DFA + G+ELD +G R+L +P +AE Y+++ G PD+SVLDSM
Sbjct: 78 EAALVADFAVDNGIELDSSCYGALIRKLTEMGQPGVAETFYNQRVIGNGIVPDSSVLDSM 137
Query: 140 VICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGV 199
V C +L +F++A A+ +++++ Y PS+ S + + ELC Q+R LEAF F +V G
Sbjct: 138 VFCLVKLRRFDEARAHLDRIIASGYAPSRNSSSLVVDELCNQDRFLEAFHCFEQVKERGS 197
Query: 200 HLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAE 259
L WC L GLC GH+ EA+ + D + P ++L+KSLFY CKR EAE
Sbjct: 198 GLWLWCCKRLFKGLCGHGHLNEAIGMLDTLCGMTRMPLPVNLYKSLFYCFCKRGCAAEAE 257
Query: 260 LLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGF 319
L ME Y DK MYT L+ EYCKD M MAM+ + RM++ E D NTLIHGF
Sbjct: 258 ALFDHMEVDGYYVDKVMYTCLMKEYCKDNNMTMAMRLYLRMVERSFELDPCIFNTLIHGF 317
Query: 320 VKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTI-LNNMVSCNFSPS 379
+KLG++DKG ++++ M + G+Q +V T+HIMI YC+EG VD+AL + +NN S + S +
Sbjct: 318 MKLGMLDKGRVMFSQMIKKGVQSNVFTYHIMIGSYCKEGNVDYALRLFVNNTGSEDISRN 377
Query: 380 LHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLE 439
+HCYT LI ++ +++ +LL +LDNGIVPDH+ +F L+KM PK HEL+ A+ L+
Sbjct: 378 VHCYTNLIFGFYKKGGMDKAVDLLMRMLDNGIVPDHITYFVLLKMLPKCHELKYAMVILQ 437
Query: 440 AILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETE 499
+IL NGCG +P VI N+E K+E+LL EI + NLA V ++V ALC
Sbjct: 438 SILDNGCGINPPVI------DDLGNIEVKVESLLGEIARKDANLAAVGLAVVTTALCSQR 497
Query: 500 NLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYL 559
N AL KM +LGC PL F+YNS+IKCL +E + ED SL++ +QE +PD TYL
Sbjct: 498 NYIAALSRIEKMVNLGCTPLPFSYNSVIKCLFQENIIEDLASLVNIIQELDFVPDVDTYL 557
Query: 560 IIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKA 619
I++NE C+K + +A I M + GL+P+VAIY SIIG L ++ R+ E + F KML++
Sbjct: 558 IVVNELCKKNDRDAAFAIIDAMEELGLRPTVAIYSSIIGSLGKQGRVVEAEETFAKMLES 617
Query: 620 GVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQG 679
G+ PD+ Y+ MIN Y +NG++ EA +L E++V++ + PSS YT LISG VK M ++G
Sbjct: 618 GIQPDEIAYMIMINTYARNGRIDEANELVEEVVKHFLRPSSFTYTVLISGFVKMGMMEKG 677
Query: 680 CLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSG 739
C YL KML DG SPN VLY++LI H+LK G+ +++F L LM + I+ D I YITL+SG
Sbjct: 678 CQYLDKMLEDGLSPNVVLYTALIGHFLKKGDFKFSFTLFGLMGENDIKHDHIAYITLLSG 737
Query: 740 ICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLI 799
+ + + KK+ ++E +K L R++ LV I S+ KS A+++I
Sbjct: 738 LWRAMARKKKRQVIVEPGKEKL---LQRLIRTKPLVS-----IPSSLGNYGSKSFAMEVI 797
Query: 800 QKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD----- 859
KVK I+PNL+L+N+II GYC R+ +A + LE MQKEG+ PN VT+TILM
Sbjct: 798 GKVKK-SIIPNLYLHNTIITGYCAAGRLDEAYNHLESMQKEGIVPNLVTYTILMKSHIEA 857
Query: 860 GDVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
GD+ SAI LF N C PD+V Y+TLLKGL R DALAL
Sbjct: 858 GDIESAIDLFEGTN---CEPDQVMYSTLLKGLCDFKRPLDALAL 883
BLAST of CmoCh16G002000 vs. Swiss-Prot
Match:
PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)
HSP 1 Score: 241.1 bits (614), Expect = 4.7e-62
Identity = 188/742 (25.34%), Postives = 343/742 (46.23%), Query Frame = 1
Query: 162 SKTSFNAIFRELCAQERVLEAFDYF-VRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALEL 221
S +SF+ + + RVL+ F + + + + L+ GL H A+EL
Sbjct: 155 SSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMEL 214
Query: 222 FDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYC 281
F+ M + G P ++++ + LC+ K L A+ +I ME + Y L+ C
Sbjct: 215 FNDMVSV-GIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLC 274
Query: 282 KDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVV 341
K +K+ A+ + +PD T TL++G K+ + G + + M P
Sbjct: 275 KKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEA 334
Query: 342 TFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSI 401
++ + GK++ AL ++ +V SP+L Y LI++L + + E L +
Sbjct: 335 AVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRM 394
Query: 402 LDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNGCGCDP----SVILASTKLQTS 461
G+ P+ V + L+ M+ + +L AL+FL ++ G S+I K
Sbjct: 395 GKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDI 454
Query: 462 SNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLFT 521
S E + E+ N L V ++ ++ C ++ AL +H+M G P ++T
Sbjct: 455 S----AAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYT 514
Query: 522 YNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMR 581
+ +L+ L + GL DA+ L + M E ++ P+ TY ++I +C +G++S A ++M
Sbjct: 515 FTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMT 574
Query: 582 QRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLL 641
++G+ P Y +I L + E K + K + ++ Y +++G+ + GKL
Sbjct: 575 EKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLE 634
Query: 642 EARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLI 701
EA + ++MV+ + Y LI G +K L +M G P+ V+Y+S+I
Sbjct: 635 EALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMI 694
Query: 702 NHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAK 761
+ K G+ + AF + DLM P+ + Y +++G+CK V++ + +L + Q
Sbjct: 695 DAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAE--VLCSKMQPVS 754
Query: 762 STLFRMLHETTLVPRDNNMIVSANSTEEMKSLAL-KLIQKVKDVCIVPNLHLYNSIICGY 821
S ++ + L I++ + K++ L I K ++ N YN +I G+
Sbjct: 755 SVPNQVTYGCFL------DILTKGEVDMQKAVELHNAILK----GLLANTATYNMLIRGF 814
Query: 822 CRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDE 881
CR R+ +A+ + M +G+ P+ +T+T +++ DV AI L+N M G PD
Sbjct: 815 CRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDR 874
Query: 882 VAYNTLLKGLSQGGRLSDALAL 893
VAYNTL+ G G + A L
Sbjct: 875 VAYNTLIHGCCVAGEMGKATEL 879
BLAST of CmoCh16G002000 vs. Swiss-Prot
Match:
PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)
HSP 1 Score: 228.8 bits (582), Expect = 2.4e-58
Identity = 171/706 (24.22%), Postives = 309/706 (43.77%), Query Frame = 1
Query: 200 CFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIRE 259
C+N L++ L G ++E +++ M P+++ + + G CK + EA + +
Sbjct: 185 CYNTLLNSLARFGLVDEMKQVYMEMLEDK-VCPNIYTYNKMVNGYCKLGNVEEANQYVSK 244
Query: 260 MEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGL 319
+ L PD YTSL+ YC+ K + A + F M GC + LIHG
Sbjct: 245 IVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARR 304
Query: 320 VDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTV 379
+D+ ++ M + P V T+ ++I C + AL ++ M P++H YTV
Sbjct: 305 IDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTV 364
Query: 380 LINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNG 439
LI++L + E+ ELL +L+ G++P+ + + L+ Y K ++ A++ +E +
Sbjct: 365 LIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRK 424
Query: 440 CGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDCAL 499
+ K SN+ K +L ++ + V ++ +I C + N D A
Sbjct: 425 LSPNTRTYNELIKGYCKSNVH-KAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAY 484
Query: 500 DYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEH 559
M G P +TY S+I LCK E+A L D +++ + P+ Y +I+ +
Sbjct: 485 RLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGY 544
Query: 560 CRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDK 619
C+ G V AH + KM + P+ ++++I L ++ E + +KM+K G+ P
Sbjct: 545 CKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTV 604
Query: 620 NLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGK 679
+ +I+ K+G A F+QM+ + P +H YT I ++ + K
Sbjct: 605 STDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAK 664
Query: 680 MLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLI 739
M +G SP+ YSSLI Y +G+ +AF ++ M + EP +++L+
Sbjct: 665 MRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIK------- 724
Query: 740 VDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQKVKDV 799
LLE + K K + E L N M ++L++K+ +
Sbjct: 725 ------HLLEMKYGKQKGS------EPELCAMSNMMEFDT---------VVELLEKMVEH 784
Query: 800 CIVPNLHLYNSIICGYCRTDRMLDANHQLELMQK-EGLHPNQVTFTILMD-----GDVNS 859
+ PN Y +I G C + A + MQ+ EG+ P+++ F L+ N
Sbjct: 785 SVTPNAKSYEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNE 844
Query: 860 AIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALALHMFICSC 900
A + + M G +P + L+ GL + G ++ + C
Sbjct: 845 AAKVVDDMICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQC 860
BLAST of CmoCh16G002000 vs. Swiss-Prot
Match:
PP442_ARATH (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana GN=At5g61990 PE=2 SV=1)
HSP 1 Score: 228.8 bits (582), Expect = 2.4e-58
Identity = 186/780 (23.85%), Postives = 337/780 (43.21%), Query Frame = 1
Query: 121 FRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVL 180
F G D + + + G E+A+ F+ + L VP + + L R+
Sbjct: 144 FVGKSDDGVLFGILFDGYIAKGYIEEAVFVFSSSMGLELVPRLSRCKVLLDALLRWNRLD 203
Query: 181 EAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSL 240
+D + + V +++LI C G+++ D++ T F++
Sbjct: 204 LFWDVYKGMVERNVVFDVKTYHMLIIAHCRAGNVQLGK---DVLFKTEKE------FRTA 263
Query: 241 FYGLCKRKWLVEAELLIRE-MEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
V+ L ++E M + L P K Y L+ CK K+++ A M +G
Sbjct: 264 TLN-------VDGALKLKESMICKGLVPLKYTYDVLIDGLCKIKRLEDAKSLLVEMDSLG 323
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
DN+T + LI G +K D + + M GI + I +EG ++ A
Sbjct: 324 VSLDNHTYSLLIDGLLKGRNADAAKGLVHEMVSHGINIKPYMYDCCICVMSKEGVMEKAK 383
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
+ + M++ P Y LI R+ + + ELL + IV + T++K
Sbjct: 384 ALFDGMIASGLIPQAQAYASLIEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGM 443
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETL--LQEIFNSNLNL 480
+L A N ++ ++ +GC P+V++ +T ++T + + + L+E+ +
Sbjct: 444 CSSGDLDGAYNIVKEMIASGCR--PNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAP 503
Query: 481 AGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLI 540
++ +I L + + +D A + +M G KP FTY + I + F A +
Sbjct: 504 DIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYV 563
Query: 541 DHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRK 600
M+EC +LP+ +INE+C+KG V A +R M +G+ Y ++ L +
Sbjct: 564 KEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYTVLMNGLFKN 623
Query: 601 KRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIY 660
++ + + +F++M G+ PD Y +ING+ K G + +A +F++MVE + P+ IY
Sbjct: 624 DKVDDAEEIFREMRGKGIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIY 683
Query: 661 TALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMER 720
L+ G + ++ L +M G PN+V Y ++I+ Y K G++ AFRL D M+
Sbjct: 684 NMLLGGFCRSGEIEKAKELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKL 743
Query: 721 SHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIV 780
+ PD Y TLV G C+ + D ++ + N+K ++ T N +
Sbjct: 744 KGLVPDSFVYTTLVDGCCR--LNDVERAITIFGTNKKGCAS------STAPFNALINWVF 803
Query: 781 SANSTEEMKSLALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLH 840
TE + +L+ D PN YN +I C+ + A MQ L
Sbjct: 804 KFGKTELKTEVLNRLMDGSFDRFGKPNDVTYNIMIDYLCKEGNLEAAKELFHQMQNANLM 863
Query: 841 PNQVTFTILMDGDVN-----SAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
P +T+T L++G +F++ G PD + Y+ ++ + G + AL L
Sbjct: 864 PTVITYTSLLNGYDKMGRRAEMFPVFDEAIAAGIEPDHIMYSVIINAFLKEGMTTKALVL 897
BLAST of CmoCh16G002000 vs. Swiss-Prot
Match:
PPR67_ARATH (Putative pentatricopeptide repeat-containing protein At1g31840 OS=Arabidopsis thaliana GN=At1g31840 PE=2 SV=2)
HSP 1 Score: 226.9 bits (577), Expect = 9.2e-58
Identity = 167/647 (25.81%), Postives = 291/647 (44.98%), Query Frame = 1
Query: 110 LAELLYDKKFTFRGAE-----------PDASVLDSMVICFCRLGKFEKALAYFNQLLSLN 169
+A+ ++D+ T RG + DA V ++ C CR G +KAL F L
Sbjct: 117 VADKVFDEMITNRGKDFNVLGSIRDRSLDADVCKFLMECCCRYGMVDKALEIFVYSTQLG 176
Query: 170 YVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVH-LGYWCFNVLIDGLCNKGHMEEA 229
V + S + L +RV D+F ++ GG+ G ++D L KG + +A
Sbjct: 177 VVIPQDSVYRMLNSLIGSDRVDLIADHFDKLCRGGIEPSGVSAHGFVLDALFCKGEVTKA 236
Query: 230 LELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVH 289
L+ ++ G+ + + GL + V + LL ++ P+ + +L++
Sbjct: 237 LDFHRLVME-RGFRVGIVSCNKVLKGLSVDQIEVASRLLSLVLDCGPA-PNVVTFCTLIN 296
Query: 290 EYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQP 349
+CK +M A F M + G EPD +TLI G+ K G++ G +++ G++
Sbjct: 297 GFCKRGEMDRAFDLFKVMEQRGIEPDLIAYSTLIDGYFKAGMLGMGHKLFSQALHKGVKL 356
Query: 350 DVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELL 409
DVV F I Y + G + A + M+ SP++ YT+LI L +D R+ E +
Sbjct: 357 DVVVFSSTIDVYVKSGDLATASVVYKRMLCQGISPNVVTYTILIKGLCQDGRIYEAFGMY 416
Query: 410 RSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSS 469
IL G+ P V + +L+ + K L+ E ++K G P V++ + S
Sbjct: 417 GQILKRGMEPSIVTYSSLIDGFCKCGNLRSGFALYEDMIK--MGYPPDVVIYGVLVDGLS 476
Query: 470 NLEQKIETLLQEI--FNSNLNLAGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLF 529
+ + + ++ L V F+ +I C D AL F M G KP +
Sbjct: 477 KQGLMLHAMRFSVKMLGQSIRLNVVVFNSLIDGWCRLNRFDEALKVFRLMGIYGIKPDVA 536
Query: 530 TYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKM 589
T+ ++++ EG E+AL L M + L PD Y +I+ C+ + + M
Sbjct: 537 TFTTVMRVSIMEGRLEEALFLFFRMFKMGLEPDALAYCTLIDAFCKHMKPTIGLQLFDLM 596
Query: 590 RQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKL 649
++ + +A+ + +I L + RI + F +++ ++PD Y TMI GY +L
Sbjct: 597 QRNKISADIAVCNVVIHLLFKCHRIEDASKFFNNLIEGKMEPDIVTYNTMICGYCSLRRL 656
Query: 650 LEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSL 709
EA ++FE + P++ T LI L K N D M G PN+V Y L
Sbjct: 657 DEAERIFELLKVTPFGPNTVTLTILIHVLCKNNDMDGAIRMFSIMAEKGSKPNAVTYGCL 716
Query: 710 INHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDK 743
++ + K ++E +F+L + M+ I P ++ Y ++ G+CK VD+
Sbjct: 717 MDWFSKSVDIEGSFKLFEEMQEKGISPSIVSYSIIIDGLCKRGRVDE 759
BLAST of CmoCh16G002000 vs. TrEMBL
Match:
A0A0A0LB22_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G175715 PE=4 SV=1)
HSP 1 Score: 1134.0 bits (2932), Expect = 0.0e+00
Identity = 556/706 (78.75%), Postives = 626/706 (88.67%), Query Frame = 1
Query: 1 MIRGRP-CKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLP 60
MIRGRP CKYYLS+NFRNLVTTCTVPLDPP TSS SSASEHK LC+SLVEQLIRRG F
Sbjct: 1 MIRGRPSCKYYLSMNFRNLVTTCTVPLDPPTTSSFSSASEHKNLCFSLVEQLIRRGFFFQ 60
Query: 61 AQQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLVYSRPQLAELLYDKKF 120
AQQVIQRIVTQSSSISEAISIV+FAAE GLELDL THG+ RQLV+S+PQL+E LY++KF
Sbjct: 61 AQQVIQRIVTQSSSISEAISIVNFAAEWGLELDLATHGLLCRQLVFSKPQLSEFLYNRKF 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
GAEPD +LDSMV CFCRLGKFE+AL++FN+LLSLNYVPSK SFNAIFRELCAQ RV
Sbjct: 121 VVGGAEPDVLLLDSMVSCFCRLGKFEEALSHFNRLLSLNYVPSKVSFNAIFRELCAQGRV 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
LEAF+YFVRVNG G++LG WCFNVL+DGLCN+G M EALELFDIMQ+TNGYPP+LHLFK+
Sbjct: 181 LEAFNYFVRVNGAGIYLGCWCFNVLMDGLCNQGFMGEALELFDIMQSTNGYPPTLHLFKT 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYGLCK WLVEAELLIREMEFRSLYPDKTMYTSL+H YC+D+KMKMAMQA FRM+KIG
Sbjct: 241 LFYGLCKSGWLVEAELLIREMEFRSLYPDKTMYTSLIHGYCRDRKMKMAMQALFRMVKIG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
C+PD +TLN+LIHGFVKLGLV+KGWLVY LM +WGIQPDVVTFHIMI +YCQEGKVD AL
Sbjct: 301 CKPDTFTLNSLIHGFVKLGLVEKGWLVYKLMEDWGIQPDVVTFHIMIGKYCQEGKVDSAL 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
ILN+MVS N SPS+HCYTVL +AL+R+ RLEEV L + +LDNGI+PDHVLF TLMKMY
Sbjct: 361 MILNSMVSSNLSPSVHCYTVLSSALYRNGRLEEVDGLFKGMLDNGIIPDHVLFLTLMKMY 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PKGHELQLALN LE I+KNGCGCDPSVILAS + QTSSNLEQK E +L+EI S+LNLAG
Sbjct: 421 PKGHELQLALNILETIVKNGCGCDPSVILASAEWQTSSNLEQKFEIVLKEISISDLNLAG 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
VAFSIVI ALCETEN ALDY H M SLGCKPLLFTYNSLI+ LCKE LFEDA+SLIDH
Sbjct: 481 VAFSIVISALCETENFCYALDYLHNMVSLGCKPLLFTYNSLIRRLCKERLFEDAMSLIDH 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
M++ SL P+TTTYLII+NE+CR+GNV++A++I RKMRQ GLKPSVAIYDSII CLSR+KR
Sbjct: 541 MKDYSLFPNTTTYLIIVNEYCRQGNVTAAYHILRKMRQVGLKPSVAIYDSIIRCLSREKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
I E + VFK ML+AG+DPDK YLTMI GY KNG++LEA +LFEQMVENSIPPSSHIYTA
Sbjct: 601 ICEAEVVFKMMLEAGMDPDKKFYLTMIKGYSKNGRILEACELFEQMVENSIPPSSHIYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEV 706
LI GL KNMTD+GCLYLGKM R+GF PN VLYS+L+NHYL++GEV
Sbjct: 661 LIRGLGMKNMTDKGCLYLGKMSRNGFLPNVVLYSTLMNHYLRVGEV 706
BLAST of CmoCh16G002000 vs. TrEMBL
Match:
A0A061G037_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_014940 PE=4 SV=1)
HSP 1 Score: 1036.2 bits (2678), Expect = 2.4e-299
Identity = 513/898 (57.13%), Postives = 676/898 (75.28%), Query Frame = 1
Query: 1 MIRGRPCKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPA 60
MI+ R +L R +TT T+PLDP + SS ++HK+ C SL EQLI+RGL A
Sbjct: 1 MIKKRLLSCHLFFKTRRAITTSTLPLDPSFAAVSSICTDHKSFCLSLTEQLIKRGLLSSA 60
Query: 61 QQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLVYSR-PQLAELLYDKKF 120
QQ+IQRI++QSSS+S+AI+ VDF RGL+LDL T G ++LV S PQLA LY
Sbjct: 61 QQLIQRIISQSSSVSDAITAVDFVTARGLDLDLSTFGALIKKLVRSGYPQLAYSLYSDNI 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
RG PD +++SMVIC C+LGK E+A F++LL +N K +FNA+ REL AQER
Sbjct: 121 IRRGINPDPFIVNSMVICLCKLGKLEEASTLFDRLL-MNNSSEKPAFNALVRELFAQERF 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
L+ FDYFV ++ GV+LG W +N LIDGLC KG++EEA+++FD+M+ T G P+LHL+KS
Sbjct: 181 LDVFDYFVAMSDIGVNLGCWYYNGLIDGLCQKGNLEEAIQMFDLMRETAGLSPTLHLYKS 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYGLCK W++EAE LI E+E + Y D+TMYTSL+ EYCKD+KMKMAM+ + RM+K G
Sbjct: 241 LFYGLCKHGWVLEAEFLIGEIESQGFYVDRTMYTSLIKEYCKDRKMKMAMRIYLRMLKTG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
CEPD+YT NTLIHGFVK+GL D+GW++YN M E G+QPDV+T+H+MIS YC+EGK + A
Sbjct: 301 CEPDSYTYNTLIHGFVKMGLFDQGWVLYNQMMEKGLQPDVITYHVMISNYCREGKANCAS 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
+LN+MVS N +PS+HCYTVLI + ++++RL E EL +S+L GIVPDHVLFFTLMKMY
Sbjct: 361 MLLNSMVSNNLAPSVHCYTVLITSFYKENRLMEAGELYKSMLTGGIVPDHVLFFTLMKMY 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PKG+EL LAL ++AI NGCG DP ++ S S +LEQKIE L+ +I +NL+LA
Sbjct: 421 PKGYELHLALMIVQAIAVNGCGFDPLLLAVS----DSEDLEQKIELLIGKIEKTNLSLAN 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
VAF+I+I AL E LD A+ + K+ +LGC PLLFTYNSL+KCL +EGLFEDA SL+D
Sbjct: 481 VAFTILISALSEGRKLDTAVHFMDKLMNLGCMPLLFTYNSLVKCLSQEGLFEDAKSLVDL 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
MQ+ + PD TYLI++NEHC+ G+++SA I +M RG+KP VAIYD IIG L R+KR
Sbjct: 541 MQDRGIFPDQATYLIMVNEHCKHGDLASAFDILDQMEDRGMKPGVAIYDCIIGSLCRQKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
+FE + +F +ML++G DPD+ +Y+TMINGY KNG+L+EAR+LFE+M+E++I P+SH YTA
Sbjct: 601 LFEAEDMFIRMLESGEDPDEIVYMTMINGYAKNGRLIEARQLFEKMIEDAIRPTSHSYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSH 720
LISGLVKK+MTD+GC+YL +ML DG PN VLY+SLIN++L+ GE E+AFRLVDLM+R+
Sbjct: 661 LISGLVKKDMTDKGCMYLDRMLGDGLVPNVVLYTSLINNFLRKGEFEFAFRLVDLMDRNQ 720
Query: 721 IEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSA 780
IE D+I YI LVSG+C+N I +K+W +++ +++A+ LFR+LH L+PR+ + VS
Sbjct: 721 IEHDLITYIALVSGVCRN-ITSRKRWCSIKRSSERAREMLFRLLHYRCLLPREKKLRVSD 780
Query: 781 NSTEEMKSLALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPN 840
+S E MK ALKL+QKVK+ +PNL+LYN II G+C DRM DA ELMQKEG+ PN
Sbjct: 781 SSPEAMKCFALKLMQKVKETRFMPNLYLYNGIISGFCWADRMQDAYDHFELMQKEGVRPN 840
Query: 841 QVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
QVT TILM G+++ AI LFNKMN D C PD++AYNTL+KGL Q GRL +AL+L
Sbjct: 841 QVTLTILMGGHIKAGEIDHAIDLFNKMNADDCTPDKIAYNTLIKGLCQAGRLLEALSL 892
BLAST of CmoCh16G002000 vs. TrEMBL
Match:
F6HAK9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g01780 PE=4 SV=1)
HSP 1 Score: 985.7 bits (2547), Expect = 3.8e-284
Identity = 483/880 (54.89%), Postives = 650/880 (73.86%), Query Frame = 1
Query: 19 VTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSISEAI 78
+ TC+ LDPP SS+ + H LC++L ++LIRRG+ QQV++R++ QS S+S+AI
Sbjct: 19 LATCSPALDPP-PSSAPTTEHHNKLCFTLTDRLIRRGVLSLGQQVVRRMIKQSPSVSDAI 78
Query: 79 SIVDFAAERGLELDLDTHGVFWRQLVYS-RPQLAELLYDKKFTFRGAEPDASVLDSMVIC 138
V+FAA RGLELD +GV R+LV S + AE +Y RG PD+ L+SMVIC
Sbjct: 79 LAVEFAAARGLELDSCGYGVLLRKLVGSGEHRFAEAVYRDYVIARGIIPDSETLNSMVIC 138
Query: 139 FCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVHLG 198
+C LGK E+A+A+F++L ++ P K + NA+ RELCA+ERVLEAFDYFVR+N G+ +G
Sbjct: 139 YCNLGKLEEAMAHFDRLFEVDSFPCKPACNAMLRELCARERVLEAFDYFVRINDVGILMG 198
Query: 199 YWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLI 258
WCFN LIDGLC+KGH++EA +FD M+ G P ++HL+K+LFYGLC+++ + EAEL +
Sbjct: 199 LWCFNRLIDGLCDKGHVDEAFYMFDTMRERTGLPATIHLYKTLFYGLCRQERVEEAELFV 258
Query: 259 REMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKL 318
EME + DK MYTSL+H YC+ KKM+ AM+ F RM+K+GC+PD YT NTLIHGFVKL
Sbjct: 259 GEMESEGHFIDKMMYTSLIHGYCRGKKMRTAMRVFLRMLKMGCDPDTYTYNTLIHGFVKL 318
Query: 319 GLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCY 378
GL DKGW+++N M+EWG+QP+VVT+HIMI +YC+EGKVD ALT+L++M S N +PS+H Y
Sbjct: 319 GLFDKGWILHNQMSEWGLQPNVVTYHIMIRRYCEEGKVDCALTLLSSMSSFNLTPSVHSY 378
Query: 379 TVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILK 438
TVLI AL++++RL EV EL + +LD G+VPDHVLFFTLM+ PKGHEL LAL L+AI K
Sbjct: 379 TVLITALYKENRLVEVEELYKKMLDIGVVPDHVLFFTLMQKQPKGHELHLALKILQAIAK 438
Query: 439 NGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDC 498
NGC D ++ S + ++EQ+IE LL EI N LA VAF I I ALC D
Sbjct: 439 NGCNLDLCLLSTSATHSPTQDVEQEIECLLGEIVRRNFALADVAFGIFISALCAAGKTDA 498
Query: 499 ALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIIN 558
AL + KM SLGC+PLL TYNSLIKCL +E L EDA SLID MQE ++PD TYLI+++
Sbjct: 499 ALLFMDKMVSLGCRPLLSTYNSLIKCLFQERLVEDAKSLIDLMQENGIVPDLATYLIMVH 558
Query: 559 EHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDP 618
EHC G+++SA + +M +RGLKPSVAIYDSIIGCLSR+KRI E + VFK ML+AGVDP
Sbjct: 559 EHCNHGDLASAFGLLDQMNERGLKPSVAIYDSIIGCLSRRKRILEAENVFKMMLEAGVDP 618
Query: 619 DKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYL 678
D +Y+TMI+GY KN + +EAR+LF++M+E+ PSSH YTA+ISGLVK+NM D+GC YL
Sbjct: 619 DAIIYVTMISGYSKNRRAIEARQLFDKMIEHGFQPSSHSYTAVISGLVKENMIDKGCSYL 678
Query: 679 GKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKN 738
ML+DGF PN+VLY+SLIN +L+ GE+E+AFRLVDLM+R+ IE D+I I LVSG+ +N
Sbjct: 679 SDMLKDGFVPNTVLYTSLINQFLRKGELEFAFRLVDLMDRNQIECDMITCIALVSGVSRN 738
Query: 739 LIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQKVK 798
+ +++W+ ++ + + + L +LH++ ++PR+NN+ S ++K AL L+QK+K
Sbjct: 739 ITPVRRRWYHVKSGSARVREILLHLLHQSFVIPRENNLSFPRGSPRKIKYFALNLMQKIK 798
Query: 799 DVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----GDVN 858
+PNL+LYN II G+CR + + DA + ELMQ EG+ PNQVTFTIL++ G+++
Sbjct: 799 GSSFMPNLYLYNGIISGFCRANMIQDAYNHFELMQTEGVCPNQVTFTILINGHTRFGEID 858
Query: 859 SAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
AIGLFNKMN DG PD + YN L+KGL + GRL DAL++
Sbjct: 859 HAIGLFNKMNADGLAPDGITYNALIKGLCKAGRLLDALSV 897
BLAST of CmoCh16G002000 vs. TrEMBL
Match:
A0A0B0MFC3_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_22354 PE=4 SV=1)
HSP 1 Score: 959.5 bits (2479), Expect = 2.9e-276
Identity = 479/883 (54.25%), Postives = 642/883 (72.71%), Query Frame = 1
Query: 16 RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 75
R + T+ +PLDP + SS ++H +LC S EQLI RGL A+++ QR+V+ SS +S
Sbjct: 17 RAVTTSAALPLDPSYATISSIPADHFSLCLSFSEQLINRGLLSSARKLFQRVVSNSSPVS 76
Query: 76 EAISIVDFAAERGLELDLDTHGVFWRQLVYS-RPQLAELLYDKKFTFRGAEPDASVLDSM 135
+A+S VDF RGL+LDL T+ V ++LV S LA Y RG PD+S+ +S+
Sbjct: 77 DALSTVDFVTSRGLDLDLSTYAVLIKKLVQSGHLLLAYSFYSDYIIGRGIIPDSSIANSI 136
Query: 136 VICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGV 195
VIC C+LGK E+A F++L++ N K +FNA+ R LC+QER L+AFDYF+++ V
Sbjct: 137 VICLCKLGKLEEATILFDRLVTDNSC-EKPAFNALVRLLCSQERFLDAFDYFIKMININV 196
Query: 196 HLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAE 255
+LG W +NVLIDGLC KG++EEA+++FD+M P+LHL+KSLFYGLC++ W+VEAE
Sbjct: 197 NLGCWYYNVLIDGLCQKGYLEEAIQMFDLMPERTESLPTLHLYKSLFYGLCRQGWVVEAE 256
Query: 256 LLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGF 315
L ++E + + DKTMYTSL++ YCK +KMKMA++ ++RM+K GC PD+YT NTLIHGF
Sbjct: 257 SLFGKIESQGFFVDKTMYTSLINVYCKGRKMKMALRVYYRMLKTGCRPDSYTYNTLIHGF 316
Query: 316 VKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSL 375
VK+GL D GW+++N M G+QP VVTFH+MIS YC+EGKVD A +LNNM+S N +P+
Sbjct: 317 VKMGLFDYGWVLFNQMMGQGLQPSVVTFHVMISNYCREGKVDCASMLLNNMISKNLAPNA 376
Query: 376 HCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEA 435
HCYTVLI +L++++R+ E E +L+ G+VPDHVLFF LMKMYPKG+EL +A L+A
Sbjct: 377 HCYTVLITSLYKENRITEAEEFYERMLNGGLVPDHVLFFKLMKMYPKGYELDIAFMVLKA 436
Query: 436 ILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETEN 495
I NGCG DP ++ S + LEQKI L++EI SNL+LA VAF+++I ALCE
Sbjct: 437 IALNGCGFDPLLLPVS----ANEELEQKIVILIEEILKSNLHLAKVAFNVLISALCEQAQ 496
Query: 496 LDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLI 555
D A + KM SLGC PLLFTYNSLIKCL ++GLFEDA SL++ MQ + PD T LI
Sbjct: 497 QDSASYFMDKMESLGCMPLLFTYNSLIKCLSQKGLFEDAESLLNRMQAQGIFPDQATCLI 556
Query: 556 IINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAG 615
IINEHC+ GN++ A I +M RG+KP VAIYD II L RKK++ E K +F +MLK+G
Sbjct: 557 IINEHCKHGNLAPAFDILDQMEDRGMKPGVAIYDCIIRSLFRKKKVSEAKDMFVRMLKSG 616
Query: 616 VDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGC 675
VDPD+ +YLTMING+ NG+++EAR+LF +M+E +I P+SH YTALISGLVKK+MTD+GC
Sbjct: 617 VDPDEIIYLTMINGFSNNGRVIEARRLFHEMIEAAIRPTSHSYTALISGLVKKDMTDKGC 676
Query: 676 LYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGI 735
+YL KML DG PN+VLY+SLIN++L+ GE E+AFRLVDLM+R+ IE D+I YI+LVS
Sbjct: 677 MYLEKMLDDGLVPNAVLYTSLINNFLQKGEFEFAFRLVDLMDRNQIELDLISYISLVSRF 736
Query: 736 CKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQ 795
++ I +K+WF + + +++A+ LF++LH +L+P++ N+ VS +S E MK ALKLIQ
Sbjct: 737 YRS-ISSRKRWFAMRRGSERAREKLFQLLHRQSLLPKEKNLRVSDSSPEAMKCFALKLIQ 796
Query: 796 KVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----G 855
KVK +PNL+LYN II G+C DRM DA ELMQKEG+ PNQVTFTILM G
Sbjct: 797 KVKQTRFMPNLYLYNVIISGFCEADRMQDAYDHFELMQKEGVLPNQVTFTILMGGHIKAG 856
Query: 856 DVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
+++ AIGLFNKMN DGC PD + Y L+ GL Q RL +AL+L
Sbjct: 857 EIDHAIGLFNKMNADGCTPDGIVYKILVNGLCQASRLLEALSL 893
BLAST of CmoCh16G002000 vs. TrEMBL
Match:
A0A0D2QJ46_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G114200 PE=4 SV=1)
HSP 1 Score: 958.0 bits (2475), Expect = 8.4e-276
Identity = 479/883 (54.25%), Postives = 640/883 (72.48%), Query Frame = 1
Query: 16 RNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 75
R + T+ +PLDP + SS ++H +LC S EQLI RGL A+++ QR+V+ SS +S
Sbjct: 17 RAVTTSAALPLDPSYATVSSIPADHFSLCLSFSEQLINRGLLSSARKLFQRVVSNSSPVS 76
Query: 76 EAISIVDFAAERGLELDLDTHGVFWRQLVYS-RPQLAELLYDKKFTFRGAEPDASVLDSM 135
+A+S VDF RGL+LDL T+ V ++LV S LA Y RG PD+S+ +S+
Sbjct: 77 DALSTVDFVTSRGLDLDLSTYAVLIKKLVQSGHLPLAYSFYSDYIIGRGIIPDSSIANSI 136
Query: 136 VICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGV 195
VIC C+LGK E+A F++L++ N K +FNA+ R LC+QER L+AFDYF+++ V
Sbjct: 137 VICLCKLGKLEEATILFDRLVTDNSC-EKPAFNALVRLLCSQERFLDAFDYFIKMININV 196
Query: 196 HLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAE 255
+LG W +N+LIDGLC KG++EEA+++FD+M P+LHL+KSLFYGLCK+ W+VEAE
Sbjct: 197 NLGCWYYNMLIDGLCQKGYLEEAIQMFDLMPERTESLPTLHLYKSLFYGLCKQGWVVEAE 256
Query: 256 LLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGF 315
L +ME + + DKTMYTSL++ YCK +KMKMA++ ++RM+K+GC PD+YT NTLIHGF
Sbjct: 257 SLFGKMESQGFFVDKTMYTSLINVYCKGRKMKMALRVYYRMLKMGCRPDSYTYNTLIHGF 316
Query: 316 VKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSL 375
VK+GL D GW+++N M E G+QP VVTFH+MIS YC+EGKVD A +LNNM+S N +P+
Sbjct: 317 VKMGLFDYGWVLFNQMMEQGLQPSVVTFHVMISNYCREGKVDCASMLLNNMISKNLAPNA 376
Query: 376 HCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEA 435
HCYTVLI +L +++R+ E E +L+ G+VPDHVLFF LMKMYPKG+EL +A L+A
Sbjct: 377 HCYTVLITSLCKENRIMEAEEFYERMLNGGLVPDHVLFFKLMKMYPKGYELDIAFMVLKA 436
Query: 436 ILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETEN 495
I NGCG DP ++ S + LEQKI L++EI SNL+LA VAF+I+I ALCE
Sbjct: 437 IALNGCGFDPLLLPVS----ANEELEQKIVILIEEILKSNLHLAKVAFNILISALCEQAQ 496
Query: 496 LDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLI 555
D AL + KM SLGC PLLFTYNSLIKCL ++ LFEDA SL++ MQ + PD T LI
Sbjct: 497 QDSALHFMDKMESLGCMPLLFTYNSLIKCLSQKSLFEDAESLLNRMQAQGIFPDQATCLI 556
Query: 556 IINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAG 615
IINEHC+ GN+ A I +M RG+KP VAIYD IIG L R+K++ E +F +ML++G
Sbjct: 557 IINEHCKHGNLEPAFDILDQMEDRGMKPGVAIYDCIIGSLFRQKKVSEATAMFIRMLESG 616
Query: 616 VDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGC 675
VDPD+ +YLTMING+ NG+++EA +LF +M+ +I P+SH YTALISGLVKKNMTD+GC
Sbjct: 617 VDPDEIIYLTMINGFSNNGRVIEADQLFHEMIGAAIRPTSHSYTALISGLVKKNMTDKGC 676
Query: 676 LYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGI 735
YL KML DG PN+VLY+SLI+++L+ E E+AFRLVDLM+R+ IE D+IFYI+LVSG
Sbjct: 677 TYLEKMLDDGLVPNAVLYTSLISNFLQKREFEFAFRLVDLMDRNQIERDLIFYISLVSGF 736
Query: 736 CKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQ 795
++ I +K+WF + + +++A+ LF++LH +L+P++ N+ VS +S E MK ALKLIQ
Sbjct: 737 YRS-ISSRKRWFSMRRGSERAREKLFQLLHRQSLLPKEKNLRVSDSSPEAMKCFALKLIQ 796
Query: 796 KVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----G 855
KVK +PNL+LYN II G+C DRM DA ELMQKEG+ PNQVTFTILM G
Sbjct: 797 KVKQTRFMPNLYLYNGIISGFCEADRMQDAYDHFELMQKEGVLPNQVTFTILMGGHIKAG 856
Query: 856 DVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
+++ AIGLFNKMN DGC PD + Y L+ GL Q RL +AL+L
Sbjct: 857 EIDHAIGLFNKMNADGCTPDGIVYKILVNGLCQASRLLEALSL 893
BLAST of CmoCh16G002000 vs. TAIR10
Match:
AT5G62370.1 (AT5G62370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)
HSP 1 Score: 720.7 bits (1859), Expect = 1.1e-207
Identity = 394/884 (44.57%), Postives = 559/884 (63.24%), Query Frame = 1
Query: 20 TTCTVPLDP-PVTSS---SSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSIS 79
TTC + + P TS+ S+++ +H++ C SL+ +L RRGL A++VI+R++ SSSIS
Sbjct: 18 TTCALSSELFPSTSAAVFSAASGDHRSRCLSLIVKLGRRGLLDSAREVIRRVIDGSSSIS 77
Query: 80 EAISIVDFAAERGLELDLDTHGVFWRQLV-YSRPQLAELLYDKKFTFRGAEPDASVLDSM 139
EA + DFA + G+ELD +G R+L +P +AE Y+++ G PD+SVLDSM
Sbjct: 78 EAALVADFAVDNGIELDSSCYGALIRKLTEMGQPGVAETFYNQRVIGNGIVPDSSVLDSM 137
Query: 140 VICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGV 199
V C +L +F++A A+ +++++ Y PS+ S + + ELC Q+R LEAF F +V G
Sbjct: 138 VFCLVKLRRFDEARAHLDRIIASGYAPSRNSSSLVVDELCNQDRFLEAFHCFEQVKERGS 197
Query: 200 HLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAE 259
L WC L GLC GH+ EA+ + D + P ++L+KSLFY CKR EAE
Sbjct: 198 GLWLWCCKRLFKGLCGHGHLNEAIGMLDTLCGMTRMPLPVNLYKSLFYCFCKRGCAAEAE 257
Query: 260 LLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGF 319
L ME Y DK MYT L+ EYCKD M MAM+ + RM++ E D NTLIHGF
Sbjct: 258 ALFDHMEVDGYYVDKVMYTCLMKEYCKDNNMTMAMRLYLRMVERSFELDPCIFNTLIHGF 317
Query: 320 VKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTI-LNNMVSCNFSPS 379
+KLG++DKG ++++ M + G+Q +V T+HIMI YC+EG VD+AL + +NN S + S +
Sbjct: 318 MKLGMLDKGRVMFSQMIKKGVQSNVFTYHIMIGSYCKEGNVDYALRLFVNNTGSEDISRN 377
Query: 380 LHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLE 439
+HCYT LI ++ +++ +LL +LDNGIVPDH+ +F L+KM PK HEL+ A+ L+
Sbjct: 378 VHCYTNLIFGFYKKGGMDKAVDLLMRMLDNGIVPDHITYFVLLKMLPKCHELKYAMVILQ 437
Query: 440 AILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETE 499
+IL NGCG +P VI N+E K+E+LL EI + NLA V ++V ALC
Sbjct: 438 SILDNGCGINPPVI------DDLGNIEVKVESLLGEIARKDANLAAVGLAVVTTALCSQR 497
Query: 500 NLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYL 559
N AL KM +LGC PL F+YNS+IKCL +E + ED SL++ +QE +PD TYL
Sbjct: 498 NYIAALSRIEKMVNLGCTPLPFSYNSVIKCLFQENIIEDLASLVNIIQELDFVPDVDTYL 557
Query: 560 IIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKA 619
I++NE C+K + +A I M + GL+P+VAIY SIIG L ++ R+ E + F KML++
Sbjct: 558 IVVNELCKKNDRDAAFAIIDAMEELGLRPTVAIYSSIIGSLGKQGRVVEAEETFAKMLES 617
Query: 620 GVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQG 679
G+ PD+ Y+ MIN Y +NG++ EA +L E++V++ + PSS YT LISG VK M ++G
Sbjct: 618 GIQPDEIAYMIMINTYARNGRIDEANELVEEVVKHFLRPSSFTYTVLISGFVKMGMMEKG 677
Query: 680 CLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSG 739
C YL KML DG SPN VLY++LI H+LK G+ +++F L LM + I+ D I YITL+SG
Sbjct: 678 CQYLDKMLEDGLSPNVVLYTALIGHFLKKGDFKFSFTLFGLMGENDIKHDHIAYITLLSG 737
Query: 740 ICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLI 799
+ + + KK+ ++E +K L R++ LV I S+ KS A+++I
Sbjct: 738 LWRAMARKKKRQVIVEPGKEKL---LQRLIRTKPLVS-----IPSSLGNYGSKSFAMEVI 797
Query: 800 QKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD----- 859
KVK I+PNL+L+N+II GYC R+ +A + LE MQKEG+ PN VT+TILM
Sbjct: 798 GKVKK-SIIPNLYLHNTIITGYCAAGRLDEAYNHLESMQKEGIVPNLVTYTILMKSHIEA 857
Query: 860 GDVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
GD+ SAI LF N C PD+V Y+TLLKGL R DALAL
Sbjct: 858 GDIESAIDLFEGTN---CEPDQVMYSTLLKGLCDFKRPLDALAL 883
BLAST of CmoCh16G002000 vs. TAIR10
Match:
AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 241.1 bits (614), Expect = 2.7e-63
Identity = 188/742 (25.34%), Postives = 343/742 (46.23%), Query Frame = 1
Query: 162 SKTSFNAIFRELCAQERVLEAFDYF-VRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALEL 221
S +SF+ + + RVL+ F + + + + L+ GL H A+EL
Sbjct: 155 SSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMEL 214
Query: 222 FDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYC 281
F+ M + G P ++++ + LC+ K L A+ +I ME + Y L+ C
Sbjct: 215 FNDMVSV-GIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLC 274
Query: 282 KDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVV 341
K +K+ A+ + +PD T TL++G K+ + G + + M P
Sbjct: 275 KKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEA 334
Query: 342 TFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSI 401
++ + GK++ AL ++ +V SP+L Y LI++L + + E L +
Sbjct: 335 AVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRM 394
Query: 402 LDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNGCGCDP----SVILASTKLQTS 461
G+ P+ V + L+ M+ + +L AL+FL ++ G S+I K
Sbjct: 395 GKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDI 454
Query: 462 SNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLFT 521
S E + E+ N L V ++ ++ C ++ AL +H+M G P ++T
Sbjct: 455 S----AAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYT 514
Query: 522 YNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMR 581
+ +L+ L + GL DA+ L + M E ++ P+ TY ++I +C +G++S A ++M
Sbjct: 515 FTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMT 574
Query: 582 QRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLL 641
++G+ P Y +I L + E K + K + ++ Y +++G+ + GKL
Sbjct: 575 EKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLE 634
Query: 642 EARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLI 701
EA + ++MV+ + Y LI G +K L +M G P+ V+Y+S+I
Sbjct: 635 EALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMI 694
Query: 702 NHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAK 761
+ K G+ + AF + DLM P+ + Y +++G+CK V++ + +L + Q
Sbjct: 695 DAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAE--VLCSKMQPVS 754
Query: 762 STLFRMLHETTLVPRDNNMIVSANSTEEMKSLAL-KLIQKVKDVCIVPNLHLYNSIICGY 821
S ++ + L I++ + K++ L I K ++ N YN +I G+
Sbjct: 755 SVPNQVTYGCFL------DILTKGEVDMQKAVELHNAILK----GLLANTATYNMLIRGF 814
Query: 822 CRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDE 881
CR R+ +A+ + M +G+ P+ +T+T +++ DV AI L+N M G PD
Sbjct: 815 CRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDR 874
Query: 882 VAYNTLLKGLSQGGRLSDALAL 893
VAYNTL+ G G + A L
Sbjct: 875 VAYNTLIHGCCVAGEMGKATEL 879
BLAST of CmoCh16G002000 vs. TAIR10
Match:
AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 228.8 bits (582), Expect = 1.4e-59
Identity = 171/706 (24.22%), Postives = 309/706 (43.77%), Query Frame = 1
Query: 200 CFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIRE 259
C+N L++ L G ++E +++ M P+++ + + G CK + EA + +
Sbjct: 185 CYNTLLNSLARFGLVDEMKQVYMEMLEDK-VCPNIYTYNKMVNGYCKLGNVEEANQYVSK 244
Query: 260 MEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGL 319
+ L PD YTSL+ YC+ K + A + F M GC + LIHG
Sbjct: 245 IVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARR 304
Query: 320 VDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTV 379
+D+ ++ M + P V T+ ++I C + AL ++ M P++H YTV
Sbjct: 305 IDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTV 364
Query: 380 LINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNG 439
LI++L + E+ ELL +L+ G++P+ + + L+ Y K ++ A++ +E +
Sbjct: 365 LIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRK 424
Query: 440 CGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDCAL 499
+ K SN+ K +L ++ + V ++ +I C + N D A
Sbjct: 425 LSPNTRTYNELIKGYCKSNVH-KAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAY 484
Query: 500 DYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEH 559
M G P +TY S+I LCK E+A L D +++ + P+ Y +I+ +
Sbjct: 485 RLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGY 544
Query: 560 CRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDK 619
C+ G V AH + KM + P+ ++++I L ++ E + +KM+K G+ P
Sbjct: 545 CKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTV 604
Query: 620 NLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGK 679
+ +I+ K+G A F+QM+ + P +H YT I ++ + K
Sbjct: 605 STDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAK 664
Query: 680 MLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLI 739
M +G SP+ YSSLI Y +G+ +AF ++ M + EP +++L+
Sbjct: 665 MRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIK------- 724
Query: 740 VDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQKVKDV 799
LLE + K K + E L N M ++L++K+ +
Sbjct: 725 ------HLLEMKYGKQKGS------EPELCAMSNMMEFDT---------VVELLEKMVEH 784
Query: 800 CIVPNLHLYNSIICGYCRTDRMLDANHQLELMQK-EGLHPNQVTFTILMD-----GDVNS 859
+ PN Y +I G C + A + MQ+ EG+ P+++ F L+ N
Sbjct: 785 SVTPNAKSYEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNE 844
Query: 860 AIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALALHMFICSC 900
A + + M G +P + L+ GL + G ++ + C
Sbjct: 845 AAKVVDDMICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQC 860
BLAST of CmoCh16G002000 vs. TAIR10
Match:
AT5G61990.1 (AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 228.8 bits (582), Expect = 1.4e-59
Identity = 186/780 (23.85%), Postives = 337/780 (43.21%), Query Frame = 1
Query: 121 FRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVL 180
F G D + + + G E+A+ F+ + L VP + + L R+
Sbjct: 144 FVGKSDDGVLFGILFDGYIAKGYIEEAVFVFSSSMGLELVPRLSRCKVLLDALLRWNRLD 203
Query: 181 EAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSL 240
+D + + V +++LI C G+++ D++ T F++
Sbjct: 204 LFWDVYKGMVERNVVFDVKTYHMLIIAHCRAGNVQLGK---DVLFKTEKE------FRTA 263
Query: 241 FYGLCKRKWLVEAELLIRE-MEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
V+ L ++E M + L P K Y L+ CK K+++ A M +G
Sbjct: 264 TLN-------VDGALKLKESMICKGLVPLKYTYDVLIDGLCKIKRLEDAKSLLVEMDSLG 323
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
DN+T + LI G +K D + + M GI + I +EG ++ A
Sbjct: 324 VSLDNHTYSLLIDGLLKGRNADAAKGLVHEMVSHGINIKPYMYDCCICVMSKEGVMEKAK 383
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
+ + M++ P Y LI R+ + + ELL + IV + T++K
Sbjct: 384 ALFDGMIASGLIPQAQAYASLIEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGM 443
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETL--LQEIFNSNLNL 480
+L A N ++ ++ +GC P+V++ +T ++T + + + L+E+ +
Sbjct: 444 CSSGDLDGAYNIVKEMIASGCR--PNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAP 503
Query: 481 AGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLI 540
++ +I L + + +D A + +M G KP FTY + I + F A +
Sbjct: 504 DIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYV 563
Query: 541 DHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRK 600
M+EC +LP+ +INE+C+KG V A +R M +G+ Y ++ L +
Sbjct: 564 KEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYTVLMNGLFKN 623
Query: 601 KRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIY 660
++ + + +F++M G+ PD Y +ING+ K G + +A +F++MVE + P+ IY
Sbjct: 624 DKVDDAEEIFREMRGKGIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIY 683
Query: 661 TALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMER 720
L+ G + ++ L +M G PN+V Y ++I+ Y K G++ AFRL D M+
Sbjct: 684 NMLLGGFCRSGEIEKAKELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKL 743
Query: 721 SHIEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIV 780
+ PD Y TLV G C+ + D ++ + N+K ++ T N +
Sbjct: 744 KGLVPDSFVYTTLVDGCCR--LNDVERAITIFGTNKKGCAS------STAPFNALINWVF 803
Query: 781 SANSTEEMKSLALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLH 840
TE + +L+ D PN YN +I C+ + A MQ L
Sbjct: 804 KFGKTELKTEVLNRLMDGSFDRFGKPNDVTYNIMIDYLCKEGNLEAAKELFHQMQNANLM 863
Query: 841 PNQVTFTILMDGDVN-----SAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
P +T+T L++G +F++ G PD + Y+ ++ + G + AL L
Sbjct: 864 PTVITYTSLLNGYDKMGRRAEMFPVFDEAIAAGIEPDHIMYSVIINAFLKEGMTTKALVL 897
BLAST of CmoCh16G002000 vs. TAIR10
Match:
AT1G31840.1 (AT1G31840.1 Tetratricopeptide repeat (TPR)-like superfamily protein)
HSP 1 Score: 226.9 bits (577), Expect = 5.2e-59
Identity = 167/647 (25.81%), Postives = 291/647 (44.98%), Query Frame = 1
Query: 110 LAELLYDKKFTFRGAE-----------PDASVLDSMVICFCRLGKFEKALAYFNQLLSLN 169
+A+ ++D+ T RG + DA V ++ C CR G +KAL F L
Sbjct: 117 VADKVFDEMITNRGKDFNVLGSIRDRSLDADVCKFLMECCCRYGMVDKALEIFVYSTQLG 176
Query: 170 YVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVH-LGYWCFNVLIDGLCNKGHMEEA 229
V + S + L +RV D+F ++ GG+ G ++D L KG + +A
Sbjct: 177 VVIPQDSVYRMLNSLIGSDRVDLIADHFDKLCRGGIEPSGVSAHGFVLDALFCKGEVTKA 236
Query: 230 LELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVH 289
L+ ++ G+ + + GL + V + LL ++ P+ + +L++
Sbjct: 237 LDFHRLVME-RGFRVGIVSCNKVLKGLSVDQIEVASRLLSLVLDCGPA-PNVVTFCTLIN 296
Query: 290 EYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQP 349
+CK +M A F M + G EPD +TLI G+ K G++ G +++ G++
Sbjct: 297 GFCKRGEMDRAFDLFKVMEQRGIEPDLIAYSTLIDGYFKAGMLGMGHKLFSQALHKGVKL 356
Query: 350 DVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELL 409
DVV F I Y + G + A + M+ SP++ YT+LI L +D R+ E +
Sbjct: 357 DVVVFSSTIDVYVKSGDLATASVVYKRMLCQGISPNVVTYTILIKGLCQDGRIYEAFGMY 416
Query: 410 RSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSS 469
IL G+ P V + +L+ + K L+ E ++K G P V++ + S
Sbjct: 417 GQILKRGMEPSIVTYSSLIDGFCKCGNLRSGFALYEDMIK--MGYPPDVVIYGVLVDGLS 476
Query: 470 NLEQKIETLLQEI--FNSNLNLAGVAFSIVICALCETENLDCALDYFHKMASLGCKPLLF 529
+ + + ++ L V F+ +I C D AL F M G KP +
Sbjct: 477 KQGLMLHAMRFSVKMLGQSIRLNVVVFNSLIDGWCRLNRFDEALKVFRLMGIYGIKPDVA 536
Query: 530 TYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKM 589
T+ ++++ EG E+AL L M + L PD Y +I+ C+ + + M
Sbjct: 537 TFTTVMRVSIMEGRLEEALFLFFRMFKMGLEPDALAYCTLIDAFCKHMKPTIGLQLFDLM 596
Query: 590 RQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKL 649
++ + +A+ + +I L + RI + F +++ ++PD Y TMI GY +L
Sbjct: 597 QRNKISADIAVCNVVIHLLFKCHRIEDASKFFNNLIEGKMEPDIVTYNTMICGYCSLRRL 656
Query: 650 LEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSL 709
EA ++FE + P++ T LI L K N D M G PN+V Y L
Sbjct: 657 DEAERIFELLKVTPFGPNTVTLTILIHVLCKNNDMDGAIRMFSIMAEKGSKPNAVTYGCL 716
Query: 710 INHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKNLIVDK 743
++ + K ++E +F+L + M+ I P ++ Y ++ G+CK VD+
Sbjct: 717 MDWFSKSVDIEGSFKLFEEMQEKGISPSIVSYSIIIDGLCKRGRVDE 759
BLAST of CmoCh16G002000 vs. NCBI nr
Match:
gi|659077232|ref|XP_008439096.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g62370 [Cucumis melo])
HSP 1 Score: 1155.6 bits (2988), Expect = 0.0e+00
Identity = 569/706 (80.59%), Postives = 631/706 (89.38%), Query Frame = 1
Query: 1 MIRGRP-CKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLP 60
MIRGRP CKYYLS+NFRNLVTTCTVPLDPP TSS SSASEHK LC+SLVEQLIRRGLF
Sbjct: 1 MIRGRPSCKYYLSLNFRNLVTTCTVPLDPPTTSSFSSASEHKNLCFSLVEQLIRRGLFFQ 60
Query: 61 AQQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLVYSRPQLAELLYDKKF 120
AQQVIQRIVTQSSSISEAISIV+FAAE GLELDL THG+ RQLVYS+PQL+E LY++KF
Sbjct: 61 AQQVIQRIVTQSSSISEAISIVNFAAEWGLELDLATHGLLCRQLVYSKPQLSEFLYNRKF 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
GAEPD +LDSMV CFCRLGKFE+AL++FN+LLSLNYVPSK SFNAIFRELCAQERV
Sbjct: 121 VVGGAEPDVLLLDSMVSCFCRLGKFEEALSHFNRLLSLNYVPSKVSFNAIFRELCAQERV 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
LEAFDYFVRVNG G++LG WCFNVL+DGLCN+G M EALELFDIMQ+TNGYPP+LHLFK+
Sbjct: 181 LEAFDYFVRVNGAGIYLGCWCFNVLMDGLCNQGFMGEALELFDIMQSTNGYPPTLHLFKT 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYGLCK WL EAELLIREMEFRSLYPDKTMYTSL+H YC+DKKMKMAMQA FRM+KIG
Sbjct: 241 LFYGLCKSGWLGEAELLIREMEFRSLYPDKTMYTSLIHGYCRDKKMKMAMQALFRMVKIG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
C+PD +TLN+LIHGF KLGLV+KGWLVY LM +WGIQPDVVTFHIMI +YCQ GKVD AL
Sbjct: 301 CKPDTFTLNSLIHGFAKLGLVEKGWLVYKLMEDWGIQPDVVTFHIMIVKYCQVGKVDSAL 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
ILN+MVS N SPS+HCYTVL +AL+R+ RLEEV+ LL+S+LDNGI+PDHVLF TLMKMY
Sbjct: 361 MILNSMVSSNLSPSVHCYTVLSSALYRNGRLEEVNGLLKSMLDNGIIPDHVLFLTLMKMY 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PKGHELQLALN LE I+KN GCDPSVILAST+ QTSSNLEQKIE LL+EI NS+LNLA
Sbjct: 421 PKGHELQLALNILETIVKNERGCDPSVILASTEWQTSSNLEQKIEILLKEISNSDLNLAA 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
VAFSIVICALCETEN ALDY H M SLGCKPLLFTYNSLI+ LCKE LFEDA+SLIDH
Sbjct: 481 VAFSIVICALCETENFGYALDYLHDMVSLGCKPLLFTYNSLIRRLCKERLFEDAMSLIDH 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
M++ SL P+TTTYLII+NE+CR+GNV++A+Y RKMRQ GLKPSVAIYDSII CLSR+KR
Sbjct: 541 MKDYSLFPNTTTYLIIVNEYCRQGNVTAAYYTLRKMRQGGLKPSVAIYDSIIRCLSREKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
IFE + VFK ML+AGVDPDK Y TMINGY KNG++LEA +LFEQMVENS+PPSSHIYTA
Sbjct: 601 IFEAEVVFKMMLEAGVDPDKKFYSTMINGYSKNGRILEACELFEQMVENSVPPSSHIYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEV 706
LI GLV KNMTD+GCLYLGKMLRDGF PN VLYSSLINHYLK+GEV
Sbjct: 661 LIRGLVMKNMTDKGCLYLGKMLRDGFLPNVVLYSSLINHYLKVGEV 706
BLAST of CmoCh16G002000 vs. NCBI nr
Match:
gi|778679316|ref|XP_004148164.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g62370 [Cucumis sativus])
HSP 1 Score: 1134.0 bits (2932), Expect = 0.0e+00
Identity = 556/706 (78.75%), Postives = 626/706 (88.67%), Query Frame = 1
Query: 1 MIRGRP-CKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLP 60
MIRGRP CKYYLS+NFRNLVTTCTVPLDPP TSS SSASEHK LC+SLVEQLIRRG F
Sbjct: 1 MIRGRPSCKYYLSMNFRNLVTTCTVPLDPPTTSSFSSASEHKNLCFSLVEQLIRRGFFFQ 60
Query: 61 AQQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLVYSRPQLAELLYDKKF 120
AQQVIQRIVTQSSSISEAISIV+FAAE GLELDL THG+ RQLV+S+PQL+E LY++KF
Sbjct: 61 AQQVIQRIVTQSSSISEAISIVNFAAEWGLELDLATHGLLCRQLVFSKPQLSEFLYNRKF 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
GAEPD +LDSMV CFCRLGKFE+AL++FN+LLSLNYVPSK SFNAIFRELCAQ RV
Sbjct: 121 VVGGAEPDVLLLDSMVSCFCRLGKFEEALSHFNRLLSLNYVPSKVSFNAIFRELCAQGRV 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
LEAF+YFVRVNG G++LG WCFNVL+DGLCN+G M EALELFDIMQ+TNGYPP+LHLFK+
Sbjct: 181 LEAFNYFVRVNGAGIYLGCWCFNVLMDGLCNQGFMGEALELFDIMQSTNGYPPTLHLFKT 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYGLCK WLVEAELLIREMEFRSLYPDKTMYTSL+H YC+D+KMKMAMQA FRM+KIG
Sbjct: 241 LFYGLCKSGWLVEAELLIREMEFRSLYPDKTMYTSLIHGYCRDRKMKMAMQALFRMVKIG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
C+PD +TLN+LIHGFVKLGLV+KGWLVY LM +WGIQPDVVTFHIMI +YCQEGKVD AL
Sbjct: 301 CKPDTFTLNSLIHGFVKLGLVEKGWLVYKLMEDWGIQPDVVTFHIMIGKYCQEGKVDSAL 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
ILN+MVS N SPS+HCYTVL +AL+R+ RLEEV L + +LDNGI+PDHVLF TLMKMY
Sbjct: 361 MILNSMVSSNLSPSVHCYTVLSSALYRNGRLEEVDGLFKGMLDNGIIPDHVLFLTLMKMY 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PKGHELQLALN LE I+KNGCGCDPSVILAS + QTSSNLEQK E +L+EI S+LNLAG
Sbjct: 421 PKGHELQLALNILETIVKNGCGCDPSVILASAEWQTSSNLEQKFEIVLKEISISDLNLAG 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
VAFSIVI ALCETEN ALDY H M SLGCKPLLFTYNSLI+ LCKE LFEDA+SLIDH
Sbjct: 481 VAFSIVISALCETENFCYALDYLHNMVSLGCKPLLFTYNSLIRRLCKERLFEDAMSLIDH 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
M++ SL P+TTTYLII+NE+CR+GNV++A++I RKMRQ GLKPSVAIYDSII CLSR+KR
Sbjct: 541 MKDYSLFPNTTTYLIIVNEYCRQGNVTAAYHILRKMRQVGLKPSVAIYDSIIRCLSREKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
I E + VFK ML+AG+DPDK YLTMI GY KNG++LEA +LFEQMVENSIPPSSHIYTA
Sbjct: 601 ICEAEVVFKMMLEAGMDPDKKFYLTMIKGYSKNGRILEACELFEQMVENSIPPSSHIYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEV 706
LI GL KNMTD+GCLYLGKM R+GF PN VLYS+L+NHYL++GEV
Sbjct: 661 LIRGLGMKNMTDKGCLYLGKMSRNGFLPNVVLYSTLMNHYLRVGEV 706
BLAST of CmoCh16G002000 vs. NCBI nr
Match:
gi|590671717|ref|XP_007038409.1| (Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao])
HSP 1 Score: 1036.2 bits (2678), Expect = 3.5e-299
Identity = 513/898 (57.13%), Postives = 676/898 (75.28%), Query Frame = 1
Query: 1 MIRGRPCKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPA 60
MI+ R +L R +TT T+PLDP + SS ++HK+ C SL EQLI+RGL A
Sbjct: 1 MIKKRLLSCHLFFKTRRAITTSTLPLDPSFAAVSSICTDHKSFCLSLTEQLIKRGLLSSA 60
Query: 61 QQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLVYSR-PQLAELLYDKKF 120
QQ+IQRI++QSSS+S+AI+ VDF RGL+LDL T G ++LV S PQLA LY
Sbjct: 61 QQLIQRIISQSSSVSDAITAVDFVTARGLDLDLSTFGALIKKLVRSGYPQLAYSLYSDNI 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
RG PD +++SMVIC C+LGK E+A F++LL +N K +FNA+ REL AQER
Sbjct: 121 IRRGINPDPFIVNSMVICLCKLGKLEEASTLFDRLL-MNNSSEKPAFNALVRELFAQERF 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
L+ FDYFV ++ GV+LG W +N LIDGLC KG++EEA+++FD+M+ T G P+LHL+KS
Sbjct: 181 LDVFDYFVAMSDIGVNLGCWYYNGLIDGLCQKGNLEEAIQMFDLMRETAGLSPTLHLYKS 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYGLCK W++EAE LI E+E + Y D+TMYTSL+ EYCKD+KMKMAM+ + RM+K G
Sbjct: 241 LFYGLCKHGWVLEAEFLIGEIESQGFYVDRTMYTSLIKEYCKDRKMKMAMRIYLRMLKTG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
CEPD+YT NTLIHGFVK+GL D+GW++YN M E G+QPDV+T+H+MIS YC+EGK + A
Sbjct: 301 CEPDSYTYNTLIHGFVKMGLFDQGWVLYNQMMEKGLQPDVITYHVMISNYCREGKANCAS 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
+LN+MVS N +PS+HCYTVLI + ++++RL E EL +S+L GIVPDHVLFFTLMKMY
Sbjct: 361 MLLNSMVSNNLAPSVHCYTVLITSFYKENRLMEAGELYKSMLTGGIVPDHVLFFTLMKMY 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PKG+EL LAL ++AI NGCG DP ++ S S +LEQKIE L+ +I +NL+LA
Sbjct: 421 PKGYELHLALMIVQAIAVNGCGFDPLLLAVS----DSEDLEQKIELLIGKIEKTNLSLAN 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
VAF+I+I AL E LD A+ + K+ +LGC PLLFTYNSL+KCL +EGLFEDA SL+D
Sbjct: 481 VAFTILISALSEGRKLDTAVHFMDKLMNLGCMPLLFTYNSLVKCLSQEGLFEDAKSLVDL 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
MQ+ + PD TYLI++NEHC+ G+++SA I +M RG+KP VAIYD IIG L R+KR
Sbjct: 541 MQDRGIFPDQATYLIMVNEHCKHGDLASAFDILDQMEDRGMKPGVAIYDCIIGSLCRQKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
+FE + +F +ML++G DPD+ +Y+TMINGY KNG+L+EAR+LFE+M+E++I P+SH YTA
Sbjct: 601 LFEAEDMFIRMLESGEDPDEIVYMTMINGYAKNGRLIEARQLFEKMIEDAIRPTSHSYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSH 720
LISGLVKK+MTD+GC+YL +ML DG PN VLY+SLIN++L+ GE E+AFRLVDLM+R+
Sbjct: 661 LISGLVKKDMTDKGCMYLDRMLGDGLVPNVVLYTSLINNFLRKGEFEFAFRLVDLMDRNQ 720
Query: 721 IEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSA 780
IE D+I YI LVSG+C+N I +K+W +++ +++A+ LFR+LH L+PR+ + VS
Sbjct: 721 IEHDLITYIALVSGVCRN-ITSRKRWCSIKRSSERAREMLFRLLHYRCLLPREKKLRVSD 780
Query: 781 NSTEEMKSLALKLIQKVKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPN 840
+S E MK ALKL+QKVK+ +PNL+LYN II G+C DRM DA ELMQKEG+ PN
Sbjct: 781 SSPEAMKCFALKLMQKVKETRFMPNLYLYNGIISGFCWADRMQDAYDHFELMQKEGVRPN 840
Query: 841 QVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
QVT TILM G+++ AI LFNKMN D C PD++AYNTL+KGL Q GRL +AL+L
Sbjct: 841 QVTLTILMGGHIKAGEIDHAIDLFNKMNADDCTPDKIAYNTLIKGLCQAGRLLEALSL 892
BLAST of CmoCh16G002000 vs. NCBI nr
Match:
gi|731423136|ref|XP_010662380.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g62370 [Vitis vinifera])
HSP 1 Score: 985.7 bits (2547), Expect = 5.4e-284
Identity = 483/880 (54.89%), Postives = 650/880 (73.86%), Query Frame = 1
Query: 19 VTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPAQQVIQRIVTQSSSISEAI 78
+ TC+ LDPP SS+ + H LC++L ++LIRRG+ QQV++R++ QS S+S+AI
Sbjct: 19 LATCSPALDPP-PSSAPTTEHHNKLCFTLTDRLIRRGVLSLGQQVVRRMIKQSPSVSDAI 78
Query: 79 SIVDFAAERGLELDLDTHGVFWRQLVYS-RPQLAELLYDKKFTFRGAEPDASVLDSMVIC 138
V+FAA RGLELD +GV R+LV S + AE +Y RG PD+ L+SMVIC
Sbjct: 79 LAVEFAAARGLELDSCGYGVLLRKLVGSGEHRFAEAVYRDYVIARGIIPDSETLNSMVIC 138
Query: 139 FCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERVLEAFDYFVRVNGGGVHLG 198
+C LGK E+A+A+F++L ++ P K + NA+ RELCA+ERVLEAFDYFVR+N G+ +G
Sbjct: 139 YCNLGKLEEAMAHFDRLFEVDSFPCKPACNAMLRELCARERVLEAFDYFVRINDVGILMG 198
Query: 199 YWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKSLFYGLCKRKWLVEAELLI 258
WCFN LIDGLC+KGH++EA +FD M+ G P ++HL+K+LFYGLC+++ + EAEL +
Sbjct: 199 LWCFNRLIDGLCDKGHVDEAFYMFDTMRERTGLPATIHLYKTLFYGLCRQERVEEAELFV 258
Query: 259 REMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIGCEPDNYTLNTLIHGFVKL 318
EME + DK MYTSL+H YC+ KKM+ AM+ F RM+K+GC+PD YT NTLIHGFVKL
Sbjct: 259 GEMESEGHFIDKMMYTSLIHGYCRGKKMRTAMRVFLRMLKMGCDPDTYTYNTLIHGFVKL 318
Query: 319 GLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFALTILNNMVSCNFSPSLHCY 378
GL DKGW+++N M+EWG+QP+VVT+HIMI +YC+EGKVD ALT+L++M S N +PS+H Y
Sbjct: 319 GLFDKGWILHNQMSEWGLQPNVVTYHIMIRRYCEEGKVDCALTLLSSMSSFNLTPSVHSY 378
Query: 379 TVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMYPKGHELQLALNFLEAILK 438
TVLI AL++++RL EV EL + +LD G+VPDHVLFFTLM+ PKGHEL LAL L+AI K
Sbjct: 379 TVLITALYKENRLVEVEELYKKMLDIGVVPDHVLFFTLMQKQPKGHELHLALKILQAIAK 438
Query: 439 NGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAGVAFSIVICALCETENLDC 498
NGC D ++ S + ++EQ+IE LL EI N LA VAF I I ALC D
Sbjct: 439 NGCNLDLCLLSTSATHSPTQDVEQEIECLLGEIVRRNFALADVAFGIFISALCAAGKTDA 498
Query: 499 ALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDHMQECSLLPDTTTYLIIIN 558
AL + KM SLGC+PLL TYNSLIKCL +E L EDA SLID MQE ++PD TYLI+++
Sbjct: 499 ALLFMDKMVSLGCRPLLSTYNSLIKCLFQERLVEDAKSLIDLMQENGIVPDLATYLIMVH 558
Query: 559 EHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKRIFEVKGVFKKMLKAGVDP 618
EHC G+++SA + +M +RGLKPSVAIYDSIIGCLSR+KRI E + VFK ML+AGVDP
Sbjct: 559 EHCNHGDLASAFGLLDQMNERGLKPSVAIYDSIIGCLSRRKRILEAENVFKMMLEAGVDP 618
Query: 619 DKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTALISGLVKKNMTDQGCLYL 678
D +Y+TMI+GY KN + +EAR+LF++M+E+ PSSH YTA+ISGLVK+NM D+GC YL
Sbjct: 619 DAIIYVTMISGYSKNRRAIEARQLFDKMIEHGFQPSSHSYTAVISGLVKENMIDKGCSYL 678
Query: 679 GKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSHIEPDVIFYITLVSGICKN 738
ML+DGF PN+VLY+SLIN +L+ GE+E+AFRLVDLM+R+ IE D+I I LVSG+ +N
Sbjct: 679 SDMLKDGFVPNTVLYTSLINQFLRKGELEFAFRLVDLMDRNQIECDMITCIALVSGVSRN 738
Query: 739 LIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSANSTEEMKSLALKLIQKVK 798
+ +++W+ ++ + + + L +LH++ ++PR+NN+ S ++K AL L+QK+K
Sbjct: 739 ITPVRRRWYHVKSGSARVREILLHLLHQSFVIPRENNLSFPRGSPRKIKYFALNLMQKIK 798
Query: 799 DVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHPNQVTFTILMD-----GDVN 858
+PNL+LYN II G+CR + + DA + ELMQ EG+ PNQVTFTIL++ G+++
Sbjct: 799 GSSFMPNLYLYNGIISGFCRANMIQDAYNHFELMQTEGVCPNQVTFTILINGHTRFGEID 858
Query: 859 SAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
AIGLFNKMN DG PD + YN L+KGL + GRL DAL++
Sbjct: 859 HAIGLFNKMNADGLAPDGITYNALIKGLCKAGRLLDALSV 897
BLAST of CmoCh16G002000 vs. NCBI nr
Match:
gi|1009114466|ref|XP_015873703.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g62370 [Ziziphus jujuba])
HSP 1 Score: 977.2 bits (2525), Expect = 1.9e-281
Identity = 491/899 (54.62%), Postives = 658/899 (73.19%), Query Frame = 1
Query: 1 MIRGRPCKYYLSVNFRNLVTTCTVPLDPPVTSSSSSASEHKTLCYSLVEQLIRRGLFLPA 60
M++ R Y R +T+C +P P +S S+ A++H +LC S EQLIRRGL A
Sbjct: 1 MLKKRHNFCYFFFKARRKITSCALPFVPSNSSISTVANDHISLCLSSAEQLIRRGLLSHA 60
Query: 61 QQVIQRIVTQSSSISEAISIVDFAAERGLELDLDTHGVFWRQLV-YSRPQLAELLYDKKF 120
QQ ++RIV SSS S+A+ + +FA+ RGLELDLD++GV R+LV R QLAE +Y K
Sbjct: 61 QQFMKRIVMHSSSDSDALLVFNFASSRGLELDLDSYGVLLRKLVSLGRYQLAEYIYCKFI 120
Query: 121 TFRGAEPDASVLDSMVICFCRLGKFEKALAYFNQLLSLNYVPSKTSFNAIFRELCAQERV 180
RG D S+L+SMVICFC+LGK E+A + +++ ++N +P K + N + RELC+QE +
Sbjct: 121 GSRGMYNDLSILNSMVICFCKLGKLEEARIHLDRIFTMNSIPCKAACNTLIRELCSQEMI 180
Query: 181 LEAFDYFVRVNGGGVHLGYWCFNVLIDGLCNKGHMEEALELFDIMQNTNGYPPSLHLFKS 240
LEAF +FVR++ + LG+W FNVLIDGLC+KG+M+EAL++F+I+ + +G P+ HL+K+
Sbjct: 181 LEAFAHFVRISDARLFLGFWSFNVLIDGLCSKGYMDEALQVFNILCHRHGRLPTTHLYKT 240
Query: 241 LFYGLCKRKWLVEAELLIREMEFRSLYPDKTMYTSLVHEYCKDKKMKMAMQAFFRMIKIG 300
LFYG C R +VEAELL EME + LY DK MYTSL++EYCK+K+MKMAM+ F RM+K+G
Sbjct: 241 LFYGHCNRGKVVEAELLFIEMESKGLYIDKVMYTSLINEYCKNKEMKMAMRVFLRMLKMG 300
Query: 301 CEPDNYTLNTLIHGFVKLGLVDKGWLVYNLMAEWGIQPDVVTFHIMISQYCQEGKVDFAL 360
C+PD +T NTLI G++KL + DKG + LM EWG+QP+V F IMIS+YC+ G++D+ L
Sbjct: 301 CDPDAFTCNTLIQGYMKLCMFDKGLAINKLMTEWGVQPNVSAFGIMISEYCKNGEIDYGL 360
Query: 361 TILNNMVSCNFSPSLHCYTVLINALHRDDRLEEVSELLRSILDNGIVPDHVLFFTLMKMY 420
+LN MVS N +PS+HCYT+LI AL +RL EV EL SILD G+VPDH+LFF L+K
Sbjct: 361 MLLNKMVSFNLTPSVHCYTILIKALLEKNRLSEVDELYNSILDRGVVPDHILFFVLVKKC 420
Query: 421 PKGHELQLALNFLEAILKNGCGCDPSVILASTKLQTSSNLEQKIETLLQEIFNSNLNLAG 480
PK H L+LAL L AI KNGCG D S+IL S ++EQ+I LL EI SNLNLA
Sbjct: 421 PKVHYLELALKILRAIAKNGCGFDLSLILYPASQNPSQDVEQEIHVLLGEIATSNLNLAT 480
Query: 481 VAFSIVICALCETENLDCALDYFHKMASLGCKPLLFTYNSLIKCLCKEGLFEDALSLIDH 540
+A ++ I ALC NLD AL +F +M +LGC P LFTYN+LIKC C+E LFE A+SLID
Sbjct: 481 MAVNVYIHALCMDGNLDVALHWFDRMRNLGCLPSLFTYNTLIKCFCQEELFEYAVSLIDL 540
Query: 541 MQECSLLPDTTTYLIIINEHCRKGNVSSAHYIHRKMRQRGLKPSVAIYDSIIGCLSRKKR 600
M+ ++PD TYL+IINE C++G+ A ++ M RG+KP VAIYDSIIGCLSR+KR
Sbjct: 541 MEGKGIVPDQATYLVIINECCKRGDPELAFHVMDDMDGRGMKPGVAIYDSIIGCLSRRKR 600
Query: 601 IFEVKGVFKKMLKAGVDPDKNLYLTMINGYGKNGKLLEARKLFEQMVENSIPPSSHIYTA 660
I + + +FK+ML+AGV PD+ +Y TMINGY NG+ EA +LF++MV+NSI PS H YTA
Sbjct: 601 ILDAENMFKRMLEAGVGPDEVVYSTMINGYLNNGRATEAHQLFKKMVDNSIWPSLHCYTA 660
Query: 661 LISGLVKKNMTDQGCLYLGKMLRDGFSPNSVLYSSLINHYLKIGEVEYAFRLVDLMERSH 720
LISGLVK+NMTD+GC +L +ML+D PN+VLY+SLIN+YLK G +E+AFRLVDLM +
Sbjct: 661 LISGLVKRNMTDKGCEHLDRMLKDDLLPNAVLYTSLINNYLKKGRLEFAFRLVDLMCKCQ 720
Query: 721 IEPDVIFYITLVSGICKNLIVDKKKWFLLEKENQKAKSTLFRMLHETTLVPRDNNMIVSA 780
D I I+LVSG+C+N++ + KW L +E+ A+ LF +LH+ T +P++N++ VSA
Sbjct: 721 FAFDHIMCISLVSGVCRNIMSTRGKWHLQSRESDMAREKLFGLLHKNTHMPKENSLRVSA 780
Query: 781 NSTEEMKSLALKLIQK-VKDVCIVPNLHLYNSIICGYCRTDRMLDANHQLELMQKEGLHP 840
+S EE K LA+KLIQ ++ + NL+LYNSII GYC ++M +A ELMQ+EGLHP
Sbjct: 781 SSFEEKKCLAMKLIQTIIEKTSSMQNLYLYNSIISGYCYAEKMQEAYGHFELMQREGLHP 840
Query: 841 NQVTFTILMD-----GDVNSAIGLFNKMNVDGCIPDEVAYNTLLKGLSQGGRLSDALAL 893
NQVT+TILMD GD++SAIG+FNKMN DGC+PD +AYNTLL+GL + GRL +AL+L
Sbjct: 841 NQVTYTILMDGHLRSGDIDSAIGIFNKMNADGCLPDRIAYNTLLRGLCKAGRLLEALSL 899
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PP443_ARATH | 2.0e-206 | 44.57 | Pentatricopeptide repeat-containing protein At5g62370 OS=Arabidopsis thaliana GN... | [more] |
PP437_ARATH | 4.7e-62 | 25.34 | Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... | [more] |
PP445_ARATH | 2.4e-58 | 24.22 | Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... | [more] |
PP442_ARATH | 2.4e-58 | 23.85 | Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... | [more] |
PPR67_ARATH | 9.2e-58 | 25.81 | Putative pentatricopeptide repeat-containing protein At1g31840 OS=Arabidopsis th... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LB22_CUCSA | 0.0e+00 | 78.75 | Uncharacterized protein OS=Cucumis sativus GN=Csa_3G175715 PE=4 SV=1 | [more] |
A0A061G037_THECC | 2.4e-299 | 57.13 | Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobro... | [more] |
F6HAK9_VITVI | 3.8e-284 | 54.89 | Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g01780 PE=4 SV=... | [more] |
A0A0B0MFC3_GOSAR | 2.9e-276 | 54.25 | Uncharacterized protein OS=Gossypium arboreum GN=F383_22354 PE=4 SV=1 | [more] |
A0A0D2QJ46_GOSRA | 8.4e-276 | 54.25 | Uncharacterized protein OS=Gossypium raimondii GN=B456_003G114200 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G62370.1 | 1.1e-207 | 44.57 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT5G59900.1 | 2.7e-63 | 25.34 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G65560.1 | 1.4e-59 | 24.22 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G61990.1 | 1.4e-59 | 23.85 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT1G31840.1 | 5.2e-59 | 25.81 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |