BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match:
PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E47 PE=3 SV=1)
HSP 1 Score: 424.1 bits (1089), Expect = 2.3e-117
Identity = 212/421 (50.36%), Postives = 289/421 (68.65%), Query Frame = 1
Query: 45 ASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSAR 104
+S+ + + CF + C NID+L + HG+L +GL+G++ TKLV +YG G AR
Sbjct: 37 SSLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDAR 96
Query: 105 MVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELR 164
+VFDQ+ +PDFY WKVM+R Y LN ++ Y+ + D+I+FS LKAC+EL+
Sbjct: 97 LVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQ 156
Query: 165 EIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAG 224
++D G+K+HCQ+VKV D+ VLTGL+DMY KCG+I+ + VF I +NVV WT+MIAG
Sbjct: 157 DLDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAG 216
Query: 225 YVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIEL 284
YV+ND EEGLVLFNRMRE+ V N++T G++I ACT+L ALHQGKW HG +K+ IEL
Sbjct: 217 YVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSG-IEL 276
Query: 285 NSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKI 344
+S L T+ LDMYVKCG +A +F+E +DLV WTAMIVGY+ G NEAL LF
Sbjct: 277 SSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMK 336
Query: 345 RSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDD 404
+ PN VT AS+LS C + NL +G VHGL IK+G+ + V NAL+ MYAKC+ D
Sbjct: 337 GVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRD 396
Query: 405 AYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAI 464
A VF EKD++ WNS+ISG++Q+GS ++AL LF++M S+S+ P+ +T+ S SA A
Sbjct: 397 AKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACAS 456
Query: 465 L 466
L
Sbjct: 457 L 456
BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match:
PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)
HSP 1 Score: 246.5 bits (628), Expect = 6.6e-64
Identity = 143/434 (32.95%), Postives = 242/434 (55.76%), Query Frame = 1
Query: 68 LIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFL 127
L + H L+V GL + TKL+ + GD+ AR VFD + P + W +IR Y
Sbjct: 37 LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96
Query: 128 NDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG-GPDSFV 187
N+ F + Y+ M+++ D+ F +LKACS L + GR VH Q+ ++G D FV
Sbjct: 97 NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156
Query: 188 LTGLIDMYGKCGQIEWSSAVFEG--IIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRES 247
GLI +Y KC ++ + VFEG + ++ +VSWT +++ Y QN E L +F++MR+
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216
Query: 248 LVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRD 307
V+ + L S++ A T L+ L QG+ +H +K + +E+ L + MY KCGQ
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVK-MGLEIEPDLLISLNTMYAKCGQVAT 276
Query: 308 AHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSV 367
A ++FD++ S +L+ W AMI GY++ G EA+ +F + I + P++++ S +SAC+
Sbjct: 277 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 336
Query: 368 SGNLSIGMLVHG-LGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSM 427
G+L ++ +G ++ + +ALIDM+AKC ++ A +VF L++DV+ W++M
Sbjct: 337 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 396
Query: 428 ISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSA---SAILETSFYLSNSIANSEI 487
I GY G A +A+ L+ M + P+ +T + L A S ++ ++ N +A+ +I
Sbjct: 397 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 456
Query: 488 MLTKQQLEFLCGIN 495
QQ + C I+
Sbjct: 457 --NPQQQHYACVID 467
BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match:
PP319_ARATH (Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN=PCMP-A2 PE=2 SV=1)
HSP 1 Score: 245.7 bits (626), Expect = 1.1e-63
Identity = 146/397 (36.78%), Postives = 221/397 (55.67%), Query Frame = 1
Query: 72 HGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMF 131
HG ++ G VGNL+ ++ LV Y G++ SA FD M + D +W +I
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266
Query: 132 ASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK-VGGPDSFVLTGL 191
I + M + + ILKACSE + + GR+VH +VK + D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326
Query: 192 IDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQ 251
+DMY KCG+I VF+G+ ++N V+WT++IA + + EE + LF M+ + +N
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386
Query: 252 FTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFD 311
T+ SI+ AC + AL GK +H IKN I E N ++ + + +Y KCG++RDA +
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSI-EKNVYIGSTLVWLYCKCGESRDAFNVLQ 446
Query: 312 ELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSI 371
+LPS D+VSWTAMI G S G +EAL + I+ G+ PN T +S L AC+ S +L I
Sbjct: 447 QLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLI 506
Query: 372 GMLVHGLGIK-LGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQ 431
G +H + K L V +ALI MYAKC + +A+ VF + EK++++W +MI GYA+
Sbjct: 507 GRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYAR 566
Query: 432 SGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILE 467
+G +AL+L +M ++ D + LS +E
Sbjct: 567 NGFCREALKLMYRMEAEGFEVDDYIFATILSTCGDIE 601
BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match:
PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)
HSP 1 Score: 241.5 bits (615), Expect = 2.1e-62
Identity = 139/420 (33.10%), Postives = 227/420 (54.05%), Query Frame = 1
Query: 53 HSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRD 112
H + C ++ L + L+ +GL TKLV ++ G V A VF+ +
Sbjct: 38 HPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDS 97
Query: 113 PDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKV 172
+ M++ + + F+ RMR E F+ +LK C + E+ G+++
Sbjct: 98 KLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEI 157
Query: 173 HCQIVKVG-GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCA 232
H +VK G D F +TGL +MY KC Q+ + VF+ + ++++VSW T++AGY QN A
Sbjct: 158 HGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMA 217
Query: 233 EEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATA 292
L + M E ++ + T+ S++ A + LR + GK +HGYA+++ L + ++TA
Sbjct: 218 RMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN-ISTA 277
Query: 293 FLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPN 352
+DMY KCG A +FD + ++VSW +MI Y Q P EA+ +F + G+ P
Sbjct: 278 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 337
Query: 353 SVTAASILSACSVSGNLSIGMLVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVVFL 412
V+ L AC+ G+L G +H L ++LGL+ +V N+LI MY KC +D A +F
Sbjct: 338 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 397
Query: 413 GVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILETSFY 471
+ + +++WN+MI G+AQ+G DAL F+QMRS ++ PD T VS ++A A L + +
Sbjct: 398 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456
BLAST of Cp4.1LG01g16670 vs. Swiss-Prot
Match:
PP214_ARATH (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana GN=PCMP-E82 PE=3 SV=2)
HSP 1 Score: 233.4 bits (594), Expect = 5.8e-60
Identity = 139/413 (33.66%), Postives = 215/413 (52.06%), Query Frame = 1
Query: 62 CRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGD---VGSARMVFDQMRDPDFYAW 121
CR++ L + HGL+I ++ N++ ++L+ + + AR VF+ + P Y W
Sbjct: 16 CRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESIDCPSVYIW 75
Query: 122 KVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK 181
MIR Y + + FY M D F +LKACS LR+I G VH +VK
Sbjct: 76 NSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVVK 135
Query: 182 VGGP-DSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVL 241
G + +V T L+ MY CG++ + VFE I NVV+W ++I+G+V N+ + +
Sbjct: 136 TGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNNRFSDAIEA 195
Query: 242 FNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYA-------IKNVIIELNSFLAT 301
F M+ + V++N+ + ++ AC R + + GKW HG+ + N LAT
Sbjct: 196 FREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKVGFNVILAT 255
Query: 302 AFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLP 361
+ +DMY KCG R A +FD +P LVSW ++I GYSQ G EAL +F D + G+ P
Sbjct: 256 SLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLDMLDLGIAP 315
Query: 362 NSVTAASILSACSVSGNLSIGMLVHGLGIKLG-LEECAVKNALIDMYAKCHMIDDAYVVF 421
+ VT S++ A + G +G +H K G +++ A+ AL++MYAK + A F
Sbjct: 316 DKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGDAESAKKAF 375
Query: 422 LGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLA-PDAITLVSALSA 462
+ +KD I W +I G A G +AL +F +M+ A PD IT + L A
Sbjct: 376 EDLEKKDTIAWTVVIIGLASHGHGNEALSIFQRMQEKGNATPDGITYLGVLYA 428
BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match:
A0A0A0KLZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1)
HSP 1 Score: 786.9 bits (2031), Expect = 1.5e-224
Identity = 393/465 (84.52%), Postives = 420/465 (90.32%), Query Frame = 1
Query: 1 MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
MLQRF PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL C Y MG
Sbjct: 13 MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72
Query: 61 LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73 LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132
Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
MIRWYFLND+F +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192
Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252
Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312
Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372
Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432
Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
NSMISGYAQ+GSAYDALRLFNQMR LAPD ITLVS LSA A L
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 473
BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match:
A0A061EAZ8_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE=4 SV=1)
HSP 1 Score: 597.4 bits (1539), Expect = 1.7e-167
Identity = 287/438 (65.53%), Postives = 353/438 (80.59%), Query Frame = 1
Query: 30 TYASQPPLS--DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCD 89
+Y + PL +D+T+AS+ ISL+ CF +GLCRNID+L K H L +++G+ G+LLCD
Sbjct: 30 SYTTDHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDLLCD 89
Query: 90 TKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKE 149
TKLV +YG G +G AR++FDQ+ DPDFY+WKVMIRWYFLND+ II FY RMRMS +
Sbjct: 90 TKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMSVRM 149
Query: 150 CDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVF 209
CDN++FS++LKACSE+R+IDEGRKVHCQIVK G PDSFV TGL+DMY KCG+IE S VF
Sbjct: 150 CDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSRKVF 209
Query: 210 EGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALH 269
IID+NVVSWT+MIAGYVQNDCAE+ LVLFNRMRE++VE N+FTLGS++TAC +L ALH
Sbjct: 210 SEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALH 269
Query: 270 QGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGY 329
QGKWVHGY IKN IELNS+ T LDMYVKCG RDA +FDEL S+DLVSWTAMIVGY
Sbjct: 270 QGKWVHGYVIKNG-IELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGY 329
Query: 330 SQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECA 389
SQ+G P+EAL+LF DK G+LPN+VT AS+LSAC+ NLS G LVH LGI+LGL++
Sbjct: 330 SQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKDST 389
Query: 390 VKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDS 449
V NAL+DMYAKC MI DA +F V +K++I WNS+ISGY+Q+GSAY+A LF+QMRS S
Sbjct: 390 VINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKS 449
Query: 450 LAPDAITLVSALSASAIL 466
++PDA+T+VS SA A L
Sbjct: 450 VSPDAVTVVSIFSACASL 466
BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match:
A5AY98_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=1)
HSP 1 Score: 577.8 bits (1488), Expect = 1.4e-161
Identity = 276/426 (64.79%), Postives = 343/426 (80.52%), Query Frame = 1
Query: 39 DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALG 98
++D+T+AS+Q IS + CF +G+C+ + +L K H LL+VHGL +LLC+TKLV +YG+ G
Sbjct: 26 EIDRTIASIQSISSNPCFSLLGICKTVSSLRKIHALLVVHGLSEDLLCETKLVSLYGSFG 85
Query: 99 DVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYN-RMRMSFKECDNIIFSIIL 158
V AR++FD++R+PD Y+WKVMIRWYFLND ++ I+ FYN R+R E DN++FSI+L
Sbjct: 86 HVECARLMFDRIRNPDLYSWKVMIRWYFLNDSYSEIVQFYNTRLRKCLNEYDNVVFSIVL 145
Query: 159 KACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVS 218
KACSELRE DEGRK+HCQIVKVG PDSFVLTGL+DMY KC ++E S VF+ I+D+NVV
Sbjct: 146 KACSELRETDEGRKLHCQIVKVGSPDSFVLTGLVDMYAKCREVEDSRRVFDEILDRNVVC 205
Query: 219 WTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAI 278
WT+MI GYVQNDC +EGLVLFNRMRE LVE NQ+TLGS++TACT+L ALHQGKWVHGY I
Sbjct: 206 WTSMIVGYVQNDCLKEGLVLFNRMREGLVEGNQYTLGSLVTACTKLGALHQGKWVHGYVI 265
Query: 279 KNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEAL 338
K+ +LNSFL T LD+Y KCG RDA +FDEL +IDLVSWTAMIVGY+Q G P EAL
Sbjct: 266 KSGF-DLNSFLVTPLLDLYFKCGDIRDAFSVFDELSTIDLVSWTAMIVGYAQRGYPREAL 325
Query: 339 RLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYA 398
+LFTD+ LLPN+VT +S+LSAC+ +G+L++G VH LGIKLG E+ +NAL+DMYA
Sbjct: 326 KLFTDERWKDLLPNTVTTSSVLSACAQTGSLNMGRSVHCLGIKLGSEDATFENALVDMYA 385
Query: 399 KCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVS 458
KCHMI DA VF V +KDVI WNS+ISGY Q+G AY+AL LF+QMRSDS+ PDAITLVS
Sbjct: 386 KCHMIGDARYVFETVFDKDVIAWNSIISGYTQNGYAYEALELFDQMRSDSVYPDAITLVS 445
Query: 459 ALSASA 464
LSA A
Sbjct: 446 VLSACA 450
BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match:
A0A0D2RH88_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G178400 PE=4 SV=1)
HSP 1 Score: 562.8 bits (1449), Expect = 4.6e-157
Identity = 283/488 (57.99%), Postives = 364/488 (74.59%), Query Frame = 1
Query: 27 SYSTYASQP-PLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLL 86
S S + QP LD ++AS+ +S + CF + C+NID L + H L I++G+ G+LL
Sbjct: 44 SLSYFTDQPFEYPSLDPSLASLHSVSSNPCFALLSFCKNIDCLKEVHALFIINGIKGDLL 103
Query: 87 CDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSF 146
CDTKLV +YG+ G VG A VFD++ +PDFY+WKVMIRWYFLND++ II FY RMRMS
Sbjct: 104 CDTKLVSLYGSFGHVGYAGSVFDRIPEPDFYSWKVMIRWYFLNDLYTEIIGFYGRMRMSV 163
Query: 147 KECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSA 206
+ DN++FS++LKACSEL++I+EGRKVHC +VKVG PDSFV TGL+DMY KC QI+ +
Sbjct: 164 RGFDNVVFSVVLKACSELQDINEGRKVHCDVVKVGNPDSFVQTGLVDMYAKCRQIKCARK 223
Query: 207 VFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRA 266
VF I +NVVSWT+M+AGYVQN+C++E LVLFNRMRE++VESNQFTLGS++TAC +L A
Sbjct: 224 VFGEIFYRNVVSWTSMLAGYVQNNCSKEALVLFNRMREAMVESNQFTLGSLVTACGKLGA 283
Query: 267 LHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIV 326
LHQGKWVHGY IK IELNS+L TA LDMYVKCG RDA FD LPS+DLVSWTAMIV
Sbjct: 284 LHQGKWVHGYIIKTG-IELNSYLVTAILDMYVKCGSLRDARSAFDALPSVDLVSWTAMIV 343
Query: 327 GYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEE 386
GYSQ+G P+EAL+LF DK R G+LPN+VT AS+LSAC+ NLS G LVH LGI+LGL +
Sbjct: 344 GYSQSGFPDEALKLFVDKRRFGILPNAVTIASLLSACAQLSNLSAGRLVHSLGIQLGLID 403
Query: 387 CAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRS 446
V NAL+DMYAKC +I A +F V +K++I WNS++SGY+Q+G AYDAL LF+QMRS
Sbjct: 404 PTVINALVDMYAKCGVIRAASYIFETVSDKNLIAWNSILSGYSQNGLAYDALELFHQMRS 463
Query: 447 DSLAPDAITLVSALSASAILET--------SFYLSNSIANSEIMLTKQQLEFL--CGINS 504
+S++PDA+TLVS SA A + ++ + N + +S + + L F CG +
Sbjct: 464 NSVSPDAVTLVSIFSACASVGAFQVGSSLHAYTMKNGLLSSSVYVGTALLNFYAKCGDSK 523
BLAST of Cp4.1LG01g16670 vs. TrEMBL
Match:
A0A067L9V5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1)
HSP 1 Score: 562.0 bits (1447), Expect = 7.9e-157
Identity = 280/487 (57.49%), Postives = 359/487 (73.72%), Query Frame = 1
Query: 27 SYSTYASQPPLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLC 86
SY T+ D +AS+ +I H CF +G C+NI +L K HGLLIV GL G+LLC
Sbjct: 32 SYLTHQLPLDPPQFDHNIASIHYIFSHPCFNLLGFCKNIYSLKKVHGLLIVDGLDGDLLC 91
Query: 87 DTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFK 146
+TKLV +YG+ GD+ +AR+VFD++ +PD Y+WKVM+RWYFL+D++ I Y+RM++ K
Sbjct: 92 NTKLVSLYGSFGDIDAARVVFDRIPNPDLYSWKVMLRWYFLSDLYWEIFGLYSRMKICVK 151
Query: 147 ECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAV 206
E DN++FSI+LKACSELR IDEGRK+HCQ+VKVG PDSFVLTGL+DMY KCG+IE S V
Sbjct: 152 EYDNVMFSIVLKACSELRCIDEGRKIHCQVVKVGDPDSFVLTGLVDMYAKCGEIESSRHV 211
Query: 207 FEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRAL 266
F+ +D+NVVSWT+MIAGYVQNDC EGL LFNRMRE V NQFTLGS++TACT+L AL
Sbjct: 212 FDENLDRNVVSWTSMIAGYVQNDCPAEGLTLFNRMREGFVGGNQFTLGSLVTACTKLGAL 271
Query: 267 HQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVG 326
HQGKWVHG+AIK+ +ELNS+L TA LDMYVKCG +DA +FDEL S+DLVSWTAMIVG
Sbjct: 272 HQGKWVHGFAIKSG-VELNSYLVTALLDMYVKCGVIKDARSVFDELSSVDLVSWTAMIVG 331
Query: 327 YSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEEC 386
Y+Q+G +EAL+LF D+ + LPN VT ++LSAC+ GNL++G VHGLGIKLG +
Sbjct: 332 YTQSGLFHEALKLFMDE-KFDALPNDVTIVTVLSACAQLGNLNLGRSVHGLGIKLGFRQS 391
Query: 387 AVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSD 446
V NAL+DMYAKCHM DA +F + KDV+ WNS+ISGY Q+GSAY+AL LF+QMR +
Sbjct: 392 TVANALVDMYAKCHMNRDASFIFERISHKDVVAWNSIISGYYQNGSAYEALELFHQMRME 451
Query: 447 SLAPDAITLVSALSASAILET--------SFYLSNSIANSEIMLTKQQLEFL--CGINSS 504
+ PDA+TLVS SA A+L ++ + + +S + + L F CG +S
Sbjct: 452 LVLPDAVTLVSVFSACALLGALRAGSSLHAYSIKEGLLSSNVYVGTALLTFYAKCGDANS 511
BLAST of Cp4.1LG01g16670 vs. TAIR10
Match:
AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 424.1 bits (1089), Expect = 1.3e-118
Identity = 212/421 (50.36%), Postives = 289/421 (68.65%), Query Frame = 1
Query: 45 ASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSAR 104
+S+ + + CF + C NID+L + HG+L +GL+G++ TKLV +YG G AR
Sbjct: 37 SSLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDAR 96
Query: 105 MVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELR 164
+VFDQ+ +PDFY WKVM+R Y LN ++ Y+ + D+I+FS LKAC+EL+
Sbjct: 97 LVFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQ 156
Query: 165 EIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAG 224
++D G+K+HCQ+VKV D+ VLTGL+DMY KCG+I+ + VF I +NVV WT+MIAG
Sbjct: 157 DLDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAG 216
Query: 225 YVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIEL 284
YV+ND EEGLVLFNRMRE+ V N++T G++I ACT+L ALHQGKW HG +K+ IEL
Sbjct: 217 YVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSG-IEL 276
Query: 285 NSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKI 344
+S L T+ LDMYVKCG +A +F+E +DLV WTAMIVGY+ G NEAL LF
Sbjct: 277 SSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMK 336
Query: 345 RSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDD 404
+ PN VT AS+LS C + NL +G VHGL IK+G+ + V NAL+ MYAKC+ D
Sbjct: 337 GVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRD 396
Query: 405 AYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAI 464
A VF EKD++ WNS+ISG++Q+GS ++AL LF++M S+S+ P+ +T+ S SA A
Sbjct: 397 AKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACAS 456
Query: 465 L 466
L
Sbjct: 457 L 456
BLAST of Cp4.1LG01g16670 vs. TAIR10
Match:
AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)
HSP 1 Score: 246.5 bits (628), Expect = 3.7e-65
Identity = 143/434 (32.95%), Postives = 242/434 (55.76%), Query Frame = 1
Query: 68 LIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFL 127
L + H L+V GL + TKL+ + GD+ AR VFD + P + W +IR Y
Sbjct: 37 LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96
Query: 128 NDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG-GPDSFV 187
N+ F + Y+ M+++ D+ F +LKACS L + GR VH Q+ ++G D FV
Sbjct: 97 NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156
Query: 188 LTGLIDMYGKCGQIEWSSAVFEG--IIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRES 247
GLI +Y KC ++ + VFEG + ++ +VSWT +++ Y QN E L +F++MR+
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216
Query: 248 LVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRD 307
V+ + L S++ A T L+ L QG+ +H +K + +E+ L + MY KCGQ
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVK-MGLEIEPDLLISLNTMYAKCGQVAT 276
Query: 308 AHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSV 367
A ++FD++ S +L+ W AMI GY++ G EA+ +F + I + P++++ S +SAC+
Sbjct: 277 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 336
Query: 368 SGNLSIGMLVHG-LGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSM 427
G+L ++ +G ++ + +ALIDM+AKC ++ A +VF L++DV+ W++M
Sbjct: 337 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 396
Query: 428 ISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSA---SAILETSFYLSNSIANSEI 487
I GY G A +A+ L+ M + P+ +T + L A S ++ ++ N +A+ +I
Sbjct: 397 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 456
Query: 488 MLTKQQLEFLCGIN 495
QQ + C I+
Sbjct: 457 --NPQQQHYACVID 467
BLAST of Cp4.1LG01g16670 vs. TAIR10
Match:
AT4G18520.1 (AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 245.7 bits (626), Expect = 6.3e-65
Identity = 146/397 (36.78%), Postives = 221/397 (55.67%), Query Frame = 1
Query: 72 HGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMF 131
HG ++ G VGNL+ ++ LV Y G++ SA FD M + D +W +I
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266
Query: 132 ASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK-VGGPDSFVLTGL 191
I + M + + ILKACSE + + GR+VH +VK + D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326
Query: 192 IDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQ 251
+DMY KCG+I VF+G+ ++N V+WT++IA + + EE + LF M+ + +N
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386
Query: 252 FTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFD 311
T+ SI+ AC + AL GK +H IKN I E N ++ + + +Y KCG++RDA +
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSI-EKNVYIGSTLVWLYCKCGESRDAFNVLQ 446
Query: 312 ELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSI 371
+LPS D+VSWTAMI G S G +EAL + I+ G+ PN T +S L AC+ S +L I
Sbjct: 447 QLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLI 506
Query: 372 GMLVHGLGIK-LGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQ 431
G +H + K L V +ALI MYAKC + +A+ VF + EK++++W +MI GYA+
Sbjct: 507 GRSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYAR 566
Query: 432 SGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILE 467
+G +AL+L +M ++ D + LS +E
Sbjct: 567 NGFCREALKLMYRMEAEGFEVDDYIFATILSTCGDIE 601
BLAST of Cp4.1LG01g16670 vs. TAIR10
Match:
AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 241.5 bits (615), Expect = 1.2e-63
Identity = 139/420 (33.10%), Postives = 227/420 (54.05%), Query Frame = 1
Query: 53 HSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRD 112
H + C ++ L + L+ +GL TKLV ++ G V A VF+ +
Sbjct: 38 HPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDS 97
Query: 113 PDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKV 172
+ M++ + + F+ RMR E F+ +LK C + E+ G+++
Sbjct: 98 KLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEI 157
Query: 173 HCQIVKVG-GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCA 232
H +VK G D F +TGL +MY KC Q+ + VF+ + ++++VSW T++AGY QN A
Sbjct: 158 HGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMA 217
Query: 233 EEGLVLFNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATA 292
L + M E ++ + T+ S++ A + LR + GK +HGYA+++ L + ++TA
Sbjct: 218 RMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN-ISTA 277
Query: 293 FLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPN 352
+DMY KCG A +FD + ++VSW +MI Y Q P EA+ +F + G+ P
Sbjct: 278 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 337
Query: 353 SVTAASILSACSVSGNLSIGMLVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYVVFL 412
V+ L AC+ G+L G +H L ++LGL+ +V N+LI MY KC +D A +F
Sbjct: 338 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 397
Query: 413 GVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILETSFY 471
+ + +++WN+MI G+AQ+G DAL F+QMRS ++ PD T VS ++A A L + +
Sbjct: 398 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456
BLAST of Cp4.1LG01g16670 vs. TAIR10
Match:
AT3G05240.1 (AT3G05240.1 mitochondrial editing factor 19)
HSP 1 Score: 233.4 bits (594), Expect = 3.3e-61
Identity = 139/413 (33.66%), Postives = 215/413 (52.06%), Query Frame = 1
Query: 62 CRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGD---VGSARMVFDQMRDPDFYAW 121
CR++ L + HGL+I ++ N++ ++L+ + + AR VF+ + P Y W
Sbjct: 16 CRSLVELNQLHGLMIKSSVIRNVIPLSRLIDFCTTCPETMNLSYARSVFESIDCPSVYIW 75
Query: 122 KVMIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVK 181
MIR Y + + FY M D F +LKACS LR+I G VH +VK
Sbjct: 76 NSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVVK 135
Query: 182 VGGP-DSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVL 241
G + +V T L+ MY CG++ + VFE I NVV+W ++I+G+V N+ + +
Sbjct: 136 TGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWNVVAWGSLISGFVNNNRFSDAIEA 195
Query: 242 FNRMRESLVESNQFTLGSIITACTRLRALHQGKWVHGYA-------IKNVIIELNSFLAT 301
F M+ + V++N+ + ++ AC R + + GKW HG+ + N LAT
Sbjct: 196 FREMQSNGVKANETIMVDLLVACGRCKDIVTGKWFHGFLQGLGFDPYFQSKVGFNVILAT 255
Query: 302 AFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLP 361
+ +DMY KCG R A +FD +P LVSW ++I GYSQ G EAL +F D + G+ P
Sbjct: 256 SLIDMYAKCGDLRTARYLFDGMPERTLVSWNSIITGYSQNGDAEEALCMFLDMLDLGIAP 315
Query: 362 NSVTAASILSACSVSGNLSIGMLVHGLGIKLG-LEECAVKNALIDMYAKCHMIDDAYVVF 421
+ VT S++ A + G +G +H K G +++ A+ AL++MYAK + A F
Sbjct: 316 DKVTFLSVIRASMIQGCSQLGQSIHAYVSKTGFVKDAAIVCALVNMYAKTGDAESAKKAF 375
Query: 422 LGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDSLA-PDAITLVSALSA 462
+ +KD I W +I G A G +AL +F +M+ A PD IT + L A
Sbjct: 376 EDLEKKDTIAWTVVIIGLASHGHGNEALSIFQRMQEKGNATPDGITYLGVLYA 428
BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match:
gi|659114052|ref|XP_008456885.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucumis melo])
HSP 1 Score: 814.7 bits (2103), Expect = 9.7e-233
Identity = 413/513 (80.51%), Postives = 446/513 (86.94%), Query Frame = 1
Query: 1 MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
MLQRFFSLPR FS L G LL+MGH +SYSTYAS PPLSDL QTM SVQFISLHSC Y MG
Sbjct: 1 MLQRFFSLPRAFSHLTGSLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLHSCCYLMG 60
Query: 61 LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM DPDFYAWKV
Sbjct: 61 LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPDPDFYAWKV 120
Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
MIRWYFLND+F +IPFYN MRMSF+ECDNIIFSIILKACSELREIDEGRKVHCQIVKVG
Sbjct: 121 MIRWYFLNDLFVDVIPFYNCMRMSFRECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
GPDSFV+TGLIDMYGKC Q+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 181 GPDSFVMTGLIDMYGKCRQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 240
Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I+E +SFLAT FLDMYVKCG
Sbjct: 241 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IVEFSSFLATTFLDMYVKCG 300
Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
QTRDA MIFDELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 301 QTRDARMIFDELPTIDLVSWTAMIVGYTQASQPNDGLRLFADEIRSDLLPNSVTAASVLS 360
Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
ACSVSGNL++GM VHGLGIKLGLEECAVKNALIDMYAKCH I DAYV+F GVLEKDVITW
Sbjct: 361 ACSVSGNLNLGMSVHGLGIKLGLEECAVKNALIDMYAKCHKIGDAYVIFHGVLEKDVITW 420
Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAILET--------SFYLS 480
NSMISGYAQ+GSAYDALRLFNQMRS SLAPDAITLVS LSASA L ++ +
Sbjct: 421 NSMISGYAQNGSAYDALRLFNQMRSYSLAPDAITLVSTLSASATLGAVQVGSSLHAYSVK 480
Query: 481 NSIANSEIMLTKQQLEFL--CGINSSAQNTNDS 504
+ +S + + L F CG SA+ DS
Sbjct: 481 EGLFSSNLYIGTALLNFYAKCGDAKSARTVFDS 512
BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match:
gi|700195421|gb|KGN50598.1| (hypothetical protein Csa_5G189930 [Cucumis sativus])
HSP 1 Score: 786.9 bits (2031), Expect = 2.2e-224
Identity = 393/465 (84.52%), Postives = 420/465 (90.32%), Query Frame = 1
Query: 1 MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
MLQRF PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL C Y MG
Sbjct: 13 MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72
Query: 61 LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73 LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132
Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
MIRWYFLND+F +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192
Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252
Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312
Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372
Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432
Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
NSMISGYAQ+GSAYDALRLFNQMR LAPD ITLVS LSA A L
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 473
BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match:
gi|778708407|ref|XP_011656184.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-like [Cucumis sativus])
HSP 1 Score: 782.3 bits (2019), Expect = 5.3e-223
Identity = 390/465 (83.87%), Postives = 420/465 (90.32%), Query Frame = 1
Query: 1 MLQRFFSLPRIFSRLAGPLLEMGHHVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMG 60
MLQRF PR F RLA PLL+MGH +SYSTYAS PPLSDL QTM SVQFISL C Y MG
Sbjct: 13 MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72
Query: 61 LCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKV 120
L RNIDTLIKFHGLLIVHGL+GNLLCDTKLVGVYGALGDV SARMVFDQM +PDFYAWKV
Sbjct: 73 LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132
Query: 121 MIRWYFLNDMFASIIPFYNRMRMSFKECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180
MIRWYFLND+F +IPFYNRMRMSF+ECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192
Query: 181 GPDSFVLTGLIDMYGKCGQIEWSSAVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNR 240
GPDSFV+TGLIDMYGKCGQ+E SSAVFE I+DKNVVSWT+MIAGYVQN+CAEEGLVLFNR
Sbjct: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252
Query: 241 MRESLVESNQFTLGSIITACTRLRALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCG 300
MR++LVESN FTLGSII ACT+LRALHQGKWVHGYAIKN I EL+SFLAT FLDMYVKCG
Sbjct: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKN-IAELSSFLATTFLDMYVKCG 312
Query: 301 QTRDAHMIFDELPSIDLVSWTAMIVGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILS 360
QTRDA MI+DELP+IDLVSWTAMIVGY+QA QPN+ LRLF D+IRS LLPNSVTAAS+LS
Sbjct: 313 QTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLS 372
Query: 361 ACSVSGNLSIGMLVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITW 420
ACSVSGNL++GM VHGLGIK+GLEEC VKNALIDMYAKCH I DAY +F GVLEKDVITW
Sbjct: 373 ACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 432
Query: 421 NSMISGYAQSGSAYDALRLFNQMRSDSLAPDAITLVSALSASAIL 466
NSMISGYAQ+GSAYDALRLFNQMR LAPD ITLVS + SA++
Sbjct: 433 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSWFTWSAMI 473
BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match:
gi|590699556|ref|XP_007045956.1| (Pentatricopeptide repeat superfamily protein [Theobroma cacao])
HSP 1 Score: 597.4 bits (1539), Expect = 2.4e-167
Identity = 287/438 (65.53%), Postives = 353/438 (80.59%), Query Frame = 1
Query: 30 TYASQPPLS--DLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNLLCD 89
+Y + PL +D+T+AS+ ISL+ CF +GLCRNID+L K H L +++G+ G+LLCD
Sbjct: 30 SYTTDHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDLLCD 89
Query: 90 TKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMSFKE 149
TKLV +YG G +G AR++FDQ+ DPDFY+WKVMIRWYFLND+ II FY RMRMS +
Sbjct: 90 TKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMSVRM 149
Query: 150 CDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSSAVF 209
CDN++FS++LKACSE+R+IDEGRKVHCQIVK G PDSFV TGL+DMY KCG+IE S VF
Sbjct: 150 CDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSRKVF 209
Query: 210 EGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLRALH 269
IID+NVVSWT+MIAGYVQNDCAE+ LVLFNRMRE++VE N+FTLGS++TAC +L ALH
Sbjct: 210 SEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALH 269
Query: 270 QGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMIVGY 329
QGKWVHGY IKN IELNS+ T LDMYVKCG RDA +FDEL S+DLVSWTAMIVGY
Sbjct: 270 QGKWVHGYVIKNG-IELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGY 329
Query: 330 SQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLEECA 389
SQ+G P+EAL+LF DK G+LPN+VT AS+LSAC+ NLS G LVH LGI+LGL++
Sbjct: 330 SQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKDST 389
Query: 390 VKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMRSDS 449
V NAL+DMYAKC MI DA +F V +K++I WNS+ISGY+Q+GSAY+A LF+QMRS S
Sbjct: 390 VINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKS 449
Query: 450 LAPDAITLVSALSASAIL 466
++PDA+T+VS SA A L
Sbjct: 450 VSPDAVTVVSIFSACASL 466
BLAST of Cp4.1LG01g16670 vs. NCBI nr
Match:
gi|645229621|ref|XP_008221546.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Prunus mume])
HSP 1 Score: 583.9 bits (1504), Expect = 2.8e-163
Identity = 276/441 (62.59%), Postives = 356/441 (80.73%), Query Frame = 1
Query: 25 HVSYSTYASQPPLSDLDQTMASVQFISLHSCFYFMGLCRNIDTLIKFHGLLIVHGLVGNL 84
H Y +S+PP DL +T+AS + + + CF + LCRNID+L K H LL++HGL +L
Sbjct: 28 HAIYQLPSSEPP--DLSETLASTRSVFSNPCFNLLVLCRNIDSLKKVHSLLVLHGLSDDL 87
Query: 85 LCDTKLVGVYGALGDVGSARMVFDQMRDPDFYAWKVMIRWYFLNDMFASIIPFYNRMRMS 144
LC TKL+ +YG+ G V AR++FDQM PDFY+WKVM+RWYF+++++A ++ FY MR+
Sbjct: 88 LCRTKLISLYGSFGYVKCARLLFDQMPSPDFYSWKVMLRWYFMHNLYAEVMGFYTHMRIC 147
Query: 145 FKECDNIIFSIILKACSELREIDEGRKVHCQIVKVGGPDSFVLTGLIDMYGKCGQIEWSS 204
+E DN++FSI+LKACSELR+ +EGRKVHCQ+VKV PDSFVLTGL+D+Y KCG IE S
Sbjct: 148 VREHDNVVFSIVLKACSELRDFNEGRKVHCQVVKVASPDSFVLTGLVDVYAKCGWIECSR 207
Query: 205 AVFEGIIDKNVVSWTTMIAGYVQNDCAEEGLVLFNRMRESLVESNQFTLGSIITACTRLR 264
AVF+GI+D+NVV WT+MI GYVQNDC ++GLVLFNRMRE L++ NQFTLGS++TACT+LR
Sbjct: 208 AVFDGIVDRNVVCWTSMIVGYVQNDCPQDGLVLFNRMREELIKGNQFTLGSVLTACTKLR 267
Query: 265 ALHQGKWVHGYAIKNVIIELNSFLATAFLDMYVKCGQTRDAHMIFDELPSIDLVSWTAMI 324
ALHQGKW+HG+ IK IE++SFL T+ LDMYVKCG R A IFDELP+IDLVSWTAMI
Sbjct: 268 ALHQGKWIHGHLIKTG-IEVSSFLVTSLLDMYVKCGDIRYARSIFDELPAIDLVSWTAMI 327
Query: 325 VGYSQAGQPNEALRLFTDKIRSGLLPNSVTAASILSACSVSGNLSIGMLVHGLGIKLGLE 384
VGY+Q+G P+EAL+LFTD+ GLLPNS+T AS+LS+C+ S NL++G +HGLGIKLGLE
Sbjct: 328 VGYTQSGCPDEALKLFTDEKWVGLLPNSITTASVLSSCAQSYNLNLGRSIHGLGIKLGLE 387
Query: 385 ECAVKNALIDMYAKCHMIDDAYVVFLGVLEKDVITWNSMISGYAQSGSAYDALRLFNQMR 444
+ V+NAL+DMYAKCHMI DA +F +L+K+VI WNS+ISGY+Q+GSA +AL+LF+QMR
Sbjct: 388 DSTVRNALVDMYAKCHMIGDARYIFETILDKNVIAWNSIISGYSQNGSACEALQLFHQMR 447
Query: 445 SDSLAPDAITLVSALSASAIL 466
S+S + DA TL S LSA L
Sbjct: 448 SESFSHDAFTLASVLSACTTL 465
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PP146_ARATH | 2.3e-117 | 50.36 | Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... | [more] |
PP224_ARATH | 6.6e-64 | 32.95 | Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... | [more] |
PP319_ARATH | 1.1e-63 | 36.78 | Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN... | [more] |
PPR32_ARATH | 2.1e-62 | 33.10 | Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... | [more] |
PP214_ARATH | 5.8e-60 | 33.66 | Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KLZ1_CUCSA | 1.5e-224 | 84.52 | Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1 | [more] |
A0A061EAZ8_THECC | 1.7e-167 | 65.53 | Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE... | [more] |
A5AY98_VITVI | 1.4e-161 | 64.79 | Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=... | [more] |
A0A0D2RH88_GOSRA | 4.6e-157 | 57.99 | Uncharacterized protein OS=Gossypium raimondii GN=B456_008G178400 PE=4 SV=1 | [more] |
A0A067L9V5_JATCU | 7.9e-157 | 57.49 | Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1 | [more] |