Homology
BLAST of HG10021841 vs. NCBI nr
Match:
XP_038893523.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa hispida])
HSP 1 Score: 1418.7 bits (3671), Expect = 0.0e+00
Identity = 699/733 (95.36%), Postives = 715/733 (97.54%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNFPTPN+NLPFRNHQILSTIDQCSS KQLKQVHAHMLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFPTPNDNLPFRNHQILSTIDQCSSPKQLKQVHAHMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYA +VFDQI PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYALNVFDQISHPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
+DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYG CGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGTCGDLNMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLFEGISCKDVVSWNSMISAFAQGN PEDALDLFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALDLFLKMERENVMPNSVTMVGVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMY+KCGSID AQKLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGDFDAAR+VFDAMPVKEIAAWNVLISA EQNG PKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFDAARRVFDAMPVKEIAAWNVLISAYEQNGNPKEALATFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSAC+QLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACSQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VE +DVYVWSAMIAGLGMHGRGKAAI+LFFEMQEAKVKPN+VTF NVLCACSHAGLVDEG
Sbjct: 421 VEVRDVYVWSAMIAGLGMHGRGKAAINLFFEMQEAKVKPNSVTFMNVLCACSHAGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
RAF HEMEP+YGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSAS+WGALLGACS
Sbjct: 481 RAFLHEMEPIYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASIWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDS+LKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSKLKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEV+GNVHEFLVGDNSH LSS IY KLDEIATKLKSVGYEPNKSHLLQ IE+DDLKEQ
Sbjct: 601 SSIEVDGNVHEFLVGDNSHPLSSKIYLKLDEIATKLKSVGYEPNKSHLLQFIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSC DYW
Sbjct: 721 HFRDGHCSCRDYW 733
BLAST of HG10021841 vs. NCBI nr
Match:
KAA0031814.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1393.6 bits (3606), Expect = 0.0e+00
Identity = 689/733 (94.00%), Postives = 709/733 (96.73%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. NCBI nr
Match:
XP_008457379.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo] >TYJ97320.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. NCBI nr
Match:
XP_004145320.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sativus] >KGN65801.1 hypothetical protein Csa_023315 [Cucumis sativus])
HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 684/733 (93.32%), Postives = 707/733 (96.45%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVP ISLQNF T NNNL FRNHQILSTID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR++FDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IKVDLTLCNAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP KHYACMVDILGRAGFLEEAMELINEM TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIE NGNVHEFLVGDN+H LSS+IYSKL+EIATKLKSVGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. NCBI nr
Match:
XP_022964665.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita moschata])
HSP 1 Score: 1359.4 bits (3517), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 699/733 (95.36%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
ME LS PL+SL N +NNL FRNHQILSTIDQCSS KQLKQVHA MLRTGLFFDPFSA
Sbjct: 1 METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKL ASAL S STL+YARDVFDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61 SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
+DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLS GMD YILNSLVRFYGACGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLFEGISCKDVVSWNSMISAFAQGN PEDAL+LFLKMEG NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERKEI VDLTLCNAMLDMY+KCGSI A+KLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGDF+AAR+VFD MPVKEIAAWN LISA E+NGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVS+LSACAQLGAIDLGGWIHVYIKREGI+LN HLI+SL+DMYAKCGALEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEEKDVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN+VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
RA FHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMP TPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVELAELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRDSELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SS+EVNG VHEFLVGDNSH LS DIYSKLDEIA KLKSVGYEPNKSHLLQLIE+DD+KE
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKL+SRVY+R+IL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match:
O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)
HSP 1 Score: 939.9 bits (2428), Expect = 1.9e-272
Identity = 447/727 (61.49%), Postives = 569/727 (78.27%), Query Frame = 0
Query: 71 PLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTA 130
P S N PT NN + +S I++C S +QLKQ H HM+RTG F DP+SASKLF
Sbjct: 16 PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75
Query: 131 SALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 190
+ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS DP S FLD++ + + PN
Sbjct: 76 AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135
Query: 191 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEG 250
+TFPF+IKAA+E+ + +G+++HGMA+K + G D+++ NSL+ Y +CGDL+ A ++F
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195
Query: 251 ISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFG 310
I KDVVSWNSMI+ F Q P+ AL+LF KME E+V + VTMVGVLSACAK +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255
Query: 311 RWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKM 370
R VCSYIE + V+LTL NAMLDMY+KCGSI+ A++LFD M E+D +WTTMLDGYA
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315
Query: 371 GDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVST 430
D++AAR+V ++MP K+I AWN LISA EQNGKP EAL F+ELQ+ K K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375
Query: 431 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDV 490
LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435
Query: 491 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHE 550
+VWSAMI GL MHG G A+D+F++MQEA VKPN VTFTNV CACSH GLVDE + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495
Query: 551 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVE 610
ME YG+VP KHYAC+VD+LGR+G+LE+A++ I MPI PS SVWGALLGAC +H N+
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555
Query: 611 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVN 670
LAE+A +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR + LKKEPGCSSIE++
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615
Query: 671 GNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHS 730
G +HEFL GDN+H +S +Y KL E+ KLKS GYEP S +LQ+IE++++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675
Query: 731 EKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGH 790
EKLAI +GLIS + IRV+KNLR+CGDCH VAKL+S++YDREI++RDRYRFHHFR+G
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735
Query: 791 CSCMDYW 798
CSC D+W
Sbjct: 736 CSCNDFW 738
BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match:
Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)
HSP 1 Score: 605.1 bits (1559), Expect = 1.1e-171
Identity = 321/768 (41.80%), Postives = 459/768 (59.77%), Query Frame = 0
Query: 68 LSVPLISLQNFPTPNNNLP----FRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFS 127
L+VP S P+++ P RNH LS + C + + L+ +HA M++ GL ++
Sbjct: 8 LTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYA 67
Query: 128 ASKLFTASALS-SFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLD 187
SKL LS F L YA VF I +PNL WNT+ R +A SSDP + +++ ++
Sbjct: 68 LSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI- 127
Query: 188 KCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNS------------ 247
LPN++TFPFV+K+ ++ KA + G+ +HG +KL +DLY+ S
Sbjct: 128 SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLE 187
Query: 248 -------------------LVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNF 307
L++ Y + G + A++LF+ I KDVVSWN+MIS +A+
Sbjct: 188 DAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGN 247
Query: 308 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 367
++AL+LF M NV P+ TMV V+SACA+ +E GR V +I+ +L + NA
Sbjct: 248 YKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNA 307
Query: 368 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 427
++D+YSKCG ++ A LF+ +P +DV SW T++ GY M +
Sbjct: 308 LIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------------ 367
Query: 428 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 487
KEAL F E+ S P++VT++S L ACA LGAID+G WIHVYI
Sbjct: 368 -------------KEALLLFQEMLRSG-ETPNDVTMLSILPACAHLGAIDIGRWIHVYID 427
Query: 488 R--EGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAA 547
+ +G+ L +SL+DMYAKCG +E A +VF S+ K + W+AMI G MHGR A+
Sbjct: 428 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 487
Query: 548 IDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVD 607
DLF M++ ++P+++TF +L ACSH+G++D GR F M Y + P +HY CM+D
Sbjct: 488 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 547
Query: 608 ILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGA 667
+LG +G +EA E+IN M + P +W +LL AC +H NVEL E ++ L+K+EP N G+
Sbjct: 548 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 607
Query: 668 IVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDI 727
VLLSNIYA GRW +V++ R L+ D +KK PGCSSIE++ VHEF++GD H + +I
Sbjct: 608 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 667
Query: 728 YSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIR 787
Y L+E+ L+ G+ P+ S +LQ +E ++ KE AL HSEKLAIAFGLIS P +
Sbjct: 668 YGMLEEMEVLLEKAGFVPDTSEVLQEME-EEWKEGALRHHSEKLAIAFGLISTKPGTKLT 727
Query: 788 VVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
+VKNLR+C +CHE KL+S++Y REI+ RDR RFHHFRDG CSC DYW
Sbjct: 728 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741
BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match:
O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)
HSP 1 Score: 566.2 bits (1458), Expect = 5.7e-160
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 0
Query: 92 ILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 151
IL + C S +KQ+HAH+LRT + S LF S SS L YA +VF IP
Sbjct: 15 ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74
Query: 152 -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 211
P +N +R + SS+P ++ ++F + + F+F ++KA S++ A G
Sbjct: 75 PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134
Query: 212 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGN 271
+HG+A K++ D ++ + Y +CG +N A +F+ +S +DVV+WN+MI + +
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194
Query: 272 FPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 331
++A LF +M+ NVMP+ + + ++SAC + ++ + R + ++ ++++D L
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254
Query: 332 AMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAA 391
A++ MY+ G +D+A++ F +M R++F T M+ GY+K G D A+ +FD K++
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314
Query: 392 WNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 451
W +ISA ++ P+EAL F E+ S I KPD V++ S +SACA LG +D W+H I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374
Query: 452 KREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 511
G++ + ++L++MYAKCG L+ +VF + ++V WS+MI L MHG A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434
Query: 512 DLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 571
LF M++ V+PN VTF VL CSH+GLV+EG+ F M Y + P +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494
Query: 572 LGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAI 631
GRA L EA+E+I MP+ + +WG+L+ AC +H +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554
Query: 632 VLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIY 691
VL+SNIYA+ RWE V +R++M + + KE G S I+ NG HEFL+GD H S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614
Query: 692 SKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQP--- 751
+KLDE+ +KLK GY P+ +L +E+++ K+ L HSEKLA+ FGL++ +
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674
Query: 752 ---IRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
IR+VKNLR+C DCH KLVS+VY+REI++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722
BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match:
Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)
HSP 1 Score: 543.5 bits (1399), Expect = 4.0e-153
Identity = 279/706 (39.52%), Postives = 425/706 (60.20%), Query Frame = 0
Query: 94 STIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 153
S ID + QLKQ+HA +L GL F F +KL AS SSF + +AR VFD +P+P
Sbjct: 26 SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85
Query: 154 LYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 213
++ WN +IR Y S ++ FQ ++ + P++FTFP ++KA S L ++GR VH
Sbjct: 86 IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145
Query: 214 GMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISC--KDVVSWNSMISAFAQGNF 273
+L F D+++ N L+ Y C L A +FEG+ + +VSW +++SA+AQ
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205
Query: 274 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 333
P +AL++F +M +V P+ V +V VL+A DL+ GR + + + + ++++ L +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265
Query: 334 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 393
+ MY+KCG + A+ LFD+M ++ W M+ GYAK
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325
Query: 394 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 453
NG +EA+ F+E+ ++K +PD +++ S +SACAQ+G+++ ++ Y+
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385
Query: 454 REGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAID 513
R + + S+L+DM+AKCG++E A VF ++DV VWSAMI G G+HGR + AI
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445
Query: 514 LFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 573
L+ M+ V PN+VTF +L AC+H+G+V EG FF+ M + + P +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505
Query: 574 GRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIV 633
GRAG L++A E+I MP+ P +VWGALL AC H +VEL E A+ QL ++P N G V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565
Query: 634 LLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYS 693
LSN+YA W++V+E+R M++ L K+ GCS +EV G + F VGD SH +I
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625
Query: 694 KLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVV 753
+++ I ++LK G+ NK L + D++ E+ L HSE++AIA+GLIS P+R+
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685
Query: 754 KNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
KNLR C +CH KL+S++ DREI++RD RFHHF+DG CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694
BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match:
Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)
HSP 1 Score: 525.8 bits (1353), Expect = 8.6e-148
Identity = 264/695 (37.99%), Postives = 414/695 (59.57%), Query Frame = 0
Query: 107 QVHAHMLRTGLFFDPFSASKLFTASALSSF----STLDYARDVFDQIPQPNLYTWNTLIR 166
Q+H +++ G A LF ++L F LD AR VFD++ + N+ +W ++I
Sbjct: 155 QIHGLIVKMGY------AKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMIC 214
Query: 167 AYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFG 226
YA + +F ++ E PN+ T VI A ++L+ G V+
Sbjct: 215 GYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIE 274
Query: 227 MDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKM 286
++ ++++LV Y C +++A+RLF+ ++ N+M S + + +AL +F M
Sbjct: 275 VNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLM 334
Query: 287 EGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSI 346
V P+ ++M+ +S+C++ ++ +G+ Y+ R + +CNA++DMY KC
Sbjct: 335 MDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQ 394
Query: 347 DVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNG 406
D A ++FD M + V +W +++ GY + G+ DAA + F+ MP K I +WN +IS Q
Sbjct: 395 DTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGS 454
Query: 407 KPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLI 466
+EA+ F +Q + D VT++S SAC LGA+DL WI+ YI++ GI L+ L
Sbjct: 455 LFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLG 514
Query: 467 SSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVK 526
++LVDM+++CG E A+ +F S+ +DV W+A I + M G + AI+LF +M E +K
Sbjct: 515 TTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLK 574
Query: 527 PNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAME 586
P+ V F L ACSH GLV +G+ F+ M ++GV P HY CMVD+LGRAG LEEA++
Sbjct: 575 PDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQ 634
Query: 587 LINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGR 646
LI +MP+ P+ +W +LL AC + NVE+A A++++ L P G+ VLLSN+YA GR
Sbjct: 635 LIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGR 694
Query: 647 WEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKS 706
W ++++R M++ L+K PG SSI++ G HEF GD SH +I + LDE++ +
Sbjct: 695 WNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASH 754
Query: 707 VGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHE 766
+G+ P+ S++L +++ + K LS HSEKLA+A+GLIS IR+VKNLR+C DCH
Sbjct: 755 LGHVPDLSNVLMDVDEKE-KIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHS 814
Query: 767 VAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
AK S+VY+REI+LRD RFH+ R G CSC D+W
Sbjct: 815 FAKFASKVYNREIILRDNNRFHYIRQGKCSCGDFW 842
BLAST of HG10021841 vs. ExPASy TrEMBL
Match:
A0A5A7SKX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00570 PE=3 SV=1)
HSP 1 Score: 1393.6 bits (3606), Expect = 0.0e+00
Identity = 689/733 (94.00%), Postives = 709/733 (96.73%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. ExPASy TrEMBL
Match:
A0A5D3BBW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001260 PE=3 SV=1)
HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. ExPASy TrEMBL
Match:
A0A1S3C623 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497087 PE=3 SV=1)
HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. ExPASy TrEMBL
Match:
A0A0A0M0R9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G530130 PE=3 SV=1)
HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 684/733 (93.32%), Postives = 707/733 (96.45%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
MEALSVP ISLQNF T NNNL FRNHQILSTID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1 MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKLFTASALSSFSTLDYAR++FDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61 SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERK IKVDLTLCNAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
R FFHEMEPVYGVVP KHYACMVDILGRAGFLEEAMELINEM TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SSIE NGNVHEFLVGDN+H LSS+IYSKL+EIATKLKSVGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. ExPASy TrEMBL
Match:
A0A6J1HLG4 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464676 PE=3 SV=1)
HSP 1 Score: 1359.4 bits (3517), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 699/733 (95.36%), Query Frame = 0
Query: 65 MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
ME LS PL+SL N +NNL FRNHQILSTIDQCSS KQLKQVHA MLRTGLFFDPFSA
Sbjct: 1 METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60
Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
SKL ASAL S STL+YARDVFDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61 SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120
Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
+DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLS GMD YILNSLVRFYGACGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180
Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
ERLFEGISCKDVVSWNSMISAFAQGN PEDAL+LFLKMEG NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240
Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
LDLEFGRWVCSYIERKEI VDLTLCNAMLDMY+KCGSI A+KLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300
Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
DGYAKMGDF+AAR+VFD MPVKEIAAWN LISA E+NGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360
Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
VTLVS+LSACAQLGAIDLGGWIHVYIKREGI+LN HLI+SL+DMYAKCGALEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420
Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
VEEKDVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN+VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480
Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
RA FHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMP TPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540
Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
LHMNVELAELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRDSELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600
Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
SS+EVNG VHEFLVGDNSH LS DIYSKLDEIA KLKSVGYEPNKSHLLQLIE+DD+KE
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660
Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKL+SRVY+R+IL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720
Query: 785 HFRDGHCSCMDYW 798
HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733
BLAST of HG10021841 vs. TAIR 10
Match:
AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 939.9 bits (2428), Expect = 1.4e-273
Identity = 447/727 (61.49%), Postives = 569/727 (78.27%), Query Frame = 0
Query: 71 PLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTA 130
P S N PT NN + +S I++C S +QLKQ H HM+RTG F DP+SASKLF
Sbjct: 16 PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75
Query: 131 SALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 190
+ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS DP S FLD++ + + PN
Sbjct: 76 AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135
Query: 191 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEG 250
+TFPF+IKAA+E+ + +G+++HGMA+K + G D+++ NSL+ Y +CGDL+ A ++F
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195
Query: 251 ISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFG 310
I KDVVSWNSMI+ F Q P+ AL+LF KME E+V + VTMVGVLSACAK +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255
Query: 311 RWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKM 370
R VCSYIE + V+LTL NAMLDMY+KCGSI+ A++LFD M E+D +WTTMLDGYA
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315
Query: 371 GDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVST 430
D++AAR+V ++MP K+I AWN LISA EQNGKP EAL F+ELQ+ K K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375
Query: 431 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDV 490
LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435
Query: 491 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHE 550
+VWSAMI GL MHG G A+D+F++MQEA VKPN VTFTNV CACSH GLVDE + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495
Query: 551 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVE 610
ME YG+VP KHYAC+VD+LGR+G+LE+A++ I MPI PS SVWGALLGAC +H N+
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555
Query: 611 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVN 670
LAE+A +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR + LKKEPGCSSIE++
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615
Query: 671 GNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHS 730
G +HEFL GDN+H +S +Y KL E+ KLKS GYEP S +LQ+IE++++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675
Query: 731 EKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGH 790
EKLAI +GLIS + IRV+KNLR+CGDCH VAKL+S++YDREI++RDRYRFHHFR+G
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735
Query: 791 CSCMDYW 798
CSC D+W
Sbjct: 736 CSCNDFW 738
BLAST of HG10021841 vs. TAIR 10
Match:
AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 605.1 bits (1559), Expect = 7.9e-173
Identity = 321/768 (41.80%), Postives = 459/768 (59.77%), Query Frame = 0
Query: 68 LSVPLISLQNFPTPNNNLP----FRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFS 127
L+VP S P+++ P RNH LS + C + + L+ +HA M++ GL ++
Sbjct: 8 LTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYA 67
Query: 128 ASKLFTASALS-SFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLD 187
SKL LS F L YA VF I +PNL WNT+ R +A SSDP + +++ ++
Sbjct: 68 LSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI- 127
Query: 188 KCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNS------------ 247
LPN++TFPFV+K+ ++ KA + G+ +HG +KL +DLY+ S
Sbjct: 128 SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLE 187
Query: 248 -------------------LVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNF 307
L++ Y + G + A++LF+ I KDVVSWN+MIS +A+
Sbjct: 188 DAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGN 247
Query: 308 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 367
++AL+LF M NV P+ TMV V+SACA+ +E GR V +I+ +L + NA
Sbjct: 248 YKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNA 307
Query: 368 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 427
++D+YSKCG ++ A LF+ +P +DV SW T++ GY M +
Sbjct: 308 LIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------------ 367
Query: 428 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 487
KEAL F E+ S P++VT++S L ACA LGAID+G WIHVYI
Sbjct: 368 -------------KEALLLFQEMLRSG-ETPNDVTMLSILPACAHLGAIDIGRWIHVYID 427
Query: 488 R--EGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAA 547
+ +G+ L +SL+DMYAKCG +E A +VF S+ K + W+AMI G MHGR A+
Sbjct: 428 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 487
Query: 548 IDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVD 607
DLF M++ ++P+++TF +L ACSH+G++D GR F M Y + P +HY CM+D
Sbjct: 488 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 547
Query: 608 ILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGA 667
+LG +G +EA E+IN M + P +W +LL AC +H NVEL E ++ L+K+EP N G+
Sbjct: 548 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 607
Query: 668 IVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDI 727
VLLSNIYA GRW +V++ R L+ D +KK PGCSSIE++ VHEF++GD H + +I
Sbjct: 608 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 667
Query: 728 YSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIR 787
Y L+E+ L+ G+ P+ S +LQ +E ++ KE AL HSEKLAIAFGLIS P +
Sbjct: 668 YGMLEEMEVLLEKAGFVPDTSEVLQEME-EEWKEGALRHHSEKLAIAFGLISTKPGTKLT 727
Query: 788 VVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
+VKNLR+C +CHE KL+S++Y REI+ RDR RFHHFRDG CSC DYW
Sbjct: 728 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741
BLAST of HG10021841 vs. TAIR 10
Match:
AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 566.2 bits (1458), Expect = 4.1e-161
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 0
Query: 92 ILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 151
IL + C S +KQ+HAH+LRT + S LF S SS L YA +VF IP
Sbjct: 15 ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74
Query: 152 -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 211
P +N +R + SS+P ++ ++F + + F+F ++KA S++ A G
Sbjct: 75 PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134
Query: 212 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGN 271
+HG+A K++ D ++ + Y +CG +N A +F+ +S +DVV+WN+MI + +
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194
Query: 272 FPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 331
++A LF +M+ NVMP+ + + ++SAC + ++ + R + ++ ++++D L
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254
Query: 332 AMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAA 391
A++ MY+ G +D+A++ F +M R++F T M+ GY+K G D A+ +FD K++
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314
Query: 392 WNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 451
W +ISA ++ P+EAL F E+ S I KPD V++ S +SACA LG +D W+H I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374
Query: 452 KREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 511
G++ + ++L++MYAKCG L+ +VF + ++V WS+MI L MHG A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434
Query: 512 DLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 571
LF M++ V+PN VTF VL CSH+GLV+EG+ F M Y + P +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494
Query: 572 LGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAI 631
GRA L EA+E+I MP+ + +WG+L+ AC +H +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554
Query: 632 VLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIY 691
VL+SNIYA+ RWE V +R++M + + KE G S I+ NG HEFL+GD H S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614
Query: 692 SKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQP--- 751
+KLDE+ +KLK GY P+ +L +E+++ K+ L HSEKLA+ FGL++ +
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674
Query: 752 ---IRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
IR+VKNLR+C DCH KLVS+VY+REI++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722
BLAST of HG10021841 vs. TAIR 10
Match:
AT3G12770.1 (mitochondrial editing factor 22 )
HSP 1 Score: 543.5 bits (1399), Expect = 2.8e-154
Identity = 279/706 (39.52%), Postives = 425/706 (60.20%), Query Frame = 0
Query: 94 STIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 153
S ID + QLKQ+HA +L GL F F +KL AS SSF + +AR VFD +P+P
Sbjct: 26 SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85
Query: 154 LYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 213
++ WN +IR Y S ++ FQ ++ + P++FTFP ++KA S L ++GR VH
Sbjct: 86 IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145
Query: 214 GMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISC--KDVVSWNSMISAFAQGNF 273
+L F D+++ N L+ Y C L A +FEG+ + +VSW +++SA+AQ
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205
Query: 274 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 333
P +AL++F +M +V P+ V +V VL+A DL+ GR + + + + ++++ L +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265
Query: 334 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 393
+ MY+KCG + A+ LFD+M ++ W M+ GYAK
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325
Query: 394 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 453
NG +EA+ F+E+ ++K +PD +++ S +SACAQ+G+++ ++ Y+
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385
Query: 454 REGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAID 513
R + + S+L+DM+AKCG++E A VF ++DV VWSAMI G G+HGR + AI
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445
Query: 514 LFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 573
L+ M+ V PN+VTF +L AC+H+G+V EG FF+ M + + P +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505
Query: 574 GRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIV 633
GRAG L++A E+I MP+ P +VWGALL AC H +VEL E A+ QL ++P N G V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565
Query: 634 LLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYS 693
LSN+YA W++V+E+R M++ L K+ GCS +EV G + F VGD SH +I
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625
Query: 694 KLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVV 753
+++ I ++LK G+ NK L + D++ E+ L HSE++AIA+GLIS P+R+
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685
Query: 754 KNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
KNLR C +CH KL+S++ DREI++RD RFHHF+DG CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694
BLAST of HG10021841 vs. TAIR 10
Match:
AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )
HSP 1 Score: 525.8 bits (1353), Expect = 6.1e-149
Identity = 264/695 (37.99%), Postives = 414/695 (59.57%), Query Frame = 0
Query: 107 QVHAHMLRTGLFFDPFSASKLFTASALSSF----STLDYARDVFDQIPQPNLYTWNTLIR 166
Q+H +++ G A LF ++L F LD AR VFD++ + N+ +W ++I
Sbjct: 155 QIHGLIVKMGY------AKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMIC 214
Query: 167 AYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFG 226
YA + +F ++ E PN+ T VI A ++L+ G V+
Sbjct: 215 GYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIE 274
Query: 227 MDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKM 286
++ ++++LV Y C +++A+RLF+ ++ N+M S + + +AL +F M
Sbjct: 275 VNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLM 334
Query: 287 EGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSI 346
V P+ ++M+ +S+C++ ++ +G+ Y+ R + +CNA++DMY KC
Sbjct: 335 MDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQ 394
Query: 347 DVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNG 406
D A ++FD M + V +W +++ GY + G+ DAA + F+ MP K I +WN +IS Q
Sbjct: 395 DTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGS 454
Query: 407 KPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLI 466
+EA+ F +Q + D VT++S SAC LGA+DL WI+ YI++ GI L+ L
Sbjct: 455 LFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLG 514
Query: 467 SSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVK 526
++LVDM+++CG E A+ +F S+ +DV W+A I + M G + AI+LF +M E +K
Sbjct: 515 TTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLK 574
Query: 527 PNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAME 586
P+ V F L ACSH GLV +G+ F+ M ++GV P HY CMVD+LGRAG LEEA++
Sbjct: 575 PDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQ 634
Query: 587 LINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGR 646
LI +MP+ P+ +W +LL AC + NVE+A A++++ L P G+ VLLSN+YA GR
Sbjct: 635 LIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGR 694
Query: 647 WEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKS 706
W ++++R M++ L+K PG SSI++ G HEF GD SH +I + LDE++ +
Sbjct: 695 WNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASH 754
Query: 707 VGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHE 766
+G+ P+ S++L +++ + K LS HSEKLA+A+GLIS IR+VKNLR+C DCH
Sbjct: 755 LGHVPDLSNVLMDVDEKE-KIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHS 814
Query: 767 VAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
AK S+VY+REI+LRD RFH+ R G CSC D+W
Sbjct: 815 FAKFASKVYNREIILRDNNRFHYIRQGKCSCGDFW 842
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038893523.1 | 0.0e+00 | 95.36 | pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa ... | [more] |
KAA0031814.1 | 0.0e+00 | 94.00 | pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] | [more] |
XP_008457379.1 | 0.0e+00 | 93.86 | PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... | [more] |
XP_004145320.1 | 0.0e+00 | 93.32 | pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sa... | [more] |
XP_022964665.1 | 0.0e+00 | 90.86 | pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita ... | [more] |
Match Name | E-value | Identity | Description | |
O82380 | 1.9e-272 | 61.49 | Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... | [more] |
Q9LN01 | 1.1e-171 | 41.80 | Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... | [more] |
O23337 | 5.7e-160 | 39.41 | Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... | [more] |
Q9LTV8 | 4.0e-153 | 39.52 | Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... | [more] |
Q9LUJ2 | 8.6e-148 | 37.99 | Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7SKX2 | 0.0e+00 | 94.00 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A5D3BBW6 | 0.0e+00 | 93.86 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A1S3C623 | 0.0e+00 | 93.86 | pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis ... | [more] |
A0A0A0M0R9 | 0.0e+00 | 93.32 | DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G5301... | [more] |
A0A6J1HLG4 | 0.0e+00 | 90.86 | pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbit... | [more] |
Match Name | E-value | Identity | Description | |
AT2G29760.1 | 1.4e-273 | 61.49 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G08070.1 | 7.9e-173 | 41.80 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G14820.1 | 4.1e-161 | 39.41 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G12770.1 | 2.8e-154 | 39.52 | mitochondrial editing factor 22 | [more] |
AT3G22690.2 | 6.1e-149 | 37.99 | INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... | [more] |