Homology
BLAST of HG10021470.1 vs. NCBI nr
Match:
XP_038894404.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida])
HSP 1 Score: 1686.4 bits (4366), Expect = 0.0e+00
Identity = 841/918 (91.61%), Postives = 876/918 (95.42%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMANVNLCIP+CERNGFPALHCTQNSHNFFGFSFFPSSVSG DLNFGDAK+RVLRH
Sbjct: 1 MVGVIMANVNLCIPSCERNGFPALHCTQNSHNFFGFSFFPSSVSGPDLNFGDAKHRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
R HKCG+IKASSNGESDIRL S N+LENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK
Sbjct: 61 RVHKCGSIKASSNGESDIRLPSENLLENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
LTMKENAS KSAE TSIS++DNGKNKVTDVQGN+DVKN+FKRVDRKDL NN+ERITR +D
Sbjct: 121 LTMKENASVKSAEITSISKIDNGKNKVTDVQGNVDVKNMFKRVDRKDLFNNTERITRERD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNK DSKRKG++RSNDEVKGKVTPF SQ NDKQHEEKR N S+Y E KVPR YNEK
Sbjct: 181 LSGNKIDSKRKGISRSNDEVKGKVTPFDSQVNDKQHEEKRNINRSNYTEPKVPRLYNEKR 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
IN KANTLD+KRESHR +GSSMRIS KIWA+DDTKPAK IL A KYSVQLERNYI GD+
Sbjct: 241 INFKANTLDIKRESHRASNGSSMRISGKIWANDDTKPAKDILNAVKYSVQLERNYISGDK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
VGRKKTEQSY SSKSGKR LEFTE+SSLE+EHAAFNNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRESSKSGKRFLEFTEDSSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
ML KRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR
Sbjct: 361 MLCKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKLRFIYTTALDVLGKARRPVEALN+F+AMQQHF+SYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNLFHAMQQHFTSYPDLVAYHSIAVTLGQAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
M+ELFDVIDSMRSPPKKKFKTG LEKWDPRL+PDIVIYNAVLNACVKRKNLEGAFWVLQE
Sbjct: 481 MKELFDVIDSMRSPPKKKFKTGVLEKWDPRLEPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT
Sbjct: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA KPLVVTYTGL
Sbjct: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVATKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS D+++AVYIFNHMKTFCSPNLVTYN+LLKGYL+HGMFEEARELFQNLSEHGRN
Sbjct: 661 IQACLDSKDIRSAVYIFNHMKTFCSPNLVTYNMLLKGYLEHGMFEEARELFQNLSEHGRN 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YFY+QMLLYGYHFNPKRHLRMILEA
Sbjct: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFYDQMLLYGYHFNPKRHLRMILEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
ARAGKDELLETTWKHLAQADRTPPP LLKERFCMKLARGDYSEALSCISNHDS+D HHFS
Sbjct: 781 ARAGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSCISNHDSSDVHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
ES WLNLLKEKR PKDTVIQLI+ VSMLLTRND PNPVF+NLL SCKEFCR+RISVADHR
Sbjct: 841 ESGWLNLLKEKRFPKDTVIQLINKVSMLLTRNDLPNPVFKNLLLSCKEFCRTRISVADHR 900
Query: 901 LEETVCTNETQSAAVMHI 919
LEETVCTNETQSAAV+ I
Sbjct: 901 LEETVCTNETQSAAVVRI 918
BLAST of HG10021470.1 vs. NCBI nr
Match:
XP_031741862.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >XP_031741863.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus] >KGN65965.1 hypothetical protein Csa_023210 [Cucumis sativus])
HSP 1 Score: 1597.4 bits (4135), Expect = 0.0e+00
Identity = 800/907 (88.20%), Postives = 845/907 (93.16%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN+NLCIPNCER GFP LHCT NSHN F SFFPSSVSGTD + DAKNRVLRH
Sbjct: 1 MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
R HKCG+IKA SNGESDI L SGN+LE+DFQFKPSFDEYV+VMETVRTRRYKRQ DDPNK
Sbjct: 61 RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
LTMKEN SAKSAESTSIS++DNGKNKVTDVQ N+DVKN+FKRVD+KDL NN+ERI KD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNKFD +RK VTRSND+VKGK+TPF S NDKQHEEKR NWSSYIE +V RS ++K
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWA--DDDTKPAKGILKAGKYSVQLERNYIPG 300
I+ KANTL+VK+ES RV DG+SM+ SEKIWA DDD KPAKG+LKAGKY +QLER+Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300
Query: 301 DRVGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEER 360
D+VGRKKTEQSY G+S SGKR LEF E++SLE+EHAAFNNFDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ 420
IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQA 480
MRERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
Query: 481 GYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
GYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
Query: 541 QELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
QELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
Query: 601 KTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
KTDEAVLAIENME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
Query: 661 GLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHG 720
GLIQACLDS DLQ+AVYIFNHMK FCSPNLVTYNILLKGYL+HGMFEEARELFQNLSE
Sbjct: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMIL 780
RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQM LYGYHFNPKRHLRMIL
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
Query: 781 EAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHH 840
EAAR GKDELLETTWKHLAQADRTPPP LLKERFCMKLARGDYSEALS I +H+S DAHH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
Query: 841 FSESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVAD 900
FSESAWLNLLKEKR P+DTVI+LIH V M+LTRN+SPNPVF+NLL SCKEFCR+RIS+AD
Sbjct: 841 FSESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
Query: 901 HRLEETV 906
HRLEETV
Sbjct: 901 HRLEETV 906
BLAST of HG10021470.1 vs. NCBI nr
Match:
XP_008459122.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo])
HSP 1 Score: 1582.4 bits (4096), Expect = 0.0e+00
Identity = 796/913 (87.19%), Postives = 840/913 (92.00%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVS--GTDLNFGDAKNRVL 60
MVGVIMANVNL IPNCER GFP LHCT NSH F SFFPSSVS GTDLNF DAKNRVL
Sbjct: 1 MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60
Query: 61 RHRGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDP 120
RHR HKCG+IKA SNGESDI L +GN+LE+DFQFKPSFDEYV+VMETVRTRRYKRQ D P
Sbjct: 61 RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120
Query: 121 NKLTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRR 180
NKLTMKEN SAKSAESTSIS++DNGKNKVTDVQ N++VKN+FKRVD+KDL NN+ERI R
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180
Query: 181 KDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNE 240
K LSGNKFD + KGVTRSND+VKGK+TPF S NDKQHEEK+ GNWSSYIE KV RS E
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240
Query: 241 KLINSKANTLDVKRESHRVCDGSSMRISEKIWA--DDDTKPAKGILKAGKYSVQLERNYI 300
K I+ KAN L+ K+E RV G+SM+ SEKIWA +DD KPAK +LKAGKY +QLER+Y
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300
Query: 301 PGDRVGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEME 360
PGD+VGRKKTEQSY G+S SGKR LEFTEE+SLE+EHAAFNNFDALDIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360
Query: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
ERIQMLSKRLNGADIDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLG 480
LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVF+AMQ+HFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480
Query: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
Query: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK
Sbjct: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
Query: 601 EGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
EGKTDEAVLAIENME RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT
Sbjct: 601 EGKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
Query: 661 YTGLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSE 720
YTGLIQACLDS DLQ+AVY+FN MK FCSPNLVTYNILLKGYL+HGMFEEAREL QNLSE
Sbjct: 661 YTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSE 720
Query: 721 HGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRM 780
+NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQM LYGYHFNPKRHLRM
Sbjct: 721 QRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRM 780
Query: 781 ILEAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDA 840
ILEAAR GKDELLETTWKHLAQADRTPPP LLKERFCMK+ARGDY+EAL CISNH+S DA
Sbjct: 781 ILEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDA 840
Query: 841 HHFSESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISV 900
HHFSESAWLNLLKEKR PKDTVI+LIH V M+ N+SPNPVF+NLL SCKEFCR+RISV
Sbjct: 841 HHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISV 900
Query: 901 ADHRLEETVCTNE 910
ADHRLEETV TNE
Sbjct: 901 ADHRLEETVHTNE 912
BLAST of HG10021470.1 vs. NCBI nr
Match:
KAG7019446.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1577.0 bits (4082), Expect = 0.0e+00
Identity = 794/918 (86.49%), Postives = 844/918 (91.94%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN NLCIP CE NGFPAL+CTQNSH GFSFFPSSVSG+ LNFG AK+RVLRH
Sbjct: 1 MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSFFPSSVSGSGLNFGSAKSRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
RGHKCGAIKASS GESDI+L+SGN+LE DFQFKPSFDEYVRVME+VR+RRYKRQSDDPNK
Sbjct: 61 RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
MKENASAKSAESTSIS N VTDVQGNMDVKN VD +DL +NSE+ITR+ D
Sbjct: 121 --MKENASAKSAESTSIS------NIVTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNKFDSKRKGVTRS DE+KGKVTPF SQ NDKQHEEKR GNWS+YIE K RS ++K
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
++ KANTLDVK ESH V GSSM+IS+KIWADDDTKP K +LK GKY VQLE NYIPGD+
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWADDDTKPTKDVLKVGKYGVQLEGNYIPGDK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
VGRKKTEQSY G SKSGKR EFTEESSLE+EHAAFN+FDA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSFDAEDIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
MLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
MRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT
Sbjct: 541 LKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAI+ ME+RGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGL
Sbjct: 601 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS +LQ+AVYIFNHMK FCSPNLVT NILLKGYLDHGMF+EA+ELFQN+SE+GRN
Sbjct: 661 IQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRN 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
IS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQMLLYGYHFNPKRHLRMI+EA
Sbjct: 721 ISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
AR GKDELLETTWKHLAQADRT PP L+KERFC+ LARGDYSEALSCIS H S+D HHFS
Sbjct: 781 ARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
+SAWLNLLKEKR PKD+VI+LIH VSMLL RNDSPNPV QNLL S KEFCRSRI+VAD R
Sbjct: 841 KSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRITVADPR 900
Query: 901 LEETVCTNETQSAAVMHI 919
LEE VCTNE+QSA VMH+
Sbjct: 901 LEEVVCTNESQSATVMHV 910
BLAST of HG10021470.1 vs. NCBI nr
Match:
XP_038894405.1 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X2 [Benincasa hispida])
HSP 1 Score: 1571.2 bits (4067), Expect = 0.0e+00
Identity = 796/918 (86.71%), Postives = 831/918 (90.52%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMANVNLCIP+CERNGFPALHCTQNSHNFFGFSFFPSSVSG DLNFGDAK+RVLRH
Sbjct: 1 MVGVIMANVNLCIPSCERNGFPALHCTQNSHNFFGFSFFPSSVSGPDLNFGDAKHRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
R HKCG+IKASSNGESDIRL S N+LENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK
Sbjct: 61 RVHKCGSIKASSNGESDIRLPSENLLENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
LTMKENAS KSAE TSIS++DNGKNKVTDVQGN+DVKN+FKRVDRKDL NN+ERITR +D
Sbjct: 121 LTMKENASVKSAEITSISKIDNGKNKVTDVQGNVDVKNMFKRVDRKDLFNNTERITRERD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNK DSKRKG++RSNDEVKGKVTPF SQ NDKQHEEKR N S+Y E KVPR YNEK
Sbjct: 181 LSGNKIDSKRKGISRSNDEVKGKVTPFDSQVNDKQHEEKRNINRSNYTEPKVPRLYNEKR 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
IN KANTLD+KRESHR +GSSMRIS KIWA+DDTKPAK IL A KYSVQLERNYI GD+
Sbjct: 241 INFKANTLDIKRESHRASNGSSMRISGKIWANDDTKPAKDILNAVKYSVQLERNYISGDK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
VGRKKTEQSY SSKSGKR LEFTE+SSLE+EHAAFNNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRESSKSGKRFLEFTEDSSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
ML KRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR
Sbjct: 361 MLCKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKL + + G + AGY
Sbjct: 421 ERFKSHKL------SEETCGGTQ-----------------------------FIPCNAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
M+ELFDVIDSMRSPPKKKFKTG LEKWDPRL+PDIVIYNAVLNACVKRKNLEGAFWVLQE
Sbjct: 481 MKELFDVIDSMRSPPKKKFKTGVLEKWDPRLEPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT
Sbjct: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA KPLVVTYTGL
Sbjct: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVATKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS D+++AVYIFNHMKTFCSPNLVTYN+LLKGYL+HGMFEEARELFQNLSEHGRN
Sbjct: 661 IQACLDSKDIRSAVYIFNHMKTFCSPNLVTYNMLLKGYLEHGMFEEARELFQNLSEHGRN 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YFY+QMLLYGYHFNPKRHLRMILEA
Sbjct: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFYDQMLLYGYHFNPKRHLRMILEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
ARAGKDELLETTWKHLAQADRTPPP LLKERFCMKLARGDYSEALSCISNHDS+D HHFS
Sbjct: 781 ARAGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSCISNHDSSDVHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
ES WLNLLKEKR PKDTVIQLI+ VSMLLTRND PNPVF+NLL SCKEFCR+RISVADHR
Sbjct: 841 ESGWLNLLKEKRFPKDTVIQLINKVSMLLTRNDLPNPVFKNLLLSCKEFCRTRISVADHR 883
Query: 901 LEETVCTNETQSAAVMHI 919
LEETVCTNETQSAAV+ I
Sbjct: 901 LEETVCTNETQSAAVVRI 883
BLAST of HG10021470.1 vs. ExPASy Swiss-Prot
Match:
Q9SA76 (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2279 PE=3 SV=1)
HSP 1 Score: 724.9 bits (1870), Expect = 1.1e-207
Identity = 421/878 (47.95%), Postives = 560/878 (63.78%), Query Frame = 0
Query: 62 GHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNKL 121
G A+K S +GES + + + F+ + S EY R +T R + D+ + L
Sbjct: 172 GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELD-L 231
Query: 122 TMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKDL 181
++E + A+ S K+ V K ++ +T KD
Sbjct: 232 VVEERRVQRIAKDARWS------------------KSRESSVAVKWSNSGESSVTMPKDE 291
Query: 182 SGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIG------NWSSYIETKVPRS 241
S + SK++ RS+D +G + EE+R+ WS E+ VP S
Sbjct: 292 SFRRRYSKQEH-HRSSDTSRGIARGSKGDELELVVEERRVQRIAKDVRWSKSDESLVPVS 351
Query: 242 YNEKLINSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGK---YSVQLE 301
+E + N RV D S +GI + K + E
Sbjct: 352 EDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLLAE 411
Query: 302 RNYIPGDRVGRKKTE---QSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFD-ALDIMDKP 361
I +R+ ++ E G+ + G + + ++S +E AF D + DI+DKP
Sbjct: 412 ERRI--ERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 471
Query: 362 RVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 421
S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I LGKLGNWR
Sbjct: 472 ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 531
Query: 422 RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAY 481
RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVF+AM SSYPD+VAY
Sbjct: 532 RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 591
Query: 482 HSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR 541
SIAVTLGQAG+++ELF VID+MRSPPKKKFK LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 592 RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 651
Query: 542 KNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYK 601
K EGAFWVLQ+LK++G +PS TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+
Sbjct: 652 KQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYR 711
Query: 602 VLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL--------- 661
VLVNTLWKEGK+DEAV +E+ME RGIVGSAALYYD ARCLCSAGRC E L
Sbjct: 712 VLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPV 771
Query: 662 -------------------MQMEKICKVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHM 721
Q++KIC+VANKPLVVTYTGLIQAC+DS +++NA YIF+ M
Sbjct: 772 VLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQM 831
Query: 722 KTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNT 781
K CSPNLVT NI+LK YL G+FEEARELFQ +SE G +I SD+ RVLPD Y FNT
Sbjct: 832 KKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNT 891
Query: 782 MLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLAQAD 841
MLD +++WDDF Y Y +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+H+ +++
Sbjct: 892 MLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSN 951
Query: 842 RTPPPALLKERFCMKLARGDYSEALSCISN----HDSNDAHHFSESAWLNLLKEKRLPKD 894
R PP L+KERF KL +GD+ A+S +++ + + FS SAW +L R +D
Sbjct: 952 RIPPSPLIKERFFRKLEKGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQD 1002
BLAST of HG10021470.1 vs. ExPASy Swiss-Prot
Match:
Q9FJW6 (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DG1 PE=1 SV=2)
HSP 1 Score: 406.0 bits (1042), Expect = 1.1e-111
Identity = 219/556 (39.39%), Postives = 334/556 (60.07%), Query Frame = 0
Query: 357 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 416
E +++L RL+G +I+ W F +MM + +++++ +L+++ LG+ +W++ V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242
Query: 417 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLG 476
+ ++ K + RF+YT L VLG ARRP EAL +FN M YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302
Query: 477 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 536
QAG ++EL VI+ MR P K K + WDP L+PD+V+YNA+LNACV + W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362
Query: 537 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 596
V EL+K GL+P+ +TYGL MEVMLE GK++ VH+FFRK++ S P A+TYKVLV LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422
Query: 597 KEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 656
+EGK +EAV A+ +ME++G++G+ ++YY+ A CLC+ GR +A++++ ++ ++ N +PL
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482
Query: 657 VTYTGLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNL 716
+T+TGLI A L+ + + + IF +MK C PN+ T N++LK Y + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542
Query: 717 SEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHL 776
VS ++P+ Y ++ ML+AS +W+ F + Y M+L GY + +H
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602
Query: 777 RMILEAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSN 836
M++EA+RAGK LLE + + + P P E C A+GD+ A++ I N +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662
Query: 837 DAHHFSESAWLNLLKEKR--LPKDTVIQLIHMVSMLLTRND-SPNPVFQNLLFSCKEFCR 896
+ SE W +L +E + L +D +H +S L D P NL S K C
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 722
Query: 897 SRISVADHRLEETVCT 908
S S A L V T
Sbjct: 723 SSSSSAQPLLAVDVTT 724
BLAST of HG10021470.1 vs. ExPASy Swiss-Prot
Match:
Q0WPZ6 (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX=3702 GN=At2g17140 PE=2 SV=1)
HSP 1 Score: 112.5 bits (280), Expect = 2.6e-23
Identity = 67/264 (25.38%), Postives = 127/264 (48.11%), Query Frame = 0
Query: 509 PRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNL 568
P +P + +YN +L +C+K + +E W+ +++ G+ P T T+ L++ + + +
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165
Query: 569 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFAR 628
E F ++ +K PN T+ +LV K G TD+ + + ME G++ + +Y
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225
Query: 629 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHMKT----- 688
C GR ++ +EK+ + P +VT+ I A + +A IF+ M+
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285
Query: 689 FCSPNLVTYNILLKGYLDHGMFEEARELFQNLSE-------------------HGRNI-- 744
PN +TYN++LKG+ G+ E+A+ LF+++ E HG+ I
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEA 345
BLAST of HG10021470.1 vs. ExPASy Swiss-Prot
Match:
Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)
HSP 1 Score: 111.3 bits (277), Expect = 5.8e-23
Identity = 102/421 (24.23%), Postives = 179/421 (42.52%), Query Frame = 0
Query: 490 SMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPS 549
++R+ PK LEK+ QPD+ YNA++N K ++ A VL ++ + P
Sbjct: 136 TLRNIPKAVRVMEILEKFG---QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 195
Query: 550 TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSI-PNALTYKVLVNTLWKEGKTDEAVLAIE 609
T TY +++ + GK +L + ++ + P +TY +L+ EG DEA+ ++
Sbjct: 196 TVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMD 255
Query: 610 NMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSN 669
M RG+ Y R +C G A + + +P V++Y L++A L+
Sbjct: 256 EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQG 315
Query: 670 DLQNAVYIFNHM-KTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRNISTVSDY 729
+ + M C PN+VTY+IL+ G EEA L + + E G
Sbjct: 316 KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKG--------- 375
Query: 730 RDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEAARAGK-D 789
+ PD Y ++ ++ A E R D F M+ G + + ++ + GK D
Sbjct: 376 ---LTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKAD 435
Query: 790 ELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSN--DAHHFSESAW 849
+ LE K L + +P + F + GD AL I SN D + ++
Sbjct: 436 QALEIFGK-LGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSM 495
Query: 850 LNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHRLEET 906
++ L + + + L+ M S P+ V N++ FC++ HR+E+
Sbjct: 496 ISCLCREGMVDEAFELLVDMRSC----EFHPSVVTYNIVL--LGFCKA------HRIEDA 528
BLAST of HG10021470.1 vs. ExPASy Swiss-Prot
Match:
Q9S7R4 (Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=OTP43 PE=2 SV=1)
HSP 1 Score: 109.4 bits (272), Expect = 2.2e-22
Identity = 72/314 (22.93%), Postives = 141/314 (44.90%), Query Frame = 0
Query: 442 ARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKT 501
A +P +A+ +F M +H + DL ++++I L ++ + + +++ ++R
Sbjct: 139 AGKPDKAVKLFLNMHEH-GCFQDLASFNTILDVLCKSKRVEKAYELFRALRG-------- 198
Query: 502 GALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVML 561
R D V YN +LN K A VL+E+ ++G+ P+ +TY +++
Sbjct: 199 --------RFSVDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFF 258
Query: 562 ECGKYNLVHEFFRKVQKSSIP-NALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAA 621
G+ EFF +++K + +TY +V+ G+ A + M R G++ S A
Sbjct: 259 RAGQIRHAWEFFLEMKKRDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGVLPSVA 318
Query: 622 LYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHM 681
Y + LC + A++ E++ + +P V TY LI+ + + + M
Sbjct: 319 TYNAMIQVLCKKDNVENAVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEELMQRM 378
Query: 682 KT-FCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFN 741
+ C PN TYN++++ Y + E+A LF+ + LP++ +N
Sbjct: 379 ENEGCEPNFQTYNMMIRYYSECSEVEKALGLFEKMGS------------GDCLPNLDTYN 423
Query: 742 TMLDASFAEKRWDD 754
++ F KR +D
Sbjct: 439 ILISGMFVRKRSED 423
BLAST of HG10021470.1 vs. ExPASy TrEMBL
Match:
A0A0A0LVN7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1)
HSP 1 Score: 1597.4 bits (4135), Expect = 0.0e+00
Identity = 800/907 (88.20%), Postives = 845/907 (93.16%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN+NLCIPNCER GFP LHCT NSHN F SFFPSSVSGTD + DAKNRVLRH
Sbjct: 1 MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
R HKCG+IKA SNGESDI L SGN+LE+DFQFKPSFDEYV+VMETVRTRRYKRQ DDPNK
Sbjct: 61 RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
LTMKEN SAKSAESTSIS++DNGKNKVTDVQ N+DVKN+FKRVD+KDL NN+ERI KD
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNKFD +RK VTRSND+VKGK+TPF S NDKQHEEKR NWSSYIE +V RS ++K
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWA--DDDTKPAKGILKAGKYSVQLERNYIPG 300
I+ KANTL+VK+ES RV DG+SM+ SEKIWA DDD KPAKG+LKAGKY +QLER+Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300
Query: 301 DRVGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEER 360
D+VGRKKTEQSY G+S SGKR LEF E++SLE+EHAAFNNFDA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
Query: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ 420
IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQA 480
MRERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQ+HFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
Query: 481 GYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
GYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
Query: 541 QELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
QELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
Query: 601 KTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
KTDEAVLAIENME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
Query: 661 GLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHG 720
GLIQACLDS DLQ+AVYIFNHMK FCSPNLVTYNILLKGYL+HGMFEEARELFQNLSE
Sbjct: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
Query: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMIL 780
RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQM LYGYHFNPKRHLRMIL
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
Query: 781 EAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHH 840
EAAR GKDELLETTWKHLAQADRTPPP LLKERFCMKLARGDYSEALS I +H+S DAHH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
Query: 841 FSESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVAD 900
FSESAWLNLLKEKR P+DTVI+LIH V M+LTRN+SPNPVF+NLL SCKEFCR+RIS+AD
Sbjct: 841 FSESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
Query: 901 HRLEETV 906
HRLEETV
Sbjct: 901 HRLEETV 906
BLAST of HG10021470.1 vs. ExPASy TrEMBL
Match:
A0A1S3C8Z0 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498323 PE=4 SV=1)
HSP 1 Score: 1582.4 bits (4096), Expect = 0.0e+00
Identity = 796/913 (87.19%), Postives = 840/913 (92.00%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVS--GTDLNFGDAKNRVL 60
MVGVIMANVNL IPNCER GFP LHCT NSH F SFFPSSVS GTDLNF DAKNRVL
Sbjct: 1 MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60
Query: 61 RHRGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDP 120
RHR HKCG+IKA SNGESDI L +GN+LE+DFQFKPSFDEYV+VMETVRTRRYKRQ D P
Sbjct: 61 RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120
Query: 121 NKLTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRR 180
NKLTMKEN SAKSAESTSIS++DNGKNKVTDVQ N++VKN+FKRVD+KDL NN+ERI R
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180
Query: 181 KDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNE 240
K LSGNKFD + KGVTRSND+VKGK+TPF S NDKQHEEK+ GNWSSYIE KV RS E
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240
Query: 241 KLINSKANTLDVKRESHRVCDGSSMRISEKIWA--DDDTKPAKGILKAGKYSVQLERNYI 300
K I+ KAN L+ K+E RV G+SM+ SEKIWA +DD KPAK +LKAGKY +QLER+Y
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300
Query: 301 PGDRVGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEME 360
PGD+VGRKKTEQSY G+S SGKR LEFTEE+SLE+EHAAFNNFDALDIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360
Query: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
ERIQMLSKRLNGADIDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLG 480
LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVF+AMQ+HFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480
Query: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
Query: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK
Sbjct: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
Query: 601 EGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
EGKTDEAVLAIENME RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT
Sbjct: 601 EGKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
Query: 661 YTGLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSE 720
YTGLIQACLDS DLQ+AVY+FN MK FCSPNLVTYNILLKGYL+HGMFEEAREL QNLSE
Sbjct: 661 YTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSE 720
Query: 721 HGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRM 780
+NISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQM LYGYHFNPKRHLRM
Sbjct: 721 QRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRM 780
Query: 781 ILEAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDA 840
ILEAAR GKDELLETTWKHLAQADRTPPP LLKERFCMK+ARGDY+EAL CISNH+S DA
Sbjct: 781 ILEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDA 840
Query: 841 HHFSESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISV 900
HHFSESAWLNLLKEKR PKDTVI+LIH V M+ N+SPNPVF+NLL SCKEFCR+RISV
Sbjct: 841 HHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISV 900
Query: 901 ADHRLEETVCTNE 910
ADHRLEETV TNE
Sbjct: 901 ADHRLEETVHTNE 912
BLAST of HG10021470.1 vs. ExPASy TrEMBL
Match:
A0A6J1EH18 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434226 PE=4 SV=1)
HSP 1 Score: 1567.0 bits (4056), Expect = 0.0e+00
Identity = 790/918 (86.06%), Postives = 840/918 (91.50%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN NLCIP CE NGFPAL+CTQNSH GFS FPSSVSG+ LNFG AK+RVLRH
Sbjct: 1 MVGVIMANANLCIPCCEGNGFPALYCTQNSHYLLGFSVFPSSVSGSGLNFGSAKSRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
RGHKCGAIKASS GESDI+L+SGN+LE DFQFKPSFDEYVRVME+VR+RRYKRQSDDPNK
Sbjct: 61 RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
MKENASAKSAEST IS N VTDVQGNMDVKN VD +DL +NSE+ITR+ D
Sbjct: 121 --MKENASAKSAESTFIS------NIVTDVQGNMDVKNKVVCVDGEDLFDNSEKITRKTD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNKFDSKRKGVTRS DE+KGKVTPF SQ NDKQHEEKR GNWS+YIE K RS ++K
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFESQVNDKQHEEKRNGNWSNYIEPKATRSNHDKR 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
++ KANTLDVK ESH V GSSM+IS+KIWADDD+KP K +LK GKY VQLE NYIPGD+
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISDKIWADDDSKPTKDVLKVGKYGVQLEGNYIPGDK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
VGRKKTEQSY G SKSGKR EFTEESSLE+EHAAFN+ DA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
MLS RLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSNRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
MRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT
Sbjct: 541 LKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAI+ ME+RGIVGSAALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGL
Sbjct: 601 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCEEALMQMEKICKVANKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS +LQ+AVYIFNHMK FCSPNLVT NILLKGYLDHGMF+EA+ELFQN+SE+GRN
Sbjct: 661 IQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRN 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
IS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQMLLYGYHFNPKRHLRMI+EA
Sbjct: 721 ISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
AR GKDELLETTWKHLAQADRT PP L+KERFC+ LARGDYSEALSCIS H S+D HHFS
Sbjct: 781 ARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
+SAWLNLLKEKR PKD+VI+LIH VSMLL RNDSPNPV QNLL S KEFCRSRISVAD R
Sbjct: 841 KSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPR 900
Query: 901 LEETVCTNETQSAAVMHI 919
LEE VCTNE+QSA VMH+
Sbjct: 901 LEEVVCTNESQSATVMHV 910
BLAST of HG10021470.1 vs. ExPASy TrEMBL
Match:
A0A6J1KEH7 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495096 PE=4 SV=1)
HSP 1 Score: 1565.8 bits (4053), Expect = 0.0e+00
Identity = 793/918 (86.38%), Postives = 839/918 (91.39%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN NLCIP CE NGF AL+CTQNSH G SFFPSSVSG+ LNFG AK+RVLRH
Sbjct: 1 MVGVIMANANLCIPCCEGNGFSALYCTQNSHYLLGLSFFPSSVSGSGLNFGSAKSRVLRH 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
RGHKCGAIKASS GESDI+L+SGN+LE DFQFKPSFDEYVRVME+VR+RRYKRQSDDPNK
Sbjct: 61 RGHKCGAIKASSKGESDIQLASGNLLEKDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
MKENASAKSAESTSIS N VTDVQGNMDVKN VD +DL +NSERITR+ D
Sbjct: 121 --MKENASAKSAESTSIS------NIVTDVQGNMDVKNKVVYVDGEDLFDNSERITRKTD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
LSGNKFDSKRKGVTRS DE+KGKVTPF SQ NDKQHEEKR GNWS+YIE KV RS ++K
Sbjct: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQINDKQHEEKRNGNWSNYIEPKVTRSNHDKR 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
++ KANTLDVK ESH V GSSM+ISEKIWADDD KP K +LK GKY VQL+ NYIPGD+
Sbjct: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWADDDIKPTKDVLKVGKYGVQLKGNYIPGDK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
VGRKKTEQSY G SKSGKR EFTEESSLE+EHAAFN+ DA DIMDKPRVSKMEMEERIQ
Sbjct: 301 VGRKKTEQSYRGLSKSGKRFHEFTEESSLEVEHAAFNSCDAADIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
MLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
MRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGAFEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT
Sbjct: 541 LKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAI+ ME+RGIVGSAALYYDFARCLCSAGR +EALMQMEKICKVANKPLVVTYTGL
Sbjct: 601 DEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRWEEALMQMEKICKVANKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS +LQ+AVYIFNHMK FCSPNLVT NILLKGYLDHGMF EA+ELFQN+SE+GRN
Sbjct: 661 IQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFNEAKELFQNMSENGRN 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
IS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQMLLYGYHFNPKRHLRMI+EA
Sbjct: 721 ISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
AR GKDELLETTWKHLAQADR PP L+KERFC+ LARGDYSEALSCIS H S+D HHFS
Sbjct: 781 ARGGKDELLETTWKHLAQADRILPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
+SAWLNLLKEKR PKD+VIQLIH VSMLL RNDSPNPV QNLL S KEFCRSRISVAD R
Sbjct: 841 KSAWLNLLKEKRFPKDSVIQLIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPR 900
Query: 901 LEETVCTNETQSAAVMHI 919
LEE VCTNE+QSAAVMH+
Sbjct: 901 LEEVVCTNESQSAAVMHV 910
BLAST of HG10021470.1 vs. ExPASy TrEMBL
Match:
A0A6J1CLQ9 (pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012614 PE=4 SV=1)
HSP 1 Score: 1535.8 bits (3975), Expect = 0.0e+00
Identity = 770/918 (83.88%), Postives = 826/918 (89.98%), Query Frame = 0
Query: 1 MVGVIMANVNLCIPNCERNGFPALHCTQNSHNFFGFSFFPSSVSGTDLNFGDAKNRVLRH 60
MVGVIMAN N+CIP CERNGF ALHCTQ+SHN FGFS FPS +SG LN G KNR+ R+
Sbjct: 1 MVGVIMANANMCIPCCERNGFRALHCTQSSHNLFGFSLFPSPISGIGLNVGYEKNRIFRY 60
Query: 61 RGHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNK 120
RG+KCGAI+ SS GESDIRL +GN+LENDF FKPSFDEYVRVME+VRT RYK+Q DDPNK
Sbjct: 61 RGNKCGAIRVSSKGESDIRLQNGNVLENDFLFKPSFDEYVRVMESVRTSRYKKQPDDPNK 120
Query: 121 LTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKD 180
L MKENASAKSAES+S+SE+DN K KVTDVQGN+DVKN+FKRVD+K L NN+ER+TR+KD
Sbjct: 121 LKMKENASAKSAESSSVSEIDNEKTKVTDVQGNVDVKNMFKRVDQKKLFNNAERVTRKKD 180
Query: 181 LSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIGNWSSYIETKVPRSYNEKL 240
L NKFD+KRKG+TR+ DE +GKVT F SQ NDKQHEE+R N IE KV R NE L
Sbjct: 181 LLENKFDNKRKGITRTKDEFRGKVTHFDSQVNDKQHEEQRKRNRLDCIEPKVRRLNNEAL 240
Query: 241 INSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGKYSVQLERNYIPGDR 300
+ SKANTLD+KR+ RVCD SSM+ E+IWAD DTK AKG L+ GK VQL RNY+PG++
Sbjct: 241 VCSKANTLDIKRQRQRVCDESSMKTVERIWADGDTKLAKGDLEVGKSGVQLARNYVPGEK 300
Query: 301 VGRKKTEQSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFDALDIMDKPRVSKMEMEERIQ 360
V KKT QSY G SKSGK +E TEESSLE+E AA NNFDALDIMDKPRVSKMEMEERIQ
Sbjct: 301 VSGKKTGQSYQGLSKSGKPFIESTEESSLEVERAALNNFDALDIMDKPRVSKMEMEERIQ 360
Query: 361 MLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMR 420
MLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMR
Sbjct: 361 MLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMR 420
Query: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
ERFKSHKLRFIYTTALDVLGKARRPVEALNVF+AMQQHFSSYPDLVAYHSIAVTLGQAGY
Sbjct: 421 ERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGY 480
Query: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQE 540
MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQE
Sbjct: 481 MRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQE 540
Query: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKT 600
LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTL KEGKT
Sbjct: 541 LKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLSKEGKT 600
Query: 601 DEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGL 660
DEAVLAI+NMERRGIVGSAALYYDFARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGL
Sbjct: 601 DEAVLAIQNMERRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGL 660
Query: 661 IQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRN 720
IQACLDS +L +AVYIFNHMK FCSPNLVTYNILLKGYLDHGMFEEARELFQNLSE G++
Sbjct: 661 IQACLDSKNLDSAVYIFNHMKAFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSESGQS 720
Query: 721 ISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEA 780
IST+SDY+DRVLPDIY FN MLDA FA KRWDDF YFYNQM LYGYHFNPKRHLRMILEA
Sbjct: 721 ISTISDYKDRVLPDIYTFNIMLDAFFAVKRWDDFGYFYNQMFLYGYHFNPKRHLRMILEA 780
Query: 781 ARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSNDAHHFS 840
RAGKDE+LETTWKHLAQ DRT PP L+KERFCMKLARGDYSEALSCISNH S+DAHHFS
Sbjct: 781 GRAGKDEILETTWKHLAQTDRTLPPPLVKERFCMKLARGDYSEALSCISNHHSSDAHHFS 840
Query: 841 ESAWLNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHR 900
ESAWLNLLKEK PKDTVI LIH VSMLLT N PNPVFQNLL SCKEFCR+RI+VAD +
Sbjct: 841 ESAWLNLLKEKGFPKDTVILLIHKVSMLLTGNHPPNPVFQNLLSSCKEFCRTRITVADSK 900
Query: 901 LEETVCTNETQSAAVMHI 919
LE+ VC +ETQSAAVMHI
Sbjct: 901 LEQIVCRDETQSAAVMHI 918
BLAST of HG10021470.1 vs. TAIR 10
Match:
AT1G30610.2 (pentatricopeptide (PPR) repeat-containing protein )
HSP 1 Score: 734.6 bits (1895), Expect = 1.0e-211
Identity = 422/852 (49.53%), Postives = 562/852 (65.96%), Query Frame = 0
Query: 68 IKASSNGESDIRL-------SSGNILEND-FQFKPSFDEYVRVMETVRTRRYKRQSDDPN 127
+K S +GES + L SS + E++ F+ + S EY R +T R + D+ +
Sbjct: 166 LKWSKSGESSVALKLSKSGESSVTVPEDESFRKRYSKQEYHRSSDTSRGIERGSRGDELD 225
Query: 128 KLTMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRK 187
L ++E + A+ S K+ V K ++ +T K
Sbjct: 226 -LVVEERRVQRIAKDARWS------------------KSRESSVAVKWSNSGESSVTMPK 285
Query: 188 DLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIG------NWSSYIETKVP 247
D S + SK++ RS+D +G + EE+R+ WS E+ VP
Sbjct: 286 DESFRRRYSKQEH-HRSSDTSRGIARGSKGDELELVVEERRVQRIAKDVRWSKSDESLVP 345
Query: 248 RSYNEKLINSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGK---YSVQ 307
S +E + N RV D S +GI + K +
Sbjct: 346 VSEDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLL 405
Query: 308 LERNYIPGDRVGRKKTE---QSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFD-ALDIMD 367
E I +R+ ++ E G+ + G + + ++S +E AF D + DI+D
Sbjct: 406 AEERRI--ERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVD 465
Query: 368 KPRVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGN 427
KP S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I LGKLGN
Sbjct: 466 KPATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGN 525
Query: 428 WRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLV 487
WRRVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVF+AM SSYPD+V
Sbjct: 526 WRRVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMV 585
Query: 488 AYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACV 547
AY SIAVTLGQAG+++ELF VID+MRSPPKKKFK LEKWDPRL+PD+V+YNAVLNACV
Sbjct: 586 AYRSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACV 645
Query: 548 KRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALT 607
+RK EGAFWVLQ+LK++G +PS TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL
Sbjct: 646 QRKQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALA 705
Query: 608 YKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKIC 667
Y+VLVNTLWKEGK+DEAV +E+ME RGIVGSAALYYD ARCLCSAGRC E L ++KIC
Sbjct: 706 YRVLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMLKKIC 765
Query: 668 KVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEE 727
+VANKPLVVTYTGLIQAC+DS +++NA YIF+ MK CSPNLVT NI+LK YL G+FEE
Sbjct: 766 RVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKAYLQGGLFEE 825
Query: 728 ARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGY 787
ARELFQ +SE G +I SD+ RVLPD Y FNTMLD +++WDDF Y Y +ML +GY
Sbjct: 826 ARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYAYREMLRHGY 885
Query: 788 HFNPKRHLRMILEAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALS 847
HFN KRHLRM+LEA+RAGK+E++E TW+H+ +++R PP L+KERF KL +GD+ A+S
Sbjct: 886 HFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLEKGDHISAIS 945
Query: 848 CISN----HDSNDAHHFSESAWLNLLKEKRLPKDTVIQLIHMVSMLL-TRNDSPNPVFQN 894
+++ + + FS SAW +L R +D+V++L+ V+ L +R++S + V N
Sbjct: 946 SLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRSESSDSVLGN 974
HSP 2 Score: 36.2 bits (82), Expect = 1.7e-01
Identity = 29/94 (30.85%), Postives = 53/94 (56.38%), Query Frame = 0
Query: 87 ENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNKLTMKENASAKSAESTSISEVDNGKNK 146
+ F+FKPSFD+Y+++ME+V+T R K++ D +L ++E+ S+ EV + K K
Sbjct: 68 DKGFEFKPSFDQYLQIMESVKTARKKKKFD---RLKVEED-DGGGGNGDSVYEVKDMKIK 127
Query: 147 VTDVQGNMDVKNLFKRVDRKDL--SNNSERITRR 179
G + + KR R+++ +ER+ +R
Sbjct: 128 ----SGELKDETFRKRYSRQEIVSDKRNERVFKR 153
BLAST of HG10021470.1 vs. TAIR 10
Match:
AT1G30610.1 (pentatricopeptide (PPR) repeat-containing protein )
HSP 1 Score: 724.9 bits (1870), Expect = 7.9e-209
Identity = 421/878 (47.95%), Postives = 560/878 (63.78%), Query Frame = 0
Query: 62 GHKCGAIKASSNGESDIRLSSGNILENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNKL 121
G A+K S +GES + + + F+ + S EY R +T R + D+ + L
Sbjct: 172 GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELD-L 231
Query: 122 TMKENASAKSAESTSISEVDNGKNKVTDVQGNMDVKNLFKRVDRKDLSNNSERITRRKDL 181
++E + A+ S K+ V K ++ +T KD
Sbjct: 232 VVEERRVQRIAKDARWS------------------KSRESSVAVKWSNSGESSVTMPKDE 291
Query: 182 SGNKFDSKRKGVTRSNDEVKGKVTPFYSQANDKQHEEKRIG------NWSSYIETKVPRS 241
S + SK++ RS+D +G + EE+R+ WS E+ VP S
Sbjct: 292 SFRRRYSKQEH-HRSSDTSRGIARGSKGDELELVVEERRVQRIAKDVRWSKSDESLVPVS 351
Query: 242 YNEKLINSKANTLDVKRESHRVCDGSSMRISEKIWADDDTKPAKGILKAGK---YSVQLE 301
+E + N RV D S +GI + K + E
Sbjct: 352 EDESF--RRGNPKQEMVRYQRVSDTS-----------------RGIERGSKGDGLDLLAE 411
Query: 302 RNYIPGDRVGRKKTE---QSYGGSSKSGKRLLEFTEESSLEIEHAAFNNFD-ALDIMDKP 361
I +R+ ++ E G+ + G + + ++S +E AF D + DI+DKP
Sbjct: 412 ERRI--ERLANERHEIRSSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKP 471
Query: 362 RVSKMEMEERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWR 421
S++EME+RI+ L+K LNGADI+MPEW FS+ +RSAKIRY+D++++R+I LGKLGNWR
Sbjct: 472 ATSRVEMEDRIEKLAKVLNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWR 531
Query: 422 RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAY 481
RVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALNVF+AM SSYPD+VAY
Sbjct: 532 RVLQVIEWLQRQDRYKSNKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAY 591
Query: 482 HSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR 541
SIAVTLGQAG+++ELF VID+MRSPPKKKFK LEKWDPRL+PD+V+YNAVLNACV+R
Sbjct: 592 RSIAVTLGQAGHIKELFYVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQR 651
Query: 542 KNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYK 601
K EGAFWVLQ+LK++G +PS TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+
Sbjct: 652 KQWEGAFWVLQQLKQRGQKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYR 711
Query: 602 VLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL--------- 661
VLVNTLWKEGK+DEAV +E+ME RGIVGSAALYYD ARCLCSAGRC E L
Sbjct: 712 VLVNTLWKEGKSDEAVHTVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPV 771
Query: 662 -------------------MQMEKICKVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHM 721
Q++KIC+VANKPLVVTYTGLIQAC+DS +++NA YIF+ M
Sbjct: 772 VLKLIENLIYKADLVHTIQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQM 831
Query: 722 KTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRNISTVSDYRDRVLPDIYMFNT 781
K CSPNLVT NI+LK YL G+FEEARELFQ +SE G +I SD+ RVLPD Y FNT
Sbjct: 832 KKVCSPNLVTCNIMLKAYLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNT 891
Query: 782 MLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLAQAD 841
MLD +++WDDF Y Y +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+H+ +++
Sbjct: 892 MLDTCAEQEKWDDFGYAYREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSN 951
Query: 842 RTPPPALLKERFCMKLARGDYSEALSCISN----HDSNDAHHFSESAWLNLLKEKRLPKD 894
R PP L+KERF KL +GD+ A+S +++ + + FS SAW +L R +D
Sbjct: 952 RIPPSPLIKERFFRKLEKGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQD 1002
HSP 2 Score: 36.2 bits (82), Expect = 1.7e-01
Identity = 29/94 (30.85%), Postives = 53/94 (56.38%), Query Frame = 0
Query: 87 ENDFQFKPSFDEYVRVMETVRTRRYKRQSDDPNKLTMKENASAKSAESTSISEVDNGKNK 146
+ F+FKPSFD+Y+++ME+V+T R K++ D +L ++E+ S+ EV + K K
Sbjct: 68 DKGFEFKPSFDQYLQIMESVKTARKKKKFD---RLKVEED-DGGGGNGDSVYEVKDMKIK 127
Query: 147 VTDVQGNMDVKNLFKRVDRKDL--SNNSERITRR 179
G + + KR R+++ +ER+ +R
Sbjct: 128 ----SGELKDETFRKRYSRQEIVSDKRNERVFKR 153
BLAST of HG10021470.1 vs. TAIR 10
Match:
AT5G67570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 406.0 bits (1042), Expect = 8.1e-113
Identity = 219/556 (39.39%), Postives = 334/556 (60.07%), Query Frame = 0
Query: 357 ERIQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 416
E +++L RL+G +I+ W F +MM + +++++ +L+++ LG+ +W++ V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242
Query: 417 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFNAMQQHFSSYPDLVAYHSIAVTLG 476
+ ++ K + RF+YT L VLG ARRP EAL +FN M YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302
Query: 477 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 536
QAG ++EL VI+ MR P K K + WDP L+PD+V+YNA+LNACV + W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362
Query: 537 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 596
V EL+K GL+P+ +TYGL MEVMLE GK++ VH+FFRK++ S P A+TYKVLV LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422
Query: 597 KEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 656
+EGK +EAV A+ +ME++G++G+ ++YY+ A CLC+ GR +A++++ ++ ++ N +PL
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482
Query: 657 VTYTGLIQACLDSNDLQNAVYIFNHMKTFCSPNLVTYNILLKGYLDHGMFEEARELFQNL 716
+T+TGLI A L+ + + + IF +MK C PN+ T N++LK Y + MF EA+ELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542
Query: 717 SEHGRNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHL 776
VS ++P+ Y ++ ML+AS +W+ F + Y M+L GY + +H
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602
Query: 777 RMILEAARAGKDELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSN 836
M++EA+RAGK LLE + + + P P E C A+GD+ A++ I N +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLEDGEIPHPLFFTELLCHATAKGDFQRAITLI-NTVAL 662
Query: 837 DAHHFSESAWLNLLKEKR--LPKDTVIQLIHMVSMLLTRND-SPNPVFQNLLFSCKEFCR 896
+ SE W +L +E + L +D +H +S L D P NL S K C
Sbjct: 663 ASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRCG 722
Query: 897 SRISVADHRLEETVCT 908
S S A L V T
Sbjct: 723 SSSSSAQPLLAVDVTT 724
BLAST of HG10021470.1 vs. TAIR 10
Match:
AT2G17140.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 112.5 bits (280), Expect = 1.9e-24
Identity = 67/264 (25.38%), Postives = 127/264 (48.11%), Query Frame = 0
Query: 509 PRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNL 568
P +P + +YN +L +C+K + +E W+ +++ G+ P T T+ L++ + + +
Sbjct: 106 PENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165
Query: 569 VHEFFRKV-QKSSIPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFAR 628
E F ++ +K PN T+ +LV K G TD+ + + ME G++ + +Y
Sbjct: 166 ARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVS 225
Query: 629 CLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSNDLQNAVYIFNHMKT----- 688
C GR ++ +EK+ + P +VT+ I A + +A IF+ M+
Sbjct: 226 SFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLG 285
Query: 689 FCSPNLVTYNILLKGYLDHGMFEEARELFQNLSE-------------------HGRNI-- 744
PN +TYN++LKG+ G+ E+A+ LF+++ E HG+ I
Sbjct: 286 LPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEA 345
BLAST of HG10021470.1 vs. TAIR 10
Match:
AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 111.3 bits (277), Expect = 4.1e-24
Identity = 102/421 (24.23%), Postives = 179/421 (42.52%), Query Frame = 0
Query: 490 SMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVLQELKKQGLQPS 549
++R+ PK LEK+ QPD+ YNA++N K ++ A VL ++ + P
Sbjct: 136 TLRNIPKAVRVMEILEKFG---QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPD 195
Query: 550 TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSI-PNALTYKVLVNTLWKEGKTDEAVLAIE 609
T TY +++ + GK +L + ++ + P +TY +L+ EG DEA+ ++
Sbjct: 196 TVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMD 255
Query: 610 NMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSN 669
M RG+ Y R +C G A + + +P V++Y L++A L+
Sbjct: 256 EMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQG 315
Query: 670 DLQNAVYIFNHM-KTFCSPNLVTYNILLKGYLDHGMFEEARELFQNLSEHGRNISTVSDY 729
+ + M C PN+VTY+IL+ G EEA L + + E G
Sbjct: 316 KWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKG--------- 375
Query: 730 RDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMLLYGYHFNPKRHLRMILEAARAGK-D 789
+ PD Y ++ ++ A E R D F M+ G + + ++ + GK D
Sbjct: 376 ---LTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKAD 435
Query: 790 ELLETTWKHLAQADRTPPPALLKERFCMKLARGDYSEALSCISNHDSN--DAHHFSESAW 849
+ LE K L + +P + F + GD AL I SN D + ++
Sbjct: 436 QALEIFGK-LGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSM 495
Query: 850 LNLLKEKRLPKDTVIQLIHMVSMLLTRNDSPNPVFQNLLFSCKEFCRSRISVADHRLEET 906
++ L + + + L+ M S P+ V N++ FC++ HR+E+
Sbjct: 496 ISCLCREGMVDEAFELLVDMRSC----EFHPSVVTYNIVL--LGFCKA------HRIEDA 528
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038894404.1 | 0.0e+00 | 91.61 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 ... | [more] |
XP_031741862.1 | 0.0e+00 | 88.20 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sa... | [more] |
XP_008459122.1 | 0.0e+00 | 87.19 | PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic ... | [more] |
KAG7019446.1 | 0.0e+00 | 86.49 | Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... | [more] |
XP_038894405.1 | 0.0e+00 | 86.71 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X2 ... | [more] |
Match Name | E-value | Identity | Description | |
Q9SA76 | 1.1e-207 | 47.95 | Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... | [more] |
Q9FJW6 | 1.1e-111 | 39.39 | Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... | [more] |
Q0WPZ6 | 2.6e-23 | 25.38 | Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX... | [more] |
Q9SR00 | 5.8e-23 | 24.23 | Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... | [more] |
Q9S7R4 | 2.2e-22 | 22.93 | Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidop... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVN7 | 0.0e+00 | 88.20 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553530 PE=4 SV=1 | [more] |
A0A1S3C8Z0 | 0.0e+00 | 87.19 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucumis ... | [more] |
A0A6J1EH18 | 0.0e+00 | 86.06 | LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chlo... | [more] |
A0A6J1KEH7 | 0.0e+00 | 86.38 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Cucurbit... | [more] |
A0A6J1CLQ9 | 0.0e+00 | 83.88 | pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Momordic... | [more] |
Match Name | E-value | Identity | Description | |
AT1G30610.2 | 1.0e-211 | 49.53 | pentatricopeptide (PPR) repeat-containing protein | [more] |
AT1G30610.1 | 7.9e-209 | 47.95 | pentatricopeptide (PPR) repeat-containing protein | [more] |
AT5G67570.1 | 8.1e-113 | 39.39 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT2G17140.1 | 1.9e-24 | 25.38 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G04760.1 | 4.1e-24 | 24.23 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |