Homology
BLAST of HG10018167 vs. NCBI nr
Match:
XP_038893977.1 (pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_038893978.1 pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida])
HSP 1 Score: 894.8 bits (2311), Expect = 3.3e-256
Identity = 454/494 (91.90%), Postives = 468/494 (94.74%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDSIFSAT VSSILVKGNGGIGCQ MAHFKT SRRR PKNLLCPRR KLPPDPAVNQFL
Sbjct: 1 MDSIFSATSVSSILVKGNGGIGCQATMAHFKTNSRRRLPKNLLCPRRAKLPPDPAVNQFL 60
Query: 61 NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGR 120
NKTSAPSP TDLISSE FQLP+GEDDEHEEIHAYD +D VVWDS+EIEAISSLFQGR
Sbjct: 61 KNKTSAPSPSLTDLISSEIFQLPKGEDDEHEEIHAYDYKDTDVVWDSDEIEAISSLFQGR 120
Query: 121 IPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLAR 180
IPQKPGKLNRDRPLPLPLPHKLRP GLP+PKIRPR++VSSRALLSKQVYK PDFLIGLAR
Sbjct: 121 IPQKPGKLNRDRPLPLPLPHKLRPSGLPDPKIRPRIMVSSRALLSKQVYKRPDFLIGLAR 180
Query: 181 EIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDD 240
IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTF WAQEQPRLFPDD
Sbjct: 181 AIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFSWAQEQPRLFPDD 240
Query: 241 RVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKG 300
RVLASTVEVLARNHELKVPL+LEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVA+KK
Sbjct: 241 RVLASTVEVLARNHELKVPLDLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKR 300
Query: 301 KRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFE 360
KR+LDPSVYVKLILELGKNPDKNVLVLTLLDELGQREAL LNQQDTT IIK CTRLGKFE
Sbjct: 301 KRLLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALTLNQQDTTTIIKVCTRLGKFE 360
Query: 361 IAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVI 420
IAEKLYSWYVESGHEPSVVMYTALVHSRYSD+KYREALSLVWEMEAANCPFDLPAY V+I
Sbjct: 361 IAEKLYSWYVESGHEPSVVMYTALVHSRYSDRKYREALSLVWEMEAANCPFDLPAYSVMI 420
Query: 421 KLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
KLFV LGDLSRAVRYFAKLKEAGFAPTYDVYR +ITIYLVSGRLAKCKEIYKEAENAGF+
Sbjct: 421 KLFVTLGDLSRAVRYFAKLKEAGFAPTYDVYRKMITIYLVSGRLAKCKEIYKEAENAGFI 480
Query: 481 MDKQITSMLLQAKR 494
MDKQITSMLLQ+KR
Sbjct: 481 MDKQITSMLLQSKR 494
BLAST of HG10018167 vs. NCBI nr
Match:
XP_004139567.1 (pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654198.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739920.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739926.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >KGN64877.1 hypothetical protein Csa_022712 [Cucumis sativus])
HSP 1 Score: 862.1 bits (2226), Expect = 2.4e-246
Identity = 439/495 (88.69%), Postives = 460/495 (92.93%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDSI SAT VSSILVKGNGGIGCQ M HFK SRRRPPKNLLCPRR KLPP+PAVNQF
Sbjct: 1 MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
Query: 61 NNKTSAPS-PFTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
NNKTSAPS PFTDLISS+ FQ DEHEEIHA+D +D VVWDS+EIEAISSLFQG
Sbjct: 61 NNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQG 120
Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP VVSSRALLSKQVYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLA 180
Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
REIR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRAL TFCWAQEQ RLFPD
Sbjct: 181 REIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPD 240
Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
DRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAM+RGFI+GGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKK 300
Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
GKRMLDPSVYVKLILELGKNPDKN+LVLTLL+ELGQREALKLNQQD T I+K CTRLGKF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKF 360
Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
EIAEKLYSWYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ NCPFDLPAY VV
Sbjct: 361 EIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVV 420
Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
IKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGF 480
Query: 481 VMDKQITSMLLQAKR 494
+MDKQITSMLLQAKR
Sbjct: 481 MMDKQITSMLLQAKR 489
BLAST of HG10018167 vs. NCBI nr
Match:
XP_008462173.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462181.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462189.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902994.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902996.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo])
HSP 1 Score: 859.8 bits (2220), Expect = 1.2e-245
Identity = 441/495 (89.09%), Postives = 458/495 (92.53%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
M SI SAT VSSILVKGNGGIGCQ M HFK SRRRPPKNLLCPRR KLPPDPAVNQFL
Sbjct: 1 MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60
Query: 61 NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
NNKTSAPSP FTDLISS+ FQ DEHEEIHAYD +D VVWDS+EIEAISSLFQG
Sbjct: 61 NNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQG 120
Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP VSSRALLSK+VYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLA 180
Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
R IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTFCW QEQ RLFPD
Sbjct: 181 RAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPD 240
Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
DRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAM+RGFIKGGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKK 300
Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
GKRMLDPSVYVKLILELGKNPDKNVLVLTLL+ELGQREALKLNQQD+T IIK CTRL KF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKF 360
Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
EIAEKLY WYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ANCPFDLPAY VV
Sbjct: 361 EIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVV 420
Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
IKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGF 480
Query: 481 VMDKQITSMLLQAKR 494
+MDKQITSMLLQAKR
Sbjct: 481 IMDKQITSMLLQAKR 489
BLAST of HG10018167 vs. NCBI nr
Match:
KAG7020726.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 817.8 bits (2111), Expect = 5.2e-233
Identity = 416/501 (83.03%), Postives = 445/501 (88.82%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDS+FS T +SSILVK NGGI CQ +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1 MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60
Query: 61 NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
+TS P P F DLISSE LPE E DE EE A D D+ VVWDSEEIEAI
Sbjct: 61 KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120
Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
+SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180
Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
FLIGLAR IR L PEEN+SKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENMSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240
Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300
Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360
Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420
Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480
Query: 481 AENAGFVMDKQITSMLLQAKR 494
AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501
BLAST of HG10018167 vs. NCBI nr
Match:
XP_022951807.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita moschata])
HSP 1 Score: 815.1 bits (2104), Expect = 3.3e-232
Identity = 416/501 (83.03%), Postives = 444/501 (88.62%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDS+FS T +SSILVK NGGI CQ +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1 MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60
Query: 61 NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
+TS P P F DLISSE LPE E DE EE A D D+ VVWDSEEIEAI
Sbjct: 61 KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120
Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
+SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180
Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240
Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300
Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360
Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420
Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480
Query: 481 AENAGFVMDKQITSMLLQAKR 494
AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501
BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match:
Q5XET4 (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)
HSP 1 Score: 502.3 bits (1292), Expect = 6.3e-141
Identity = 271/478 (56.69%), Postives = 349/478 (73.01%), Query Frame = 0
Query: 19 GGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFLNNKTSAPSPFTDLISSET 78
G IG + A + +S++ KNL PRR KLPPD VN FL P +I +
Sbjct: 23 GNIGVTRVNASQRNHSKKL-TKNLRNPRRTKLPPDFGVNLFLRKPKIEPL----VIDDDD 82
Query: 79 FQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGRIPQKPGKLNRDRPLPLPLP 138
Q+ E +D+ D+ VVW+ EEIEAISSLFQ RIPQKP K +R RPLPLP P
Sbjct: 83 EQVQESVNDD----------DDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQP 142
Query: 139 HKLRPPGLPNPKIRPRVVVSSRAL--LSKQVYKCPDFLIGLAREIRYL-SPEENVSKVLN 198
HKLRP GLP PK + ++ S AL +SKQVYK P FLIGLAREI+ L S + +VS VLN
Sbjct: 143 HKLRPLGLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLN 202
Query: 199 RWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDDRVLASTVEVLARNHEL 258
+W FL+KGSLS TI+ELG MGLP+RAL+T+ WA++ L PD+R+LAST++VLA++HEL
Sbjct: 203 KWVSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHEL 262
Query: 259 KVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILEL 318
K+ L+ LAS+ V+EAM++G I+GG LNLA KL++ SK R+LD SVYVK+ILE+
Sbjct: 263 KL---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEI 322
Query: 319 GKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEP 378
KNPDK LV+ LL+EL +RE LKL+QQD T+I+K C +LG+FE+ E L+ W+ S EP
Sbjct: 323 AKNPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREP 382
Query: 379 SVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYF 438
SVVMYT ++HSRYS++KYREA+S+VWEME +NC DLPAY VVIKLFVAL DL RA+RY+
Sbjct: 383 SVVMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYY 442
Query: 439 AKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDKQITSMLLQAKR 494
+KLKEAGF+PTYD+YR++I++Y SGRL KCKEI KE E+AG +DK + LLQ ++
Sbjct: 443 SKLKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479
BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match:
Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)
HSP 1 Score: 75.5 bits (184), Expect = 1.9e-12
Identity = 68/322 (21.12%), Postives = 145/322 (45.03%), Query Frame = 0
Query: 177 LAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLF 236
L ++ L P ++++ L+ + L +L KE G R+L+ F + Q Q
Sbjct: 79 LINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCK 138
Query: 237 PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV------LEAMVRGFIKGGSLNLAW 296
P++ + + +L R E + LE F ++ S+GV A++ + + G +
Sbjct: 139 PNEHIYTIMISLLGR--EGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSL 198
Query: 297 KLLVASKKGKRMLDPSV--YVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAI 356
+LL K K + PS+ Y +I + +L L E+ + E ++ + +
Sbjct: 199 ELLDRMKNEK--ISPSILTYNTVINACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTL 258
Query: 357 IKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANC 416
+ AC G + AE ++ + G P + Y+ LV + ++ + L+ EM +
Sbjct: 259 LSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGS 318
Query: 417 PFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKE 476
D+ +Y V+++ + G + A+ F +++ AG P + Y L+ ++ SGR ++
Sbjct: 319 LPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQ 378
Query: 477 IYKEAENAGFVMDKQITSMLLQ 491
++ E +++ D ++L++
Sbjct: 379 LFLEMKSSNTDPDAATYNILIE 395
BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match:
O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)
HSP 1 Score: 72.0 bits (175), Expect = 2.1e-11
Identity = 68/291 (23.37%), Postives = 126/291 (43.30%), Query Frame = 0
Query: 201 LQKGSLSLTIKELGRMGLPDRALKTFCW---AQEQPRLFPDDRVLASTVEVLARNHELKV 260
L + L +K L G +RA+ F W + L D +V+ V +L R + V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193
Query: 261 ------PLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK---GKRMLDPSVY 320
+ L+E+ L ++ + + G A L K+ ++ +V
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253
Query: 321 VKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWY 380
+ + ++G++ K +L +LDE+ + + LK ++ + ++ AC R G A++ ++
Sbjct: 254 LDVFGKMGRSWRK---ILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAEL 313
Query: 381 VESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDL 440
G+EP V Y AL+ Y EALS++ EME +CP D Y ++ +V G
Sbjct: 314 KSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFS 373
Query: 441 SRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
A + + G P Y +I Y +G+ + +++ + AG V
Sbjct: 374 KEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCV 418
BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match:
Q9ZUA2 (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX=3702 GN=At2g01740 PE=3 SV=1)
HSP 1 Score: 67.4 bits (163), Expect = 5.2e-10
Identity = 53/210 (25.24%), Postives = 91/210 (43.33%), Query Frame = 0
Query: 280 FIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALK 339
F K G L LA K + K+ + + LI K D V V +L E+ +R +
Sbjct: 173 FCKSGELQLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAV-SLYKEM-RRVRMS 232
Query: 340 LNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSL 399
LN TA+I + G+ + AE++YS VE EP+ ++YT ++ + A+
Sbjct: 233 LNVVTYTALIDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKF 292
Query: 400 VWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLV 459
+ +M D+ AY V+I G L A ++++ P ++ ++ Y
Sbjct: 293 LAKMLNQGMRLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFK 352
Query: 460 SGRLAKCKEIYKEAENAGFVMDKQITSMLL 490
SGR+ +Y + GF D S ++
Sbjct: 353 SGRMKAAVNMYHKLIERGFEPDVVALSTMI 380
BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match:
Q66GP4 (Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g13770 PE=2 SV=1)
HSP 1 Score: 67.4 bits (163), Expect = 5.2e-10
Identity = 58/237 (24.47%), Postives = 112/237 (47.26%), Query Frame = 0
Query: 261 LEEFTKLASRGVLEA------MVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILE 320
LE ++ +G+ E+ ++R F + + + KL + K + DP + +K++L
Sbjct: 268 LEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEAGGKKLLKDPEMCLKVVLM 327
Query: 321 LGK--NPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESG 380
+ N + + V+ + ++ LK+ AI+ ++ F A K+Y W ++
Sbjct: 328 YVREGNMETTLEVVAAM----RKAELKVTDCILCAIVNGFSKQRGFAEAVKVYEWAMKEE 387
Query: 381 HEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAV 440
E V Y +++ +KY +A L EM + AY ++ ++ LS AV
Sbjct: 388 CEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNIMDMYGKTRRLSDAV 447
Query: 441 RYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDK-QITSML 489
R AK+K+ G P +Y +LI ++ + L + ++I+KE + A + DK TSM+
Sbjct: 448 RLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKVLPDKVSYTSMI 500
BLAST of HG10018167 vs. ExPASy TrEMBL
Match:
A0A0A0LVM0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)
HSP 1 Score: 862.1 bits (2226), Expect = 1.2e-246
Identity = 439/495 (88.69%), Postives = 460/495 (92.93%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDSI SAT VSSILVKGNGGIGCQ M HFK SRRRPPKNLLCPRR KLPP+PAVNQF
Sbjct: 1 MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
Query: 61 NNKTSAPS-PFTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
NNKTSAPS PFTDLISS+ FQ DEHEEIHA+D +D VVWDS+EIEAISSLFQG
Sbjct: 61 NNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQG 120
Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP VVSSRALLSKQVYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLA 180
Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
REIR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRAL TFCWAQEQ RLFPD
Sbjct: 181 REIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPD 240
Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
DRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAM+RGFI+GGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKK 300
Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
GKRMLDPSVYVKLILELGKNPDKN+LVLTLL+ELGQREALKLNQQD T I+K CTRLGKF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKF 360
Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
EIAEKLYSWYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ NCPFDLPAY VV
Sbjct: 361 EIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVV 420
Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
IKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGF 480
Query: 481 VMDKQITSMLLQAKR 494
+MDKQITSMLLQAKR
Sbjct: 481 MMDKQITSMLLQAKR 489
BLAST of HG10018167 vs. ExPASy TrEMBL
Match:
A0A1S3CGD0 (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)
HSP 1 Score: 859.8 bits (2220), Expect = 5.7e-246
Identity = 441/495 (89.09%), Postives = 458/495 (92.53%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
M SI SAT VSSILVKGNGGIGCQ M HFK SRRRPPKNLLCPRR KLPPDPAVNQFL
Sbjct: 1 MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60
Query: 61 NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
NNKTSAPSP FTDLISS+ FQ DEHEEIHAYD +D VVWDS+EIEAISSLFQG
Sbjct: 61 NNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQG 120
Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP VSSRALLSK+VYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLA 180
Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
R IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTFCW QEQ RLFPD
Sbjct: 181 RAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPD 240
Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
DRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAM+RGFIKGGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKK 300
Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
GKRMLDPSVYVKLILELGKNPDKNVLVLTLL+ELGQREALKLNQQD+T IIK CTRL KF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKF 360
Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
EIAEKLY WYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ANCPFDLPAY VV
Sbjct: 361 EIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVV 420
Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
IKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGF 480
Query: 481 VMDKQITSMLLQAKR 494
+MDKQITSMLLQAKR
Sbjct: 481 IMDKQITSMLLQAKR 489
BLAST of HG10018167 vs. ExPASy TrEMBL
Match:
A0A6J1GIP2 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454537 PE=4 SV=1)
HSP 1 Score: 815.1 bits (2104), Expect = 1.6e-232
Identity = 416/501 (83.03%), Postives = 444/501 (88.62%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDS+FS T +SSILVK NGGI CQ +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1 MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60
Query: 61 NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
+TS P P F DLISSE LPE E DE EE A D D+ VVWDSEEIEAI
Sbjct: 61 KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120
Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
+SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180
Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240
Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300
Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360
Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420
Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480
Query: 481 AENAGFVMDKQITSMLLQAKR 494
AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501
BLAST of HG10018167 vs. ExPASy TrEMBL
Match:
A0A6J1KK31 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495984 PE=4 SV=1)
HSP 1 Score: 810.4 bits (2092), Expect = 4.0e-231
Identity = 414/501 (82.63%), Postives = 442/501 (88.22%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MDS+FS T VSSILVK NGGI CQ MAHF T S+RRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1 MDSLFSTTAVSSILVKRNGGISCQIPMAHFLTNSKRRPPKNLLYPRRTKLPPDPGVNQFL 60
Query: 61 NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
+TS P P + DLI SE LPE E DE EE A D D+ +VWD EEIEAI
Sbjct: 61 KKRTSDPHPDTSYPDLIPSEKIGLPEEELDELEETAADNYFANDDNDSDIVWDPEEIEAI 120
Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
+SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180
Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240
Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300
Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
LVA+K GKRMLDPSV+VKLILE+GKNPDKN+LVL LLDELGQREAL L+QQDT+AIIK
Sbjct: 301 LVAAKNGKRMLDPSVHVKLILEIGKNPDKNMLVLALLDELGQREALNLSQQDTSAIIKVS 360
Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420
Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
PAY VVIKLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480
Query: 481 AENAGFVMDKQITSMLLQAKR 494
AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501
BLAST of HG10018167 vs. ExPASy TrEMBL
Match:
A0A6J1DI37 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020698 PE=4 SV=1)
HSP 1 Score: 805.8 bits (2080), Expect = 9.8e-230
Identity = 409/495 (82.63%), Postives = 439/495 (88.69%), Query Frame = 0
Query: 1 MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
MD IFS+++VSSI+VKGNGGI CQ MA F +RRR PKNLL PRR KLPPDP VNQFL
Sbjct: 1 MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
Query: 61 NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEI----HAYDCRDNGVVWDSEEIEAISSL 120
N TS P FTD SSE + PE E D+HEE + D +D ++WDS+EIEAISSL
Sbjct: 61 KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
Query: 121 FQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLI 180
FQGRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIR R V SRA LSKQVYK PDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
Query: 181 GLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRL 240
GLAR IR LS EENVSKVLNRW PFL KGSLSLTI+ELG MGL DRAL++FCWAQEQPRL
Sbjct: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
Query: 241 FPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVA 300
FPDDRVLASTVEVL+RNHELKVPLNLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
Query: 301 SKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRL 360
+KKG RMLDPSVYVKLILELGKNPDKN+LVLTLLDELGQREALKLNQQDTTAI+K CTRL
Sbjct: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
Query: 361 GKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAY 420
GKFEIAE+LY WYVES HEPSVVMYTAL+HSRYS+KKYREALS+VWEMEAANCPFDLPAY
Sbjct: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
Query: 421 CVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAEN 480
VVIKLFVALGDLSRA RYFAKLKEAGFAPTYD+YRNLITIYLVSGRLAKCKEIYKEA+N
Sbjct: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
Query: 481 AGFVMDKQITSMLLQ 491
AGF++DKQITS LLQ
Sbjct: 481 AGFIIDKQITSRLLQ 495
BLAST of HG10018167 vs. TAIR 10
Match:
AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 502.3 bits (1292), Expect = 4.5e-142
Identity = 271/478 (56.69%), Postives = 349/478 (73.01%), Query Frame = 0
Query: 19 GGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFLNNKTSAPSPFTDLISSET 78
G IG + A + +S++ KNL PRR KLPPD VN FL P +I +
Sbjct: 23 GNIGVTRVNASQRNHSKKL-TKNLRNPRRTKLPPDFGVNLFLRKPKIEPL----VIDDDD 82
Query: 79 FQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGRIPQKPGKLNRDRPLPLPLP 138
Q+ E +D+ D+ VVW+ EEIEAISSLFQ RIPQKP K +R RPLPLP P
Sbjct: 83 EQVQESVNDD----------DDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQP 142
Query: 139 HKLRPPGLPNPKIRPRVVVSSRAL--LSKQVYKCPDFLIGLAREIRYL-SPEENVSKVLN 198
HKLRP GLP PK + ++ S AL +SKQVYK P FLIGLAREI+ L S + +VS VLN
Sbjct: 143 HKLRPLGLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLN 202
Query: 199 RWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDDRVLASTVEVLARNHEL 258
+W FL+KGSLS TI+ELG MGLP+RAL+T+ WA++ L PD+R+LAST++VLA++HEL
Sbjct: 203 KWVSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHEL 262
Query: 259 KVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILEL 318
K+ L+ LAS+ V+EAM++G I+GG LNLA KL++ SK R+LD SVYVK+ILE+
Sbjct: 263 KL---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEI 322
Query: 319 GKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEP 378
KNPDK LV+ LL+EL +RE LKL+QQD T+I+K C +LG+FE+ E L+ W+ S EP
Sbjct: 323 AKNPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREP 382
Query: 379 SVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYF 438
SVVMYT ++HSRYS++KYREA+S+VWEME +NC DLPAY VVIKLFVAL DL RA+RY+
Sbjct: 383 SVVMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYY 442
Query: 439 AKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDKQITSMLLQAKR 494
+KLKEAGF+PTYD+YR++I++Y SGRL KCKEI KE E+AG +DK + LLQ ++
Sbjct: 443 SKLKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479
BLAST of HG10018167 vs. TAIR 10
Match:
AT1G74850.1 (plastid transcriptionally active 2 )
HSP 1 Score: 75.5 bits (184), Expect = 1.4e-13
Identity = 68/322 (21.12%), Postives = 145/322 (45.03%), Query Frame = 0
Query: 177 LAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLF 236
L ++ L P ++++ L+ + L +L KE G R+L+ F + Q Q
Sbjct: 79 LINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCK 138
Query: 237 PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV------LEAMVRGFIKGGSLNLAW 296
P++ + + +L R E + LE F ++ S+GV A++ + + G +
Sbjct: 139 PNEHIYTIMISLLGR--EGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSL 198
Query: 297 KLLVASKKGKRMLDPSV--YVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAI 356
+LL K K + PS+ Y +I + +L L E+ + E ++ + +
Sbjct: 199 ELLDRMKNEK--ISPSILTYNTVINACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTL 258
Query: 357 IKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANC 416
+ AC G + AE ++ + G P + Y+ LV + ++ + L+ EM +
Sbjct: 259 LSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGS 318
Query: 417 PFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKE 476
D+ +Y V+++ + G + A+ F +++ AG P + Y L+ ++ SGR ++
Sbjct: 319 LPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQ 378
Query: 477 IYKEAENAGFVMDKQITSMLLQ 491
++ E +++ D ++L++
Sbjct: 379 LFLEMKSSNTDPDAATYNILIE 395
BLAST of HG10018167 vs. TAIR 10
Match:
AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 72.0 bits (175), Expect = 1.5e-12
Identity = 68/291 (23.37%), Postives = 126/291 (43.30%), Query Frame = 0
Query: 201 LQKGSLSLTIKELGRMGLPDRALKTFCW---AQEQPRLFPDDRVLASTVEVLARNHELKV 260
L + L +K L G +RA+ F W + L D +V+ V +L R + V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193
Query: 261 ------PLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK---GKRMLDPSVY 320
+ L+E+ L ++ + + G A L K+ ++ +V
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253
Query: 321 VKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWY 380
+ + ++G++ K +L +LDE+ + + LK ++ + ++ AC R G A++ ++
Sbjct: 254 LDVFGKMGRSWRK---ILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAEL 313
Query: 381 VESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDL 440
G+EP V Y AL+ Y EALS++ EME +CP D Y ++ +V G
Sbjct: 314 KSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFS 373
Query: 441 SRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
A + + G P Y +I Y +G+ + +++ + AG V
Sbjct: 374 KEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCV 418
BLAST of HG10018167 vs. TAIR 10
Match:
AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 67.4 bits (163), Expect = 3.7e-11
Identity = 53/210 (25.24%), Postives = 91/210 (43.33%), Query Frame = 0
Query: 280 FIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALK 339
F K G L LA K + K+ + + LI K D V V +L E+ +R +
Sbjct: 173 FCKSGELQLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAV-SLYKEM-RRVRMS 232
Query: 340 LNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSL 399
LN TA+I + G+ + AE++YS VE EP+ ++YT ++ + A+
Sbjct: 233 LNVVTYTALIDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKF 292
Query: 400 VWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLV 459
+ +M D+ AY V+I G L A ++++ P ++ ++ Y
Sbjct: 293 LAKMLNQGMRLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFK 352
Query: 460 SGRLAKCKEIYKEAENAGFVMDKQITSMLL 490
SGR+ +Y + GF D S ++
Sbjct: 353 SGRMKAAVNMYHKLIERGFEPDVVALSTMI 380
BLAST of HG10018167 vs. TAIR 10
Match:
AT5G13770.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )
HSP 1 Score: 67.4 bits (163), Expect = 3.7e-11
Identity = 58/237 (24.47%), Postives = 112/237 (47.26%), Query Frame = 0
Query: 261 LEEFTKLASRGVLEA------MVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILE 320
LE ++ +G+ E+ ++R F + + + KL + K + DP + +K++L
Sbjct: 268 LEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEAGGKKLLKDPEMCLKVVLM 327
Query: 321 LGK--NPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESG 380
+ N + + V+ + ++ LK+ AI+ ++ F A K+Y W ++
Sbjct: 328 YVREGNMETTLEVVAAM----RKAELKVTDCILCAIVNGFSKQRGFAEAVKVYEWAMKEE 387
Query: 381 HEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAV 440
E V Y +++ +KY +A L EM + AY ++ ++ LS AV
Sbjct: 388 CEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNIMDMYGKTRRLSDAV 447
Query: 441 RYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDK-QITSML 489
R AK+K+ G P +Y +LI ++ + L + ++I+KE + A + DK TSM+
Sbjct: 448 RLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKVLPDKVSYTSMI 500
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038893977.1 | 3.3e-256 | 91.90 | pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_03... | [more] |
XP_004139567.1 | 2.4e-246 | 88.69 | pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_0116... | [more] |
XP_008462173.1 | 1.2e-245 | 89.09 | PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] ... | [more] |
KAG7020726.1 | 5.2e-233 | 83.03 | Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... | [more] |
XP_022951807.1 | 3.3e-232 | 83.03 | pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita mosc... | [more] |
Match Name | E-value | Identity | Description | |
Q5XET4 | 6.3e-141 | 56.69 | Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... | [more] |
Q9S7Q2 | 1.9e-12 | 21.12 | Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... | [more] |
O64624 | 2.1e-11 | 23.37 | Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... | [more] |
Q9ZUA2 | 5.2e-10 | 25.24 | Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX... | [more] |
Q66GP4 | 5.2e-10 | 24.47 | Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidop... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVM0 | 1.2e-246 | 88.69 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1 | [more] |
A0A1S3CGD0 | 5.7e-246 | 89.09 | pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... | [more] |
A0A6J1GIP2 | 1.6e-232 | 83.03 | pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita mo... | [more] |
A0A6J1KK31 | 4.0e-231 | 82.63 | pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita ma... | [more] |
A0A6J1DI37 | 9.8e-230 | 82.63 | pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica ch... | [more] |
Match Name | E-value | Identity | Description | |
AT2G01860.1 | 4.5e-142 | 56.69 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G74850.1 | 1.4e-13 | 21.12 | plastid transcriptionally active 2 | [more] |
AT2G18940.1 | 1.5e-12 | 23.37 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT2G01740.1 | 3.7e-11 | 25.24 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT5G13770.1 | 3.7e-11 | 24.47 | Pentatricopeptide repeat (PPR-like) superfamily protein | [more] |