HG10018167 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018167
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 1210241 .. 1211722 (-)
RNA-Seq ExpressionHG10018167
SyntenyHG10018167
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTCCATTTTCTCAGCAACTTTAGTCTCTTCCATTCTTGTGAAAGGAAATGGAGGAATTGGCTGCCAGACTATGATGGCTCATTTCAAGACCTACTCTAGAAGACGCCCACCCAAAAACCTCCTCTGTCCACGACGGAACAAGCTTCCTCCTGACCCCGCCGTCAACCAATTCTTGAACAATAAAACCTCTGCCCCTTCCCCATTTACCGATTTGATTTCCTCGGAGACTTTCCAACTCCCCGAAGGTGAAGACGATGAGCATGAAGAAATCCACGCTTATGACTGTAGGGATAATGGTGTTGTTTGGGATTCAGAAGAAATTGAAGCTATTTCATCACTCTTCCAAGGGAGAATTCCTCAGAAACCTGGTAAATTGAATCGGGACAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATTCGCCCAAGAGTAGTGGTTTCTTCGCGTGCTTTGCTGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGAGATTAGATATCTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAGAAGGGCTCTCTGTCATTGACGATCAAGGAACTGGGTCGTATGGGTCTTCCTGATAGAGCTCTAAAGACATTCTGTTGGGCACAGGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTAAACTTGCTAGTCGTGGTGTGCTCGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTTCGAAGAAGGGCAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTGATATTGGAGCTTGGGAAAAACCCTGATAAAAACGTATTGGTTCTTACCTTACTGGATGAGCTAGGACAAAGAGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGCCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAGTCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATTGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCAACGTATGATGTATATAGGAATCTGATTACCATATATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTACAAGGAAGCAGAGAATGCTGGATTTGTCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

mRNA sequence

ATGGACTCCATTTTCTCAGCAACTTTAGTCTCTTCCATTCTTGTGAAAGGAAATGGAGGAATTGGCTGCCAGACTATGATGGCTCATTTCAAGACCTACTCTAGAAGACGCCCACCCAAAAACCTCCTCTGTCCACGACGGAACAAGCTTCCTCCTGACCCCGCCGTCAACCAATTCTTGAACAATAAAACCTCTGCCCCTTCCCCATTTACCGATTTGATTTCCTCGGAGACTTTCCAACTCCCCGAAGGTGAAGACGATGAGCATGAAGAAATCCACGCTTATGACTGTAGGGATAATGGTGTTGTTTGGGATTCAGAAGAAATTGAAGCTATTTCATCACTCTTCCAAGGGAGAATTCCTCAGAAACCTGGTAAATTGAATCGGGACAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATTCGCCCAAGAGTAGTGGTTTCTTCGCGTGCTTTGCTGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGAGATTAGATATCTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAGAAGGGCTCTCTGTCATTGACGATCAAGGAACTGGGTCGTATGGGTCTTCCTGATAGAGCTCTAAAGACATTCTGTTGGGCACAGGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTAAACTTGCTAGTCGTGGTGTGCTCGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTTCGAAGAAGGGCAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTGATATTGGAGCTTGGGAAAAACCCTGATAAAAACGTATTGGTTCTTACCTTACTGGATGAGCTAGGACAAAGAGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGCCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAGTCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATTGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCAACGTATGATGTATATAGGAATCTGATTACCATATATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTACAAGGAAGCAGAGAATGCTGGATTTGTCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

Coding sequence (CDS)

ATGGACTCCATTTTCTCAGCAACTTTAGTCTCTTCCATTCTTGTGAAAGGAAATGGAGGAATTGGCTGCCAGACTATGATGGCTCATTTCAAGACCTACTCTAGAAGACGCCCACCCAAAAACCTCCTCTGTCCACGACGGAACAAGCTTCCTCCTGACCCCGCCGTCAACCAATTCTTGAACAATAAAACCTCTGCCCCTTCCCCATTTACCGATTTGATTTCCTCGGAGACTTTCCAACTCCCCGAAGGTGAAGACGATGAGCATGAAGAAATCCACGCTTATGACTGTAGGGATAATGGTGTTGTTTGGGATTCAGAAGAAATTGAAGCTATTTCATCACTCTTCCAAGGGAGAATTCCTCAGAAACCTGGTAAATTGAATCGGGACAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATTCGCCCAAGAGTAGTGGTTTCTTCGCGTGCTTTGCTGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGAGATTAGATATCTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAGAAGGGCTCTCTGTCATTGACGATCAAGGAACTGGGTCGTATGGGTCTTCCTGATAGAGCTCTAAAGACATTCTGTTGGGCACAGGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTAAACTTGCTAGTCGTGGTGTGCTCGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTTCGAAGAAGGGCAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTGATATTGGAGCTTGGGAAAAACCCTGATAAAAACGTATTGGTTCTTACCTTACTGGATGAGCTAGGACAAAGAGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGCCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAGTCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATTGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCAACGTATGATGTATATAGGAATCTGATTACCATATATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTACAAGGAAGCAGAGAATGCTGGATTTGTCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

Protein sequence

MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFLNNKTSAPSPFTDLISSETFQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDKQITSMLLQAKR
Homology
BLAST of HG10018167 vs. NCBI nr
Match: XP_038893977.1 (pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_038893978.1 pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida])

HSP 1 Score: 894.8 bits (2311), Expect = 3.3e-256
Identity = 454/494 (91.90%), Postives = 468/494 (94.74%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDSIFSAT VSSILVKGNGGIGCQ  MAHFKT SRRR PKNLLCPRR KLPPDPAVNQFL
Sbjct: 1   MDSIFSATSVSSILVKGNGGIGCQATMAHFKTNSRRRLPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGR 120
            NKTSAPSP  TDLISSE FQLP+GEDDEHEEIHAYD +D  VVWDS+EIEAISSLFQGR
Sbjct: 61  KNKTSAPSPSLTDLISSEIFQLPKGEDDEHEEIHAYDYKDTDVVWDSDEIEAISSLFQGR 120

Query: 121 IPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLAR 180
           IPQKPGKLNRDRPLPLPLPHKLRP GLP+PKIRPR++VSSRALLSKQVYK PDFLIGLAR
Sbjct: 121 IPQKPGKLNRDRPLPLPLPHKLRPSGLPDPKIRPRIMVSSRALLSKQVYKRPDFLIGLAR 180

Query: 181 EIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDD 240
            IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTF WAQEQPRLFPDD
Sbjct: 181 AIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFSWAQEQPRLFPDD 240

Query: 241 RVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKG 300
           RVLASTVEVLARNHELKVPL+LEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVA+KK 
Sbjct: 241 RVLASTVEVLARNHELKVPLDLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKR 300

Query: 301 KRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFE 360
           KR+LDPSVYVKLILELGKNPDKNVLVLTLLDELGQREAL LNQQDTT IIK CTRLGKFE
Sbjct: 301 KRLLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALTLNQQDTTTIIKVCTRLGKFE 360

Query: 361 IAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVI 420
           IAEKLYSWYVESGHEPSVVMYTALVHSRYSD+KYREALSLVWEMEAANCPFDLPAY V+I
Sbjct: 361 IAEKLYSWYVESGHEPSVVMYTALVHSRYSDRKYREALSLVWEMEAANCPFDLPAYSVMI 420

Query: 421 KLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
           KLFV LGDLSRAVRYFAKLKEAGFAPTYDVYR +ITIYLVSGRLAKCKEIYKEAENAGF+
Sbjct: 421 KLFVTLGDLSRAVRYFAKLKEAGFAPTYDVYRKMITIYLVSGRLAKCKEIYKEAENAGFI 480

Query: 481 MDKQITSMLLQAKR 494
           MDKQITSMLLQ+KR
Sbjct: 481 MDKQITSMLLQSKR 494

BLAST of HG10018167 vs. NCBI nr
Match: XP_004139567.1 (pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654198.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739920.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739926.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >KGN64877.1 hypothetical protein Csa_022712 [Cucumis sativus])

HSP 1 Score: 862.1 bits (2226), Expect = 2.4e-246
Identity = 439/495 (88.69%), Postives = 460/495 (92.93%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDSI SAT VSSILVKGNGGIGCQ  M HFK  SRRRPPKNLLCPRR KLPP+PAVNQF 
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60

Query: 61  NNKTSAPS-PFTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
           NNKTSAPS PFTDLISS+ FQ      DEHEEIHA+D  +D  VVWDS+EIEAISSLFQG
Sbjct: 61  NNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQG 120

Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
           RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP  VVSSRALLSKQVYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLA 180

Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
           REIR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRAL TFCWAQEQ RLFPD
Sbjct: 181 REIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPD 240

Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
           DRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAM+RGFI+GGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKK 300

Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
           GKRMLDPSVYVKLILELGKNPDKN+LVLTLL+ELGQREALKLNQQD T I+K CTRLGKF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKF 360

Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
           EIAEKLYSWYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ NCPFDLPAY VV
Sbjct: 361 EIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVV 420

Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
           IKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGF 480

Query: 481 VMDKQITSMLLQAKR 494
           +MDKQITSMLLQAKR
Sbjct: 481 MMDKQITSMLLQAKR 489

BLAST of HG10018167 vs. NCBI nr
Match: XP_008462173.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462181.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462189.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902994.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902996.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo])

HSP 1 Score: 859.8 bits (2220), Expect = 1.2e-245
Identity = 441/495 (89.09%), Postives = 458/495 (92.53%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           M SI SAT VSSILVKGNGGIGCQ  M HFK  SRRRPPKNLLCPRR KLPPDPAVNQFL
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
           NNKTSAPSP FTDLISS+ FQ      DEHEEIHAYD  +D  VVWDS+EIEAISSLFQG
Sbjct: 61  NNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQG 120

Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
           RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP   VSSRALLSK+VYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLA 180

Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
           R IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTFCW QEQ RLFPD
Sbjct: 181 RAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPD 240

Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
           DRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAM+RGFIKGGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKK 300

Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
           GKRMLDPSVYVKLILELGKNPDKNVLVLTLL+ELGQREALKLNQQD+T IIK CTRL KF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKF 360

Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
           EIAEKLY WYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ANCPFDLPAY VV
Sbjct: 361 EIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVV 420

Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
           IKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGF 480

Query: 481 VMDKQITSMLLQAKR 494
           +MDKQITSMLLQAKR
Sbjct: 481 IMDKQITSMLLQAKR 489

BLAST of HG10018167 vs. NCBI nr
Match: KAG7020726.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 817.8 bits (2111), Expect = 5.2e-233
Identity = 416/501 (83.03%), Postives = 445/501 (88.82%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDS+FS T +SSILVK NGGI CQ  +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
             +TS P P   F DLISSE   LPE E DE EE       A D  D+ VVWDSEEIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
           +SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR  VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
           FLIGLAR IR L PEEN+SKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENMSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
           LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK  
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
           TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
           PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFVMDKQITSMLLQAKR 494
           AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of HG10018167 vs. NCBI nr
Match: XP_022951807.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita moschata])

HSP 1 Score: 815.1 bits (2104), Expect = 3.3e-232
Identity = 416/501 (83.03%), Postives = 444/501 (88.62%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDS+FS T +SSILVK NGGI CQ  +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
             +TS P P   F DLISSE   LPE E DE EE       A D  D+ VVWDSEEIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
           +SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR  VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
           FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
           LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK  
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
           TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420

Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
           PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFVMDKQITSMLLQAKR 494
           AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match: Q5XET4 (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)

HSP 1 Score: 502.3 bits (1292), Expect = 6.3e-141
Identity = 271/478 (56.69%), Postives = 349/478 (73.01%), Query Frame = 0

Query: 19  GGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFLNNKTSAPSPFTDLISSET 78
           G IG   + A  + +S++   KNL  PRR KLPPD  VN FL      P     +I  + 
Sbjct: 23  GNIGVTRVNASQRNHSKKL-TKNLRNPRRTKLPPDFGVNLFLRKPKIEPL----VIDDDD 82

Query: 79  FQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGRIPQKPGKLNRDRPLPLPLP 138
            Q+ E  +D+          D+ VVW+ EEIEAISSLFQ RIPQKP K +R RPLPLP P
Sbjct: 83  EQVQESVNDD----------DDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQP 142

Query: 139 HKLRPPGLPNPKIRPRVVVSSRAL--LSKQVYKCPDFLIGLAREIRYL-SPEENVSKVLN 198
           HKLRP GLP PK   + ++ S AL  +SKQVYK P FLIGLAREI+ L S + +VS VLN
Sbjct: 143 HKLRPLGLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLN 202

Query: 199 RWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDDRVLASTVEVLARNHEL 258
           +W  FL+KGSLS TI+ELG MGLP+RAL+T+ WA++   L PD+R+LAST++VLA++HEL
Sbjct: 203 KWVSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHEL 262

Query: 259 KVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILEL 318
           K+   L+    LAS+ V+EAM++G I+GG LNLA KL++ SK   R+LD SVYVK+ILE+
Sbjct: 263 KL---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEI 322

Query: 319 GKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEP 378
            KNPDK  LV+ LL+EL +RE LKL+QQD T+I+K C +LG+FE+ E L+ W+  S  EP
Sbjct: 323 AKNPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREP 382

Query: 379 SVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYF 438
           SVVMYT ++HSRYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY+
Sbjct: 383 SVVMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYY 442

Query: 439 AKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDKQITSMLLQAKR 494
           +KLKEAGF+PTYD+YR++I++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 443 SKLKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.9e-12
Identity = 68/322 (21.12%), Postives = 145/322 (45.03%), Query Frame = 0

Query: 177 LAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLF 236
           L  ++  L P  ++++ L+ +   L     +L  KE    G   R+L+ F + Q Q    
Sbjct: 79  LINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCK 138

Query: 237 PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV------LEAMVRGFIKGGSLNLAW 296
           P++ +    + +L R  E  +   LE F ++ S+GV        A++  + + G    + 
Sbjct: 139 PNEHIYTIMISLLGR--EGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSL 198

Query: 297 KLLVASKKGKRMLDPSV--YVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAI 356
           +LL   K  K  + PS+  Y  +I    +       +L L  E+ + E ++ +      +
Sbjct: 199 ELLDRMKNEK--ISPSILTYNTVINACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTL 258

Query: 357 IKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANC 416
           + AC   G  + AE ++    + G  P +  Y+ LV +    ++  +   L+ EM +   
Sbjct: 259 LSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGS 318

Query: 417 PFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKE 476
             D+ +Y V+++ +   G +  A+  F +++ AG  P  + Y  L+ ++  SGR    ++
Sbjct: 319 LPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQ 378

Query: 477 IYKEAENAGFVMDKQITSMLLQ 491
           ++ E +++    D    ++L++
Sbjct: 379 LFLEMKSSNTDPDAATYNILIE 395

BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 2.1e-11
Identity = 68/291 (23.37%), Postives = 126/291 (43.30%), Query Frame = 0

Query: 201 LQKGSLSLTIKELGRMGLPDRALKTFCW---AQEQPRLFPDDRVLASTVEVLARNHELKV 260
           L +  L   +K L   G  +RA+  F W   +     L  D +V+   V +L R  +  V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193

Query: 261 ------PLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK---GKRMLDPSVY 320
                  + L+E+  L        ++  + + G    A  L    K+      ++  +V 
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253

Query: 321 VKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWY 380
           + +  ++G++  K   +L +LDE+ + + LK ++   + ++ AC R G    A++ ++  
Sbjct: 254 LDVFGKMGRSWRK---ILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAEL 313

Query: 381 VESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDL 440
              G+EP  V Y AL+        Y EALS++ EME  +CP D   Y  ++  +V  G  
Sbjct: 314 KSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFS 373

Query: 441 SRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
             A      + + G  P    Y  +I  Y  +G+  +  +++   + AG V
Sbjct: 374 KEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCV 418

BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match: Q9ZUA2 (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX=3702 GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 5.2e-10
Identity = 53/210 (25.24%), Postives = 91/210 (43.33%), Query Frame = 0

Query: 280 FIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALK 339
           F K G L LA K   + K+     +   +  LI    K  D  V V +L  E+ +R  + 
Sbjct: 173 FCKSGELQLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAV-SLYKEM-RRVRMS 232

Query: 340 LNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSL 399
           LN    TA+I    + G+ + AE++YS  VE   EP+ ++YT ++   +       A+  
Sbjct: 233 LNVVTYTALIDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKF 292

Query: 400 VWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLV 459
           + +M       D+ AY V+I      G L  A      ++++   P   ++  ++  Y  
Sbjct: 293 LAKMLNQGMRLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFK 352

Query: 460 SGRLAKCKEIYKEAENAGFVMDKQITSMLL 490
           SGR+     +Y +    GF  D    S ++
Sbjct: 353 SGRMKAAVNMYHKLIERGFEPDVVALSTMI 380

BLAST of HG10018167 vs. ExPASy Swiss-Prot
Match: Q66GP4 (Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g13770 PE=2 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 5.2e-10
Identity = 58/237 (24.47%), Postives = 112/237 (47.26%), Query Frame = 0

Query: 261 LEEFTKLASRGVLEA------MVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILE 320
           LE   ++  +G+ E+      ++R F +   + +  KL   +   K + DP + +K++L 
Sbjct: 268 LEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEAGGKKLLKDPEMCLKVVLM 327

Query: 321 LGK--NPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESG 380
             +  N +  + V+  +    ++  LK+      AI+   ++   F  A K+Y W ++  
Sbjct: 328 YVREGNMETTLEVVAAM----RKAELKVTDCILCAIVNGFSKQRGFAEAVKVYEWAMKEE 387

Query: 381 HEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAV 440
            E   V Y   +++    +KY +A  L  EM        + AY  ++ ++     LS AV
Sbjct: 388 CEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNIMDMYGKTRRLSDAV 447

Query: 441 RYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDK-QITSML 489
           R  AK+K+ G  P   +Y +LI ++  +  L + ++I+KE + A  + DK   TSM+
Sbjct: 448 RLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKVLPDKVSYTSMI 500

BLAST of HG10018167 vs. ExPASy TrEMBL
Match: A0A0A0LVM0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)

HSP 1 Score: 862.1 bits (2226), Expect = 1.2e-246
Identity = 439/495 (88.69%), Postives = 460/495 (92.93%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDSI SAT VSSILVKGNGGIGCQ  M HFK  SRRRPPKNLLCPRR KLPP+PAVNQF 
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60

Query: 61  NNKTSAPS-PFTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
           NNKTSAPS PFTDLISS+ FQ      DEHEEIHA+D  +D  VVWDS+EIEAISSLFQG
Sbjct: 61  NNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQG 120

Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
           RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP  VVSSRALLSKQVYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLA 180

Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
           REIR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRAL TFCWAQEQ RLFPD
Sbjct: 181 REIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPD 240

Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
           DRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAM+RGFI+GGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKK 300

Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
           GKRMLDPSVYVKLILELGKNPDKN+LVLTLL+ELGQREALKLNQQD T I+K CTRLGKF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKF 360

Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
           EIAEKLYSWYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ NCPFDLPAY VV
Sbjct: 361 EIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVV 420

Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
           IKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGF 480

Query: 481 VMDKQITSMLLQAKR 494
           +MDKQITSMLLQAKR
Sbjct: 481 MMDKQITSMLLQAKR 489

BLAST of HG10018167 vs. ExPASy TrEMBL
Match: A0A1S3CGD0 (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)

HSP 1 Score: 859.8 bits (2220), Expect = 5.7e-246
Identity = 441/495 (89.09%), Postives = 458/495 (92.53%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           M SI SAT VSSILVKGNGGIGCQ  M HFK  SRRRPPKNLLCPRR KLPPDPAVNQFL
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEIHAYD-CRDNGVVWDSEEIEAISSLFQG 120
           NNKTSAPSP FTDLISS+ FQ      DEHEEIHAYD  +D  VVWDS+EIEAISSLFQG
Sbjct: 61  NNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQG 120

Query: 121 RIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLIGLA 180
           RIPQKPGKLNR+RPLPLPLPHKLRPP LPNPKIRP   VSSRALLSK+VYK PDFLIGLA
Sbjct: 121 RIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLA 180

Query: 181 REIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPD 240
           R IR LSPEENVSKVLNRWGPFLQKGSLSLTIKELG MGLPDRALKTFCW QEQ RLFPD
Sbjct: 181 RAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPD 240

Query: 241 DRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK 300
           DRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAM+RGFIKGGSLNLAWKLLVA+KK
Sbjct: 241 DRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKK 300

Query: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKF 360
           GKRMLDPSVYVKLILELGKNPDKNVLVLTLL+ELGQREALKLNQQD+T IIK CTRL KF
Sbjct: 301 GKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKF 360

Query: 361 EIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVV 420
           EIAEKLY WYVESGHEPS+VMYTALVHSRYSD+KYREALSLVWEME+ANCPFDLPAY VV
Sbjct: 361 EIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVV 420

Query: 421 IKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGF 480
           IKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAGF
Sbjct: 421 IKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGF 480

Query: 481 VMDKQITSMLLQAKR 494
           +MDKQITSMLLQAKR
Sbjct: 481 IMDKQITSMLLQAKR 489

BLAST of HG10018167 vs. ExPASy TrEMBL
Match: A0A6J1GIP2 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454537 PE=4 SV=1)

HSP 1 Score: 815.1 bits (2104), Expect = 1.6e-232
Identity = 416/501 (83.03%), Postives = 444/501 (88.62%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDS+FS T +SSILVK NGGI CQ  +AHF+T SRRRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
             +TS P P   F DLISSE   LPE E DE EE       A D  D+ VVWDSEEIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
           +SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR  VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
           FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
           LVA+K GKRMLDPSVYVKLILE+GKNPDKN+LVL LLDELGQREAL LNQQDT+AIIK  
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
           TRLGKFEIAE+LYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420

Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
           PAY VV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFVMDKQITSMLLQAKR 494
           AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of HG10018167 vs. ExPASy TrEMBL
Match: A0A6J1KK31 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495984 PE=4 SV=1)

HSP 1 Score: 810.4 bits (2092), Expect = 4.0e-231
Identity = 414/501 (82.63%), Postives = 442/501 (88.22%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MDS+FS T VSSILVK NGGI CQ  MAHF T S+RRPPKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDSLFSTTAVSSILVKRNGGISCQIPMAHFLTNSKRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP---FTDLISSETFQLPEGEDDEHEE-----IHAYDCRDNGVVWDSEEIEAI 120
             +TS P P   + DLI SE   LPE E DE EE       A D  D+ +VWD EEIEAI
Sbjct: 61  KKRTSDPHPDTSYPDLIPSEKIGLPEEELDELEETAADNYFANDDNDSDIVWDPEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPD 180
           +SLF+GRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIRPR  VSSRAL+SKQVYK PD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQ 240
           FLIGLAR IR L PEENVSKVLNRW PFLQKGSLSLTIKELG MGL DRALKTFCW QEQ
Sbjct: 181 FLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKAC 360
           LVA+K GKRMLDPSV+VKLILE+GKNPDKN+LVL LLDELGQREAL L+QQDT+AIIK  
Sbjct: 301 LVAAKNGKRMLDPSVHVKLILEIGKNPDKNMLVLALLDELGQREALNLSQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDL 420
           TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVH+RYS++KYREALS+VWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKE 480
           PAY VVIKLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFVMDKQITSMLLQAKR 494
           AENAG+VMDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of HG10018167 vs. ExPASy TrEMBL
Match: A0A6J1DI37 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020698 PE=4 SV=1)

HSP 1 Score: 805.8 bits (2080), Expect = 9.8e-230
Identity = 409/495 (82.63%), Postives = 439/495 (88.69%), Query Frame = 0

Query: 1   MDSIFSATLVSSILVKGNGGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFL 60
           MD IFS+++VSSI+VKGNGGI CQ  MA F   +RRR PKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP-FTDLISSETFQLPEGEDDEHEEI----HAYDCRDNGVVWDSEEIEAISSL 120
            N TS   P FTD  SSE  + PE E D+HEE     +  D +D  ++WDS+EIEAISSL
Sbjct: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRDRPLPLPLPHKLRPPGLPNPKIRPRVVVSSRALLSKQVYKCPDFLI 180
           FQGRIPQKPGKLNR+RPLPLPLPHKLRPPGLPNPKIR R  V SRA LSKQVYK PDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180

Query: 181 GLAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRL 240
           GLAR IR LS EENVSKVLNRW PFL KGSLSLTI+ELG MGL DRAL++FCWAQEQPRL
Sbjct: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240

Query: 241 FPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVA 300
           FPDDRVLASTVEVL+RNHELKVPLNLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV 
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300

Query: 301 SKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRL 360
           +KKG RMLDPSVYVKLILELGKNPDKN+LVLTLLDELGQREALKLNQQDTTAI+K CTRL
Sbjct: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360

Query: 361 GKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAY 420
           GKFEIAE+LY WYVES HEPSVVMYTAL+HSRYS+KKYREALS+VWEMEAANCPFDLPAY
Sbjct: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420

Query: 421 CVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAEN 480
            VVIKLFVALGDLSRA RYFAKLKEAGFAPTYD+YRNLITIYLVSGRLAKCKEIYKEA+N
Sbjct: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480

Query: 481 AGFVMDKQITSMLLQ 491
           AGF++DKQITS LLQ
Sbjct: 481 AGFIIDKQITSRLLQ 495

BLAST of HG10018167 vs. TAIR 10
Match: AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 502.3 bits (1292), Expect = 4.5e-142
Identity = 271/478 (56.69%), Postives = 349/478 (73.01%), Query Frame = 0

Query: 19  GGIGCQTMMAHFKTYSRRRPPKNLLCPRRNKLPPDPAVNQFLNNKTSAPSPFTDLISSET 78
           G IG   + A  + +S++   KNL  PRR KLPPD  VN FL      P     +I  + 
Sbjct: 23  GNIGVTRVNASQRNHSKKL-TKNLRNPRRTKLPPDFGVNLFLRKPKIEPL----VIDDDD 82

Query: 79  FQLPEGEDDEHEEIHAYDCRDNGVVWDSEEIEAISSLFQGRIPQKPGKLNRDRPLPLPLP 138
            Q+ E  +D+          D+ VVW+ EEIEAISSLFQ RIPQKP K +R RPLPLP P
Sbjct: 83  EQVQESVNDD----------DDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQP 142

Query: 139 HKLRPPGLPNPKIRPRVVVSSRAL--LSKQVYKCPDFLIGLAREIRYL-SPEENVSKVLN 198
           HKLRP GLP PK   + ++ S AL  +SKQVYK P FLIGLAREI+ L S + +VS VLN
Sbjct: 143 HKLRPLGLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLN 202

Query: 199 RWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLFPDDRVLASTVEVLARNHEL 258
           +W  FL+KGSLS TI+ELG MGLP+RAL+T+ WA++   L PD+R+LAST++VLA++HEL
Sbjct: 203 KWVSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHEL 262

Query: 259 KVPLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILEL 318
           K+   L+    LAS+ V+EAM++G I+GG LNLA KL++ SK   R+LD SVYVK+ILE+
Sbjct: 263 KL---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEI 322

Query: 319 GKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEP 378
            KNPDK  LV+ LL+EL +RE LKL+QQD T+I+K C +LG+FE+ E L+ W+  S  EP
Sbjct: 323 AKNPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREP 382

Query: 379 SVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYF 438
           SVVMYT ++HSRYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY+
Sbjct: 383 SVVMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYY 442

Query: 439 AKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDKQITSMLLQAKR 494
           +KLKEAGF+PTYD+YR++I++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 443 SKLKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of HG10018167 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 75.5 bits (184), Expect = 1.4e-13
Identity = 68/322 (21.12%), Postives = 145/322 (45.03%), Query Frame = 0

Query: 177 LAREIRYLSPEENVSKVLNRWGPFLQKGSLSLTIKELGRMGLPDRALKTFCWAQEQPRLF 236
           L  ++  L P  ++++ L+ +   L     +L  KE    G   R+L+ F + Q Q    
Sbjct: 79  LINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCK 138

Query: 237 PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV------LEAMVRGFIKGGSLNLAW 296
           P++ +    + +L R  E  +   LE F ++ S+GV        A++  + + G    + 
Sbjct: 139 PNEHIYTIMISLLGR--EGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSL 198

Query: 297 KLLVASKKGKRMLDPSV--YVKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAI 356
           +LL   K  K  + PS+  Y  +I    +       +L L  E+ + E ++ +      +
Sbjct: 199 ELLDRMKNEK--ISPSILTYNTVINACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTL 258

Query: 357 IKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANC 416
           + AC   G  + AE ++    + G  P +  Y+ LV +    ++  +   L+ EM +   
Sbjct: 259 LSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGS 318

Query: 417 PFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKE 476
             D+ +Y V+++ +   G +  A+  F +++ AG  P  + Y  L+ ++  SGR    ++
Sbjct: 319 LPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQ 378

Query: 477 IYKEAENAGFVMDKQITSMLLQ 491
           ++ E +++    D    ++L++
Sbjct: 379 LFLEMKSSNTDPDAATYNILIE 395

BLAST of HG10018167 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 72.0 bits (175), Expect = 1.5e-12
Identity = 68/291 (23.37%), Postives = 126/291 (43.30%), Query Frame = 0

Query: 201 LQKGSLSLTIKELGRMGLPDRALKTFCW---AQEQPRLFPDDRVLASTVEVLARNHELKV 260
           L +  L   +K L   G  +RA+  F W   +     L  D +V+   V +L R  +  V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193

Query: 261 ------PLNLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVASKK---GKRMLDPSVY 320
                  + L+E+  L        ++  + + G    A  L    K+      ++  +V 
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253

Query: 321 VKLILELGKNPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWY 380
           + +  ++G++  K   +L +LDE+ + + LK ++   + ++ AC R G    A++ ++  
Sbjct: 254 LDVFGKMGRSWRK---ILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAEL 313

Query: 381 VESGHEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDL 440
              G+EP  V Y AL+        Y EALS++ EME  +CP D   Y  ++  +V  G  
Sbjct: 314 KSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFS 373

Query: 441 SRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFV 480
             A      + + G  P    Y  +I  Y  +G+  +  +++   + AG V
Sbjct: 374 KEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCV 418

BLAST of HG10018167 vs. TAIR 10
Match: AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 67.4 bits (163), Expect = 3.7e-11
Identity = 53/210 (25.24%), Postives = 91/210 (43.33%), Query Frame = 0

Query: 280 FIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALK 339
           F K G L LA K   + K+     +   +  LI    K  D  V V +L  E+ +R  + 
Sbjct: 173 FCKSGELQLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAV-SLYKEM-RRVRMS 232

Query: 340 LNQQDTTAIIKACTRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDKKYREALSL 399
           LN    TA+I    + G+ + AE++YS  VE   EP+ ++YT ++   +       A+  
Sbjct: 233 LNVVTYTALIDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKF 292

Query: 400 VWEMEAANCPFDLPAYCVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLV 459
           + +M       D+ AY V+I      G L  A      ++++   P   ++  ++  Y  
Sbjct: 293 LAKMLNQGMRLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFK 352

Query: 460 SGRLAKCKEIYKEAENAGFVMDKQITSMLL 490
           SGR+     +Y +    GF  D    S ++
Sbjct: 353 SGRMKAAVNMYHKLIERGFEPDVVALSTMI 380

BLAST of HG10018167 vs. TAIR 10
Match: AT5G13770.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 67.4 bits (163), Expect = 3.7e-11
Identity = 58/237 (24.47%), Postives = 112/237 (47.26%), Query Frame = 0

Query: 261 LEEFTKLASRGVLEA------MVRGFIKGGSLNLAWKLLVASKKGKRMLDPSVYVKLILE 320
           LE   ++  +G+ E+      ++R F +   + +  KL   +   K + DP + +K++L 
Sbjct: 268 LEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEAGGKKLLKDPEMCLKVVLM 327

Query: 321 LGK--NPDKNVLVLTLLDELGQREALKLNQQDTTAIIKACTRLGKFEIAEKLYSWYVESG 380
             +  N +  + V+  +    ++  LK+      AI+   ++   F  A K+Y W ++  
Sbjct: 328 YVREGNMETTLEVVAAM----RKAELKVTDCILCAIVNGFSKQRGFAEAVKVYEWAMKEE 387

Query: 381 HEPSVVMYTALVHSRYSDKKYREALSLVWEMEAANCPFDLPAYCVVIKLFVALGDLSRAV 440
            E   V Y   +++    +KY +A  L  EM        + AY  ++ ++     LS AV
Sbjct: 388 CEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNIMDMYGKTRRLSDAV 447

Query: 441 RYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFVMDK-QITSML 489
           R  AK+K+ G  P   +Y +LI ++  +  L + ++I+KE + A  + DK   TSM+
Sbjct: 448 RLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKVLPDKVSYTSMI 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893977.13.3e-25691.90pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_03... [more]
XP_004139567.12.4e-24688.69pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_0116... [more]
XP_008462173.11.2e-24589.09PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] ... [more]
KAG7020726.15.2e-23383.03Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022951807.13.3e-23283.03pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
Q5XET46.3e-14156.69Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... [more]
Q9S7Q21.9e-1221.12Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
O646242.1e-1123.37Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Q9ZUA25.2e-1025.24Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX... [more]
Q66GP45.2e-1024.47Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LVM01.2e-24688.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1[more]
A0A1S3CGD05.7e-24689.09pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1GIP21.6e-23283.03pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita mo... [more]
A0A6J1KK314.0e-23182.63pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita ma... [more]
A0A6J1DI379.8e-23082.63pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica ch... [more]
Match NameE-valueIdentityDescription
AT2G01860.14.5e-14256.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74850.11.4e-1321.12plastid transcriptionally active 2 [more]
AT2G18940.11.5e-1223.37Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G01740.13.7e-1125.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G13770.13.7e-1124.47Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 178..318
e-value: 5.4E-6
score: 27.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 319..493
e-value: 3.2E-26
score: 94.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 414..446
e-value: 0.0033
score: 15.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 346..365
e-value: 0.96
score: 9.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 399..457
e-value: 6.6E-5
score: 22.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 8.769097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 411..445
score: 9.54735
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..148
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 72..468
NoneNo IPR availablePANTHERPTHR46128:SF179TETRATRICOPEPTIDE REPEAT-LIKE SUPERFAMILY PROTEINcoord: 72..468

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018167.1HG10018167.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding