Moc08g33000 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc08g33000
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr8: 23963285 .. 23964799 (+)
RNA-Seq ExpressionMoc08g33000
SyntenyMoc08g33000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTGCATATTCTCGAGCTCTATCGTCTCTTCCATTATGGTGAAAGGAAATGGAGGAATCAGCTGCCAGATTTCGATGGCTCGTTTCATGGCGAATGCCAGAAGACGACTGCCCAAGAACCTTCTCAATCCACGACGGACCAAGCTTCCCCCTGATCCCGGCGTCAATCAATTCTTGAAGAACACAACCTCCGGCTCTGGCCCCTCCTTCACAGATTTCACTTCCAGCGAGAAAATCGAGTTTCCCGAGGAAGAACACGACGACCACGAAGAAGCTGACACTGAGAATTATTTCGTCGATGATAAGGACGGTGAAATTATTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTCCAAGGCAGAATCCCTCAGAAACCTGGTAAGTTGAACCGGGAGAGGCCTCTTCCGCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATCCGAGCAAGAACGGGCGTTCCTTCGCGTGCATCGCTATCTAAGCAAGTCTACAAGCGCCCCGATTTTCTTATCGGCCTTGCCAGAGCGATTAGAGATCTGTCTCGGGAGGAAAATGTGTCCAAAGTTCTCAATAGGTGGGCTCCGTTTTTGCTGAAGGGGTCCCTGTCATTGACGATCAGGGAACTGGGTCATATGGGTCTCGCCGATAGAGCTTTACAGTCGTTCTGTTGGGCGCAAGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTCTCGAGGAACCATGAACTGAAAGTACCACTGAACTTGGAAGAGTTCACTAGACTGGCTAGTCGTGGTGTGCTCGAGGCGATGATAAGAGGGTTTATCAAAGGTGGGAGCTTAAACCTTGCTTGGAAGCTTCTTGTAGTTGCCAAGAAGGGTAATAGAATGTTGGATCCCAGCGTTTATGTGAAATTGATATTGGAGCTAGGGAAGAACCCTGATAAAAACATGCTGGTACTTACCTTACTGGATGAACTAGGACAAAGAGAAGCCTTGAAATTAAACCAGCAAGACACAACAGCTATCATGAAGGTCTGCACTAGGCTTGGTAAATTTGAAATTGCAGAGAGACTTTATGGCTGGTATGTAGAATCTGTACATGAACCTAGTGTGGTTATGTACACTGCTTTAATTCATAGTCGCTACTCAGAGAAGAAATATAGAGAGGCATTATCTGTGGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATAATGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTCTCAAGGGCTGCTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGACATATATAGGAATCTGATTACCATTTATTTAGTTTCAGGGAGGTTAGCAAAGTGTAAGGAAATTTATAAGGAAGCCAAGAATGCAGGATTTATTATTGATAAACAAATTACTTCGAGGCTGTTGCAATGTGCAAGCAGAAAGATGATACGTTCCTAG

mRNA sequence

ATGGATTGCATATTCTCGAGCTCTATCGTCTCTTCCATTATGGTGAAAGGAAATGGAGGAATCAGCTGCCAGATTTCGATGGCTCGTTTCATGGCGAATGCCAGAAGACGACTGCCCAAGAACCTTCTCAATCCACGACGGACCAAGCTTCCCCCTGATCCCGGCGTCAATCAATTCTTGAAGAACACAACCTCCGGCTCTGGCCCCTCCTTCACAGATTTCACTTCCAGCGAGAAAATCGAGTTTCCCGAGGAAGAACACGACGACCACGAAGAAGCTGACACTGAGAATTATTTCGTCGATGATAAGGACGGTGAAATTATTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTCCAAGGCAGAATCCCTCAGAAACCTGGTAAGTTGAACCGGGAGAGGCCTCTTCCGCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATCCGAGCAAGAACGGGCGTTCCTTCGCGTGCATCGCTATCTAAGCAAGTCTACAAGCGCCCCGATTTTCTTATCGGCCTTGCCAGAGCGATTAGAGATCTGTCTCGGGAGGAAAATGTGTCCAAAGTTCTCAATAGGTGGGCTCCGTTTTTGCTGAAGGGGTCCCTGTCATTGACGATCAGGGAACTGGGTCATATGGGTCTCGCCGATAGAGCTTTACAGTCGTTCTGTTGGGCGCAAGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTCTCGAGGAACCATGAACTGAAAGTACCACTGAACTTGGAAGAGTTCACTAGACTGGCTAGTCGTGGTGTGCTCGAGGCGATGATAAGAGGGTTTATCAAAGGTGGGAGCTTAAACCTTGCTTGGAAGCTTCTTGTAGTTGCCAAGAAGGGTAATAGAATGTTGGATCCCAGCGTTTATGTGAAATTGATATTGGAGCTAGGGAAGAACCCTGATAAAAACATGCTGGTACTTACCTTACTGGATGAACTAGGACAAAGAGAAGCCTTGAAATTAAACCAGCAAGACACAACAGCTATCATGAAGGTCTGCACTAGGCTTGGTAAATTTGAAATTGCAGAGAGACTTTATGGCTGGTATGTAGAATCTGTACATGAACCTAGTGTGGTTATGTACACTGCTTTAATTCATAGTCGCTACTCAGAGAAGAAATATAGAGAGGCATTATCTGTGGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATAATGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTCTCAAGGGCTGCTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGACATATATAGGAATCTGATTACCATTTATTTAGTTTCAGGGAGGTTAGCAAAGTGTAAGGAAATTTATAAGGAAGCCAAGAATGCAGGATTTATTATTGATAAACAAATTACTTCGAGGCTGTTGCAATGTGCAAGCAGAAAGATGATACGTTCCTAG

Coding sequence (CDS)

ATGGATTGCATATTCTCGAGCTCTATCGTCTCTTCCATTATGGTGAAAGGAAATGGAGGAATCAGCTGCCAGATTTCGATGGCTCGTTTCATGGCGAATGCCAGAAGACGACTGCCCAAGAACCTTCTCAATCCACGACGGACCAAGCTTCCCCCTGATCCCGGCGTCAATCAATTCTTGAAGAACACAACCTCCGGCTCTGGCCCCTCCTTCACAGATTTCACTTCCAGCGAGAAAATCGAGTTTCCCGAGGAAGAACACGACGACCACGAAGAAGCTGACACTGAGAATTATTTCGTCGATGATAAGGACGGTGAAATTATTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTCCAAGGCAGAATCCCTCAGAAACCTGGTAAGTTGAACCGGGAGAGGCCTCTTCCGCTCCCACTTCCTCACAAGCTACGACCACCAGGACTTCCTAACCCTAAAATCCGAGCAAGAACGGGCGTTCCTTCGCGTGCATCGCTATCTAAGCAAGTCTACAAGCGCCCCGATTTTCTTATCGGCCTTGCCAGAGCGATTAGAGATCTGTCTCGGGAGGAAAATGTGTCCAAAGTTCTCAATAGGTGGGCTCCGTTTTTGCTGAAGGGGTCCCTGTCATTGACGATCAGGGAACTGGGTCATATGGGTCTCGCCGATAGAGCTTTACAGTCGTTCTGTTGGGCGCAAGAACAACCTCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTCTCGAGGAACCATGAACTGAAAGTACCACTGAACTTGGAAGAGTTCACTAGACTGGCTAGTCGTGGTGTGCTCGAGGCGATGATAAGAGGGTTTATCAAAGGTGGGAGCTTAAACCTTGCTTGGAAGCTTCTTGTAGTTGCCAAGAAGGGTAATAGAATGTTGGATCCCAGCGTTTATGTGAAATTGATATTGGAGCTAGGGAAGAACCCTGATAAAAACATGCTGGTACTTACCTTACTGGATGAACTAGGACAAAGAGAAGCCTTGAAATTAAACCAGCAAGACACAACAGCTATCATGAAGGTCTGCACTAGGCTTGGTAAATTTGAAATTGCAGAGAGACTTTATGGCTGGTATGTAGAATCTGTACATGAACCTAGTGTGGTTATGTACACTGCTTTAATTCATAGTCGCTACTCAGAGAAGAAATATAGAGAGGCATTATCTGTGGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCTTATAATGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTCTCAAGGGCTGCTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGACATATATAGGAATCTGATTACCATTTATTTAGTTTCAGGGAGGTTAGCAAAGTGTAAGGAAATTTATAAGGAAGCCAAGAATGCAGGATTTATTATTGATAAACAAATTACTTCGAGGCTGTTGCAATGTGCAAGCAGAAAGATGATACGTTCCTAG

Protein sequence

MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFLKNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAGFIIDKQITSRLLQCASRKMIRS
Homology
BLAST of Moc08g33000 vs. NCBI nr
Match: XP_022153119.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Momordica charantia] >XP_022153120.1 pentatricopeptide repeat-containing protein At2g01860 isoform X2 [Momordica charantia])

HSP 1 Score: 989.9 bits (2558), Expect = 7.8e-285
Identity = 504/504 (100.00%), Postives = 504/504 (100.00%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL
Sbjct: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
           KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL
Sbjct: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
           FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180

Query: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
           GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL
Sbjct: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240

Query: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
           FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300

Query: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
           AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL
Sbjct: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360

Query: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
           GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY
Sbjct: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420

Query: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
           NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN
Sbjct: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480

Query: 481 AGFIIDKQITSRLLQCASRKMIRS 505
           AGFIIDKQITSRLLQCASRKMIRS
Sbjct: 481 AGFIIDKQITSRLLQCASRKMIRS 504

BLAST of Moc08g33000 vs. NCBI nr
Match: KAG7020726.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 803.9 bits (2075), Expect = 7.9e-229
Identity = 406/498 (81.53%), Postives = 446/498 (89.56%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD +FS++ +SSI+VK NGGISCQI +A F  N+RRR PKNLL PRRTKLPPDPGVNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGP--SFTDFTSSEKIEFPEEEHDDHEEADTENYFV-DDKDGEIIWDSDEIEAI 120
           K  TSG  P  SF D  SSEKI  PEEE D+ EE   +NYF  DD D +++WDS+EIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIR RT V SRA +SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQ 240
           FLIGLARAIRDL  EEN+SKVLNRWAPFL KGSLSLTI+ELGHMGLADRAL++FCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENMSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVL+RNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVC 360
           LV AK G RMLDPSVYVKLILE+GKNPDKNMLVL LLDELGQREAL LNQQDT+AI+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDL 420
           TRLGKFEIAERLY WYVES HEPSVVMYTAL+H+RYSE+KYREALSVVWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKE 480
           PAY+VV+KLFVALGDLSRA RYFAKLKEAGF PTY IYRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AKNAGFIIDKQITSRLLQ 496
           A+NAG+++DKQITS LLQ
Sbjct: 481 AENAGYVMDKQITSMLLQ 498

BLAST of Moc08g33000 vs. NCBI nr
Match: XP_038893977.1 (pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_038893978.1 pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida])

HSP 1 Score: 801.6 bits (2069), Expect = 3.9e-228
Identity = 407/495 (82.22%), Postives = 442/495 (89.29%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD IFS++ VSSI+VKGNGGI CQ +MA F  N+RRRLPKNLL PRR KLPPDP VNQFL
Sbjct: 1   MDSIFSATSVSSILVKGNGGIGCQATMAHFKTNSRRRLPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
           KN TS   PS TD  SSE  + P+ E D+HEE    +Y    KD +++WDSDEIEAISSL
Sbjct: 61  KNKTSAPSPSLTDLISSEIFQLPKGEDDEHEEIHAYDY----KDTDVVWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
           FQGRIPQKPGKLNR+RPLPLPLPHKLRP GLP+PKIR R  V SRA LSKQVYKRPDFLI
Sbjct: 121 FQGRIPQKPGKLNRDRPLPLPLPHKLRPSGLPDPKIRPRIMVSSRALLSKQVYKRPDFLI 180

Query: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
           GLARAIRDLS EENVSKVLNRW PFL KGSLSLTI+ELGHMGL DRAL++F WAQEQPRL
Sbjct: 181 GLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFSWAQEQPRL 240

Query: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
           FPDDRVLASTVEVL+RNHELKVPL+LEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV 
Sbjct: 241 FPDDRVLASTVEVLARNHELKVPLDLEEFTKLASRGVLEAMVRGFIKGGSLNLAWKLLVA 300

Query: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
           AKK  R+LDPSVYVKLILELGKNPDKN+LVLTLLDELGQREAL LNQQDTT I+KVCTRL
Sbjct: 301 AKKRKRLLDPSVYVKLILELGKNPDKNVLVLTLLDELGQREALTLNQQDTTTIIKVCTRL 360

Query: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
           GKFEIAE+LY WYVES HEPSVVMYTAL+HSRYS++KYREALS+VWEMEAANCPFDLPAY
Sbjct: 361 GKFEIAEKLYSWYVESGHEPSVVMYTALVHSRYSDRKYREALSLVWEMEAANCPFDLPAY 420

Query: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
           +V+IKLFV LGDLSRA RYFAKLKEAGFAPTYD+YR +ITIYLVSGRLAKCKEIYKEA+N
Sbjct: 421 SVMIKLFVTLGDLSRAVRYFAKLKEAGFAPTYDVYRKMITIYLVSGRLAKCKEIYKEAEN 480

Query: 481 AGFIIDKQITSRLLQ 496
           AGFI+DKQITS LLQ
Sbjct: 481 AGFIMDKQITSMLLQ 491

BLAST of Moc08g33000 vs. NCBI nr
Match: XP_022951807.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita moschata])

HSP 1 Score: 801.2 bits (2068), Expect = 5.1e-228
Identity = 406/498 (81.53%), Postives = 445/498 (89.36%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD +FS++ +SSI+VK NGGISCQI +A F  N+RRR PKNLL PRRTKLPPDPGVNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGP--SFTDFTSSEKIEFPEEEHDDHEEADTENYFV-DDKDGEIIWDSDEIEAI 120
           K  TSG  P  SF D  SSEKI  PEEE D+ EE   +NYF  DD D +++WDS+EIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIR RT V SRA +SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQ 240
           FLIGLARAIRDL  EENVSKVLNRWAPFL KGSLSLTI+ELGHMGLADRAL++FCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVL+RNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVC 360
           LV AK G RMLDPSVYVKLILE+GKNPDKNMLVL LLDELGQREAL LNQQDT+AI+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDL 420
           TRLGKFEIAERLY WYVES HEPSVVMYTAL+H+RYSE+KYREALSVVWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420

Query: 421 PAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKE 480
           PAY+VV+KLFVALGDLSRA RYFAKLKEAGF PTY IYRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AKNAGFIIDKQITSRLLQ 496
           A+NAG+++DKQITS LLQ
Sbjct: 481 AENAGYVMDKQITSMLLQ 498

BLAST of Moc08g33000 vs. NCBI nr
Match: KAG6585791.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 800.4 bits (2066), Expect = 8.7e-228
Identity = 405/498 (81.33%), Postives = 445/498 (89.36%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD +FS++ +SSI+VK NGGISCQI +A F  N+RRR PKNLL PRRTKLPPDPGVNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGP--SFTDFTSSEKIEFPEEEHDDHEEADTENYFV-DDKDGEIIWDSDEIEAI 120
           K  TSG  P  SF D  SSEKI  PEEE D+ EE   +NYF  DD D +++WDS+EIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPK+R RT V SRA +SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKVRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQ 240
           FLIGLARAIRDL  EENVSKVLNRWAPFL KGSLSLTI+ELGHMGLADRAL++FCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVL+RNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVC 360
           LV AK G RMLDPSVYVKLILE+GKNPDKNMLVL LLDELGQREAL LNQQDT+AI+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDL 420
           TRLGKFEIAERLY WYVES HEPSVVMYTAL+H+RYSE+KYREALSVVWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKE 480
           PAY+VV+KLFVALGDLSRA RYFAKLKEAGF PTY IYRNLITIYL +GRLAK KEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKWKEIYKE 480

Query: 481 AKNAGFIIDKQITSRLLQ 496
           A+NAG+++DKQITS LLQ
Sbjct: 481 AENAGYVMDKQITSMLLQ 498

BLAST of Moc08g33000 vs. ExPASy Swiss-Prot
Match: Q5XET4 (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 1.4e-143
Identity = 279/473 (58.99%), Postives = 349/473 (73.78%), Query Frame = 0

Query: 33  NARRRLPKNLLNPRRTKLPPDPGVNQFLKNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEE 92
           N  ++L KNL NPRRTKLPPD GVN FL+                 KIE P    DD E+
Sbjct: 36  NHSKKLTKNLRNPRRTKLPPDFGVNLFLR---------------KPKIE-PLVIDDDDEQ 95

Query: 93  ADTENYFVDDKDGEIIWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLP 152
                  V+D D  ++W+ +EIEAISSLFQ RIPQKP K +R RPLPLP PHKLRP GLP
Sbjct: 96  VQES---VNDDDDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQPHKLRPLGLP 155

Query: 153 NPK---IRARTGVPSRASLSKQVYKRPDFLIGLARAIRDL-SREENVSKVLNRWAPFLLK 212
            PK   IR+    P+ +S+SKQVYK P FLIGLAR I+ L S + +VS VLN+W  FL K
Sbjct: 156 TPKKNIIRS----PALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFLRK 215

Query: 213 GSLSLTIRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHELKVPLNLEE 272
           GSLS TIRELGHMGL +RALQ++ WA++   L PD+R+LAST++VL+++HELK+   L+ 
Sbjct: 216 GSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---LKF 275

Query: 273 FTRLASRGVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKLILELGKNPDKNM 332
              LAS+ V+EAMI+G I+GG LNLA KL++++K  NR+LD SVYVK+ILE+ KNPDK  
Sbjct: 276 DNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAKNPDKYH 335

Query: 333 LVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTAL 392
           LV+ LL+EL +RE LKL+QQD T+IMK+C +LG+FE+ E L+ W+  S  EPSVVMYT +
Sbjct: 336 LVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSVVMYTTM 395

Query: 393 IHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGF 452
           IHSRYSE+KYREA+SVVWEME +NC  DLPAY VVIKLFVAL DL RA RY++KLKEAGF
Sbjct: 396 IHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSKLKEAGF 455

Query: 453 APTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAGFIIDKQITSRLLQCASRKM 502
           +PTYDIYR++I++Y  SGRL KCKEI KE ++AG  +DK  + RLLQ   + M
Sbjct: 456 SPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEKQTM 482

BLAST of Moc08g33000 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.5e-12
Identity = 71/304 (23.36%), Postives = 124/304 (40.79%), Query Frame = 0

Query: 206 LLKGSLSLTIRELGHMGLADRALQSFCW---AQEQPRLFPDDRVLASTVEVLSRNHELKV 265
           LL+  L   ++ L   G  +RA+  F W   +     L  D +V+   V +L R  +  V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193

Query: 266 ------PLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKL 325
                  + L+E+  L        ++  + + G    A  L    K+         Y  +
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253

Query: 326 ILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVES 385
           +   GK       +L +LDE+ + + LK ++   + ++  C R G    A+  +      
Sbjct: 254 LDVFGKMGRSWRKILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAELKSC 313

Query: 386 VHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRA 445
            +EP  V Y AL+        Y EALSV+ EME  +CP D   YN ++  +V  G    A
Sbjct: 314 GYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEA 373

Query: 446 ARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAGFIIDKQITSRLLQC 501
           A     + + G  P    Y  +I  Y  +G+  +  +++   K AG + +    + +L  
Sbjct: 374 AGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVLSL 433

BLAST of Moc08g33000 vs. ExPASy Swiss-Prot
Match: Q8GZ63 (Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX=3702 GN=At5g25630 PE=2 SV=2)

HSP 1 Score: 68.6 bits (166), Expect = 2.4e-10
Identity = 37/124 (29.84%), Postives = 62/124 (50.00%), Query Frame = 0

Query: 351 TAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEA 410
           T +M V    G+   A+ ++    E+ H PS++ YT L+ +   +K+Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 411 ANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAK 470
           +    D   +N VI  F   G++  A +   K+KE G  PT   Y  LI  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 471 CKEI 475
             E+
Sbjct: 169 SSEL 172

BLAST of Moc08g33000 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 9.0e-10
Identity = 73/345 (21.16%), Postives = 150/345 (43.48%), Query Frame = 0

Query: 147 RPPGLPNPKIRARTG--VPSRASLSKQVYKRPDFLIGLARAIRDLSREENVSKVLNRWAP 206
           R P   + KI+A+T   V    S+S +  K    +  L   +  L    ++++ L+ +  
Sbjct: 42  RKPCSFSGKIKAKTKDLVLGNPSVSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKN 101

Query: 207 FLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHELKVPL 266
            L     +L  +E    G   R+L+ F + Q Q    P++ +    + +L R  E  +  
Sbjct: 102 KLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGR--EGLLDK 161

Query: 267 NLEEFTRLASRGV------LEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSV--YVKL 326
            LE F  + S+GV        A+I  + + G    + +LL   K  N  + PS+  Y  +
Sbjct: 162 CLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELLDRMK--NEKISPSILTYNTV 221

Query: 327 ILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVES 386
           I    +       +L L  E+ + E ++ +      ++  C   G  + AE ++    + 
Sbjct: 222 INACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDG 281

Query: 387 VHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRA 446
              P +  Y+ L+ +    ++  +   ++ EM +     D+ +YNV+++ +   G +  A
Sbjct: 282 GIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNVLLEAYAKSGSIKEA 341

Query: 447 ARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNA 482
              F +++ AG  P  + Y  L+ ++  SGR    ++++ E K++
Sbjct: 342 MGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSS 381

BLAST of Moc08g33000 vs. ExPASy Swiss-Prot
Match: Q8L6Y7 (Pentatricopeptide repeat-containing protein At2g38420, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g38420 PE=2 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 2.5e-07
Identity = 62/275 (22.55%), Postives = 120/275 (43.64%), Query Frame = 0

Query: 215 IRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHEL--KVPLNLEEFTRL 274
           I   G  G  + A++ F +     R  P    L + + VL R  +    VP  L +  R+
Sbjct: 115 IAAYGFSGRIEEAIEVF-FKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRM 174

Query: 275 ASR---GVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKLILELGKNPDKNML 334
             R        +I    + G ++ A +L+    + + ++DP +Y +L+  + K+ D +  
Sbjct: 175 GVRLEESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCF 234

Query: 335 -VLTLLDELGQREALKLNQQDTTAIMKVCTRLGK-FEIAERLYGWYVESVHEPSVVMYTA 394
            V+  L++L ++       +D T +M+     G+  E+   L     + V EP +V YT 
Sbjct: 235 DVIGYLEDL-RKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRV-EPDLVCYTI 294

Query: 395 LIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAG 454
           ++    +++ Y +A  +  E+       D+  YNV I       D+  A +  + + + G
Sbjct: 295 VLQGVIADEDYPKADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLG 354

Query: 455 FAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAG 483
             P    Y  LI   + +G L++ K ++KE +  G
Sbjct: 355 SEPNVVTYNILIKALVKAGDLSRAKTLWKEMETNG 386

BLAST of Moc08g33000 vs. ExPASy TrEMBL
Match: A0A6J1DI37 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020698 PE=4 SV=1)

HSP 1 Score: 989.9 bits (2558), Expect = 3.8e-285
Identity = 504/504 (100.00%), Postives = 504/504 (100.00%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL
Sbjct: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
           KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL
Sbjct: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
           FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180

Query: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
           GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL
Sbjct: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240

Query: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
           FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300

Query: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
           AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL
Sbjct: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360

Query: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
           GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY
Sbjct: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420

Query: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
           NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN
Sbjct: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480

Query: 481 AGFIIDKQITSRLLQCASRKMIRS 505
           AGFIIDKQITSRLLQCASRKMIRS
Sbjct: 481 AGFIIDKQITSRLLQCASRKMIRS 504

BLAST of Moc08g33000 vs. ExPASy TrEMBL
Match: A0A6J1GIP2 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454537 PE=4 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 2.5e-228
Identity = 406/498 (81.53%), Postives = 445/498 (89.36%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD +FS++ +SSI+VK NGGISCQI +A F  N+RRR PKNLL PRRTKLPPDPGVNQFL
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGP--SFTDFTSSEKIEFPEEEHDDHEEADTENYFV-DDKDGEIIWDSDEIEAI 120
           K  TSG  P  SF D  SSEKI  PEEE D+ EE   +NYF  DD D +++WDS+EIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIR RT V SRA +SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQ 240
           FLIGLARAIRDL  EENVSKVLNRWAPFL KGSLSLTI+ELGHMGLADRAL++FCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVL+RNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVC 360
           LV AK G RMLDPSVYVKLILE+GKNPDKNMLVL LLDELGQREAL LNQQDT+AI+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDL 420
           TRLGKFEIAERLY WYVES HEPSVVMYTAL+H+RYSE+KYREALSVVWEMEAAN PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420

Query: 421 PAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKE 480
           PAY+VV+KLFVALGDLSRA RYFAKLKEAGF PTY IYRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AKNAGFIIDKQITSRLLQ 496
           A+NAG+++DKQITS LLQ
Sbjct: 481 AENAGYVMDKQITSMLLQ 498

BLAST of Moc08g33000 vs. ExPASy TrEMBL
Match: A0A6J1KK31 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495984 PE=4 SV=1)

HSP 1 Score: 795.4 bits (2053), Expect = 1.4e-226
Identity = 403/498 (80.92%), Postives = 444/498 (89.16%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD +FS++ VSSI+VK NGGISCQI MA F+ N++RR PKNLL PRRTKLPPDPGVNQFL
Sbjct: 1   MDSLFSTTAVSSILVKRNGGISCQIPMAHFLTNSKRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  KNTTSGSGP--SFTDFTSSEKIEFPEEEHDDHEEADTENYFV-DDKDGEIIWDSDEIEAI 120
           K  TS   P  S+ D   SEKI  PEEE D+ EE   +NYF  DD D +I+WD +EIEAI
Sbjct: 61  KKRTSDPHPDTSYPDLIPSEKIGLPEEELDELEETAADNYFANDDNDSDIVWDPEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIR RT V SRA +SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQ 240
           FLIGLARAIRDL  EENVSKVLNRWAPFL KGSLSLTI+ELGHMGLADRAL++FCW QEQ
Sbjct: 181 FLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 PRLFPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKL 300
           PRL+PDDRVLASTVEVL+RNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVVAKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVC 360
           LV AK G RMLDPSV+VKLILE+GKNPDKNMLVL LLDELGQREAL L+QQDT+AI+KV 
Sbjct: 301 LVAAKNGKRMLDPSVHVKLILEIGKNPDKNMLVLALLDELGQREALNLSQQDTSAIIKVS 360

Query: 361 TRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDL 420
           TRLGKFEIAE+LY WYVES HEPSVVMYTAL+H+RYSE+KYREALSVVWEMEAANCPFDL
Sbjct: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKE 480
           PAY+VVIKLFVALGDLSRA RYFAKLKEAGF PTY IYRNLITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AKNAGFIIDKQITSRLLQ 496
           A+NAG+++DKQITS LLQ
Sbjct: 481 AENAGYVMDKQITSMLLQ 498

BLAST of Moc08g33000 vs. ExPASy TrEMBL
Match: A0A1S3CGD0 (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 1.1e-223
Identity = 403/495 (81.41%), Postives = 438/495 (88.48%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           M  I S++ VSSI+VKGNGGI CQI+M  F AN+RRR PKNLL PRR KLPPDP VNQFL
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
            N TS   PSFTD  SS+  +      D+HEE    +Y    KD +++WDSDEIEAISSL
Sbjct: 61  NNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDY---TKDTDVVWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
           FQGRIPQKPGKLNRERPLPLPLPHKLRPP LPNPKIR  T V SRA LSK+VYKRPDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLI 180

Query: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
           GLARAIRDLS EENVSKVLNRW PFL KGSLSLTI+ELGHMGL DRAL++FCW QEQ RL
Sbjct: 181 GLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRL 240

Query: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
           FPDDRVLASTVEVLSRNHELKVP+NLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV 
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVA 300

Query: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
           AKKG RMLDPSVYVKLILELGKNPDKN+LVLTLL+ELGQREALKLNQQD+T I+KVCTRL
Sbjct: 301 AKKGKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRL 360

Query: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
            KFEIAE+LY WYVES HEPS+VMYTAL+HSRYS++KYREALS+VWEME+ANCPFDLPAY
Sbjct: 361 RKFEIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAY 420

Query: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
           NVVIKLFVALGDLSRA RYFAKLKEAGF+PTYD+YRN+ITIYLVSGRLAK KEIYKEA+N
Sbjct: 421 NVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAEN 480

Query: 481 AGFIIDKQITSRLLQ 496
           AGFI+DKQITS LLQ
Sbjct: 481 AGFIMDKQITSMLLQ 486

BLAST of Moc08g33000 vs. ExPASy TrEMBL
Match: A0A0A0LVM0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 4.5e-222
Identity = 398/495 (80.40%), Postives = 434/495 (87.68%), Query Frame = 0

Query: 1   MDCIFSSSIVSSIMVKGNGGISCQISMARFMANARRRLPKNLLNPRRTKLPPDPGVNQFL 60
           MD I S++ VSSI+VKGNGGI CQ +M  F AN+RRR PKNLL PRR KLPP+P VNQF 
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60

Query: 61  KNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEEADTENYFVDDKDGEIIWDSDEIEAISSL 120
            N TS   P FTD  SS+  +      D+HEE    +Y    KD +++WDSDEIEAISSL
Sbjct: 61  NNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDY---TKDTDVVWDSDEIEAISSL 120

Query: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRARTGVPSRASLSKQVYKRPDFLI 180
           FQGRIPQKPGKLNRERPLPLPLPHKLRPP LPNPKIR  T V SRA LSKQVYKRPDFLI
Sbjct: 121 FQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLI 180

Query: 181 GLARAIRDLSREENVSKVLNRWAPFLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRL 240
           GLAR IRDLS EENVSKVLNRW PFL KGSLSLTI+ELGHMGL DRAL +FCWAQEQ RL
Sbjct: 181 GLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRL 240

Query: 241 FPDDRVLASTVEVLSRNHELKVPLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVV 300
           FPDDRVLASTVEVLSRNHELKV +NLEEFT+LASRGVLEAM+RGFI+GGSLNLAWKLLV 
Sbjct: 241 FPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVA 300

Query: 301 AKKGNRMLDPSVYVKLILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRL 360
           AKKG RMLDPSVYVKLILELGKNPDKNMLVLTLL+ELGQREALKLNQQD T I+KVCTRL
Sbjct: 301 AKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRL 360

Query: 361 GKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAY 420
           GKFEIAE+LY WYVES HEPS+VMYTAL+HSRYS++KYREALS+VWEME+ NCPFDLPAY
Sbjct: 361 GKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAY 420

Query: 421 NVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKN 480
           +VVIKLFVALGDLSRA RYFAKLKEAGF+PTY++YRN+ITIYLVSGRLAKCKEIYKEA+N
Sbjct: 421 SVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAEN 480

Query: 481 AGFIIDKQITSRLLQ 496
           AGF++DKQITS LLQ
Sbjct: 481 AGFMMDKQITSMLLQ 486

BLAST of Moc08g33000 vs. TAIR 10
Match: AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 511.1 bits (1315), Expect = 9.8e-145
Identity = 279/473 (58.99%), Postives = 349/473 (73.78%), Query Frame = 0

Query: 33  NARRRLPKNLLNPRRTKLPPDPGVNQFLKNTTSGSGPSFTDFTSSEKIEFPEEEHDDHEE 92
           N  ++L KNL NPRRTKLPPD GVN FL+                 KIE P    DD E+
Sbjct: 36  NHSKKLTKNLRNPRRTKLPPDFGVNLFLR---------------KPKIE-PLVIDDDDEQ 95

Query: 93  ADTENYFVDDKDGEIIWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPGLP 152
                  V+D D  ++W+ +EIEAISSLFQ RIPQKP K +R RPLPLP PHKLRP GLP
Sbjct: 96  VQES---VNDDDDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQPHKLRPLGLP 155

Query: 153 NPK---IRARTGVPSRASLSKQVYKRPDFLIGLARAIRDL-SREENVSKVLNRWAPFLLK 212
            PK   IR+    P+ +S+SKQVYK P FLIGLAR I+ L S + +VS VLN+W  FL K
Sbjct: 156 TPKKNIIRS----PALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFLRK 215

Query: 213 GSLSLTIRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHELKVPLNLEE 272
           GSLS TIRELGHMGL +RALQ++ WA++   L PD+R+LAST++VL+++HELK+   L+ 
Sbjct: 216 GSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---LKF 275

Query: 273 FTRLASRGVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKLILELGKNPDKNM 332
              LAS+ V+EAMI+G I+GG LNLA KL++++K  NR+LD SVYVK+ILE+ KNPDK  
Sbjct: 276 DNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAKNPDKYH 335

Query: 333 LVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTAL 392
           LV+ LL+EL +RE LKL+QQD T+IMK+C +LG+FE+ E L+ W+  S  EPSVVMYT +
Sbjct: 336 LVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSVVMYTTM 395

Query: 393 IHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGF 452
           IHSRYSE+KYREA+SVVWEME +NC  DLPAY VVIKLFVAL DL RA RY++KLKEAGF
Sbjct: 396 IHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSKLKEAGF 455

Query: 453 APTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAGFIIDKQITSRLLQCASRKM 502
           +PTYDIYR++I++Y  SGRL KCKEI KE ++AG  +DK  + RLLQ   + M
Sbjct: 456 SPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEKQTM 482

BLAST of Moc08g33000 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 75.9 bits (185), Expect = 1.1e-13
Identity = 71/304 (23.36%), Postives = 124/304 (40.79%), Query Frame = 0

Query: 206 LLKGSLSLTIRELGHMGLADRALQSFCW---AQEQPRLFPDDRVLASTVEVLSRNHELKV 265
           LL+  L   ++ L   G  +RA+  F W   +     L  D +V+   V +L R  +  V
Sbjct: 134 LLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYSV 193

Query: 266 ------PLNLEEFTRLASRGVLEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSVYVKL 325
                  + L+E+  L        ++  + + G    A  L    K+         Y  +
Sbjct: 194 AAKLLDKIPLQEY--LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVI 253

Query: 326 ILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVES 385
           +   GK       +L +LDE+ + + LK ++   + ++  C R G    A+  +      
Sbjct: 254 LDVFGKMGRSWRKILGVLDEM-RSKGLKFDEFTCSTVLSACAREGLLREAKEFFAELKSC 313

Query: 386 VHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRA 445
            +EP  V Y AL+        Y EALSV+ EME  +CP D   YN ++  +V  G    A
Sbjct: 314 GYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEA 373

Query: 446 ARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNAGFIIDKQITSRLLQC 501
           A     + + G  P    Y  +I  Y  +G+  +  +++   K AG + +    + +L  
Sbjct: 374 AGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVLSL 433

BLAST of Moc08g33000 vs. TAIR 10
Match: AT5G25630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 68.6 bits (166), Expect = 1.7e-11
Identity = 37/124 (29.84%), Postives = 62/124 (50.00%), Query Frame = 0

Query: 351 TAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEA 410
           T +M V    G+   A+ ++    E+ H PS++ YT L+ +   +K+Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 411 ANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAK 470
           +    D   +N VI  F   G++  A +   K+KE G  PT   Y  LI  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 471 CKEI 475
             E+
Sbjct: 169 SSEL 172

BLAST of Moc08g33000 vs. TAIR 10
Match: AT5G25630.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 68.6 bits (166), Expect = 1.7e-11
Identity = 37/124 (29.84%), Postives = 62/124 (50.00%), Query Frame = 0

Query: 351 TAIMKVCTRLGKFEIAERLYGWYVESVHEPSVVMYTALIHSRYSEKKYREALSVVWEMEA 410
           T +M V    G+   A+ ++    E+ H PS++ YT L+ +   +K+Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 411 ANCPFDLPAYNVVIKLFVALGDLSRAARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAK 470
           +    D   +N VI  F   G++  A +   K+KE G  PT   Y  LI  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 471 CKEI 475
             E+
Sbjct: 169 SSEL 172

BLAST of Moc08g33000 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 66.6 bits (161), Expect = 6.4e-11
Identity = 73/345 (21.16%), Postives = 150/345 (43.48%), Query Frame = 0

Query: 147 RPPGLPNPKIRARTG--VPSRASLSKQVYKRPDFLIGLARAIRDLSREENVSKVLNRWAP 206
           R P   + KI+A+T   V    S+S +  K    +  L   +  L    ++++ L+ +  
Sbjct: 42  RKPCSFSGKIKAKTKDLVLGNPSVSVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKN 101

Query: 207 FLLKGSLSLTIRELGHMGLADRALQSFCWAQEQPRLFPDDRVLASTVEVLSRNHELKVPL 266
            L     +L  +E    G   R+L+ F + Q Q    P++ +    + +L R  E  +  
Sbjct: 102 KLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGR--EGLLDK 161

Query: 267 NLEEFTRLASRGV------LEAMIRGFIKGGSLNLAWKLLVVAKKGNRMLDPSV--YVKL 326
            LE F  + S+GV        A+I  + + G    + +LL   K  N  + PS+  Y  +
Sbjct: 162 CLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELLDRMK--NEKISPSILTYNTV 221

Query: 327 ILELGKNPDKNMLVLTLLDELGQREALKLNQQDTTAIMKVCTRLGKFEIAERLYGWYVES 386
           I    +       +L L  E+ + E ++ +      ++  C   G  + AE ++    + 
Sbjct: 222 INACARGGLDWEGLLGLFAEM-RHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDG 281

Query: 387 VHEPSVVMYTALIHSRYSEKKYREALSVVWEMEAANCPFDLPAYNVVIKLFVALGDLSRA 446
              P +  Y+ L+ +    ++  +   ++ EM +     D+ +YNV+++ +   G +  A
Sbjct: 282 GIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNVLLEAYAKSGSIKEA 341

Query: 447 ARYFAKLKEAGFAPTYDIYRNLITIYLVSGRLAKCKEIYKEAKNA 482
              F +++ AG  P  + Y  L+ ++  SGR    ++++ E K++
Sbjct: 342 MGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSS 381

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022153119.17.8e-285100.00pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Momordica char... [more]
KAG7020726.17.9e-22981.53Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038893977.13.9e-22882.22pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_03... [more]
XP_022951807.15.1e-22881.53pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita mosc... [more]
KAG6585791.18.7e-22881.33Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q5XET41.4e-14358.99Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... [more]
O646241.5e-1223.36Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Q8GZ632.4e-1029.84Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX... [more]
Q9S7Q29.0e-1021.16Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
Q8L6Y72.5e-0722.55Pentatricopeptide repeat-containing protein At2g38420, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1DI373.8e-285100.00pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Momordica ch... [more]
A0A6J1GIP22.5e-22881.53pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita mo... [more]
A0A6J1KK311.4e-22680.92pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita ma... [more]
A0A1S3CGD01.1e-22381.41pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0LVM04.5e-22280.40Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01860.19.8e-14558.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G18940.11.1e-1323.36Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G25630.11.7e-1129.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G25630.21.7e-1129.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74850.16.4e-1121.16plastid transcriptionally active 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 179..304
e-value: 3.3E-6
score: 28.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 323..503
e-value: 7.5E-25
score: 90.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 419..451
e-value: 0.0012
score: 16.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 379..428
e-value: 7.7E-6
score: 25.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 454..483
e-value: 0.55
score: 10.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 416..450
score: 10.13926
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 125..162
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..72
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 57..72
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 73..474
NoneNo IPR availablePANTHERPTHR46128:SF179TETRATRICOPEPTIDE REPEAT-LIKE SUPERFAMILY PROTEINcoord: 73..474

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc08g33000.1Moc08g33000.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding