Moc01g28310 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc01g28310
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 20101083 .. 20103215 (-)
RNA-Seq ExpressionMoc01g28310
SyntenyMoc01g28310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCACGCTAACAGCAGAATCACCATGGGCTCAGACCATGTCTCTGCTCGAAAACTGTTCCACCATGAAGCACCTGAAGCAGATCCACGCTCAAATGATCAAGACAGAGACGGCCACAGACCCCAAATTAGCCACCAAGCTTCTAACGCTCTGCAGCACACCTCATTTCGGAAATCTCCATTACGCGCAAATGGTTTTCAATGGAATCACCAGGCCCACCACTTTCATGTGGAACGCCATCATTAGAGCGTACTCAGACAGTAACGAACCAAAAATAGCGTTTCTCTTATATTGTCAGATGCTTTCTTCTTCCGTGCCGCCCAATTCTCACACCTTCCCTTTCTTGCTCAAATCGTGCCGCGATTTGTCTGCCATGGGGGAGGCCCTTCAGGTCCACGGACTGATTGTGAAATTGGGATTTGGGTCGGATGTTTTCGCCATGAATTCTCTGATTCGTGCTTATACGTTATGTGGTGACATTCAGTATGCACGCCAACTGTTTGATCATATTCCTGAACGAGATGTCGTTTCTTGGAACACGATGGTTGATGGGTATATCAAATTTGGGGACGTAAAATCAGCGTATGGGGTTTTCTTGGAGATGCCGTTGAAGAATGTTGTCTCGTGGACGTCGATGATCTCGGGGCTAGTTGAGGCAGGACTGGGCGTGGAAGCTTTGGAACTTTGCTGCGAAATGCAGAGGGCGGGATTTGAACTTGATGCGGTTGCTGTTGCGAGTATGCTCACTGCTTGTGCAAATCTTGGAGCGTTGGAGCAAGGAAGATGGCTCCATTTCTATGTACTCAACAATGGAGTCCATCTTGATCGAGTCATTGCCTGTGCTCTAGTAAATATGTACGTAAAATGCGGCGACTTGGAAGAAGCCTTGCAGGTTTTTGCGGAGTTGAAGGCCTCTTTGGTATGCAATTCAGATTCAGCATCAGTTTATTGTTCTTACACTTAGTGGATATTTGGTTGGTGAGCTTATTAAGTGAAGAGTACTGAACACAGTTTCAGCAACTTCTTAAATCGAGATATAATTTCTGAATATGCTACCAACCATATTTAAAAACAACCTGAGATGATAACTCTATGTTTTTTTTATTAAATCCCTGTTAGAAAGATGTCCATGTATGGACGGCCATGATTGAAGGTTTTGCTATTAATGGGCGTGGAGTGGAAGCACTGGAATGGTTCAACAAAATGCAGAGAGAAGGAACAAGACCAAATTCCATCACTTTCACTGCAATTCTGACGGCCTGCAGCTACGGAGGATTGGTTGAAGAGGGAAAATCTTTATTCAAGAGCATGAAAAGTCTCTATAACTTGAGTCCATGTATTGAGCATTACGGGTGCATGGTTGATCTTCTGGGTCGAGCCGGGCTGTTGCAGGAAGCGAAGGAGTTGATCGAGATGATGCCGATGAAACCGAATGCTGTGATATGGGGAGCTCTACTGAAGGCTTGTCAGATTCATGGGGATTTTTTTCTTGCTGGCCAAATTGGAACCCATTTGGTGGAAGTTGATTCAGATCACAGCGGGCGGTACATTCAATTGGCAACCATCTTGGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAATTGAGGTTGAAGATGAAGAAGCTGAGGGTCCTAATTCCTCCGGGAAAGAGTTTGATAACTTTAGATGGCGTTGTTCATGAATTTCTAGCTGGGCATCAAGATCATCCACAGATGAAGCAGATTCACCAGAAGTTGAACCAGGTTGCAGAGAGATTACGCAAAGAAGGGTATTCTAAAATCACCATTGAATTCAAAAACTTACATCGATAAGTTATGGTATATTTGAAATTTTATCATATTCTTTAACAGGTACAAACCTGTGACTAAAGATTTATGGCTTGATCTTGAGAATGATGAGAAAGAGACAGTGATGGCGCAGCACAGTGAGAAGCTGGCGATTGCATTCGGATTGATCAATATGAAACCAGGAACGACGATTCGAGTTGTTAAGAATCTAAGGGTATGTGGAGATTGCCATGACGTTGCGAAGCTCGTATCTAGGATCTATAAAAGAGACATTGTAATGCGGGATAGAGTTCGTTTCCACCATTTTAGAGATGGGAGATGTTCTTGCAGAGATTACTGGTAG

mRNA sequence

ATGTTCACGCTAACAGCAGAATCACCATGGGCTCAGACCATGTCTCTGCTCGAAAACTGTTCCACCATGAAGCACCTGAAGCAGATCCACGCTCAAATGATCAAGACAGAGACGGCCACAGACCCCAAATTAGCCACCAAGCTTCTAACGCTCTGCAGCACACCTCATTTCGGAAATCTCCATTACGCGCAAATGGTTTTCAATGGAATCACCAGGCCCACCACTTTCATGTGGAACGCCATCATTAGAGCGTACTCAGACAGTAACGAACCAAAAATAGCGTTTCTCTTATATTGTCAGATGCTTTCTTCTTCCGTGCCGCCCAATTCTCACACCTTCCCTTTCTTGCTCAAATCGTGCCGCGATTTGTCTGCCATGGGGGAGGCCCTTCAGGTCCACGGACTGATTGTGAAATTGGGATTTGGGTCGGATGTTTTCGCCATGAATTCTCTGATTCGTGCTTATACGTTATGTGGTGACATTCAGTATGCACGCCAACTGTTTGATCATATTCCTGAACGAGATGTCGTTTCTTGGAACACGATGGTTGATGGGTATATCAAATTTGGGGACGTAAAATCAGCGTATGGGGTTTTCTTGGAGATGCCGTTGAAGAATGTTGTCTCGTGGACGTCGATGATCTCGGGGCTAGTTGAGGCAGGACTGGGCGTGGAAGCTTTGGAACTTTGCTGCGAAATGCAGAGGGCGGGATTTGAACTTGATGCGGTTGCTGTTGCGAGTATGCTCACTGCTTGTGCAAATCTTGGAGCGTTGGAGCAAGGAAGATGGCTCCATTTCTATGTACTCAACAATGGAGTCCATCTTGATCGAGTCATTGCCTGTGCTCTAGTAAATATGTACGTAAAATGCGGCGACTTGGAAGAAGCCTTGCAGGTTTTTGCGGAGTTGAAGGCCTCTTTGAAAGATGTCCATGTATGGACGGCCATGATTGAAGGTTTTGCTATTAATGGGCGTGGAGTGGAAGCACTGGAATGGTTCAACAAAATGCAGAGAGAAGGAACAAGACCAAATTCCATCACTTTCACTGCAATTCTGACGGCCTGCAGCTACGGAGGATTGGTTGAAGAGGGAAAATCTTTATTCAAGAGCATGAAAAGTCTCTATAACTTGAGTCCATGTATTGAGCATTACGGGTGCATGGTTGATCTTCTGGGTCGAGCCGGGCTGTTGCAGGAAGCGAAGGAGTTGATCGAGATGATGCCGATGAAACCGAATGCTGTGATATGGGGAGCTCTACTGAAGGCTTGTCAGATTCATGGGGATTTTTTTCTTGCTGGCCAAATTGGAACCCATTTGGTGGAAGTTGATTCAGATCACAGCGGGCGGTACATTCAATTGGCAACCATCTTGGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAATTGAGGTTGAAGATGAAGAAGCTGAGGGTCCTAATTCCTCCGGGAAAGAGTTTGATAACTTTAGATGGCGTTGTTCATGAATTTCTAGCTGGGCATCAAGATCATCCACAGATGAAGCAGATTCACCAGAAGTTGAACCAGGTTGCAGAGAGATTACGCAAAGAAGGGTACAAACCTGTGACTAAAGATTTATGGCTTGATCTTGAGAATGATGAGAAAGAGACAGTGATGGCGCAGCACAGTGAGAAGCTGGCGATTGCATTCGGATTGATCAATATGAAACCAGGAACGACGATTCGAGTTGTTAAGAATCTAAGGGTATGTGGAGATTGCCATGACGTTGCGAAGCTCGTATCTAGGATCTATAAAAGAGACATTGTAATGCGGGATAGAGTTCGTTTCCACCATTTTAGAGATGGGAGATGTTCTTGCAGAGATTACTGGTAG

Coding sequence (CDS)

ATGTTCACGCTAACAGCAGAATCACCATGGGCTCAGACCATGTCTCTGCTCGAAAACTGTTCCACCATGAAGCACCTGAAGCAGATCCACGCTCAAATGATCAAGACAGAGACGGCCACAGACCCCAAATTAGCCACCAAGCTTCTAACGCTCTGCAGCACACCTCATTTCGGAAATCTCCATTACGCGCAAATGGTTTTCAATGGAATCACCAGGCCCACCACTTTCATGTGGAACGCCATCATTAGAGCGTACTCAGACAGTAACGAACCAAAAATAGCGTTTCTCTTATATTGTCAGATGCTTTCTTCTTCCGTGCCGCCCAATTCTCACACCTTCCCTTTCTTGCTCAAATCGTGCCGCGATTTGTCTGCCATGGGGGAGGCCCTTCAGGTCCACGGACTGATTGTGAAATTGGGATTTGGGTCGGATGTTTTCGCCATGAATTCTCTGATTCGTGCTTATACGTTATGTGGTGACATTCAGTATGCACGCCAACTGTTTGATCATATTCCTGAACGAGATGTCGTTTCTTGGAACACGATGGTTGATGGGTATATCAAATTTGGGGACGTAAAATCAGCGTATGGGGTTTTCTTGGAGATGCCGTTGAAGAATGTTGTCTCGTGGACGTCGATGATCTCGGGGCTAGTTGAGGCAGGACTGGGCGTGGAAGCTTTGGAACTTTGCTGCGAAATGCAGAGGGCGGGATTTGAACTTGATGCGGTTGCTGTTGCGAGTATGCTCACTGCTTGTGCAAATCTTGGAGCGTTGGAGCAAGGAAGATGGCTCCATTTCTATGTACTCAACAATGGAGTCCATCTTGATCGAGTCATTGCCTGTGCTCTAGTAAATATGTACGTAAAATGCGGCGACTTGGAAGAAGCCTTGCAGGTTTTTGCGGAGTTGAAGGCCTCTTTGAAAGATGTCCATGTATGGACGGCCATGATTGAAGGTTTTGCTATTAATGGGCGTGGAGTGGAAGCACTGGAATGGTTCAACAAAATGCAGAGAGAAGGAACAAGACCAAATTCCATCACTTTCACTGCAATTCTGACGGCCTGCAGCTACGGAGGATTGGTTGAAGAGGGAAAATCTTTATTCAAGAGCATGAAAAGTCTCTATAACTTGAGTCCATGTATTGAGCATTACGGGTGCATGGTTGATCTTCTGGGTCGAGCCGGGCTGTTGCAGGAAGCGAAGGAGTTGATCGAGATGATGCCGATGAAACCGAATGCTGTGATATGGGGAGCTCTACTGAAGGCTTGTCAGATTCATGGGGATTTTTTTCTTGCTGGCCAAATTGGAACCCATTTGGTGGAAGTTGATTCAGATCACAGCGGGCGGTACATTCAATTGGCAACCATCTTGGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAATTGAGGTTGAAGATGAAGAAGCTGAGGGTCCTAATTCCTCCGGGAAAGAGTTTGATAACTTTAGATGGCGTTGTTCATGAATTTCTAGCTGGGCATCAAGATCATCCACAGATGAAGCAGATTCACCAGAAGTTGAACCAGGTTGCAGAGAGATTACGCAAAGAAGGGTACAAACCTGTGACTAAAGATTTATGGCTTGATCTTGAGAATGATGAGAAAGAGACAGTGATGGCGCAGCACAGTGAGAAGCTGGCGATTGCATTCGGATTGATCAATATGAAACCAGGAACGACGATTCGAGTTGTTAAGAATCTAAGGGTATGTGGAGATTGCCATGACGTTGCGAAGCTCGTATCTAGGATCTATAAAAGAGACATTGTAATGCGGGATAGAGTTCGTTTCCACCATTTTAGAGATGGGAGATGTTCTTGCAGAGATTACTGGTAG

Protein sequence

MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRVRFHHFRDGRCSCRDYW
Homology
BLAST of Moc01g28310 vs. NCBI nr
Match: XP_022132422.1 (pentatricopeptide repeat-containing protein At5g66520 [Momordica charantia])

HSP 1 Score: 1256.9 bits (3251), Expect = 0.0e+00
Identity = 616/616 (100.00%), Postives = 616/616 (100.00%), Query Frame = 0

Query: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60
           MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL
Sbjct: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60

Query: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120
           HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC
Sbjct: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120

Query: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180
           RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN
Sbjct: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180

Query: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240
           TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL
Sbjct: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240

Query: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300
           DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF
Sbjct: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300

Query: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360
           AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL
Sbjct: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360

Query: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420
           VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL
Sbjct: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420

Query: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480
           KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI
Sbjct: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480

Query: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE 540
           PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE
Sbjct: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE 540

Query: 541 KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV 600
           KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV
Sbjct: 541 KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV 600

Query: 601 RFHHFRDGRCSCRDYW 617
           RFHHFRDGRCSCRDYW
Sbjct: 601 RFHHFRDGRCSCRDYW 616

BLAST of Moc01g28310 vs. NCBI nr
Match: XP_022978438.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucurbita maxima])

HSP 1 Score: 1038.9 bits (2685), Expect = 1.8e-299
Identity = 503/621 (81.00%), Postives = 560/621 (90.18%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLLENCS MK LK+IHAQMI+T TAT+PKLATKLLTLC++PH
Sbjct: 1   MFALKAESPMQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCTSPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
           FG+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY QMLSSSVP NS+TFPFL
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           GFELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           LQ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LQEFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+Y LSP IEHYGCMVDLLGRAGLL+EAKELI+ MPMKPNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYILSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMKPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           RV IPPGKS ITL+GVVHEFLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LEN+ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+SRIY+R+I+
Sbjct: 541 LENEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISRIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC+DYW
Sbjct: 601 MRDRVRFHHFRGGSCSCKDYW 621

BLAST of Moc01g28310 vs. NCBI nr
Match: XP_023543056.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1033.1 bits (2670), Expect = 9.8e-298
Identity = 498/621 (80.19%), Postives = 559/621 (90.02%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLL+NCS MK LK+IHAQMI+T TAT+PKLATKLLTLC++PH
Sbjct: 1   MFALKAESPMQSTWAQTMSLLDNCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCTSPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
            G+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY +MLSSSVP NS+TFPFL
Sbjct: 61  LGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRRMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           GFELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           L+ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+YNLSP IEHYGCMVDLLGRAGLL+EAKELI+ MPMKPNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMKPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           R+ IPPGKS ITL+GVVH+FLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RLPIPPGKSSITLNGVVHQFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LEN+ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+SRIY+R+I+
Sbjct: 541 LENEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISRIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of Moc01g28310 vs. NCBI nr
Match: XP_022949774.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucurbita moschata])

HSP 1 Score: 1033.1 bits (2670), Expect = 9.8e-298
Identity = 499/621 (80.35%), Postives = 559/621 (90.02%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLLENCS MK LK+IHAQMI+T TAT+PKLATKLLTLC +PH
Sbjct: 1   MFALKAESPVQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCISPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
           FG+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY QMLSSSVP NS+TFPFL
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           GFELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           L+ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+YNLSP IEHYGCMVDLLGRAGLL+EAKELI+ MPM+PNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMEPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           R+ IPPGKS ITL+GVVHEFLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RLPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LE++ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+S+IY+R+I+
Sbjct: 541 LESEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISQIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of Moc01g28310 vs. NCBI nr
Match: KAG6603968.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1030.0 bits (2662), Expect = 8.3e-297
Identity = 498/621 (80.19%), Postives = 558/621 (89.86%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLLENCS MK LK+IHAQMI+T TAT+PKLATKLLTLC +PH
Sbjct: 1   MFALKAESPVQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCISPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
           FG+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY QMLSSSVP NS+TFPFL
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           G ELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GCELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           L+ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+YNLSP IEHYGCMVDLLGRAGLL+EAKELI+ MPM+PNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMEPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           R+ IPPGKS ITL+GVVHEFLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RLPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LE++ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+S+IY+R+I+
Sbjct: 541 LESEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISQIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of Moc01g28310 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 670.2 bits (1728), Expect = 2.2e-191
Identity = 329/607 (54.20%), Postives = 433/607 (71.33%), Query Frame = 0

Query: 12  QTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLC-STPHFGNLHYAQMVFNGI 71
           +TMS L+ CS  + LKQIHA+M+KT    D    TK L+ C S+     L YAQ+VF+G 
Sbjct: 16  ETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGF 75

Query: 72  TRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGEAL 131
            RP TF+WN +IR +S S+EP+ + LLY +ML SS P N++TFP LLK+C +LSA  E  
Sbjct: 76  DRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 135

Query: 132 QVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMVDGYIKFG 191
           Q+H  I KLG+ +DV+A+NSLI +Y + G+ + A  LFD IPE D VSWN+++ GY+K G
Sbjct: 136 QIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAG 195

Query: 192 DVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASMLT 251
            +  A  +F +M  KN +SWT+MISG V+A +  EAL+L  EMQ +  E D V++A+ L+
Sbjct: 196 KMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALS 255

Query: 252 ACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDV 311
           ACA LGALEQG+W+H Y+    + +D V+ C L++MY KCG++EEAL+VF  +K   K V
Sbjct: 256 ACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKK--KSV 315

Query: 312 HVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKS 371
             WTA+I G+A +G G EA+  F +MQ+ G +PN ITFTA+LTACSY GLVEEGK +F S
Sbjct: 316 QAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYS 375

Query: 372 MKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFF 431
           M+  YNL P IEHYGC+VDLLGRAGLL EAK  I+ MP+KPNAVIWGALLKAC+IH +  
Sbjct: 376 MERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIE 435

Query: 432 LAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLD 491
           L  +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK+  V   PG S I+L+
Sbjct: 436 LGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLE 495

Query: 492 GVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDL-ENDEKETVMAQHS 551
           G  HEFLAG + HP++++I  K   +  +L + GY P  +++ LDL ++DE+E ++ QHS
Sbjct: 496 GTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHS 555

Query: 552 EKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRVRFHHFRDGR 611
           EKLAI +GLI  KPGT IR++KNLRVC DCH V KL+S+IYKRDIVMRDR RFHHFRDG+
Sbjct: 556 EKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGK 615

Query: 612 CSCRDYW 617
           CSC DYW
Sbjct: 616 CSCGDYW 620

BLAST of Moc01g28310 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 4.2e-158
Identity = 283/633 (44.71%), Postives = 411/633 (64.93%), Query Frame = 0

Query: 6   AESPWAQTMSL---LENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTP--HFGNL 65
           A SP +   SL   + NC T++ L QIHA  IK+    D   A ++L  C+T   H  +L
Sbjct: 16  ASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDL 75

Query: 66  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPK--IAFLLYCQMLSSS-VPPNSHTFPFLL 125
            YA  +FN + +   F WN IIR +S+S+E K  IA  L+ +M+S   V PN  TFP +L
Sbjct: 76  DYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVL 135

Query: 126 KSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLF-DHIPERD- 185
           K+C     + E  Q+HGL +K GFG D F M++L+R Y +CG ++ AR LF  +I E+D 
Sbjct: 136 KACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDM 195

Query: 186 ------------VVSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLG 245
                       +V WN M+DGY++ GD K+A  +F +M  ++VVSW +MISG    G  
Sbjct: 196 VVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFF 255

Query: 246 VEALELCCEMQRAGFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACAL 305
            +A+E+  EM++     + V + S+L A + LG+LE G WLH Y  ++G+ +D V+  AL
Sbjct: 256 KDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSAL 315

Query: 306 VNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRP 365
           ++MY KCG +E+A+ VF  L    ++V  W+AMI GFAI+G+  +A++ F KM++ G RP
Sbjct: 316 IDMYSKCGIIEKAIHVFERLPR--ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 375

Query: 366 NSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKEL 425
           + + +  +LTACS+GGLVEEG+  F  M S+  L P IEHYGCMVDLLGR+GLL EA+E 
Sbjct: 376 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 435

Query: 426 IEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKW 485
           I  MP+KP+ VIW ALL AC++ G+  +  ++   L+++    SG Y+ L+ + A++G W
Sbjct: 436 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 495

Query: 486 KEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKE 545
            E +E+RL+MK+  +   PG SLI +DGV+HEF+     HP+ K+I+  L +++++LR  
Sbjct: 496 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 555

Query: 546 GYKPVTKDLWLDLENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVA 605
           GY+P+T  + L+LE ++KE V+  HSEK+A AFGLI+  PG  IR+VKNLR+C DCH   
Sbjct: 556 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 615

Query: 606 KLVSRIYKRDIVMRDRVRFHHFRDGRCSCRDYW 617
           KL+S++YKR I +RDR RFHHF+DG CSC DYW
Sbjct: 616 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Moc01g28310 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 515.0 bits (1325), Expect = 1.2e-144
Identity = 261/609 (42.86%), Postives = 399/609 (65.52%), Query Frame = 0

Query: 14  MSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGN-----LHYAQMVFN 73
           ++LL++CS+   LK IH  +++T   +D  +A++LL LC      N     L YA  +F+
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 74  GITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGE 133
            I  P  F++N +IR +S   EP  AF  Y QML S + P++ TFPFL+K+  ++  +  
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 134 ALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMVDGYIK 193
             Q H  IV+ GF +DV+  NSL+  Y  CG I  A ++F  +  RDVVSW +MV GY K
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 194 FGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASM 253
            G V++A  +F EMP +N+ +W+ MI+G  +     +A++L   M+R G   +   + S+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 254 LTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLK 313
           +++CA+LGALE G   + YV+ + + ++ ++  ALV+M+ +CGD+E+A+ VF  L  +  
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPET-- 315

Query: 314 DVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLF 373
           D   W+++I+G A++G   +A+ +F++M   G  P  +TFTA+L+ACS+GGLVE+G  ++
Sbjct: 316 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 375

Query: 374 KSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGD 433
           ++MK  + + P +EHYGC+VD+LGRAG L EA+  I  M +KPNA I GALL AC+I+ +
Sbjct: 376 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 435

Query: 434 FFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLIT 493
             +A ++G  L++V  +HSG Y+ L+ I A  G+W +   LR  MK+  V  PPG SLI 
Sbjct: 436 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 495

Query: 494 LDGVVHEFLAG-HQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDEKETVMAQ 553
           +DG +++F  G  Q HP+M +I +K  ++  ++R  GYK  T D + D++ +EKE+ +  
Sbjct: 496 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHM 555

Query: 554 HSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRVRFHHFRD 613
           HSEKLAIA+G++  KPGTTIR+VKNLRVC DCH V KL+S +Y R++++RDR RFHHFR+
Sbjct: 556 HSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRN 615

Query: 614 GRCSCRDYW 617
           G CSCRDYW
Sbjct: 616 GVCSCRDYW 622

BLAST of Moc01g28310 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 5.8e-144
Identity = 268/716 (37.43%), Postives = 412/716 (57.54%), Query Frame = 0

Query: 5   TAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNLHYAQ 64
           T  +  ++ +SL+E C +++ LKQ H  MI+T T +DP  A+KL  + +   F +L YA+
Sbjct: 25  TTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYAR 84

Query: 65  MVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLS-SSVPPNSHTFPFLLKSCRDL 124
            VF+ I +P +F WN +IRAY+   +P ++   +  M+S S   PN +TFPFL+K+  ++
Sbjct: 85  KVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEV 144

Query: 125 SAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMV 184
           S++     +HG+ VK   GSDVF  NSLI  Y  CGD+  A ++F  I E+DVVSWN+M+
Sbjct: 145 SSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMI 204

Query: 185 DGYIKFG----------------------------------------------------- 244
           +G+++ G                                                     
Sbjct: 205 NGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVN 264

Query: 245 ------------------------------------------------DVKSAYGVFLEM 304
                                                           D ++A  V   M
Sbjct: 265 VNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSM 324

Query: 305 PLKNVVSWTSMISGLVEAGLGVEALELCCEMQ-RAGFELDAVAVASMLTACANLGALEQG 364
           P K++V+W ++IS   + G   EAL +  E+Q +   +L+ + + S L+ACA +GALE G
Sbjct: 325 PQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELG 384

Query: 365 RWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFA 424
           RW+H Y+  +G+ ++  +  AL++MY KCGDLE++ +VF  ++   +DV VW+AMI G A
Sbjct: 385 RWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLA 444

Query: 425 INGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCI 484
           ++G G EA++ F KMQ    +PN +TFT +  ACS+ GLV+E +SLF  M+S Y + P  
Sbjct: 445 MHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEE 504

Query: 485 EHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVE 544
           +HY C+VD+LGR+G L++A + IE MP+ P+  +WGALL AC+IH +  LA    T L+E
Sbjct: 505 KHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLE 564

Query: 545 VDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQ 604
           ++  + G ++ L+ I A  GKW+  +ELR  M+   +   PG S I +DG++HEFL+G  
Sbjct: 565 LEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDN 624

Query: 605 DHPQMKQIHQKLNQVAERLRKEGYKP-VTKDLWLDLENDEKETVMAQHSEKLAIAFGLIN 617
            HP  ++++ KL++V E+L+  GY+P +++ L +  E + KE  +  HSEKLAI +GLI+
Sbjct: 625 AHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLIS 684

BLAST of Moc01g28310 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 503.4 bits (1295), Expect = 3.5e-141
Identity = 269/708 (37.99%), Postives = 397/708 (56.07%), Query Frame = 0

Query: 13  TMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLC-STPHFGNLHYAQMVFNGIT 72
           ++SLL NC T++ L+ IHAQMIK          +KL+  C  +PHF  L YA  VF  I 
Sbjct: 36  SLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQ 95

Query: 73  RPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGEALQ 132
            P   +WN + R ++ S++P  A  LY  M+S  + PNS+TFPF+LKSC    A  E  Q
Sbjct: 96  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 155

Query: 133 VHGLIVKLGFGSDVFAMNS-------------------------------LIRAYTLCGD 192
           +HG ++KLG   D++   S                               LI+ Y   G 
Sbjct: 156 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 215

Query: 193 IQYARQLFDHIPERDVVSWNTMVDGYI--------------------------------- 252
           I+ A++LFD IP +DVVSWN M+ GY                                  
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 253 -------------------------------------KFGDVKSAYGVFLEMPLKNVVSW 312
                                                K G++++A G+F  +P K+V+SW
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 313 TSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASMLTACANLGALEQGRWLHFYVLN 372
            ++I G     L  EAL L  EM R+G   + V + S+L ACA+LGA++ GRW+H Y+  
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 373 --NGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFAINGRGVE 432
              GV     +  +L++MY KCGD+E A QVF  +    K +  W AMI GFA++GR   
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILH--KSLSSWNAMIFGFAMHGRADA 455

Query: 433 ALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMV 492
           + + F++M++ G +P+ ITF  +L+ACS+ G+++ G+ +F++M   Y ++P +EHYGCM+
Sbjct: 456 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 515

Query: 493 DLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVEVDSDHSG 552
           DLLG +GL +EA+E+I MM M+P+ VIW +LLKAC++HG+  L      +L++++ ++ G
Sbjct: 516 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 575

Query: 553 RYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQ 612
            Y+ L+ I A+ G+W E A+ R  +    +   PG S I +D VVHEF+ G + HP+ ++
Sbjct: 576 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 635

Query: 613 IHQKLNQVAERLRKEGYKPVTKDLWLDLENDEKETVMAQHSEKLAIAFGLINMKPGTTIR 617
           I+  L ++   L K G+ P T ++  ++E + KE  +  HSEKLAIAFGLI+ KPGT + 
Sbjct: 636 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 695

BLAST of Moc01g28310 vs. ExPASy TrEMBL
Match: A0A6J1BTS5 (pentatricopeptide repeat-containing protein At5g66520 OS=Momordica charantia OX=3673 GN=LOC111005281 PE=3 SV=1)

HSP 1 Score: 1256.9 bits (3251), Expect = 0.0e+00
Identity = 616/616 (100.00%), Postives = 616/616 (100.00%), Query Frame = 0

Query: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60
           MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL
Sbjct: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60

Query: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120
           HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC
Sbjct: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120

Query: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180
           RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN
Sbjct: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180

Query: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240
           TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL
Sbjct: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240

Query: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300
           DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF
Sbjct: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300

Query: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360
           AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL
Sbjct: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360

Query: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420
           VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL
Sbjct: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420

Query: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480
           KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI
Sbjct: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480

Query: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE 540
           PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE
Sbjct: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDE 540

Query: 541 KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV 600
           KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV
Sbjct: 541 KETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRV 600

Query: 601 RFHHFRDGRCSCRDYW 617
           RFHHFRDGRCSCRDYW
Sbjct: 601 RFHHFRDGRCSCRDYW 616

BLAST of Moc01g28310 vs. ExPASy TrEMBL
Match: A0A6J1IT43 (pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita maxima OX=3661 GN=LOC111478422 PE=3 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 8.6e-300
Identity = 503/621 (81.00%), Postives = 560/621 (90.18%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLLENCS MK LK+IHAQMI+T TAT+PKLATKLLTLC++PH
Sbjct: 1   MFALKAESPMQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCTSPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
           FG+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY QMLSSSVP NS+TFPFL
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           GFELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           LQ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LQEFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+Y LSP IEHYGCMVDLLGRAGLL+EAKELI+ MPMKPNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYILSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMKPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           RV IPPGKS ITL+GVVHEFLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LEN+ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+SRIY+R+I+
Sbjct: 541 LENEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISRIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC+DYW
Sbjct: 601 MRDRVRFHHFRGGSCSCKDYW 621

BLAST of Moc01g28310 vs. ExPASy TrEMBL
Match: A0A6J1GDX2 (pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita moschata OX=3662 GN=LOC111453066 PE=3 SV=1)

HSP 1 Score: 1033.1 bits (2670), Expect = 4.7e-298
Identity = 499/621 (80.35%), Postives = 559/621 (90.02%), Query Frame = 0

Query: 1   MFTLTAESP----WAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPH 60
           MF L AESP    WAQTMSLLENCS MK LK+IHAQMI+T TAT+PKLATKLLTLC +PH
Sbjct: 1   MFALKAESPVQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCISPH 60

Query: 61  FGNLHYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFL 120
           FG+LHYAQ VFNGI+ PTTFMWNA+IRAYS+SNEP++AFLLY QMLSSSVP NS+TFPFL
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDV 180
           LK+CR+ SAM EALQVHGL++KLGFGSDVFA+N+L+  Y LCGDIQYARQLFD+IPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRA 240
           VSWN M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAGL VEAL LC EMQ A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEA 300
           GFELD VA+AS+LTACANLGAL+QGRWLHFYVLNNGVH+DRVI CALVNMY+KCGD+EEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 LQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACS 360
           L+ F +LK   KDV+VWTAMI+GFAI+GRGVEALEWF +M REG RPNSITFTA+L ACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIW 420
           Y GLVEEGK LF+SM S+YNLSP IEHYGCMVDLLGRAGLL+EAKELI+ MPM+PNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMEPNAIIW 420

Query: 421 GALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKL 480
           GALLKAC+IH DF + GQIG HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLD 540
           R+ IPPGKS ITL+GVVHEFLAGHQDHPQM+QI  KLNQV ERLR+ EGY+P TKDL LD
Sbjct: 481 RLPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIV 600
           LE++ KET +AQHSEKLAIAFGLIN KPG+TIRVVKNLRVC DCH VAKL+S+IY+R+I+
Sbjct: 541 LESEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISQIYRREII 600

Query: 601 MRDRVRFHHFRDGRCSCRDYW 617
           MRDRVRFHHFR G CSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of Moc01g28310 vs. ExPASy TrEMBL
Match: A0A0A0KKE0 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502750 PE=3 SV=1)

HSP 1 Score: 1006.1 bits (2600), Expect = 6.2e-290
Identity = 485/617 (78.61%), Postives = 544/617 (88.17%), Query Frame = 0

Query: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60
           MFTL AESP   T +LLENCS MK LKQI AQMIKT   T+PKLATK LTLC++PH G+L
Sbjct: 1   MFTLNAESPLQSTWALLENCSNMKQLKQIQAQMIKTAIITEPKLATKFLTLCTSPHVGDL 60

Query: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120
            YAQ VFNGIT P TFMWNAIIRAYS+S+EP++AFL Y QMLSSSVP NS+TFPFLL++C
Sbjct: 61  LYAQRVFNGITSPNTFMWNAIIRAYSNSDEPELAFLSYQQMLSSSVPHNSYTFPFLLRAC 120

Query: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180
           R+L AMGEALQVHGL++KLGFGSDVFA+N+L+  Y LCG+I  ARQLFD+IPERD VSWN
Sbjct: 121 RNLLAMGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIHCARQLFDNIPERDAVSWN 180

Query: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240
            M+DGYIK GDVK+AYGVFL+MPLKNVVSWTS+ISGLVEAG  VEAL LC EMQ AGFEL
Sbjct: 181 IMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGQSVEALSLCYEMQNAGFEL 240

Query: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300
           D VA+AS+LTACANLGAL+QGRWLHFYVLNNGV +DRVI CALVNMYVKCGD+EEAL VF
Sbjct: 241 DGVAIASLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEALSVF 300

Query: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360
            +LK + KDV++WTAMI+GFAI+GRGVEALEWFN+M+REG RPNSITFTA+L ACSYGGL
Sbjct: 301 GKLKGNQKDVYIWTAMIDGFAIHGRGVEALEWFNRMRREGIRPNSITFTAVLRACSYGGL 360

Query: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420
           VEEGK LFKSMK  YN++P IEHYGCMVDLLGR+G L EAKELI+ MPMKP+AVIWGALL
Sbjct: 361 VEEGKELFKSMKCFYNVNPSIEHYGCMVDLLGRSGRLDEAKELIKKMPMKPSAVIWGALL 420

Query: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480
           KAC IH DF L  Q+G HLVEVDSDHSGRYIQLATILAAEGKWKEAAE+RLKMK L V I
Sbjct: 421 KACWIHRDFLLGSQVGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKSLGVPI 480

Query: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLDLEND 540
            PGKS +TL+G+VHEFLAGHQDHPQM+QI  KL Q+AERLR+ EGY+P TKDL LDLEN+
Sbjct: 481 SPGKSSVTLNGIVHEFLAGHQDHPQMEQIQLKLKQIAERLRQDEGYEPATKDLLLDLENE 540

Query: 541 EKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDR 600
           EKET MAQHSEKLAIAFGLIN KPGTTIRV+KNLR+C DCH VAKLVS+IY R+I+MRDR
Sbjct: 541 EKETAMAQHSEKLAIAFGLINTKPGTTIRVIKNLRICRDCHTVAKLVSQIYSREIIMRDR 600

Query: 601 VRFHHFRDGRCSCRDYW 617
           VRFHHFRDG CSC+DYW
Sbjct: 601 VRFHHFRDGSCSCKDYW 617

BLAST of Moc01g28310 vs. ExPASy TrEMBL
Match: A0A5D3CKZ8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002030 PE=3 SV=1)

HSP 1 Score: 1000.0 bits (2584), Expect = 4.4e-288
Identity = 484/617 (78.44%), Postives = 543/617 (88.01%), Query Frame = 0

Query: 1   MFTLTAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNL 60
           MFTL AESP   T +LLENCS MK LKQI AQMIKT   ++PKLATK LTLC++PH G+L
Sbjct: 1   MFTLKAESPLQSTWTLLENCSNMKQLKQIQAQMIKTAILSEPKLATKFLTLCTSPHVGDL 60

Query: 61  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSC 120
            YAQ VFNGIT P T MWNAIIRAYS+S EP++AFLLY QMLSSSVP NS+TFPFLLK+C
Sbjct: 61  LYAQRVFNGITSPNTVMWNAIIRAYSNSKEPELAFLLYQQMLSSSVPHNSYTFPFLLKAC 120

Query: 121 RDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWN 180
           R+LSA+GEALQVHGL++KLGFGSDVFA+N+L+  Y LCG+I+YARQ+FD+IPERD VSWN
Sbjct: 121 RNLSALGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIRYARQMFDNIPERDAVSWN 180

Query: 181 TMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFEL 240
            M+DGYIK GDVK+AYG+FL+MP KNVVSWTS+ISGLV AGL V+AL LC EMQ AGFEL
Sbjct: 181 IMIDGYIKSGDVKTAYGIFLDMPSKNVVSWTSLISGLVGAGLSVKALSLCYEMQNAGFEL 240

Query: 241 DAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVF 300
           D VA+A +LTACANLGAL+QGRWLHFYVLNNGV +DRVI CALVNMYVKCGD+EEAL+VF
Sbjct: 241 DGVAIACLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEALRVF 300

Query: 301 AELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGL 360
            +LK   KDV +WTAMI+GFAI+GRGVEALEWF+ M+REG RPNSITFTA+L ACSYGGL
Sbjct: 301 GKLKGDQKDVCIWTAMIDGFAIHGRGVEALEWFDLMRREGIRPNSITFTAVLRACSYGGL 360

Query: 361 VEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALL 420
           VEEGK LFKSMK LYNLSP IEHYGCMVDLLGR+G L EAKELI+ MPMKPNAVIWGA L
Sbjct: 361 VEEGKELFKSMKCLYNLSPSIEHYGCMVDLLGRSGRLNEAKELIKNMPMKPNAVIWGAFL 420

Query: 421 KACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLI 480
           KAC IH DF +  QIG HLVEVDSDHSGRYIQLATILAA+GKWKEAAE+RLKMK L V I
Sbjct: 421 KACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAQGKWKEAAEVRLKMKNLGVPI 480

Query: 481 PPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRK-EGYKPVTKDLWLDLEND 540
            PGKS ITL+G+VHEFLAGHQDHPQM+QIH KL Q+AERLR+ EGY+P TKDL LDLEN+
Sbjct: 481 SPGKSSITLNGIVHEFLAGHQDHPQMEQIHLKLKQIAERLRQDEGYEPATKDLLLDLENE 540

Query: 541 EKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDR 600
           EKET +AQHSEKLAIAFGLIN KPGTTIRVVKNLR+C DCH VAKLVS+IY R+I+MRDR
Sbjct: 541 EKETAIAQHSEKLAIAFGLINTKPGTTIRVVKNLRICRDCHTVAKLVSQIYCREIIMRDR 600

Query: 601 VRFHHFRDGRCSCRDYW 617
           VRFHHFRDG CSC+DYW
Sbjct: 601 VRFHHFRDGSCSCKDYW 617

BLAST of Moc01g28310 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 670.2 bits (1728), Expect = 1.5e-192
Identity = 329/607 (54.20%), Postives = 433/607 (71.33%), Query Frame = 0

Query: 12  QTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLC-STPHFGNLHYAQMVFNGI 71
           +TMS L+ CS  + LKQIHA+M+KT    D    TK L+ C S+     L YAQ+VF+G 
Sbjct: 16  ETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGF 75

Query: 72  TRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGEAL 131
            RP TF+WN +IR +S S+EP+ + LLY +ML SS P N++TFP LLK+C +LSA  E  
Sbjct: 76  DRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 135

Query: 132 QVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMVDGYIKFG 191
           Q+H  I KLG+ +DV+A+NSLI +Y + G+ + A  LFD IPE D VSWN+++ GY+K G
Sbjct: 136 QIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAG 195

Query: 192 DVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASMLT 251
            +  A  +F +M  KN +SWT+MISG V+A +  EAL+L  EMQ +  E D V++A+ L+
Sbjct: 196 KMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALS 255

Query: 252 ACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDV 311
           ACA LGALEQG+W+H Y+    + +D V+ C L++MY KCG++EEAL+VF  +K   K V
Sbjct: 256 ACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKK--KSV 315

Query: 312 HVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKS 371
             WTA+I G+A +G G EA+  F +MQ+ G +PN ITFTA+LTACSY GLVEEGK +F S
Sbjct: 316 QAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYS 375

Query: 372 MKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFF 431
           M+  YNL P IEHYGC+VDLLGRAGLL EAK  I+ MP+KPNAVIWGALLKAC+IH +  
Sbjct: 376 MERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIE 435

Query: 432 LAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLD 491
           L  +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK+  V   PG S I+L+
Sbjct: 436 LGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLE 495

Query: 492 GVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDL-ENDEKETVMAQHS 551
           G  HEFLAG + HP++++I  K   +  +L + GY P  +++ LDL ++DE+E ++ QHS
Sbjct: 496 GTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHS 555

Query: 552 EKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRVRFHHFRDGR 611
           EKLAI +GLI  KPGT IR++KNLRVC DCH V KL+S+IYKRDIVMRDR RFHHFRDG+
Sbjct: 556 EKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGK 615

Query: 612 CSCRDYW 617
           CSC DYW
Sbjct: 616 CSCGDYW 620

BLAST of Moc01g28310 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 559.7 bits (1441), Expect = 2.9e-159
Identity = 283/633 (44.71%), Postives = 411/633 (64.93%), Query Frame = 0

Query: 6   AESPWAQTMSL---LENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTP--HFGNL 65
           A SP +   SL   + NC T++ L QIHA  IK+    D   A ++L  C+T   H  +L
Sbjct: 16  ASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDL 75

Query: 66  HYAQMVFNGITRPTTFMWNAIIRAYSDSNEPK--IAFLLYCQMLSSS-VPPNSHTFPFLL 125
            YA  +FN + +   F WN IIR +S+S+E K  IA  L+ +M+S   V PN  TFP +L
Sbjct: 76  DYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVL 135

Query: 126 KSCRDLSAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLF-DHIPERD- 185
           K+C     + E  Q+HGL +K GFG D F M++L+R Y +CG ++ AR LF  +I E+D 
Sbjct: 136 KACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDM 195

Query: 186 ------------VVSWNTMVDGYIKFGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLG 245
                       +V WN M+DGY++ GD K+A  +F +M  ++VVSW +MISG    G  
Sbjct: 196 VVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFF 255

Query: 246 VEALELCCEMQRAGFELDAVAVASMLTACANLGALEQGRWLHFYVLNNGVHLDRVIACAL 305
            +A+E+  EM++     + V + S+L A + LG+LE G WLH Y  ++G+ +D V+  AL
Sbjct: 256 KDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSAL 315

Query: 306 VNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRP 365
           ++MY KCG +E+A+ VF  L    ++V  W+AMI GFAI+G+  +A++ F KM++ G RP
Sbjct: 316 IDMYSKCGIIEKAIHVFERLPR--ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 375

Query: 366 NSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKEL 425
           + + +  +LTACS+GGLVEEG+  F  M S+  L P IEHYGCMVDLLGR+GLL EA+E 
Sbjct: 376 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 435

Query: 426 IEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKW 485
           I  MP+KP+ VIW ALL AC++ G+  +  ++   L+++    SG Y+ L+ + A++G W
Sbjct: 436 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 495

Query: 486 KEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQIHQKLNQVAERLRKE 545
            E +E+RL+MK+  +   PG SLI +DGV+HEF+     HP+ K+I+  L +++++LR  
Sbjct: 496 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 555

Query: 546 GYKPVTKDLWLDLENDEKETVMAQHSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVA 605
           GY+P+T  + L+LE ++KE V+  HSEK+A AFGLI+  PG  IR+VKNLR+C DCH   
Sbjct: 556 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 615

Query: 606 KLVSRIYKRDIVMRDRVRFHHFRDGRCSCRDYW 617
           KL+S++YKR I +RDR RFHHF+DG CSC DYW
Sbjct: 616 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Moc01g28310 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 515.0 bits (1325), Expect = 8.3e-146
Identity = 261/609 (42.86%), Postives = 399/609 (65.52%), Query Frame = 0

Query: 14  MSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGN-----LHYAQMVFN 73
           ++LL++CS+   LK IH  +++T   +D  +A++LL LC      N     L YA  +F+
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 74  GITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGE 133
            I  P  F++N +IR +S   EP  AF  Y QML S + P++ TFPFL+K+  ++  +  
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 134 ALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMVDGYIK 193
             Q H  IV+ GF +DV+  NSL+  Y  CG I  A ++F  +  RDVVSW +MV GY K
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 194 FGDVKSAYGVFLEMPLKNVVSWTSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASM 253
            G V++A  +F EMP +N+ +W+ MI+G  +     +A++L   M+R G   +   + S+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 254 LTACANLGALEQGRWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLK 313
           +++CA+LGALE G   + YV+ + + ++ ++  ALV+M+ +CGD+E+A+ VF  L  +  
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPET-- 315

Query: 314 DVHVWTAMIEGFAINGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLF 373
           D   W+++I+G A++G   +A+ +F++M   G  P  +TFTA+L+ACS+GGLVE+G  ++
Sbjct: 316 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 375

Query: 374 KSMKSLYNLSPCIEHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGD 433
           ++MK  + + P +EHYGC+VD+LGRAG L EA+  I  M +KPNA I GALL AC+I+ +
Sbjct: 376 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 435

Query: 434 FFLAGQIGTHLVEVDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLIT 493
             +A ++G  L++V  +HSG Y+ L+ I A  G+W +   LR  MK+  V  PPG SLI 
Sbjct: 436 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 495

Query: 494 LDGVVHEFLAG-HQDHPQMKQIHQKLNQVAERLRKEGYKPVTKDLWLDLENDEKETVMAQ 553
           +DG +++F  G  Q HP+M +I +K  ++  ++R  GYK  T D + D++ +EKE+ +  
Sbjct: 496 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHM 555

Query: 554 HSEKLAIAFGLINMKPGTTIRVVKNLRVCGDCHDVAKLVSRIYKRDIVMRDRVRFHHFRD 613
           HSEKLAIA+G++  KPGTTIR+VKNLRVC DCH V KL+S +Y R++++RDR RFHHFR+
Sbjct: 556 HSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRN 615

Query: 614 GRCSCRDYW 617
           G CSCRDYW
Sbjct: 616 GVCSCRDYW 622

BLAST of Moc01g28310 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 512.7 bits (1319), Expect = 4.1e-145
Identity = 268/716 (37.43%), Postives = 412/716 (57.54%), Query Frame = 0

Query: 5   TAESPWAQTMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLCSTPHFGNLHYAQ 64
           T  +  ++ +SL+E C +++ LKQ H  MI+T T +DP  A+KL  + +   F +L YA+
Sbjct: 25  TTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYAR 84

Query: 65  MVFNGITRPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLS-SSVPPNSHTFPFLLKSCRDL 124
            VF+ I +P +F WN +IRAY+   +P ++   +  M+S S   PN +TFPFL+K+  ++
Sbjct: 85  KVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEV 144

Query: 125 SAMGEALQVHGLIVKLGFGSDVFAMNSLIRAYTLCGDIQYARQLFDHIPERDVVSWNTMV 184
           S++     +HG+ VK   GSDVF  NSLI  Y  CGD+  A ++F  I E+DVVSWN+M+
Sbjct: 145 SSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMI 204

Query: 185 DGYIKFG----------------------------------------------------- 244
           +G+++ G                                                     
Sbjct: 205 NGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVN 264

Query: 245 ------------------------------------------------DVKSAYGVFLEM 304
                                                           D ++A  V   M
Sbjct: 265 VNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSM 324

Query: 305 PLKNVVSWTSMISGLVEAGLGVEALELCCEMQ-RAGFELDAVAVASMLTACANLGALEQG 364
           P K++V+W ++IS   + G   EAL +  E+Q +   +L+ + + S L+ACA +GALE G
Sbjct: 325 PQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELG 384

Query: 365 RWLHFYVLNNGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFA 424
           RW+H Y+  +G+ ++  +  AL++MY KCGDLE++ +VF  ++   +DV VW+AMI G A
Sbjct: 385 RWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLA 444

Query: 425 INGRGVEALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCI 484
           ++G G EA++ F KMQ    +PN +TFT +  ACS+ GLV+E +SLF  M+S Y + P  
Sbjct: 445 MHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEE 504

Query: 485 EHYGCMVDLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVE 544
           +HY C+VD+LGR+G L++A + IE MP+ P+  +WGALL AC+IH +  LA    T L+E
Sbjct: 505 KHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLE 564

Query: 545 VDSDHSGRYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQ 604
           ++  + G ++ L+ I A  GKW+  +ELR  M+   +   PG S I +DG++HEFL+G  
Sbjct: 565 LEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDN 624

Query: 605 DHPQMKQIHQKLNQVAERLRKEGYKP-VTKDLWLDLENDEKETVMAQHSEKLAIAFGLIN 617
            HP  ++++ KL++V E+L+  GY+P +++ L +  E + KE  +  HSEKLAI +GLI+
Sbjct: 625 AHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLIS 684

BLAST of Moc01g28310 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 503.4 bits (1295), Expect = 2.5e-142
Identity = 269/708 (37.99%), Postives = 397/708 (56.07%), Query Frame = 0

Query: 13  TMSLLENCSTMKHLKQIHAQMIKTETATDPKLATKLLTLC-STPHFGNLHYAQMVFNGIT 72
           ++SLL NC T++ L+ IHAQMIK          +KL+  C  +PHF  L YA  VF  I 
Sbjct: 36  SLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQ 95

Query: 73  RPTTFMWNAIIRAYSDSNEPKIAFLLYCQMLSSSVPPNSHTFPFLLKSCRDLSAMGEALQ 132
            P   +WN + R ++ S++P  A  LY  M+S  + PNS+TFPF+LKSC    A  E  Q
Sbjct: 96  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 155

Query: 133 VHGLIVKLGFGSDVFAMNS-------------------------------LIRAYTLCGD 192
           +HG ++KLG   D++   S                               LI+ Y   G 
Sbjct: 156 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 215

Query: 193 IQYARQLFDHIPERDVVSWNTMVDGYI--------------------------------- 252
           I+ A++LFD IP +DVVSWN M+ GY                                  
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 253 -------------------------------------KFGDVKSAYGVFLEMPLKNVVSW 312
                                                K G++++A G+F  +P K+V+SW
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 313 TSMISGLVEAGLGVEALELCCEMQRAGFELDAVAVASMLTACANLGALEQGRWLHFYVLN 372
            ++I G     L  EAL L  EM R+G   + V + S+L ACA+LGA++ GRW+H Y+  
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 373 --NGVHLDRVIACALVNMYVKCGDLEEALQVFAELKASLKDVHVWTAMIEGFAINGRGVE 432
              GV     +  +L++MY KCGD+E A QVF  +    K +  W AMI GFA++GR   
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILH--KSLSSWNAMIFGFAMHGRADA 455

Query: 433 ALEWFNKMQREGTRPNSITFTAILTACSYGGLVEEGKSLFKSMKSLYNLSPCIEHYGCMV 492
           + + F++M++ G +P+ ITF  +L+ACS+ G+++ G+ +F++M   Y ++P +EHYGCM+
Sbjct: 456 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 515

Query: 493 DLLGRAGLLQEAKELIEMMPMKPNAVIWGALLKACQIHGDFFLAGQIGTHLVEVDSDHSG 552
           DLLG +GL +EA+E+I MM M+P+ VIW +LLKAC++HG+  L      +L++++ ++ G
Sbjct: 516 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 575

Query: 553 RYIQLATILAAEGKWKEAAELRLKMKKLRVLIPPGKSLITLDGVVHEFLAGHQDHPQMKQ 612
            Y+ L+ I A+ G+W E A+ R  +    +   PG S I +D VVHEF+ G + HP+ ++
Sbjct: 576 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 635

Query: 613 IHQKLNQVAERLRKEGYKPVTKDLWLDLENDEKETVMAQHSEKLAIAFGLINMKPGTTIR 617
           I+  L ++   L K G+ P T ++  ++E + KE  +  HSEKLAIAFGLI+ KPGT + 
Sbjct: 636 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 695

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022132422.10.0e+00100.00pentatricopeptide repeat-containing protein At5g66520 [Momordica charantia][more]
XP_022978438.11.8e-29981.00pentatricopeptide repeat-containing protein At5g66520 [Cucurbita maxima][more]
XP_023543056.19.8e-29880.19pentatricopeptide repeat-containing protein At5g66520 [Cucurbita pepo subsp. pep... [more]
XP_022949774.19.8e-29880.35pentatricopeptide repeat-containing protein At5g66520 [Cucurbita moschata][more]
KAG6603968.18.3e-29780.19Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q9FJY72.2e-19154.20Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9FI804.2e-15844.71Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9FG161.2e-14442.86Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
O823805.8e-14437.43Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN013.5e-14137.99Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1BTS50.0e+00100.00pentatricopeptide repeat-containing protein At5g66520 OS=Momordica charantia OX=... [more]
A0A6J1IT438.6e-30081.00pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita maxima OX=366... [more]
A0A6J1GDX24.7e-29880.35pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita moschata OX=3... [more]
A0A0A0KKE06.2e-29078.61DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G5027... [more]
A0A5D3CKZ84.4e-28878.44Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G66520.11.5e-19254.20Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.12.9e-15944.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G06540.18.3e-14642.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.14.1e-14537.43Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.12.5e-14237.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 147..175
e-value: 0.036
score: 14.3
coord: 383..408
e-value: 0.0045
score: 17.1
coord: 282..304
e-value: 0.0016
score: 18.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 77..109
e-value: 1.1E-4
score: 20.2
coord: 177..206
e-value: 1.0E-4
score: 20.2
coord: 281..305
e-value: 0.0018
score: 16.3
coord: 312..344
e-value: 4.6E-7
score: 27.6
coord: 208..242
e-value: 2.5E-7
score: 28.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 73..120
e-value: 9.2E-10
score: 38.6
coord: 205..254
e-value: 5.2E-9
score: 36.2
coord: 308..355
e-value: 1.5E-12
score: 47.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 276..306
score: 8.780059
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 74..108
score: 9.941957
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 175..209
score: 10.764054
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..343
score: 11.586152
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 7..133
e-value: 5.7E-13
score: 50.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 262..360
e-value: 1.9E-23
score: 84.8
coord: 146..261
e-value: 1.1E-29
score: 105.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 361..518
e-value: 2.7E-14
score: 54.9
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 482..606
e-value: 3.5E-37
score: 127.1
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 12..590
NoneNo IPR availablePANTHERPTHR47928:SF136PPR CONTAINING PLANT-LIKE PROTEINcoord: 12..590

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc01g28310.1Moc01g28310.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0016554 cytidine to uridine editing
cellular_component GO:0009507 chloroplast
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding