MS004694.1 (mRNA) Bitter gourd (TR) v1

Overview
NameMS004694.1
TypemRNA
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold741: 94296 .. 96332 (+)
Sequence length2037
RNA-Seq ExpressionMS004694.1
SyntenyMS004694.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTCATCTTCTTCAATGCAATTAGAATCAACTTCACCCTTTTCTTCCGCCATCCCCTTGTTTCTTCTGGGAACCAGAACTCACACCCCAATACGCCTCAACAAGATCAAACGAAAGCCCACAACTCCAATTATAATCCTCAGAGCCTCAGCTTCTTCGAAATCAAAAGATGTATGGAGAAGAAGAACCCCGTCTGGAGATTCAACGGCATCGCCATTTCCCCAAAAGTACCAACGCAGCGCGAGAAGGCAGCGAGAGCCCCCGCATTTGGACCACAGCGTCGACATGGACGAGCTTTTATCGTCAATTGGGCAGACGAAGAACGAACAGGAGCTGTACTCGGTGCTGTCCCCGTATAAAGGGCGCCAGCTTTCGATCCGGTTCATGGTGTCGCTTCTGTCGCGCGAACCGGACTGGCAACGGTCGCTCGCGATTCTTGATTGGGTCAACGAAGAGGCTCTTTACACGCCCTCGGTGTTCGCTTACAATGTTGTTATCCGCAATGTACTGCGCGCGAAGCAGTGGGAGATTGCACACGGCCTGTTCGACGAAATGCGCCAGAGAGCTCTTGCGGCTGATAGGTATACTTATTCCACTCTCATCACTTGTTTTGGGAAAGAGGGGTTGTTTGATGCTGCCCTCTATTGGCTTCAGCAAATGGAGCAAGATCGGGTCTCAGGGGACCTTGTTTTGTACAGTAATTTGATTGAGCTTTCTCGTAAACTCTGTGATTATTCAAAGGCCATTTCCATTTTTTCAAGATTAAAGAGATCCGGGATTACTCCAGATATTGTAGCCTATAATTCCATGATAAATGTGTTTGGAAAAGCTAAGCTTTTCAGAGAGGCTCGTTTTTTGTTGAAGGAGATGAGGGCTGTGGGTGTTGTGCCGGATACTGTTAGCTACTCGACTTTACTCAGTATGTTTGTTGAGAATGAGAAGTTTTTGGAGGCTCTGTCTGTGTTTTCTGAGATGATGGAGGTCAACTGCCCGCTTGATCTTACTACTTGTAATGTTATGATTGATGTTTATGGTCAGCTGGATATGGTGAAGGAGGCTGACCGGCTGTTTTGGAGCATGAGGAAGATGGGAATTGAGCCGAATGTTGTGAGTTATAATACCATTTTGAGGGTCTATGGTGAGGCTGAGCTTTTCGGGGAAGCGATACACCTTTTCCGCTTGATGCAGAGGAAGGAGATTGAGCAGAATGTGGTGACATATAACACCATGATCAAGATATATGGGAAGTCTCTGGAGCATGAAAAGGCAACGAATCTTGTGCAAGAGATGCAAAATAGAGGGATTGAACCGAACGCGATTACATACTCGACAATAATTTCGATATGGGGGAAAGCAGGAAAGTTGGAGAGAGCTGCAATGCTGTTTCAGAAGCTGAGAAGCTCTGGGGCTGAGATTGATCAGGTTCTTTACCAGACCATGATTGTGGCCTATGAGAGGGCAGGCTTGGTTGCTCACGCCAAGCGTTTGCTTCACGAGCTGAAGCAACCAGACAACATTCCGAGGAAGACAGCAATTACCATTCTCGCTAGAGCTGGTCGAATCGAAGAAGCGACATGGGTTTTTCGACAGGCGTTCGATGCAGGAGAGTTGAAGGACATATCCATTTTTGGGTGCATGATTGATCTGTTTTCGAGGAACAAGAAGCACAAAAATGTTGTGGAGGTATTTGAGAAGATGAGAAGTGTGGGACATTTCCCCAACTCTAATGTCATTGCCCTTGTCCTAAATGCTTATGGGAAGTTGAGGGATTTTGACAAGGCTGATGTTGTGTACATGGAGATGCAGGAAGAGGGCTGTGTTTTTCCCGACGAAGTACACTTCCAGATGCTCGGTCTCTACGGCGCAAGAAAGGATTACAAGAGATTGGACTCGTTGTTCGAAAGGCTCGACTCTGATCCCAATATAAATAAGAAGGAATTGCATCTCGTTGTTGCGAGCATTTACGAAAGGGCTAACAGACTTAACGATGCATCTCGGATTATGAGCAGGATGAACGAAATGGCAATCTCGAGATCA

mRNA sequence

ATCTCATCTTCTTCAATGCAATTAGAATCAACTTCACCCTTTTCTTCCGCCATCCCCTTGTTTCTTCTGGGAACCAGAACTCACACCCCAATACGCCTCAACAAGATCAAACGAAAGCCCACAACTCCAATTATAATCCTCAGAGCCTCAGCTTCTTCGAAATCAAAAGATGTATGGAGAAGAAGAACCCCGTCTGGAGATTCAACGGCATCGCCATTTCCCCAAAAGTACCAACGCAGCGCGAGAAGGCAGCGAGAGCCCCCGCATTTGGACCACAGCGTCGACATGGACGAGCTTTTATCGTCAATTGGGCAGACGAAGAACGAACAGGAGCTGTACTCGGTGCTGTCCCCGTATAAAGGGCGCCAGCTTTCGATCCGGTTCATGGTGTCGCTTCTGTCGCGCGAACCGGACTGGCAACGGTCGCTCGCGATTCTTGATTGGGTCAACGAAGAGGCTCTTTACACGCCCTCGGTGTTCGCTTACAATGTTGTTATCCGCAATGTACTGCGCGCGAAGCAGTGGGAGATTGCACACGGCCTGTTCGACGAAATGCGCCAGAGAGCTCTTGCGGCTGATAGGTATACTTATTCCACTCTCATCACTTGTTTTGGGAAAGAGGGGTTGTTTGATGCTGCCCTCTATTGGCTTCAGCAAATGGAGCAAGATCGGGTCTCAGGGGACCTTGTTTTGTACAGTAATTTGATTGAGCTTTCTCGTAAACTCTGTGATTATTCAAAGGCCATTTCCATTTTTTCAAGATTAAAGAGATCCGGGATTACTCCAGATATTGTAGCCTATAATTCCATGATAAATGTGTTTGGAAAAGCTAAGCTTTTCAGAGAGGCTCGTTTTTTGTTGAAGGAGATGAGGGCTGTGGGTGTTGTGCCGGATACTGTTAGCTACTCGACTTTACTCAGTATGTTTGTTGAGAATGAGAAGTTTTTGGAGGCTCTGTCTGTGTTTTCTGAGATGATGGAGGTCAACTGCCCGCTTGATCTTACTACTTGTAATGTTATGATTGATGTTTATGGTCAGCTGGATATGGTGAAGGAGGCTGACCGGCTGTTTTGGAGCATGAGGAAGATGGGAATTGAGCCGAATGTTGTGAGTTATAATACCATTTTGAGGGTCTATGGTGAGGCTGAGCTTTTCGGGGAAGCGATACACCTTTTCCGCTTGATGCAGAGGAAGGAGATTGAGCAGAATGTGGTGACATATAACACCATGATCAAGATATATGGGAAGTCTCTGGAGCATGAAAAGGCAACGAATCTTGTGCAAGAGATGCAAAATAGAGGGATTGAACCGAACGCGATTACATACTCGACAATAATTTCGATATGGGGGAAAGCAGGAAAGTTGGAGAGAGCTGCAATGCTGTTTCAGAAGCTGAGAAGCTCTGGGGCTGAGATTGATCAGGTTCTTTACCAGACCATGATTGTGGCCTATGAGAGGGCAGGCTTGGTTGCTCACGCCAAGCGTTTGCTTCACGAGCTGAAGCAACCAGACAACATTCCGAGGAAGACAGCAATTACCATTCTCGCTAGAGCTGGTCGAATCGAAGAAGCGACATGGGTTTTTCGACAGGCGTTCGATGCAGGAGAGTTGAAGGACATATCCATTTTTGGGTGCATGATTGATCTGTTTTCGAGGAACAAGAAGCACAAAAATGTTGTGGAGGTATTTGAGAAGATGAGAAGTGTGGGACATTTCCCCAACTCTAATGTCATTGCCCTTGTCCTAAATGCTTATGGGAAGTTGAGGGATTTTGACAAGGCTGATGTTGTGTACATGGAGATGCAGGAAGAGGGCTGTGTTTTTCCCGACGAAGTACACTTCCAGATGCTCGGTCTCTACGGCGCAAGAAAGGATTACAAGAGATTGGACTCGTTGTTCGAAAGGCTCGACTCTGATCCCAATATAAATAAGAAGGAATTGCATCTCGTTGTTGCGAGCATTTACGAAAGGGCTAACAGACTTAACGATGCATCTCGGATTATGAGCAGGATGAACGAAATGGCAATCTCGAGATCA

Coding sequence (CDS)

ATCTCATCTTCTTCAATGCAATTAGAATCAACTTCACCCTTTTCTTCCGCCATCCCCTTGTTTCTTCTGGGAACCAGAACTCACACCCCAATACGCCTCAACAAGATCAAACGAAAGCCCACAACTCCAATTATAATCCTCAGAGCCTCAGCTTCTTCGAAATCAAAAGATGTATGGAGAAGAAGAACCCCGTCTGGAGATTCAACGGCATCGCCATTTCCCCAAAAGTACCAACGCAGCGCGAGAAGGCAGCGAGAGCCCCCGCATTTGGACCACAGCGTCGACATGGACGAGCTTTTATCGTCAATTGGGCAGACGAAGAACGAACAGGAGCTGTACTCGGTGCTGTCCCCGTATAAAGGGCGCCAGCTTTCGATCCGGTTCATGGTGTCGCTTCTGTCGCGCGAACCGGACTGGCAACGGTCGCTCGCGATTCTTGATTGGGTCAACGAAGAGGCTCTTTACACGCCCTCGGTGTTCGCTTACAATGTTGTTATCCGCAATGTACTGCGCGCGAAGCAGTGGGAGATTGCACACGGCCTGTTCGACGAAATGCGCCAGAGAGCTCTTGCGGCTGATAGGTATACTTATTCCACTCTCATCACTTGTTTTGGGAAAGAGGGGTTGTTTGATGCTGCCCTCTATTGGCTTCAGCAAATGGAGCAAGATCGGGTCTCAGGGGACCTTGTTTTGTACAGTAATTTGATTGAGCTTTCTCGTAAACTCTGTGATTATTCAAAGGCCATTTCCATTTTTTCAAGATTAAAGAGATCCGGGATTACTCCAGATATTGTAGCCTATAATTCCATGATAAATGTGTTTGGAAAAGCTAAGCTTTTCAGAGAGGCTCGTTTTTTGTTGAAGGAGATGAGGGCTGTGGGTGTTGTGCCGGATACTGTTAGCTACTCGACTTTACTCAGTATGTTTGTTGAGAATGAGAAGTTTTTGGAGGCTCTGTCTGTGTTTTCTGAGATGATGGAGGTCAACTGCCCGCTTGATCTTACTACTTGTAATGTTATGATTGATGTTTATGGTCAGCTGGATATGGTGAAGGAGGCTGACCGGCTGTTTTGGAGCATGAGGAAGATGGGAATTGAGCCGAATGTTGTGAGTTATAATACCATTTTGAGGGTCTATGGTGAGGCTGAGCTTTTCGGGGAAGCGATACACCTTTTCCGCTTGATGCAGAGGAAGGAGATTGAGCAGAATGTGGTGACATATAACACCATGATCAAGATATATGGGAAGTCTCTGGAGCATGAAAAGGCAACGAATCTTGTGCAAGAGATGCAAAATAGAGGGATTGAACCGAACGCGATTACATACTCGACAATAATTTCGATATGGGGGAAAGCAGGAAAGTTGGAGAGAGCTGCAATGCTGTTTCAGAAGCTGAGAAGCTCTGGGGCTGAGATTGATCAGGTTCTTTACCAGACCATGATTGTGGCCTATGAGAGGGCAGGCTTGGTTGCTCACGCCAAGCGTTTGCTTCACGAGCTGAAGCAACCAGACAACATTCCGAGGAAGACAGCAATTACCATTCTCGCTAGAGCTGGTCGAATCGAAGAAGCGACATGGGTTTTTCGACAGGCGTTCGATGCAGGAGAGTTGAAGGACATATCCATTTTTGGGTGCATGATTGATCTGTTTTCGAGGAACAAGAAGCACAAAAATGTTGTGGAGGTATTTGAGAAGATGAGAAGTGTGGGACATTTCCCCAACTCTAATGTCATTGCCCTTGTCCTAAATGCTTATGGGAAGTTGAGGGATTTTGACAAGGCTGATGTTGTGTACATGGAGATGCAGGAAGAGGGCTGTGTTTTTCCCGACGAAGTACACTTCCAGATGCTCGGTCTCTACGGCGCAAGAAAGGATTACAAGAGATTGGACTCGTTGTTCGAAAGGCTCGACTCTGATCCCAATATAAATAAGAAGGAATTGCATCTCGTTGTTGCGAGCATTTACGAAAGGGCTAACAGACTTAACGATGCATCTCGGATTATGAGCAGGATGAACGAAATGGCAATCTCGAGATCA

Protein sequence

ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWRRRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYKGRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANRLNDASRIMSRMNEMAISRS
Homology
BLAST of MS004694.1 vs. NCBI nr
Match: XP_022157661.1 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic [Momordica charantia])

HSP 1 Score: 1318.9 bits (3412), Expect = 0.0e+00
Identity = 674/679 (99.26%), Postives = 675/679 (99.41%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           ISSSSMQLESTSPFSSAIPLFLLGTR H PIRLNKIKRKPTTPIIILRASASSKSKDVWR
Sbjct: 6   ISSSSMQLESTSPFSSAIPLFLLGTRNHAPIRLNKIKRKPTTPIIILRASASSKSKDVWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK
Sbjct: 66  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG
Sbjct: 126 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSV+SEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVYSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVF DEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR
Sbjct: 606 MQEEGCVFSDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIMSRMNEMAISRS
Sbjct: 666 LNDASRIMSRMNEMAISRS 684

BLAST of MS004694.1 vs. NCBI nr
Match: XP_008443439.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g39980, chloroplastic [Cucumis melo] >KAA0053737.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25666.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1210.3 bits (3130), Expect = 0.0e+00
Identity = 615/676 (90.98%), Postives = 643/676 (95.12%), Query Frame = 0

Query: 4   SSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWRRRT 63
           SS+QL+ST  FSS IPLFL+ TR +  IR NKIK KP T I I RAS+SS SKD+WRR+T
Sbjct: 4   SSIQLQSTLSFSSPIPLFLIETRDYPKIRFNKIKTKPRTRIPIFRASSSSASKDIWRRKT 63

Query: 64  PSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYKGRQ 123
           PS  ST +  PQKYQRSARRQRE  HLDHS+DMDELL+SIGQTKNEQELYSVLSPYKGRQ
Sbjct: 64  PSEKSTTTLLPQKYQRSARRQRESSHLDHSIDMDELLASIGQTKNEQELYSVLSPYKGRQ 123

Query: 124 LSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFD 183
           LS+RFMVSLLSRE DWQRSLAILDW+NEEALYTPSV+AYNVV+RNVLRAKQWE+AHGLFD
Sbjct: 124 LSMRFMVSLLSRESDWQRSLAILDWINEEALYTPSVYAYNVVLRNVLRAKQWELAHGLFD 183

Query: 184 EMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLC 243
           EMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSRKLC
Sbjct: 184 EMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSRKLC 243

Query: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYS 303
           DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV V+PDTVSYS
Sbjct: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVMPDTVSYS 303

Query: 304 TLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKM 363
           TLL+MFVENEKFLEALSVFSEM EVNCPLDLTTCN+MIDVYGQLDMVKEADRLFW MRKM
Sbjct: 304 TLLNMFVENEKFLEALSVFSEMTEVNCPLDLTTCNIMIDVYGQLDMVKEADRLFWRMRKM 363

Query: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423
           GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA
Sbjct: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423

Query: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVA 483
           TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSS AEIDQVLYQTMIVA
Sbjct: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSRAEIDQVLYQTMIVA 483

Query: 484 YERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIF 543
           YE+AGLVAHAKRLLHELKQPDNIPRKTAITILA+AGRIEEATWVFRQAFDAGELKDIS+F
Sbjct: 484 YEKAGLVAHAKRLLHELKQPDNIPRKTAITILAKAGRIEEATWVFRQAFDAGELKDISVF 543

Query: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQE 603
           GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFD AD VYMEMQE
Sbjct: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDTADAVYMEMQE 603

Query: 604 EGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANRLND 663
           +GC+FPDEVHFQML LYGARKDYKRL+SLFERLDSDPNINKKELHLVVASIYER NRLN+
Sbjct: 604 KGCIFPDEVHFQMLSLYGARKDYKRLESLFERLDSDPNINKKELHLVVASIYERGNRLNN 663

Query: 664 ASRIMSRMNEMAISRS 680
           ASRIM+RMNE AISRS
Sbjct: 664 ASRIMNRMNETAISRS 679

BLAST of MS004694.1 vs. NCBI nr
Match: XP_023521621.1 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023527980.1 pentatricopeptide repeat-containing protein At5g39980, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1208.7 bits (3126), Expect = 0.0e+00
Identity = 614/679 (90.43%), Postives = 646/679 (95.14%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           IS SSMQL+STS FSS + L L+ TR +  + L++ K KP   I + RAS+SS SKDVWR
Sbjct: 7   ISPSSMQLQSTSSFSSPVFLCLIETRNNLKVCLSEFKSKPRIRISVPRASSSSTSKDVWR 66

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           R+TPS  ST +PFPQKY+RSARRQRE  HLDHSVDMDELL+SIGQTKNEQELYS+LSPYK
Sbjct: 67  RKTPSEKSTTTPFPQKYERSARRQRESSHLDHSVDMDELLASIGQTKNEQELYSILSPYK 126

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLS+RFMVS+LSRE DWQRSLAILDW+NEEALYTPSVFAYNVV+RNVLRAKQWE+AHG
Sbjct: 127 GRQLSMRFMVSILSRESDWQRSLAILDWINEEALYTPSVFAYNVVLRNVLRAKQWELAHG 186

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSR
Sbjct: 187 LFDEMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSR 246

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV VVPDTV
Sbjct: 247 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVVPDTV 306

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSVFSEM EVNCPL+LTTCN+MIDVYGQLDMVKEADRLFWSM
Sbjct: 307 SYSTLLSMFVENEKFLEALSVFSEMTEVNCPLNLTTCNIMIDVYGQLDMVKEADRLFWSM 366

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 367 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 426

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNL+QEMQNRGIEPNAITYSTIISIWGKAGKL+RAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 427 EKATNLIQEMQNRGIEPNAITYSTIISIWGKAGKLDRAAMLFQKLRSSGAEIDQVLYQTM 486

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRL+ ELKQPDNIPRKTAITILARAGR+EEATWVFRQAFDAGELKDI
Sbjct: 487 IVAYERAGLVAHAKRLIQELKQPDNIPRKTAITILARAGRVEEATWVFRQAFDAGELKDI 546

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           S+FGCMIDLFSRNKKHKNVVEVFEKMRSVGHFP+SNVIALVLNAYGKLRDFDKAD VY E
Sbjct: 547 SVFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPSSNVIALVLNAYGKLRDFDKADAVYTE 606

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVFPDEVHFQML LYGARKDYKRL+ LFERLDSDPNINKKELHLVVASIYERANR
Sbjct: 607 MQEEGCVFPDEVHFQMLSLYGARKDYKRLEELFERLDSDPNINKKELHLVVASIYERANR 666

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIM+RMNEM+ISRS
Sbjct: 667 LNDASRIMNRMNEMSISRS 685

BLAST of MS004694.1 vs. NCBI nr
Match: XP_022934402.1 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1206.8 bits (3121), Expect = 0.0e+00
Identity = 613/679 (90.28%), Postives = 646/679 (95.14%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           IS SSMQL+STS FSS + L L+ TR +  + L++ K KP   I I RAS+SS SKD+WR
Sbjct: 6   ISPSSMQLQSTSSFSSPVFLCLIETRNNLKVCLSEFKSKPRIRISIPRASSSSTSKDIWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           R+T S  ST +PFPQKY+RSARRQRE  HLDHSVDMDELL+SIGQTKNEQELYS+LSPYK
Sbjct: 66  RKTSSEKSTTTPFPQKYERSARRQRESSHLDHSVDMDELLASIGQTKNEQELYSILSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLS+RFMVS+LSRE DWQRSLAILDW+NEEALYTPSVFAYNVV+RNVLRAKQWE+AHG
Sbjct: 126 GRQLSMRFMVSILSRESDWQRSLAILDWINEEALYTPSVFAYNVVLRNVLRAKQWELAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+M+QDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMDQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV VVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSVFSEM EVNCPL+LTTCN+MIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVFSEMTEVNCPLNLTTCNIMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNL+QEMQNRGIEPNAITYSTIISIWGKAGKL+RAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLIQEMQNRGIEPNAITYSTIISIWGKAGKLDRAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRL+ ELKQPDNIPRKTAITILARAGR+EEATWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLIQELKQPDNIPRKTAITILARAGRVEEATWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           S+FGCMIDLFSRNKKHKNVVEVFEKMRSVGHFP+SNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SVFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPSSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVFPDEVHFQML LYGARKDYKRL+ LFERLDSDPNINKKELHLVVASIYERANR
Sbjct: 606 MQEEGCVFPDEVHFQMLSLYGARKDYKRLEELFERLDSDPNINKKELHLVVASIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIM+RMNEM+ISRS
Sbjct: 666 LNDASRIMNRMNEMSISRS 684

BLAST of MS004694.1 vs. NCBI nr
Match: KAG6580983.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1206.8 bits (3121), Expect = 0.0e+00
Identity = 613/679 (90.28%), Postives = 646/679 (95.14%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           IS SSMQL+STS FSS + L L+ TR +  + L++ K KP   I I RAS+SS SKD+WR
Sbjct: 6   ISPSSMQLQSTSSFSSPVFLCLIETRNNLKVCLSEFKSKPRIRISIPRASSSSTSKDIWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           R+TPS  STA+PFPQKY+RSARRQRE  HLDHSVDMDELL+SIGQTKNEQELYS+LSPYK
Sbjct: 66  RKTPSEKSTATPFPQKYERSARRQRESSHLDHSVDMDELLASIGQTKNEQELYSILSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLS+RFMVS+LSRE DWQRSLAILDW+NEEALYTPSVFAYNVV+RNVLRAKQWE+AHG
Sbjct: 126 GRQLSMRFMVSILSRESDWQRSLAILDWINEEALYTPSVFAYNVVLRNVLRAKQWELAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV VVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSVFSEM EVNCPL+LTTCN+MIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVFSEMTEVNCPLNLTTCNIMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNL+QEMQNRGIEPNAITYSTIISIWGKAGKL+RAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLIQEMQNRGIEPNAITYSTIISIWGKAGKLDRAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRL+ ELKQPDNIPRKTAITILARAGR+EE+TWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLIQELKQPDNIPRKTAITILARAGRVEESTWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           S+FGCMIDLFSRNKKHKNVVEVFEKMRSVGHFP+SNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SVFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPSSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVFPDEVHFQML LY ARKDYKRL+  FERLDSDPNINKKELHLVVASIYERANR
Sbjct: 606 MQEEGCVFPDEVHFQMLSLYSARKDYKRLEESFERLDSDPNINKKELHLVVASIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIM+RMNEM+ISRS
Sbjct: 666 LNDASRIMNRMNEMSISRS 684

BLAST of MS004694.1 vs. ExPASy Swiss-Prot
Match: Q9FLD8 (Pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g39980 PE=2 SV=1)

HSP 1 Score: 1008.8 bits (2607), Expect = 2.9e-293
Identity = 518/677 (76.51%), Postives = 598/677 (88.33%), Query Frame = 0

Query: 6   MQLESTSPFSSAIPLFLLGTRTH--TPIRLNKI-KRKPTTPIIILRASASSKS---KDVW 65
           M +E  S  S ++PL  L TR H  T I  + I + +    I  + AS+SS+S   K VW
Sbjct: 1   MYIEIASSSSLSLPLLPL-TRPHIYTSIPFSTIPEARQRNLIFTVSASSSSESTQNKKVW 60

Query: 66  RRRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPY 125
           R++ P  ++T+S    +  R  RR +    LDH+VDMDELL+SI QT+NE+EL+S+LS Y
Sbjct: 61  RKQ-PEKNTTSS---FQALRKHRRYQRSAFLDHNVDMDELLASIHQTQNEKELFSLLSTY 120

Query: 126 KGRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAH 185
           K RQLSIRFMVSLLSRE DWQRSLA+LDWV+EEA YTPSVFAYNVV+RNVLRAKQ++IAH
Sbjct: 121 KDRQLSIRFMVSLLSRENDWQRSLALLDWVHEEAKYTPSVFAYNVVLRNVLRAKQFDIAH 180

Query: 186 GLFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELS 245
           GLFDEMRQRALA DRYTYSTLIT FGKEG+FD+AL WLQ+MEQDRVSGDLVLYSNLIELS
Sbjct: 181 GLFDEMRQRALAPDRYTYSTLITSFGKEGMFDSALSWLQKMEQDRVSGDLVLYSNLIELS 240

Query: 246 RKLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDT 305
           R+LCDYSKAISIFSRLKRSGITPD+VAYNSMINV+GKAKLFREAR L+KEM   GV+P+T
Sbjct: 241 RRLCDYSKAISIFSRLKRSGITPDLVAYNSMINVYGKAKLFREARLLIKEMNEAGVLPNT 300

Query: 306 VSYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWS 365
           VSYSTLLS++VEN KFLEALSVF+EM EVNC LDLTTCN+MIDVYGQLDMVKEADRLFWS
Sbjct: 301 VSYSTLLSVYVENHKFLEALSVFAEMKEVNCALDLTTCNIMIDVYGQLDMVKEADRLFWS 360

Query: 366 MRKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLE 425
           +RKM IEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRK+IEQNVVTYNTMIKIYGK++E
Sbjct: 361 LRKMDIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKDIEQNVVTYNTMIKIYGKTME 420

Query: 426 HEKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQT 485
           HEKATNLVQEMQ+RGIEPNAITYSTIISIWGKAGKL+RAA LFQKLRSSG EIDQVLYQT
Sbjct: 421 HEKATNLVQEMQSRGIEPNAITYSTIISIWGKAGKLDRAATLFQKLRSSGVEIDQVLYQT 480

Query: 486 MIVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKD 545
           MIVAYER GL+ HAKRLLHELK PDNIPR+TAITILA+AGR EEATWVFRQAF++GE+KD
Sbjct: 481 MIVAYERVGLMGHAKRLLHELKLPDNIPRETAITILAKAGRTEEATWVFRQAFESGEVKD 540

Query: 546 ISIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYM 605
           IS+FGCMI+L+SRN+++ NV+EVFEKMR+ G+FP+SNVIA+VLNAYGK R+F+KAD VY 
Sbjct: 541 ISVFGCMINLYSRNQRYVNVIEVFEKMRTAGYFPDSNVIAMVLNAYGKQREFEKADTVYR 600

Query: 606 EMQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERAN 665
           EMQEEGCVFPDEVHFQML LY ++KD++ ++SLF+RL+SDPN+N KELHLVVA++YERA+
Sbjct: 601 EMQEEGCVFPDEVHFQMLSLYSSKKDFEMVESLFQRLESDPNVNSKELHLVVAALYERAD 660

Query: 666 RLNDASRIMSRMNEMAI 677
           +LNDASR+M+RM E  I
Sbjct: 661 KLNDASRVMNRMRERGI 672

BLAST of MS004694.1 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 4.4e-52
Identity = 134/571 (23.47%), Postives = 270/571 (47.29%), Query Frame = 0

Query: 92  HSVDMDELLSSIGQTKNEQELYSVLSPYKGRQLSIR---FMVSLLSREPDWQRSLAILDW 151
           +S D++ L++ +        +   L  +K + LS+     +    +   DWQRSL +  +
Sbjct: 72  YSYDVESLINKLSSLPPRGSIARCLDIFKNK-LSLNDFALVFKEFAGRGDWQRSLRLFKY 131

Query: 152 VNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLITCFGKEG 211
           +  +    P+   Y ++I  + R    +    +FDEM  + ++   ++Y+ LI  +G+ G
Sbjct: 132 MQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNG 191

Query: 212 LFDAALYWLQQMEQDRVSGDLVLYSNLIE-LSRKLCDYSKAISIFSRLKRSGITPDIVAY 271
            ++ +L  L +M+ +++S  ++ Y+ +I   +R   D+   + +F+ ++  GI PDIV Y
Sbjct: 192 RYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDIVTY 251

Query: 272 NSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMME 331
           N++++      L  EA  + + M   G+VPD  +YS L+  F +  +  +   +  EM  
Sbjct: 252 NTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMAS 311

Query: 332 VNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGE 391
                D+T+ NV+++ Y +   +KEA  +F  M+  G  PN  +Y+ +L ++G++  + +
Sbjct: 312 GGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDD 371

Query: 392 AIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIIS 451
              LF  M+    + +  TYN +I+++G+    ++   L  +M    IEP+  TY  II 
Sbjct: 372 VRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYEGIIF 431

Query: 452 IWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAK---RLLHELKQPD 511
             GK G  E A  + Q + ++        Y  +I A+ +A L   A      +HE+    
Sbjct: 432 ACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNP 491

Query: 512 NIPR-KTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVF 571
           +I    + +   AR G ++E+  +  +  D+G  ++   F   I+ + +  K +  V+ +
Sbjct: 492 SIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTY 551

Query: 572 EKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQEEGCVFPDEVHFQMLGLYGAR 631
             M      P+   +  VL+ Y   R  D+    + EM+    +     +  ML +YG  
Sbjct: 552 VDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLAVYGKT 611

Query: 632 KDYKRLDSLFERLDSDPNINKKELHLVVASI 655
           + +  ++ L E + S+   N   +H V+  +
Sbjct: 612 ERWDDVNELLEEMLSNRVSN---IHQVIGQM 638

BLAST of MS004694.1 vs. ExPASy Swiss-Prot
Match: Q9LQ14 (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 204.5 bits (519), Expect = 3.7e-51
Identity = 131/436 (30.05%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 155 YTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLITCFGKEGLFDAAL 214
           Y P+   +N +I  +    +   A  L D M  R    D +TY T++    K G  D AL
Sbjct: 181 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 240

Query: 215 YWLQQMEQDRVSGDLVLYSNLIELSRKLCDY---SKAISIFSRLKRSGITPDIVAYNSMI 274
             L++ME+ ++  D+V+Y+ +I+    LC+Y   + A+++F+ +   GI P++V YNS+I
Sbjct: 241 SLLKKMEKGKIEADVVIYTTIID---ALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLI 300

Query: 275 NVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMMEVNCP 334
                   + +A  LL +M    + P+ V++S L+  FV+  K +EA  ++ EM++ +  
Sbjct: 301 RCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSID 360

Query: 335 LDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGEAIHL 394
            D+ T + +I+ +   D + EA  +F  M      PNVV+YNT+++ + +A+   E + L
Sbjct: 361 PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMEL 420

Query: 395 FRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIISIWGK 454
           FR M ++ +  N VTYNT+I+   ++ + + A  + ++M + G+ P+ ITYS ++    K
Sbjct: 421 FREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCK 480

Query: 455 AGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAKRLLHELK----QPDNIP 514
            GKLE+A ++F+ L+ S  E D   Y  MI    +AG V     L   L     +P+ I 
Sbjct: 481 YGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVII 540

Query: 515 RKTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVFEKMR 574
             T I+   R G  EEA  +FR+  + G L +   +  +I    R+       E+ ++MR
Sbjct: 541 YTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIKEMR 600

Query: 575 SVGHFPNSNVIALVLN 584
           S G   +++ I++V+N
Sbjct: 601 SCGFVGDASTISMVIN 613

BLAST of MS004694.1 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 199.1 bits (505), Expect = 1.6e-49
Identity = 133/519 (25.63%), Postives = 250/519 (48.17%), Query Frame = 0

Query: 173 KQWEIAHGLFD-EMRQRALAA--DRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDL 232
           K++++A   FD  M+Q+   +  D    + +I+  GKEG   +A      +++D  S D+
Sbjct: 149 KKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDV 208

Query: 233 VLYSNLIELSRKLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGK-AKLFREARFLLK 292
             Y++LI        Y +A+++F +++  G  P ++ YN ++NVFGK    + +   L++
Sbjct: 209 YSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVE 268

Query: 293 EMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLD 352
           +M++ G+ PD  +Y+TL++         EA  VF EM       D  T N ++DVYG+  
Sbjct: 269 KMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSH 328

Query: 353 MVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYN 412
             KEA ++   M   G  P++V+YN+++  Y    +  EA+ L   M  K  + +V TY 
Sbjct: 329 RPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYT 388

Query: 413 TMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSS 472
           T++  + ++ + E A ++ +EM+N G +PN  T++  I ++G  GK      +F ++   
Sbjct: 389 TLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVC 448

Query: 473 GAEIDQVLYQTMIVAYERAGLVAHAKRLLHELKQPDNIPRK----TAITILARAGRIEEA 532
           G   D V + T++  + + G+ +    +  E+K+   +P +    T I+  +R G  E+A
Sbjct: 449 GLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQA 508

Query: 533 TWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNA 592
             V+R+  DAG   D+S +  ++   +R    +   +V  +M      PN      +L+A
Sbjct: 509 MTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHA 568

Query: 593 YGKLRDFDKADVVYMEMQEEGCVFPDEVHFQMLGLYGARKDY----KRLDSLFERLDSDP 652
           Y   ++      +  E+   G + P  V  + L L  ++ D     +R  S  +     P
Sbjct: 569 YANGKEIGLMHSLAEEVY-SGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSP 628

Query: 653 NINKKELHLVVASIYERANRLNDASRIMSRMNEMAISRS 680
           +I        + SIY R   +  A+ ++  M E   + S
Sbjct: 629 DITTLN---SMVSIYGRRQMVAKANGVLDYMKERGFTPS 663

BLAST of MS004694.1 vs. ExPASy Swiss-Prot
Match: Q9LQ16 (Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX=3702 GN=At1g62910 PE=2 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 1.3e-48
Identity = 127/446 (28.48%), Postives = 230/446 (51.57%), Query Frame = 0

Query: 142 SLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLI 201
           ++A++D + E   Y P  F +  +I  +    +   A  L D+M QR    D  TY T++
Sbjct: 172 AVALVDQMVEMG-YKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVV 231

Query: 202 TCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLCDYSKAISIFSRLKRSGIT 261
               K G  D AL  L++ME+ ++  D+V+Y+ +I+   K      A+++F+ +   GI 
Sbjct: 232 NGLCKRGDIDLALSLLKKMEKGKIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIR 291

Query: 262 PDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSV 321
           PD+  Y+S+I+       + +A  LL +M    + P+ V++S L+  FV+  K +EA  +
Sbjct: 292 PDVFTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKL 351

Query: 322 FSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGE 381
           + EM++ +   D+ T + +I+ +   D + EA  +F  M      PNVV+Y+T+++ + +
Sbjct: 352 YDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCK 411

Query: 382 AELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAIT 441
           A+   E + LFR M ++ +  N VTY T+I  + ++ + + A  + ++M + G+ PN +T
Sbjct: 412 AKRVEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILT 471

Query: 442 YSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAKRLLHELK 501
           Y+ ++    K GKL +A ++F+ L+ S  E D   Y  MI    +AG V     L   L 
Sbjct: 472 YNILLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLS 531

Query: 502 ----QPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHK 561
                P+ I   T I+   R G  EEA  + ++  + G L +   +  +I    R+   +
Sbjct: 532 LKGVSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDRE 591

Query: 562 NVVEVFEKMRSVGHFPNSNVIALVLN 584
              E+ ++MRS G   +++ I LV N
Sbjct: 592 ASAELIKEMRSCGFAGDASTIGLVTN 616

BLAST of MS004694.1 vs. ExPASy TrEMBL
Match: A0A6J1DYU7 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111024318 PE=4 SV=1)

HSP 1 Score: 1318.9 bits (3412), Expect = 0.0e+00
Identity = 674/679 (99.26%), Postives = 675/679 (99.41%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           ISSSSMQLESTSPFSSAIPLFLLGTR H PIRLNKIKRKPTTPIIILRASASSKSKDVWR
Sbjct: 6   ISSSSMQLESTSPFSSAIPLFLLGTRNHAPIRLNKIKRKPTTPIIILRASASSKSKDVWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK
Sbjct: 66  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG
Sbjct: 126 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSV+SEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVYSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVF DEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR
Sbjct: 606 MQEEGCVFSDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIMSRMNEMAISRS
Sbjct: 666 LNDASRIMSRMNEMAISRS 684

BLAST of MS004694.1 vs. ExPASy TrEMBL
Match: A0A5A7UGV7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G008300 PE=4 SV=1)

HSP 1 Score: 1210.3 bits (3130), Expect = 0.0e+00
Identity = 615/676 (90.98%), Postives = 643/676 (95.12%), Query Frame = 0

Query: 4   SSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWRRRT 63
           SS+QL+ST  FSS IPLFL+ TR +  IR NKIK KP T I I RAS+SS SKD+WRR+T
Sbjct: 4   SSIQLQSTLSFSSPIPLFLIETRDYPKIRFNKIKTKPRTRIPIFRASSSSASKDIWRRKT 63

Query: 64  PSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYKGRQ 123
           PS  ST +  PQKYQRSARRQRE  HLDHS+DMDELL+SIGQTKNEQELYSVLSPYKGRQ
Sbjct: 64  PSEKSTTTLLPQKYQRSARRQRESSHLDHSIDMDELLASIGQTKNEQELYSVLSPYKGRQ 123

Query: 124 LSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFD 183
           LS+RFMVSLLSRE DWQRSLAILDW+NEEALYTPSV+AYNVV+RNVLRAKQWE+AHGLFD
Sbjct: 124 LSMRFMVSLLSRESDWQRSLAILDWINEEALYTPSVYAYNVVLRNVLRAKQWELAHGLFD 183

Query: 184 EMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLC 243
           EMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSRKLC
Sbjct: 184 EMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSRKLC 243

Query: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYS 303
           DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV V+PDTVSYS
Sbjct: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVMPDTVSYS 303

Query: 304 TLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKM 363
           TLL+MFVENEKFLEALSVFSEM EVNCPLDLTTCN+MIDVYGQLDMVKEADRLFW MRKM
Sbjct: 304 TLLNMFVENEKFLEALSVFSEMTEVNCPLDLTTCNIMIDVYGQLDMVKEADRLFWRMRKM 363

Query: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423
           GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA
Sbjct: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423

Query: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVA 483
           TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSS AEIDQVLYQTMIVA
Sbjct: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSRAEIDQVLYQTMIVA 483

Query: 484 YERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIF 543
           YE+AGLVAHAKRLLHELKQPDNIPRKTAITILA+AGRIEEATWVFRQAFDAGELKDIS+F
Sbjct: 484 YEKAGLVAHAKRLLHELKQPDNIPRKTAITILAKAGRIEEATWVFRQAFDAGELKDISVF 543

Query: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQE 603
           GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFD AD VYMEMQE
Sbjct: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDTADAVYMEMQE 603

Query: 604 EGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANRLND 663
           +GC+FPDEVHFQML LYGARKDYKRL+SLFERLDSDPNINKKELHLVVASIYER NRLN+
Sbjct: 604 KGCIFPDEVHFQMLSLYGARKDYKRLESLFERLDSDPNINKKELHLVVASIYERGNRLNN 663

Query: 664 ASRIMSRMNEMAISRS 680
           ASRIM+RMNE AISRS
Sbjct: 664 ASRIMNRMNETAISRS 679

BLAST of MS004694.1 vs. ExPASy TrEMBL
Match: A0A1S3B8T9 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487030 PE=4 SV=1)

HSP 1 Score: 1210.3 bits (3130), Expect = 0.0e+00
Identity = 615/676 (90.98%), Postives = 643/676 (95.12%), Query Frame = 0

Query: 4   SSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWRRRT 63
           SS+QL+ST  FSS IPLFL+ TR +  IR NKIK KP T I I RAS+SS SKD+WRR+T
Sbjct: 4   SSIQLQSTLSFSSPIPLFLIETRDYPKIRFNKIKTKPRTRIPIFRASSSSASKDIWRRKT 63

Query: 64  PSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYKGRQ 123
           PS  ST +  PQKYQRSARRQRE  HLDHS+DMDELL+SIGQTKNEQELYSVLSPYKGRQ
Sbjct: 64  PSEKSTTTLLPQKYQRSARRQRESSHLDHSIDMDELLASIGQTKNEQELYSVLSPYKGRQ 123

Query: 124 LSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFD 183
           LS+RFMVSLLSRE DWQRSLAILDW+NEEALYTPSV+AYNVV+RNVLRAKQWE+AHGLFD
Sbjct: 124 LSMRFMVSLLSRESDWQRSLAILDWINEEALYTPSVYAYNVVLRNVLRAKQWELAHGLFD 183

Query: 184 EMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLC 243
           EMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSRKLC
Sbjct: 184 EMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSRKLC 243

Query: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYS 303
           DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV V+PDTVSYS
Sbjct: 244 DYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVMPDTVSYS 303

Query: 304 TLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKM 363
           TLL+MFVENEKFLEALSVFSEM EVNCPLDLTTCN+MIDVYGQLDMVKEADRLFW MRKM
Sbjct: 304 TLLNMFVENEKFLEALSVFSEMTEVNCPLDLTTCNIMIDVYGQLDMVKEADRLFWRMRKM 363

Query: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423
           GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA
Sbjct: 364 GIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKA 423

Query: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVA 483
           TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSS AEIDQVLYQTMIVA
Sbjct: 424 TNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSRAEIDQVLYQTMIVA 483

Query: 484 YERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIF 543
           YE+AGLVAHAKRLLHELKQPDNIPRKTAITILA+AGRIEEATWVFRQAFDAGELKDIS+F
Sbjct: 484 YEKAGLVAHAKRLLHELKQPDNIPRKTAITILAKAGRIEEATWVFRQAFDAGELKDISVF 543

Query: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQE 603
           GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFD AD VYMEMQE
Sbjct: 544 GCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDTADAVYMEMQE 603

Query: 604 EGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANRLND 663
           +GC+FPDEVHFQML LYGARKDYKRL+SLFERLDSDPNINKKELHLVVASIYER NRLN+
Sbjct: 604 KGCIFPDEVHFQMLSLYGARKDYKRLESLFERLDSDPNINKKELHLVVASIYERGNRLNN 663

Query: 664 ASRIMSRMNEMAISRS 680
           ASRIM+RMNE AISRS
Sbjct: 664 ASRIMNRMNETAISRS 679

BLAST of MS004694.1 vs. ExPASy TrEMBL
Match: A0A6J1F2M9 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111441589 PE=4 SV=1)

HSP 1 Score: 1206.8 bits (3121), Expect = 0.0e+00
Identity = 613/679 (90.28%), Postives = 646/679 (95.14%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           IS SSMQL+STS FSS + L L+ TR +  + L++ K KP   I I RAS+SS SKD+WR
Sbjct: 6   ISPSSMQLQSTSSFSSPVFLCLIETRNNLKVCLSEFKSKPRIRISIPRASSSSTSKDIWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           R+T S  ST +PFPQKY+RSARRQRE  HLDHSVDMDELL+SIGQTKNEQELYS+LSPYK
Sbjct: 66  RKTSSEKSTTTPFPQKYERSARRQRESSHLDHSVDMDELLASIGQTKNEQELYSILSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLS+RFMVS+LSRE DWQRSLAILDW+NEEALYTPSVFAYNVV+RNVLRAKQWE+AHG
Sbjct: 126 GRQLSMRFMVSILSRESDWQRSLAILDWINEEALYTPSVFAYNVVLRNVLRAKQWELAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+M+QDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMDQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV VVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSVFSEM EVNCPL+LTTCN+MIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVFSEMTEVNCPLNLTTCNIMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNL+QEMQNRGIEPNAITYSTIISIWGKAGKL+RAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLIQEMQNRGIEPNAITYSTIISIWGKAGKLDRAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRL+ ELKQPDNIPRKTAITILARAGR+EEATWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLIQELKQPDNIPRKTAITILARAGRVEEATWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           S+FGCMIDLFSRNKKHKNVVEVFEKMRSVGHFP+SNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SVFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPSSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQEEGCVFPDEVHFQML LYGARKDYKRL+ LFERLDSDPNINKKELHLVVASIYERANR
Sbjct: 606 MQEEGCVFPDEVHFQMLSLYGARKDYKRLEELFERLDSDPNINKKELHLVVASIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIM+RMNEM+ISRS
Sbjct: 666 LNDASRIMNRMNEMSISRS 684

BLAST of MS004694.1 vs. ExPASy TrEMBL
Match: A0A6J1J8J4 (pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111482255 PE=4 SV=1)

HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 612/679 (90.13%), Postives = 645/679 (94.99%), Query Frame = 0

Query: 1   ISSSSMQLESTSPFSSAIPLFLLGTRTHTPIRLNKIKRKPTTPIIILRASASSKSKDVWR 60
           IS SSMQL+STS FSS + L L  TR +  + L++ K KP   I I RAS+SS SKD+WR
Sbjct: 6   ISPSSMQLQSTSSFSSPMFLCLNETRNNLKVCLSEFKSKPRIRISIPRASSSSTSKDIWR 65

Query: 61  RRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPYK 120
           R+TPS  ST +PFPQKY+RSARRQRE  HLDHSVDMDELL+SIGQTKNEQELYS+LSPYK
Sbjct: 66  RKTPSEKSTTTPFPQKYERSARRQRESSHLDHSVDMDELLASIGQTKNEQELYSILSPYK 125

Query: 121 GRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHG 180
           GRQLS+RFMVS+LSRE DWQRSLAILDW+NEEALYTPSVFAYNVV+RNVLRAKQWE+AHG
Sbjct: 126 GRQLSMRFMVSILSRESDWQRSLAILDWINEEALYTPSVFAYNVVLRNVLRAKQWELAHG 185

Query: 181 LFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSR 240
           LFDEMRQRALAADRYTYSTLIT FGKEG+FDAAL WLQ+MEQDRVSGDLVLYSNLIELSR
Sbjct: 186 LFDEMRQRALAADRYTYSTLITYFGKEGMFDAALSWLQKMEQDRVSGDLVLYSNLIELSR 245

Query: 241 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTV 300
           KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAV VVPDTV
Sbjct: 246 KLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVDVVPDTV 305

Query: 301 SYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSM 360
           SYSTLLSMFVENEKFLEALSVFSEM EVNCPL+LTTCN+MIDVYGQLDMVKEADRLFWSM
Sbjct: 306 SYSTLLSMFVENEKFLEALSVFSEMTEVNCPLNLTTCNIMIDVYGQLDMVKEADRLFWSM 365

Query: 361 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 420
           RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH
Sbjct: 366 RKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEH 425

Query: 421 EKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTM 480
           EKATNL+QEMQNRGIEPNAITYSTIISIWGKAGKL+RAAMLFQKLRSSGAEIDQVLYQTM
Sbjct: 426 EKATNLIQEMQNRGIEPNAITYSTIISIWGKAGKLDRAAMLFQKLRSSGAEIDQVLYQTM 485

Query: 481 IVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDI 540
           IVAYERAGLVAHAKRL+ ELKQPDNIPRKTAITILARAGR+EEATWVFRQAFDAGELKDI
Sbjct: 486 IVAYERAGLVAHAKRLIQELKQPDNIPRKTAITILARAGRVEEATWVFRQAFDAGELKDI 545

Query: 541 SIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYME 600
           S+FGCMIDLFSRNK+HKNVVEVFEKMRSVGHFP+SNVIALVLNAYGKLRDFDKAD VYME
Sbjct: 546 SVFGCMIDLFSRNKRHKNVVEVFEKMRSVGHFPSSNVIALVLNAYGKLRDFDKADAVYME 605

Query: 601 MQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERANR 660
           MQE+GCVFPDEVHFQML LYGARKDYKRL+ LFERLDSDPNINKKELHLVVA IYERANR
Sbjct: 606 MQEQGCVFPDEVHFQMLSLYGARKDYKRLEELFERLDSDPNINKKELHLVVAGIYERANR 665

Query: 661 LNDASRIMSRMNEMAISRS 680
           LNDASRIM+RMNEM+ISRS
Sbjct: 666 LNDASRIMNRMNEMSISRS 684

BLAST of MS004694.1 vs. TAIR 10
Match: AT5G39980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1008.8 bits (2607), Expect = 2.0e-294
Identity = 518/677 (76.51%), Postives = 598/677 (88.33%), Query Frame = 0

Query: 6   MQLESTSPFSSAIPLFLLGTRTH--TPIRLNKI-KRKPTTPIIILRASASSKS---KDVW 65
           M +E  S  S ++PL  L TR H  T I  + I + +    I  + AS+SS+S   K VW
Sbjct: 1   MYIEIASSSSLSLPLLPL-TRPHIYTSIPFSTIPEARQRNLIFTVSASSSSESTQNKKVW 60

Query: 66  RRRTPSGDSTASPFPQKYQRSARRQREPPHLDHSVDMDELLSSIGQTKNEQELYSVLSPY 125
           R++ P  ++T+S    +  R  RR +    LDH+VDMDELL+SI QT+NE+EL+S+LS Y
Sbjct: 61  RKQ-PEKNTTSS---FQALRKHRRYQRSAFLDHNVDMDELLASIHQTQNEKELFSLLSTY 120

Query: 126 KGRQLSIRFMVSLLSREPDWQRSLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAH 185
           K RQLSIRFMVSLLSRE DWQRSLA+LDWV+EEA YTPSVFAYNVV+RNVLRAKQ++IAH
Sbjct: 121 KDRQLSIRFMVSLLSRENDWQRSLALLDWVHEEAKYTPSVFAYNVVLRNVLRAKQFDIAH 180

Query: 186 GLFDEMRQRALAADRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELS 245
           GLFDEMRQRALA DRYTYSTLIT FGKEG+FD+AL WLQ+MEQDRVSGDLVLYSNLIELS
Sbjct: 181 GLFDEMRQRALAPDRYTYSTLITSFGKEGMFDSALSWLQKMEQDRVSGDLVLYSNLIELS 240

Query: 246 RKLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDT 305
           R+LCDYSKAISIFSRLKRSGITPD+VAYNSMINV+GKAKLFREAR L+KEM   GV+P+T
Sbjct: 241 RRLCDYSKAISIFSRLKRSGITPDLVAYNSMINVYGKAKLFREARLLIKEMNEAGVLPNT 300

Query: 306 VSYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWS 365
           VSYSTLLS++VEN KFLEALSVF+EM EVNC LDLTTCN+MIDVYGQLDMVKEADRLFWS
Sbjct: 301 VSYSTLLSVYVENHKFLEALSVFAEMKEVNCALDLTTCNIMIDVYGQLDMVKEADRLFWS 360

Query: 366 MRKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLE 425
           +RKM IEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRK+IEQNVVTYNTMIKIYGK++E
Sbjct: 361 LRKMDIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKDIEQNVVTYNTMIKIYGKTME 420

Query: 426 HEKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQT 485
           HEKATNLVQEMQ+RGIEPNAITYSTIISIWGKAGKL+RAA LFQKLRSSG EIDQVLYQT
Sbjct: 421 HEKATNLVQEMQSRGIEPNAITYSTIISIWGKAGKLDRAATLFQKLRSSGVEIDQVLYQT 480

Query: 486 MIVAYERAGLVAHAKRLLHELKQPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKD 545
           MIVAYER GL+ HAKRLLHELK PDNIPR+TAITILA+AGR EEATWVFRQAF++GE+KD
Sbjct: 481 MIVAYERVGLMGHAKRLLHELKLPDNIPRETAITILAKAGRTEEATWVFRQAFESGEVKD 540

Query: 546 ISIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYM 605
           IS+FGCMI+L+SRN+++ NV+EVFEKMR+ G+FP+SNVIA+VLNAYGK R+F+KAD VY 
Sbjct: 541 ISVFGCMINLYSRNQRYVNVIEVFEKMRTAGYFPDSNVIAMVLNAYGKQREFEKADTVYR 600

Query: 606 EMQEEGCVFPDEVHFQMLGLYGARKDYKRLDSLFERLDSDPNINKKELHLVVASIYERAN 665
           EMQEEGCVFPDEVHFQML LY ++KD++ ++SLF+RL+SDPN+N KELHLVVA++YERA+
Sbjct: 601 EMQEEGCVFPDEVHFQMLSLYSSKKDFEMVESLFQRLESDPNVNSKELHLVVAALYERAD 660

Query: 666 RLNDASRIMSRMNEMAI 677
           +LNDASR+M+RM E  I
Sbjct: 661 KLNDASRVMNRMRERGI 672

BLAST of MS004694.1 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 207.6 bits (527), Expect = 3.1e-53
Identity = 134/571 (23.47%), Postives = 270/571 (47.29%), Query Frame = 0

Query: 92  HSVDMDELLSSIGQTKNEQELYSVLSPYKGRQLSIR---FMVSLLSREPDWQRSLAILDW 151
           +S D++ L++ +        +   L  +K + LS+     +    +   DWQRSL +  +
Sbjct: 72  YSYDVESLINKLSSLPPRGSIARCLDIFKNK-LSLNDFALVFKEFAGRGDWQRSLRLFKY 131

Query: 152 VNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLITCFGKEG 211
           +  +    P+   Y ++I  + R    +    +FDEM  + ++   ++Y+ LI  +G+ G
Sbjct: 132 MQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNG 191

Query: 212 LFDAALYWLQQMEQDRVSGDLVLYSNLIE-LSRKLCDYSKAISIFSRLKRSGITPDIVAY 271
            ++ +L  L +M+ +++S  ++ Y+ +I   +R   D+   + +F+ ++  GI PDIV Y
Sbjct: 192 RYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDIVTY 251

Query: 272 NSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMME 331
           N++++      L  EA  + + M   G+VPD  +YS L+  F +  +  +   +  EM  
Sbjct: 252 NTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMAS 311

Query: 332 VNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGE 391
                D+T+ NV+++ Y +   +KEA  +F  M+  G  PN  +Y+ +L ++G++  + +
Sbjct: 312 GGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDD 371

Query: 392 AIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIIS 451
              LF  M+    + +  TYN +I+++G+    ++   L  +M    IEP+  TY  II 
Sbjct: 372 VRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYEGIIF 431

Query: 452 IWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAK---RLLHELKQPD 511
             GK G  E A  + Q + ++        Y  +I A+ +A L   A      +HE+    
Sbjct: 432 ACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNP 491

Query: 512 NIPR-KTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVF 571
           +I    + +   AR G ++E+  +  +  D+G  ++   F   I+ + +  K +  V+ +
Sbjct: 492 SIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTY 551

Query: 572 EKMRSVGHFPNSNVIALVLNAYGKLRDFDKADVVYMEMQEEGCVFPDEVHFQMLGLYGAR 631
             M      P+   +  VL+ Y   R  D+    + EM+    +     +  ML +YG  
Sbjct: 552 VDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLAVYGKT 611

Query: 632 KDYKRLDSLFERLDSDPNINKKELHLVVASI 655
           + +  ++ L E + S+   N   +H V+  +
Sbjct: 612 ERWDDVNELLEEMLSNRVSN---IHQVIGQM 638

BLAST of MS004694.1 vs. TAIR 10
Match: AT1G62930.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 204.5 bits (519), Expect = 2.7e-52
Identity = 131/436 (30.05%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 155 YTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLITCFGKEGLFDAAL 214
           Y P+   +N +I  +    +   A  L D M  R    D +TY T++    K G  D AL
Sbjct: 181 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 240

Query: 215 YWLQQMEQDRVSGDLVLYSNLIELSRKLCDY---SKAISIFSRLKRSGITPDIVAYNSMI 274
             L++ME+ ++  D+V+Y+ +I+    LC+Y   + A+++F+ +   GI P++V YNS+I
Sbjct: 241 SLLKKMEKGKIEADVVIYTTIID---ALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLI 300

Query: 275 NVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMMEVNCP 334
                   + +A  LL +M    + P+ V++S L+  FV+  K +EA  ++ EM++ +  
Sbjct: 301 RCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSID 360

Query: 335 LDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGEAIHL 394
            D+ T + +I+ +   D + EA  +F  M      PNVV+YNT+++ + +A+   E + L
Sbjct: 361 PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMEL 420

Query: 395 FRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIISIWGK 454
           FR M ++ +  N VTYNT+I+   ++ + + A  + ++M + G+ P+ ITYS ++    K
Sbjct: 421 FREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCK 480

Query: 455 AGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAKRLLHELK----QPDNIP 514
            GKLE+A ++F+ L+ S  E D   Y  MI    +AG V     L   L     +P+ I 
Sbjct: 481 YGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVII 540

Query: 515 RKTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVFEKMR 574
             T I+   R G  EEA  +FR+  + G L +   +  +I    R+       E+ ++MR
Sbjct: 541 YTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIKEMR 600

Query: 575 SVGHFPNSNVIALVLN 584
           S G   +++ I++V+N
Sbjct: 601 SCGFVGDASTISMVIN 613

BLAST of MS004694.1 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 199.1 bits (505), Expect = 1.1e-50
Identity = 133/519 (25.63%), Postives = 250/519 (48.17%), Query Frame = 0

Query: 173 KQWEIAHGLFD-EMRQRALAA--DRYTYSTLITCFGKEGLFDAALYWLQQMEQDRVSGDL 232
           K++++A   FD  M+Q+   +  D    + +I+  GKEG   +A      +++D  S D+
Sbjct: 149 KKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDV 208

Query: 233 VLYSNLIELSRKLCDYSKAISIFSRLKRSGITPDIVAYNSMINVFGK-AKLFREARFLLK 292
             Y++LI        Y +A+++F +++  G  P ++ YN ++NVFGK    + +   L++
Sbjct: 209 YSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVE 268

Query: 293 EMRAVGVVPDTVSYSTLLSMFVENEKFLEALSVFSEMMEVNCPLDLTTCNVMIDVYGQLD 352
           +M++ G+ PD  +Y+TL++         EA  VF EM       D  T N ++DVYG+  
Sbjct: 269 KMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSH 328

Query: 353 MVKEADRLFWSMRKMGIEPNVVSYNTILRVYGEAELFGEAIHLFRLMQRKEIEQNVVTYN 412
             KEA ++   M   G  P++V+YN+++  Y    +  EA+ L   M  K  + +V TY 
Sbjct: 329 RPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYT 388

Query: 413 TMIKIYGKSLEHEKATNLVQEMQNRGIEPNAITYSTIISIWGKAGKLERAAMLFQKLRSS 472
           T++  + ++ + E A ++ +EM+N G +PN  T++  I ++G  GK      +F ++   
Sbjct: 389 TLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVC 448

Query: 473 GAEIDQVLYQTMIVAYERAGLVAHAKRLLHELKQPDNIPRK----TAITILARAGRIEEA 532
           G   D V + T++  + + G+ +    +  E+K+   +P +    T I+  +R G  E+A
Sbjct: 449 GLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQA 508

Query: 533 TWVFRQAFDAGELKDISIFGCMIDLFSRNKKHKNVVEVFEKMRSVGHFPNSNVIALVLNA 592
             V+R+  DAG   D+S +  ++   +R    +   +V  +M      PN      +L+A
Sbjct: 509 MTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHA 568

Query: 593 YGKLRDFDKADVVYMEMQEEGCVFPDEVHFQMLGLYGARKDY----KRLDSLFERLDSDP 652
           Y   ++      +  E+   G + P  V  + L L  ++ D     +R  S  +     P
Sbjct: 569 YANGKEIGLMHSLAEEVY-SGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSP 628

Query: 653 NINKKELHLVVASIYERANRLNDASRIMSRMNEMAISRS 680
           +I        + SIY R   +  A+ ++  M E   + S
Sbjct: 629 DITTLN---SMVSIYGRRQMVAKANGVLDYMKERGFTPS 663

BLAST of MS004694.1 vs. TAIR 10
Match: AT1G62910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 196.1 bits (497), Expect = 9.4e-50
Identity = 127/446 (28.48%), Postives = 230/446 (51.57%), Query Frame = 0

Query: 142 SLAILDWVNEEALYTPSVFAYNVVIRNVLRAKQWEIAHGLFDEMRQRALAADRYTYSTLI 201
           ++A++D + E   Y P  F +  +I  +    +   A  L D+M QR    D  TY T++
Sbjct: 172 AVALVDQMVEMG-YKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVV 231

Query: 202 TCFGKEGLFDAALYWLQQMEQDRVSGDLVLYSNLIELSRKLCDYSKAISIFSRLKRSGIT 261
               K G  D AL  L++ME+ ++  D+V+Y+ +I+   K      A+++F+ +   GI 
Sbjct: 232 NGLCKRGDIDLALSLLKKMEKGKIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIR 291

Query: 262 PDIVAYNSMINVFGKAKLFREARFLLKEMRAVGVVPDTVSYSTLLSMFVENEKFLEALSV 321
           PD+  Y+S+I+       + +A  LL +M    + P+ V++S L+  FV+  K +EA  +
Sbjct: 292 PDVFTYSSLISCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKL 351

Query: 322 FSEMMEVNCPLDLTTCNVMIDVYGQLDMVKEADRLFWSMRKMGIEPNVVSYNTILRVYGE 381
           + EM++ +   D+ T + +I+ +   D + EA  +F  M      PNVV+Y+T+++ + +
Sbjct: 352 YDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYSTLIKGFCK 411

Query: 382 AELFGEAIHLFRLMQRKEIEQNVVTYNTMIKIYGKSLEHEKATNLVQEMQNRGIEPNAIT 441
           A+   E + LFR M ++ +  N VTY T+I  + ++ + + A  + ++M + G+ PN +T
Sbjct: 412 AKRVEEGMELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILT 471

Query: 442 YSTIISIWGKAGKLERAAMLFQKLRSSGAEIDQVLYQTMIVAYERAGLVAHAKRLLHELK 501
           Y+ ++    K GKL +A ++F+ L+ S  E D   Y  MI    +AG V     L   L 
Sbjct: 472 YNILLDGLCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLS 531

Query: 502 ----QPDNIPRKTAITILARAGRIEEATWVFRQAFDAGELKDISIFGCMIDLFSRNKKHK 561
                P+ I   T I+   R G  EEA  + ++  + G L +   +  +I    R+   +
Sbjct: 532 LKGVSPNVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRARLRDGDRE 591

Query: 562 NVVEVFEKMRSVGHFPNSNVIALVLN 584
              E+ ++MRS G   +++ I LV N
Sbjct: 592 ASAELIKEMRSCGFAGDASTIGLVTN 616

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022157661.10.0e+0099.26pentatricopeptide repeat-containing protein At5g39980, chloroplastic [Momordica ... [more]
XP_008443439.10.0e+0090.98PREDICTED: pentatricopeptide repeat-containing protein At5g39980, chloroplastic ... [more]
XP_023521621.10.0e+0090.43pentatricopeptide repeat-containing protein At5g39980, chloroplastic-like [Cucur... [more]
XP_022934402.10.0e+0090.28pentatricopeptide repeat-containing protein At5g39980, chloroplastic [Cucurbita ... [more]
KAG6580983.10.0e+0090.28Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9FLD82.9e-29376.51Pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Arabidop... [more]
Q9S7Q24.4e-5223.47Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
Q9LQ143.7e-5130.05Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Q9LYZ91.6e-4925.63Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
Q9LQ161.3e-4828.48Pentatricopeptide repeat-containing protein At1g62910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1DYU70.0e+0099.26pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Momordic... [more]
A0A5A7UGV70.0e+0090.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B8T90.0e+0090.98pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucumis ... [more]
A0A6J1F2M90.0e+0090.28pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucurbit... [more]
A0A6J1J8J40.0e+0090.13pentatricopeptide repeat-containing protein At5g39980, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G39980.12.0e-29476.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74850.13.1e-5323.47plastid transcriptionally active 2 [more]
AT1G62930.12.7e-5230.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G02860.11.1e-5025.63Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62910.19.4e-5028.48Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 361..430
e-value: 4.0E-18
score: 67.6
coord: 243..290
e-value: 2.5E-9
score: 38.9
coord: 431..503
e-value: 3.0E-16
score: 61.5
coord: 291..360
e-value: 1.3E-17
score: 66.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 504..678
e-value: 4.4E-19
score: 71.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 105..242
e-value: 4.4E-23
score: 83.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 384..667
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 320..381
e-value: 1.7E-9
score: 37.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 581..606
e-value: 0.0013
score: 18.9
coord: 542..570
e-value: 0.0018
score: 18.4
coord: 230..260
e-value: 0.88
score: 10.0
coord: 475..501
e-value: 0.13
score: 12.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 157..204
e-value: 1.2E-11
score: 44.6
coord: 403..446
e-value: 1.2E-14
score: 54.3
coord: 262..309
e-value: 7.3E-11
score: 42.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 405..439
e-value: 3.3E-8
score: 31.2
coord: 336..369
e-value: 6.8E-7
score: 27.1
coord: 300..333
e-value: 2.1E-6
score: 25.5
coord: 195..227
e-value: 5.8E-7
score: 27.3
coord: 440..473
e-value: 1.1E-4
score: 20.1
coord: 265..299
e-value: 3.7E-8
score: 31.0
coord: 542..574
e-value: 1.4E-4
score: 19.8
coord: 161..188
e-value: 0.0026
score: 15.8
coord: 370..403
e-value: 3.7E-6
score: 24.8
coord: 581..610
e-value: 9.4E-7
score: 26.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 574..608
score: 8.966401
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 539..573
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 228..262
score: 9.229472
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 333..367
score: 11.662881
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..332
score: 10.522905
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 438..472
score: 10.215989
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 403..437
score: 12.298636
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 158..192
score: 9.404853
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 263..297
score: 12.002681
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 473..507
score: 8.823904
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 193..227
score: 9.898111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 368..402
score: 10.215989
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 56..93
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..93
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..78
NoneNo IPR availablePANTHERPTHR47447:SF7PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN, CHLOROPLASTICcoord: 8..675
NoneNo IPR availablePANTHERPTHR47447OS03G0856100 PROTEINcoord: 8..675

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
MS004694MS004694gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
MS004694.1-cdsMS004694.1-cds-scaffold741:94296..96332CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
MS004694.1MS004694.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding