MS004611.1 (mRNA) Bitter gourd (TR) v1

Overview
NameMS004611.1
TypemRNA
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold995: 678114 .. 679814 (-)
Sequence length1701
RNA-Seq ExpressionMS004611.1
SyntenyMS004611.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACTCTGCTCAATACAGTGTCTCCAATAGCAAACCCGTCACCAGAAACTTCAAGAAGAGGGTGTGGGTTCTTCTCCCACATCCCAAATCTCCACAAACTTTCACTCAACAAGGGATTTTCCAAAGTTTTAGCATCTACCCACATCACCATTTCCCCAAAGGACACCATTTTCACACTGCCAAATTGGAGGACTGGGAAGGTTGATCAGAAGAGCAGAGACCTCAGGCTCAATGATGCATTTCTCCACTTAGAGTTCATGGTGAGGAAGGGCCAAAAGCCAGATGTGTTTCAAGCCACTCAGCTGTTGTATGATCTGTGCAAGGCAAGCAAGATGAGGAAAGCCATTAAGGTGATGGAGATGATGATTGGGTCTGGAATCATTCCTGATGCATCTTCCTATACTTTTTTGGTAAGTTGTTTGTGTAAAAGAGGGAATGTTGGGTATGCAATGCAATTAGTGGACAAAATGGAGGAATATGGTTATCCTACAAACACTGTTACTTATAATTCACTTGTGAGAGGGCTGTGTATGCATGGAAATTTGAGCCAGAGTTTGCAGCTTTTAGACAGGTTAATCCACAAGGGGCTGGTCCCAAATGCTTACACTTACTCTTTTTTGCTTGAAGCAGCTTACAAGGAAAGAGGAGCTGATGAAGCCATTAAGCTTTTGGATGAGATAATTGCCAAGGGTGGGAAGCCTAATTTGGTTAGCTACAATGTTTTGTTAACTGGTTTGTGTAAAGAAGGCAGGACAGAAGATGCCATTCAGTTGTTTAGAGAATTGCCTTCAAAGGGATTTAGTCCAAATGTTGTTAGTTGCAATATTTTGCTGAGGAGTCTGTGCTATGAAGGAAGGTGGGAGGAGGCAAATGAGCTTTTAGCTGAAATGGACGGTGACGATCGCGCCCCTTCGATCGTCACATACAATATATTGATTGGTTCGCTTACACTCCATGGCAGAACAGAACATGCGCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGGTTCAAGCCCACAGCTTCTAGTTACAACCCGATAATTGCTCGCCTTTGCAAAGATCGGAAGGTAGATCTTGTTGTGAAGTGTTTGGACCAAATGATGTATCGACATTGCAATCCGAATGAAGGAACTTACAATGCCATTGCTACGCTTTGTGAAGAGGGAATGGTTCAAGAGGCATTCTCCATCATACAAAGTTTGGGCAATAGACAACAATCCTCTACTCAAGAATTTTATAAAATGGTTGTTACCAGCTTGTGTCGAAAAGGGAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACGAAGTACGGGTTTACTCCCGACTCTTTCACGTATTCATCTTTGATCCGGGGATTATGCATGGAGGGAATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAAGAGAATTGCTACAAGCCTGATACTGAAAATTACAATGCGCTTATTCTTGGTTGTTGTAAATCTCAAAGAACCGATTTGGCATTGGAGGTTTTTGAAGTAATGGTCGATAAAGGATATCTGCCTAACGAAACAACGTACACCATTCTCGTGGAAGGGATCATCCACGAGAAGGAGATAGATCTAGCTGCCAAAGTACTGAAGGAGTTGCAACTCAGAGATGTTATAAGCCAAAGCACAGTGGATAGACTTGTTATGCAGTATGACCTAAATGATTTACCATTG

mRNA sequence

ATGGCGACTCTGCTCAATACAGTGTCTCCAATAGCAAACCCGTCACCAGAAACTTCAAGAAGAGGGTGTGGGTTCTTCTCCCACATCCCAAATCTCCACAAACTTTCACTCAACAAGGGATTTTCCAAAGTTTTAGCATCTACCCACATCACCATTTCCCCAAAGGACACCATTTTCACACTGCCAAATTGGAGGACTGGGAAGGTTGATCAGAAGAGCAGAGACCTCAGGCTCAATGATGCATTTCTCCACTTAGAGTTCATGGTGAGGAAGGGCCAAAAGCCAGATGTGTTTCAAGCCACTCAGCTGTTGTATGATCTGTGCAAGGCAAGCAAGATGAGGAAAGCCATTAAGGTGATGGAGATGATGATTGGGTCTGGAATCATTCCTGATGCATCTTCCTATACTTTTTTGGTAAGTTGTTTGTGTAAAAGAGGGAATGTTGGGTATGCAATGCAATTAGTGGACAAAATGGAGGAATATGGTTATCCTACAAACACTGTTACTTATAATTCACTTGTGAGAGGGCTGTGTATGCATGGAAATTTGAGCCAGAGTTTGCAGCTTTTAGACAGGTTAATCCACAAGGGGCTGGTCCCAAATGCTTACACTTACTCTTTTTTGCTTGAAGCAGCTTACAAGGAAAGAGGAGCTGATGAAGCCATTAAGCTTTTGGATGAGATAATTGCCAAGGGTGGGAAGCCTAATTTGGTTAGCTACAATGTTTTGTTAACTGGTTTGTGTAAAGAAGGCAGGACAGAAGATGCCATTCAGTTGTTTAGAGAATTGCCTTCAAAGGGATTTAGTCCAAATGTTGTTAGTTGCAATATTTTGCTGAGGAGTCTGTGCTATGAAGGAAGGTGGGAGGAGGCAAATGAGCTTTTAGCTGAAATGGACGGTGACGATCGCGCCCCTTCGATCGTCACATACAATATATTGATTGGTTCGCTTACACTCCATGGCAGAACAGAACATGCGCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGGTTCAAGCCCACAGCTTCTAGTTACAACCCGATAATTGCTCGCCTTTGCAAAGATCGGAAGGTAGATCTTGTTGTGAAGTGTTTGGACCAAATGATGTATCGACATTGCAATCCGAATGAAGGAACTTACAATGCCATTGCTACGCTTTGTGAAGAGGGAATGGTTCAAGAGGCATTCTCCATCATACAAAGTTTGGGCAATAGACAACAATCCTCTACTCAAGAATTTTATAAAATGGTTGTTACCAGCTTGTGTCGAAAAGGGAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACGAAGTACGGGTTTACTCCCGACTCTTTCACGTATTCATCTTTGATCCGGGGATTATGCATGGAGGGAATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAAGAGAATTGCTACAAGCCTGATACTGAAAATTACAATGCGCTTATTCTTGGTTGTTGTAAATCTCAAAGAACCGATTTGGCATTGGAGGTTTTTGAAGTAATGGTCGATAAAGGATATCTGCCTAACGAAACAACGTACACCATTCTCGTGGAAGGGATCATCCACGAGAAGGAGATAGATCTAGCTGCCAAAGTACTGAAGGAGTTGCAACTCAGAGATGTTATAAGCCAAAGCACAGTGGATAGACTTGTTATGCAGTATGACCTAAATGATTTACCATTG

Coding sequence (CDS)

ATGGCGACTCTGCTCAATACAGTGTCTCCAATAGCAAACCCGTCACCAGAAACTTCAAGAAGAGGGTGTGGGTTCTTCTCCCACATCCCAAATCTCCACAAACTTTCACTCAACAAGGGATTTTCCAAAGTTTTAGCATCTACCCACATCACCATTTCCCCAAAGGACACCATTTTCACACTGCCAAATTGGAGGACTGGGAAGGTTGATCAGAAGAGCAGAGACCTCAGGCTCAATGATGCATTTCTCCACTTAGAGTTCATGGTGAGGAAGGGCCAAAAGCCAGATGTGTTTCAAGCCACTCAGCTGTTGTATGATCTGTGCAAGGCAAGCAAGATGAGGAAAGCCATTAAGGTGATGGAGATGATGATTGGGTCTGGAATCATTCCTGATGCATCTTCCTATACTTTTTTGGTAAGTTGTTTGTGTAAAAGAGGGAATGTTGGGTATGCAATGCAATTAGTGGACAAAATGGAGGAATATGGTTATCCTACAAACACTGTTACTTATAATTCACTTGTGAGAGGGCTGTGTATGCATGGAAATTTGAGCCAGAGTTTGCAGCTTTTAGACAGGTTAATCCACAAGGGGCTGGTCCCAAATGCTTACACTTACTCTTTTTTGCTTGAAGCAGCTTACAAGGAAAGAGGAGCTGATGAAGCCATTAAGCTTTTGGATGAGATAATTGCCAAGGGTGGGAAGCCTAATTTGGTTAGCTACAATGTTTTGTTAACTGGTTTGTGTAAAGAAGGCAGGACAGAAGATGCCATTCAGTTGTTTAGAGAATTGCCTTCAAAGGGATTTAGTCCAAATGTTGTTAGTTGCAATATTTTGCTGAGGAGTCTGTGCTATGAAGGAAGGTGGGAGGAGGCAAATGAGCTTTTAGCTGAAATGGACGGTGACGATCGCGCCCCTTCGATCGTCACATACAATATATTGATTGGTTCGCTTACACTCCATGGCAGAACAGAACATGCGCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGGTTCAAGCCCACAGCTTCTAGTTACAACCCGATAATTGCTCGCCTTTGCAAAGATCGGAAGGTAGATCTTGTTGTGAAGTGTTTGGACCAAATGATGTATCGACATTGCAATCCGAATGAAGGAACTTACAATGCCATTGCTACGCTTTGTGAAGAGGGAATGGTTCAAGAGGCATTCTCCATCATACAAAGTTTGGGCAATAGACAACAATCCTCTACTCAAGAATTTTATAAAATGGTTGTTACCAGCTTGTGTCGAAAAGGGAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACGAAGTACGGGTTTACTCCCGACTCTTTCACGTATTCATCTTTGATCCGGGGATTATGCATGGAGGGAATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAAGAGAATTGCTACAAGCCTGATACTGAAAATTACAATGCGCTTATTCTTGGTTGTTGTAAATCTCAAAGAACCGATTTGGCATTGGAGGTTTTTGAAGTAATGGTCGATAAAGGATATCTGCCTAACGAAACAACGTACACCATTCTCGTGGAAGGGATCATCCACGAGAAGGAGATAGATCTAGCTGCCAAAGTACTGAAGGAGTTGCAACTCAGAGATGTTATAAGCCAAAGCACAGTGGATAGACTTGTTATGCAGTATGACCTAAATGATTTACCATTG

Protein sequence

MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFTLPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDLNDLPL
Homology
BLAST of MS004611.1 vs. NCBI nr
Match: XP_022141778.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Momordica charantia])

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 567/567 (100.00%), Postives = 567/567 (100.00%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT
Sbjct: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM
Sbjct: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK
Sbjct: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQSTVDRLVMQYDLNDLPL
Sbjct: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 567

BLAST of MS004611.1 vs. NCBI nr
Match: XP_038899825.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Benincasa hispida])

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 533/567 (94.00%), Postives = 551/567 (97.18%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI NP PET+RRGCGFFSHIPNLHKLSL+KGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPLPETTRRGCGFFSHIPNLHKLSLSKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNWRT K DQKSR+LRLNDAF HLEF+V KGQKPDVFQATQLLYDLCKA KMRKAIKVM
Sbjct: 61  LPNWRTAKGDQKSRELRLNDAFFHLEFLVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LCK+GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVS NILLRSLCYEGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PSIVTYNILIGSLTL GRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSIVTYNILIGSLTLSGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEEN YKPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLALEVFE+MVDKGYLPNETTYTILVEGI+HEKEIDLA KVL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTVERLVMQYDLNELPL 567

BLAST of MS004611.1 vs. NCBI nr
Match: XP_008444287.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis melo] >ADN33755.1 pentatricopeptide repeat-containing protein [Cucumis melo subsp. melo] >KAA0052323.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK01883.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1045.4 bits (2702), Expect = 1.8e-301
Identity = 517/567 (91.18%), Postives = 544/567 (95.94%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI N SPET+RRGCGFFSHIPNL KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+TGKV+QKS++LRL DAF HLEFMV KGQKPDVFQATQLLYDLCKA KMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  KPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-NKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYLPNETTYTILVEGIIHEKE+DLA KVL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQST++RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. NCBI nr
Match: XP_011653982.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis sativus] >KGN54942.1 hypothetical protein Csa_012601 [Cucumis sativus])

HSP 1 Score: 1045.0 bits (2701), Expect = 2.3e-301
Identity = 516/567 (91.01%), Postives = 543/567 (95.77%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI NPSPET+RRGCGFFSHIPN+ KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+ GK+DQKS++LRLNDAF HLEFMV KGQKPDVFQATQLLYDLCK  KMRKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDA+QLFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q  STQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  K D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYL NETTYTILVEGIIHEKE+DLA +VL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVI+QSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. NCBI nr
Match: KAG6607968.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia] >KAG7037481.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1041.6 bits (2692), Expect = 2.5e-300
Identity = 511/567 (90.12%), Postives = 540/567 (95.24%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI NPSPET+RRG GFFSHIPNLHKLSL+KGFSKVLAST +TISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGYGFFSHIPNLHKLSLSKGFSKVLASTQVTISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNWR GK DQK+R+ RLNDAFL+LE++V KGQKPDVFQATQLLYDLCKASK+R+AIKVM
Sbjct: 61  LPNWRIGKGDQKNREHRLNDAFLNLEYLVGKGQKPDVFQATQLLYDLCKASKLRRAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+ SLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA+KLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTHSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAVKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRT+DAIQLFRELPSKGF+PNVVS NILLRSLCYEGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDDAIQLFRELPSKGFNPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDRAPS VTYN LIGSL  HGRTEHALEVLEEMIRARFKPTASSYNPIIARLC+D+KVDL
Sbjct: 301 DDRAPSAVTYNTLIGSLAFHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCRDKKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCE GMVQEAFSI+QSLGN+Q  STQEFYK V+TS
Sbjct: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEVGMVQEAFSILQSLGNKQHYSTQEFYKSVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTY AFQLLYEMTKYGFTPDSF YSSLIRGLCMEGMLDEAIEIFSVMEEN YKPD
Sbjct: 421 LCRKGNTYSAFQLLYEMTKYGFTPDSFAYSSLIRGLCMEGMLDEAIEIFSVMEENNYKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LI GCCKS+RTDLALEVFE MV+KGYLPNETTYT LVEGI+HEK+IDLA KVL+
Sbjct: 481 TENYNSLIFGCCKSRRTDLALEVFETMVNKGYLPNETTYTTLVEGIVHEKQIDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQL+DVISQSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLKDVISQSTVERLVMQYDLNELPL 567

BLAST of MS004611.1 vs. ExPASy Swiss-Prot
Match: A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 1.1e-218
Identity = 382/578 (66.09%), Postives = 474/578 (82.01%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPN--LHKLSLNKGFSKVLASTHITISPKDTI 60
           M+TLLN+V  +A+P   + R+  GF SHIP+  LH  S++KG ++VLAST IT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTL------PNWRTGKV--DQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKA 120
           FT+      P+  +G    D +S +  L+D+F HLE +V  G KP+V  +TQLLYDLCKA
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 SKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTY 180
           ++++KAI+V+E+M+ SGIIPDAS+YT+LV+ LCKRGNVGYAMQLV+KME++GYP+NTVTY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIA 240
           N+LVRGLCM G+L+QSLQ ++RL+ KGL PNA+TYSFLLEAAYKERG DEA+KLLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DA+ LFRELP+KGF  NVVS NILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG DRAPS+VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNRQQ 420
           IARLCK+ KVDLVVKCLD+M+YR C PNEGTYNAI +LCE    VQEAF IIQSL N+Q+
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 SSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIE 480
             T +FYK V+TSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVME--ENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEG 540
           + S+ME  ENC KP  +N+NA+ILG CK +RTDLA+EVFE+MV+K  +PNETTY ILVEG
Sbjct: 481 VLSIMEESENC-KPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEG 540

Query: 541 IIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDLN 564
           I HE E++LA +VL EL+LR VI Q+ VDR+VMQ++L+
Sbjct: 541 IAHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of MS004611.1 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 384.8 bits (987), Expect = 1.7e-105
Identity = 196/523 (37.48%), Postives = 321/523 (61.38%), Query Frame = 0

Query: 36  SLNKGFSKVLASTHITISPKDTIFTLPNWRTGK-VDQKSRDLRLNDAFLHLEFMVRKGQK 95
           +L+ G+S    + H   S  ++ F L +  +   + Q  R   L + F  LE MV  G  
Sbjct: 77  TLSSGYSNSNGNGH--YSSVNSSFALEDVESNNHLRQMVRTGELEEGFKFLENMVYHGNV 136

Query: 96  PDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQL 155
           PD+   T L+   C+  K RKA K++E++ GSG +PD  +Y  ++S  CK G +  A+ +
Sbjct: 137 PDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGEINNALSV 196

Query: 156 VDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYK 215
           +D+M       + VTYN+++R LC  G L Q++++LDR++ +   P+  TY+ L+EA  +
Sbjct: 197 LDRM---SVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIEATCR 256

Query: 216 ERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVS 275
           + G   A+KLLDE+  +G  P++V+YNVL+ G+CKEGR ++AI+   ++PS G  PNV++
Sbjct: 257 DSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSSGCQPNVIT 316

Query: 276 CNILLRSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMI 335
            NI+LRS+C  GRW +A +LLA+M     +PS+VT+NILI  L   G    A+++LE+M 
Sbjct: 317 HNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMP 376

Query: 336 RARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-LCEEGMVQ 395
           +   +P + SYNP++   CK++K+D  ++ L++M+ R C P+  TYN + T LC++G V+
Sbjct: 377 QHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVE 436

Query: 396 EAFSIIQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLI 455
           +A  I+  L ++  S     Y  V+  L + G T  A +LL EM      PD+ TYSSL+
Sbjct: 437 DAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLV 496

Query: 456 RGLCMEGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYL 515
            GL  EG +DEAI+ F   E    +P+   +N+++LG CKS++TD A++    M+++G  
Sbjct: 497 GGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCK 556

Query: 516 PNETTYTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRL 557
           PNET+YTIL+EG+ +E     A ++L EL  + ++ +S+ +++
Sbjct: 557 PNETSYTILIEGLAYEGMAKEALELLNELCNKGLMKKSSAEQV 594

BLAST of MS004611.1 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 1.4e-91
Identity = 178/479 (37.16%), Postives = 276/479 (57.62%), Query Frame = 0

Query: 85  LEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCK 144
           LE MVRKG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  CK
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFG-QPDVFAYNALINGFCK 171

Query: 145 RGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYT 204
              +  A +++D+M    +  +TVTYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELP 264
           Y+ L+EA   E G DEA+KL+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTE 324
            KG  P+V+S NILLR+L  +G+WEE  +L+ +M  +   P++VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ M+   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG    S     Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEV 504
           PD  TY+S+I  LC EGM+DEA E+   M    + P    YN ++LG CK+ R + A+ V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDL 563
            E MV  G  PNETTYT+L+EGI        A ++  +L   D IS+ +  RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of MS004611.1 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 260.4 bits (664), Expect = 4.8e-68
Identity = 154/571 (26.97%), Postives = 278/571 (48.69%), Query Frame = 0

Query: 42  SKVLASTHITISPKDTIFTLPNWR---TGKVDQKSR---------DLRLNDAFLHLEFMV 101
           S V+     T+SP  + F    WR   +GK     R         +L+L+DA      MV
Sbjct: 18  SLVVRGNAATVSPSFSFF----WRRAFSGKTSYDYREKLSRNGLSELKLDDAVALFGEMV 77

Query: 102 RKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVG 161
           +    P + + ++LL  + K +K    I + E M   GI  +  +Y+ L++C C+R  + 
Sbjct: 78  KSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLP 137

Query: 162 YAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLL 221
            A+ ++ KM + GY  N VT +SL+ G C    +S+++ L+D++   G  PN  T++ L+
Sbjct: 138 LALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLI 197

Query: 222 EAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRT---------------- 281
              +    A EA+ L+D ++AKG +P+LV+Y V++ GLCK G T                
Sbjct: 198 HGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLE 257

Query: 282 -------------------EDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANEL 341
                              +DA+ LF+E+ +KG  PNVV+ + L+  LC  GRW +A+ L
Sbjct: 258 PGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRL 317

Query: 342 LAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCK 401
           L++M      P + T++ LI +    G+   A ++ +EM++    P+  +Y+ +I   C 
Sbjct: 318 LSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCM 377

Query: 402 DRKVDLVVKCLDQMMYRHCNPNEGTYNA-IATLCEEGMVQEAFSIIQSLGNRQQSSTQEF 461
             ++D   +  + M+ +HC P+  TYN  I   C+   V+E   + + +  R        
Sbjct: 378 HDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVT 437

Query: 462 YKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVME 521
           Y +++  L + G+   A ++  EM   G  P+  TY++L+ GLC  G L++A+ +F  ++
Sbjct: 438 YNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQ 497

Query: 522 ENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEID 565
            +  +P    YN +I G CK+ + +   ++F  +  KG  P+   Y  ++ G   +   +
Sbjct: 498 RSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKE 557

BLAST of MS004611.1 vs. ExPASy Swiss-Prot
Match: Q9SXD8 (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX=3702 GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 6.3e-68
Identity = 146/526 (27.76%), Postives = 262/526 (49.81%), Query Frame = 0

Query: 75  DLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASS 134
           D++L+DA      MV+    P + +  +LL  + K  K    I + E M    I+    +
Sbjct: 63  DMKLDDAIGLFGGMVKSRPLPSIVEFNKLLSAIAKMKKFDVVISLGEKMQRLEIVHGLYT 122

Query: 135 YTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLI 194
           Y  L++C C+R  +  A+ L+ KM + GY  + VT +SL+ G C    +S ++ L+D+++
Sbjct: 123 YNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMV 182

Query: 195 HKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRT- 254
             G  P+  T++ L+   +    A EA+ L+D ++ +G +PNLV+Y V++ GLCK G T 
Sbjct: 183 EMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDTD 242

Query: 255 ----------------------------------EDAIQLFRELPSKGFSPNVVSCNILL 314
                                             +DA+ LF+E+ +KG  PNVV+ + L+
Sbjct: 243 LALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLI 302

Query: 315 RSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFK 374
             LC  GRW +A++LL++M      P++VT+N LI +    G+   A ++ ++MI+    
Sbjct: 303 SCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSID 362

Query: 375 PTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNA-IATLCEEGMVQEAFSI 434
           P   +YN ++   C   ++D   +  + M+ + C P+  TYN  I   C+   V++   +
Sbjct: 363 PDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTEL 422

Query: 435 IQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCM 494
            + + +R        Y  ++  L   G+   A ++  +M   G  PD  TYS L+ GLC 
Sbjct: 423 FREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCN 482

Query: 495 EGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETT 554
            G L++A+E+F  M+++  K D   Y  +I G CK+ + D   ++F  +  KG  PN  T
Sbjct: 483 NGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVT 542

Query: 555 YTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDLND 565
           Y  ++ G+  ++ +  A  +LK+++    +  S     +++  L D
Sbjct: 543 YNTMISGLCSKRLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRD 588

BLAST of MS004611.1 vs. ExPASy TrEMBL
Match: A0A6J1CLJ3 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012056 PE=4 SV=1)

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 567/567 (100.00%), Postives = 567/567 (100.00%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT
Sbjct: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM
Sbjct: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK
Sbjct: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQSTVDRLVMQYDLNDLPL
Sbjct: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 567

BLAST of MS004611.1 vs. ExPASy TrEMBL
Match: A0A5A7U8T1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold113G001460 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 8.5e-302
Identity = 517/567 (91.18%), Postives = 544/567 (95.94%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI N SPET+RRGCGFFSHIPNL KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+TGKV+QKS++LRL DAF HLEFMV KGQKPDVFQATQLLYDLCKA KMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  KPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-NKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYLPNETTYTILVEGIIHEKE+DLA KVL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQST++RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. ExPASy TrEMBL
Match: E5GBB3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 8.5e-302
Identity = 517/567 (91.18%), Postives = 544/567 (95.94%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI N SPET+RRGCGFFSHIPNL KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+TGKV+QKS++LRL DAF HLEFMV KGQKPDVFQATQLLYDLCKA KMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  KPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-NKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYLPNETTYTILVEGIIHEKE+DLA KVL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQST++RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. ExPASy TrEMBL
Match: A0A1S3BA04 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487653 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 8.5e-302
Identity = 517/567 (91.18%), Postives = 544/567 (95.94%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI N SPET+RRGCGFFSHIPNL KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+TGKV+QKS++LRL DAF HLEFMV KGQKPDVFQATQLLYDLCKA KMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  KPD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-NKPD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYLPNETTYTILVEGIIHEKE+DLA KVL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVISQST++RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. ExPASy TrEMBL
Match: A0A0A0L2W8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G613170 PE=4 SV=1)

HSP 1 Score: 1045.0 bits (2701), Expect = 1.1e-301
Identity = 516/567 (91.01%), Postives = 543/567 (95.77%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60
           MATLLNTVSPI NPSPET+RRGCGFFSHIPN+ KLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120
           LPNW+ GK+DQKS++LRLNDAF HLEFMV KGQKPDVFQATQLLYDLCK  KMRKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVS LC++GNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEA KLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300
           NVLLTGLCKEGRTEDA+QLFRELPSKGFSPNVVS NILLRSLC EGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+R+PS VTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q  STQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEEN  K D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLD 480

Query: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540
           TENYN+LILGCCKS+RTDLAL+VFE+MV KGYL NETTYTILVEGIIHEKE+DLA +VL+
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540

Query: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 568
           ELQLRDVI+QSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of MS004611.1 vs. TAIR 10
Match: AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 760.8 bits (1963), Expect = 8.0e-220
Identity = 382/578 (66.09%), Postives = 474/578 (82.01%), Query Frame = 0

Query: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPN--LHKLSLNKGFSKVLASTHITISPKDTI 60
           M+TLLN+V  +A+P   + R+  GF SHIP+  LH  S++KG ++VLAST IT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTL------PNWRTGKV--DQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKA 120
           FT+      P+  +G    D +S +  L+D+F HLE +V  G KP+V  +TQLLYDLCKA
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 SKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTY 180
           ++++KAI+V+E+M+ SGIIPDAS+YT+LV+ LCKRGNVGYAMQLV+KME++GYP+NTVTY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIA 240
           N+LVRGLCM G+L+QSLQ ++RL+ KGL PNA+TYSFLLEAAYKERG DEA+KLLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DA+ LFRELP+KGF  NVVS NILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG DRAPS+VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNRQQ 420
           IARLCK+ KVDLVVKCLD+M+YR C PNEGTYNAI +LCE    VQEAF IIQSL N+Q+
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 SSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIE 480
             T +FYK V+TSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVME--ENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEG 540
           + S+ME  ENC KP  +N+NA+ILG CK +RTDLA+EVFE+MV+K  +PNETTY ILVEG
Sbjct: 481 VLSIMEESENC-KPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEG 540

Query: 541 IIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDLN 564
           I HE E++LA +VL EL+LR VI Q+ VDR+VMQ++L+
Sbjct: 541 IAHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of MS004611.1 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 384.8 bits (987), Expect = 1.2e-106
Identity = 196/523 (37.48%), Postives = 321/523 (61.38%), Query Frame = 0

Query: 36  SLNKGFSKVLASTHITISPKDTIFTLPNWRTGK-VDQKSRDLRLNDAFLHLEFMVRKGQK 95
           +L+ G+S    + H   S  ++ F L +  +   + Q  R   L + F  LE MV  G  
Sbjct: 77  TLSSGYSNSNGNGH--YSSVNSSFALEDVESNNHLRQMVRTGELEEGFKFLENMVYHGNV 136

Query: 96  PDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQL 155
           PD+   T L+   C+  K RKA K++E++ GSG +PD  +Y  ++S  CK G +  A+ +
Sbjct: 137 PDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGEINNALSV 196

Query: 156 VDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYK 215
           +D+M       + VTYN+++R LC  G L Q++++LDR++ +   P+  TY+ L+EA  +
Sbjct: 197 LDRM---SVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIEATCR 256

Query: 216 ERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVS 275
           + G   A+KLLDE+  +G  P++V+YNVL+ G+CKEGR ++AI+   ++PS G  PNV++
Sbjct: 257 DSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSSGCQPNVIT 316

Query: 276 CNILLRSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMI 335
            NI+LRS+C  GRW +A +LLA+M     +PS+VT+NILI  L   G    A+++LE+M 
Sbjct: 317 HNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMP 376

Query: 336 RARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-LCEEGMVQ 395
           +   +P + SYNP++   CK++K+D  ++ L++M+ R C P+  TYN + T LC++G V+
Sbjct: 377 QHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVE 436

Query: 396 EAFSIIQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLI 455
           +A  I+  L ++  S     Y  V+  L + G T  A +LL EM      PD+ TYSSL+
Sbjct: 437 DAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLV 496

Query: 456 RGLCMEGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYL 515
            GL  EG +DEAI+ F   E    +P+   +N+++LG CKS++TD A++    M+++G  
Sbjct: 497 GGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCK 556

Query: 516 PNETTYTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRL 557
           PNET+YTIL+EG+ +E     A ++L EL  + ++ +S+ +++
Sbjct: 557 PNETSYTILIEGLAYEGMAKEALELLNELCNKGLMKKSSAEQV 594

BLAST of MS004611.1 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 338.6 bits (867), Expect = 9.8e-93
Identity = 178/479 (37.16%), Postives = 276/479 (57.62%), Query Frame = 0

Query: 85  LEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCK 144
           LE MVRKG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  CK
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFG-QPDVFAYNALINGFCK 171

Query: 145 RGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYT 204
              +  A +++D+M    +  +TVTYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELP 264
           Y+ L+EA   E G DEA+KL+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTE 324
            KG  P+V+S NILLR+L  +G+WEE  +L+ +M  +   P++VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ M+   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG    S     Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEV 504
           PD  TY+S+I  LC EGM+DEA E+   M    + P    YN ++LG CK+ R + A+ V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDL 563
            E MV  G  PNETTYT+L+EGI        A ++  +L   D IS+ +  RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of MS004611.1 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 260.4 bits (664), Expect = 3.4e-69
Identity = 154/571 (26.97%), Postives = 278/571 (48.69%), Query Frame = 0

Query: 42  SKVLASTHITISPKDTIFTLPNWR---TGKVDQKSR---------DLRLNDAFLHLEFMV 101
           S V+     T+SP  + F    WR   +GK     R         +L+L+DA      MV
Sbjct: 18  SLVVRGNAATVSPSFSFF----WRRAFSGKTSYDYREKLSRNGLSELKLDDAVALFGEMV 77

Query: 102 RKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASSYTFLVSCLCKRGNVG 161
           +    P + + ++LL  + K +K    I + E M   GI  +  +Y+ L++C C+R  + 
Sbjct: 78  KSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLP 137

Query: 162 YAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLIHKGLVPNAYTYSFLL 221
            A+ ++ KM + GY  N VT +SL+ G C    +S+++ L+D++   G  PN  T++ L+
Sbjct: 138 LALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLI 197

Query: 222 EAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRT---------------- 281
              +    A EA+ L+D ++AKG +P+LV+Y V++ GLCK G T                
Sbjct: 198 HGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLE 257

Query: 282 -------------------EDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANEL 341
                              +DA+ LF+E+ +KG  PNVV+ + L+  LC  GRW +A+ L
Sbjct: 258 PGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRL 317

Query: 342 LAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCK 401
           L++M      P + T++ LI +    G+   A ++ +EM++    P+  +Y+ +I   C 
Sbjct: 318 LSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCM 377

Query: 402 DRKVDLVVKCLDQMMYRHCNPNEGTYNA-IATLCEEGMVQEAFSIIQSLGNRQQSSTQEF 461
             ++D   +  + M+ +HC P+  TYN  I   C+   V+E   + + +  R        
Sbjct: 378 HDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVT 437

Query: 462 YKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVME 521
           Y +++  L + G+   A ++  EM   G  P+  TY++L+ GLC  G L++A+ +F  ++
Sbjct: 438 YNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQ 497

Query: 522 ENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEID 565
            +  +P    YN +I G CK+ + +   ++F  +  KG  P+   Y  ++ G   +   +
Sbjct: 498 RSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKE 557

BLAST of MS004611.1 vs. TAIR 10
Match: AT1G62590.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 260.0 bits (663), Expect = 4.4e-69
Identity = 146/526 (27.76%), Postives = 262/526 (49.81%), Query Frame = 0

Query: 75  DLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVMEMMIGSGIIPDASS 134
           D++L+DA      MV+    P + +  +LL  + K  K    I + E M    I+    +
Sbjct: 63  DMKLDDAIGLFGGMVKSRPLPSIVEFNKLLSAIAKMKKFDVVISLGEKMQRLEIVHGLYT 122

Query: 135 YTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLSQSLQLLDRLI 194
           Y  L++C C+R  +  A+ L+ KM + GY  + VT +SL+ G C    +S ++ L+D+++
Sbjct: 123 YNILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMV 182

Query: 195 HKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRT- 254
             G  P+  T++ L+   +    A EA+ L+D ++ +G +PNLV+Y V++ GLCK G T 
Sbjct: 183 EMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDTD 242

Query: 255 ----------------------------------EDAIQLFRELPSKGFSPNVVSCNILL 314
                                             +DA+ LF+E+ +KG  PNVV+ + L+
Sbjct: 243 LALNLLNKMEAAKIEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLI 302

Query: 315 RSLCYEGRWEEANELLAEMDGDDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFK 374
             LC  GRW +A++LL++M      P++VT+N LI +    G+   A ++ ++MI+    
Sbjct: 303 SCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSID 362

Query: 375 PTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNA-IATLCEEGMVQEAFSI 434
           P   +YN ++   C   ++D   +  + M+ + C P+  TYN  I   C+   V++   +
Sbjct: 363 PDIFTYNSLVNGFCMHDRLDKAKQMFEFMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTEL 422

Query: 435 IQSLGNRQQSSTQEFYKMVVTSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCM 494
            + + +R        Y  ++  L   G+   A ++  +M   G  PD  TYS L+ GLC 
Sbjct: 423 FREMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCN 482

Query: 495 EGMLDEAIEIFSVMEENCYKPDTENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETT 554
            G L++A+E+F  M+++  K D   Y  +I G CK+ + D   ++F  +  KG  PN  T
Sbjct: 483 NGKLEKALEVFDYMQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVT 542

Query: 555 YTILVEGIIHEKEIDLAAKVLKELQLRDVISQSTVDRLVMQYDLND 565
           Y  ++ G+  ++ +  A  +LK+++    +  S     +++  L D
Sbjct: 543 YNTMISGLCSKRLLQEAYALLKKMKEDGPLPNSGTYNTLIRAHLRD 588

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141778.10.0e+00100.00pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Momordica ... [more]
XP_038899825.10.0e+0094.00pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Benincasa ... [more]
XP_008444287.11.8e-30191.18PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
XP_011653982.12.3e-30191.01pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis sa... [more]
KAG6607968.12.5e-30090.12Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A3KPF81.1e-21866.09Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
Q3EDF81.7e-10537.48Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9SR001.4e-9137.16Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Q9SXD14.8e-6826.97Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Q9SXD86.3e-6827.76Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1CLJ30.0e+00100.00pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Momordic... [more]
A0A5A7U8T18.5e-30291.18Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
E5GBB38.5e-30291.18Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=41267... [more]
A0A1S3BA048.5e-30291.18pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Cucumis ... [more]
A0A0A0L2W81.1e-30191.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G613170 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79080.18.0e-22066.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09900.11.2e-10637.48Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G04760.19.8e-9337.16Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G62670.13.4e-6926.97rna processing factor 2 [more]
AT1G62590.14.4e-6927.76pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 64..160
e-value: 1.0E-15
score: 59.8
coord: 231..297
e-value: 7.2E-22
score: 79.8
coord: 161..230
e-value: 3.5E-14
score: 54.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 423..563
e-value: 1.8E-36
score: 128.2
coord: 298..422
e-value: 2.0E-24
score: 88.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 340..383
e-value: 4.1E-8
score: 33.3
coord: 479..525
e-value: 4.2E-12
score: 46.1
coord: 235..283
e-value: 8.6E-15
score: 54.7
coord: 166..211
e-value: 1.4E-10
score: 41.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 107..132
e-value: 9.7E-4
score: 17.2
coord: 238..272
e-value: 2.9E-10
score: 37.7
coord: 447..481
e-value: 7.3E-9
score: 33.3
coord: 344..377
e-value: 1.3E-4
score: 19.9
coord: 134..164
e-value: 1.7E-5
score: 22.7
coord: 308..341
e-value: 1.8E-8
score: 32.1
coord: 484..516
e-value: 2.1E-7
score: 28.7
coord: 168..202
e-value: 2.5E-7
score: 28.4
coord: 273..304
e-value: 1.0E-5
score: 23.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 127..159
e-value: 1.2E-8
score: 34.6
coord: 441..473
e-value: 7.4E-12
score: 44.8
coord: 305..333
e-value: 9.5E-7
score: 28.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 9.218511
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..130
score: 8.988323
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 12.452094
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 13.888024
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 131..165
score: 10.753093
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 12.967276
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 166..200
score: 12.430172
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 10.950397
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 9.536388
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..565
NoneNo IPR availablePANTHERPTHR45613:SF297PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..565
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 93..265

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
MS004611MS004611gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
MS004611.1-cdsMS004611.1-cds-scaffold995:678114..679814CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
MS004611.1MS004611.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding