Moc03g31300 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g31300
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr3: 22205917 .. 22207719 (-)
RNA-Seq ExpressionMoc03g31300
SyntenyMoc03g31300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGCTCATAGGTTCTCCCCATCTTCCGGTTTTATCAAAAGACCTCTTTATCTATGGAACTTGATGATTCGAAGCTCCGTCAATGGCGGTTTCTTCGCCGAAACTCTCGAAACCTACTCCTTTATGCGCCACTCTGGAATTCATGGTAACAACTTCACTTTTCCTCTCCTCCTCAAGGCCTGCGCCAATCTTGCTTCGATTGGCGATGGCACGATGCTCCACGCTCACCTCATCCGCGTCGGGTTCGAAACAGATATCTTCGTTCAAACCTCACTCGTGGATATGTACTCCAAATGCTTCGACTTGGCTTCTTCACGCCAAGTGTTCGACGAAATGTCTATGAGAAGTGTCATCTCTTGGAATTCGATGATTGCTGCTTATTCTCGCGCTTTTCGGGTTAATGAAGGGTTTAAGCTATTCCGAGAGATGTGGGGGTTTGGGTTTGAGCCAAATTCCTCGACATTTGTGAGCTTATTGTCCGGTTTTGCTAACCCAGTTCACGGGTCTCTCTTTCAGTGCCTTCTGATACAGGGTTGCCTAACAAAGTTTCGACTTCAAAATGACACGCCCGTGGCAAATTCTCTCATGAAAATGTATGTAAACTTTGGTCAAATTGATGCTGCTCGCTCTGTTTTTTATGCCATAGGTGGCAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGATACTTGAAATCTGGGGCTGTTGCCGAAGTGTTCAGAATCTTTAGTCAAATGAGACTAAATAATGTGGTATTGGATAAAGTCGTTTTCGTGGACATCATTTCCTCTTGTATACAACTTGGAAATTTGCTTTTAGCTTCTTCGCTTCATTCTCTCCTACTGAAAATTGGGCTCAATAGTGAGGATCCTATTGGCTGCTTGCTTATTAGCATGTATTCAAAATGCGGCGACCACTTGTCTGCTCGAGCAGTATTTGATATGTTACCTGAAAAAGGCATCTTCTTGTGGACGTCGGTGATAAGTGGATATGCCAATGCTGGGTACCCTGGAGAAGCGTTACATCTATTTACAATGGCGACACAGAATAATATTAGACCAAATGGAGCAATGCTAGCTACTGCAGTTTCTGCTTGTGCTGATTCAGGATCGCTGAGCATGTGCAAGGAGATGGAAGCATACATACAGCTGAATGGTGTAGCGGTAGATTGTCAAGTTTCGACATCATTGATACATATGTATTGTAAATGTGAAAGTATTAAGAAGGCAGAAGGAGTTTTTAAAAGTATGATAAGTAGAGACCTGGCAGCTTGGAGTGCTATGATGAATGGCTACGCTGTGTATGGGATGGGAGAAGAGGCTATAAATCTGTTTCATGAGATGCGAAGAACCGGAATAAAACCAGATGCTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCACTCAGGTCTAGTAGAAGATGGGCTGAATCATTTCAAGAACATGCAATTGGATTTTGGAATAGAACCTACTGTGGAACACTATACTTGTTTGGTAGATATCCTAAGCAGAGCTGGTCATCTAGAATTGGCTTTGAACGCAATTCAAGAGATGCCTGCCCAATTTCAAGCTCAAGCTTGGGTTCCTTTTCTCAGTGCTTGCAGGACTTACTGCGATGTCGAACTTGGAGAAGTTGCAAATAAAAACATATCAGGTTCAAATCCTGGAAACCCAGTAAATCATGTTTTGGTGGCTAATTTATATACATCTGTGGGTAAGTGGAAAGAAGCAGCTGTAGTGAGAAGTTTGATCGGTGATAAAGGTCTGGTCAAAGAACCAGGATGTAGCCAGCTTTAA

mRNA sequence

ATGCAGGCTCATAGGTTCTCCCCATCTTCCGGTTTTATCAAAAGACCTCTTTATCTATGGAACTTGATGATTCGAAGCTCCGTCAATGGCGGTTTCTTCGCCGAAACTCTCGAAACCTACTCCTTTATGCGCCACTCTGGAATTCATGGTAACAACTTCACTTTTCCTCTCCTCCTCAAGGCCTGCGCCAATCTTGCTTCGATTGGCGATGGCACGATGCTCCACGCTCACCTCATCCGCGTCGGGTTCGAAACAGATATCTTCGTTCAAACCTCACTCGTGGATATGTACTCCAAATGCTTCGACTTGGCTTCTTCACGCCAAGTGTTCGACGAAATGTCTATGAGAAGTGTCATCTCTTGGAATTCGATGATTGCTGCTTATTCTCGCGCTTTTCGGGTTAATGAAGGGTTTAAGCTATTCCGAGAGATGTGGGGGTTTGGGTTTGAGCCAAATTCCTCGACATTTGTGAGCTTATTGTCCGGTTTTGCTAACCCAGTTCACGGGTCTCTCTTTCAGTGCCTTCTGATACAGGGTTGCCTAACAAAGTTTCGACTTCAAAATGACACGCCCGTGGCAAATTCTCTCATGAAAATGTATGTAAACTTTGGTCAAATTGATGCTGCTCGCTCTGTTTTTTATGCCATAGGTGGCAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGATACTTGAAATCTGGGGCTGTTGCCGAAGTGTTCAGAATCTTTAGTCAAATGAGACTAAATAATGTGGTATTGGATAAAGTCGTTTTCGTGGACATCATTTCCTCTTGTATACAACTTGGAAATTTGCTTTTAGCTTCTTCGCTTCATTCTCTCCTACTGAAAATTGGGCTCAATAGTGAGGATCCTATTGGCTGCTTGCTTATTAGCATGTATTCAAAATGCGGCGACCACTTGTCTGCTCGAGCAGTATTTGATATGTTACCTGAAAAAGGCATCTTCTTGTGGACGTCGGTGATAAGTGGATATGCCAATGCTGGGTACCCTGGAGAAGCGTTACATCTATTTACAATGGCGACACAGAATAATATTAGACCAAATGGAGCAATGCTAGCTACTGCAGTTTCTGCTTGTGCTGATTCAGGATCGCTGAGCATGTGCAAGGAGATGGAAGCATACATACAGCTGAATGGTGTAGCGGTAGATTGTCAAGTTTCGACATCATTGATACATATGTATTGTAAATGTGAAAGTATTAAGAAGGCAGAAGGAGTTTTTAAAAGTATGATAAGTAGAGACCTGGCAGCTTGGAGTGCTATGATGAATGGCTACGCTGTGTATGGGATGGGAGAAGAGGCTATAAATCTGTTTCATGAGATGCGAAGAACCGGAATAAAACCAGATGCTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCACTCAGGTCTAGTAGAAGATGGGCTGAATCATTTCAAGAACATGCAATTGGATTTTGGAATAGAACCTACTGTGGAACACTATACTTGTTTGGTAGATATCCTAAGCAGAGCTGGTCATCTAGAATTGGCTTTGAACGCAATTCAAGAGATGCCTGCCCAATTTCAAGCTCAAGCTTGGGTTCCTTTTCTCAGTGCTTGCAGGACTTACTGCGATGTCGAACTTGGAGAAGTTGCAAATAAAAACATATCAGGTTCAAATCCTGGAAACCCAGTAAATCATGTTTTGGTGGCTAATTTATATACATCTGTGGGTAAGTGGAAAGAAGCAGCTGTAGTGAGAAGTTTGATCGGTGATAAAGGTCTGGTCAAAGAACCAGGATGTAGCCAGCTTTAA

Coding sequence (CDS)

ATGCAGGCTCATAGGTTCTCCCCATCTTCCGGTTTTATCAAAAGACCTCTTTATCTATGGAACTTGATGATTCGAAGCTCCGTCAATGGCGGTTTCTTCGCCGAAACTCTCGAAACCTACTCCTTTATGCGCCACTCTGGAATTCATGGTAACAACTTCACTTTTCCTCTCCTCCTCAAGGCCTGCGCCAATCTTGCTTCGATTGGCGATGGCACGATGCTCCACGCTCACCTCATCCGCGTCGGGTTCGAAACAGATATCTTCGTTCAAACCTCACTCGTGGATATGTACTCCAAATGCTTCGACTTGGCTTCTTCACGCCAAGTGTTCGACGAAATGTCTATGAGAAGTGTCATCTCTTGGAATTCGATGATTGCTGCTTATTCTCGCGCTTTTCGGGTTAATGAAGGGTTTAAGCTATTCCGAGAGATGTGGGGGTTTGGGTTTGAGCCAAATTCCTCGACATTTGTGAGCTTATTGTCCGGTTTTGCTAACCCAGTTCACGGGTCTCTCTTTCAGTGCCTTCTGATACAGGGTTGCCTAACAAAGTTTCGACTTCAAAATGACACGCCCGTGGCAAATTCTCTCATGAAAATGTATGTAAACTTTGGTCAAATTGATGCTGCTCGCTCTGTTTTTTATGCCATAGGTGGCAAGACAGTAATTTCTTGGACAATAATGCTTGGTGGATACTTGAAATCTGGGGCTGTTGCCGAAGTGTTCAGAATCTTTAGTCAAATGAGACTAAATAATGTGGTATTGGATAAAGTCGTTTTCGTGGACATCATTTCCTCTTGTATACAACTTGGAAATTTGCTTTTAGCTTCTTCGCTTCATTCTCTCCTACTGAAAATTGGGCTCAATAGTGAGGATCCTATTGGCTGCTTGCTTATTAGCATGTATTCAAAATGCGGCGACCACTTGTCTGCTCGAGCAGTATTTGATATGTTACCTGAAAAAGGCATCTTCTTGTGGACGTCGGTGATAAGTGGATATGCCAATGCTGGGTACCCTGGAGAAGCGTTACATCTATTTACAATGGCGACACAGAATAATATTAGACCAAATGGAGCAATGCTAGCTACTGCAGTTTCTGCTTGTGCTGATTCAGGATCGCTGAGCATGTGCAAGGAGATGGAAGCATACATACAGCTGAATGGTGTAGCGGTAGATTGTCAAGTTTCGACATCATTGATACATATGTATTGTAAATGTGAAAGTATTAAGAAGGCAGAAGGAGTTTTTAAAAGTATGATAAGTAGAGACCTGGCAGCTTGGAGTGCTATGATGAATGGCTACGCTGTGTATGGGATGGGAGAAGAGGCTATAAATCTGTTTCATGAGATGCGAAGAACCGGAATAAAACCAGATGCTTCTGTTTATGCAAGCATTTTATTGGCTTGCAGTCACTCAGGTCTAGTAGAAGATGGGCTGAATCATTTCAAGAACATGCAATTGGATTTTGGAATAGAACCTACTGTGGAACACTATACTTGTTTGGTAGATATCCTAAGCAGAGCTGGTCATCTAGAATTGGCTTTGAACGCAATTCAAGAGATGCCTGCCCAATTTCAAGCTCAAGCTTGGGTTCCTTTTCTCAGTGCTTGCAGGACTTACTGCGATGTCGAACTTGGAGAAGTTGCAAATAAAAACATATCAGGTTCAAATCCTGGAAACCCAGTAAATCATGTTTTGGTGGCTAATTTATATACATCTGTGGGTAAGTGGAAAGAAGCAGCTGTAGTGAGAAGTTTGATCGGTGATAAAGGTCTGGTCAAAGAACCAGGATGTAGCCAGCTTTAA

Protein sequence

MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAMLATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL
Homology
BLAST of Moc03g31300 vs. NCBI nr
Match: XP_022137264.1 (pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia])

HSP 1 Score: 1206.0 bits (3119), Expect = 0.0e+00
Identity = 600/600 (100.00%), Postives = 600/600 (100.00%), Query Frame = 0

Query: 1   MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK 60
           MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK
Sbjct: 1   MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK 60

Query: 61  ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS 120
           ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS
Sbjct: 61  ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS 120

Query: 121 WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC 180
           WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC
Sbjct: 121 WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC 180

Query: 181 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 240
           LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV
Sbjct: 181 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 240

Query: 241 FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM 300
           FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM
Sbjct: 241 FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM 300

Query: 301 YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML 360
           YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML
Sbjct: 301 YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML 360

Query: 361 ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS 420
           ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS
Sbjct: 361 ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS 420

Query: 421 RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH 480
           RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH
Sbjct: 421 RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH 480

Query: 481 FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC 540
           FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC
Sbjct: 481 FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC 540

Query: 541 DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 600
           DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL
Sbjct: 541 DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 600

BLAST of Moc03g31300 vs. NCBI nr
Match: XP_004137641.1 (pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus])

HSP 1 Score: 984.6 bits (2544), Expect = 3.9e-283
Identity = 494/601 (82.20%), Postives = 537/601 (89.35%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA++LETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  +L +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
           SWNSMIAAYSR+FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
           CLTKF+L +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK GL  EDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+ISGYANAGYP EAL LF+MATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  SI+KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYAV+GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI PT+ HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVKEPGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600

BLAST of Moc03g31300 vs. NCBI nr
Match: XP_038893873.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 980.3 bits (2533), Expect = 7.3e-282
Identity = 494/601 (82.20%), Postives = 533/601 (88.69%), Query Frame = 0

Query: 1   MQAHRFSPSSG-FIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFSPSS   IKRPLYLWNL IR SVNGGFF E LETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSPSSTLIIKRPLYLWNLTIRISVNGGFFTEXLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KAC+NLASIGDGTMLHAHLIRV FE+DIFVQTSLVDM SKC DLASSRQ+FDEMS RSVI
Sbjct: 61  KACSNLASIGDGTMLHAHLIRVRFESDIFVQTSLVDMCSKCSDLASSRQMFDEMSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
           SWNSMIAAYSR F VNE  KLFREM G GFE NSSTFVSLLSGFA+P HGSLFQ   + G
Sbjct: 121 SWNSMIAAYSRDFGVNEALKLFREMLGVGFEANSSTFVSLLSGFADPTHGSLFQGRSVHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
           C+TKF+L +DTPVANSLM+MYVNFGQID+A SVFY I  KTVISWTIMLGGYL++GAVA+
Sbjct: 181 CITKFQLLDDTPVANSLMQMYVNFGQIDSACSVFYTISDKTVISWTIMLGGYLRAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF IF+QMR NNVVLDKVVFVDIISSC+QLGNL LASSLHSLLLK GLN+EDPIGCLLIS
Sbjct: 241 VFEIFNQMRKNNVVLDKVVFVDIISSCVQLGNLFLASSLHSLLLKTGLNNEDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MY K GD LSAR VFD+L EK I+ WTS+ISGYANAGYP EAL  FTMATQNN+RPNGAM
Sbjct: 301 MYLKRGDLLSARVVFDLLTEKSIYSWTSMISGYANAGYPREALRFFTMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATAVSACAD GSL+MC+E+EA+I LN +A D QVSTSLIH+YCKC SI+KAE  F SMI
Sbjct: 361 LATAVSACADLGSLNMCREIEAFIPLNDLASDYQVSTSLIHLYCKCGSIEKAEIFFNSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYA++ MGEEAINLFH+M+R+G+KPDASVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAMHCMGEEAINLFHKMQRSGMKPDASVYASILLACSHSGLVEDGLK 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLDFGI PTV HYTCLVDILSR G LELALN IQEMP QFQAQAW PFLSACRTY
Sbjct: 481 HFKNMQLDFGIVPTVVHYTCLVDILSRTGRLELALNTIQEMPTQFQAQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDV LGEVAN++I  SNP NPVNHVL+ANLYTS+ KWKEAA+VRSLIGDKGL KEPGCSQ
Sbjct: 541 CDVGLGEVANRSILCSNPRNPVNHVLMANLYTSMDKWKEAAMVRSLIGDKGLFKEPGCSQ 600

BLAST of Moc03g31300 vs. NCBI nr
Match: TYJ99083.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 969.1 bits (2504), Expect = 1.7e-278
Identity = 487/601 (81.03%), Postives = 529/601 (88.02%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA+TLETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  DL +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
            WNSMIAAYSR FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
            +TKF+  +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK  L  EDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYEDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+ISGYANAGYP EAL LFTMATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  S +KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYA++GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI P + HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

BLAST of Moc03g31300 vs. NCBI nr
Match: XP_008446053.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis melo])

HSP 1 Score: 964.9 bits (2493), Expect = 3.2e-277
Identity = 485/601 (80.70%), Postives = 528/601 (87.85%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA+TLETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  DL +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
            WNSMIAAYSR FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
            +TKF+  +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK  L  +DPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYQDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+IS YANAGYP EAL LFTMATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISEYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  S +KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYA++GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLQ 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI P + HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

BLAST of Moc03g31300 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 5.4e-102
Identity = 224/645 (34.73%), Postives = 338/645 (52.40%), Query Frame = 0

Query: 5   RFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACAN 64
           RF PS   +    Y WN +IRS  + G   + L  +  M       +N+TFP + KAC  
Sbjct: 84  RFPPSDAGV----YHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGE 143

Query: 65  LASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSM 124
           ++S+  G   HA  +  GF +++FV  +LV MYS+C  L+ +R+VFDEMS+  V+SWNS+
Sbjct: 144 ISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSI 203

Query: 125 IAAYSRAFRVNEGFKLFREMWG-FGFEPNSSTFVSLLSGFAN-PVH--GSLFQCLLIQGC 184
           I +Y++  +     ++F  M   FG  P++ T V++L   A+   H  G    C  +   
Sbjct: 204 IESYAKLGKPKVALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAV--- 263

Query: 185 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 244
            T   +QN   V N L+ MY   G +D A +VF  +  K V+SW  M+ GY + G   + 
Sbjct: 264 -TSEMIQN-MFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDA 323

Query: 245 FRIFSQMRLNNVVLD-----------------------------------KVVFVDIISS 304
            R+F +M+   + +D                                   +V  + ++S 
Sbjct: 324 VRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSG 383

Query: 305 CIQLGNLLLASSLHSLLLKI-------GLNSEDPIGCLLISMYSKCGDHLSARAVFDML- 364
           C  +G L+    +H   +K        G   E+ +   LI MY+KC    +ARA+FD L 
Sbjct: 384 CASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLS 443

Query: 365 -PEKGIFLWTSVISGYANAGYPGEALHLFTMATQNN--IRPNGAMLATAVSACADSGSLS 424
             E+ +  WT +I GY+  G   +AL L +   + +   RPN   ++ A+ ACA   +L 
Sbjct: 444 PKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALR 503

Query: 425 MCKEMEAYIQLNGV-AVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNGY 484
           + K++ AY   N   AV   VS  LI MY KC SI  A  VF +M++++   W+++M GY
Sbjct: 504 IGKQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGY 563

Query: 485 AVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEPT 544
            ++G GEEA+ +F EMRR G K D      +L ACSHSG+++ G+ +F  M+  FG+ P 
Sbjct: 564 GMHGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPG 623

Query: 545 VEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNIS 599
            EHY CLVD+L RAG L  AL  I+EMP +     WV FLS CR +  VELGE A + I+
Sbjct: 624 PEHYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKIT 683

BLAST of Moc03g31300 vs. ExPASy Swiss-Prot
Match: Q9ZQ74 (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 1.5e-96
Identity = 202/585 (34.53%), Postives = 318/585 (54.36%), Query Frame = 0

Query: 18  YLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAH 77
           YLW +M+R         E ++ Y  +   G   ++  F   LKAC  L  + +G  +H  
Sbjct: 108 YLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKKIHCQ 167

Query: 78  LIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEG 137
           L++V    D  V T L+DMY+KC ++ S+ +VF+++++R+V+ W SMIA Y +     EG
Sbjct: 168 LVKVP-SFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEG 227

Query: 138 FKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSLM 197
             LF  M       N  T+ +L+   A     +L Q     GCL K  ++  + +  SL+
Sbjct: 228 LVLFNRMRENNVLGNEYTYGTLI--MACTKLSALHQGKWFHGCLVKSGIELSSCLVTSLL 287

Query: 198 KMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDKV 257
            MYV  G I  AR VF       ++ WT M+ GY  +G+V E   +F +M+   +  + V
Sbjct: 288 DMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPNCV 347

Query: 258 VFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARAVFDML 317
               ++S C  + NL L  S+H L +K+G+  +  +   L+ MY+KC  +  A+ VF+M 
Sbjct: 348 TIASVLSGCGLIENLELGRSVHGLSIKVGI-WDTNVANALVHMYAKCYQNRDAKYVFEME 407

Query: 318 PEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAMLATAVSACADSGSLSMCK 377
            EK I  W S+ISG++  G   EAL LF      ++ PNG  +A+  SACA  GSL++  
Sbjct: 408 SEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAVGS 467

Query: 378 EMEAY-IQLNGVA-VDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNGYAV 437
            + AY ++L  +A     V T+L+  Y KC   + A  +F ++  ++   WSAM+ GY  
Sbjct: 468 SLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAMIGGYGK 527

Query: 438 YGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEPTVE 497
            G    ++ LF EM +   KP+ S + SIL AC H+G+V +G  +F +M  D+   P+ +
Sbjct: 528 QGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYNFTPSTK 587

Query: 498 HYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNISGS 557
           HYTC+VD+L+RAG LE AL+ I++MP Q   + +  FL  C  +   +LGE+  K +   
Sbjct: 588 HYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDL 647

Query: 558 NPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 601
           +P +   +VLV+NLY S G+W +A  VR+L+  +GL K  G S +
Sbjct: 648 HPDDASYYVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTM 688

BLAST of Moc03g31300 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 2.6e-96
Identity = 199/587 (33.90%), Postives = 315/587 (53.66%), Query Frame = 0

Query: 13  IKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGT 72
           I++ L+ WN+++      G F+ ++  +  M  SG+  +++TF  + K+ ++L S+  G 
Sbjct: 157 IEKALF-WNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGE 216

Query: 73  MLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAF 132
            LH  +++ GF     V  SLV  Y K   + S+R+VFDEM+ R VISWNS+I  Y    
Sbjct: 217 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 276

Query: 133 RVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPV 192
              +G  +F +M   G E + +T VS+ +G A+    SL + +   G   K     +   
Sbjct: 277 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIG--VKACFSREDRF 336

Query: 193 ANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNV 252
            N+L+ MY   G +D+A++VF  +  ++V+S+T M+ GY + G   E  ++F +M    +
Sbjct: 337 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 396

Query: 253 VLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARA 312
             D      +++ C +   L     +H  + +  L  +  +   L+ MY+KCG    A  
Sbjct: 397 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 456

Query: 313 VFDMLPEKGIFLWTSVISGYANAGYPGEALHLFT-MATQNNIRPNGAMLATAVSACADSG 372
           VF  +  K I  W ++I GY+   Y  EAL LF  +  +    P+   +A  + ACA   
Sbjct: 457 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 516

Query: 373 SLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMN 432
           +    +E+  YI  NG   D  V+ SL+ MY KC ++  A  +F  + S+DL +W+ M+ 
Sbjct: 517 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 576

Query: 433 GYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIE 492
           GY ++G G+EAI LF++MR+ GI+ D   + S+L ACSHSGLV++G   F  M+ +  IE
Sbjct: 577 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 636

Query: 493 PTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKN 552
           PTVEHY C+VD+L+R G L  A   I+ MP    A  W   L  CR + DV+L E   + 
Sbjct: 637 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 696

Query: 553 ISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCS 599
           +    P N   +VL+AN+Y    KW++   +R  IG +GL K PGCS
Sbjct: 697 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCS 740

BLAST of Moc03g31300 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 7.0e-94
Identity = 203/616 (32.95%), Postives = 319/616 (51.79%), Query Frame = 0

Query: 18  YLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAH 77
           +++N +IR   + G   E +  +  M +SGI  + +TFP  L ACA   + G+G  +H  
Sbjct: 100 FMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 159

Query: 78  LIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEG 137
           ++++G+  D+FVQ SLV  Y++C +L S+R+VFDEMS R+V+SW SMI  Y+R     + 
Sbjct: 160 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 219

Query: 138 FKL-FREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSL 197
             L FR +      PNS T V ++S  A      L     +   +    ++ +  + ++L
Sbjct: 220 VDLFFRMVRDEEVTPNSVTMVCVISACAK--LEDLETGEKVYAFIRNSGIEVNDLMVSAL 279

Query: 198 MKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDK 257
           + MY+    ID A+ +F   G   +     M   Y++ G   E   +F+ M  + V  D+
Sbjct: 280 VDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDR 339

Query: 258 VVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCG---------DH 317
           +  +  ISSC QL N+L   S H  +L+ G  S D I   LI MY KC          D 
Sbjct: 340 ISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDR 399

Query: 318 LSARAV----------------------FDMLPEKGIFLWTSVISGYANAGYPGEALHLF 377
           +S + V                      F+ +PEK I  W ++ISG        EA+ +F
Sbjct: 400 MSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVF 459

Query: 378 -TMATQNNIRPNGAMLATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCK 437
            +M +Q  +  +G  + +  SAC   G+L + K +  YI+ NG+ +D ++ T+L+ M+ +
Sbjct: 460 CSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSR 519

Query: 438 CESIKKAEGVFKSMISRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASI 497
           C   + A  +F S+ +RD++AW+A +   A+ G  E AI LF +M   G+KPD   +   
Sbjct: 520 CGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGA 579

Query: 498 LLACSHSGLVEDGLNHFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQF 557
           L ACSH GLV+ G   F +M    G+ P   HY C+VD+L RAG LE A+  I++MP + 
Sbjct: 580 LTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEP 639

Query: 558 QAQAWVPFLSACRTYCDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRS 601
               W   L+ACR   +VE+   A + I    P    ++VL++N+Y S G+W + A VR 
Sbjct: 640 NDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRL 699

BLAST of Moc03g31300 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 345.9 bits (886), Expect = 9.2e-94
Identity = 194/586 (33.11%), Postives = 314/586 (53.58%), Query Frame = 0

Query: 14  KRPLYLWNLMIRSSVNGGFFAETLETYSFMRH-SGIHGNNFTFPLLLKACANLASIGDGT 73
           +R L+ WN+++      G+F E +  Y  M    G+  + +TFP +L+ C  +  +  G 
Sbjct: 157 ERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLARGK 216

Query: 74  MLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAF 133
            +H H++R G+E DI V  +L+ MY KC D+ S+R +FD M  R +ISWN+MI+ Y    
Sbjct: 217 EVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIISWNAMISGYFENG 276

Query: 134 RVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPV 193
             +EG +LF  M G   +P+  T  S++S         L + +      T F +  D  V
Sbjct: 277 MCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRLGRDIHAYVITTGFAV--DISV 336

Query: 194 ANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNV 253
            NSL +MY+N G    A  +F  +  K ++SWT M+ GY  +    +    +  M  ++V
Sbjct: 337 CNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQDSV 396

Query: 254 VLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARA 313
             D++    ++S+C  LG+L     LH L +K  L S   +   LI+MYSKC     A  
Sbjct: 397 KPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMYSKCKCIDKALD 456

Query: 314 VFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAMLATAVSACADSGS 373
           +F  +P K +  WTS+I+G        EAL +F    +  ++PN   L  A++ACA  G+
Sbjct: 457 IFHNIPRKNVISWTSIIAGLRLNNRCFEAL-IFLRQMKMTLQPNAITLTAALAACARIGA 516

Query: 374 LSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNG 433
           L   KE+ A++   GV +D  +  +L+ MY +C  +  A   F S   +D+ +W+ ++ G
Sbjct: 517 LMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQ-KKDVTSWNILLTG 576

Query: 434 YAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEP 493
           Y+  G G   + LF  M ++ ++PD   + S+L  CS S +V  GL +F  M+ D+G+ P
Sbjct: 577 YSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMVRQGLMYFSKME-DYGVTP 636

Query: 494 TVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNI 553
            ++HY C+VD+L RAG L+ A   IQ+MP       W   L+ACR +  ++LGE++ ++I
Sbjct: 637 NLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLNACRIHHKIDLGELSAQHI 696

Query: 554 SGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCS 599
              +  +   ++L+ NLY   GKW+E A VR ++ + GL  + GCS
Sbjct: 697 FELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENGLTVDAGCS 737

BLAST of Moc03g31300 vs. ExPASy TrEMBL
Match: A0A6J1C6R4 (pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charantia OX=3673 GN=LOC111008767 PE=4 SV=1)

HSP 1 Score: 1206.0 bits (3119), Expect = 0.0e+00
Identity = 600/600 (100.00%), Postives = 600/600 (100.00%), Query Frame = 0

Query: 1   MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK 60
           MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK
Sbjct: 1   MQAHRFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLK 60

Query: 61  ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS 120
           ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS
Sbjct: 61  ACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVIS 120

Query: 121 WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC 180
           WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC
Sbjct: 121 WNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGC 180

Query: 181 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 240
           LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV
Sbjct: 181 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 240

Query: 241 FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM 300
           FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM
Sbjct: 241 FRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISM 300

Query: 301 YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML 360
           YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML
Sbjct: 301 YSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAML 360

Query: 361 ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS 420
           ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS
Sbjct: 361 ATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMIS 420

Query: 421 RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH 480
           RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH
Sbjct: 421 RDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNH 480

Query: 481 FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC 540
           FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC
Sbjct: 481 FKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYC 540

Query: 541 DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 600
           DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL
Sbjct: 541 DVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 600

BLAST of Moc03g31300 vs. ExPASy TrEMBL
Match: A0A0A0LT91 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043310 PE=4 SV=1)

HSP 1 Score: 984.6 bits (2544), Expect = 1.9e-283
Identity = 494/601 (82.20%), Postives = 537/601 (89.35%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA++LETYSFMRHSGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQSLETYSFMRHSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  +L +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKFSNLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
           SWNSMIAAYSR+FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 SWNSMIAAYSRSFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
           CLTKF+L +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 CLTKFQLHDDTPVENSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK GL  EDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLFLGSSLHSLLLKTGLKYEDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+ISGYANAGYP EAL LF+MATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFSMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  SI+KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMRREIEAFIQQDGLASDSQVSTSLIHLYCKFGSIEKAEKVFNSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYAV+GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAVHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI PT+ HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPTMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVKEPGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKEPGCSQ 600

BLAST of Moc03g31300 vs. ExPASy TrEMBL
Match: A0A5D3BIG9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002870 PE=4 SV=1)

HSP 1 Score: 969.1 bits (2504), Expect = 8.2e-279
Identity = 487/601 (81.03%), Postives = 529/601 (88.02%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA+TLETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  DL +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
            WNSMIAAYSR FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
            +TKF+  +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK  L  EDPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYEDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+ISGYANAGYP EAL LFTMATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISGYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  S +KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYA++GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLE 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI P + HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

BLAST of Moc03g31300 vs. ExPASy TrEMBL
Match: A0A1S3BDP0 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103488899 PE=4 SV=1)

HSP 1 Score: 964.9 bits (2493), Expect = 1.5e-277
Identity = 485/601 (80.70%), Postives = 528/601 (87.85%), Query Frame = 0

Query: 1   MQAHRFSPSSGFI-KRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLL 60
           MQ HRFS SS  I K+PLYLWNL IRSSVNGGFFA+TLETYSFMR SGIHGNNFTFPLLL
Sbjct: 1   MQIHRFSSSSTLITKKPLYLWNLTIRSSVNGGFFAQTLETYSFMRQSGIHGNNFTFPLLL 60

Query: 61  KACANLASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVI 120
           KACANLASIGDGTMLHAHLI VGFE+D+FVQTSLVDMYSK  DL +SRQVFDE S RSVI
Sbjct: 61  KACANLASIGDGTMLHAHLIHVGFESDVFVQTSLVDMYSKISDLRASRQVFDETSTRSVI 120

Query: 121 SWNSMIAAYSRAFRVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQG 180
            WNSMIAAYSR FRVNE  KLFREM G GFEPNSSTFVSLLSGFA+P HGSLFQ  L+ G
Sbjct: 121 FWNSMIAAYSRGFRVNEALKLFREMLGGGFEPNSSTFVSLLSGFADPTHGSLFQGRLLHG 180

Query: 181 CLTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAE 240
            +TKF+  +DTPV NSL++MYVNFGQID+A SVFYAI  KTVISWTIMLGGYLK+GAVA+
Sbjct: 181 FMTKFQFHDDTPVQNSLVQMYVNFGQIDSACSVFYAISEKTVISWTIMLGGYLKAGAVAK 240

Query: 241 VFRIFSQMRLNNVVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLIS 300
           VF  FSQMR NNVVLDK VFVDIISSCIQLGNL L SSLHSLLLK  L  +DPIGCLLIS
Sbjct: 241 VFETFSQMRQNNVVLDKFVFVDIISSCIQLGNLSLGSSLHSLLLKTALKYQDPIGCLLIS 300

Query: 301 MYSKCGDHLSARAVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAM 360
           MYSKCGD LSARAVFD+L EK I+ WTS+IS YANAGYP EAL LFTMATQNN+RPNGAM
Sbjct: 301 MYSKCGDLLSARAVFDLLSEKSIYSWTSMISEYANAGYPREALSLFTMATQNNVRPNGAM 360

Query: 361 LATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMI 420
           LATA+SACAD GSLSM +E+EA+IQ +G+A D QVSTSLIH+YCK  S +KAE VF SMI
Sbjct: 361 LATAISACADLGSLSMLREIEAFIQQDGLASDYQVSTSLIHLYCKFGSFEKAEKVFSSMI 420

Query: 421 SRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLN 480
            RDLAAWS+MMNGYA++GMGE+ +NLFHEM+R+GIKPD SVYASILLACSHSGLVEDGL 
Sbjct: 421 HRDLAAWSSMMNGYAMHGMGEKTMNLFHEMQRSGIKPDGSVYASILLACSHSGLVEDGLQ 480

Query: 481 HFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTY 540
           HFKNMQLD+GI P + HYTCLVDILSRAGHLELALN IQEMP QFQ+QAW PFLSACRTY
Sbjct: 481 HFKNMQLDYGIVPNMVHYTCLVDILSRAGHLELALNTIQEMPTQFQSQAWAPFLSACRTY 540

Query: 541 CDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQ 600
           CDVELGEVAN+ +  SNP NPVNHVL+ANLYTS+GKWKEAA VRSLI DKGLVK+PGCSQ
Sbjct: 541 CDVELGEVANRCLLSSNPRNPVNHVLMANLYTSMGKWKEAAKVRSLIDDKGLVKQPGCSQ 600

BLAST of Moc03g31300 vs. ExPASy TrEMBL
Match: A0A6P4ADT9 (pentatricopeptide repeat-containing protein At4g21065-like OS=Ziziphus jujuba OX=326968 GN=LOC107419613 PE=3 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 3.6e-194
Identity = 334/589 (56.71%), Postives = 436/589 (74.02%), Query Frame = 0

Query: 13  IKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGT 72
           +KRPL+LWNLMIR S+N G F+ TL+ Y+ M H+G+HGN+FTFPL+ KAC+NL SI    
Sbjct: 3   LKRPLFLWNLMIRDSINHGLFSHTLQLYASMFHTGLHGNSFTFPLVFKACSNLTSIDFAI 62

Query: 73  MLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAF 132
            LH+H+ R GF  D+FVQT+L+DMYS C  L SSR+VFDEM MRS++SWNS+I+AYSRAF
Sbjct: 63  QLHSHVFRNGFHADLFVQTALIDMYSSCSRLGSSRKVFDEMPMRSLVSWNSIISAYSRAF 122

Query: 133 RVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQN-DTP 192
           RVNE F L +E+W  G +P+SSTFVS+LSG  +P + SLF CL I GC  K  L N + P
Sbjct: 123 RVNEAFLLLKEVWVLGLQPSSSTFVSILSGCCHPDNHSLFHCLSIHGCAIKLGLTNCEIP 182

Query: 193 VANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNN 252
           +ANSL+  Y++FGQ+D AR +F  I  K++ISWT ++GGY + G V E F +F+QMR  +
Sbjct: 183 LANSLLNAYIHFGQMDRARFIFNNIEEKSLISWTTIIGGYFRVGNVDEAFSLFNQMRQTS 242

Query: 253 VVLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSAR 312
           + LD V+FV ++S C Q GN++LASS+HSL+LK G + E+PI  LL++MY+ CGD +SAR
Sbjct: 243 LSLDSVLFVILVSGCAQEGNIILASSVHSLVLKAGSDDEEPINHLLVTMYANCGDLVSAR 302

Query: 313 AVFDMLPEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAMLATAVSACADSG 372
             F M  ++ I LWTS+I GY + GYP EA +LF        +P GA LA  +SA AD  
Sbjct: 303 KTFHMANDRSISLWTSMIGGYTHLGYPEEAFNLFRKLLSTATKPTGATLAIILSAYADLQ 362

Query: 373 SLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMN 432
           SLSM KE+E YI +NG+  D +V TSLIHM+C+C +IKKA  +F+ + ++DL  WS+M+N
Sbjct: 363 SLSMGKEIEEYILMNGLGSDTRVQTSLIHMFCRCGAIKKARELFERVTNKDLVVWSSMIN 422

Query: 433 GYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIE 492
           GYA +GMGEEA++LFH M+ +GIKPD+ VY SIL ACSHSGLV DG+ +F +MQ DFGI+
Sbjct: 423 GYATHGMGEEALSLFHNMQSSGIKPDSVVYKSILTACSHSGLVADGMKYFHSMQKDFGIQ 482

Query: 493 PTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKN 552
           PT EHY CLVD+L RAG L LA+  IQEMP + QA AW P LSACRTYC++ELGE+A K 
Sbjct: 483 PTSEHYACLVDLLGRAGQLNLAVRIIQEMPVEEQALAWGPLLSACRTYCNIELGELAAKK 542

Query: 553 ISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 601
           +   NP +  N VLVANLYTSVGKW++AA  R LI ++ L+KE G S +
Sbjct: 543 LLDLNPESASNCVLVANLYTSVGKWEKAATTRRLIKEEQLIKERGWSHI 591

BLAST of Moc03g31300 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 373.2 bits (957), Expect = 3.8e-103
Identity = 224/645 (34.73%), Postives = 338/645 (52.40%), Query Frame = 0

Query: 5   RFSPSSGFIKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACAN 64
           RF PS   +    Y WN +IRS  + G   + L  +  M       +N+TFP + KAC  
Sbjct: 84  RFPPSDAGV----YHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGE 143

Query: 65  LASIGDGTMLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSM 124
           ++S+  G   HA  +  GF +++FV  +LV MYS+C  L+ +R+VFDEMS+  V+SWNS+
Sbjct: 144 ISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSI 203

Query: 125 IAAYSRAFRVNEGFKLFREMWG-FGFEPNSSTFVSLLSGFAN-PVH--GSLFQCLLIQGC 184
           I +Y++  +     ++F  M   FG  P++ T V++L   A+   H  G    C  +   
Sbjct: 204 IESYAKLGKPKVALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAV--- 263

Query: 185 LTKFRLQNDTPVANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEV 244
            T   +QN   V N L+ MY   G +D A +VF  +  K V+SW  M+ GY + G   + 
Sbjct: 264 -TSEMIQN-MFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDA 323

Query: 245 FRIFSQMRLNNVVLD-----------------------------------KVVFVDIISS 304
            R+F +M+   + +D                                   +V  + ++S 
Sbjct: 324 VRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSG 383

Query: 305 CIQLGNLLLASSLHSLLLKI-------GLNSEDPIGCLLISMYSKCGDHLSARAVFDML- 364
           C  +G L+    +H   +K        G   E+ +   LI MY+KC    +ARA+FD L 
Sbjct: 384 CASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLS 443

Query: 365 -PEKGIFLWTSVISGYANAGYPGEALHLFTMATQNN--IRPNGAMLATAVSACADSGSLS 424
             E+ +  WT +I GY+  G   +AL L +   + +   RPN   ++ A+ ACA   +L 
Sbjct: 444 PKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALR 503

Query: 425 MCKEMEAYIQLNGV-AVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNGY 484
           + K++ AY   N   AV   VS  LI MY KC SI  A  VF +M++++   W+++M GY
Sbjct: 504 IGKQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGY 563

Query: 485 AVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEPT 544
            ++G GEEA+ +F EMRR G K D      +L ACSHSG+++ G+ +F  M+  FG+ P 
Sbjct: 564 GMHGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPG 623

Query: 545 VEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNIS 599
            EHY CLVD+L RAG L  AL  I+EMP +     WV FLS CR +  VELGE A + I+
Sbjct: 624 PEHYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKIT 683

BLAST of Moc03g31300 vs. TAIR 10
Match: AT2G03380.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 355.1 bits (910), Expect = 1.1e-97
Identity = 202/585 (34.53%), Postives = 318/585 (54.36%), Query Frame = 0

Query: 18  YLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAH 77
           YLW +M+R         E ++ Y  +   G   ++  F   LKAC  L  + +G  +H  
Sbjct: 108 YLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKKIHCQ 167

Query: 78  LIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEG 137
           L++V    D  V T L+DMY+KC ++ S+ +VF+++++R+V+ W SMIA Y +     EG
Sbjct: 168 LVKVP-SFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEG 227

Query: 138 FKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSLM 197
             LF  M       N  T+ +L+   A     +L Q     GCL K  ++  + +  SL+
Sbjct: 228 LVLFNRMRENNVLGNEYTYGTLI--MACTKLSALHQGKWFHGCLVKSGIELSSCLVTSLL 287

Query: 198 KMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDKV 257
            MYV  G I  AR VF       ++ WT M+ GY  +G+V E   +F +M+   +  + V
Sbjct: 288 DMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPNCV 347

Query: 258 VFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARAVFDML 317
               ++S C  + NL L  S+H L +K+G+  +  +   L+ MY+KC  +  A+ VF+M 
Sbjct: 348 TIASVLSGCGLIENLELGRSVHGLSIKVGI-WDTNVANALVHMYAKCYQNRDAKYVFEME 407

Query: 318 PEKGIFLWTSVISGYANAGYPGEALHLFTMATQNNIRPNGAMLATAVSACADSGSLSMCK 377
            EK I  W S+ISG++  G   EAL LF      ++ PNG  +A+  SACA  GSL++  
Sbjct: 408 SEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAVGS 467

Query: 378 EMEAY-IQLNGVA-VDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMNGYAV 437
            + AY ++L  +A     V T+L+  Y KC   + A  +F ++  ++   WSAM+ GY  
Sbjct: 468 SLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAMIGGYGK 527

Query: 438 YGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIEPTVE 497
            G    ++ LF EM +   KP+ S + SIL AC H+G+V +G  +F +M  D+   P+ +
Sbjct: 528 QGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYNFTPSTK 587

Query: 498 HYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKNISGS 557
           HYTC+VD+L+RAG LE AL+ I++MP Q   + +  FL  C  +   +LGE+  K +   
Sbjct: 588 HYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDL 647

Query: 558 NPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCSQL 601
           +P +   +VLV+NLY S G+W +A  VR+L+  +GL K  G S +
Sbjct: 648 HPDDASYYVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTM 688

BLAST of Moc03g31300 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 354.4 bits (908), Expect = 1.8e-97
Identity = 199/587 (33.90%), Postives = 315/587 (53.66%), Query Frame = 0

Query: 13  IKRPLYLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGT 72
           I++ L+ WN+++      G F+ ++  +  M  SG+  +++TF  + K+ ++L S+  G 
Sbjct: 157 IEKALF-WNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGE 216

Query: 73  MLHAHLIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAF 132
            LH  +++ GF     V  SLV  Y K   + S+R+VFDEM+ R VISWNS+I  Y    
Sbjct: 217 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 276

Query: 133 RVNEGFKLFREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPV 192
              +G  +F +M   G E + +T VS+ +G A+    SL + +   G   K     +   
Sbjct: 277 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIG--VKACFSREDRF 336

Query: 193 ANSLMKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNV 252
            N+L+ MY   G +D+A++VF  +  ++V+S+T M+ GY + G   E  ++F +M    +
Sbjct: 337 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 396

Query: 253 VLDKVVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCGDHLSARA 312
             D      +++ C +   L     +H  + +  L  +  +   L+ MY+KCG    A  
Sbjct: 397 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 456

Query: 313 VFDMLPEKGIFLWTSVISGYANAGYPGEALHLFT-MATQNNIRPNGAMLATAVSACADSG 372
           VF  +  K I  W ++I GY+   Y  EAL LF  +  +    P+   +A  + ACA   
Sbjct: 457 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 516

Query: 373 SLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCKCESIKKAEGVFKSMISRDLAAWSAMMN 432
           +    +E+  YI  NG   D  V+ SL+ MY KC ++  A  +F  + S+DL +W+ M+ 
Sbjct: 517 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 576

Query: 433 GYAVYGMGEEAINLFHEMRRTGIKPDASVYASILLACSHSGLVEDGLNHFKNMQLDFGIE 492
           GY ++G G+EAI LF++MR+ GI+ D   + S+L ACSHSGLV++G   F  M+ +  IE
Sbjct: 577 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 636

Query: 493 PTVEHYTCLVDILSRAGHLELALNAIQEMPAQFQAQAWVPFLSACRTYCDVELGEVANKN 552
           PTVEHY C+VD+L+R G L  A   I+ MP    A  W   L  CR + DV+L E   + 
Sbjct: 637 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 696

Query: 553 ISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRSLIGDKGLVKEPGCS 599
           +    P N   +VL+AN+Y    KW++   +R  IG +GL K PGCS
Sbjct: 697 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCS 740

BLAST of Moc03g31300 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 346.3 bits (887), Expect = 5.0e-95
Identity = 203/616 (32.95%), Postives = 319/616 (51.79%), Query Frame = 0

Query: 18  YLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAH 77
           +++N +IR   + G   E +  +  M +SGI  + +TFP  L ACA   + G+G  +H  
Sbjct: 100 FMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 159

Query: 78  LIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEG 137
           ++++G+  D+FVQ SLV  Y++C +L S+R+VFDEMS R+V+SW SMI  Y+R     + 
Sbjct: 160 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 219

Query: 138 FKL-FREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSL 197
             L FR +      PNS T V ++S  A      L     +   +    ++ +  + ++L
Sbjct: 220 VDLFFRMVRDEEVTPNSVTMVCVISACAK--LEDLETGEKVYAFIRNSGIEVNDLMVSAL 279

Query: 198 MKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDK 257
           + MY+    ID A+ +F   G   +     M   Y++ G   E   +F+ M  + V  D+
Sbjct: 280 VDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDR 339

Query: 258 VVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCG---------DH 317
           +  +  ISSC QL N+L   S H  +L+ G  S D I   LI MY KC          D 
Sbjct: 340 ISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDR 399

Query: 318 LSARAV----------------------FDMLPEKGIFLWTSVISGYANAGYPGEALHLF 377
           +S + V                      F+ +PEK I  W ++ISG        EA+ +F
Sbjct: 400 MSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVF 459

Query: 378 -TMATQNNIRPNGAMLATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCK 437
            +M +Q  +  +G  + +  SAC   G+L + K +  YI+ NG+ +D ++ T+L+ M+ +
Sbjct: 460 CSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSR 519

Query: 438 CESIKKAEGVFKSMISRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASI 497
           C   + A  +F S+ +RD++AW+A +   A+ G  E AI LF +M   G+KPD   +   
Sbjct: 520 CGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGA 579

Query: 498 LLACSHSGLVEDGLNHFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQF 557
           L ACSH GLV+ G   F +M    G+ P   HY C+VD+L RAG LE A+  I++MP + 
Sbjct: 580 LTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEP 639

Query: 558 QAQAWVPFLSACRTYCDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRS 601
               W   L+ACR   +VE+   A + I    P    ++VL++N+Y S G+W + A VR 
Sbjct: 640 NDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRL 699

BLAST of Moc03g31300 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 346.3 bits (887), Expect = 5.0e-95
Identity = 203/616 (32.95%), Postives = 319/616 (51.79%), Query Frame = 0

Query: 18  YLWNLMIRSSVNGGFFAETLETYSFMRHSGIHGNNFTFPLLLKACANLASIGDGTMLHAH 77
           +++N +IR   + G   E +  +  M +SGI  + +TFP  L ACA   + G+G  +H  
Sbjct: 100 FMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGL 159

Query: 78  LIRVGFETDIFVQTSLVDMYSKCFDLASSRQVFDEMSMRSVISWNSMIAAYSRAFRVNEG 137
           ++++G+  D+FVQ SLV  Y++C +L S+R+VFDEMS R+V+SW SMI  Y+R     + 
Sbjct: 160 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 219

Query: 138 FKL-FREMWGFGFEPNSSTFVSLLSGFANPVHGSLFQCLLIQGCLTKFRLQNDTPVANSL 197
             L FR +      PNS T V ++S  A      L     +   +    ++ +  + ++L
Sbjct: 220 VDLFFRMVRDEEVTPNSVTMVCVISACAK--LEDLETGEKVYAFIRNSGIEVNDLMVSAL 279

Query: 198 MKMYVNFGQIDAARSVFYAIGGKTVISWTIMLGGYLKSGAVAEVFRIFSQMRLNNVVLDK 257
           + MY+    ID A+ +F   G   +     M   Y++ G   E   +F+ M  + V  D+
Sbjct: 280 VDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDR 339

Query: 258 VVFVDIISSCIQLGNLLLASSLHSLLLKIGLNSEDPIGCLLISMYSKCG---------DH 317
           +  +  ISSC QL N+L   S H  +L+ G  S D I   LI MY KC          D 
Sbjct: 340 ISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDR 399

Query: 318 LSARAV----------------------FDMLPEKGIFLWTSVISGYANAGYPGEALHLF 377
           +S + V                      F+ +PEK I  W ++ISG        EA+ +F
Sbjct: 400 MSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVF 459

Query: 378 -TMATQNNIRPNGAMLATAVSACADSGSLSMCKEMEAYIQLNGVAVDCQVSTSLIHMYCK 437
            +M +Q  +  +G  + +  SAC   G+L + K +  YI+ NG+ +D ++ T+L+ M+ +
Sbjct: 460 CSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSR 519

Query: 438 CESIKKAEGVFKSMISRDLAAWSAMMNGYAVYGMGEEAINLFHEMRRTGIKPDASVYASI 497
           C   + A  +F S+ +RD++AW+A +   A+ G  E AI LF +M   G+KPD   +   
Sbjct: 520 CGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGA 579

Query: 498 LLACSHSGLVEDGLNHFKNMQLDFGIEPTVEHYTCLVDILSRAGHLELALNAIQEMPAQF 557
           L ACSH GLV+ G   F +M    G+ P   HY C+VD+L RAG LE A+  I++MP + 
Sbjct: 580 LTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEP 639

Query: 558 QAQAWVPFLSACRTYCDVELGEVANKNISGSNPGNPVNHVLVANLYTSVGKWKEAAVVRS 601
               W   L+ACR   +VE+   A + I    P    ++VL++N+Y S G+W + A VR 
Sbjct: 640 NDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRL 699

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022137264.10.0e+00100.00pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia][more]
XP_004137641.13.9e-28382.20pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus][more]
XP_038893873.17.3e-28282.20LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein DOT4, chloropla... [more]
TYJ99083.11.7e-27881.03pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008446053.13.2e-27780.70PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-... [more]
Match NameE-valueIdentityDescription
Q9LFL55.4e-10234.73Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Q9ZQ741.5e-9634.53Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
Q9SN392.6e-9633.90Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LUJ27.0e-9432.95Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9M9E29.2e-9433.11Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1C6R40.0e+00100.00pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charanti... [more]
A0A0A0LT911.9e-28382.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043310 PE=4 SV=1[more]
A0A5D3BIG98.2e-27981.03Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BDP01.5e-27780.70pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like OS=Cuc... [more]
A0A6P4ADT93.6e-19456.71pentatricopeptide repeat-containing protein At4g21065-like OS=Ziziphus jujuba OX... [more]
Match NameE-valueIdentityDescription
AT5G16860.13.8e-10334.73Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03380.11.1e-9734.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.11.8e-9733.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22690.15.0e-9532.95CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.25.0e-9532.95INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 222..251
e-value: 4.5E-4
score: 20.3
coord: 496..520
e-value: 0.18
score: 12.1
coord: 324..346
e-value: 0.0016
score: 18.6
coord: 297..322
e-value: 0.03
score: 14.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 117..165
e-value: 3.5E-11
score: 43.1
coord: 422..468
e-value: 5.2E-9
score: 36.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 425..458
e-value: 7.3E-9
score: 33.3
coord: 396..423
e-value: 1.8E-4
score: 19.4
coord: 119..152
e-value: 4.5E-8
score: 30.8
coord: 222..255
e-value: 3.7E-4
score: 18.5
coord: 324..356
e-value: 8.6E-4
score: 17.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..355
score: 9.558311
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 220..254
score: 9.656963
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 422..456
score: 12.605553
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 117..151
score: 11.849223
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 382..590
e-value: 1.9E-29
score: 105.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 173..274
e-value: 7.0E-15
score: 56.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 275..381
e-value: 2.5E-15
score: 58.6
coord: 18..172
e-value: 1.3E-28
score: 102.2
NoneNo IPR availablePANTHERPTHR24015:SF1934PPR CONTAINING PLANT-LIKE PROTEINcoord: 14..600
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 14..600

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g31300.1Moc03g31300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0004557 alpha-galactosidase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding