Moc05g05620 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g05620
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr5: 3926146 .. 3928778 (-)
RNA-Seq ExpressionMoc05g05620
SyntenyMoc05g05620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACGCTCAGCCACTCAACCTCCGCCCTCCCACTTCAATATCACGCATTCTCCACCAGACCCACCACTCTCGCCGCCGCTCTCGCCTCCGCCTCCACTCTCCTACACCTCAAACAAGTCCACGTTCAAATCCTTCGCTCCAAATTCGAACGCTACGATTCCGATTCCCTTCTTTTCAAACTTGTCCTCTCATCTTGTGCTCTCTCCTCCAGCCTCGACTATGCCCTCTCTGTCTTCGATCAAATCCCCGAGCCCAAGACCCGTTTCTGCAATAAGCTCCTGCGCGAATTGTCTCGGGGTCCGAAGCCGGAGAATGCGCTTTTTGTGTACGAGAAGATGAGGGCCGAGGGTCTGAGTCTGGACCGGTTCAGCTTCCCGCCGATTTTGAAAGCGGCTTCTAGGAATCTGTCTTTGAGAACGGGGATGGAGATTCATGGACTTGCGTCGAAGTTGGGGTTTGGTCCAGACCCATTTGTGGAGACGGGGTTGGTGAGAATGTATGCGGCCTGTGGAAGGCTTATGGAAGCACGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTCACTTGGAGCATCATGATTGATGGGTACGCCAACTTCTATTCTGGATTGTAATTGTAACTGCATTTATCAACCAATTTCTTTTCAAAATATGAACTTAAATGTAAAAATTTTGCTCTTTACTTACTAGTGATTGTTGGATTGGCAAAATTCAGTAGTCAAGGTATTGGTAATTTTCCTTAGGCTCTTTATTTGAATGGTACTATTCTAAAAAGCCATGACTTCTATACTTACCATTTTATGATGGATTGTTACAGTTATAACTATGTTATAGTCATGGATATTGTTGCCACTTTTGGTAGCCTTCAAATATGGAAGAACATTATATTTATATTGGCCCCTTACATTGTTCATTTGGGAGAGATGCTACTTATGGCAGTAATTTTTCTATCTCATCCTTCTCTGGTTTAAATGCATAGCTATTTAGAAGAAAAATGACTATGGAACTTATATTTTTCCTGTAACCTCAAGCATACAGGTACTGCATAAGTGGCTGTTATGATCTTGCCTTTCAACTCTTCGAAGAAATGAAGAGAACAGACATGGAACCTGATGAGATGATTCTTTCAACCATTATTTCTGCCTGCGCTCGTGCTGGAAATTTGGATTATGGAACAAGAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCACTTACAAAGTGCTCTCATCACGATGTACGCAAGTTGTGGCTCCATGGATTTGGCTTGGGATTTGTATGAAAAGATATCCCCCAAAAACATGGTTGTTTCAACTGCCATGGTTTCTGGGCTTTCAAAATGTGGACAGATTGGTGATGCTCGCTATGTGTTCGATCAAATGGTGGAGAAGGACTTGATATGTTGGAGTGCAATGATTTCTGGATATACTGAGAGTGATTGCCCTCAAGAGGCTCTTGTATTGTTCAAGAAAATGCAACTACTGGGAATAAAACCTGATGTAGTGACCATGTTGAGTGTCATCTCAGCTTGTGCTCATCTTGGTGCATTAGAGCAAGCCAATTGGATCTGTACTTATGTTGATAAAAATGGGTTCGACAAGGCATTATCCGTCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCGAGGGAAGTTTTTAAAAAGATGCCAAAGAAGAATGTTATATCTTGGACATGTATGATTAATGCTTCTGCAATGCATGGAGATTCTCATAATGCTCTAAACCTATTTCATCAAATGAAGGATGAAAATGTTGAGCCTAATTGGATCACATTTGTTGGGGTGCTTTATGCTTGTAGCCATGGAGGTCTAGTTGAGGAAGGCCGAAAAATATTTCACTCAATGATCAATGACTATGGCATTAGTCCGAAACACGAACACTTCGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGATGATAGAGGCAATGCCCTTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGGTCTATGGCGAGACTGAGTTAGGAGAGTTTGCTGCTAAACAAGTTCTAAAGCTCGAGCCAAATCACGATGGGGCCTTTGTTGTCTTATCGAATGTATACGCTAAAGAAAGGAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGAATGAGATGGGCGTTGCCAAAGAGAGAGGATGCAGTAGAGTTGAATTGAACAATGAGGTGCATGAATTTCAAATGGCAGATAGAAAGCACAGGCAAGCAGATCAAATATATCAAAAGTTAGATGAGGTAGTTCAAAAGTTGAAGATGGCTGGTTATACGCCTCGCGTAGATTGCGTTCTTGTTGATTTAGACGAAGAAGAAAGGAAGGAAGCAATCCTCTGGCACAGCGAGAAACTGGCACTTTGCTATGCCCTCATGAACGAAGGGTCACACATTCGCATTATAAAGAACCTTAGGATTTGTGAAGATTGTCATACTTTTATGAAATTAGCCTCTAAGGTTTATGCCAGAGAGATTATCATTAGGGACAGAACTAGATTTCACCATTACAGAGACGGTTCGTGTTCATGTAACGACTATTGGTGA

mRNA sequence

ATGGAAACGCTCAGCCACTCAACCTCCGCCCTCCCACTTCAATATCACGCATTCTCCACCAGACCCACCACTCTCGCCGCCGCTCTCGCCTCCGCCTCCACTCTCCTACACCTCAAACAAGTCCACGTTCAAATCCTTCGCTCCAAATTCGAACGCTACGATTCCGATTCCCTTCTTTTCAAACTTGTCCTCTCATCTTGTGCTCTCTCCTCCAGCCTCGACTATGCCCTCTCTGTCTTCGATCAAATCCCCGAGCCCAAGACCCGTTTCTGCAATAAGCTCCTGCGCGAATTGTCTCGGGGTCCGAAGCCGGAGAATGCGCTTTTTGTGTACGAGAAGATGAGGGCCGAGGGTCTGAGTCTGGACCGGTTCAGCTTCCCGCCGATTTTGAAAGCGGCTTCTAGGAATCTGTCTTTGAGAACGGGGATGGAGATTCATGGACTTGCGTCGAAGTTGGGGTTTGGTCCAGACCCATTTGTGGAGACGGGGTTGGTGAGAATGTATGCGGCCTGTGGAAGGCTTATGGAAGCACGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTCACTTGGAGCATCATGATTGATGGGTACTGCATAAGTGGCTGTTATGATCTTGCCTTTCAACTCTTCGAAGAAATGAAGAGAACAGACATGGAACCTGATGAGATGATTCTTTCAACCATTATTTCTGCCTGCGCTCGTGCTGGAAATTTGGATTATGGAACAAGAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCACTTACAAAGTGCTCTCATCACGATGTACGCAAGTTGTGGCTCCATGGATTTGGCTTGGGATTTGTATGAAAAGATATCCCCCAAAAACATGGTTGTTTCAACTGCCATGGTTTCTGGGCTTTCAAAATGTGGACAGATTGGTGATGCTCGCTATGTGTTCGATCAAATGGTGGAGAAGGACTTGATATGTTGGAGTGCAATGATTTCTGGATATACTGAGAGTGATTGCCCTCAAGAGGCTCTTGTATTGTTCAAGAAAATGCAACTACTGGGAATAAAACCTGATGTAGTGACCATGTTGAGTGTCATCTCAGCTTGTGCTCATCTTGGTGCATTAGAGCAAGCCAATTGGATCTGTACTTATGTTGATAAAAATGGGTTCGACAAGGCATTATCCGTCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCGAGGGAAGTTTTTAAAAAGATGCCAAAGAAGAATGTTATATCTTGGACATGTATGATTAATGCTTCTGCAATGCATGGAGATTCTCATAATGCTCTAAACCTATTTCATCAAATGAAGGATGAAAATGTTGAGCCTAATTGGATCACATTTGTTGGGGTGCTTTATGCTTGTAGCCATGGAGGTCTAGTTGAGGAAGGCCGAAAAATATTTCACTCAATGATCAATGACTATGGCATTAGTCCGAAACACGAACACTTCGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGATGATAGAGGCAATGCCCTTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGGTCTATGGCGAGACTGAGTTAGGAGAGTTTGCTGCTAAACAAGTTCTAAAGCTCGAGCCAAATCACGATGGGGCCTTTGTTGTCTTATCGAATGTATACGCTAAAGAAAGGAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGAATGAGATGGGCGTTGCCAAAGAGAGAGGATGCAGTAGAGTTGAATTGAACAATGAGGTGCATGAATTTCAAATGGCAGATAGAAAGCACAGGCAAGCAGATCAAATATATCAAAAGTTAGATGAGGTAGTTCAAAAGTTGAAGATGGCTGGTTATACGCCTCGCGTAGATTGCGTTCTTGTTGATTTAGACGAAGAAGAAAGGAAGGAAGCAATCCTCTGGCACAGCGAGAAACTGGCACTTTGCTATGCCCTCATGAACGAAGGGTCACACATTCGCATTATAAAGAACCTTAGGATTTGTGAAGATTGTCATACTTTTATGAAATTAGCCTCTAAGGTTTATGCCAGAGAGATTATCATTAGGGACAGAACTAGATTTCACCATTACAGAGACGGTTCGTGTTCATGTAACGACTATTGGTGA

Coding sequence (CDS)

ATGGAAACGCTCAGCCACTCAACCTCCGCCCTCCCACTTCAATATCACGCATTCTCCACCAGACCCACCACTCTCGCCGCCGCTCTCGCCTCCGCCTCCACTCTCCTACACCTCAAACAAGTCCACGTTCAAATCCTTCGCTCCAAATTCGAACGCTACGATTCCGATTCCCTTCTTTTCAAACTTGTCCTCTCATCTTGTGCTCTCTCCTCCAGCCTCGACTATGCCCTCTCTGTCTTCGATCAAATCCCCGAGCCCAAGACCCGTTTCTGCAATAAGCTCCTGCGCGAATTGTCTCGGGGTCCGAAGCCGGAGAATGCGCTTTTTGTGTACGAGAAGATGAGGGCCGAGGGTCTGAGTCTGGACCGGTTCAGCTTCCCGCCGATTTTGAAAGCGGCTTCTAGGAATCTGTCTTTGAGAACGGGGATGGAGATTCATGGACTTGCGTCGAAGTTGGGGTTTGGTCCAGACCCATTTGTGGAGACGGGGTTGGTGAGAATGTATGCGGCCTGTGGAAGGCTTATGGAAGCACGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTCACTTGGAGCATCATGATTGATGGGTACTGCATAAGTGGCTGTTATGATCTTGCCTTTCAACTCTTCGAAGAAATGAAGAGAACAGACATGGAACCTGATGAGATGATTCTTTCAACCATTATTTCTGCCTGCGCTCGTGCTGGAAATTTGGATTATGGAACAAGAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCACTTACAAAGTGCTCTCATCACGATGTACGCAAGTTGTGGCTCCATGGATTTGGCTTGGGATTTGTATGAAAAGATATCCCCCAAAAACATGGTTGTTTCAACTGCCATGGTTTCTGGGCTTTCAAAATGTGGACAGATTGGTGATGCTCGCTATGTGTTCGATCAAATGGTGGAGAAGGACTTGATATGTTGGAGTGCAATGATTTCTGGATATACTGAGAGTGATTGCCCTCAAGAGGCTCTTGTATTGTTCAAGAAAATGCAACTACTGGGAATAAAACCTGATGTAGTGACCATGTTGAGTGTCATCTCAGCTTGTGCTCATCTTGGTGCATTAGAGCAAGCCAATTGGATCTGTACTTATGTTGATAAAAATGGGTTCGACAAGGCATTATCCGTCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCGAGGGAAGTTTTTAAAAAGATGCCAAAGAAGAATGTTATATCTTGGACATGTATGATTAATGCTTCTGCAATGCATGGAGATTCTCATAATGCTCTAAACCTATTTCATCAAATGAAGGATGAAAATGTTGAGCCTAATTGGATCACATTTGTTGGGGTGCTTTATGCTTGTAGCCATGGAGGTCTAGTTGAGGAAGGCCGAAAAATATTTCACTCAATGATCAATGACTATGGCATTAGTCCGAAACACGAACACTTCGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGATGATAGAGGCAATGCCCTTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGGTCTATGGCGAGACTGAGTTAGGAGAGTTTGCTGCTAAACAAGTTCTAAAGCTCGAGCCAAATCACGATGGGGCCTTTGTTGTCTTATCGAATGTATACGCTAAAGAAAGGAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGAATGAGATGGGCGTTGCCAAAGAGAGAGGATGCAGTAGAGTTGAATTGAACAATGAGGTGCATGAATTTCAAATGGCAGATAGAAAGCACAGGCAAGCAGATCAAATATATCAAAAGTTAGATGAGGTAGTTCAAAAGTTGAAGATGGCTGGTTATACGCCTCGCGTAGATTGCGTTCTTGTTGATTTAGACGAAGAAGAAAGGAAGGAAGCAATCCTCTGGCACAGCGAGAAACTGGCACTTTGCTATGCCCTCATGAACGAAGGGTCACACATTCGCATTATAAAGAACCTTAGGATTTGTGAAGATTGTCATACTTTTATGAAATTAGCCTCTAAGGTTTATGCCAGAGAGATTATCATTAGGGACAGAACTAGATTTCACCATTACAGAGACGGTTCGTGTTCATGTAACGACTATTGGTGA

Protein sequence

METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWHSEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW
Homology
BLAST of Moc05g05620 vs. NCBI nr
Match: XP_022138165.1 (pentatricopeptide repeat-containing protein At4g14820 [Momordica charantia])

HSP 1 Score: 1481.5 bits (3834), Expect = 0.0e+00
Identity = 726/726 (100.00%), Postives = 726/726 (100.00%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF
Sbjct: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS
Sbjct: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV
Sbjct: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL
Sbjct: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL
Sbjct: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML
Sbjct: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK
Sbjct: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF
Sbjct: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE
Sbjct: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE
Sbjct: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH
Sbjct: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SCNDYW
Sbjct: 721 SCNDYW 726

BLAST of Moc05g05620 vs. NCBI nr
Match: XP_038902272.1 (pentatricopeptide repeat-containing protein At4g14820 [Benincasa hispida])

HSP 1 Score: 1331.2 bits (3444), Expect = 0.0e+00
Identity = 643/726 (88.57%), Postives = 687/726 (94.63%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSHSTS LPLQ H + TRPT L+AAL+SAS+LLHLKQVH QILRSKFE YDS+SLLF
Sbjct: 1   METLSHSTSVLPLQIHTYPTRPTALSAALSSASSLLHLKQVHAQILRSKFECYDSNSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           +L+LSSCAL  SLDYALSVFDQIP+PKTRFCNKLLRELSRG +PE AL +YEKMRAEGLS
Sbjct: 61  ELILSSCALLPSLDYALSVFDQIPQPKTRFCNKLLRELSRGSEPEVALLLYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGLV+MYAACGR+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVKMYAACGRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMSHRDVVTWSIMIDGYC SGCYDLAFQLFE+MKRTD+EPDEMILST++SACARAGNL
Sbjct: 181 FDKMSHRDVVTWSIMIDGYCSSGCYDLAFQLFEQMKRTDLEPDEMILSTVLSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT+IHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDL+EKI PKNMVVSTAMVSGL
Sbjct: 241 DFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLHEKIFPKNMVVSTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMV KDLICWSAMISGYTESDCPQEAL+LFKKMQ  G+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVVKDLICWSAMISGYTESDCPQEALILFKKMQQQGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QANWI  YVDKNGF KALSVNNALIDMYAKCGSLEGAREVF KMPKK
Sbjct: 361 SVISACAHLGALDQANWIQNYVDKNGFCKALSVNNALIDMYAKCGSLEGAREVFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+H+A++LFHQMK ENVEPNWITFVGVLYACSHGGLVEEGR+IF
Sbjct: 421 NVISWTSMINALAMHGDAHSAMSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSM N+YGISPKHEHFGCMVDLFGRANLLREALE+IEAMPFAPNAIIWGSLMAACQV+GE
Sbjct: 481 HSMTNEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQVHGE 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWEDVGEVRKLMN+MGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMNKMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QADQIYQKLDEVVQKL +AGYTP+ +CVL DLDEEE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADQIYQKLDEVVQKLNLAGYTPQTNCVLADLDEEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLA CYALMNEG  I IIKNLRICEDCH FMKLASKVYAREIIIRDR+RFHHYRDGSC
Sbjct: 661 SEKLAFCYALMNEGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. NCBI nr
Match: XP_022959359.1 (pentatricopeptide repeat-containing protein At4g14820 [Cucurbita moschata] >XP_022960069.1 pentatricopeptide repeat-containing protein At4g14820 [Cucurbita moschata])

HSP 1 Score: 1318.5 bits (3411), Expect = 0.0e+00
Identity = 634/726 (87.33%), Postives = 685/726 (94.35%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPLQ   + T+P  L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLQLPPYPTKPNALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSC+LS SLDYALSVFDQIPEPK+RFCNKLLRELSRG +PENALFVYEKMRAEGLS
Sbjct: 61  KLILSSCSLSPSLDYALSVFDQIPEPKSRFCNKLLRELSRGSEPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYCISG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCISGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT++HEFITKKNIVMDPHLQSALI MYASCGS DLAWDLYEKISPKNMV+STAMVSGL
Sbjct: 241 DFGTKVHEFITKKNIVMDPHLQSALIKMYASCGSTDLAWDLYEKISPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ LG+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQLGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+H AL+LFHQMK ENVEPNWITFVG+LYACSHGGLVEEG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHTALSLFHQMKVENVEPNWITFVGLLYACSHGGLVEEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++G+
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHGD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED GEVRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDAGEVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IYQKL+EVVQ LK+AGYTP+ +CVLVDLD+EE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYQKLNEVVQTLKLAGYTPQTNCVLVDLDDEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGS I IIKNLRICEDCH FMKLASKVYAREI++RDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVYAREIVVRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. NCBI nr
Match: XP_023538947.1 (pentatricopeptide repeat-containing protein At4g14820 [Cucurbita pepo subsp. pepo] >XP_023538948.1 pentatricopeptide repeat-containing protein At4g14820 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1317.8 bits (3409), Expect = 0.0e+00
Identity = 635/726 (87.47%), Postives = 685/726 (94.35%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPL    + TRPT L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLHLPPYPTRPTALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSCALS SLDYALSVFDQIPEPKTRFCNKLLRELSRG +PENALF+YEKMRAEGLS
Sbjct: 61  KLILSSCALSPSLDYALSVFDQIPEPKTRFCNKLLRELSRGSEPENALFLYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYCISG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCISGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT+IHEFITK NIVMDPHLQSALI MYASCGS DLAWDLYEKI+PKNMV+STAMVSGL
Sbjct: 241 DFGTKIHEFITKNNIVMDPHLQSALIKMYASCGSTDLAWDLYEKITPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDAR VFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ LG+KPDVVTML
Sbjct: 301 AKGGQIGDARCVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQLGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+HNAL+LFHQMK ENVEPNWITFVG+LYACSHGGLVEEG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHNALSLFHQMKVENVEPNWITFVGLLYACSHGGLVEEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++G+
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHGD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED G+VRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWEDAGDVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IYQKL+EVVQ LK+AGYTP+++CVLVDLDEEE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYQKLNEVVQTLKLAGYTPQINCVLVDLDEEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGS I IIKNLRICEDCH FMKLASKVYAREI++RDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVYAREIVVRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. NCBI nr
Match: XP_022974384.1 (pentatricopeptide repeat-containing protein At4g14820-like [Cucurbita maxima] >XP_022975405.1 pentatricopeptide repeat-containing protein At4g14820-like isoform X1 [Cucurbita maxima] >XP_022975406.1 pentatricopeptide repeat-containing protein At4g14820-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1315.1 bits (3402), Expect = 0.0e+00
Identity = 634/726 (87.33%), Postives = 683/726 (94.08%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPLQ   + TRP  L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLQLPPYPTRPNALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSCALS SLDYALSVFDQIPEPKTRFCNKLLRELSRG +PENALFVYEKMRAEGLS
Sbjct: 61  KLILSSCALSPSLDYALSVFDQIPEPKTRFCNKLLRELSRGSEPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYC+SG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCLSGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT+IHEFITKKNIVMDPHLQSALI MYAS GS DLAWDLYEKISPKNMV+STAMVSGL
Sbjct: 241 DFGTKIHEFITKKNIVMDPHLQSALIKMYASYGSTDLAWDLYEKISPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ +G+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQMGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+HNAL+LFHQMK ENVEPNWITFVG+LYACSHGGLV+EG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHNALSLFHQMKVENVEPNWITFVGLLYACSHGGLVKEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++ +
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHSD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED GEVRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDAGEVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IY KL+EVVQKLK+AGYTP+ +CVLVDLDEEE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYHKLNEVVQKLKLAGYTPQTNCVLVDLDEEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGS I I KNLRICEDCH FMKLASKVYAREI++RDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSRICITKNLRICEDCHAFMKLASKVYAREIVVRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 857.4 bits (2214), Expect = 1.1e-247
Identity = 410/717 (57.18%), Postives = 544/717 (75.87%), Query Frame = 0

Query: 19  STRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCALSSSLDYALS 78
           ST   T+   L+   +L H+KQ+H  ILR+    +  +S LF L +SS ++  +L YAL+
Sbjct: 9   STAANTILEKLSFCKSLNHIKQLHAHILRTVI-NHKLNSFLFNLSVSSSSI--NLSYALN 68

Query: 79  VFDQIPE-PKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNL 138
           VF  IP  P++   N  LR+LSR  +P   +  Y+++R  G  LD+FSF PILKA S+  
Sbjct: 69  VFSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVS 128

Query: 139 SLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMID 198
           +L  GME+HG+A K+    DPFVETG + MYA+CGR+  AR VFD+MSHRDVVTW+ MI+
Sbjct: 129 ALFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIE 188

Query: 199 GYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVM 258
            YC  G  D AF+LFEEMK +++ PDEMIL  I+SAC R GN+ Y   I+EF+ + ++ M
Sbjct: 189 RYCRFGLVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRM 248

Query: 259 DPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMV 318
           D HL +AL+TMYA  G MD+A + + K+S +N+ VSTAMVSG SKCG++ DA+ +FDQ  
Sbjct: 249 DTHLLTALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTE 308

Query: 319 EKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTMLSVISACAHLGALEQANW 378
           +KDL+CW+ MIS Y ESD PQEAL +F++M   GIKPDVV+M SVISACA+LG L++A W
Sbjct: 309 KKDLVCWTTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKW 368

Query: 379 ICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGD 438
           + + +  NG +  LS+NNALI+MYAKCG L+  R+VF+KMP++NV+SW+ MINA +MHG+
Sbjct: 369 VHSCIHVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGE 428

Query: 439 SHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFG 498
           + +AL+LF +MK ENVEPN +TFVGVLY CSH GLVEEG+KIF SM ++Y I+PK EH+G
Sbjct: 429 ASDALSLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYG 488

Query: 499 CMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPN 558
           CMVDLFGRANLLREALE+IE+MP A N +IWGSLM+AC+++GE ELG+FAAK++L+LEP+
Sbjct: 489 CMVDLFGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPD 548

Query: 559 HDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQ 618
           HDGA V++SN+YA+E+RWEDV  +R++M E  V KE+G SR++ N + HEF + D++H+Q
Sbjct: 549 HDGALVLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQ 608

Query: 619 ADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWHSEKLALCYALMNEGSH- 678
           +++IY KLDEVV KLK+AGY P    VLVD++EEE+K+ +LWHSEKLALC+ LMNE    
Sbjct: 609 SNEIYAKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEE 668

Query: 679 -------IRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW 727
                  IRI+KNLR+CEDCH F KL SKVY REII+RDRTRFH Y++G CSC DYW
Sbjct: 669 EKDSCGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Moc05g05620 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 594.0 bits (1530), Expect = 2.3e-168
Identity = 290/700 (41.43%), Postives = 450/700 (64.29%), Query Frame = 0

Query: 34  TLLHLKQVHVQILRSK--FERYDSDSLLFKLVLSSCALSSSLDYALSVFDQIPEPKTRFC 93
           +L  LKQ H  ++R+    + Y +  L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRELSRGPKPENALFVYEKMRAEGLSL-DRFSFPPILKAASRNLSLRTGMEIHGLAS 153
           N L+R  + GP P  +++ +  M +E     ++++FP ++KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCISGCYDLAFQ 213
           K   G D FV   L+  Y +CG L  A  VF  +  +DVV+W+ MI+G+   G  D A +
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 LFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHLQSALITMYA 273
           LF++M+  D++   + +  ++SACA+  NL++G ++  +I +  + ++  L +A++ MY 
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYT 281

Query: 274 SCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISG 333
            CGS++ A  L++ +  K+ V  T M+ G +       AR V + M +KD++ W+A+IS 
Sbjct: 282 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 341

Query: 334 YTESDCPQEALVLFKKMQL-LGIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDK 393
           Y ++  P EAL++F ++QL   +K + +T++S +SACA +GALE   WI +Y+ K+G   
Sbjct: 342 YEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRM 401

Query: 394 ALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMK 453
              V +ALI MY+KCG LE +REVF  + K++V  W+ MI   AMHG  + A+++F++M+
Sbjct: 402 NFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQ 461

Query: 454 DENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLL 513
           + NV+PN +TF  V  ACSH GLV+E   +FH M ++YGI P+ +H+ C+VD+ GR+  L
Sbjct: 462 EANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYL 521

Query: 514 REALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVY 573
            +A++ IEAMP  P+  +WG+L+ AC+++    L E A  ++L+LEP +DGA V+LSN+Y
Sbjct: 522 EKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIY 581

Query: 574 AKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVV 633
           AK  +WE+V E+RK M   G+ KE GCS +E++  +HEF   D  H  ++++Y KL EV+
Sbjct: 582 AKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVM 641

Query: 634 QKLKMAGYTPRVDCVLVDLDEEERKEAIL-WHSEKLALCYALMNEGSH--IRIIKNLRIC 693
           +KLK  GY P +  VL  ++EEE KE  L  HSEKLA+CY L++  +   IR+IKNLR+C
Sbjct: 642 EKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVC 701

Query: 694 EDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW 727
            DCH+  KL S++Y REII+RDR RFHH+R+G CSCND+W
Sbjct: 702 GDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Moc05g05620 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 2.7e-156
Identity = 289/736 (39.27%), Postives = 439/736 (59.65%), Query Frame = 0

Query: 29  LASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCALS---SSLDYALSVFDQIPE 88
           L +  TL  L+ +H Q++  K   ++++  L KL+   C LS     L YA+SVF  I E
Sbjct: 40  LHNCKTLQSLRIIHAQMI--KIGLHNTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTGMEI 148
           P     N + R  +    P +AL +Y  M + GL  + ++FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHR------------------ 208
           HG   KLG   D +V T L+ MY   GRL +A  VFDK  HR                  
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 219

Query: 209 -------------DVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISAC 268
                        DVV+W+ MI GY  +G Y  A +LF++M +T++ PDE  + T++SAC
Sbjct: 220 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSAC 279

Query: 269 ARAGNLDYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVST 328
           A++G+++ G ++H +I       +  + +ALI +Y+ CG ++ A                
Sbjct: 280 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA---------------- 339

Query: 329 AMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKP 388
                   CG       +F+++  KD+I W+ +I GYT  +  +EAL+LF++M   G  P
Sbjct: 340 --------CG-------LFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETP 399

Query: 389 DVVTMLSVISACAHLGALEQANWICTYVDK--NGFDKALSVNNALIDMYAKCGSLEGARE 448
           + VTMLS++ ACAHLGA++   WI  Y+DK   G   A S+  +LIDMYAKCG +E A +
Sbjct: 400 NDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQ 459

Query: 449 VFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGL 508
           VF  +  K++ SW  MI   AMHG +  + +LF +M+   ++P+ ITFVG+L ACSH G+
Sbjct: 460 VFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGM 519

Query: 509 VEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLM 568
           ++ GR IF +M  DY ++PK EH+GCM+DL G + L +EA EMI  M   P+ +IW SL+
Sbjct: 520 LDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLL 579

Query: 569 AACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAK 628
            AC+++G  ELGE  A+ ++K+EP + G++V+LSN+YA   RW +V + R L+N+ G+ K
Sbjct: 580 KACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKK 639

Query: 629 ERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEE 688
             GCS +E+++ VHEF + D+ H +  +IY  L+E+   L+ AG+ P    VL +++EE 
Sbjct: 640 VPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEW 699

Query: 689 RKEAILWHSEKLALCYALMN--EGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRT 727
           ++ A+  HSEKLA+ + L++   G+ + I+KNLR+C +CH   KL SK+Y REII RDRT
Sbjct: 700 KEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRT 741

BLAST of Moc05g05620 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 1.9e-149
Identity = 289/809 (35.72%), Postives = 441/809 (54.51%), Query Frame = 0

Query: 27  AALASASTLLHLKQVHVQILRSKFERYDSD-SLLFKLVLSSCALSS--SLDYALSVFDQI 86
           ++L +  T+  LK  H  + +   +  D+D S + KLV  SC L +  SL +A  VF+  
Sbjct: 37  SSLKNCKTIDELKMFHRSLTK---QGLDNDVSTITKLVARSCELGTRESLSFAKEVFENS 96

Query: 87  PEPKTRFC-NKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTG 146
               T F  N L+R  +       A+ ++ +M   G+S D+++FP  L A +++ +   G
Sbjct: 97  ESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNG 156

Query: 147 MEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCIS 206
           ++IHGL  K+G+  D FV+  LV  YA CG L  AR VFD+MS R+VV+W+ MI GY   
Sbjct: 157 IQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARR 216

Query: 207 GCYDLAFQLFEEMKR-TDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHL 266
                A  LF  M R  ++ P+ + +  +ISACA+  +L+ G +++ FI    I ++  +
Sbjct: 217 DFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLM 276

Query: 267 QSALITMYASCGSMDLA------------------------------------------- 326
            SAL+ MY  C ++D+A                                           
Sbjct: 277 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 336

Query: 327 -------------------------------------WD--------------------- 386
                                                WD                     
Sbjct: 337 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 396

Query: 387 LYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEA 446
           +++++S K +V   ++V+G  + G++  A   F+ M EK+++ W+ +ISG  +    +EA
Sbjct: 397 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 456

Query: 447 LVLFKKMQLL-GIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDKALSVNNALID 506
           + +F  MQ   G+  D VTM+S+ SAC HLGAL+ A WI  Y++KNG    + +   L+D
Sbjct: 457 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 516

Query: 507 MYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWIT 566
           M+++CG  E A  +F  +  ++V +WT  I A AM G++  A+ LF  M ++ ++P+ + 
Sbjct: 517 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 576

Query: 567 FVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAM 626
           FVG L ACSHGGLV++G++IF+SM+  +G+SP+  H+GCMVDL GRA LL EA+++IE M
Sbjct: 577 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 636

Query: 627 PFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVG 686
           P  PN +IW SL+AAC+V G  E+  +AA+++  L P   G++V+LSNVYA   RW D+ 
Sbjct: 637 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 696

Query: 687 EVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTP 727
           +VR  M E G+ K  G S +++  + HEF   D  H +   I   LDEV Q+    G+ P
Sbjct: 697 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 756

BLAST of Moc05g05620 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 510.0 bits (1312), Expect = 4.4e-143
Identity = 267/725 (36.83%), Postives = 430/725 (59.31%), Query Frame = 0

Query: 7   STSALPLQYHAFSTRPTTLAAALA-SASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLS 66
           S  A PL Y        +  A+L  SA+    LKQ+H ++L    +   S  L+ KL+ +
Sbjct: 5   SCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQ--FSGFLITKLIHA 64

Query: 67  SCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFS 126
           S +    + +A  VFD +P P+    N ++R  SR    ++AL +Y  M+   +S D F+
Sbjct: 65  SSSF-GDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFT 124

Query: 127 FPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFD--K 186
           FP +LKA S    L+ G  +H    +LGF  D FV+ GL+ +YA C RL  AR VF+   
Sbjct: 125 FPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLP 184

Query: 187 MSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNLDYG 246
           +  R +V+W+ ++  Y  +G    A ++F +M++ D++PD + L ++++A     +L  G
Sbjct: 185 LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQG 244

Query: 247 TRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKC 306
             IH  + K  + ++P L  +L TMYA                               KC
Sbjct: 245 RSIHASVVKMGLEIEPDLLISLNTMYA-------------------------------KC 304

Query: 307 GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTMLSVI 366
           GQ+  A+ +FD+M   +LI W+AMISGY ++   +EA+ +F +M    ++PD +++ S I
Sbjct: 305 GQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAI 364

Query: 367 SACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVI 426
           SACA +G+LEQA  +  YV ++ +   + +++ALIDM+AKCGS+EGAR VF +   ++V+
Sbjct: 365 SACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVV 424

Query: 427 SWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIFHSM 486
            W+ MI    +HG +  A++L+  M+   V PN +TF+G+L AC+H G+V EG   F+ M
Sbjct: 425 VWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRM 484

Query: 487 INDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGETEL 546
             D+ I+P+ +H+ C++DL GRA  L +A E+I+ MP  P   +WG+L++AC+ +   EL
Sbjct: 485 A-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVEL 544

Query: 547 GEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVELNN 606
           GE+AA+Q+  ++P++ G +V LSN+YA  R W+ V EVR  M E G+ K+ GCS VE+  
Sbjct: 545 GEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRG 604

Query: 607 EVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWHSEK 666
            +  F++ D+ H + ++I ++++ +  +LK  G+    D  L DL++EE +E +  HSE+
Sbjct: 605 RLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHSER 664

Query: 667 LALCYALMN--EGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCS 726
           +A+ Y L++  +G+ +RI KNLR C +CH   KL SK+  REI++RD  RFHH++DG CS
Sbjct: 665 IAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCS 694

BLAST of Moc05g05620 vs. ExPASy TrEMBL
Match: A0A6J1C8N9 (pentatricopeptide repeat-containing protein At4g14820 OS=Momordica charantia OX=3673 GN=LOC111009402 PE=3 SV=1)

HSP 1 Score: 1481.5 bits (3834), Expect = 0.0e+00
Identity = 726/726 (100.00%), Postives = 726/726 (100.00%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF
Sbjct: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS
Sbjct: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV
Sbjct: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL
Sbjct: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL
Sbjct: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML
Sbjct: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK
Sbjct: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF
Sbjct: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE
Sbjct: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE
Sbjct: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH
Sbjct: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SCNDYW
Sbjct: 721 SCNDYW 726

BLAST of Moc05g05620 vs. ExPASy TrEMBL
Match: A0A6J1H7U5 (pentatricopeptide repeat-containing protein At4g14820 OS=Cucurbita moschata OX=3662 GN=LOC111460103 PE=3 SV=1)

HSP 1 Score: 1318.5 bits (3411), Expect = 0.0e+00
Identity = 634/726 (87.33%), Postives = 685/726 (94.35%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPLQ   + T+P  L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLQLPPYPTKPNALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSC+LS SLDYALSVFDQIPEPK+RFCNKLLRELSRG +PENALFVYEKMRAEGLS
Sbjct: 61  KLILSSCSLSPSLDYALSVFDQIPEPKSRFCNKLLRELSRGSEPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYCISG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCISGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT++HEFITKKNIVMDPHLQSALI MYASCGS DLAWDLYEKISPKNMV+STAMVSGL
Sbjct: 241 DFGTKVHEFITKKNIVMDPHLQSALIKMYASCGSTDLAWDLYEKISPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ LG+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQLGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+H AL+LFHQMK ENVEPNWITFVG+LYACSHGGLVEEG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHTALSLFHQMKVENVEPNWITFVGLLYACSHGGLVEEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++G+
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHGD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED GEVRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDAGEVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IYQKL+EVVQ LK+AGYTP+ +CVLVDLD+EE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYQKLNEVVQTLKLAGYTPQTNCVLVDLDDEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGS I IIKNLRICEDCH FMKLASKVYAREI++RDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVYAREIVVRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. ExPASy TrEMBL
Match: A0A6J1IE29 (pentatricopeptide repeat-containing protein At4g14820-like OS=Cucurbita maxima OX=3661 GN=LOC111474723 PE=3 SV=1)

HSP 1 Score: 1315.1 bits (3402), Expect = 0.0e+00
Identity = 634/726 (87.33%), Postives = 683/726 (94.08%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPLQ   + TRP  L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLQLPPYPTRPNALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSCALS SLDYALSVFDQIPEPKTRFCNKLLRELSRG +PENALFVYEKMRAEGLS
Sbjct: 61  KLILSSCALSPSLDYALSVFDQIPEPKTRFCNKLLRELSRGSEPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYC+SG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCLSGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT+IHEFITKKNIVMDPHLQSALI MYAS GS DLAWDLYEKISPKNMV+STAMVSGL
Sbjct: 241 DFGTKIHEFITKKNIVMDPHLQSALIKMYASYGSTDLAWDLYEKISPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ +G+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQMGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+HNAL+LFHQMK ENVEPNWITFVG+LYACSHGGLV+EG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHNALSLFHQMKVENVEPNWITFVGLLYACSHGGLVKEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++ +
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHSD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED GEVRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDAGEVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IY KL+EVVQKLK+AGYTP+ +CVLVDLDEEE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYHKLNEVVQKLKLAGYTPQTNCVLVDLDEEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSC 720
           SEKLALCYALMNEGS I I KNLRICEDCH FMKLASKVYAREI++RDRTRFHHYRDGSC
Sbjct: 661 SEKLALCYALMNEGSRICITKNLRICEDCHAFMKLASKVYAREIVVRDRTRFHHYRDGSC 720

Query: 721 SCNDYW 727
           SC DYW
Sbjct: 721 SCKDYW 726

BLAST of Moc05g05620 vs. ExPASy TrEMBL
Match: A0A6J1ICZ7 (pentatricopeptide repeat-containing protein At4g14820-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111474723 PE=3 SV=1)

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 634/738 (85.91%), Postives = 683/738 (92.55%), Query Frame = 0

Query: 1   METLSHSTSALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLF 60
           METLSH+TS LPLQ   + TRP  L+AAL+SA++LLH+KQVH QILRSKFER DSDSLLF
Sbjct: 1   METLSHTTSILPLQLPPYPTRPNALSAALSSATSLLHIKQVHAQILRSKFERSDSDSLLF 60

Query: 61  KLVLSSCALSSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLS 120
           KL+LSSCALS SLDYALSVFDQIPEPKTRFCNKLLRELSRG +PENALFVYEKMRAEGLS
Sbjct: 61  KLILSSCALSPSLDYALSVFDQIPEPKTRFCNKLLRELSRGSEPENALFVYEKMRAEGLS 120

Query: 121 LDRFSFPPILKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLV 180
           LDRF FPP+LKAASRNLSLRTGMEIHGLASKLGFG DPFVETGL+RMYAAC R+MEARLV
Sbjct: 121 LDRFCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLIRMYAACRRIMEARLV 180

Query: 181 FDKMSHRDVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNL 240
           FDKMS RDVVTWSIMIDGYC+SG YDLAFQLFEEMKRT +EPDEMILSTI+SACARAGNL
Sbjct: 181 FDKMSQRDVVTWSIMIDGYCLSGYYDLAFQLFEEMKRTGLEPDEMILSTILSACARAGNL 240

Query: 241 DYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGL 300
           D+GT+IHEFITKKNIVMDPHLQSALI MYAS GS DLAWDLYEKISPKNMV+STAMVSGL
Sbjct: 241 DFGTKIHEFITKKNIVMDPHLQSALIKMYASYGSTDLAWDLYEKISPKNMVISTAMVSGL 300

Query: 301 SKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTML 360
           +K GQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQ +G+KPDVVTML
Sbjct: 301 AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQMGMKPDVVTML 360

Query: 361 SVISACAHLGALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKK 420
           SVISACAHLGAL+QA WI  YVDKNGF KALS+NNALIDMYAKCGSLEGARE+F KMPKK
Sbjct: 361 SVISACAHLGALDQAKWIQIYVDKNGFGKALSINNALIDMYAKCGSLEGAREIFGKMPKK 420

Query: 421 NVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIF 480
           NVISWT MINA AMHGD+HNAL+LFHQMK ENVEPNWITFVG+LYACSHGGLV+EG++IF
Sbjct: 421 NVISWTSMINALAMHGDAHNALSLFHQMKVENVEPNWITFVGLLYACSHGGLVKEGQRIF 480

Query: 481 HSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGE 540
           HSMIN+YGISPKHEHFGCMVDLFGRA LLREALE++EAMPFAPNAIIWGSLMAACQ++ +
Sbjct: 481 HSMINEYGISPKHEHFGCMVDLFGRAKLLREALEVVEAMPFAPNAIIWGSLMAACQLHSD 540

Query: 541 TELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVE 600
           TELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWED GEVRKLMNEMGV+KERGCSR+E
Sbjct: 541 TELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDAGEVRKLMNEMGVSKERGCSRIE 600

Query: 601 LNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWH 660
           LNNEVHEFQMADRKH+QAD IY KL+EVVQKLK+AGYTP+ +CVLVDLDEEE+KE +LWH
Sbjct: 601 LNNEVHEFQMADRKHKQADLIYHKLNEVVQKLKLAGYTPQTNCVLVDLDEEEKKELVLWH 660

Query: 661 SEKLALCYALMNEGSH------------IRIIKNLRICEDCHTFMKLASKVYAREIIIRD 720
           SEKLALCYALMNEGS             I I KNLRICEDCH FMKLASKVYAREI++RD
Sbjct: 661 SEKLALCYALMNEGSRICITKNLRICEGICITKNLRICEDCHAFMKLASKVYAREIVVRD 720

Query: 721 RTRFHHYRDGSCSCNDYW 727
           RTRFHHYRDGSCSC DYW
Sbjct: 721 RTRFHHYRDGSCSCKDYW 738

BLAST of Moc05g05620 vs. ExPASy TrEMBL
Match: A0A438KLK6 (Pentatricopeptide repeat-containing protein OS=Vitis vinifera OX=29760 GN=PCMP-H3_3 PE=3 SV=1)

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 518/719 (72.04%), Postives = 612/719 (85.12%), Query Frame = 0

Query: 10  ALPLQYHAFSTRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCAL 69
           A P  +H+      TL +AL+SA++L HLKQVH QILRSK +R  S SLL KLV+SSCAL
Sbjct: 15  ATPTTHHSHH----TLFSALSSATSLTHLKQVHAQILRSKLDR--STSLLVKLVISSCAL 74

Query: 70  SSSLDYALSVFDQIPEPKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPI 129
           SSSLDYALSVF+ IP+P+T  CN+ LRELSR  +PE  L VYE+MR +GL++DRFSFPP+
Sbjct: 75  SSSLDYALSVFNLIPKPETHLCNRFLRELSRSEEPEKTLLVYERMRKQGLAVDRFSFPPL 134

Query: 130 LKAASRNLSLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDV 189
           LKA SR  SL  G+EIHGLA+KLGF  DPFV+TGLVRMYAACGR+ EARL+FDKM HRDV
Sbjct: 135 LKALSRVKSLVEGLEIHGLAAKLGFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDV 194

Query: 190 VTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEF 249
           VTWSIMIDGYC SG ++ A  LFEEMK  ++EPDEM+LST++SAC RAGNL YG  IH+F
Sbjct: 195 VTWSIMIDGYCQSGLFNDALLLFEEMKNYNVEPDEMMLSTVLSACGRAGNLSYGKMIHDF 254

Query: 250 ITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDA 309
           I + NIV+DPHLQSAL+TMYASCGSMDLA +L+EK++PKN+V STAMV+G SK GQI +A
Sbjct: 255 IMENNIVVDPHLQSALVTMYASCGSMDLALNLFEKMTPKNLVASTAMVTGYSKLGQIENA 314

Query: 310 RYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTMLSVISACAHL 369
           R VF+QMV+KDL+CWSAMISGY ESD PQEAL LF +MQ LGIKPD VTMLSVI+ACAHL
Sbjct: 315 RSVFNQMVKKDLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHL 374

Query: 370 GALEQANWICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMI 429
           GAL+QA WI  +VDKNGF  AL +NNALI+MYAKCGSLE AR +F KMP+KNVISWTCMI
Sbjct: 375 GALDQAKWIHLFVDKNGFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMI 434

Query: 430 NASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGI 489
           +A AMHGD+ +AL  FHQM+DEN+EPN ITFVGVLYACSH GLVEEGRKIF+SMIN++ I
Sbjct: 435 SAFAMHGDAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNI 494

Query: 490 SPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAK 549
           +PKH H+GCMVDLFGRANLLREALE++EAMP APN IIWGSLMAAC+V+GE ELGEFAAK
Sbjct: 495 TPKHVHYGCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEVELGEFAAK 554

Query: 550 QVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQ 609
           ++L+L+P+HDGA V LSN+YAK +RWEDVG+VRKLM   G++KERGCSR+ELNNE+HEF 
Sbjct: 555 RLLELDPDHDGAHVFLSNIYAKAKRWEDVGQVRKLMKHKGISKERGCSRIELNNEIHEFL 614

Query: 610 MADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWHSEKLALCYA 669
           +ADR H+ AD+IY+KLDEVV KLK+ GY+P    +LVDL+EEE+KE +LWHSEKLALCY 
Sbjct: 615 VADRSHKHADEIYEKLDEVVSKLKLVGYSPNTCSILVDLEEEEKKEVVLWHSEKLALCYG 674

Query: 670 LMNE--GSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW 727
           LM +  GS I I KNLR+CEDCHTF+KLASKVY REI++RDRTRFHHY+DG CSC DYW
Sbjct: 675 LMRDGTGSCIHINKNLRVCEDCHTFIKLASKVYEREIVVRDRTRFHHYKDGVCSCKDYW 727

BLAST of Moc05g05620 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 857.4 bits (2214), Expect = 8.1e-249
Identity = 410/717 (57.18%), Postives = 544/717 (75.87%), Query Frame = 0

Query: 19  STRPTTLAAALASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCALSSSLDYALS 78
           ST   T+   L+   +L H+KQ+H  ILR+    +  +S LF L +SS ++  +L YAL+
Sbjct: 9   STAANTILEKLSFCKSLNHIKQLHAHILRTVI-NHKLNSFLFNLSVSSSSI--NLSYALN 68

Query: 79  VFDQIPE-PKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNL 138
           VF  IP  P++   N  LR+LSR  +P   +  Y+++R  G  LD+FSF PILKA S+  
Sbjct: 69  VFSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVS 128

Query: 139 SLRTGMEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMID 198
           +L  GME+HG+A K+    DPFVETG + MYA+CGR+  AR VFD+MSHRDVVTW+ MI+
Sbjct: 129 ALFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIE 188

Query: 199 GYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVM 258
            YC  G  D AF+LFEEMK +++ PDEMIL  I+SAC R GN+ Y   I+EF+ + ++ M
Sbjct: 189 RYCRFGLVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRM 248

Query: 259 DPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMV 318
           D HL +AL+TMYA  G MD+A + + K+S +N+ VSTAMVSG SKCG++ DA+ +FDQ  
Sbjct: 249 DTHLLTALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTE 308

Query: 319 EKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKPDVVTMLSVISACAHLGALEQANW 378
           +KDL+CW+ MIS Y ESD PQEAL +F++M   GIKPDVV+M SVISACA+LG L++A W
Sbjct: 309 KKDLVCWTTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKW 368

Query: 379 ICTYVDKNGFDKALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGD 438
           + + +  NG +  LS+NNALI+MYAKCG L+  R+VF+KMP++NV+SW+ MINA +MHG+
Sbjct: 369 VHSCIHVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGE 428

Query: 439 SHNALNLFHQMKDENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFG 498
           + +AL+LF +MK ENVEPN +TFVGVLY CSH GLVEEG+KIF SM ++Y I+PK EH+G
Sbjct: 429 ASDALSLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYG 488

Query: 499 CMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPN 558
           CMVDLFGRANLLREALE+IE+MP A N +IWGSLM+AC+++GE ELG+FAAK++L+LEP+
Sbjct: 489 CMVDLFGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPD 548

Query: 559 HDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQ 618
           HDGA V++SN+YA+E+RWEDV  +R++M E  V KE+G SR++ N + HEF + D++H+Q
Sbjct: 549 HDGALVLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQ 608

Query: 619 ADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEERKEAILWHSEKLALCYALMNEGSH- 678
           +++IY KLDEVV KLK+AGY P    VLVD++EEE+K+ +LWHSEKLALC+ LMNE    
Sbjct: 609 SNEIYAKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEE 668

Query: 679 -------IRIIKNLRICEDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW 727
                  IRI+KNLR+CEDCH F KL SKVY REII+RDRTRFH Y++G CSC DYW
Sbjct: 669 EKDSCGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Moc05g05620 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 594.0 bits (1530), Expect = 1.7e-169
Identity = 290/700 (41.43%), Postives = 450/700 (64.29%), Query Frame = 0

Query: 34  TLLHLKQVHVQILRSK--FERYDSDSLLFKLVLSSCALSSSLDYALSVFDQIPEPKTRFC 93
           +L  LKQ H  ++R+    + Y +  L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRELSRGPKPENALFVYEKMRAEGLSL-DRFSFPPILKAASRNLSLRTGMEIHGLAS 153
           N L+R  + GP P  +++ +  M +E     ++++FP ++KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCISGCYDLAFQ 213
           K   G D FV   L+  Y +CG L  A  VF  +  +DVV+W+ MI+G+   G  D A +
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 LFEEMKRTDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHLQSALITMYA 273
           LF++M+  D++   + +  ++SACA+  NL++G ++  +I +  + ++  L +A++ MY 
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYT 281

Query: 274 SCGSMDLAWDLYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISG 333
            CGS++ A  L++ +  K+ V  T M+ G +       AR V + M +KD++ W+A+IS 
Sbjct: 282 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 341

Query: 334 YTESDCPQEALVLFKKMQL-LGIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDK 393
           Y ++  P EAL++F ++QL   +K + +T++S +SACA +GALE   WI +Y+ K+G   
Sbjct: 342 YEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRM 401

Query: 394 ALSVNNALIDMYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMK 453
              V +ALI MY+KCG LE +REVF  + K++V  W+ MI   AMHG  + A+++F++M+
Sbjct: 402 NFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQ 461

Query: 454 DENVEPNWITFVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLL 513
           + NV+PN +TF  V  ACSH GLV+E   +FH M ++YGI P+ +H+ C+VD+ GR+  L
Sbjct: 462 EANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYL 521

Query: 514 REALEMIEAMPFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVY 573
            +A++ IEAMP  P+  +WG+L+ AC+++    L E A  ++L+LEP +DGA V+LSN+Y
Sbjct: 522 EKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIY 581

Query: 574 AKERRWEDVGEVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVV 633
           AK  +WE+V E+RK M   G+ KE GCS +E++  +HEF   D  H  ++++Y KL EV+
Sbjct: 582 AKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVM 641

Query: 634 QKLKMAGYTPRVDCVLVDLDEEERKEAIL-WHSEKLALCYALMNEGSH--IRIIKNLRIC 693
           +KLK  GY P +  VL  ++EEE KE  L  HSEKLA+CY L++  +   IR+IKNLR+C
Sbjct: 642 EKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVC 701

Query: 694 EDCHTFMKLASKVYAREIIIRDRTRFHHYRDGSCSCNDYW 727
            DCH+  KL S++Y REII+RDR RFHH+R+G CSCND+W
Sbjct: 702 GDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Moc05g05620 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 553.9 bits (1426), Expect = 1.9e-157
Identity = 289/736 (39.27%), Postives = 439/736 (59.65%), Query Frame = 0

Query: 29  LASASTLLHLKQVHVQILRSKFERYDSDSLLFKLVLSSCALS---SSLDYALSVFDQIPE 88
           L +  TL  L+ +H Q++  K   ++++  L KL+   C LS     L YA+SVF  I E
Sbjct: 40  LHNCKTLQSLRIIHAQMI--KIGLHNTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRFCNKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTGMEI 148
           P     N + R  +    P +AL +Y  M + GL  + ++FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHR------------------ 208
           HG   KLG   D +V T L+ MY   GRL +A  VFDK  HR                  
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 219

Query: 209 -------------DVVTWSIMIDGYCISGCYDLAFQLFEEMKRTDMEPDEMILSTIISAC 268
                        DVV+W+ MI GY  +G Y  A +LF++M +T++ PDE  + T++SAC
Sbjct: 220 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSAC 279

Query: 269 ARAGNLDYGTRIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDLYEKISPKNMVVST 328
           A++G+++ G ++H +I       +  + +ALI +Y+ CG ++ A                
Sbjct: 280 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA---------------- 339

Query: 329 AMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQLLGIKP 388
                   CG       +F+++  KD+I W+ +I GYT  +  +EAL+LF++M   G  P
Sbjct: 340 --------CG-------LFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETP 399

Query: 389 DVVTMLSVISACAHLGALEQANWICTYVDK--NGFDKALSVNNALIDMYAKCGSLEGARE 448
           + VTMLS++ ACAHLGA++   WI  Y+DK   G   A S+  +LIDMYAKCG +E A +
Sbjct: 400 NDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQ 459

Query: 449 VFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWITFVGVLYACSHGGL 508
           VF  +  K++ SW  MI   AMHG +  + +LF +M+   ++P+ ITFVG+L ACSH G+
Sbjct: 460 VFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGM 519

Query: 509 VEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAMPFAPNAIIWGSLM 568
           ++ GR IF +M  DY ++PK EH+GCM+DL G + L +EA EMI  M   P+ +IW SL+
Sbjct: 520 LDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLL 579

Query: 569 AACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVGEVRKLMNEMGVAK 628
            AC+++G  ELGE  A+ ++K+EP + G++V+LSN+YA   RW +V + R L+N+ G+ K
Sbjct: 580 KACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKK 639

Query: 629 ERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTPRVDCVLVDLDEEE 688
             GCS +E+++ VHEF + D+ H +  +IY  L+E+   L+ AG+ P    VL +++EE 
Sbjct: 640 VPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEW 699

Query: 689 RKEAILWHSEKLALCYALMN--EGSHIRIIKNLRICEDCHTFMKLASKVYAREIIIRDRT 727
           ++ A+  HSEKLA+ + L++   G+ + I+KNLR+C +CH   KL SK+Y REII RDRT
Sbjct: 700 KEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRT 741

BLAST of Moc05g05620 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 531.2 bits (1367), Expect = 1.3e-150
Identity = 289/809 (35.72%), Postives = 441/809 (54.51%), Query Frame = 0

Query: 27  AALASASTLLHLKQVHVQILRSKFERYDSD-SLLFKLVLSSCALSS--SLDYALSVFDQI 86
           ++L +  T+  LK  H  + +   +  D+D S + KLV  SC L +  SL +A  VF+  
Sbjct: 37  SSLKNCKTIDELKMFHRSLTK---QGLDNDVSTITKLVARSCELGTRESLSFAKEVFENS 96

Query: 87  PEPKTRFC-NKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTG 146
               T F  N L+R  +       A+ ++ +M   G+S D+++FP  L A +++ +   G
Sbjct: 97  ESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNG 156

Query: 147 MEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCIS 206
           ++IHGL  K+G+  D FV+  LV  YA CG L  AR VFD+MS R+VV+W+ MI GY   
Sbjct: 157 IQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARR 216

Query: 207 GCYDLAFQLFEEMKR-TDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHL 266
                A  LF  M R  ++ P+ + +  +ISACA+  +L+ G +++ FI    I ++  +
Sbjct: 217 DFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLM 276

Query: 267 QSALITMYASCGSMDLA------------------------------------------- 326
            SAL+ MY  C ++D+A                                           
Sbjct: 277 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 336

Query: 327 -------------------------------------WD--------------------- 386
                                                WD                     
Sbjct: 337 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 396

Query: 387 LYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEA 446
           +++++S K +V   ++V+G  + G++  A   F+ M EK+++ W+ +ISG  +    +EA
Sbjct: 397 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 456

Query: 447 LVLFKKMQLL-GIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDKALSVNNALID 506
           + +F  MQ   G+  D VTM+S+ SAC HLGAL+ A WI  Y++KNG    + +   L+D
Sbjct: 457 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 516

Query: 507 MYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWIT 566
           M+++CG  E A  +F  +  ++V +WT  I A AM G++  A+ LF  M ++ ++P+ + 
Sbjct: 517 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 576

Query: 567 FVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAM 626
           FVG L ACSHGGLV++G++IF+SM+  +G+SP+  H+GCMVDL GRA LL EA+++IE M
Sbjct: 577 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 636

Query: 627 PFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVG 686
           P  PN +IW SL+AAC+V G  E+  +AA+++  L P   G++V+LSNVYA   RW D+ 
Sbjct: 637 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 696

Query: 687 EVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTP 727
           +VR  M E G+ K  G S +++  + HEF   D  H +   I   LDEV Q+    G+ P
Sbjct: 697 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 756

BLAST of Moc05g05620 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 526.9 bits (1356), Expect = 2.5e-149
Identity = 288/808 (35.64%), Postives = 440/808 (54.46%), Query Frame = 0

Query: 27  AALASASTLLHLKQVHVQILRSKFERYDSD-SLLFKLVLSSCALSS--SLDYALSVFDQI 86
           ++L +  T+  LK  H  + +   +  D+D S + KLV  SC L +  SL +A  VF+  
Sbjct: 37  SSLKNCKTIDELKMFHRSLTK---QGLDNDVSTITKLVARSCELGTRESLSFAKEVFENS 96

Query: 87  PEPKTRFC-NKLLRELSRGPKPENALFVYEKMRAEGLSLDRFSFPPILKAASRNLSLRTG 146
               T F  N L+R  +       A+ ++ +M   G+S D+++FP  L A +++ +   G
Sbjct: 97  ESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNG 156

Query: 147 MEIHGLASKLGFGPDPFVETGLVRMYAACGRLMEARLVFDKMSHRDVVTWSIMIDGYCIS 206
           ++IHGL  K+G+  D FV+  LV  YA CG L  AR VFD+MS R+VV+W+ MI GY   
Sbjct: 157 IQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARR 216

Query: 207 GCYDLAFQLFEEMKR-TDMEPDEMILSTIISACARAGNLDYGTRIHEFITKKNIVMDPHL 266
                A  LF  M R  ++ P+ + +  +ISACA+  +L+ G +++ FI    I ++  +
Sbjct: 217 DFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLM 276

Query: 267 QSALITMYASCGSMDLA------------------------------------------- 326
            SAL+ MY  C ++D+A                                           
Sbjct: 277 VSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGV 336

Query: 327 -------------------------------------WD--------------------- 386
                                                WD                     
Sbjct: 337 RPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFR 396

Query: 387 LYEKISPKNMVVSTAMVSGLSKCGQIGDARYVFDQMVEKDLICWSAMISGYTESDCPQEA 446
           +++++S K +V   ++V+G  + G++  A   F+ M EK+++ W+ +ISG  +    +EA
Sbjct: 397 IFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEA 456

Query: 447 LVLFKKMQLL-GIKPDVVTMLSVISACAHLGALEQANWICTYVDKNGFDKALSVNNALID 506
           + +F  MQ   G+  D VTM+S+ SAC HLGAL+ A WI  Y++KNG    + +   L+D
Sbjct: 457 IEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVD 516

Query: 507 MYAKCGSLEGAREVFKKMPKKNVISWTCMINASAMHGDSHNALNLFHQMKDENVEPNWIT 566
           M+++CG  E A  +F  +  ++V +WT  I A AM G++  A+ LF  M ++ ++P+ + 
Sbjct: 517 MFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVA 576

Query: 567 FVGVLYACSHGGLVEEGRKIFHSMINDYGISPKHEHFGCMVDLFGRANLLREALEMIEAM 626
           FVG L ACSHGGLV++G++IF+SM+  +G+SP+  H+GCMVDL GRA LL EA+++IE M
Sbjct: 577 FVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDM 636

Query: 627 PFAPNAIIWGSLMAACQVYGETELGEFAAKQVLKLEPNHDGAFVVLSNVYAKERRWEDVG 686
           P  PN +IW SL+AAC+V G  E+  +AA+++  L P   G++V+LSNVYA   RW D+ 
Sbjct: 637 PMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMA 696

Query: 687 EVRKLMNEMGVAKERGCSRVELNNEVHEFQMADRKHRQADQIYQKLDEVVQKLKMAGYTP 726
           +VR  M E G+ K  G S +++  + HEF   D  H +   I   LDEV Q+    G+ P
Sbjct: 697 KVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVP 756

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022138165.10.0e+00100.00pentatricopeptide repeat-containing protein At4g14820 [Momordica charantia][more]
XP_038902272.10.0e+0088.57pentatricopeptide repeat-containing protein At4g14820 [Benincasa hispida][more]
XP_022959359.10.0e+0087.33pentatricopeptide repeat-containing protein At4g14820 [Cucurbita moschata] >XP_0... [more]
XP_023538947.10.0e+0087.47pentatricopeptide repeat-containing protein At4g14820 [Cucurbita pepo subsp. pep... [more]
XP_022974384.10.0e+0087.33pentatricopeptide repeat-containing protein At4g14820-like [Cucurbita maxima] >X... [more]
Match NameE-valueIdentityDescription
O233371.1e-24757.18Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
O823802.3e-16841.43Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN012.7e-15639.27Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LUJ21.9e-14935.72Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9LTV84.4e-14336.83Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1C8N90.0e+00100.00pentatricopeptide repeat-containing protein At4g14820 OS=Momordica charantia OX=... [more]
A0A6J1H7U50.0e+0087.33pentatricopeptide repeat-containing protein At4g14820 OS=Cucurbita moschata OX=3... [more]
A0A6J1IE290.0e+0087.33pentatricopeptide repeat-containing protein At4g14820-like OS=Cucurbita maxima O... [more]
A0A6J1ICZ70.0e+0085.91pentatricopeptide repeat-containing protein At4g14820-like isoform X2 OS=Cucurbi... [more]
A0A438KLK60.0e+0072.04Pentatricopeptide repeat-containing protein OS=Vitis vinifera OX=29760 GN=PCMP-H... [more]
Match NameE-valueIdentityDescription
AT4G14820.18.1e-24957.18Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.11.7e-16941.43Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.9e-15739.27Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.21.3e-15035.72INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT3G22690.12.5e-14935.64CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 294..321
e-value: 4.1E-5
score: 21.5
coord: 395..421
e-value: 8.2E-5
score: 20.5
coord: 190..224
e-value: 5.4E-10
score: 36.8
coord: 322..356
e-value: 8.9E-7
score: 26.7
coord: 459..491
e-value: 0.0028
score: 15.7
coord: 423..457
e-value: 3.5E-6
score: 24.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 420..467
e-value: 1.7E-10
score: 40.9
coord: 319..367
e-value: 5.8E-9
score: 36.0
coord: 187..236
e-value: 9.3E-12
score: 45.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 496..519
e-value: 0.28
score: 11.5
coord: 263..284
e-value: 1.2
score: 9.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 421..455
score: 10.928473
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 320..354
score: 11.049048
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..319
score: 8.758137
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 13.372844
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 390..420
score: 8.61564
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 393..620
e-value: 5.6E-40
score: 139.6
coord: 21..136
e-value: 1.4E-7
score: 33.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 145..245
e-value: 4.5E-25
score: 90.0
coord: 246..378
e-value: 5.3E-29
score: 102.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 287..580
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 595..715
e-value: 7.8E-36
score: 122.7
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 20..217
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 302..719
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 20..217
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 208..300
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 302..719
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 208..300

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g05620.1Moc05g05620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding