Cla97C05G081420 (gene) Watermelon (97103) v2.5

Overview
NameCla97C05G081420
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr05: 1144435 .. 1147246 (-)
RNA-Seq ExpressionCla97C05G081420
SyntenyCla97C05G081420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATATCTCATCTCTTATGATCTCCATTCGCCAAAACTCTCGATTTTTCAAGAACCTTAGGATCCACATCCGGAATCTCTCCGTGGAAACGAATGGAGGCAATAATGGATTTGAAGCGATTGAACCGAGTGAGAAACTATTGAGCCGTACTCACCGCCAAGACGTCAGTGAGATTGCGGAAAATGTTTGTAAGGTCATTAGGAGCAAACCCAGATGGGAGCAGACTCTGCTTTCTGATTACCCTTCTTTCAATTTCCATGACCCGTCTTTTTTTCGCGAGCTTTTGAAGCAGTTGAACAATGTTTTGCTTTCTTTGAGGTTTTTTCTTTGGTTGAGTTCGCAGCCTGAGTTCTTGCCCCATCCAGTCAGTTGCAACACGCTTTTTGATGCTCTTTTGGAGGCCAGGGCTTGTGTTCCCGCTAAATCTTTTCTTCATTCTTTCGGTTTTAGTCCCGAGCCTGCCTCTTTGGAGAATTACATTCGATGTGTTTGCGAGGGTGGTTTGGTTGAGGAAGCTGTTAATGTATTTGATGTGTTAAAAGAGACTGGATATCGTCCATCCATTGAGACGTGGAACTTTGCTTTACAGAGTTGTCTTAAGTTTGGGAGGACTGATCTTATTTGGAAACTGTATGAAGAGATGATTGAAGCTGGTGTGCAGAAGGATGTGGGGATAGAGACTGTGGGGTATCTTATCCAGGCATTTTGCAGCGATAACAAGGTTTCAAGAGCTTATGAACTTCTAAGACAGGCTTTAGAGGATGGATTGGCCCCTTGTAATGATGCTTTCAACAAATTGATTTCTGGGTTCTGCGAGGAGGAGGATTATGATAGAGTATCAGAACTTCTCCACACAATGATAGCTAGGAAACGTACTCCCGATATTTTTACCTACCAGAAAATCATTAACGGGCTCTGCAAGAAAGGGAAGCAGCTGGAGGCATTTGAGGTTTTCAATGTCCTTAAGGATCGGGGATATGCTCCTGATAAGGTCATGTATACAACAATGATTGTTGGCCTTTGTAAGATGAGGTGGCTTGGAGATGCTAAAAAGCTGTGGTTTGAGATGATTGATAAGGGATTTCTTCCAAATGAGTATACGTACAACACGTTGATTAATGGATTTTGTAAGATTGGAAAGCTGGATGAGGCCTCTAAGCTATGTAAGGAAATGCATGATAGAGGTTATAAAAAAACCACTCTCAGCTGCAACATAGTGTCTACACGGAAGGACAGATGAAGCATAGGACTTCTATTGAGAAATGCCTTCCAAGGATGTAAATTCGCGATGTTCTAACATTCAATACCCTGATTCATGGATTTTGGAGAAATAAGATATTGCAGAGGACAGACCTATTCAAAGAACTGCTAGAGCAGGGGTTGCAGCCTTCAACTGCGTTTTGTACTCATCACATTAAAAAGCTTTGCCAATTAGGTAGAGTGAAAGAATCAAAGAAAATGTGGAATGACTTGCATAATAGAGGTCTTCAGCCGATGGTCTGCACTCATGAGCACATAATTAATGGATTATGTAAACAAGGATGTGTGGTAGAGGAGATGGATGGGTTGATAATTATGTTGAAGAGCAATCTCAAGCTTCAAAAGCGGACTTTTGTTAAGGTGGTTCAGAGTTTCATTCAAATGGCTAAATTAGATGATTCTTTATCAGTCTTAGGCTCAATGCTTAGAGTACGTTATAAACTAGAAAAAGGCACTTCAGTTATCTCTTGGAAAAACTATGTGGAAACAAGTATCAGTTTGTTGAATCACAATTACAGGAAATCATAAATTCCAACCAGTAGGAGATTCTCTATATATTCTAGGGAGGCTAGTTTAAGATGGTTGTACTCCTGTCATCGATGTTCCTAAAGGCAGATCCGTTGAATGCTTCTTTTGTGGGGGGAGATGGAAGCATGTTAGAAGGGATCTTTTTAACCCAGAAAGCTGTTCTTGCATCGACCGAATAGCCTCTCTGATTCATCTCGTCAGCTGGGAAAAATGGTTTCAACTTGGAAGAAGATTCAATGTGGAAACAGTGTTTATAACCTAGTGTATGTTGTTGGTGGAGAATTCTGTAGCCGAGTTGCAGTGGTTCTTAGGTCCAGGAGTCAATATTTCATTTATGAGAAGTTGATTGTTTAGATTTTCTCCACGCACGCATTGGGTTATAGAATTTCCTTGTCATGTAACTTATCTCCTACCATACGGGCTGCGTTATTGCTGAATAGTTTTAAATTTCACACTAAGTTTAATTCTCATTCCAAATATTAAAATCAGTATCAAAGCAATACATCTCATATGAAGTCTGGTTTAATAAAATTTTATAAATTTCATACACTGTTTCCTTTTTCTGCATTCTTGACCCCTTCCCCTTCCATTTCCAAAACCGTTTCTTTGGGCGAAAACAGCTGCTTCATCATCTTCTATTGTGGTCTGCAGAACTCCCGCATTTGGCAGGGCAGATAGGGAATCTCTTCACAACTTACAGTGTTTGCTTTGCTTTCATTCAGGCATTCATTCGCTGCAGTAATTTTTAAGATAGACAGCAAAACTTTGAAAAAAGTATATACCATTTGGCAGTGACTAAATCCTCTAGCAATTCCCCCCACTTTTTCTGATTGGTCAACTGCTCTTTGCTAAAGACAGGATATCCCTAGTTACAAAATATGTTGGATCATTCTCATCTTCGAAACACAGCTCCATATTTCCTTTTCCACCCTCCCATCAAGGAAGAAAAAGAACTTGTTTGCGAAGCCCATAGTTCATACAGGCCATCATCAGGGAACATTTATGAGCAATGCCTTTAA

mRNA sequence

ATGGCTAATATCTCATCTCTTATGATCTCCATTCGCCAAAACTCTCGATTTTTCAAGAACCTTAGGATCCACATCCGGAATCTCTCCGTGGAAACGAATGGAGGCAATAATGGATTTGAAGCGATTGAACCGAGTGAGAAACTATTGAGCCGTACTCACCGCCAAGACGTCAGTGAGATTGCGGAAAATGTTTGTAAGGTCATTAGGAGCAAACCCAGATGGGAGCAGACTCTGCTTTCTGATTACCCTTCTTTCAATTTCCATGACCCGTCTTTTTTTCGCGAGCTTTTGAAGCAGTTGAACAATGTTTTGCTTTCTTTGAGGTTTTTTCTTTGGTTGAGTTCGCAGCCTGAGTTCTTGCCCCATCCAGTCAGTTGCAACACGCTTTTTGATGCTCTTTTGGAGGCCAGGGCTTGTGTTCCCGCTAAATCTTTTCTTCATTCTTTCGGTTTTAGTCCCGAGCCTGCCTCTTTGGAGAATTACATTCGATGTGTTTGCGAGGGTGGTTTGGTTGAGGAAGCTGTTAATGTATTTGATGTGTTAAAAGAGACTGGATATCGTCCATCCATTGAGACGTGGAACTTTGCTTTACAGAGTTGTCTTAAGTTTGGGAGGACTGATCTTATTTGGAAACTGTATGAAGAGATGATTGAAGCTGGTGTGCAGAAGGATGTGGGGATAGAGACTGTGGGGTATCTTATCCAGGCATTTTGCAGCGATAACAAGGTTTCAAGAGCTTATGAACTTCTAAGACAGGCTTTAGAGGATGGATTGGCCCCTTGTAATGATGCTTTCAACAAATTGATTTCTGGGTTCTGCGAGGAGGAGGATTATGATAGAGTATCAGAACTTCTCCACACAATGATAGCTAGGAAACGTACTCCCGATATTTTTACCTACCAGAAAATCATTAACGGGCTCTGCAAGAAAGGGAAGCAGCTGGAGGCATTTGAGGTTTTCAATGTCCTTAAGGATCGGGGATATGCTCCTGATAAGGTCATGTATACAACAATGATTGTTGGCCTTTGTAAGATGAGGTGGCTTGGAGATGCTAAAAAGCTGTGGTTTGAGATGATTGATAAGGGATTTCTTCCAAATGAGTATACGTACAACACGTTGATTAATGGATTTTGTAAGATTGGAAAGCTGGATGAGGCCTCTAAGCTATGTAAGGAAATGCATGATAGAGACCTATTCAAAGAACTGCTAGAGCAGGGGTTGCAGCCTTCAACTGCGTTTTGTACTCATCACATTAAAAAGCTTTGCCAATTAGGTAGAGTGAAAGAATCAAAGAAAATGTGGAATGACTTGCATAATAGAGGTCTTCAGCCGATGGTCTGCACTCATGAGCACATAATTAATGGATTATGTAAACAAGGATGTGTGGTAGAGGAGATGGATGGGTTGATAATTATGTTGAAGAGCAATCTCAAGCTTCAAAAGCGGACTTTTGTTAAGGTGGTTCAGAGTTTCATTCAAATGGCTAAATTAGATGATTCTTTATCAGTCTTAGGCTCAATGCTTAGACTGCTTCATCATCTTCTATTGTGGTCTGCAGAACTCCCGCATTTGGCAGGGCAGATAGGGAATCTCTTCACAACTTACAGTGTTTGCTTTGCTTTCATTCAGGCATTCATTCGCTGCACTCCATATTTCCTTTTCCACCCTCCCATCAAGGAAGAAAAAGAACTTGTTTGCGAAGCCCATAGTTCATACAGGCCATCATCAGGGAACATTTATGAGCAATGCCTTTAA

Coding sequence (CDS)

ATGGCTAATATCTCATCTCTTATGATCTCCATTCGCCAAAACTCTCGATTTTTCAAGAACCTTAGGATCCACATCCGGAATCTCTCCGTGGAAACGAATGGAGGCAATAATGGATTTGAAGCGATTGAACCGAGTGAGAAACTATTGAGCCGTACTCACCGCCAAGACGTCAGTGAGATTGCGGAAAATGTTTGTAAGGTCATTAGGAGCAAACCCAGATGGGAGCAGACTCTGCTTTCTGATTACCCTTCTTTCAATTTCCATGACCCGTCTTTTTTTCGCGAGCTTTTGAAGCAGTTGAACAATGTTTTGCTTTCTTTGAGGTTTTTTCTTTGGTTGAGTTCGCAGCCTGAGTTCTTGCCCCATCCAGTCAGTTGCAACACGCTTTTTGATGCTCTTTTGGAGGCCAGGGCTTGTGTTCCCGCTAAATCTTTTCTTCATTCTTTCGGTTTTAGTCCCGAGCCTGCCTCTTTGGAGAATTACATTCGATGTGTTTGCGAGGGTGGTTTGGTTGAGGAAGCTGTTAATGTATTTGATGTGTTAAAAGAGACTGGATATCGTCCATCCATTGAGACGTGGAACTTTGCTTTACAGAGTTGTCTTAAGTTTGGGAGGACTGATCTTATTTGGAAACTGTATGAAGAGATGATTGAAGCTGGTGTGCAGAAGGATGTGGGGATAGAGACTGTGGGGTATCTTATCCAGGCATTTTGCAGCGATAACAAGGTTTCAAGAGCTTATGAACTTCTAAGACAGGCTTTAGAGGATGGATTGGCCCCTTGTAATGATGCTTTCAACAAATTGATTTCTGGGTTCTGCGAGGAGGAGGATTATGATAGAGTATCAGAACTTCTCCACACAATGATAGCTAGGAAACGTACTCCCGATATTTTTACCTACCAGAAAATCATTAACGGGCTCTGCAAGAAAGGGAAGCAGCTGGAGGCATTTGAGGTTTTCAATGTCCTTAAGGATCGGGGATATGCTCCTGATAAGGTCATGTATACAACAATGATTGTTGGCCTTTGTAAGATGAGGTGGCTTGGAGATGCTAAAAAGCTGTGGTTTGAGATGATTGATAAGGGATTTCTTCCAAATGAGTATACGTACAACACGTTGATTAATGGATTTTGTAAGATTGGAAAGCTGGATGAGGCCTCTAAGCTATGTAAGGAAATGCATGATAGAGACCTATTCAAAGAACTGCTAGAGCAGGGGTTGCAGCCTTCAACTGCGTTTTGTACTCATCACATTAAAAAGCTTTGCCAATTAGGTAGAGTGAAAGAATCAAAGAAAATGTGGAATGACTTGCATAATAGAGGTCTTCAGCCGATGGTCTGCACTCATGAGCACATAATTAATGGATTATGTAAACAAGGATGTGTGGTAGAGGAGATGGATGGGTTGATAATTATGTTGAAGAGCAATCTCAAGCTTCAAAAGCGGACTTTTGTTAAGGTGGTTCAGAGTTTCATTCAAATGGCTAAATTAGATGATTCTTTATCAGTCTTAGGCTCAATGCTTAGACTGCTTCATCATCTTCTATTGTGGTCTGCAGAACTCCCGCATTTGGCAGGGCAGATAGGGAATCTCTTCACAACTTACAGTGTTTGCTTTGCTTTCATTCAGGCATTCATTCGCTGCACTCCATATTTCCTTTTCCACCCTCCCATCAAGGAAGAAAAAGAACTTGTTTGCGAAGCCCATAGTTCATACAGGCCATCATCAGGGAACATTTATGAGCAATGCCTTTAA

Protein sequence

MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEIAENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFLPHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMIDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHIKKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKLQKRTFVKVVQSFIQMAKLDDSLSVLGSMLRLLHHLLLWSAELPHLAGQIGNLFTTYSVCFAFIQAFIRCTPYFLFHPPIKEEKELVCEAHSSYRPSSGNIYEQCL
Homology
BLAST of Cla97C05G081420 vs. NCBI nr
Match: XP_038891677.1 (pentatricopeptide repeat-containing protein At5g18950-like isoform X1 [Benincasa hispida])

HSP 1 Score: 889.4 bits (2297), Expect = 1.7e-254
Identity = 447/575 (77.74%), Postives = 478/575 (83.13%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSRF KNLRIHIRNLSVETNGGN G+E IEPSEKLL+ T RQDVSEI
Sbjct: 1   MANITSLMISIRQNSRFAKNLRIHIRNLSVETNGGNKGYEEIEPSEKLLNPTRRQDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           AE VCKVIR+KPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQP FL
Sbjct: 61  AEEVCKVIRNKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPNFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNV DV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVLDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LK TGYRPSIETWNFAL+SCL+F RTDLIWKLYEEM++AGVQKDVGIETVGYLIQAFCSD
Sbjct: 181 LKGTGYRPSIETWNFALRSCLQFERTDLIWKLYEEMMKAGVQKDVGIETVGYLIQAFCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYELLRQALEDGLAPCNDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISRFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IINGLCK+ KQL+AFEVFN LK RGYAPD VMYTTMI GLCKMRWLGDA+KLWFEMI 
Sbjct: 301 QEIINGLCKERKQLQAFEVFNALKGRGYAPDMVMYTTMIHGLCKMRWLGDARKLWFEMIV 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGFLPNEYTYNTLI G+CKIG LDEA KL KEMHDR                        
Sbjct: 361 KGFLPNEYTYNTLIYGYCKIGNLDEALKLYKEMHDRGSKETTLSCNILIAGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DLFKELLEQGLQPST+  TH I+
Sbjct: 421 AYNFFREMPCKDVVCDIVTYNTLIQGFCREGKILQSTDLFKELLEQGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQ+G V+E+KKMWNDLHNRGLQPMVCT +HIINGLC+QGCVVE M+ LI MLKSNLK 
Sbjct: 481 KLCQVGSVQEAKKMWNDLHNRGLQPMVCTRDHIINGLCEQGCVVEGMEWLITMLKSNLKP 540

BLAST of Cla97C05G081420 vs. NCBI nr
Match: XP_038891680.1 (pentatricopeptide repeat-containing protein At5g18950-like isoform X2 [Benincasa hispida])

HSP 1 Score: 865.1 bits (2234), Expect = 3.3e-247
Identity = 433/555 (78.02%), Postives = 461/555 (83.06%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSRF KNLRIHIRNLSVETNGGN G+E IEPSEKLL+ T RQDVSEI
Sbjct: 1   MANITSLMISIRQNSRFAKNLRIHIRNLSVETNGGNKGYEEIEPSEKLLNPTRRQDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           AE VCKVIR+KPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQP FL
Sbjct: 61  AEEVCKVIRNKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPNFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNV DV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVLDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LK TGYRPSIETWNFAL+SCL+F RTDLIWKLYEEM++AGVQKDVGIETVGYLIQAFCSD
Sbjct: 181 LKGTGYRPSIETWNFALRSCLQFERTDLIWKLYEEMMKAGVQKDVGIETVGYLIQAFCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYELLRQALEDGLAPCNDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISRFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IINGLCK+ KQL+AFEVFN LK RGYAPD VMYTTMI GLCKMRWLGDA+KLWFEMI 
Sbjct: 301 QEIINGLCKERKQLQAFEVFNALKGRGYAPDMVMYTTMIHGLCKMRWLGDARKLWFEMIV 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGFLPNEYTYNTLI G+CKIG LDEA KL KEMHDR                        
Sbjct: 361 KGFLPNEYTYNTLIYGYCKIGNLDEALKLYKEMHDRGSKETTLSCNILIAGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DLFKELLEQGLQPST+  TH I+
Sbjct: 421 AYNFFREMPCKDVVCDIVTYNTLIQGFCREGKILQSTDLFKELLEQGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 495
           KLCQ+G V+E+KKMWNDLHNRGLQPMVCT +HIINGLC+QGCVVE M+ LI MLKSNLK 
Sbjct: 481 KLCQVGSVQEAKKMWNDLHNRGLQPMVCTRDHIINGLCEQGCVVEGMEWLITMLKSNLKP 540

BLAST of Cla97C05G081420 vs. NCBI nr
Match: XP_038891688.1 (pentatricopeptide repeat-containing protein At5g18950-like isoform X3 [Benincasa hispida])

HSP 1 Score: 865.1 bits (2234), Expect = 3.3e-247
Identity = 433/555 (78.02%), Postives = 461/555 (83.06%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSRF KNLRIHIRNLSVETNGGN G+E IEPSEKLL+ T RQDVSEI
Sbjct: 1   MANITSLMISIRQNSRFAKNLRIHIRNLSVETNGGNKGYEEIEPSEKLLNPTRRQDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           AE VCKVIR+KPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQP FL
Sbjct: 61  AEEVCKVIRNKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPNFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNV DV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVLDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LK TGYRPSIETWNFAL+SCL+F RTDLIWKLYEEM++AGVQKDVGIETVGYLIQAFCSD
Sbjct: 181 LKGTGYRPSIETWNFALRSCLQFERTDLIWKLYEEMMKAGVQKDVGIETVGYLIQAFCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYELLRQALEDGLAPCNDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISRFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IINGLCK+ KQL+AFEVFN LK RGYAPD VMYTTMI GLCKMRWLGDA+KLWFEMI 
Sbjct: 301 QEIINGLCKERKQLQAFEVFNALKGRGYAPDMVMYTTMIHGLCKMRWLGDARKLWFEMIV 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGFLPNEYTYNTLI G+CKIG LDEA KL KEMHDR                        
Sbjct: 361 KGFLPNEYTYNTLIYGYCKIGNLDEALKLYKEMHDRGSKETTLSCNILIAGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DLFKELLEQGLQPST+  TH I+
Sbjct: 421 AYNFFREMPCKDVVCDIVTYNTLIQGFCREGKILQSTDLFKELLEQGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 495
           KLCQ+G V+E+KKMWNDLHNRGLQPMVCT +HIINGLC+QGCVVE M+ LI MLKSNLK 
Sbjct: 481 KLCQVGSVQEAKKMWNDLHNRGLQPMVCTRDHIINGLCEQGCVVEGMEWLITMLKSNLKP 540

BLAST of Cla97C05G081420 vs. NCBI nr
Match: XP_022924328.1 (pentatricopeptide repeat-containing protein At5g18950-like [Cucurbita moschata] >XP_022924329.1 pentatricopeptide repeat-containing protein At5g18950-like [Cucurbita moschata])

HSP 1 Score: 847.0 bits (2187), Expect = 9.4e-242
Identity = 426/575 (74.09%), Postives = 463/575 (80.52%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSR  KNL IHIRNLSVE NGG +G E I  SEKLL+ TH QDV+EI
Sbjct: 1   MANITSLMISIRQNSRLVKNLGIHIRNLSVEMNGGKDGCEEINLSEKLLNPTHCQDVTEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           ++ VCKVIRSKP+WEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL
Sbjct: 61  SKEVCKVIRSKPKWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GYRPSIETWNFA QSCLKFGRTDLIWKLY EM+E GVQ +VGIETVGYLIQA CSD
Sbjct: 181 LKEAGYRPSIETWNFAFQSCLKFGRTDLIWKLYAEMMETGVQANVGIETVGYLIQALCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           N VS+AYELLRQALEDGLAP NDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NNVSKAYELLRQALEDGLAPSNDAFNKLISLFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IINGLCKK +QL+AF+VFN LKDRGYAPD VMYTTMI GLCKM WLGDA+KLWFEMID
Sbjct: 301 QEIINGLCKKHRQLQAFQVFNGLKDRGYAPDMVMYTTMIHGLCKMSWLGDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHD------------------------- 420
           KGFLPNEYTYNTLI GFCKIG LDEA KL KEMHD                         
Sbjct: 361 KGFLPNEYTYNTLIYGFCKIGNLDEALKLYKEMHDRGYKETTLSCNTVIAGLCLHGRTDE 420

Query: 421 ------------------------------------RDLFKELLEQGLQPSTAFCTHHIK 480
                                               RDL KELLE+GLQPST+  TH I+
Sbjct: 421 AYNFFREMPCKDVVCDVVTYNTLIQGFCREEKILQCRDLLKELLEKGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQ+G V+E+KKMWND+HNRGLQPMVCT +HIINGLC+QGC VE ++ L+ MLKSNLK 
Sbjct: 481 KLCQVGDVQEAKKMWNDMHNRGLQPMVCTRDHIINGLCEQGCAVEGVEWLMTMLKSNLKP 540

BLAST of Cla97C05G081420 vs. NCBI nr
Match: XP_022979328.1 (pentatricopeptide repeat-containing protein At5g18950-like [Cucurbita maxima])

HSP 1 Score: 846.7 bits (2186), Expect = 1.2e-241
Identity = 425/575 (73.91%), Postives = 462/575 (80.35%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSR  KNLRIHIRNLSVE NGG +G E I  SEKLL+ TH QDV+EI
Sbjct: 1   MANITSLMISIRQNSRLVKNLRIHIRNLSVEMNGGKDGCEEINLSEKLLNPTHSQDVTEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           ++ VCKVIRSKP+WEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL
Sbjct: 61  SKEVCKVIRSKPKWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GYRPSIETWNFA QSCLKFGR DLIWKLY EM+E GVQ DVGIETVGYLIQA CSD
Sbjct: 181 LKEAGYRPSIETWNFAFQSCLKFGRIDLIWKLYAEMMETGVQADVGIETVGYLIQALCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           N V++AYELLRQALEDGLAP NDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NNVTKAYELLRQALEDGLAPSNDAFNKLISVFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IIN LCKK +QL+AF+VFN LKDRGYAPD VMYTTMI GLCKM WLGDA+KLWFEMID
Sbjct: 301 QEIINRLCKKHRQLQAFQVFNSLKDRGYAPDMVMYTTMIHGLCKMSWLGDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHD------------------------- 420
           KGFLPNEYTYNTLI GFCKIG LDEA KL KEMHD                         
Sbjct: 361 KGFLPNEYTYNTLIYGFCKIGNLDEALKLYKEMHDRGYKETTLSCNTVIAGLCLHGRTDE 420

Query: 421 ------------------------------------RDLFKELLEQGLQPSTAFCTHHIK 480
                                               RDL KELLE+GLQPST+  TH I+
Sbjct: 421 AYNFFREMPCRDVVCDVVTYNTLIQGFCREGKILQCRDLLKELLEKGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQ+G V+E+KKMWND+HNRGLQPMVCT +HIINGLC+QGC VE ++ L+ MLKSNLK 
Sbjct: 481 KLCQVGDVQEAKKMWNDMHNRGLQPMVCTRDHIINGLCEQGCAVEGVEWLMTMLKSNLKP 540

BLAST of Cla97C05G081420 vs. ExPASy Swiss-Prot
Match: Q8GYM2 (Pentatricopeptide repeat-containing protein At5g18950 OS=Arabidopsis thaliana OX=3702 GN=At5g18950 PE=2 SV=2)

HSP 1 Score: 380.2 bits (975), Expect = 4.3e-104
Identity = 202/462 (43.72%), Postives = 280/462 (60.61%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           M+   S +IS  +N R  KN    IR+L+VE+    +     +P E+  + ++    +E+
Sbjct: 1   MSRGQSYLISFFRN-RTRKNPNTQIRSLTVESRDCES-----KPDEQKSAVSY----TEM 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           A+ V  ++R + RW+QTL+SD+PSF+F DP FF ELLK  NNVL SL FF WL S  ++ 
Sbjct: 61  AKTVSTIMRERQRWQQTLVSDFPSFDFADPLFFGELLKSQNNVLFSLWFFRWLCSNYDYT 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           P PVS N LF ALL+ +A   AKSFL + GF PEP  LE Y++C+ E GLVEEA+ V++V
Sbjct: 121 PGPVSLNILFGALLDGKAVKAAKSFLDTTGFKPEPTLLEQYVKCLSEEGLVEEAIEVYNV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LK+ G   S+ T N  L  CLK  + D  W+L++EM+E+    +   E +  LI+A C  
Sbjct: 181 LKDMGISSSVVTCNSVLLGCLKARKLDRFWELHKEMVES----EFDSERIRCLIRALCDG 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
             VS  YELL+Q L+ GL P    + KLISGFCE  +Y  +SE+LHTMIA    P ++ Y
Sbjct: 241 GDVSEGYELLKQGLKQGLDPGQYVYAKLISGFCEIGNYACMSEVLHTMIAWNHFPSMYIY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           QKII GLC   KQLEA+ +F  LKD+GYAPD+V+YTTMI G C+  WLG A+KLWFEMI 
Sbjct: 301 QKIIKGLCMNKKQLEAYCIFKNLKDKGYAPDRVVYTTMIRGFCEKGWLGSARKLWFEMIK 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHIKK 420
           KG  PNE+ YN +I+G  K G++               + E+L  G   +   C   IK 
Sbjct: 361 KGMRPNEFAYNVMIHGHFKRGEISLVEA---------FYNEMLRNGYGGTMLSCNTMIKG 420

Query: 421 LCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCV 463
            C  G+  E+ +++ ++   G+ P   T+  +I G CK+  V
Sbjct: 421 FCSHGKSDEAFEIFKNMSETGVTPNAITYNALIKGFCKENKV 439

BLAST of Cla97C05G081420 vs. ExPASy Swiss-Prot
Match: Q9FLL3 (Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g41170 PE=2 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 1.2e-45
Identity = 109/389 (28.02%), Postives = 197/389 (50.64%), Query Frame = 0

Query: 125 SCNTLFDALLEARACVPAKSFL---HSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDVL 184
           +CN L +   ++     A SFL      GF P+  +  + I   C G  +EEA+++ + +
Sbjct: 109 TCNLLMNCFCQSSQPYLASSFLGKMMKLGFEPDIVTFTSLINGFCLGNRMEEAMSMVNQM 168

Query: 185 KETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSDN 244
            E G +P +  +   + S  K G  +    L+++M   G++ DV + T   L+   C+  
Sbjct: 169 VEMGIKPDVVMYTTIIDSLCKNGHVNYALSLFDQMENYGIRPDVVMYT--SLVNGLCNSG 228

Query: 245 KVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTYQ 304
           +   A  LLR   +  + P    FN LI  F +E  +    EL + MI     P+IFTY 
Sbjct: 229 RWRDADSLLRGMTKRKIKPDVITFNALIDAFVKEGKFLDAEELYNEMIRMSIAPNIFTYT 288

Query: 305 KIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMIDK 364
            +ING C +G   EA ++F +++ +G  PD V YT++I G CK + + DA K+++EM  K
Sbjct: 289 SLINGFCMEGCVDEARQMFYLMETKGCFPDVVAYTSLINGFCKCKKVDDAMKIFYEMSQK 348

Query: 365 GFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHIKKL 424
           G   N  TY TLI GF ++GK + A         +++F  ++ +G+ P+       +  L
Sbjct: 349 GLTGNTITYTTLIQGFGQVGKPNVA---------QEVFSHMVSRGVPPNIRTYNVLLHCL 408

Query: 425 CQLGRVKESKKMWNDLHNR---GLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLK 484
           C  G+VK++  ++ D+  R   G+ P + T+  +++GLC  G + + +     M K  + 
Sbjct: 409 CYNGKVKKALMIFEDMQKREMDGVAPNIWTYNVLLHGLCYNGKLEKALMVFEDMRKREMD 468

Query: 485 LQKRTFVKVVQSFIQMAKLDDSLSVLGSM 508
           +   T+  ++Q   +  K+ +++++  S+
Sbjct: 469 IGIITYTIIIQGMCKAGKVKNAVNLFCSL 486

BLAST of Cla97C05G081420 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 2.6e-45
Identity = 117/390 (30.00%), Postives = 188/390 (48.21%), Query Frame = 0

Query: 125 SCNTLFDALLEARACVPAKSFLHSFGFSPE------PASLENYIRCVCEGGLVEEAVNVF 184
           SCN     L  ++ C    + +  F   PE       AS    I  VC+ G ++EA ++ 
Sbjct: 212 SCNVYLTRL--SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLL 271

Query: 185 DVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFC 244
            +++  GY P + +++  +    +FG  D +WKL E M   G++ +  I   G +I   C
Sbjct: 272 LLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYI--YGSIIGLLC 331

Query: 245 SDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIF 304
              K++ A E   + +  G+ P    +  LI GFC+  D    S+  + M +R  TPD+ 
Sbjct: 332 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 391

Query: 305 TYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEM 364
           TY  II+G C+ G  +EA ++F+ +  +G  PD V +T +I G CK   + DA ++   M
Sbjct: 392 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 451

Query: 365 IDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHI 424
           I  G  PN  TY TLI+G CK G LD A++L  EM          + GLQP+       +
Sbjct: 452 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEM---------WKIGLQPNIFTYNSIV 511

Query: 425 KKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLK 484
             LC+ G ++E+ K+  +    GL     T+  +++  CK G + +  + L  ML   L+
Sbjct: 512 NGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQ 571

Query: 485 LQKRTFVKVVQSFIQMAKLDDSLSVLGSML 509
               TF  ++  F     L+D   +L  ML
Sbjct: 572 PTIVTFNVLMNGFCLHGMLEDGEKLLNWML 588

BLAST of Cla97C05G081420 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.3e-44
Identity = 107/390 (27.44%), Postives = 200/390 (51.28%), Query Frame = 0

Query: 121 PHPVSCNTLFDAL-LEARA--CVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNV 180
           P  V  NTL + L LE R    +     +   G  P   +L   +  +C  G V +AV +
Sbjct: 156 PDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVL 215

Query: 181 FDVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAF 240
            D + ETG++P+  T+   L    K G+T L  +L  +M E  ++ D    ++  +I   
Sbjct: 216 IDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSI--IIDGL 275

Query: 241 CSDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDI 300
           C D  +  A+ L  +    G       +N LI GFC    +D  ++LL  MI RK +P++
Sbjct: 276 CKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNV 335

Query: 301 FTYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFE 360
            T+  +I+   K+GK  EA ++   +  RG AP+ + Y ++I G CK   L +A ++   
Sbjct: 336 VTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDL 395

Query: 361 MIDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHH 420
           MI KG  P+  T+N LING+CK  ++D+           +LF+E+  +G+  +T      
Sbjct: 396 MISKGCDPDIMTFNILINGYCKANRIDDG---------LELFREMSLRGVIANTVTYNTL 455

Query: 421 IKKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNL 480
           ++  CQ G+++ +KK++ ++ +R ++P + +++ +++GLC  G + + ++    + KS +
Sbjct: 456 VQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKIEKSKM 515

Query: 481 KLQKRTFVKVVQSFIQMAKLDDSLSVLGSM 508
           +L    ++ ++      +K+DD+  +  S+
Sbjct: 516 ELDIGIYMIIIHGMCNASKVDDAWDLFCSL 534

BLAST of Cla97C05G081420 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 8.5e-44
Identity = 100/355 (28.17%), Postives = 183/355 (51.55%), Query Frame = 0

Query: 153 PEPASLENYIRCVCEGGLVEEAVNVFDVLKETGYRPSIETWNFALQSCLKFGRTDLIWKL 212
           P+  ++   I  +C  G V EA+ + D + E G++P   T+   L    K G + L   L
Sbjct: 173 PDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDL 232

Query: 213 YEEMIEAGVQKDVGIETVGYLIQAFCSDNKVSRAYELLRQALEDGLAPCNDAFNKLISGF 272
           + +M E  ++  V   ++  +I + C D     A  L  +    G+      ++ LI G 
Sbjct: 233 FRKMEERNIKASVVQYSI--VIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGL 292

Query: 273 CEEEDYDRVSELLHTMIARKRTPDIFTYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDK 332
           C +  +D  +++L  MI R   PD+ T+  +I+   K+GK LEA E++N +  RG APD 
Sbjct: 293 CNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDT 352

Query: 333 VMYTTMIVGLCKMRWLGDAKKLWFEMIDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKE 392
           + Y ++I G CK   L +A +++  M+ KG  P+  TY+ LIN +CK  ++D+  +    
Sbjct: 353 ITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMR---- 412

Query: 393 MHDRDLFKELLEQGLQPSTAFCTHHIKKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHI 452
                LF+E+  +GL P+T      +   CQ G++  +K+++ ++ +RG+ P V T+  +
Sbjct: 413 -----LFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGIL 472

Query: 453 INGLCKQGCVVEEMDGLIIMLKSNLKLQKRTFVKVVQSFIQMAKLDDSLSVLGSM 508
           ++GLC  G + + ++    M KS + L    +  ++      +K+DD+ S+  S+
Sbjct: 473 LDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSL 516

BLAST of Cla97C05G081420 vs. ExPASy TrEMBL
Match: A0A6J1E8U0 (pentatricopeptide repeat-containing protein At5g18950-like OS=Cucurbita moschata OX=3662 GN=LOC111431855 PE=4 SV=1)

HSP 1 Score: 847.0 bits (2187), Expect = 4.6e-242
Identity = 426/575 (74.09%), Postives = 463/575 (80.52%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSR  KNL IHIRNLSVE NGG +G E I  SEKLL+ TH QDV+EI
Sbjct: 1   MANITSLMISIRQNSRLVKNLGIHIRNLSVEMNGGKDGCEEINLSEKLLNPTHCQDVTEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           ++ VCKVIRSKP+WEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL
Sbjct: 61  SKEVCKVIRSKPKWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GYRPSIETWNFA QSCLKFGRTDLIWKLY EM+E GVQ +VGIETVGYLIQA CSD
Sbjct: 181 LKEAGYRPSIETWNFAFQSCLKFGRTDLIWKLYAEMMETGVQANVGIETVGYLIQALCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           N VS+AYELLRQALEDGLAP NDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NNVSKAYELLRQALEDGLAPSNDAFNKLISLFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IINGLCKK +QL+AF+VFN LKDRGYAPD VMYTTMI GLCKM WLGDA+KLWFEMID
Sbjct: 301 QEIINGLCKKHRQLQAFQVFNGLKDRGYAPDMVMYTTMIHGLCKMSWLGDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHD------------------------- 420
           KGFLPNEYTYNTLI GFCKIG LDEA KL KEMHD                         
Sbjct: 361 KGFLPNEYTYNTLIYGFCKIGNLDEALKLYKEMHDRGYKETTLSCNTVIAGLCLHGRTDE 420

Query: 421 ------------------------------------RDLFKELLEQGLQPSTAFCTHHIK 480
                                               RDL KELLE+GLQPST+  TH I+
Sbjct: 421 AYNFFREMPCKDVVCDVVTYNTLIQGFCREEKILQCRDLLKELLEKGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQ+G V+E+KKMWND+HNRGLQPMVCT +HIINGLC+QGC VE ++ L+ MLKSNLK 
Sbjct: 481 KLCQVGDVQEAKKMWNDMHNRGLQPMVCTRDHIINGLCEQGCAVEGVEWLMTMLKSNLKP 540

BLAST of Cla97C05G081420 vs. ExPASy TrEMBL
Match: A0A6J1IVV9 (pentatricopeptide repeat-containing protein At5g18950-like OS=Cucurbita maxima OX=3661 GN=LOC111479084 PE=4 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 6.0e-242
Identity = 425/575 (73.91%), Postives = 462/575 (80.35%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           MANI+SLMISIRQNSR  KNLRIHIRNLSVE NGG +G E I  SEKLL+ TH QDV+EI
Sbjct: 1   MANITSLMISIRQNSRLVKNLRIHIRNLSVEMNGGKDGCEEINLSEKLLNPTHSQDVTEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           ++ VCKVIRSKP+WEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL
Sbjct: 61  SKEVCKVIRSKPKWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCNTLFDALLEA+ACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV
Sbjct: 121 PHPVSCNTLFDALLEAKACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GYRPSIETWNFA QSCLKFGR DLIWKLY EM+E GVQ DVGIETVGYLIQA CSD
Sbjct: 181 LKEAGYRPSIETWNFAFQSCLKFGRIDLIWKLYAEMMETGVQADVGIETVGYLIQALCSD 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           N V++AYELLRQALEDGLAP NDAFNKLIS FCEE++YDRVSELLHTMIA+ R PDIFTY
Sbjct: 241 NNVTKAYELLRQALEDGLAPSNDAFNKLISVFCEEKNYDRVSELLHTMIAKNRNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IIN LCKK +QL+AF+VFN LKDRGYAPD VMYTTMI GLCKM WLGDA+KLWFEMID
Sbjct: 301 QEIINRLCKKHRQLQAFQVFNSLKDRGYAPDMVMYTTMIHGLCKMSWLGDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHD------------------------- 420
           KGFLPNEYTYNTLI GFCKIG LDEA KL KEMHD                         
Sbjct: 361 KGFLPNEYTYNTLIYGFCKIGNLDEALKLYKEMHDRGYKETTLSCNTVIAGLCLHGRTDE 420

Query: 421 ------------------------------------RDLFKELLEQGLQPSTAFCTHHIK 480
                                               RDL KELLE+GLQPST+  TH I+
Sbjct: 421 AYNFFREMPCRDVVCDVVTYNTLIQGFCREGKILQCRDLLKELLEKGLQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQ+G V+E+KKMWND+HNRGLQPMVCT +HIINGLC+QGC VE ++ L+ MLKSNLK 
Sbjct: 481 KLCQVGDVQEAKKMWNDMHNRGLQPMVCTRDHIINGLCEQGCAVEGVEWLMTMLKSNLKP 540

BLAST of Cla97C05G081420 vs. ExPASy TrEMBL
Match: A0A5D3D1Z5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G005110 PE=4 SV=1)

HSP 1 Score: 830.9 bits (2145), Expect = 3.4e-237
Identity = 415/575 (72.17%), Postives = 456/575 (79.30%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           M N++SLMISIRQNSRF KNLRIHIRNLSVETNGGNNG E IE SEKLL+ THR+DVSEI
Sbjct: 1   MVNLTSLMISIRQNSRFVKNLRIHIRNLSVETNGGNNGREEIESSEKLLNLTHRKDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           A  V KVIRSKPRWEQ+LLSDYPSFNFHDPSFF ELLKQLNNV LSLRFFLWLSSQPEFL
Sbjct: 61  AAEVGKVIRSKPRWEQSLLSDYPSFNFHDPSFFSELLKQLNNVFLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCN LFDALLEA+AC PAKSFLHSF FSPEPASLENYIRCVCEGG+VEEAV +FDV
Sbjct: 121 PHPVSCNKLFDALLEAKACAPAKSFLHSFDFSPEPASLENYIRCVCEGGIVEEAVKIFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GY PS+ETWNFA QSCLKFGRTDLIWKLYEEM+EAGVQKDV IETVGYLIQAFC+D
Sbjct: 181 LKEAGYHPSVETWNFAFQSCLKFGRTDLIWKLYEEMMEAGVQKDVDIETVGYLIQAFCND 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYE+LRQ LEDGLAPCNDAFNKLISGFCEE++Y RVSELLHTMIA+   PDIFTY
Sbjct: 241 NKVSRAYEILRQVLEDGLAPCNDAFNKLISGFCEEKNYHRVSELLHTMIAKNCNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IING CK GK L+AFEVFN LKDRGYAPD VMYTTMI GLCKM  L DA++LWFEMID
Sbjct: 301 QEIINGFCKNGKTLQAFEVFNALKDRGYAPDMVMYTTMIHGLCKMSRLEDARRLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGF PNEY+YNTLI GFCKIG LDEA KL KEM D                         
Sbjct: 361 KGFRPNEYSYNTLIYGFCKIGNLDEAMKLYKEMLDSGYKETTLSCNTLILGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DL KEL  +G+QPST+  TH I+
Sbjct: 421 AYDFFREMPCKSIVCDVITYNTLIQGFCREGKVLQSIDLLKELQAKGVQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQLG V+E+K+MWND+HNRGLQPMVCT +HII+GLC++GCVVE M+ LI MLKSNLK 
Sbjct: 481 KLCQLGNVQEAKEMWNDMHNRGLQPMVCTRDHIISGLCEKGCVVEGMEWLITMLKSNLKP 540

BLAST of Cla97C05G081420 vs. ExPASy TrEMBL
Match: A0A1S3AVU1 (pentatricopeptide repeat-containing protein At5g18950 OS=Cucumis melo OX=3656 GN=LOC103483227 PE=4 SV=1)

HSP 1 Score: 830.9 bits (2145), Expect = 3.4e-237
Identity = 415/575 (72.17%), Postives = 456/575 (79.30%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           M N++SLMISIRQNSRF K+LRIHIRNLSVETNGGNNG E IE SEKLL+ THR+DVSEI
Sbjct: 1   MVNLTSLMISIRQNSRFVKSLRIHIRNLSVETNGGNNGREEIESSEKLLNLTHRKDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           A  V KVIRSKPRWEQ+LLSDYPSFNFHDPSFF ELLKQLNNV LSLRFFLWLSSQPEFL
Sbjct: 61  AAEVGKVIRSKPRWEQSLLSDYPSFNFHDPSFFSELLKQLNNVFLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCN LFDALLEA+AC PAKSFLHSF FSPEPASLENYIRCVCEGG+VEEAV +FDV
Sbjct: 121 PHPVSCNKLFDALLEAKACAPAKSFLHSFDFSPEPASLENYIRCVCEGGIVEEAVKIFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GY PS+ETWNFA QSCLKFGRTDLIWKLYEEM+EAGVQKDV IETVGYLIQAFC+D
Sbjct: 181 LKEAGYHPSVETWNFAFQSCLKFGRTDLIWKLYEEMMEAGVQKDVDIETVGYLIQAFCND 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYE+LRQ LEDGLAPCNDAFNKLISGFCEE++Y RVSELLHTMIA+   PDIFTY
Sbjct: 241 NKVSRAYEILRQVLEDGLAPCNDAFNKLISGFCEEKNYHRVSELLHTMIAKNCNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IING CK GK L+AFEVFN LKDRGYAPD VMYTTMI GLCKM  L DA+KLWFEMID
Sbjct: 301 QEIINGFCKNGKTLQAFEVFNALKDRGYAPDMVMYTTMIYGLCKMSRLEDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGF PNEY+YNTLI GFCKIG LDEA KL KEM D                         
Sbjct: 361 KGFRPNEYSYNTLIYGFCKIGNLDEAMKLYKEMLDSGYKETTLSCNTLILGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DL KEL  +G+QPST+  TH I+
Sbjct: 421 AYDFFREMPCKSIVCDVITYNTLIQGFCREGKVLQSIDLLKELQAKGVQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQLG V+E+K+MWND+HNRGLQPMVCT +HII+GLC++GCVVE M+ LI MLKSNLK 
Sbjct: 481 KLCQLGNVQEAKEMWNDMHNRGLQPMVCTRDHIISGLCEKGCVVEGMEWLITMLKSNLKP 540

BLAST of Cla97C05G081420 vs. ExPASy TrEMBL
Match: A0A5A7U134 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G001480 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.2e-236
Identity = 414/575 (72.00%), Postives = 455/575 (79.13%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           M N++SLMISIRQNSRF KNLRIHIRNLSVETNGGNNG E IE SEKLL+ THR+DVSEI
Sbjct: 1   MVNLTSLMISIRQNSRFVKNLRIHIRNLSVETNGGNNGREEIESSEKLLNLTHRKDVSEI 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           A  V KVIRSKPRWEQ+LLSDYPSFNFHDPSFF ELLKQLNNV LSLRFFLWLSSQPEFL
Sbjct: 61  AAEVGKVIRSKPRWEQSLLSDYPSFNFHDPSFFSELLKQLNNVFLSLRFFLWLSSQPEFL 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           PHPVSCN LFDALLEA+AC PAKSFLHSF FSPEPASLENYIRCVCEGG+VEEAV +FDV
Sbjct: 121 PHPVSCNKLFDALLEAKACAPAKSFLHSFDFSPEPASLENYIRCVCEGGIVEEAVKIFDV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LKE GY PS+ETWNFA QSCLKFGRTDLIWKLYEEM+EAGVQKDV IETVGYLIQAFC+D
Sbjct: 181 LKEAGYHPSVETWNFAFQSCLKFGRTDLIWKLYEEMMEAGVQKDVDIETVGYLIQAFCND 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
           NKVSRAYE+LRQ LEDGLAPCNDAFNKLISGFCEE++Y RVSELLHTMIA+   PDIFTY
Sbjct: 241 NKVSRAYEILRQVLEDGLAPCNDAFNKLISGFCEEKNYHRVSELLHTMIAKNCNPDIFTY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           Q+IING CK GK L+AFEVF  LKDRGYAPD VMYTTMI GLCKM  L DA+KLWFEMID
Sbjct: 301 QEIINGFCKNGKTLQAFEVFTALKDRGYAPDMVMYTTMIHGLCKMSRLEDARKLWFEMID 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDR------------------------ 420
           KGF PNEY+YNTLI GFCKIG LDEA KL KEM D                         
Sbjct: 361 KGFRPNEYSYNTLIYGFCKIGNLDEAMKLYKEMLDSGYKETTLSCNTLILGLCLHGRTDE 420

Query: 421 -------------------------------------DLFKELLEQGLQPSTAFCTHHIK 480
                                                DL KEL  +G+QPST+  TH I+
Sbjct: 421 AYDFFREMPCKSIVCDVITYNTLIQGFCREGKVLQSIDLLKELQAKGVQPSTSSYTHLIQ 480

Query: 481 KLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLKL 515
           KLCQLG V+E+K+MWND+HNRGLQPMVCT +HII+GLC++GCVVE M+ LI ML+SNLK 
Sbjct: 481 KLCQLGNVQEAKEMWNDMHNRGLQPMVCTRDHIISGLCEKGCVVEGMEWLITMLQSNLKP 540

BLAST of Cla97C05G081420 vs. TAIR 10
Match: AT5G18950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 380.2 bits (975), Expect = 3.0e-105
Identity = 202/462 (43.72%), Postives = 280/462 (60.61%), Query Frame = 0

Query: 1   MANISSLMISIRQNSRFFKNLRIHIRNLSVETNGGNNGFEAIEPSEKLLSRTHRQDVSEI 60
           M+   S +IS  +N R  KN    IR+L+VE+    +     +P E+  + ++    +E+
Sbjct: 1   MSRGQSYLISFFRN-RTRKNPNTQIRSLTVESRDCES-----KPDEQKSAVSY----TEM 60

Query: 61  AENVCKVIRSKPRWEQTLLSDYPSFNFHDPSFFRELLKQLNNVLLSLRFFLWLSSQPEFL 120
           A+ V  ++R + RW+QTL+SD+PSF+F DP FF ELLK  NNVL SL FF WL S  ++ 
Sbjct: 61  AKTVSTIMRERQRWQQTLVSDFPSFDFADPLFFGELLKSQNNVLFSLWFFRWLCSNYDYT 120

Query: 121 PHPVSCNTLFDALLEARACVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDV 180
           P PVS N LF ALL+ +A   AKSFL + GF PEP  LE Y++C+ E GLVEEA+ V++V
Sbjct: 121 PGPVSLNILFGALLDGKAVKAAKSFLDTTGFKPEPTLLEQYVKCLSEEGLVEEAIEVYNV 180

Query: 181 LKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSD 240
           LK+ G   S+ T N  L  CLK  + D  W+L++EM+E+    +   E +  LI+A C  
Sbjct: 181 LKDMGISSSVVTCNSVLLGCLKARKLDRFWELHKEMVES----EFDSERIRCLIRALCDG 240

Query: 241 NKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTY 300
             VS  YELL+Q L+ GL P    + KLISGFCE  +Y  +SE+LHTMIA    P ++ Y
Sbjct: 241 GDVSEGYELLKQGLKQGLDPGQYVYAKLISGFCEIGNYACMSEVLHTMIAWNHFPSMYIY 300

Query: 301 QKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMID 360
           QKII GLC   KQLEA+ +F  LKD+GYAPD+V+YTTMI G C+  WLG A+KLWFEMI 
Sbjct: 301 QKIIKGLCMNKKQLEAYCIFKNLKDKGYAPDRVVYTTMIRGFCEKGWLGSARKLWFEMIK 360

Query: 361 KGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHIKK 420
           KG  PNE+ YN +I+G  K G++               + E+L  G   +   C   IK 
Sbjct: 361 KGMRPNEFAYNVMIHGHFKRGEISLVEA---------FYNEMLRNGYGGTMLSCNTMIKG 420

Query: 421 LCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCV 463
            C  G+  E+ +++ ++   G+ P   T+  +I G CK+  V
Sbjct: 421 FCSHGKSDEAFEIFKNMSETGVTPNAITYNALIKGFCKENKV 439

BLAST of Cla97C05G081420 vs. TAIR 10
Match: AT5G41170.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 186.0 bits (471), Expect = 8.4e-47
Identity = 109/389 (28.02%), Postives = 197/389 (50.64%), Query Frame = 0

Query: 125 SCNTLFDALLEARACVPAKSFL---HSFGFSPEPASLENYIRCVCEGGLVEEAVNVFDVL 184
           +CN L +   ++     A SFL      GF P+  +  + I   C G  +EEA+++ + +
Sbjct: 109 TCNLLMNCFCQSSQPYLASSFLGKMMKLGFEPDIVTFTSLINGFCLGNRMEEAMSMVNQM 168

Query: 185 KETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFCSDN 244
            E G +P +  +   + S  K G  +    L+++M   G++ DV + T   L+   C+  
Sbjct: 169 VEMGIKPDVVMYTTIIDSLCKNGHVNYALSLFDQMENYGIRPDVVMYT--SLVNGLCNSG 228

Query: 245 KVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIFTYQ 304
           +   A  LLR   +  + P    FN LI  F +E  +    EL + MI     P+IFTY 
Sbjct: 229 RWRDADSLLRGMTKRKIKPDVITFNALIDAFVKEGKFLDAEELYNEMIRMSIAPNIFTYT 288

Query: 305 KIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEMIDK 364
            +ING C +G   EA ++F +++ +G  PD V YT++I G CK + + DA K+++EM  K
Sbjct: 289 SLINGFCMEGCVDEARQMFYLMETKGCFPDVVAYTSLINGFCKCKKVDDAMKIFYEMSQK 348

Query: 365 GFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHIKKL 424
           G   N  TY TLI GF ++GK + A         +++F  ++ +G+ P+       +  L
Sbjct: 349 GLTGNTITYTTLIQGFGQVGKPNVA---------QEVFSHMVSRGVPPNIRTYNVLLHCL 408

Query: 425 CQLGRVKESKKMWNDLHNR---GLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLK 484
           C  G+VK++  ++ D+  R   G+ P + T+  +++GLC  G + + +     M K  + 
Sbjct: 409 CYNGKVKKALMIFEDMQKREMDGVAPNIWTYNVLLHGLCYNGKLEKALMVFEDMRKREMD 468

Query: 485 LQKRTFVKVVQSFIQMAKLDDSLSVLGSM 508
           +   T+  ++Q   +  K+ +++++  S+
Sbjct: 469 IGIITYTIIIQGMCKAGKVKNAVNLFCSL 486

BLAST of Cla97C05G081420 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 184.9 bits (468), Expect = 1.9e-46
Identity = 117/390 (30.00%), Postives = 188/390 (48.21%), Query Frame = 0

Query: 125 SCNTLFDALLEARACVPAKSFLHSFGFSPE------PASLENYIRCVCEGGLVEEAVNVF 184
           SCN     L  ++ C    + +  F   PE       AS    I  VC+ G ++EA ++ 
Sbjct: 212 SCNVYLTRL--SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLL 271

Query: 185 DVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFC 244
            +++  GY P + +++  +    +FG  D +WKL E M   G++ +  I   G +I   C
Sbjct: 272 LLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYI--YGSIIGLLC 331

Query: 245 SDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIF 304
              K++ A E   + +  G+ P    +  LI GFC+  D    S+  + M +R  TPD+ 
Sbjct: 332 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 391

Query: 305 TYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEM 364
           TY  II+G C+ G  +EA ++F+ +  +G  PD V +T +I G CK   + DA ++   M
Sbjct: 392 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 451

Query: 365 IDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHI 424
           I  G  PN  TY TLI+G CK G LD A++L  EM          + GLQP+       +
Sbjct: 452 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEM---------WKIGLQPNIFTYNSIV 511

Query: 425 KKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLK 484
             LC+ G ++E+ K+  +    GL     T+  +++  CK G + +  + L  ML   L+
Sbjct: 512 NGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQ 571

Query: 485 LQKRTFVKVVQSFIQMAKLDDSLSVLGSML 509
               TF  ++  F     L+D   +L  ML
Sbjct: 572 PTIVTFNVLMNGFCLHGMLEDGEKLLNWML 588

BLAST of Cla97C05G081420 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 184.9 bits (468), Expect = 1.9e-46
Identity = 117/390 (30.00%), Postives = 188/390 (48.21%), Query Frame = 0

Query: 125 SCNTLFDALLEARACVPAKSFLHSFGFSPE------PASLENYIRCVCEGGLVEEAVNVF 184
           SCN     L  ++ C    + +  F   PE       AS    I  VC+ G ++EA ++ 
Sbjct: 212 SCNVYLTRL--SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLL 271

Query: 185 DVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAFC 244
            +++  GY P + +++  +    +FG  D +WKL E M   G++ +  I   G +I   C
Sbjct: 272 LLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYI--YGSIIGLLC 331

Query: 245 SDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDIF 304
              K++ A E   + +  G+ P    +  LI GFC+  D    S+  + M +R  TPD+ 
Sbjct: 332 RICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVL 391

Query: 305 TYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFEM 364
           TY  II+G C+ G  +EA ++F+ +  +G  PD V +T +I G CK   + DA ++   M
Sbjct: 392 TYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 451

Query: 365 IDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHHI 424
           I  G  PN  TY TLI+G CK G LD A++L  EM          + GLQP+       +
Sbjct: 452 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEM---------WKIGLQPNIFTYNSIV 511

Query: 425 KKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNLK 484
             LC+ G ++E+ K+  +    GL     T+  +++  CK G + +  + L  ML   L+
Sbjct: 512 NGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQ 571

Query: 485 LQKRTFVKVVQSFIQMAKLDDSLSVLGSML 509
               TF  ++  F     L+D   +L  ML
Sbjct: 572 PTIVTFNVLMNGFCLHGMLEDGEKLLNWML 588

BLAST of Cla97C05G081420 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 182.6 bits (462), Expect = 9.3e-46
Identity = 107/390 (27.44%), Postives = 200/390 (51.28%), Query Frame = 0

Query: 121 PHPVSCNTLFDAL-LEARA--CVPAKSFLHSFGFSPEPASLENYIRCVCEGGLVEEAVNV 180
           P  V  NTL + L LE R    +     +   G  P   +L   +  +C  G V +AV +
Sbjct: 156 PDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVL 215

Query: 181 FDVLKETGYRPSIETWNFALQSCLKFGRTDLIWKLYEEMIEAGVQKDVGIETVGYLIQAF 240
            D + ETG++P+  T+   L    K G+T L  +L  +M E  ++ D    ++  +I   
Sbjct: 216 IDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSI--IIDGL 275

Query: 241 CSDNKVSRAYELLRQALEDGLAPCNDAFNKLISGFCEEEDYDRVSELLHTMIARKRTPDI 300
           C D  +  A+ L  +    G       +N LI GFC    +D  ++LL  MI RK +P++
Sbjct: 276 CKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNV 335

Query: 301 FTYQKIINGLCKKGKQLEAFEVFNVLKDRGYAPDKVMYTTMIVGLCKMRWLGDAKKLWFE 360
            T+  +I+   K+GK  EA ++   +  RG AP+ + Y ++I G CK   L +A ++   
Sbjct: 336 VTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDL 395

Query: 361 MIDKGFLPNEYTYNTLINGFCKIGKLDEASKLCKEMHDRDLFKELLEQGLQPSTAFCTHH 420
           MI KG  P+  T+N LING+CK  ++D+           +LF+E+  +G+  +T      
Sbjct: 396 MISKGCDPDIMTFNILINGYCKANRIDDG---------LELFREMSLRGVIANTVTYNTL 455

Query: 421 IKKLCQLGRVKESKKMWNDLHNRGLQPMVCTHEHIINGLCKQGCVVEEMDGLIIMLKSNL 480
           ++  CQ G+++ +KK++ ++ +R ++P + +++ +++GLC  G + + ++    + KS +
Sbjct: 456 VQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKIEKSKM 515

Query: 481 KLQKRTFVKVVQSFIQMAKLDDSLSVLGSM 508
           +L    ++ ++      +K+DD+  +  S+
Sbjct: 516 ELDIGIYMIIIHGMCNASKVDDAWDLFCSL 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038891677.11.7e-25477.74pentatricopeptide repeat-containing protein At5g18950-like isoform X1 [Benincasa... [more]
XP_038891680.13.3e-24778.02pentatricopeptide repeat-containing protein At5g18950-like isoform X2 [Benincasa... [more]
XP_038891688.13.3e-24778.02pentatricopeptide repeat-containing protein At5g18950-like isoform X3 [Benincasa... [more]
XP_022924328.19.4e-24274.09pentatricopeptide repeat-containing protein At5g18950-like [Cucurbita moschata] ... [more]
XP_022979328.11.2e-24173.91pentatricopeptide repeat-containing protein At5g18950-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8GYM24.3e-10443.72Pentatricopeptide repeat-containing protein At5g18950 OS=Arabidopsis thaliana OX... [more]
Q9FLL31.2e-4528.02Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidop... [more]
Q0WVK72.6e-4530.00Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9LPX21.3e-4427.44Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Q6NQ838.5e-4428.17Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1E8U04.6e-24274.09pentatricopeptide repeat-containing protein At5g18950-like OS=Cucurbita moschata... [more]
A0A6J1IVV96.0e-24273.91pentatricopeptide repeat-containing protein At5g18950-like OS=Cucurbita maxima O... [more]
A0A5D3D1Z53.4e-23772.17Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AVU13.4e-23772.17pentatricopeptide repeat-containing protein At5g18950 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7U1342.2e-23672.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G18950.13.0e-10543.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G41170.18.4e-4728.02Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.11.9e-4630.00Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.21.9e-4630.00Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G12775.19.3e-4627.44Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 83..240
e-value: 7.3E-18
score: 66.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 397..521
e-value: 1.9E-14
score: 55.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 277..396
e-value: 7.4E-33
score: 116.3
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 361..393
e-value: 7.1E-15
score: 54.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 298..331
e-value: 9.8E-5
score: 20.3
coord: 333..367
e-value: 2.7E-8
score: 31.5
coord: 368..398
e-value: 4.5E-9
score: 33.9
coord: 192..224
e-value: 5.6E-4
score: 17.9
coord: 264..296
e-value: 3.7E-4
score: 18.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 295..344
e-value: 7.7E-13
score: 48.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 192..221
e-value: 0.0065
score: 16.6
coord: 231..258
e-value: 0.25
score: 11.7
coord: 264..290
e-value: 0.088
score: 13.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 11.860184
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 154..188
score: 8.900633
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 8.823904
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 12.375365
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 8.659485
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 331..365
score: 11.476539
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 189..223
score: 10.018685
NoneNo IPR availablePANTHERPTHR47939:SF4PPR CONTAINING PLANT-LIKE PROTEINcoord: 397..510
coord: 4..396
NoneNo IPR availablePANTHERPTHR47939MEMBRANE-ASSOCIATED SALT-INDUCIBLE PROTEIN-LIKEcoord: 397..510
coord: 4..396

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G081420.2Cla97C05G081420.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding