Cla97C01G020740 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G020740
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 33268668 .. 33272467 (-)
RNA-Seq ExpressionCla97C01G020740
SyntenyCla97C01G020740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGTTTACATCTTCGGTAGGTAACCTAGCAACGCTCTCTACCTTCCATTCTGCATTCAAATCTTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGATTTTCCGTCAGCTACTAAGGTATCGGGTTAAACCTAATGATTCTATCTTCTCCTTACTCATCAAAGCCTTCGTTGTCTCGTCTCCATCTTCTTCTTTTGCATCATCGTCCTGTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTAACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAATTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTTCTGAAAAAGATGTTGTATCGTGGAATGCGTTGATTTCTGGGTACTCACGAAGTGGGTATAGCCATGATGCGTTCGAGCTATTTGTCGAAATGTGCAGAAGGGGGTTTGACCCTTCTCAGAGAACATTGGTAAGTTTAATTCCTTCCTGTGGTACCCAACAATTATTCGTCCAAGGAAAATGCATCCATGCGTTAGGTGTTAAGGCTGGCCTTGATTTAGACTCCCAAGTGAAGAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTAGAAGGAGTGGAGCTCTTATTTGGAGAGATCATTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTAGAGGCAATGCTTGTTCTCAAGCAAATGCTTGAGGAAAGATTCAATGCTAACTCGGTTACTATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACTAAAACTGGTCTTGTGGAAAATGTTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGGTGCATACAAATGGCAGAACTGATTTATATGTCAAAACTCAAGAGAAACTTGGTTGCATTAACTGCGATTATTTCTAGCTATGCTGAGAAAGGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTGGATGCAGTTGCAATGGTTGGCATAATCCAAGGTCTTACATATCCTCATCACGTTGGCATTGGACTTGCTTTCCACGGTTATGGCCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGGATGTATTCAAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCATGAAAAGACACAGAGCAGCTGGAATTCTGTGATATCTAGTTGTACACAGGCAGGAAGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGATGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTTGCATTCTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTGGAGGGTTTTGTTGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACTTAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTGACAATCATGCTCTCCTTTGTTACACTAAAATGATGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTGAAAACGGTAGAACATACTTCAAAACCATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAGCATTGTGCATCCATGGTTGGCCTGCTTGGTCGGGCAGGATTATTTGAAGAGGCAATTGAATTTATCAAGAACATGGATATCAATCCAGATTCTGCAGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCGAAAAAGTTGCTTTTCTCTAACTGTAGAAATGGGGGTTTTTTTGTGTTGATGTCAAATCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGGTTGTTCAGGTGTTAGCCTTATGGAATAAATTTCTTTGGAAGACAGACAGATTTATACTTTTGAGATTTTCACTTGAATGGATACTCTTCGCAGTGGCTTGCATTTCACTTTAGAGTCCTCATTTTGAGCATACTAGGACTAGTGTAGGCATTTGACAAGTGTAGTGCAAGTGATGTAATTATTTAGCTTCTATTTAAAACCAAGTTATTCATTTGTTTAATCATAATAAAAGTTCTGGATAAAAGAATTAAATTACAATGAAACAAAAAATAGGTATGATAGAATTTTATTTATGAATAAGATGAATACTATCTCAAAATTTTGAAAAAATTATACAAATGCTTATGAAACAATTATTTTGTGTTATAAATGAGGTATGTTAGGATTGATTTTGTTTGGTAATTCAATCCCCATTTAAATTAGTTCGAATGTTTGTGAAGTTAGAAGTCAAAAGACAAGTACTTAGAAAGAAGGGGAAGAAAAGATGCAGAGTCCATGCACTGGAATGAGATTGTTTCGTATTAAAGTAATAGGTAATCTTATGTTGAACTTAAAATTTAAGAATTGTTCCTAATTTGTATAATCAGGAAGGGAATTCAAGTTTCTTCTCTCATTCCCAAGTAATGAAGATTGCCCACCAAAACCAATGGAGATCCAAAATTGTTTTCATCCAAAATTAGCAAGAATGAGGGGTGAATCCCCAAAAAACATCAATGGATTGTATAAGTTAAATCGTATCGTACCTTGAATGCCGGATCGCCAAAACCAAAGAAATCCAGCAAGCTGCAGCTCTTTAGGATTCCTTCCTTGCCTTGAAGGGTTAAAAACTTGATGAGTAAGTCACTCATGAAAAGCCCAAGCAGACTCAAAATCTTTTTGATGCAAAATTAGTAAAAATGATGAAAACAAAAAGAAAATCTGTATAGTTGTATCTTACCTTGAAGTTATCAGTGGGAAATTATGAGCTGTTAGTCTTGACAGTTAACAATTTACTGGATCACCAAGACCTACGAATCCAGCTAGCAGCAGCTTTATACGATTCCTTCCTCGCCTTGAAGGGTTAAAAAGTTTGTCGTGCAAGGCGGAAATGAACTTTGTAAGCTAGTCACAGTGACTTCTTACTTTTTCTCTTGAATACCAATAAAATTACAGTAATCAAAGATCACTGATCTGGTGGAACACAATTTTGATACAAAATGTTCTCCCGGCACATACCAACTTTCAAATTGGTCCTCTAGTCAGTCAGATTTCAATCACTAATCTAGTGGAACACAAAATTTGATAGAAATGATAAAATTAAGCATGCTGGGGGAAAACGAATCTTTTCTGAGAAAGCAATTCAACTTTAATAACTCGGTGGCCGGCAAAACGAGGAGAATATTCGTCCTAAATACAAAGCCACCAAGGAGATAAATTCCCACGGACAACAATTTATCTGGGATGAGTATAGTTTGTTCAATCACTTGCAGAGAACATTGTAGCTCATATGACTAACCTCAAGTGGTTAAGAAATCGAAAACGATGATTTGGCCATTCAGAGTGGAAGAGGAAGGGGAATTGAAAAACAGATACGAAAATACCGAATACTCTCCCGATCCAAGAGAATTACGCCTTCAAGAAGTCGAGCAAATCCCTGTCAACGGTGCATCGCCTGTGCAGGCGAGGACTGAGGTCCGGAATTCATTCTTCGTTTTCTTCTTCCTCACAAGAAGATGA

mRNA sequence

ATGCAGTTTACATCTTCGGTAGGTAACCTAGCAACGCTCTCTACCTTCCATTCTGCATTCAAATCTTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGATTTTCCGTCAGCTACTAAGGTATCGGGTTAAACCTAATGATTCTATCTTCTCCTTACTCATCAAAGCCTTCGTTGTCTCGTCTCCATCTTCTTCTTTTGCATCATCGTCCTGTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTAACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAATTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTTCTGAAAAAGATGTTGTATCGTGGAATGCGTTGATTTCTGGGTACTCACGAAGTGGGTATAGCCATGATGCGTTCGAGCTATTTGTCGAAATGTGCAGAAGGGGGTTTGACCCTTCTCAGAGAACATTGGTAAGTTTAATTCCTTCCTGTGGTACCCAACAATTATTCGTCCAAGGAAAATGCATCCATGCGTTAGGTGTTAAGGCTGGCCTTGATTTAGACTCCCAAGTGAAGAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTAGAAGGAGTGGAGCTCTTATTTGGAGAGATCATTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTAGAGGCAATGCTTGTTCTCAAGCAAATGCTTGAGGAAAGATTCAATGCTAACTCGGTTACTATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACTAAAACTGGTCTTGTGGAAAATGTTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGGTGCATACAAATGGCAGAACTGATTTATATGTCAAAACTCAAGAGAAACTTGGTTGCATTAACTGCGATTATTTCTAGCTATGCTGAGAAAGGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTGGATGCAGTTGCAATGGTTGGCATAATCCAAGGTCTTACATATCCTCATCACGTTGGCATTGGACTTGCTTTCCACGGTTATGGCCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGGATGTATTCAAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCATGAAAAGACACAGAGCAGCTGGAATTCTGTGATATCTAGTTGTACACAGGCAGGAAGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGATGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTTGCATTCTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTGGAGGGTTTTGTTGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACTTAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTGACAATCATGCTCTCCTTTGTTACACTAAAATGATGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTGAAAACGGTAGAACATACTTCAAAACCATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAGCATTGTGCATCCATGGTTGGCCTGCTTGGTCGGGCAGGATTATTTGAAGAGGCAATTGAATTTATCAAGAACATGGATATCAATCCAGATTCTGCAGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCGAAAAAGTTGCTTTTCTCTAACTGTAGAAATGGGGGTTTTTTTGTGTTGATGTCAAATCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGGTTGTTCAGTGGTTAAGAAATCGAAAACGATGATTTGGCCATTCAGAGTGGAAGAGGAAGGGGAATTGAAAAACAGATACGAAAATACCGAATACTCTCCCGATCCAAGAGAATTACGCCTTCAAGAAGTCGAGCAAATCCCTGTCAACGGTGCATCGCCTGTGCAGGCGAGGACTGAGGTCCGGAATTCATTCTTCGTTTTCTTCTTCCTCACAAGAAGATGA

Coding sequence (CDS)

ATGCAGTTTACATCTTCGGTAGGTAACCTAGCAACGCTCTCTACCTTCCATTCTGCATTCAAATCTTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGATTTTCCGTCAGCTACTAAGGTATCGGGTTAAACCTAATGATTCTATCTTCTCCTTACTCATCAAAGCCTTCGTTGTCTCGTCTCCATCTTCTTCTTTTGCATCATCGTCCTGTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTAACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAATTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTTCTGAAAAAGATGTTGTATCGTGGAATGCGTTGATTTCTGGGTACTCACGAAGTGGGTATAGCCATGATGCGTTCGAGCTATTTGTCGAAATGTGCAGAAGGGGGTTTGACCCTTCTCAGAGAACATTGGTAAGTTTAATTCCTTCCTGTGGTACCCAACAATTATTCGTCCAAGGAAAATGCATCCATGCGTTAGGTGTTAAGGCTGGCCTTGATTTAGACTCCCAAGTGAAGAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTAGAAGGAGTGGAGCTCTTATTTGGAGAGATCATTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTAGAGGCAATGCTTGTTCTCAAGCAAATGCTTGAGGAAAGATTCAATGCTAACTCGGTTACTATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACTAAAACTGGTCTTGTGGAAAATGTTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGGTGCATACAAATGGCAGAACTGATTTATATGTCAAAACTCAAGAGAAACTTGGTTGCATTAACTGCGATTATTTCTAGCTATGCTGAGAAAGGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTGGATGCAGTTGCAATGGTTGGCATAATCCAAGGTCTTACATATCCTCATCACGTTGGCATTGGACTTGCTTTCCACGGTTATGGCCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGGATGTATTCAAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCATGAAAAGACACAGAGCAGCTGGAATTCTGTGATATCTAGTTGTACACAGGCAGGAAGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGATGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTTGCATTCTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTGGAGGGTTTTGTTGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACTTAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTGACAATCATGCTCTCCTTTGTTACACTAAAATGATGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTGAAAACGGTAGAACATACTTCAAAACCATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAGCATTGTGCATCCATGGTTGGCCTGCTTGGTCGGGCAGGATTATTTGAAGAGGCAATTGAATTTATCAAGAACATGGATATCAATCCAGATTCTGCAGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCGAAAAAGTTGCTTTTCTCTAACTGTAGAAATGGGGGTTTTTTTGTGTTGATGTCAAATCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGGTTGTTCAGTGGTTAAGAAATCGAAAACGATGATTTGGCCATTCAGAGTGGAAGAGGAAGGGGAATTGAAAAACAGATACGAAAATACCGAATACTCTCCCGATCCAAGAGAATTACGCCTTCAAGAAGTCGAGCAAATCCCTGTCAACGGTGCATCGCCTGTGCAGGCGAGGACTGAGGTCCGGAATTCATTCTTCGTTTTCTTCTTCCTCACAAGAAGATGA

Protein sequence

MQFTSSVGNLATLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAFVVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVVKKSKTMIWPFRVEEEGELKNRYENTEYSPDPRELRLQEVEQIPVNGASPVQARTEVRNSFFVFFFLTRR
Homology
BLAST of Cla97C01G020740 vs. NCBI nr
Match: XP_038882792.1 (pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida])

HSP 1 Score: 1291.2 bits (3340), Expect = 0.0e+00
Identity = 648/701 (92.44%), Postives = 666/701 (95.01%), Query Frame = 0

Query: 1   MQFTSSVGNLAT--LSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LAT  LSTFHSAFKSYVEGKNFTPPLL+FRQLLRY +KPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLATLSLSTFHSAFKSYVEGKNFTPPLLLFRQLLRYPIKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA SSCSENAKAEANQLQTHFIKWGF+QFLYVSTAFLDLYSKLGFVKAA+ L
Sbjct: 61  VVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAAQHL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FDEF EKDVVSWNALISGYSRSGYSHDAF+LFVEM RRGFDP QRTLVSLIPSCGTQQLF
Sbjct: 121 FDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGKCIHALGVKAGLDLDSQVKN LASMYGKCADLE VELLFGE IEKNVVSWNTMIGAF
Sbjct: 181 VQGKCIHALGVKAGLDLDSQVKNVLASMYGKCADLEAVELLFGETIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
            QNGFFLEAMLV KQMLEER NANSVTMVSILSANANPG IHCYATKTGLVEN+SVV SL
Sbjct: 241 DQNGFFLEAMLVFKQMLEERVNANSVTMVSILSANANPGCIHCYATKTGLVENLSVVVSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           VCSYV CGCIQ+AELIYMSKL++NLVALTAIISSYAEKGDMGSVVKLYSR+QHLDMKLDA
Sbjct: 301 VCSYVSCGCIQIAELIYMSKLQKNLVALTAIISSYAEKGDMGSVVKLYSRMQHLDMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGLAFHGYGLKSGLIIDCLVANGFI MYSKFDNIDAVFSLF E
Sbjct: 361 VAMVGIIQGITYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFLE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MH KT SSWNSVISSC QAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHRKTLSSWNSVISSCAQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILHCYILRNNLDLE FVGTALIDMYVKCGRIDLAEKVFKSMK+PCLASWNSLISGYGLF
Sbjct: 481 EILHCYILRNNLDLENFVGTALIDMYVKCGRIDLAEKVFKSMKDPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GFDNHAL CYTKMMEKGIKPNKITFSG+LAACTHGGLVE GRTYFK MKKEFGIVPESQH
Sbjct: 541 GFDNHALRCYTKMMEKGIKPNKITFSGLLAACTHGGLVEEGRTYFKIMKKEFGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM+INPDSAVWGALLSACCIHQEVKLGESVAKKL FSN
Sbjct: 601 CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLFFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVA++RKMMREMG+DG S V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVAKVRKMMREMGDDGYSGV 701

BLAST of Cla97C01G020740 vs. NCBI nr
Match: XP_023543683.1 (pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1275.4 bits (3299), Expect = 0.0e+00
Identity = 639/701 (91.16%), Postives = 663/701 (94.58%), Query Frame = 0

Query: 1   MQFTSSVGNLATL--STFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA+L  STFHSAFKSYVEGK  TPPLL+FRQLLRYRVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSHSTFHSAFKSYVEGKISTPPLLVFRQLLRYRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA SSCSENAK EANQLQTHFIKWGF+QFLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPSSCSENAKVEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSGY+HDAFELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGYNHDAFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGKCIHALGVKAGLDLDSQVKN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 VQGKCIHALGVKAGLDLDSQVKNSLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLE   N NSVTMVSILSANANP SIHCYATKTGL+ENVSVV SL
Sbjct: 241 GQNGFFVEAMLVFKQMLEGSINTNSVTMVSILSANANPRSIHCYATKTGLMENVSVVISL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYV+CGCIQ+AELIYMSKL++NLVALTAIIS YAEKGDMGSVVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVKCGCIQIAELIYMSKLQKNLVALTAIISGYAEKGDMGSVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGLAFHGYGLKSGLIIDCLVANGFI MYSKFD+IDAVF+LFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDDIDAVFTLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MHEKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHEKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILH YILRNNLDLEGFVGTALIDMYVKCGR+D AE VFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EILHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAENVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GFDNHA LCYT MMEKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFDNHAFLCYTTMMEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. NCBI nr
Match: XP_022950030.1 (pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1266.1 bits (3275), Expect = 0.0e+00
Identity = 635/701 (90.58%), Postives = 663/701 (94.58%), Query Frame = 0

Query: 1   MQFTSSVGNLA--TLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA  +LSTFHSAFKSYVEGK  TPPLL+FRQLLR RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSLSTFHSAFKSYVEGKISTPPLLVFRQLLRCRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA  SCSENAKAEANQLQ HFIKWGF+QFLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPLSCSENAKAEANQLQIHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSGY+HDAFELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGYNHDAFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
            QGKCIHALGVKAGLDLDSQVKN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 AQGKCIHALGVKAGLDLDSQVKNSLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLEER + NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Sbjct: 241 GQNGFFVEAMLVFKQMLEERIDTNSVTMVSILSANANPRSIHCYATKTGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYVRCGCIQ+AELIYMSKL++NLVALTAIIS YAEKGDMGSVVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVRCGCIQIAELIYMSKLQKNLVALTAIISGYAEKGDMGSVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGLAFHGYGLKSGLIIDCLVANGFI MYS+FD+IDAVFSLFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLAFHGYGLKSGLIIDCLVANGFISMYSRFDDIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           M EKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MREKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EI+H YILRNNLDLEGFVGTALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EIIHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAEKVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFNNHAFLCYTKMLEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGALLNACCIHQEVKLGESVAKRLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
            RNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCS V
Sbjct: 661 SRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. NCBI nr
Match: KAG6603840.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7034021.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1265.8 bits (3274), Expect = 0.0e+00
Identity = 635/701 (90.58%), Postives = 663/701 (94.58%), Query Frame = 0

Query: 1   MQFTSSVGNLATL--STFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA+L  STFHSAFKSYVEGK  TPPLL+FRQLLR RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSHSTFHSAFKSYVEGKISTPPLLVFRQLLRCRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA  SCSENAKAEANQLQ HFIKWGF++FLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPLSCSENAKAEANQLQIHFIKWGFDRFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSGY+HDAFELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGYNHDAFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGKCIHALGVKAGLDLDSQVKN LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 VQGKCIHALGVKAGLDLDSQVKNTLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLEER + NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Sbjct: 241 GQNGFFVEAMLVFKQMLEERIDTNSVTMVSILSANANPRSIHCYATKTGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYVRCGCIQ+AELIYMSKL++NLVALTAIIS YAEKGDMGSVVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVRCGCIQIAELIYMSKLQKNLVALTAIISGYAEKGDMGSVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGLAFHGYGLKSGLIIDCLVANGFI MYS+FD+IDAVFSLFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLAFHGYGLKSGLIIDCLVANGFISMYSRFDDIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           M EKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MREKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EI+H YILRNNLDLEGFVGTALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EIIHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAEKVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFNNHAFLCYTKMLEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGALLNACCIHQEVKLGESVAKRLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
            RNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCS V
Sbjct: 661 SRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. NCBI nr
Match: XP_022977696.1 (pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1264.6 bits (3271), Expect = 0.0e+00
Identity = 631/701 (90.01%), Postives = 662/701 (94.44%), Query Frame = 0

Query: 1   MQFTSSVGNLA--TLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA  +LSTFHSAFKSYVEGK  TPPLL+FRQLLR RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSLSTFHSAFKSYVEGKISTPPLLVFRQLLRCRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA SSCSENA+AEANQLQTHFIKWGF+QFLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPSSCSENAEAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSG++HD FELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGFNHDTFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGKCIHALGVKAGLDLDSQVKN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 VQGKCIHALGVKAGLDLDSQVKNSLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLEE  N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Sbjct: 241 GQNGFFVEAMLVFKQMLEESINTSSVTMVSILSANANPRSIHCYATKTGLMENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYV+CGCI +AE IYMSKL++NLVALTAIIS YAEKGDMG+VVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVKCGCIHIAEQIYMSKLQKNLVALTAIISGYAEKGDMGAVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGL+FHGYGLKSGLIIDCLVANGFI MYS+FD+IDAVFSLFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLSFHGYGLKSGLIIDCLVANGFISMYSRFDDIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MHEKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHEKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILH YILRNNLDLEGFVGTALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+ISGYGLF
Sbjct: 481 EILHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAEKVFKSMKEPCLASWNSMISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GFDNH  LCYTKMMEKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFDNHVFLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGAFLSACCIHQEVKLGESVAKKLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVAKVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. ExPASy Swiss-Prot
Match: Q9SJ73 (Pentatricopeptide repeat-containing protein At2g04860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E74 PE=2 SV=3)

HSP 1 Score: 696.8 bits (1797), Expect = 2.7e-199
Identity = 356/696 (51.15%), Postives = 472/696 (67.82%), Query Frame = 0

Query: 1   MQFTSSVGNLATLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAFVV 60
           M+ T  +     LS FHS  KS + G+  + P+ IFR LLR  + PN    S+ ++A   
Sbjct: 1   MRITKPITLYRDLSYFHSLLKSCIHGEISSSPITIFRDLLRSSLTPNHFTMSIFLQA--- 60

Query: 61  SSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFD 120
              ++SF S       K +  Q+QTH  K G ++F+YV T+ L+LY K G V +A+ LFD
Sbjct: 61  --TTTSFNS------FKLQVEQVQTHLTKSGLDRFVYVKTSLLNLYLKKGCVTSAQMLFD 120

Query: 121 EFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQ 180
           E  E+D V WNALI GYSR+GY  DA++LF+ M ++GF PS  TLV+L+P CG      Q
Sbjct: 121 EMPERDTVVWNALICGYSRNGYECDAWKLFIVMLQQGFSPSATTLVNLLPFCGQCGFVSQ 180

Query: 181 GKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQ 240
           G+ +H +  K+GL+LDSQVKNAL S Y KCA+L   E+LF E+ +K+ VSWNTMIGA+ Q
Sbjct: 181 GRSVHGVAAKSGLELDSQVKNALISFYSKCAELGSAEVLFREMKDKSTVSWNTMIGAYSQ 240

Query: 241 NGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVC 300
           +G   EA+ V K M E+    + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC
Sbjct: 241 SGLQEEAITVFKNMFEKNVEISPVTIINLLSAHVSHEPLHCLVVKCGMVNDISVVTSLVC 300

Query: 301 SYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVA 360
           +Y RCGC+  AE +Y S  + ++V LT+I+S YAEKGDM   V  +S+ + L MK+DAVA
Sbjct: 301 AYSRCGCLVSAERLYASAKQDSIVGLTSIVSCYAEKGDMDIAVVYFSKTRQLCMKIDAVA 360

Query: 361 MVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMH 420
           +VGI+ G     H+ IG++ HGY +KSGL    LV NG I MYSKFD+++ V  LF+++ 
Sbjct: 361 LVGILHGCKKSSHIDIGMSLHGYAIKSGLCTKTLVVNGLITMYSKFDDVETVLFLFEQLQ 420

Query: 421 EKTQSSWNSVISSCTQAGRSIDAMALFSQMMLS-GYGPDSITLASLLSACCQNGNLHSGE 480
           E    SWNSVIS C Q+GR+  A  +F QMML+ G  PD+IT+ASLL+ C Q   L+ G+
Sbjct: 421 ETPLISWNSVISGCVQSGRASTAFEVFHQMMLTGGLLPDAITIASLLAGCSQLCCLNLGK 480

Query: 481 ILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFG 540
            LH Y LRNN + E FV TALIDMY KCG    AE VFKS+K PC A+WNS+ISGY L G
Sbjct: 481 ELHGYTLRNNFENENFVCTALIDMYAKCGNEVQAESVFKSIKAPCTATWNSMISGYSLSG 540

Query: 541 FDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHC 600
             + AL CY +M EKG+KP++ITF G+L+AC HGG V+ G+  F+ M KEFGI P  QH 
Sbjct: 541 LQHRALSCYLEMREKGLKPDEITFLGVLSACNHGGFVDEGKICFRAMIKEFGISPTLQHY 600

Query: 601 ASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNC 660
           A MVGLLGRA LF EA+  I  MDI PDSAVWGALLSAC IH+E+++GE VA+K+   + 
Sbjct: 601 ALMVGLLGRACLFTEALYLIWKMDIKPDSAVWGALLSACIIHRELEVGEYVARKMFMLDY 660

Query: 661 RNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDG 696
           +NGG +VLMSNLYA    W+DV R+R MM++ G DG
Sbjct: 661 KNGGLYVLMSNLYATEAMWDDVVRVRNMMKDNGYDG 685

BLAST of Cla97C01G020740 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 367.9 bits (943), Expect = 2.9e-100
Identity = 208/646 (32.20%), Postives = 349/646 (54.02%), Query Frame = 0

Query: 98  VSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMC-RR 157
           +  AFL ++ + G +  A  +F + SE+++ SWN L+ GY++ GY  +A  L+  M    
Sbjct: 131 LGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVG 190

Query: 158 GFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGV 217
           G  P   T   ++ +CG      +GK +H   V+ G +LD  V NAL +MY KC D++  
Sbjct: 191 GVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSA 250

Query: 218 ELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANP 277
            LLF  +  ++++SWN MI  + +NG   E + +   M     + + +T+ S++SA    
Sbjct: 251 RLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELL 310

Query: 278 G------SIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAELIYMSKLKRNLVALTAII 337
           G       IH Y   TG   ++SV  SL   Y+  G  + AE ++    ++++V+ T +I
Sbjct: 311 GDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMI 370

Query: 338 SSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLI 397
           S Y         +  Y  +    +K D + +  ++        +  G+  H   +K+ LI
Sbjct: 371 SGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLI 430

Query: 398 IDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVISSCTQAGRSIDAMALFSQM 457
              +VAN  I MYSK   ID    +F  +  K   SW S+I+      R  +A+    QM
Sbjct: 431 SYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQM 490

Query: 458 MLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDLEGFVGTALIDMYVKCGRI 517
            ++   P++ITL + L+AC + G L  G+ +H ++LR  + L+ F+  AL+DMYV+CGR+
Sbjct: 491 KMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRM 550

Query: 518 DLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC 577
           + A   F S K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C
Sbjct: 551 NTAWSQFNSQKKD-VTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGC 610

Query: 578 THGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLFEEAIEFIKNMDINPDSAV 637
           +   +V  G  YF  M +++G+ P  +H A +V LLGRAG  +EA +FI+ M + PD AV
Sbjct: 611 SKSQMVRQGLMYFSKM-EDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAV 670

Query: 638 WGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE 697
           WGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +VA++R+MM+E
Sbjct: 671 WGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKE 730

Query: 698 MG---EDGCSVVKKSKTMIWPFRVEEEGELKNRYENTEYSPDPREL 734
            G   + GCS         W   VE +G++     + +Y P  +E+
Sbjct: 731 NGLTVDAGCS---------W---VEVKGKVHAFLSDDKYHPQTKEI 761

BLAST of Cla97C01G020740 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 6.4e-100
Identity = 207/631 (32.81%), Postives = 329/631 (52.14%), Query Frame = 0

Query: 79  EANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYS 138
           E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G++
Sbjct: 52  ELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFA 111

Query: 139 RSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ 198
           +      A + FV M     +P       L+  CG +     GK IH L VK+G  LD  
Sbjct: 112 KVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLF 171

Query: 199 VKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEER 258
               L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ ++K M EE 
Sbjct: 172 AMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEEN 231

Query: 259 FNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAE 318
              + +T+VS+L A +          IH YA ++G    V++ T+LV  Y +CG ++ A 
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETAR 291

Query: 319 LIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPH 378
            ++   L+RN+V+  ++I +Y +  +    + ++ ++    +K   V+++G +       
Sbjct: 292 QLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLG 351

Query: 379 HVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVIS 438
            +  G   H   ++ GL  +  V N  I MY K   +D   S+F ++  +T  SWN++I 
Sbjct: 352 DLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMIL 411

Query: 439 SCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDL 498
              Q GR IDA+  FSQM      PD+ T  S+++A  +    H  + +H  ++R+ LD 
Sbjct: 412 GFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDK 471

Query: 499 EGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMM 558
             FV TAL+DMY KCG I +A  +F  M E  + +WN++I GYG  GF   AL  + +M 
Sbjct: 472 NVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ 531

Query: 559 EKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLF 618
           +  IKPN +TF  +++AC+H GLVE G   F  MK+ + I     H  +MV LLGRAG  
Sbjct: 532 KGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRL 591

Query: 619 EEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY 678
            EA +FI  M + P   V+GA+L AC IH+ V   E  A++L   N  +GG+ VL++N+Y
Sbjct: 592 NEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIY 651

Query: 679 AASGRWNDVARIRKMMREMG---EDGCSVVK 701
            A+  W  V ++R  M   G     GCS+V+
Sbjct: 652 RAASMWEKVGQVRVSMLRQGLRKTPGCSMVE 682

BLAST of Cla97C01G020740 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 363.2 bits (931), Expect = 7.1e-99
Identity = 224/689 (32.51%), Postives = 354/689 (51.38%), Query Frame = 0

Query: 88  IKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYSRSGYSHDAF 147
           +K G  + ++V  A +  Y   GFV  A +LFD   E+++VSWN++I  +S +G+S ++F
Sbjct: 214 VKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNGFSEESF 273

Query: 148 ELFVEMCRR----GFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQVKNAL 207
            L  EM        F P   TLV+++P C  ++    GK +H   VK  LD +  + NAL
Sbjct: 274 LLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNAL 333

Query: 208 ASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLE--ERFNA 267
             MY KC  +   +++F     KNVVSWNTM+G F   G       VL+QML   E   A
Sbjct: 334 MDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLAGGEDVKA 393

Query: 268 NSVTMVSIL------SANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAELIY 327
           + VT+++ +      S   +   +HCY+ K   V N  V  + V SY +CG +  A+ ++
Sbjct: 394 DEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVF 453

Query: 328 MSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGLTY 387
                + + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Sbjct: 454 HGIRSKTVNSWNALIGGHAQSND-----PRLSLDAHLQMKISGLLPDSFTVCSLLSACSK 513

Query: 388 PHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSV 447
              + +G   HG+ +++ L  D  V    + +Y     +  V +LF  M +K+  SWN+V
Sbjct: 514 LKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWNTV 573

Query: 448 ISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNL 507
           I+   Q G    A+ +F QM+L G     I++  +  AC    +L  G   H Y L++ L
Sbjct: 574 ITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKHLL 633

Query: 508 DLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTK 567
           + + F+  +LIDMY K G I  + KVF  +KE   ASWN++I GYG+ G    A+  + +
Sbjct: 634 EDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLFEE 693

Query: 568 MMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAG 627
           M   G  P+ +TF G+L AC H GL+  G  Y   MK  FG+ P  +H A ++ +LGRAG
Sbjct: 694 MQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAG 753

Query: 628 LFEEAIEFI-KNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS 687
             ++A+  + + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+S
Sbjct: 754 QLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLS 813

Query: 688 NLYAASGRWNDVARIRKMMREMG---EDGCSVVKKSKTMIWPFRVEEE-----GELKNRY 742
           NLYA  G+W DV ++R+ M EM    + GCS ++ ++  ++ F V E       E+K+ +
Sbjct: 814 NLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNR-KVFSFVVGERFLDGFEEIKSLW 873

BLAST of Cla97C01G020740 vs. ExPASy Swiss-Prot
Match: Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 359.0 bits (920), Expect = 1.3e-97
Identity = 210/654 (32.11%), Postives = 348/654 (53.21%), Query Frame = 0

Query: 81  NQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEK-DVVSWNALISGYSR 140
           ++L +  +K G++   ++  A + +Y+K   + AARRLFD F EK D V WN+++S YS 
Sbjct: 202 SELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYST 261

Query: 141 SGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGL-DLDSQ 200
           SG S +  ELF EM   G  P+  T+VS + +C        GK IHA  +K+     +  
Sbjct: 262 SGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELY 321

Query: 201 VKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEER 260
           V NAL +MY +C  +   E +  ++   +VV+WN++I  + QN  + EA+     M+   
Sbjct: 322 VCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAG 381

Query: 261 FNANSVTMVSILSANANPGS------IHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAE 320
             ++ V+M SI++A+    +      +H Y  K G   N+ V  +L+  Y +C       
Sbjct: 382 HKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMG 441

Query: 321 LIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPH 380
             ++    ++L++ T +I+ YA+       ++L+  V    M++D + +  I++  +   
Sbjct: 442 RAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLK 501

Query: 381 HVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVIS 440
            + I    H + L+ GL +D ++ N  + +Y K  N+     +F+ +  K   SW S+IS
Sbjct: 502 SMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMIS 561

Query: 441 SCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDL 500
           S    G   +A+ LF +M+ +G   DS+ L  +LSA      L+ G  +HCY+LR    L
Sbjct: 562 SSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFCL 621

Query: 501 EGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMM 560
           EG +  A++DMY  CG +  A+ VF  ++   L  + S+I+ YG+ G    A+  + KM 
Sbjct: 622 EGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMR 681

Query: 561 EKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLF 620
            + + P+ I+F  +L AC+H GL++ GR + K M+ E+ + P  +H   +V +LGRA   
Sbjct: 682 HENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCV 741

Query: 621 EEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY 680
            EA EF+K M   P + VW ALL+AC  H E ++GE  A++LL    +N G  VL+SN++
Sbjct: 742 VEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNVF 801

Query: 681 AASGRWNDVARIRKMMREMGED---GCSVVK-KSKTMIWPFRVEEEGELKNRYE 723
           A  GRWNDV ++R  M+  G +   GCS ++   K   +  R +   E K  YE
Sbjct: 802 AEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYE 854

BLAST of Cla97C01G020740 vs. ExPASy TrEMBL
Match: A0A6J1GEF9 (pentatricopeptide repeat-containing protein At2g04860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453241 PE=4 SV=1)

HSP 1 Score: 1266.1 bits (3275), Expect = 0.0e+00
Identity = 635/701 (90.58%), Postives = 663/701 (94.58%), Query Frame = 0

Query: 1   MQFTSSVGNLA--TLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA  +LSTFHSAFKSYVEGK  TPPLL+FRQLLR RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSLSTFHSAFKSYVEGKISTPPLLVFRQLLRCRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA  SCSENAKAEANQLQ HFIKWGF+QFLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPLSCSENAKAEANQLQIHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSGY+HDAFELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGYNHDAFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
            QGKCIHALGVKAGLDLDSQVKN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 AQGKCIHALGVKAGLDLDSQVKNSLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLEER + NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Sbjct: 241 GQNGFFVEAMLVFKQMLEERIDTNSVTMVSILSANANPRSIHCYATKTGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYVRCGCIQ+AELIYMSKL++NLVALTAIIS YAEKGDMGSVVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVRCGCIQIAELIYMSKLQKNLVALTAIISGYAEKGDMGSVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGLAFHGYGLKSGLIIDCLVANGFI MYS+FD+IDAVFSLFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLAFHGYGLKSGLIIDCLVANGFISMYSRFDDIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           M EKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MREKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EI+H YILRNNLDLEGFVGTALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EIIHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAEKVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFNNHAFLCYTKMLEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGALLNACCIHQEVKLGESVAKRLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
            RNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCS V
Sbjct: 661 SRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. ExPASy TrEMBL
Match: A0A6J1IKP6 (pentatricopeptide repeat-containing protein At2g04860 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111477926 PE=4 SV=1)

HSP 1 Score: 1264.6 bits (3271), Expect = 0.0e+00
Identity = 631/701 (90.01%), Postives = 662/701 (94.44%), Query Frame = 0

Query: 1   MQFTSSVGNLA--TLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+LA  +LSTFHSAFKSYVEGK  TPPLL+FRQLLR RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHLASLSLSTFHSAFKSYVEGKISTPPLLVFRQLLRCRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS SSSFA SSCSENA+AEANQLQTHFIKWGF+QFLYVSTAFLDLYSKLGFVKAARRL
Sbjct: 61  VVSSSSSSFAPSSCSENAEAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+  EKDVVSWNALISGYSRSG++HD FELFVEM RRGF+P QRTLVSLIPSCGTQ LF
Sbjct: 121 FDDIPEKDVVSWNALISGYSRSGFNHDTFELFVEMRRRGFNPCQRTLVSLIPSCGTQHLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGKCIHALGVKAGLDLDSQVKN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF
Sbjct: 181 VQGKCIHALGVKAGLDLDSQVKNSLASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFF+EAMLV KQMLEE  N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Sbjct: 241 GQNGFFVEAMLVFKQMLEESINTSSVTMVSILSANANPRSIHCYATKTGLMENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           +CSYV+CGCI +AE IYMSKL++NLVALTAIIS YAEKGDMG+VVKLYSRVQHL+MKLDA
Sbjct: 301 ICSYVKCGCIHIAEQIYMSKLQKNLVALTAIISGYAEKGDMGAVVKLYSRVQHLEMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG+TYP H GIGL+FHGYGLKSGLIIDCLVANGFI MYS+FD+IDAVFSLFQE
Sbjct: 361 VAMVGIIQGITYPDHSGIGLSFHGYGLKSGLIIDCLVANGFISMYSRFDDIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MHEKT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHEKTLSSWNSVISSCAQAGRSIDAMALFSQMKLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILH YILRNNLDLEGFVGTALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+ISGYGLF
Sbjct: 481 EILHSYILRNNLDLEGFVGTALIDMYVKCGRMDFAEKVFKSMKEPCLASWNSMISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GFDNH  LCYTKMMEKGIKPNKITFSGILAACTHGGLVE GRTYF+ MKKE GIVPESQH
Sbjct: 541 GFDNHVFLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRTYFEIMKKELGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM++NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSN
Sbjct: 601 CASMVGLLGRAGLFEEAILFIKNMEVNPDSAVWGAFLSACCIHQEVKLGESVAKKLLFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVAKVRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. ExPASy TrEMBL
Match: A0A0A0KMV8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G510930 PE=4 SV=1)

HSP 1 Score: 1239.2 bits (3205), Expect = 0.0e+00
Identity = 623/701 (88.87%), Postives = 652/701 (93.01%), Query Frame = 0

Query: 1   MQFTSSVGNLATLS--TFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFTSSVG+ ATLS  TFHSAFK YVEGK FTPPLL+FR+LLR+RVKPNDS FSLLIKAF
Sbjct: 1   MQFTSSVGHPATLSLTTFHSAFKFYVEGKCFTPPLLLFRELLRHRVKPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS +SSFA S CSEN KAEANQLQTHFIKWGF+QFLYVSTAFLDLYSKLGFVKAA+RL
Sbjct: 61  VVSSSTSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAAQRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+F EKDVVSWNALISGY+R G SHDAF+LFVEM RR FDP QRTLVSL+PSCGTQQLF
Sbjct: 121 FDDFPEKDVVSWNALISGYTRCGNSHDAFKLFVEMRRREFDPCQRTLVSLMPSCGTQQLF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGK IH LGVKAGLDLDSQVKNAL SMYGKCADL+GV+LLFGEI EK+VVSWNTMIGAF
Sbjct: 181 VQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLDGVKLLFGEITEKSVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNG F EAMLV KQMLEE  NANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Sbjct: 241 GQNGLFSEAMLVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           VCSYV+CG I++AELIYMSKLK+NLVALTAIIS YAEKGDMGSVV+LYS VQHLDMKLDA
Sbjct: 301 VCSYVKCGYIELAELIYMSKLKKNLVALTAIISHYAEKGDMGSVVRLYSIVQHLDMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG TYP H+GIGLAFHGYG+KSGLIIDCLVANGFI MYSKFDNIDAVFSLFQE
Sbjct: 361 VAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MH+KT SSWNSVISSC QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILHCYILRNNLDLEGFVGTAL+DMYVKCGR+D AE VFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EILHCYILRNNLDLEGFVGTALVDMYVKCGRMDFAENVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF NHALLCYT+MMEKGIKPNKITFSGILAACTHGGLVE GR YFK MKK+FGIVPESQH
Sbjct: 541 GFHNHALLCYTEMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKKFGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVG+LGRAGLFEEAI FI+NM+ NPDSAVWGALLSACCIHQEVKLGESVAKKL FSN
Sbjct: 601 CASMVGMLGRAGLFEEAIVFIQNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAAS RWNDVARIRKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGV 701

BLAST of Cla97C01G020740 vs. ExPASy TrEMBL
Match: A0A5D3CM04 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G00360 PE=4 SV=1)

HSP 1 Score: 1235.3 bits (3195), Expect = 0.0e+00
Identity = 622/701 (88.73%), Postives = 650/701 (92.72%), Query Frame = 0

Query: 1   MQFTSSVGNLATLS--TFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFT SVG+ ATLS  TFHSAFK YVEGKNFTPPLL+FRQLLR++V+PNDS FSLLIKAF
Sbjct: 1   MQFTPSVGHPATLSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRHQVRPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS      SS CSEN KAEANQLQTHFIKWGF+QFLYVSTAFLDLYS+LGFVKAARRL
Sbjct: 61  VVSS------SSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSRLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+F EKDVVSWNALISGY+R GYSHDAF+LFVEM RRGFDP QRTLVSL+PSCGTQ+LF
Sbjct: 121 FDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQELF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGK IH LGVKAGLDLDSQVKN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAF
Sbjct: 181 VQGKSIHGLGVKAGLDLDSQVKNTLVSMYGKCADLEGVKLLFGEIGEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFFLEAMLV KQMLEE  +ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Sbjct: 241 GQNGFFLEAMLVFKQMLEESVSANSVTMVSILSANANAGCIHCYATKIGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           VCSYV+CG I++AELIYMSKL++NLVALTAIIS YAEKGDMGSVV+LYS VQHLDMKLDA
Sbjct: 301 VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISRYAEKGDMGSVVRLYSLVQHLDMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG TYP H GIGLAFHGYG+KSGLIIDCLVANGFI MYSKFDNIDAVFSLFQE
Sbjct: 361 VAMVGIIQGFTYPDHTGIGLAFHGYGVKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MH+KT SSWNSVISS  QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHKKTLSSWNSVISSSAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILHCYILRN++DLEGFVGTAL+DMYVKCGRID AE VFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EILHCYILRNSVDLEGFVGTALVDMYVKCGRIDFAENVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF N ALLCYTKMMEKGIKPNKITFSGILAACTHGGLVE GR YFKTMKKEFGIVPESQH
Sbjct: 541 GFHNRALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKTMKKEFGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM+ NPDSAVWGALLSACCIHQE+KLGESVAKKL FSN
Sbjct: 601 CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEIKLGESVAKKLFFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVA+IRKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVAKIRKMMREMGEDGCSGV 695

BLAST of Cla97C01G020740 vs. ExPASy TrEMBL
Match: A0A1S4DTH7 (pentatricopeptide repeat-containing protein At2g04860 OS=Cucumis melo OX=3656 GN=LOC107990461 PE=4 SV=1)

HSP 1 Score: 1235.3 bits (3195), Expect = 0.0e+00
Identity = 622/701 (88.73%), Postives = 650/701 (92.72%), Query Frame = 0

Query: 1   MQFTSSVGNLATLS--TFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAF 60
           MQFT SVG+ ATLS  TFHSAFK YVEGKNFTPPLL+FRQLLR++V+PNDS FSLLIKAF
Sbjct: 1   MQFTPSVGHPATLSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRHQVRPNDSTFSLLIKAF 60

Query: 61  VVSSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRL 120
           VVSS      SS CSEN KAEANQLQTHFIKWGF+QFLYVSTAFLDLYS+LGFVKAARRL
Sbjct: 61  VVSS------SSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSRLGFVKAARRL 120

Query: 121 FDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLF 180
           FD+F EKDVVSWNALISGY+R GYSHDAF+LFVEM RRGFDP QRTLVSL+PSCGTQ+LF
Sbjct: 121 FDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQELF 180

Query: 181 VQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAF 240
           VQGK IH LGVKAGLDLDSQVKN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAF
Sbjct: 181 VQGKSIHGLGVKAGLDLDSQVKNTLVSMYGKCADLEGVKLLFGEIGEKNVVSWNTMIGAF 240

Query: 241 GQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL 300
           GQNGFFLEAMLV KQMLEE  +ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Sbjct: 241 GQNGFFLEAMLVFKQMLEESVSANSVTMVSILSANANAGCIHCYATKIGLVENVSVVTSL 300

Query: 301 VCSYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDA 360
           VCSYV+CG I++AELIYMSKL++NLVALTAIIS YAEKGDMGSVV+LYS VQHLDMKLDA
Sbjct: 301 VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISRYAEKGDMGSVVRLYSLVQHLDMKLDA 360

Query: 361 VAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQE 420
           VAMVGIIQG TYP H GIGLAFHGYG+KSGLIIDCLVANGFI MYSKFDNIDAVFSLFQE
Sbjct: 361 VAMVGIIQGFTYPDHTGIGLAFHGYGVKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQE 420

Query: 421 MHEKTQSSWNSVISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSG 480
           MH+KT SSWNSVISS  QAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLH G
Sbjct: 421 MHKKTLSSWNSVISSSAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFG 480

Query: 481 EILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLF 540
           EILHCYILRN++DLEGFVGTAL+DMYVKCGRID AE VFKSMKEPCLASWNSLISGYGLF
Sbjct: 481 EILHCYILRNSVDLEGFVGTALVDMYVKCGRIDFAENVFKSMKEPCLASWNSLISGYGLF 540

Query: 541 GFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQH 600
           GF N ALLCYTKMMEKGIKPNKITFSGILAACTHGGLVE GR YFKTMKKEFGIVPESQH
Sbjct: 541 GFHNRALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKTMKKEFGIVPESQH 600

Query: 601 CASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN 660
           CASMVGLLGRAGLFEEAI FIKNM+ NPDSAVWGALLSACCIHQE+KLGESVAKKL FSN
Sbjct: 601 CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEIKLGESVAKKLFFSN 660

Query: 661 CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSVV 700
           CRNGGFFVLMSNLYAASGRWNDVA+IRKMMREMGEDGCS V
Sbjct: 661 CRNGGFFVLMSNLYAASGRWNDVAKIRKMMREMGEDGCSGV 695

BLAST of Cla97C01G020740 vs. TAIR 10
Match: AT2G04860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 696.8 bits (1797), Expect = 1.9e-200
Identity = 356/696 (51.15%), Postives = 472/696 (67.82%), Query Frame = 0

Query: 1   MQFTSSVGNLATLSTFHSAFKSYVEGKNFTPPLLIFRQLLRYRVKPNDSIFSLLIKAFVV 60
           M+ T  +     LS FHS  KS + G+  + P+ IFR LLR  + PN    S+ ++A   
Sbjct: 1   MRITKPITLYRDLSYFHSLLKSCIHGEISSSPITIFRDLLRSSLTPNHFTMSIFLQA--- 60

Query: 61  SSPSSSFASSSCSENAKAEANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFD 120
              ++SF S       K +  Q+QTH  K G ++F+YV T+ L+LY K G V +A+ LFD
Sbjct: 61  --TTTSFNS------FKLQVEQVQTHLTKSGLDRFVYVKTSLLNLYLKKGCVTSAQMLFD 120

Query: 121 EFSEKDVVSWNALISGYSRSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQ 180
           E  E+D V WNALI GYSR+GY  DA++LF+ M ++GF PS  TLV+L+P CG      Q
Sbjct: 121 EMPERDTVVWNALICGYSRNGYECDAWKLFIVMLQQGFSPSATTLVNLLPFCGQCGFVSQ 180

Query: 181 GKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQ 240
           G+ +H +  K+GL+LDSQVKNAL S Y KCA+L   E+LF E+ +K+ VSWNTMIGA+ Q
Sbjct: 181 GRSVHGVAAKSGLELDSQVKNALISFYSKCAELGSAEVLFREMKDKSTVSWNTMIGAYSQ 240

Query: 241 NGFFLEAMLVLKQMLEERFNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVC 300
           +G   EA+ V K M E+    + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC
Sbjct: 241 SGLQEEAITVFKNMFEKNVEISPVTIINLLSAHVSHEPLHCLVVKCGMVNDISVVTSLVC 300

Query: 301 SYVRCGCIQMAELIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVA 360
           +Y RCGC+  AE +Y S  + ++V LT+I+S YAEKGDM   V  +S+ + L MK+DAVA
Sbjct: 301 AYSRCGCLVSAERLYASAKQDSIVGLTSIVSCYAEKGDMDIAVVYFSKTRQLCMKIDAVA 360

Query: 361 MVGIIQGLTYPHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMH 420
           +VGI+ G     H+ IG++ HGY +KSGL    LV NG I MYSKFD+++ V  LF+++ 
Sbjct: 361 LVGILHGCKKSSHIDIGMSLHGYAIKSGLCTKTLVVNGLITMYSKFDDVETVLFLFEQLQ 420

Query: 421 EKTQSSWNSVISSCTQAGRSIDAMALFSQMMLS-GYGPDSITLASLLSACCQNGNLHSGE 480
           E    SWNSVIS C Q+GR+  A  +F QMML+ G  PD+IT+ASLL+ C Q   L+ G+
Sbjct: 421 ETPLISWNSVISGCVQSGRASTAFEVFHQMMLTGGLLPDAITIASLLAGCSQLCCLNLGK 480

Query: 481 ILHCYILRNNLDLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFG 540
            LH Y LRNN + E FV TALIDMY KCG    AE VFKS+K PC A+WNS+ISGY L G
Sbjct: 481 ELHGYTLRNNFENENFVCTALIDMYAKCGNEVQAESVFKSIKAPCTATWNSMISGYSLSG 540

Query: 541 FDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHC 600
             + AL CY +M EKG+KP++ITF G+L+AC HGG V+ G+  F+ M KEFGI P  QH 
Sbjct: 541 LQHRALSCYLEMREKGLKPDEITFLGVLSACNHGGFVDEGKICFRAMIKEFGISPTLQHY 600

Query: 601 ASMVGLLGRAGLFEEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNC 660
           A MVGLLGRA LF EA+  I  MDI PDSAVWGALLSAC IH+E+++GE VA+K+   + 
Sbjct: 601 ALMVGLLGRACLFTEALYLIWKMDIKPDSAVWGALLSACIIHRELEVGEYVARKMFMLDY 660

Query: 661 RNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDG 696
           +NGG +VLMSNLYA    W+DV R+R MM++ G DG
Sbjct: 661 KNGGLYVLMSNLYATEAMWDDVVRVRNMMKDNGYDG 685

BLAST of Cla97C01G020740 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 367.9 bits (943), Expect = 2.1e-101
Identity = 208/646 (32.20%), Postives = 349/646 (54.02%), Query Frame = 0

Query: 98  VSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYSRSGYSHDAFELFVEMC-RR 157
           +  AFL ++ + G +  A  +F + SE+++ SWN L+ GY++ GY  +A  L+  M    
Sbjct: 131 LGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVG 190

Query: 158 GFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQVKNALASMYGKCADLEGV 217
           G  P   T   ++ +CG      +GK +H   V+ G +LD  V NAL +MY KC D++  
Sbjct: 191 GVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSA 250

Query: 218 ELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEERFNANSVTMVSILSANANP 277
            LLF  +  ++++SWN MI  + +NG   E + +   M     + + +T+ S++SA    
Sbjct: 251 RLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELL 310

Query: 278 G------SIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAELIYMSKLKRNLVALTAII 337
           G       IH Y   TG   ++SV  SL   Y+  G  + AE ++    ++++V+ T +I
Sbjct: 311 GDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMI 370

Query: 338 SSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPHHVGIGLAFHGYGLKSGLI 397
           S Y         +  Y  +    +K D + +  ++        +  G+  H   +K+ LI
Sbjct: 371 SGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLI 430

Query: 398 IDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVISSCTQAGRSIDAMALFSQM 457
              +VAN  I MYSK   ID    +F  +  K   SW S+I+      R  +A+    QM
Sbjct: 431 SYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQM 490

Query: 458 MLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDLEGFVGTALIDMYVKCGRI 517
            ++   P++ITL + L+AC + G L  G+ +H ++LR  + L+ F+  AL+DMYV+CGR+
Sbjct: 491 KMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRM 550

Query: 518 DLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC 577
           + A   F S K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C
Sbjct: 551 NTAWSQFNSQKKD-VTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGC 610

Query: 578 THGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLFEEAIEFIKNMDINPDSAV 637
           +   +V  G  YF  M +++G+ P  +H A +V LLGRAG  +EA +FI+ M + PD AV
Sbjct: 611 SKSQMVRQGLMYFSKM-EDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAV 670

Query: 638 WGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE 697
           WGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +VA++R+MM+E
Sbjct: 671 WGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKE 730

Query: 698 MG---EDGCSVVKKSKTMIWPFRVEEEGELKNRYENTEYSPDPREL 734
            G   + GCS         W   VE +G++     + +Y P  +E+
Sbjct: 731 NGLTVDAGCS---------W---VEVKGKVHAFLSDDKYHPQTKEI 761

BLAST of Cla97C01G020740 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 366.7 bits (940), Expect = 4.6e-101
Identity = 207/631 (32.81%), Postives = 329/631 (52.14%), Query Frame = 0

Query: 79  EANQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYS 138
           E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G++
Sbjct: 52  ELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFA 111

Query: 139 RSGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ 198
           +      A + FV M     +P       L+  CG +     GK IH L VK+G  LD  
Sbjct: 112 KVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLF 171

Query: 199 VKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEER 258
               L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ ++K M EE 
Sbjct: 172 AMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEEN 231

Query: 259 FNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAE 318
              + +T+VS+L A +          IH YA ++G    V++ T+LV  Y +CG ++ A 
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETAR 291

Query: 319 LIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPH 378
            ++   L+RN+V+  ++I +Y +  +    + ++ ++    +K   V+++G +       
Sbjct: 292 QLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLG 351

Query: 379 HVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVIS 438
            +  G   H   ++ GL  +  V N  I MY K   +D   S+F ++  +T  SWN++I 
Sbjct: 352 DLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMIL 411

Query: 439 SCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDL 498
              Q GR IDA+  FSQM      PD+ T  S+++A  +    H  + +H  ++R+ LD 
Sbjct: 412 GFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDK 471

Query: 499 EGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMM 558
             FV TAL+DMY KCG I +A  +F  M E  + +WN++I GYG  GF   AL  + +M 
Sbjct: 472 NVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ 531

Query: 559 EKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLF 618
           +  IKPN +TF  +++AC+H GLVE G   F  MK+ + I     H  +MV LLGRAG  
Sbjct: 532 KGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRL 591

Query: 619 EEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY 678
            EA +FI  M + P   V+GA+L AC IH+ V   E  A++L   N  +GG+ VL++N+Y
Sbjct: 592 NEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIY 651

Query: 679 AASGRWNDVARIRKMMREMG---EDGCSVVK 701
            A+  W  V ++R  M   G     GCS+V+
Sbjct: 652 RAASMWEKVGQVRVSMLRQGLRKTPGCSMVE 682

BLAST of Cla97C01G020740 vs. TAIR 10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 363.2 bits (931), Expect = 5.1e-100
Identity = 224/689 (32.51%), Postives = 354/689 (51.38%), Query Frame = 0

Query: 88  IKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEKDVVSWNALISGYSRSGYSHDAF 147
           +K G  + ++V  A +  Y   GFV  A +LFD   E+++VSWN++I  +S +G+S ++F
Sbjct: 214 VKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDNGFSEESF 273

Query: 148 ELFVEMCRR----GFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQVKNAL 207
            L  EM        F P   TLV+++P C  ++    GK +H   VK  LD +  + NAL
Sbjct: 274 LLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNAL 333

Query: 208 ASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLE--ERFNA 267
             MY KC  +   +++F     KNVVSWNTM+G F   G       VL+QML   E   A
Sbjct: 334 MDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLAGGEDVKA 393

Query: 268 NSVTMVSIL------SANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAELIY 327
           + VT+++ +      S   +   +HCY+ K   V N  V  + V SY +CG +  A+ ++
Sbjct: 394 DEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVF 453

Query: 328 MSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGLTY 387
                + + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Sbjct: 454 HGIRSKTVNSWNALIGGHAQSND-----PRLSLDAHLQMKISGLLPDSFTVCSLLSACSK 513

Query: 388 PHHVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSV 447
              + +G   HG+ +++ L  D  V    + +Y     +  V +LF  M +K+  SWN+V
Sbjct: 514 LKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWNTV 573

Query: 448 ISSCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNL 507
           I+   Q G    A+ +F QM+L G     I++  +  AC    +L  G   H Y L++ L
Sbjct: 574 ITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKHLL 633

Query: 508 DLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTK 567
           + + F+  +LIDMY K G I  + KVF  +KE   ASWN++I GYG+ G    A+  + +
Sbjct: 634 EDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLFEE 693

Query: 568 MMEKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAG 627
           M   G  P+ +TF G+L AC H GL+  G  Y   MK  FG+ P  +H A ++ +LGRAG
Sbjct: 694 MQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAG 753

Query: 628 LFEEAIEFI-KNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS 687
             ++A+  + + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+S
Sbjct: 754 QLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLS 813

Query: 688 NLYAASGRWNDVARIRKMMREMG---EDGCSVVKKSKTMIWPFRVEEE-----GELKNRY 742
           NLYA  G+W DV ++R+ M EM    + GCS ++ ++  ++ F V E       E+K+ +
Sbjct: 814 NLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNR-KVFSFVVGERFLDGFEEIKSLW 873

BLAST of Cla97C01G020740 vs. TAIR 10
Match: AT3G63370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 359.0 bits (920), Expect = 9.5e-99
Identity = 210/654 (32.11%), Postives = 348/654 (53.21%), Query Frame = 0

Query: 81  NQLQTHFIKWGFNQFLYVSTAFLDLYSKLGFVKAARRLFDEFSEK-DVVSWNALISGYSR 140
           ++L +  +K G++   ++  A + +Y+K   + AARRLFD F EK D V WN+++S YS 
Sbjct: 202 SELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYST 261

Query: 141 SGYSHDAFELFVEMCRRGFDPSQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGL-DLDSQ 200
           SG S +  ELF EM   G  P+  T+VS + +C        GK IHA  +K+     +  
Sbjct: 262 SGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELY 321

Query: 201 VKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFLEAMLVLKQMLEER 260
           V NAL +MY +C  +   E +  ++   +VV+WN++I  + QN  + EA+     M+   
Sbjct: 322 VCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAG 381

Query: 261 FNANSVTMVSILSANANPGS------IHCYATKTGLVENVSVVTSLVCSYVRCGCIQMAE 320
             ++ V+M SI++A+    +      +H Y  K G   N+ V  +L+  Y +C       
Sbjct: 382 HKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMG 441

Query: 321 LIYMSKLKRNLVALTAIISSYAEKGDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGLTYPH 380
             ++    ++L++ T +I+ YA+       ++L+  V    M++D + +  I++  +   
Sbjct: 442 RAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLK 501

Query: 381 HVGIGLAFHGYGLKSGLIIDCLVANGFIRMYSKFDNIDAVFSLFQEMHEKTQSSWNSVIS 440
            + I    H + L+ GL +D ++ N  + +Y K  N+     +F+ +  K   SW S+IS
Sbjct: 502 SMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMIS 561

Query: 441 SCTQAGRSIDAMALFSQMMLSGYGPDSITLASLLSACCQNGNLHSGEILHCYILRNNLDL 500
           S    G   +A+ LF +M+ +G   DS+ L  +LSA      L+ G  +HCY+LR    L
Sbjct: 562 SSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFCL 621

Query: 501 EGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMM 560
           EG +  A++DMY  CG +  A+ VF  ++   L  + S+I+ YG+ G    A+  + KM 
Sbjct: 622 EGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMR 681

Query: 561 EKGIKPNKITFSGILAACTHGGLVENGRTYFKTMKKEFGIVPESQHCASMVGLLGRAGLF 620
            + + P+ I+F  +L AC+H GL++ GR + K M+ E+ + P  +H   +V +LGRA   
Sbjct: 682 HENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCV 741

Query: 621 EEAIEFIKNMDINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY 680
            EA EF+K M   P + VW ALL+AC  H E ++GE  A++LL    +N G  VL+SN++
Sbjct: 742 VEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNVF 801

Query: 681 AASGRWNDVARIRKMMREMGED---GCSVVK-KSKTMIWPFRVEEEGELKNRYE 723
           A  GRWNDV ++R  M+  G +   GCS ++   K   +  R +   E K  YE
Sbjct: 802 AEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYE 854

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882792.10.0e+0092.44pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida][more]
XP_023543683.10.0e+0091.16pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pep... [more]
XP_022950030.10.0e+0090.58pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita mosc... [more]
KAG6603840.10.0e+0090.58Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022977696.10.0e+0090.01pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
Q9SJ732.7e-19951.15Pentatricopeptide repeat-containing protein At2g04860 OS=Arabidopsis thaliana OX... [more]
Q9M9E22.9e-10032.20Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q3E6Q16.4e-10032.81Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q0WN607.1e-9932.51Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Q9M1V31.3e-9732.11Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1GEF90.0e+0090.58pentatricopeptide repeat-containing protein At2g04860 isoform X1 OS=Cucurbita mo... [more]
A0A6J1IKP60.0e+0090.01pentatricopeptide repeat-containing protein At2g04860 isoform X1 OS=Cucurbita ma... [more]
A0A0A0KMV80.0e+0088.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G510930 PE=4 SV=1[more]
A0A5D3CM040.0e+0088.73Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DTH70.0e+0088.73pentatricopeptide repeat-containing protein At2g04860 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT2G04860.11.9e-20051.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G15510.12.1e-10132.20Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.14.6e-10132.81Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18485.15.1e-10032.51Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G63370.19.5e-9932.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 397..422
e-value: 0.058
score: 13.7
coord: 601..622
e-value: 0.31
score: 11.4
coord: 229..258
e-value: 5.0E-7
score: 29.5
coord: 498..522
e-value: 1.0E-4
score: 22.3
coord: 325..349
e-value: 1.3
score: 9.4
coord: 527..556
e-value: 8.1E-4
score: 19.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 498..522
e-value: 1.5E-4
score: 19.7
coord: 229..262
e-value: 3.2E-6
score: 25.0
coord: 527..559
e-value: 4.1E-5
score: 21.5
coord: 128..161
e-value: 1.0E-8
score: 32.8
coord: 426..458
e-value: 1.6E-6
score: 25.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 125..172
e-value: 1.7E-12
score: 47.4
coord: 425..471
e-value: 4.0E-7
score: 30.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 126..160
score: 13.493418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 524..558
score: 9.514466
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 227..261
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 10.785976
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 13..183
e-value: 1.5E-23
score: 85.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 379..480
e-value: 5.5E-20
score: 73.5
coord: 279..370
e-value: 1.6E-8
score: 36.0
coord: 184..278
e-value: 1.8E-16
score: 62.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 481..589
e-value: 1.4E-20
score: 76.0
coord: 590..727
e-value: 6.4E-12
score: 47.5
NoneNo IPR availablePANTHERPTHR24015:SF502PPR CONTAINING PLANT-LIKE PROTEINcoord: 317..701
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 317..701
coord: 47..225
coord: 225..315
NoneNo IPR availablePANTHERPTHR24015:SF502PPR CONTAINING PLANT-LIKE PROTEINcoord: 47..225
coord: 225..315

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G020740.2Cla97C01G020740.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0005739 mitochondrion
molecular_function GO:0004556 alpha-amylase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding