Cla97C05G105680 (gene) Watermelon (97103) v2

NameCla97C05G105680
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing family protein
LocationCla97Chr05 : 33205714 .. 33207681 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGAAAAGAGTTCTATGTTCAATTCCTCACAGGTCGTTTTCCTCTGCGCCAGAAACCCCATCTCTCTACTCCTTCCTCCAACCTTCTCTTTTTGCCCTAAAGAGAACCCCATTTTCGCCTTCTCAAGACTCCACCGACCTCCGTCAGGATCCAACTCCTCAAATTTTAACTCCAGATCGCGTCGCCGCCGTAGAAACGGCCCTCCACAAGTCCCTCCTCACCAGCGACACTGATGAGGCATGGAAATCCTTCAAATTGCTCACGAGAAGCTCTGTTTTCCCATGTAAGCCTCTTACCAATTCACTTATTGCTCACTTGTCCTCAATTGGGGACGTTCATAATCTGAAAAGAGCCTTTGCATCTGTGGTGTTTGTTATTGAGAAGAAACCTGAACTGTTGGATTTTGGATCTGTTAAAGCTTTATTGGCTTCTATGAAATGTGCCAACACTGCTGCCCCTGCTCTTTCTTTGATCAAATGCATGTTTAAGAATCGATGCTTCGTGCCTTTTAGTGTTTGGGGCAATGAACTTGTTGATATTTGCAGACAGAATGGGAGTTTGATTCCCTTTTTAAGAGTTTTTGAAGAGAACTGTAGGATTGCTTTAGATGAGAGCTTGGATTTTATGAAACCAGACCTTATTGCTTGTAATGCAGCACTTGAAGGGTGTTGCTATGAGCTTGAATCTGTAACAGATGCTGAGAAAGTCGTTGAAACGATGTCACTTTTGTATCTTCGGCCTGATGAAGTGAGTTTTGGTGCTCTTGCTTATTTGTATGCATTGAAGGGGCTTGAACAGAAGATAATAGAGTTAGAAGTCTTGATGGGAAATTTTGGTTTTAATCATAAAGATCTCTTTTTTAGTAGTTTGATTAGTGGATACGTTCATGCGAGCAACTTTGCTGTTGTTTCCAAGACTATGTTGCGTAGTTTAAAAGATGAATGTGGAGCACATGTACATTTTGGTGAAAAAACGTATTTGGAAATGGTTAAGGGGTTTATTCAAAGTGGAAATCTGAAGGAATTATCTGCATTGATTGTCAATGCTAAGAATTTGGAGTCTTCATCAGAAGTTGATGGATCTATTGGATTTGGTATCATTAATGCATGTGTTAATATTGGATGGTTAGATAAGGCGCATGACATTCTGAACGAAATCAATTCCCAGGGAGTTTCCCTGGGCCTTGGAGTCTATCTGCCAATCTTGAAAGCTTACAGGAAGGAGCATCGAACAGCTGAAGCCACCCGATTAATCATGGATATTAGCAGTTCTGGGCTTCAGTTGGACGCAGAGAGTTATGATGCTCTAATAGAGGCATCGATGTCGAACCAAGATTTTCAGTCAGCTTTTGCTTTGTTCAGGAATATGAGAGAAACAAGAAAATCTGACATGAAAGCTAGTTATCTAACTATTATGACTGGCTTAATGGAGAACCATAGGCCTGAGTTGATGGCTGCCTTCTTAGATGAAGTTGTTGAAGATCCTCTTGTGGAAGTTGGAACTCATGATTGGAACTCTATTATACATGCCTTTTGCAAAGCTGGAAGACTCGAGGATGCGAGGAGAACATTCCGAAGAATGAAATTTCTGCAGTTTGAACCAAATGAGCAGACCTTCTTGTCCCTAATTAATGGCTATGTGTCTGCAGAGAGATATTTCTGTGTTCTGATGCTGTGGAACGAACTTAAGTGGAAGATTACAGCAAATGGGGAGACAGGTATCAAACTTGACAACAACTTGGTTGATGCATTCCTGTATGCTTTGGTCAAGGGAGGTTTCTTCGATGCCGTGATGCAAGTCGTTGAAAAAACTAAGGATACGAAGATCTTTGTTGATAAGTGGAAGTATAAGCAAGCATTCATGGAGACTCATAAGAAACTCAAAGTGGCAAAGTTGAGGAGGAGGAACCACAGGAAAATGGAATCATTAATTGCTTTCAAGAACTGGGCTGGTCTGAATGCTTGA

mRNA sequence

ATGTGGAAAAGAGTTCTATGTTCAATTCCTCACAGGTCGTTTTCCTCTGCGCCAGAAACCCCATCTCTCTACTCCTTCCTCCAACCTTCTCTTTTTGCCCTAAAGAGAACCCCATTTTCGCCTTCTCAAGACTCCACCGACCTCCGTCAGGATCCAACTCCTCAAATTTTAACTCCAGATCGCGTCGCCGCCGTAGAAACGGCCCTCCACAAGTCCCTCCTCACCAGCGACACTGATGAGGCATGGAAATCCTTCAAATTGCTCACGAGAAGCTCTGTTTTCCCATGTAAGCCTCTTACCAATTCACTTATTGCTCACTTGTCCTCAATTGGGGACGTTCATAATCTGAAAAGAGCCTTTGCATCTGTGGTGTTTGTTATTGAGAAGAAACCTGAACTGTTGGATTTTGGATCTGTTAAAGCTTTATTGGCTTCTATGAAATGTGCCAACACTGCTGCCCCTGCTCTTTCTTTGATCAAATGCATGTTTAAGAATCGATGCTTCGTGCCTTTTAGTGTTTGGGGCAATGAACTTGTTGATATTTGCAGACAGAATGGGAGTTTGATTCCCTTTTTAAGAGTTTTTGAAGAGAACTGTAGGATTGCTTTAGATGAGAGCTTGGATTTTATGAAACCAGACCTTATTGCTTGTAATGCAGCACTTGAAGGGTGTTGCTATGAGCTTGAATCTGTAACAGATGCTGAGAAAGTCGTTGAAACGATGTCACTTTTGTATCTTCGGCCTGATGAAGTGAGTTTTGGTGCTCTTGCTTATTTGTATGCATTGAAGGGGCTTGAACAGAAGATAATAGAGTTAGAAGTCTTGATGGGAAATTTTGGTTTTAATCATAAAGATCTCTTTTTTAGTAGTTTGATTAGTGGATACGTTCATGCGAGCAACTTTGCTGTTGTTTCCAAGACTATGTTGCGTAGTTTAAAAGATGAATGTGGAGCACATGTACATTTTGGTGAAAAAACGTATTTGGAAATGGTTAAGGGGTTTATTCAAAGTGGAAATCTGAAGGAATTATCTGCATTGATTGTCAATGCTAAGAATTTGGAGTCTTCATCAGAAGTTGATGGATCTATTGGATTTGGTATCATTAATGCATGTGTTAATATTGGATGGTTAGATAAGGCGCATGACATTCTGAACGAAATCAATTCCCAGGGAGTTTCCCTGGGCCTTGGAGTCTATCTGCCAATCTTGAAAGCTTACAGGAAGGAGCATCGAACAGCTGAAGCCACCCGATTAATCATGGATATTAGCAGTTCTGGGCTTCAGTTGGACGCAGAGAGTTATGATGCTCTAATAGAGGCATCGATGTCGAACCAAGATTTTCAGTCAGCTTTTGCTTTGTTCAGGAATATGAGAGAAACAAGAAAATCTGACATGAAAGCTAGTTATCTAACTATTATGACTGGCTTAATGGAGAACCATAGGCCTGAGTTGATGGCTGCCTTCTTAGATGAAGTTGTTGAAGATCCTCTTGTGGAAGTTGGAACTCATGATTGGAACTCTATTATACATGCCTTTTGCAAAGCTGGAAGACTCGAGGATGCGAGGAGAACATTCCGAAGAATGAAATTTCTGCAGTTTGAACCAAATGAGCAGACCTTCTTGTCCCTAATTAATGGCTATGTGTCTGCAGAGAGATATTTCTGTGTTCTGATGCTGTGGAACGAACTTAAGTGGAAGATTACAGCAAATGGGGAGACAGGTATCAAACTTGACAACAACTTGGTTGATGCATTCCTGTATGCTTTGGTCAAGGGAGGTTTCTTCGATGCCGTGATGCAAGTCGTTGAAAAAACTAAGGATACGAAGATCTTTGTTGATAAGTGGAAGTATAAGCAAGCATTCATGGAGACTCATAAGAAACTCAAAGTGGCAAAGTTGAGGAGGAGGAACCACAGGAAAATGGAATCATTAATTGCTTTCAAGAACTGGGCTGGTCTGAATGCTTGA

Coding sequence (CDS)

ATGTGGAAAAGAGTTCTATGTTCAATTCCTCACAGGTCGTTTTCCTCTGCGCCAGAAACCCCATCTCTCTACTCCTTCCTCCAACCTTCTCTTTTTGCCCTAAAGAGAACCCCATTTTCGCCTTCTCAAGACTCCACCGACCTCCGTCAGGATCCAACTCCTCAAATTTTAACTCCAGATCGCGTCGCCGCCGTAGAAACGGCCCTCCACAAGTCCCTCCTCACCAGCGACACTGATGAGGCATGGAAATCCTTCAAATTGCTCACGAGAAGCTCTGTTTTCCCATGTAAGCCTCTTACCAATTCACTTATTGCTCACTTGTCCTCAATTGGGGACGTTCATAATCTGAAAAGAGCCTTTGCATCTGTGGTGTTTGTTATTGAGAAGAAACCTGAACTGTTGGATTTTGGATCTGTTAAAGCTTTATTGGCTTCTATGAAATGTGCCAACACTGCTGCCCCTGCTCTTTCTTTGATCAAATGCATGTTTAAGAATCGATGCTTCGTGCCTTTTAGTGTTTGGGGCAATGAACTTGTTGATATTTGCAGACAGAATGGGAGTTTGATTCCCTTTTTAAGAGTTTTTGAAGAGAACTGTAGGATTGCTTTAGATGAGAGCTTGGATTTTATGAAACCAGACCTTATTGCTTGTAATGCAGCACTTGAAGGGTGTTGCTATGAGCTTGAATCTGTAACAGATGCTGAGAAAGTCGTTGAAACGATGTCACTTTTGTATCTTCGGCCTGATGAAGTGAGTTTTGGTGCTCTTGCTTATTTGTATGCATTGAAGGGGCTTGAACAGAAGATAATAGAGTTAGAAGTCTTGATGGGAAATTTTGGTTTTAATCATAAAGATCTCTTTTTTAGTAGTTTGATTAGTGGATACGTTCATGCGAGCAACTTTGCTGTTGTTTCCAAGACTATGTTGCGTAGTTTAAAAGATGAATGTGGAGCACATGTACATTTTGGTGAAAAAACGTATTTGGAAATGGTTAAGGGGTTTATTCAAAGTGGAAATCTGAAGGAATTATCTGCATTGATTGTCAATGCTAAGAATTTGGAGTCTTCATCAGAAGTTGATGGATCTATTGGATTTGGTATCATTAATGCATGTGTTAATATTGGATGGTTAGATAAGGCGCATGACATTCTGAACGAAATCAATTCCCAGGGAGTTTCCCTGGGCCTTGGAGTCTATCTGCCAATCTTGAAAGCTTACAGGAAGGAGCATCGAACAGCTGAAGCCACCCGATTAATCATGGATATTAGCAGTTCTGGGCTTCAGTTGGACGCAGAGAGTTATGATGCTCTAATAGAGGCATCGATGTCGAACCAAGATTTTCAGTCAGCTTTTGCTTTGTTCAGGAATATGAGAGAAACAAGAAAATCTGACATGAAAGCTAGTTATCTAACTATTATGACTGGCTTAATGGAGAACCATAGGCCTGAGTTGATGGCTGCCTTCTTAGATGAAGTTGTTGAAGATCCTCTTGTGGAAGTTGGAACTCATGATTGGAACTCTATTATACATGCCTTTTGCAAAGCTGGAAGACTCGAGGATGCGAGGAGAACATTCCGAAGAATGAAATTTCTGCAGTTTGAACCAAATGAGCAGACCTTCTTGTCCCTAATTAATGGCTATGTGTCTGCAGAGAGATATTTCTGTGTTCTGATGCTGTGGAACGAACTTAAGTGGAAGATTACAGCAAATGGGGAGACAGGTATCAAACTTGACAACAACTTGGTTGATGCATTCCTGTATGCTTTGGTCAAGGGAGGTTTCTTCGATGCCGTGATGCAAGTCGTTGAAAAAACTAAGGATACGAAGATCTTTGTTGATAAGTGGAAGTATAAGCAAGCATTCATGGAGACTCATAAGAAACTCAAAGTGGCAAAGTTGAGGAGGAGGAACCACAGGAAAATGGAATCATTAATTGCTTTCAAGAACTGGGCTGGTCTGAATGCTTGA

Protein sequence

MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPDRVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAFASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVDICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVETMSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASNFAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVDGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA
BLAST of Cla97C05G105680 vs. NCBI nr
Match: XP_008446433.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g69290 [Cucumis melo])

HSP 1 Score: 1214.9 bits (3142), Expect = 0.0e+00
Identity = 605/655 (92.37%), Postives = 633/655 (96.64%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MWKRVLC IPHRSFSS PETPSLYSFLQPSLFA KRTPFSPSQDSTDLRQDPTPQ LTPD
Sbjct: 1   MWKRVLCLIPHRSFSSVPETPSLYSFLQPSLFAKKRTPFSPSQDSTDLRQDPTPQTLTPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
           RVAAVETALHKSLLTSDTDEAWKSFKLLTRSS+FP K LTNSLIAHLSSIGDVHNLKRAF
Sbjct: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSIFPSKSLTNSLIAHLSSIGDVHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWG ELVD
Sbjct: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGKELVD 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           ICRQ+GSLIPFLRVFEENCRIALDE LDF+KPDLIACNAALEGCC+ELESVTDAEKVVET
Sbjct: 181 ICRQSGSLIPFLRVFEENCRIALDERLDFLKPDLIACNAALEGCCHELESVTDAEKVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMG+FGF  KDL FS+L+SGYV+ASN
Sbjct: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGSFGFTRKDLLFSNLVSGYVNASN 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVD 360
           FA VSKTMLRSLKDECG+HVHFGEKTYLEMVKGFIQSGNLKELSALI++A+NLESSS VD
Sbjct: 301 FAAVSKTMLRSLKDECGSHVHFGEKTYLEMVKGFIQSGNLKELSALIIDAQNLESSSAVD 360

Query: 361 GSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIM 420
           GSIG+GIINACVNIGWLDKA  +LNEINSQGVSLGLGVY+PILKAYR E RT EAT+L+M
Sbjct: 361 GSIGYGIINACVNIGWLDKAQYVLNEINSQGVSLGLGVYMPILKAYRTERRTTEATQLVM 420

Query: 421 DISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENH 480
           DI++SG+QLDAESYD+LIEASMSNQDFQSAF LFRNMRETRKSD KASYLTIMTGLMENH
Sbjct: 421 DITNSGIQLDAESYDSLIEASMSNQDFQSAFTLFRNMRETRKSDTKASYLTIMTGLMENH 480

Query: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540
           RPELMAAFLDE+VEDPLVEVGTHDWNSIIHAFCKAGRLEDARRT+RRMKFLQFEPNEQTF
Sbjct: 481 RPELMAAFLDEIVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTYRRMKFLQFEPNEQTF 540

Query: 541 LSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQ 600
           LSLINGYVSAERYFCVLMLWNELKWK+T +GE+GIKLDNNLVDAFLYALVKGGFFDAVMQ
Sbjct: 541 LSLINGYVSAERYFCVLMLWNELKWKVTPDGESGIKLDNNLVDAFLYALVKGGFFDAVMQ 600

Query: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           VVEKTKDTKIF+DKWKYKQAFME HKKLKVAKLRRRNHRKMESLIAFKNWAGL+A
Sbjct: 601 VVEKTKDTKIFIDKWKYKQAFMENHKKLKVAKLRRRNHRKMESLIAFKNWAGLSA 655

BLAST of Cla97C05G105680 vs. NCBI nr
Match: XP_004135146.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g69290 [Cucumis sativus] >KGN51979.1 hypothetical protein Csa_5G606690 [Cucumis sativus])

HSP 1 Score: 1208.7 bits (3126), Expect = 0.0e+00
Identity = 607/656 (92.53%), Postives = 633/656 (96.49%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPET-PSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTP 60
           MWKRVLC IPHRSFSS PE  PSLYSFLQPSLFA KRTPFSPSQDSTDLRQDPTPQ LTP
Sbjct: 1   MWKRVLCLIPHRSFSSVPENPPSLYSFLQPSLFARKRTPFSPSQDSTDLRQDPTPQNLTP 60

Query: 61  DRVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRA 120
           D VA VETALHKSLLTSDTDEAWKSFKLLTRSS FP K LTNSLIAHLSSIGDVHNLKRA
Sbjct: 61  DGVAVVETALHKSLLTSDTDEAWKSFKLLTRSSAFPSKSLTNSLIAHLSSIGDVHNLKRA 120

Query: 121 FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELV 180
           FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWG ELV
Sbjct: 121 FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGKELV 180

Query: 181 DICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVE 240
           DICRQ+GSLIPFLRVFEENCRIALDE LDF+KPDLIACNAALEGCC+ELESVTDAEKV+E
Sbjct: 181 DICRQSGSLIPFLRVFEENCRIALDERLDFLKPDLIACNAALEGCCHELESVTDAEKVIE 240

Query: 241 TMSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHAS 300
           TMSLLYLRPDEVSFGALAYLYALKGL+QKIIELEVLMG+FGF  KDLFFS+L+SGYV+AS
Sbjct: 241 TMSLLYLRPDEVSFGALAYLYALKGLDQKIIELEVLMGSFGFTCKDLFFSNLVSGYVNAS 300

Query: 301 NFAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEV 360
           NFA VSKTMLRSLKDECG+HVHFGEKTYLEMVKGFIQSGNLKELSALI++A+NLESSS V
Sbjct: 301 NFAAVSKTMLRSLKDECGSHVHFGEKTYLEMVKGFIQSGNLKELSALIIDAQNLESSSAV 360

Query: 361 DGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLI 420
           DGSIGFGIINACVNIGWLDKA  IL+E+NSQGVSLGLGVYLPILKAYRKEHRTA AT+LI
Sbjct: 361 DGSIGFGIINACVNIGWLDKAQYILDEMNSQGVSLGLGVYLPILKAYRKEHRTAAATQLI 420

Query: 421 MDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMEN 480
           MDISSSG+QLDAE+YDALIEASMSNQDFQSAF LFR+MRETRKSD KASYLTIMTGLMEN
Sbjct: 421 MDISSSGIQLDAENYDALIEASMSNQDFQSAFTLFRSMRETRKSDTKASYLTIMTGLMEN 480

Query: 481 HRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQT 540
           HRPELMAAFLDE+VEDPLVEVGTHDWNSIIHAFCKAGRLEDARRT+RRMKFLQFEPNEQT
Sbjct: 481 HRPELMAAFLDEIVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTYRRMKFLQFEPNEQT 540

Query: 541 FLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVM 600
           FLSLINGYVSAERYFCVLMLWNELKWK+T NGE+GIKLDNNLVDAFLYALVKGGFFDAVM
Sbjct: 541 FLSLINGYVSAERYFCVLMLWNELKWKVTPNGESGIKLDNNLVDAFLYALVKGGFFDAVM 600

Query: 601 QVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           QVVEKTKDTKIF+DKWKYKQAFMETHKKLKVAKLRRRN++KMESLIAFKNWAGLNA
Sbjct: 601 QVVEKTKDTKIFIDKWKYKQAFMETHKKLKVAKLRRRNYKKMESLIAFKNWAGLNA 656

BLAST of Cla97C05G105680 vs. NCBI nr
Match: XP_022149103.1 (pentatricopeptide repeat-containing protein At1g69290 [Momordica charantia])

HSP 1 Score: 1166.8 bits (3017), Expect = 0.0e+00
Identity = 588/655 (89.77%), Postives = 622/655 (94.96%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MWK  L SIP RSFSSAPE P+LYSFLQPSLFALKRTP S SQ+STDLRQ+PTPQ LTPD
Sbjct: 1   MWKTALYSIPRRSFSSAPEIPTLYSFLQPSLFALKRTPLSSSQESTDLRQNPTPQTLTPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
           RVAAVET LHKSLLTSDTDEAWKSFKLLTRSS FPCK LTNSLIAHLSSIGDVHNLKRAF
Sbjct: 61  RVAAVETTLHKSLLTSDTDEAWKSFKLLTRSSAFPCKSLTNSLIAHLSSIGDVHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           ASVVFVIEKKPELL+F SVK LLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD
Sbjct: 121 ASVVFVIEKKPELLEFESVKTLLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           ICRQ+GSLIPFLRVFEENCRIALDE LDFMKPDLIACNAALEGCC+ELESV DAEKVVET
Sbjct: 181 ICRQSGSLIPFLRVFEENCRIALDERLDFMKPDLIACNAALEGCCHELESVMDAEKVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MSLL LRPDE SFGALAYLYALKGLEQKI+ELE LMG+FGF  K  FF++L+  YV++ N
Sbjct: 241 MSLLNLRPDEASFGALAYLYALKGLEQKIMELEGLMGSFGFACKSFFFANLVGAYVNSGN 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVD 360
           FA VS+TMLRSLKDE GAHV+FGE+TY+E+VKGF+QSGNLKELSALIV+A+NLESSSEVD
Sbjct: 301 FAAVSRTMLRSLKDERGAHVNFGERTYMEVVKGFVQSGNLKELSALIVDAQNLESSSEVD 360

Query: 361 GSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIM 420
           GSIGFGIINACVNIG LDKAH ILNEINSQGV LGLGVYLPILKAY+KEHRTAEAT+LIM
Sbjct: 361 GSIGFGIINACVNIGRLDKAHSILNEINSQGVPLGLGVYLPILKAYQKEHRTAEATQLIM 420

Query: 421 DISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENH 480
           DISSSGLQLDAESYDALIEASMS+QDFQSAFALFR+MRETRKSD +ASYLTIMTGLMENH
Sbjct: 421 DISSSGLQLDAESYDALIEASMSSQDFQSAFALFRSMRETRKSDTRASYLTIMTGLMENH 480

Query: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540
           RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF
Sbjct: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540

Query: 541 LSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQ 600
           LSLINGYVSAERYFCVLMLW+E+KWK+T +GE GIKLD+NLVDAFLYALVKGGFFD+VMQ
Sbjct: 541 LSLINGYVSAERYFCVLMLWHEVKWKVTTDGERGIKLDSNLVDAFLYALVKGGFFDSVMQ 600

Query: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLR+RN+RKMESLIAFKNWAGLNA
Sbjct: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRKRNYRKMESLIAFKNWAGLNA 655

BLAST of Cla97C05G105680 vs. NCBI nr
Match: XP_022968525.1 (pentatricopeptide repeat-containing protein At1g69290 [Cucurbita maxima])

HSP 1 Score: 1125.2 bits (2909), Expect = 0.0e+00
Identity = 565/655 (86.26%), Postives = 599/655 (91.45%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MWKR +CSIP R FSS PE  SLYSFLQPSLFA KR PFSPSQ+STDLRQ+ TPQ LT D
Sbjct: 1   MWKRAVCSIPRRLFSSTPEVSSLYSFLQPSLFATKRAPFSPSQESTDLRQNQTPQSLTTD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
           RVAAVET LHKSLLTSDTDEAWKSFKLLT+SSVFPCK LTNSLIAHLSSIGDVHNLKRAF
Sbjct: 61  RVAAVETTLHKSLLTSDTDEAWKSFKLLTKSSVFPCKSLTNSLIAHLSSIGDVHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           AS VFVIEKKPELLDFGSVK LLASMKCANTAAPALSLIKCM KNRCFVPF  WGNELV 
Sbjct: 121 ASAVFVIEKKPELLDFGSVKTLLASMKCANTAAPALSLIKCMLKNRCFVPFECWGNELVS 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           ICRQ+GSLIPFLRVFEE CRI L+E LD MKPDL ACNAALEGCC+ELESVTDAE VVET
Sbjct: 181 ICRQSGSLIPFLRVFEEICRIVLNERLDSMKPDLNACNAALEGCCHELESVTDAEHVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MSLL LRPDEV+ GALAYLYALKGLEQKIIEL+ LMG+FGF  K LFF++L+SGYV++ +
Sbjct: 241 MSLLNLRPDEVTIGALAYLYALKGLEQKIIELKCLMGSFGFTSKSLFFNNLVSGYVNSGD 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVD 360
            A VSKTML  LKDECG HV F EKTYLE+VK F+QSGNLKELS+LIV+A+NLES ++VD
Sbjct: 301 LAAVSKTMLDGLKDECGEHVRFEEKTYLEVVKAFVQSGNLKELSSLIVDAQNLESLTDVD 360

Query: 361 GSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIM 420
           GSIGFGIINACVNIGWLD  H IL EINSQGVS+GLGVY+PILKAY+KE RTAEAT+LIM
Sbjct: 361 GSIGFGIINACVNIGWLDNVHAILKEINSQGVSVGLGVYMPILKAYQKERRTAEATQLIM 420

Query: 421 DISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENH 480
           D+SSSG+QLDAES+DALIEASMSNQDFQSAFALFR MRETRKSD  ASYLTIMTGLME+H
Sbjct: 421 DVSSSGIQLDAESFDALIEASMSNQDFQSAFALFRKMRETRKSDTNASYLTIMTGLMESH 480

Query: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540
           RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF
Sbjct: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540

Query: 541 LSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQ 600
           LSLI+GYVS ERYFCVLMLWNELKWKIT NGE G KLD+NLVDAFLYALVKGGFFDAVMQ
Sbjct: 541 LSLIHGYVSGERYFCVLMLWNELKWKITPNGEKGFKLDSNLVDAFLYALVKGGFFDAVMQ 600

Query: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           VVEKTKDTK FVDKWKYKQAFMETHKKLKVAKLRRRNHRKM+SLI FKNW GLNA
Sbjct: 601 VVEKTKDTKTFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMQSLIDFKNWVGLNA 655

BLAST of Cla97C05G105680 vs. NCBI nr
Match: XP_023541058.1 (pentatricopeptide repeat-containing protein At1g69290 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1120.9 bits (2898), Expect = 0.0e+00
Identity = 564/655 (86.11%), Postives = 600/655 (91.60%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MWKR +CSIP R FSS PE  SLYSFLQPSLFA KR PFSPSQ+STDLRQ+ TPQILT D
Sbjct: 1   MWKRAVCSIPRRLFSSTPEVSSLYSFLQPSLFAPKRAPFSPSQESTDLRQNQTPQILTTD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
           RVAAVET LH SLLTSDTDEAWKSFKLLT+SSVFPCK LTNSLIAHLSSIGDVHNLKRAF
Sbjct: 61  RVAAVETTLHNSLLTSDTDEAWKSFKLLTKSSVFPCKSLTNSLIAHLSSIGDVHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           AS VFVIEKKPELLDFGSVK LLASMKCANTAAPA+SLIKCM KNRCFVPF  WGNELV 
Sbjct: 121 ASAVFVIEKKPELLDFGSVKTLLASMKCANTAAPAVSLIKCMLKNRCFVPFEFWGNELVS 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           ICRQ+GSLIPFLRVFEE CRI L+E L  MKPDL ACNAALEGCC+ELESVTDAE VVET
Sbjct: 181 ICRQSGSLIPFLRVFEEICRIVLNERLYSMKPDLNACNAALEGCCHELESVTDAEHVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MSLL LRPDEV+FG+LAYLYALKGLEQKIIEL+ LMG+FGF  K LFF++L+SGY ++ +
Sbjct: 241 MSLLNLRPDEVTFGSLAYLYALKGLEQKIIELKRLMGSFGFASKSLFFNNLVSGYGNSGD 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVD 360
            A VSKTML  LKDECG HV F EKTYLE+VK F+QSGNLKELS+LIV+A+NLESS++VD
Sbjct: 301 LAAVSKTMLDGLKDECGEHVRFEEKTYLEVVKAFVQSGNLKELSSLIVDAQNLESSTDVD 360

Query: 361 GSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIM 420
           GSIGFGIINACVNIGWLD  H IL EINSQGVS+GLGVY+PILKAY+KE RTAEAT+LIM
Sbjct: 361 GSIGFGIINACVNIGWLDNVHAILKEINSQGVSVGLGVYMPILKAYQKERRTAEATQLIM 420

Query: 421 DISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENH 480
           D+SSSGLQLDAE++DALIEASMSNQDFQSAFALFR MRETRKS+  ASYLTIMTGLME+H
Sbjct: 421 DVSSSGLQLDAETFDALIEASMSNQDFQSAFALFRKMRETRKSNTNASYLTIMTGLMESH 480

Query: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540
           RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF
Sbjct: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540

Query: 541 LSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQ 600
           LSLINGYVS ERYFCVLMLWNELKWKITANGE G KLD+NLVDAFLYALVKGGFFDAVMQ
Sbjct: 541 LSLINGYVSGERYFCVLMLWNELKWKITANGERGFKLDSNLVDAFLYALVKGGFFDAVMQ 600

Query: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           VVEKTKDTK FVDKWKYKQAFMETHKKLKVAKLRRRNHRKM+SLI FKNW GLNA
Sbjct: 601 VVEKTKDTKTFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMQSLIDFKNWVGLNA 655

BLAST of Cla97C05G105680 vs. TrEMBL
Match: tr|A0A1S3BF23|A0A1S3BF23_CUCME (pentatricopeptide repeat-containing protein At1g69290 OS=Cucumis melo OX=3656 GN=LOC103489182 PE=4 SV=1)

HSP 1 Score: 1214.9 bits (3142), Expect = 0.0e+00
Identity = 605/655 (92.37%), Postives = 633/655 (96.64%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MWKRVLC IPHRSFSS PETPSLYSFLQPSLFA KRTPFSPSQDSTDLRQDPTPQ LTPD
Sbjct: 1   MWKRVLCLIPHRSFSSVPETPSLYSFLQPSLFAKKRTPFSPSQDSTDLRQDPTPQTLTPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
           RVAAVETALHKSLLTSDTDEAWKSFKLLTRSS+FP K LTNSLIAHLSSIGDVHNLKRAF
Sbjct: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSIFPSKSLTNSLIAHLSSIGDVHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWG ELVD
Sbjct: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGKELVD 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           ICRQ+GSLIPFLRVFEENCRIALDE LDF+KPDLIACNAALEGCC+ELESVTDAEKVVET
Sbjct: 181 ICRQSGSLIPFLRVFEENCRIALDERLDFLKPDLIACNAALEGCCHELESVTDAEKVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMG+FGF  KDL FS+L+SGYV+ASN
Sbjct: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGSFGFTRKDLLFSNLVSGYVNASN 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVD 360
           FA VSKTMLRSLKDECG+HVHFGEKTYLEMVKGFIQSGNLKELSALI++A+NLESSS VD
Sbjct: 301 FAAVSKTMLRSLKDECGSHVHFGEKTYLEMVKGFIQSGNLKELSALIIDAQNLESSSAVD 360

Query: 361 GSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIM 420
           GSIG+GIINACVNIGWLDKA  +LNEINSQGVSLGLGVY+PILKAYR E RT EAT+L+M
Sbjct: 361 GSIGYGIINACVNIGWLDKAQYVLNEINSQGVSLGLGVYMPILKAYRTERRTTEATQLVM 420

Query: 421 DISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENH 480
           DI++SG+QLDAESYD+LIEASMSNQDFQSAF LFRNMRETRKSD KASYLTIMTGLMENH
Sbjct: 421 DITNSGIQLDAESYDSLIEASMSNQDFQSAFTLFRNMRETRKSDTKASYLTIMTGLMENH 480

Query: 481 RPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTF 540
           RPELMAAFLDE+VEDPLVEVGTHDWNSIIHAFCKAGRLEDARRT+RRMKFLQFEPNEQTF
Sbjct: 481 RPELMAAFLDEIVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTYRRMKFLQFEPNEQTF 540

Query: 541 LSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQ 600
           LSLINGYVSAERYFCVLMLWNELKWK+T +GE+GIKLDNNLVDAFLYALVKGGFFDAVMQ
Sbjct: 541 LSLINGYVSAERYFCVLMLWNELKWKVTPDGESGIKLDNNLVDAFLYALVKGGFFDAVMQ 600

Query: 601 VVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           VVEKTKDTKIF+DKWKYKQAFME HKKLKVAKLRRRNHRKMESLIAFKNWAGL+A
Sbjct: 601 VVEKTKDTKIFIDKWKYKQAFMENHKKLKVAKLRRRNHRKMESLIAFKNWAGLSA 655

BLAST of Cla97C05G105680 vs. TrEMBL
Match: tr|A0A0A0KSW9|A0A0A0KSW9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606690 PE=4 SV=1)

HSP 1 Score: 1208.7 bits (3126), Expect = 0.0e+00
Identity = 607/656 (92.53%), Postives = 633/656 (96.49%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPET-PSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTP 60
           MWKRVLC IPHRSFSS PE  PSLYSFLQPSLFA KRTPFSPSQDSTDLRQDPTPQ LTP
Sbjct: 1   MWKRVLCLIPHRSFSSVPENPPSLYSFLQPSLFARKRTPFSPSQDSTDLRQDPTPQNLTP 60

Query: 61  DRVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRA 120
           D VA VETALHKSLLTSDTDEAWKSFKLLTRSS FP K LTNSLIAHLSSIGDVHNLKRA
Sbjct: 61  DGVAVVETALHKSLLTSDTDEAWKSFKLLTRSSAFPSKSLTNSLIAHLSSIGDVHNLKRA 120

Query: 121 FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELV 180
           FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWG ELV
Sbjct: 121 FASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGKELV 180

Query: 181 DICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVE 240
           DICRQ+GSLIPFLRVFEENCRIALDE LDF+KPDLIACNAALEGCC+ELESVTDAEKV+E
Sbjct: 181 DICRQSGSLIPFLRVFEENCRIALDERLDFLKPDLIACNAALEGCCHELESVTDAEKVIE 240

Query: 241 TMSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHAS 300
           TMSLLYLRPDEVSFGALAYLYALKGL+QKIIELEVLMG+FGF  KDLFFS+L+SGYV+AS
Sbjct: 241 TMSLLYLRPDEVSFGALAYLYALKGLDQKIIELEVLMGSFGFTCKDLFFSNLVSGYVNAS 300

Query: 301 NFAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEV 360
           NFA VSKTMLRSLKDECG+HVHFGEKTYLEMVKGFIQSGNLKELSALI++A+NLESSS V
Sbjct: 301 NFAAVSKTMLRSLKDECGSHVHFGEKTYLEMVKGFIQSGNLKELSALIIDAQNLESSSAV 360

Query: 361 DGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLI 420
           DGSIGFGIINACVNIGWLDKA  IL+E+NSQGVSLGLGVYLPILKAYRKEHRTA AT+LI
Sbjct: 361 DGSIGFGIINACVNIGWLDKAQYILDEMNSQGVSLGLGVYLPILKAYRKEHRTAAATQLI 420

Query: 421 MDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMEN 480
           MDISSSG+QLDAE+YDALIEASMSNQDFQSAF LFR+MRETRKSD KASYLTIMTGLMEN
Sbjct: 421 MDISSSGIQLDAENYDALIEASMSNQDFQSAFTLFRSMRETRKSDTKASYLTIMTGLMEN 480

Query: 481 HRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQT 540
           HRPELMAAFLDE+VEDPLVEVGTHDWNSIIHAFCKAGRLEDARRT+RRMKFLQFEPNEQT
Sbjct: 481 HRPELMAAFLDEIVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTYRRMKFLQFEPNEQT 540

Query: 541 FLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVM 600
           FLSLINGYVSAERYFCVLMLWNELKWK+T NGE+GIKLDNNLVDAFLYALVKGGFFDAVM
Sbjct: 541 FLSLINGYVSAERYFCVLMLWNELKWKVTPNGESGIKLDNNLVDAFLYALVKGGFFDAVM 600

Query: 601 QVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           QVVEKTKDTKIF+DKWKYKQAFMETHKKLKVAKLRRRN++KMESLIAFKNWAGLNA
Sbjct: 601 QVVEKTKDTKIFIDKWKYKQAFMETHKKLKVAKLRRRNYKKMESLIAFKNWAGLNA 656

BLAST of Cla97C05G105680 vs. TrEMBL
Match: tr|M5XK61|M5XK61_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G286800 PE=4 SV=1)

HSP 1 Score: 931.0 bits (2405), Expect = 1.4e-267
Identity = 463/656 (70.58%), Postives = 555/656 (84.60%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MW++ L  +PHR FSS PE P+LYSFLQPS+FALKR    PSQ S      P P+ L PD
Sbjct: 1   MWRKALTLLPHRPFSSTPEIPTLYSFLQPSVFALKR-DLPPSQKSHSDLPTPPPKTLAPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
            +  +E  LHKSL+T +TDEAWKSFK LT SS FP K LTNSLI HLSS+GD+HNLKRAF
Sbjct: 61  HITTLEATLHKSLITHNTDEAWKSFKTLTGSSAFPSKSLTNSLITHLSSLGDIHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           A+VV+V+EK P  LDF +V  LL +MKCANTAAPA +LIK +FKNR F+PFSVWGN L++
Sbjct: 121 ATVVYVVEKNPGFLDFETVGTLLDAMKCANTAAPAFALIKSVFKNRFFLPFSVWGNVLIE 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           I R+NG+ + FLRVFEENCRIALDE L+ MKPDL ACNAALEGCC ELESV+DAEKVVET
Sbjct: 181 ISRKNGNFVAFLRVFEENCRIALDEKLESMKPDLAACNAALEGCCRELESVSDAEKVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           M++L +RPDE SFG LAYLYALKGLE+KI ELE LMG FGF++K +F S+LI+GYV +  
Sbjct: 241 MAVLGVRPDESSFGFLAYLYALKGLEEKITELEGLMGGFGFSNKRVFQSNLINGYVKSGK 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSS-EV 360
              VS T+LR L++  G  ++ GE+TY E+VKG++ S ++KEL+ LI+ A+ LESS+  V
Sbjct: 301 LESVSATILRILREGDGDFLNLGEETYCEVVKGYLMSASVKELATLIIEAQKLESSTVVV 360

Query: 361 DGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLI 420
           D S+G+GI+NACV+IG  DKAH IL+E+N+QG SLGLGVY+PILKAY KEHRTAEAT+L+
Sbjct: 361 DRSVGYGIVNACVHIGLSDKAHGILDEMNAQGGSLGLGVYVPILKAYCKEHRTAEATQLV 420

Query: 421 MDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMEN 480
           MD+S+SGLQLD  +YDALIE+SMS+QDFQSAF+L+R+MRE R SD+K SYLTIMTGLMEN
Sbjct: 421 MDVSNSGLQLDTGTYDALIESSMSSQDFQSAFSLYRDMREARISDLKGSYLTIMTGLMEN 480

Query: 481 HRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQT 540
           HRPELMAAFLDEVVEDP +EVGTHDWNSIIHAFCKAGRLEDARRTFRRM FLQ +PNEQT
Sbjct: 481 HRPELMAAFLDEVVEDPRIEVGTHDWNSIIHAFCKAGRLEDARRTFRRMIFLQHKPNEQT 540

Query: 541 FLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVM 600
           +LSLI+GYVS E+YFCVLMLW+E+K  ++ +GE GIK D+N+VDAFLYALVKGGFFDAVM
Sbjct: 541 YLSLISGYVSVEKYFCVLMLWHEVKRNVSVDGEKGIKFDHNMVDAFLYALVKGGFFDAVM 600

Query: 601 QVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           QVVEK+++ K+FVDKW+YKQAFMETHKKLKV+KLR+RN RKME+L+AFKNWAGLNA
Sbjct: 601 QVVEKSQEMKVFVDKWRYKQAFMETHKKLKVSKLRKRNFRKMEALVAFKNWAGLNA 655

BLAST of Cla97C05G105680 vs. TrEMBL
Match: tr|A0A2I4H895|A0A2I4H895_9ROSI (pentatricopeptide repeat-containing protein At1g69290 OS=Juglans regia OX=51240 GN=LOC109014373 PE=4 SV=1)

HSP 1 Score: 928.7 bits (2399), Expect = 7.2e-267
Identity = 463/655 (70.69%), Postives = 552/655 (84.27%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           M +R L  +P R FSS PE PSLYSFLQPS+FALK+TP  P   S     D  P+ LTPD
Sbjct: 1   MLRRPLSFLPRRPFSSTPEVPSLYSFLQPSIFALKKTPSPPPPTS----PDQPPRALTPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
            +A +ET LH+SLLTS+TDEAWKSFK LT ++VFP K LTNSL+ HLSS+ D+HNLKRAF
Sbjct: 61  HIATLETTLHQSLLTSNTDEAWKSFKALTSNTVFPSKSLTNSLVTHLSSLNDIHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           ASVV++IEKKP+LLDF ++  LL++MKCANTAAPA +LIKCM KNR FVPF +WG+ LV+
Sbjct: 121 ASVVYLIEKKPKLLDFETLGNLLSAMKCANTAAPAFALIKCMLKNRYFVPFGLWGSALVE 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           I R+NG+ + FLRVFEENCRIALDE LDFMKPDL ACNAALEGCC ELESV+DAE VVET
Sbjct: 181 ISRKNGNFVAFLRVFEENCRIALDEKLDFMKPDLPACNAALEGCCRELESVSDAENVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           MS+L +RPDE SFG+LAYLYALKGL +KIIELE LM  FGF++K  FF +L+SGY+ + N
Sbjct: 241 MSILGIRPDESSFGSLAYLYALKGLGEKIIELEGLMDGFGFSNKSAFFINLVSGYIKSGN 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSE-V 360
              VS T+L   +D       FGE+TY E+V GF+ +G++K L+ LI+ A+ LE SS  V
Sbjct: 301 LESVSATILHCCEDS-----KFGEETYREVVNGFLNNGSVKSLATLIIEAQKLEPSSVLV 360

Query: 361 DGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLI 420
           D S+G+GI+NACVN+G  DKAH IL E+++QG S+GLGVY+PILKAYRKEHRTAEAT+L+
Sbjct: 361 DRSVGYGIVNACVNLGLSDKAHSILEEMDAQGGSVGLGVYVPILKAYRKEHRTAEATQLV 420

Query: 421 MDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMEN 480
           M+I++SG+QLD E+YD+LIEASMS+QDFQSAF+LFR+MRE R  D+K SYLTIMTGLMEN
Sbjct: 421 MEITNSGIQLDVETYDSLIEASMSSQDFQSAFSLFRDMREARIPDLKGSYLTIMTGLMEN 480

Query: 481 HRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQT 540
           HRPELMAAFLDEVVEDP +EV THDWNSIIHAFCKAGRLEDARRTFRRM FLQFEPN+QT
Sbjct: 481 HRPELMAAFLDEVVEDPRIEVATHDWNSIIHAFCKAGRLEDARRTFRRMIFLQFEPNDQT 540

Query: 541 FLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVM 600
           +LS INGYV+ E+YF VLMLWNE+K K++ +G+ G+K D+NLVDAFLYALVKGGFFDAVM
Sbjct: 541 YLSQINGYVTTEKYFSVLMLWNEVKRKVSNDGQKGVKFDHNLVDAFLYALVKGGFFDAVM 600

Query: 601 QVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLN 655
           QVVEKT++ KIFVDKW+YKQAFMETHKKLKVAKLR+RN RKME+L+AFKNWAGLN
Sbjct: 601 QVVEKTQEMKIFVDKWRYKQAFMETHKKLKVAKLRKRNFRKMEALVAFKNWAGLN 646

BLAST of Cla97C05G105680 vs. TrEMBL
Match: tr|A0A2P6R2Q5|A0A2P6R2Q5_ROSCH (Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0438351 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 2.8e-263
Identity = 457/656 (69.66%), Postives = 554/656 (84.45%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSFSSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTPD 60
           MW++    +  R FSS+PE P+LYSFLQPS+FALK  P S S  S DL   P P+ LTPD
Sbjct: 1   MWRKAFTLLQRRPFSSSPEIPTLYSFLQPSIFALKNPPSSSSSHS-DLPTSP-PKTLTPD 60

Query: 61  RVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAF 120
            V  ++T LHKSLLT +TDEAWKSFK LT SSVFP K LTNSLI HL+S+GD+HNLKRAF
Sbjct: 61  HVTTLQTTLHKSLLTHNTDEAWKSFKSLTGSSVFPSKSLTNSLITHLASLGDIHNLKRAF 120

Query: 121 ASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVD 180
           ASVV+V+EK PELL+F +V ++L +MKCANTAAPA +LI+CMFKNR F+PFSVWG+ +V+
Sbjct: 121 ASVVYVVEKSPELLEFETVGSVLGAMKCANTAAPAFALIQCMFKNRFFLPFSVWGSVVVE 180

Query: 181 ICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 240
           I R+NG+   FLRVFEENCR+ALDE ++FMKPDL ACNAALEGCC ELESV+ AEKVVET
Sbjct: 181 ISRKNGNFAAFLRVFEENCRVALDEKMEFMKPDLAACNAALEGCCCELESVSGAEKVVET 240

Query: 241 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASN 300
           M++L +RPDE SFG LAYLYALKGL +KI ELE LMG FGF+ K +F ++LI+GYV +  
Sbjct: 241 MAVLGVRPDESSFGFLAYLYALKGLGEKISELEGLMGGFGFSDKRVFRNNLINGYVKSGK 300

Query: 301 FAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLESSS-EV 360
              VS T+L+ L++  G  +    +TY  +VKGF+ +G +KEL+ LI+ A+NLESS+  V
Sbjct: 301 LEFVSATILQGLREGDGECLDLDGETYCRVVKGFLDNGKVKELATLIIEAQNLESSTVVV 360

Query: 361 DGSIGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLI 420
           D S+G+GI+NACV IG  DKAH IL+E+N+QG +LGLGV++PILKAY KEHRTAEAT+L+
Sbjct: 361 DRSVGYGIVNACVGIGLSDKAHSILDEMNAQGGTLGLGVHVPILKAYCKEHRTAEATQLV 420

Query: 421 MDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMEN 480
           MDIS+SGL+LD E+YD LIEASMS+QDFQSAF+LFR+MRE R  D+K SYLTIMTGLMEN
Sbjct: 421 MDISNSGLKLDMETYDTLIEASMSSQDFQSAFSLFRDMREARTPDLKGSYLTIMTGLMEN 480

Query: 481 HRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQT 540
           HRPELMAAFLDEVVEDP +EVGTHDWNSIIHAFCKAGRLEDARRTFRRM FLQ++PN+QT
Sbjct: 481 HRPELMAAFLDEVVEDPRIEVGTHDWNSIIHAFCKAGRLEDARRTFRRMIFLQYKPNDQT 540

Query: 541 FLSLINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVM 600
           +LSLI+GYVS E+YFCVLMLW+E+K  I+ +GE G+K D+N+VDAFLYALVKGGFFDAVM
Sbjct: 541 YLSLISGYVSVEKYFCVLMLWHEVKRNISVDGERGLKFDHNMVDAFLYALVKGGFFDAVM 600

Query: 601 QVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWAGLNA 656
           QVVEK+++ KIFVDKW+YKQAFMETHKKLKV+KLR+R+ RKME+L+AFKNWAGLNA
Sbjct: 601 QVVEKSQEMKIFVDKWRYKQAFMETHKKLKVSKLRKRSFRKMEALVAFKNWAGLNA 654

BLAST of Cla97C05G105680 vs. Swiss-Prot
Match: sp|P0C7R4|PP110_ARATH (Pentatricopeptide repeat-containing protein At1g69290 OS=Arabidopsis thaliana OX=3702 GN=At1g69290 PE=2 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 3.8e-231
Identity = 420/663 (63.35%), Postives = 521/663 (78.58%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSF-SSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTP 60
           M+++ L SI  R F SS+PE+PSLYSFL+PSLF+ K    SPS     L     P+ LTP
Sbjct: 1   MFRKTLNSISRRHFSSSSPESPSLYSFLKPSLFSHKPITLSPS-----LSPPQNPKTLTP 60

Query: 61  DRVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSI-----GDVH 120
           D+ ++ E+ LH SL    TDEAWK+F+ LT +S  P K L NSLI HLS +        H
Sbjct: 61  DQKSSFESTLHDSLNAHYTDEAWKAFRSLTAASSLPEKRLINSLITHLSGVEGSGESISH 120

Query: 121 NLKRAFASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVW 180
            LKRAFAS  +VIEK P LL+F +V+ LL SMK A  A PAL+L+KCMFKNR FVPF +W
Sbjct: 121 RLKRAFASAAYVIEKDPILLEFETVRTLLESMKLAKAAGPALALVKCMFKNRYFVPFDLW 180

Query: 181 GNELVDICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDA 240
           G+ ++DICR+NGSL PFL+VF+E+CRI++DE L+FMKPDL+A NAALE CC ++ES+ DA
Sbjct: 181 GHLVIDICRENGSLAPFLKVFKESCRISVDEKLEFMKPDLVASNAALEACCRQMESLADA 240

Query: 241 EKVVETMSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISG 300
           E V+E+M++L ++PDE+SFG LAYLYA KGL +KI ELE LM  FGF  + + +S++ISG
Sbjct: 241 ENVIESMAVLGVKPDELSFGFLAYLYARKGLREKISELENLMDGFGFASRRILYSNMISG 300

Query: 301 YVHASNFAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLE 360
           YV + +   VS  +L SLK E G    F  +TY E+VKGFI+S ++K L+ +I+ A+ LE
Sbjct: 301 YVKSGDLDSVSDVILHSLK-EGGEESSFSVETYCELVKGFIESKSVKSLAKVILEAQKLE 360

Query: 361 SS-SEVDGSIGFGIINACVNIGWLDKAHDILNEINSQ-GVSLGLGVYLPILKAYRKEHRT 420
           SS   VD S+GFGIINACVN+G+ DKAH IL E+ +Q G S+G+GVY+PILKAY KE+RT
Sbjct: 361 SSYVGVDSSVGFGIINACVNLGFSDKAHSILEEMIAQGGGSVGIGVYVPILKAYCKEYRT 420

Query: 421 AEATRLIMDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTI 480
           AEAT+L+ +ISSSGLQLD E  +ALIEASM+NQDF SAF LFR+MRE R  D+K SYLTI
Sbjct: 421 AEATQLVTEISSSGLQLDVEISNALIEASMTNQDFISAFTLFRDMRENRVVDLKGSYLTI 480

Query: 481 MTGLMENHRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQ 540
           MTGL+EN RPELMAAFLDEVVEDP VEV +HDWNSIIHAFCK+GRLEDARRTFRRM FL+
Sbjct: 481 MTGLLENQRPELMAAFLDEVVEDPRVEVNSHDWNSIIHAFCKSGRLEDARRTFRRMVFLR 540

Query: 541 FEPNEQTFLSLINGYVSAERYFCVLMLWNELKWKITA-NGETGIKLDNNLVDAFLYALVK 600
           +EPN QT+LSLINGYVS E+YF VL+LWNE+K KI++   E   +LD+ LVDAFLYALVK
Sbjct: 541 YEPNNQTYLSLINGYVSGEKYFNVLLLWNEIKGKISSVEAEKRSRLDHALVDAFLYALVK 600

Query: 601 GGFFDAVMQVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWA 655
           GGFFDA MQVVEK+++ KIFVDKW+YKQAFMETHKKL++ KLR+RN++KMESL+AFKNWA
Sbjct: 601 GGFFDAAMQVVEKSQEMKIFVDKWRYKQAFMETHKKLRLPKLRKRNYKKMESLVAFKNWA 657

BLAST of Cla97C05G105680 vs. Swiss-Prot
Match: sp|Q9CAA5|PP109_ARATH (Pentatricopeptide repeat-containing protein At1g68980, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g68980 PE=2 SV=1)

HSP 1 Score: 681.8 bits (1758), Expect = 7.6e-195
Identity = 356/630 (56.51%), Postives = 463/630 (73.49%), Query Frame = 0

Query: 30  SLFALKRTPFSPSQDSTDLRQDPTPQILTPDRVAAVETALHKSLLTSDTDEAWKSFKLLT 89
           +L +L+R PFS     T          LTP + ++ E+ LH SL+T DTD+AWK F+   
Sbjct: 7   TLISLRR-PFSSIPSKT----------LTPHQKSSFESTLHHSLITHDTDQAWKVFRSFA 66

Query: 90  RSSVFPCKPLTNSLIAHLSSIGDV-------HNLKRAFASVVFVIEKKPELLDFGSVKAL 149
            +S  P K L NSLI HLSS  +        H LKRAF S  +VIEK P LL+F +V+ +
Sbjct: 67  AASSLPDKRLLNSLITHLSSFHNTDQNTSLRHRLKRAFVSTTYVIEKDPILLEFETVRTV 126

Query: 150 LASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVDICRQNGSLIPFLRVFEENCRIA 209
           L SMK A  + PAL+L++CMFKNR FVPF +WG+ L+D+CR+NGSL  FL+VF E+CRIA
Sbjct: 127 LESMKLAKASGPALALVECMFKNRYFVPFDLWGDLLIDVCRENGSLAAFLKVFRESCRIA 186

Query: 210 LDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVETMSLLYLRPDEVSFGALAYLYAL 269
           +DE LDFMKPDL+A NAALE CC ++ES+ DAE ++E+M +L ++PDE+SFG LAYLYA 
Sbjct: 187 VDEKLDFMKPDLVASNAALEACCRQMESLADAENLIESMDVLGVKPDELSFGFLAYLYAR 246

Query: 270 KGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASNFAVVSKTMLRSLKDECGAHVHF 329
           KGL +KI ELE LM   GF  + + +SS+ISGYV + +    S  +L SLK   G    F
Sbjct: 247 KGLREKISELEDLMDGLGFASRRILYSSMISGYVKSGDLDSASDVILCSLKG-VGEASSF 306

Query: 330 GEKTYLEMVKGFIQSGNLKELSALIVNAKNLES-SSEVDGSIGFGIINACVNIGWLDKAH 389
            E+TY E+V+GFI+S +++ L+ LI+ A+ LES S++V GS+GFGI+NACV +G+  K+ 
Sbjct: 307 SEETYCELVRGFIESKSVESLAKLIIEAQKLESMSTDVGGSVGFGIVNACVKLGFSGKS- 366

Query: 390 DILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDISSSGLQLDAESYDALIEAS 449
            IL+E+N+QG S G+GVY+PILKAY KE RT+EAT+L+ +ISSSGLQLD E+Y+ +IEAS
Sbjct: 367 -ILDELNAQGGSGGIGVYVPILKAYCKEGRTSEATQLVTEISSSGLQLDVETYNTMIEAS 426

Query: 450 MSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEVG 509
           M+  DF SA  LFR+MRETR +D+K  YLTIMTGL+EN RPELMA F++EV+EDP VEV 
Sbjct: 427 MTKHDFLSALTLFRDMRETRVADLKRCYLTIMTGLLENQRPELMAEFVEEVMEDPRVEVK 486

Query: 510 THDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVSAERYFCVLMLWN 569
           +HDWNSIIHAFCK+GRL DA+ TFRRM FLQ+EPN QT+LSLINGYVS E+YF V+++W 
Sbjct: 487 SHDWNSIIHAFCKSGRLGDAKSTFRRMTFLQYEPNNQTYLSLINGYVSCEKYFEVVVIWK 546

Query: 570 ELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKIFVDKWKYKQAF 629
           E K       +   KL++ L DAFL ALVKGGFF   +QV+EK ++ KIFVDKW+YK  F
Sbjct: 547 EFK-------DKKAKLEHALADAFLNALVKGGFFGTALQVIEKCQEMKIFVDKWRYKATF 606

Query: 630 METHKKLKVAKLRRRNHRKMESLIAFKNWA 652
           MET K L++ KLR+R  +K+E L AFKNWA
Sbjct: 607 METQKNLRLPKLRKRKMKKIEFLDAFKNWA 615

BLAST of Cla97C05G105680 vs. Swiss-Prot
Match: sp|Q9SA60|PPR6_ARATH (Pentatricopeptide repeat-containing protein At1g03100, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g03100 PE=2 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 1.4e-36
Identity = 100/339 (29.50%), Postives = 160/339 (47.20%), Query Frame = 0

Query: 324 EKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVDGSIGFGIINACVNIGWLDKAHDI 383
           E+ Y+++ K F++SG +KEL+  ++ A++ +S    D S+   +INAC+++G LD+AHD+
Sbjct: 459 EEIYVKLAKAFLESGKMKELAKFLLKAEHEDSPVSSDNSMLINVINACISLGMLDQAHDL 518

Query: 384 LNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDISSSGLQLDAESYDALIEASMS 443
           L+E+   GV                                                   
Sbjct: 519 LDEMRMAGVRTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 578

Query: 444 NQDFQSAFALFRNMRE---TRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEV 503
                               R  + K  +  ++ G   N    LM+  L E+ E   ++ 
Sbjct: 579 XXXXXXXXXXXXXXXXXXILRGGNQK--FEKLLKGCEGNAEAGLMSKLLREIREVQSLDA 638

Query: 504 GTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVS-AERYFCVLML 563
           G HDWN++IH F K G ++DA +  +RM+ L   PN QTF S++ GY +   +Y  V  L
Sbjct: 639 GVHDWNNVIHFFSKKGLMQDAEKALKRMRSLGHSPNAQTFHSMVTGYAAIGSKYTEVTEL 698

Query: 564 WNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKIFVDKWKYKQ 623
           W E+  K  A   + +K D  L+DA LY  V+GGFF    +VVE  +   +FVDK+KY+ 
Sbjct: 699 WGEM--KSIAAATSSMKFDQELLDAVLYTFVRGGFFSRANEVVEMMEKKNMFVDKYKYRM 758

Query: 624 AFMETHK---KLKVAKLRRRNH-RKMESLIAFKNWAGLN 655
            F++ HK   K K  K++  +  +K E+ + FK W GL+
Sbjct: 759 LFLKYHKTAYKGKAPKVQSESQLKKREAGLVFKKWLGLS 793

BLAST of Cla97C05G105680 vs. Swiss-Prot
Match: sp|Q9SF38|PP222_ARATH (Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=HCF152 PE=2 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 2.3e-34
Identity = 97/349 (27.79%), Postives = 171/349 (49.00%), Query Frame = 0

Query: 325 KTYLEMVKGFIQSGNLKELSALIVNAKNLES-SSEVDGSIGFGIINACVNIGWLDKAHDI 384
           + Y  ++KG++++G + + + ++   +  +  +S  D      +++A VN G +D+A  +
Sbjct: 415 RIYTTLMKGYMKNGRVADTARMLEAMRRQDDRNSHPDEVTYTTVVSAFVNAGLMDRARQV 474

Query: 385 LNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDIS-SSGLQLDAESYDALIEASM 444
           L E+   GV      Y  +LK Y K+ +   A  L+ +++  +G++ D  SY+ +I+  +
Sbjct: 475 LAEMARMGVPANRITYNVLLKGYCKQLQIDRAEDLLREMTEDAGIEPDVVSYNIIIDGCI 534

Query: 445 SNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEVGT 504
              D   A A F  MR    +  K SY T+M     + +P+L     DE++ DP V+V  
Sbjct: 535 LIDDSAGALAFFNEMRTRGIAPTKISYTTLMKAFAMSGQPKLANRVFDEMMNDPRVKVDL 594

Query: 505 HDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVSAERYFCVLMLWNE 564
             WN ++  +C+ G +EDA+R   RMK   F PN  T+ SL NG   A +    L+LW E
Sbjct: 595 IAWNMLVEGYCRLGLIEDAQRVVSRMKENGFYPNVATYGSLANGVSQARKPGDALLLWKE 654

Query: 565 LKWKITANGETG------------IKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKI 624
           +K +     +              +K D  L+D      V+  FF   ++++   ++  I
Sbjct: 655 IKERCAVKKKEAPSDSSSDPAPPMLKPDEGLLDTLADICVRAAFFKKALEIIACMEENGI 714

Query: 625 FVDKWKYKQAFMETHKKL------KVAKLRRRNHRKMESLIAFKNWAGL 654
             +K KYK+ ++E H ++        A++ RR  RK  +  AFK W GL
Sbjct: 715 PPNKTKYKKIYVEMHSRMFTSKHASQARIDRRVERK-RAAEAFKFWLGL 762

BLAST of Cla97C05G105680 vs. Swiss-Prot
Match: sp|B3H672|PP317_ARATH (Pentatricopeptide repeat-containing protein At4g17616 OS=Arabidopsis thaliana OX=3702 GN=At4g17616 PE=2 SV=1)

HSP 1 Score: 124.4 bits (311), Expect = 4.7e-27
Identity = 130/626 (20.77%), Postives = 242/626 (38.66%), Query Frame = 0

Query: 69  LHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAFASVVFVIE 128
           L  +L     D+AW  FK   R   FP   + N  +  LS   D   L +A       ++
Sbjct: 61  LETALKDHRVDDAWDVFKDFKRLYGFPESVIMNRFVTVLSYSSDAGWLCKASDLTRLALK 120

Query: 129 KKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSV-------WGNELVDI 188
           + P +L    +  L  S+  A     A S+++ M +    +   V            +  
Sbjct: 121 QNPGMLSGDVLTKLSLSLARAQMVESACSILRIMLEKGYVLTSDVLRLVVMHMVKTEIGT 180

Query: 189 CRQNGSLIPFL-RVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 248
           C  +  L+    R  E N         + +KPD +  N  L G C         ++++E 
Sbjct: 181 CLASNYLVQVCDRFVEFNVGKRNSSPGNVVKPDTVLFNLVL-GSCVRFGFSLKGQELIEL 240

Query: 249 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFG---FNHKDLFFSSLISGYVH 308
           M+ + +  D  S   ++ +Y + G+  ++ + +  +G        H   FF +L+S    
Sbjct: 241 MAKVDVVADAYSIVIMSCIYEMNGMRDELRKFKEHIGQVPPQLLGHYQHFFDNLLSLEFK 300

Query: 309 ASNFAVVSKTMLRSLKDE--------------------------CGAHVHFG-------- 368
             +     +  L   K +                           G  +H          
Sbjct: 301 FDDIGSAGRLALDMCKSKVLVSVENLGFDSEKPRVLPVGSHHIRSGLKIHISPKLLQRDS 360

Query: 369 ---------------------EKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVDGS 428
                                 KT  ++V G+ +  NL ELS L+ +       ++V   
Sbjct: 361 SLGVDTEATFVNYSNSKLGITNKTLAKLVYGYKRHDNLPELSKLLFSLGGSRLCADV--- 420

Query: 429 IGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDI 488
                I+ACV IGWL+ AHDIL+++NS G  + L  Y  +L  Y K      A  L+  +
Sbjct: 421 -----IDACVAIGWLEAAHDILDDMNSAGYPMELATYRMVLSGYYKSKMLRNAEVLLKQM 480

Query: 489 SSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRP 548
           + +GL  D  S + ++      +D ++       +R+    ++ A          +   P
Sbjct: 481 TKAGLITD-PSNEIVVSPETEEKDSENT-----ELRDLLVQEINAG--------KQMKAP 540

Query: 549 ELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLS 608
            ++                 ++ NS ++ FCKA    DA  T+R++  ++  P  Q+F  
Sbjct: 541 SML-----------------YELNSSLYYFCKAKMQGDALITYRKIPKMKIPPTVQSFWI 600

Query: 609 LINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVV 629
           LI+ Y S   Y  + ++W ++K  I +     +K   +L++  +   ++GG+F+ VM+++
Sbjct: 601 LIDMYSSLGMYREITIVWGDIKRNIASK---NLKTTQDLLEKLVVNFLRGGYFERVMELI 643

BLAST of Cla97C05G105680 vs. TAIR10
Match: AT1G69290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 802.4 bits (2071), Expect = 2.1e-232
Identity = 420/663 (63.35%), Postives = 521/663 (78.58%), Query Frame = 0

Query: 1   MWKRVLCSIPHRSF-SSAPETPSLYSFLQPSLFALKRTPFSPSQDSTDLRQDPTPQILTP 60
           M+++ L SI  R F SS+PE+PSLYSFL+PSLF+ K    SPS     L     P+ LTP
Sbjct: 1   MFRKTLNSISRRHFSSSSPESPSLYSFLKPSLFSHKPITLSPS-----LSPPQNPKTLTP 60

Query: 61  DRVAAVETALHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSI-----GDVH 120
           D+ ++ E+ LH SL    TDEAWK+F+ LT +S  P K L NSLI HLS +        H
Sbjct: 61  DQKSSFESTLHDSLNAHYTDEAWKAFRSLTAASSLPEKRLINSLITHLSGVEGSGESISH 120

Query: 121 NLKRAFASVVFVIEKKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSVW 180
            LKRAFAS  +VIEK P LL+F +V+ LL SMK A  A PAL+L+KCMFKNR FVPF +W
Sbjct: 121 RLKRAFASAAYVIEKDPILLEFETVRTLLESMKLAKAAGPALALVKCMFKNRYFVPFDLW 180

Query: 181 GNELVDICRQNGSLIPFLRVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDA 240
           G+ ++DICR+NGSL PFL+VF+E+CRI++DE L+FMKPDL+A NAALE CC ++ES+ DA
Sbjct: 181 GHLVIDICRENGSLAPFLKVFKESCRISVDEKLEFMKPDLVASNAALEACCRQMESLADA 240

Query: 241 EKVVETMSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFGFNHKDLFFSSLISG 300
           E V+E+M++L ++PDE+SFG LAYLYA KGL +KI ELE LM  FGF  + + +S++ISG
Sbjct: 241 ENVIESMAVLGVKPDELSFGFLAYLYARKGLREKISELENLMDGFGFASRRILYSNMISG 300

Query: 301 YVHASNFAVVSKTMLRSLKDECGAHVHFGEKTYLEMVKGFIQSGNLKELSALIVNAKNLE 360
           YV + +   VS  +L SLK E G    F  +TY E+VKGFI+S ++K L+ +I+ A+ LE
Sbjct: 301 YVKSGDLDSVSDVILHSLK-EGGEESSFSVETYCELVKGFIESKSVKSLAKVILEAQKLE 360

Query: 361 SS-SEVDGSIGFGIINACVNIGWLDKAHDILNEINSQ-GVSLGLGVYLPILKAYRKEHRT 420
           SS   VD S+GFGIINACVN+G+ DKAH IL E+ +Q G S+G+GVY+PILKAY KE+RT
Sbjct: 361 SSYVGVDSSVGFGIINACVNLGFSDKAHSILEEMIAQGGGSVGIGVYVPILKAYCKEYRT 420

Query: 421 AEATRLIMDISSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTI 480
           AEAT+L+ +ISSSGLQLD E  +ALIEASM+NQDF SAF LFR+MRE R  D+K SYLTI
Sbjct: 421 AEATQLVTEISSSGLQLDVEISNALIEASMTNQDFISAFTLFRDMRENRVVDLKGSYLTI 480

Query: 481 MTGLMENHRPELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQ 540
           MTGL+EN RPELMAAFLDEVVEDP VEV +HDWNSIIHAFCK+GRLEDARRTFRRM FL+
Sbjct: 481 MTGLLENQRPELMAAFLDEVVEDPRVEVNSHDWNSIIHAFCKSGRLEDARRTFRRMVFLR 540

Query: 541 FEPNEQTFLSLINGYVSAERYFCVLMLWNELKWKITA-NGETGIKLDNNLVDAFLYALVK 600
           +EPN QT+LSLINGYVS E+YF VL+LWNE+K KI++   E   +LD+ LVDAFLYALVK
Sbjct: 541 YEPNNQTYLSLINGYVSGEKYFNVLLLWNEIKGKISSVEAEKRSRLDHALVDAFLYALVK 600

Query: 601 GGFFDAVMQVVEKTKDTKIFVDKWKYKQAFMETHKKLKVAKLRRRNHRKMESLIAFKNWA 655
           GGFFDA MQVVEK+++ KIFVDKW+YKQAFMETHKKL++ KLR+RN++KMESL+AFKNWA
Sbjct: 601 GGFFDAAMQVVEKSQEMKIFVDKWRYKQAFMETHKKLRLPKLRKRNYKKMESLVAFKNWA 657

BLAST of Cla97C05G105680 vs. TAIR10
Match: AT1G68980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 681.8 bits (1758), Expect = 4.2e-196
Identity = 356/630 (56.51%), Postives = 463/630 (73.49%), Query Frame = 0

Query: 30  SLFALKRTPFSPSQDSTDLRQDPTPQILTPDRVAAVETALHKSLLTSDTDEAWKSFKLLT 89
           +L +L+R PFS     T          LTP + ++ E+ LH SL+T DTD+AWK F+   
Sbjct: 7   TLISLRR-PFSSIPSKT----------LTPHQKSSFESTLHHSLITHDTDQAWKVFRSFA 66

Query: 90  RSSVFPCKPLTNSLIAHLSSIGDV-------HNLKRAFASVVFVIEKKPELLDFGSVKAL 149
            +S  P K L NSLI HLSS  +        H LKRAF S  +VIEK P LL+F +V+ +
Sbjct: 67  AASSLPDKRLLNSLITHLSSFHNTDQNTSLRHRLKRAFVSTTYVIEKDPILLEFETVRTV 126

Query: 150 LASMKCANTAAPALSLIKCMFKNRCFVPFSVWGNELVDICRQNGSLIPFLRVFEENCRIA 209
           L SMK A  + PAL+L++CMFKNR FVPF +WG+ L+D+CR+NGSL  FL+VF E+CRIA
Sbjct: 127 LESMKLAKASGPALALVECMFKNRYFVPFDLWGDLLIDVCRENGSLAAFLKVFRESCRIA 186

Query: 210 LDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVETMSLLYLRPDEVSFGALAYLYAL 269
           +DE LDFMKPDL+A NAALE CC ++ES+ DAE ++E+M +L ++PDE+SFG LAYLYA 
Sbjct: 187 VDEKLDFMKPDLVASNAALEACCRQMESLADAENLIESMDVLGVKPDELSFGFLAYLYAR 246

Query: 270 KGLEQKIIELEVLMGNFGFNHKDLFFSSLISGYVHASNFAVVSKTMLRSLKDECGAHVHF 329
           KGL +KI ELE LM   GF  + + +SS+ISGYV + +    S  +L SLK   G    F
Sbjct: 247 KGLREKISELEDLMDGLGFASRRILYSSMISGYVKSGDLDSASDVILCSLKG-VGEASSF 306

Query: 330 GEKTYLEMVKGFIQSGNLKELSALIVNAKNLES-SSEVDGSIGFGIINACVNIGWLDKAH 389
            E+TY E+V+GFI+S +++ L+ LI+ A+ LES S++V GS+GFGI+NACV +G+  K+ 
Sbjct: 307 SEETYCELVRGFIESKSVESLAKLIIEAQKLESMSTDVGGSVGFGIVNACVKLGFSGKS- 366

Query: 390 DILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDISSSGLQLDAESYDALIEAS 449
            IL+E+N+QG S G+GVY+PILKAY KE RT+EAT+L+ +ISSSGLQLD E+Y+ +IEAS
Sbjct: 367 -ILDELNAQGGSGGIGVYVPILKAYCKEGRTSEATQLVTEISSSGLQLDVETYNTMIEAS 426

Query: 450 MSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEVG 509
           M+  DF SA  LFR+MRETR +D+K  YLTIMTGL+EN RPELMA F++EV+EDP VEV 
Sbjct: 427 MTKHDFLSALTLFRDMRETRVADLKRCYLTIMTGLLENQRPELMAEFVEEVMEDPRVEVK 486

Query: 510 THDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVSAERYFCVLMLWN 569
           +HDWNSIIHAFCK+GRL DA+ TFRRM FLQ+EPN QT+LSLINGYVS E+YF V+++W 
Sbjct: 487 SHDWNSIIHAFCKSGRLGDAKSTFRRMTFLQYEPNNQTYLSLINGYVSCEKYFEVVVIWK 546

Query: 570 ELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKIFVDKWKYKQAF 629
           E K       +   KL++ L DAFL ALVKGGFF   +QV+EK ++ KIFVDKW+YK  F
Sbjct: 547 EFK-------DKKAKLEHALADAFLNALVKGGFFGTALQVIEKCQEMKIFVDKWRYKATF 606

Query: 630 METHKKLKVAKLRRRNHRKMESLIAFKNWA 652
           MET K L++ KLR+R  +K+E L AFKNWA
Sbjct: 607 METQKNLRLPKLRKRKMKKIEFLDAFKNWA 615

BLAST of Cla97C05G105680 vs. TAIR10
Match: AT1G03100.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 156.0 bits (393), Expect = 8.0e-38
Identity = 100/339 (29.50%), Postives = 160/339 (47.20%), Query Frame = 0

Query: 324 EKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVDGSIGFGIINACVNIGWLDKAHDI 383
           E+ Y+++ K F++SG +KEL+  ++ A++ +S    D S+   +INAC+++G LD+AHD+
Sbjct: 459 EEIYVKLAKAFLESGKMKELAKFLLKAEHEDSPVSSDNSMLINVINACISLGMLDQAHDL 518

Query: 384 LNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDISSSGLQLDAESYDALIEASMS 443
           L+E+   GV                                                   
Sbjct: 519 LDEMRMAGVRTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 578

Query: 444 NQDFQSAFALFRNMRE---TRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEV 503
                               R  + K  +  ++ G   N    LM+  L E+ E   ++ 
Sbjct: 579 XXXXXXXXXXXXXXXXXXILRGGNQK--FEKLLKGCEGNAEAGLMSKLLREIREVQSLDA 638

Query: 504 GTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVS-AERYFCVLML 563
           G HDWN++IH F K G ++DA +  +RM+ L   PN QTF S++ GY +   +Y  V  L
Sbjct: 639 GVHDWNNVIHFFSKKGLMQDAEKALKRMRSLGHSPNAQTFHSMVTGYAAIGSKYTEVTEL 698

Query: 564 WNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKIFVDKWKYKQ 623
           W E+  K  A   + +K D  L+DA LY  V+GGFF    +VVE  +   +FVDK+KY+ 
Sbjct: 699 WGEM--KSIAAATSSMKFDQELLDAVLYTFVRGGFFSRANEVVEMMEKKNMFVDKYKYRM 758

Query: 624 AFMETHK---KLKVAKLRRRNH-RKMESLIAFKNWAGLN 655
            F++ HK   K K  K++  +  +K E+ + FK W GL+
Sbjct: 759 LFLKYHKTAYKGKAPKVQSESQLKKREAGLVFKKWLGLS 793

BLAST of Cla97C05G105680 vs. TAIR10
Match: AT3G09650.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 148.7 bits (374), Expect = 1.3e-35
Identity = 97/349 (27.79%), Postives = 171/349 (49.00%), Query Frame = 0

Query: 325 KTYLEMVKGFIQSGNLKELSALIVNAKNLES-SSEVDGSIGFGIINACVNIGWLDKAHDI 384
           + Y  ++KG++++G + + + ++   +  +  +S  D      +++A VN G +D+A  +
Sbjct: 415 RIYTTLMKGYMKNGRVADTARMLEAMRRQDDRNSHPDEVTYTTVVSAFVNAGLMDRARQV 474

Query: 385 LNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDIS-SSGLQLDAESYDALIEASM 444
           L E+   GV      Y  +LK Y K+ +   A  L+ +++  +G++ D  SY+ +I+  +
Sbjct: 475 LAEMARMGVPANRITYNVLLKGYCKQLQIDRAEDLLREMTEDAGIEPDVVSYNIIIDGCI 534

Query: 445 SNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRPELMAAFLDEVVEDPLVEVGT 504
              D   A A F  MR    +  K SY T+M     + +P+L     DE++ DP V+V  
Sbjct: 535 LIDDSAGALAFFNEMRTRGIAPTKISYTTLMKAFAMSGQPKLANRVFDEMMNDPRVKVDL 594

Query: 505 HDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLSLINGYVSAERYFCVLMLWNE 564
             WN ++  +C+ G +EDA+R   RMK   F PN  T+ SL NG   A +    L+LW E
Sbjct: 595 IAWNMLVEGYCRLGLIEDAQRVVSRMKENGFYPNVATYGSLANGVSQARKPGDALLLWKE 654

Query: 565 LKWKITANGETG------------IKLDNNLVDAFLYALVKGGFFDAVMQVVEKTKDTKI 624
           +K +     +              +K D  L+D      V+  FF   ++++   ++  I
Sbjct: 655 IKERCAVKKKEAPSDSSSDPAPPMLKPDEGLLDTLADICVRAAFFKKALEIIACMEENGI 714

Query: 625 FVDKWKYKQAFMETHKKL------KVAKLRRRNHRKMESLIAFKNWAGL 654
             +K KYK+ ++E H ++        A++ RR  RK  +  AFK W GL
Sbjct: 715 PPNKTKYKKIYVEMHSRMFTSKHASQARIDRRVERK-RAAEAFKFWLGL 762

BLAST of Cla97C05G105680 vs. TAIR10
Match: AT4G17616.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 124.4 bits (311), Expect = 2.6e-28
Identity = 130/626 (20.77%), Postives = 242/626 (38.66%), Query Frame = 0

Query: 69  LHKSLLTSDTDEAWKSFKLLTRSSVFPCKPLTNSLIAHLSSIGDVHNLKRAFASVVFVIE 128
           L  +L     D+AW  FK   R   FP   + N  +  LS   D   L +A       ++
Sbjct: 61  LETALKDHRVDDAWDVFKDFKRLYGFPESVIMNRFVTVLSYSSDAGWLCKASDLTRLALK 120

Query: 129 KKPELLDFGSVKALLASMKCANTAAPALSLIKCMFKNRCFVPFSV-------WGNELVDI 188
           + P +L    +  L  S+  A     A S+++ M +    +   V            +  
Sbjct: 121 QNPGMLSGDVLTKLSLSLARAQMVESACSILRIMLEKGYVLTSDVLRLVVMHMVKTEIGT 180

Query: 189 CRQNGSLIPFL-RVFEENCRIALDESLDFMKPDLIACNAALEGCCYELESVTDAEKVVET 248
           C  +  L+    R  E N         + +KPD +  N  L G C         ++++E 
Sbjct: 181 CLASNYLVQVCDRFVEFNVGKRNSSPGNVVKPDTVLFNLVL-GSCVRFGFSLKGQELIEL 240

Query: 249 MSLLYLRPDEVSFGALAYLYALKGLEQKIIELEVLMGNFG---FNHKDLFFSSLISGYVH 308
           M+ + +  D  S   ++ +Y + G+  ++ + +  +G        H   FF +L+S    
Sbjct: 241 MAKVDVVADAYSIVIMSCIYEMNGMRDELRKFKEHIGQVPPQLLGHYQHFFDNLLSLEFK 300

Query: 309 ASNFAVVSKTMLRSLKDE--------------------------CGAHVHFG-------- 368
             +     +  L   K +                           G  +H          
Sbjct: 301 FDDIGSAGRLALDMCKSKVLVSVENLGFDSEKPRVLPVGSHHIRSGLKIHISPKLLQRDS 360

Query: 369 ---------------------EKTYLEMVKGFIQSGNLKELSALIVNAKNLESSSEVDGS 428
                                 KT  ++V G+ +  NL ELS L+ +       ++V   
Sbjct: 361 SLGVDTEATFVNYSNSKLGITNKTLAKLVYGYKRHDNLPELSKLLFSLGGSRLCADV--- 420

Query: 429 IGFGIINACVNIGWLDKAHDILNEINSQGVSLGLGVYLPILKAYRKEHRTAEATRLIMDI 488
                I+ACV IGWL+ AHDIL+++NS G  + L  Y  +L  Y K      A  L+  +
Sbjct: 421 -----IDACVAIGWLEAAHDILDDMNSAGYPMELATYRMVLSGYYKSKMLRNAEVLLKQM 480

Query: 489 SSSGLQLDAESYDALIEASMSNQDFQSAFALFRNMRETRKSDMKASYLTIMTGLMENHRP 548
           + +GL  D  S + ++      +D ++       +R+    ++ A          +   P
Sbjct: 481 TKAGLITD-PSNEIVVSPETEEKDSENT-----ELRDLLVQEINAG--------KQMKAP 540

Query: 549 ELMAAFLDEVVEDPLVEVGTHDWNSIIHAFCKAGRLEDARRTFRRMKFLQFEPNEQTFLS 608
            ++                 ++ NS ++ FCKA    DA  T+R++  ++  P  Q+F  
Sbjct: 541 SML-----------------YELNSSLYYFCKAKMQGDALITYRKIPKMKIPPTVQSFWI 600

Query: 609 LINGYVSAERYFCVLMLWNELKWKITANGETGIKLDNNLVDAFLYALVKGGFFDAVMQVV 629
           LI+ Y S   Y  + ++W ++K  I +     +K   +L++  +   ++GG+F+ VM+++
Sbjct: 601 LIDMYSSLGMYREITIVWGDIKRNIASK---NLKTTQDLLEKLVVNFLRGGYFERVMELI 643

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008446433.10.0e+0092.37PREDICTED: pentatricopeptide repeat-containing protein At1g69290 [Cucumis melo][more]
XP_004135146.10.0e+0092.53PREDICTED: pentatricopeptide repeat-containing protein At1g69290 [Cucumis sativu... [more]
XP_022149103.10.0e+0089.77pentatricopeptide repeat-containing protein At1g69290 [Momordica charantia][more]
XP_022968525.10.0e+0086.26pentatricopeptide repeat-containing protein At1g69290 [Cucurbita maxima][more]
XP_023541058.10.0e+0086.11pentatricopeptide repeat-containing protein At1g69290 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BF23|A0A1S3BF23_CUCME0.0e+0092.37pentatricopeptide repeat-containing protein At1g69290 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0KSW9|A0A0A0KSW9_CUCSA0.0e+0092.53Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606690 PE=4 SV=1[more]
tr|M5XK61|M5XK61_PRUPE1.4e-26770.58Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G286800 PE=4 SV=1[more]
tr|A0A2I4H895|A0A2I4H895_9ROSI7.2e-26770.69pentatricopeptide repeat-containing protein At1g69290 OS=Juglans regia OX=51240 ... [more]
tr|A0A2P6R2Q5|A0A2P6R2Q5_ROSCH2.8e-26369.66Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0438351 P... [more]
Match NameE-valueIdentityDescription
sp|P0C7R4|PP110_ARATH3.8e-23163.35Pentatricopeptide repeat-containing protein At1g69290 OS=Arabidopsis thaliana OX... [more]
sp|Q9CAA5|PP109_ARATH7.6e-19556.51Pentatricopeptide repeat-containing protein At1g68980, mitochondrial OS=Arabidop... [more]
sp|Q9SA60|PPR6_ARATH1.4e-3629.50Pentatricopeptide repeat-containing protein At1g03100, mitochondrial OS=Arabidop... [more]
sp|Q9SF38|PP222_ARATH2.3e-3427.79Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidop... [more]
sp|B3H672|PP317_ARATH4.7e-2720.77Pentatricopeptide repeat-containing protein At4g17616 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT1G69290.12.1e-23263.35Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G68980.14.2e-19656.51Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G03100.18.0e-3829.50Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G09650.11.3e-3527.79Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G17616.12.6e-2820.77Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0005575 cellular_component
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0030247 polysaccharide binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G105680.1Cla97C05G105680.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 326..463
e-value: 3.2E-14
score: 55.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 176..315
e-value: 3.4E-8
score: 35.0
coord: 464..628
e-value: 3.6E-17
score: 64.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 418..461
e-value: 0.0011
score: 18.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 505..537
e-value: 2.8E-8
score: 31.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 505..548
e-value: 7.1E-12
score: 45.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 367..392
e-value: 0.49
score: 10.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 465..499
score: 5.031
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..394
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 430..460
score: 7.903
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 249..283
score: 5.119
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 536..566
score: 5.952
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 213..248
score: 7.443
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 6.566
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 578..612
score: 5.634
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 5.382
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 501..535
score: 11.915
NoneNo IPR availablePANTHERPTHR24015:SF304SUBFAMILY NOT NAMEDcoord: 10..651
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..651

The following gene(s) are paralogous to this gene:

None