Cla97C04G070890.1 (mRNA) Watermelon (97103) v2

NameCla97C04G070890.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCla97Chr04 : 15304631 .. 15306481 (-)
Sequence length1851
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCAGGTTTGTTCCTTTCTGCCCTTTTATTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTCCATTGTGTTGGAATTCGAATGGGCTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGGGAAATGTGGGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACGTCCATGATTTCGGCGTATGTTAATGGCGGTGATGCTGTTTGTGCCTTGGATCTTTATGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTGATGCTGCAAGCTTGTTGTGTGACTCGAAATTTGGTTCTAGGAAGGCTGCTTCAATGTCATGTGGTTAAGAATGGTTTATTGTTTGATATAGGTCTGCAGAATTCGTTCTTGCGAATGTATAGTCAACTGGGTGGGGAGGATGAAGTCGGAGTTATTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCGTGGGGGATATTTTGAAAGTTGTGGATATCTTCAACAAAATCATGCGTGAAGTTGCATTCAGCATTGAGACATTAACCATGCTTATATCAGCAACTGCGAGTTCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGTATTCTGCGGACTTCTTTGTTGTATATGTATGCTAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGAAATTTCCAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACATTTTGATGAGGCAGTTGAGATCTTCAAGCAAATGCAAGCTGCTGGCTTGAAACCCAGTGTTGGAATTTTGAAACACTTAATTGACGCTTATGCCTGTCTGGGTGCTCTGCAGTTGGGGAAAGTAATACATTGTTACCTCATCCGAATCTATGGATTGGAGACATGTAATACACACTTAGAAACATCTCTCCTGAACATGTATGTAAGATGTGGGAGCATTCCTTCTGCCAGAAAATGTTTTGACTTGATCTTAATTAAAGATGTTGTGGCGTGGACTTCCATGATTGAGGGATATGGTTCTCATGGACTAGGTATTGATGCTCTCAATCTGTTCCATCAAATGATGAGTGAAGCAGTGATCCCAAATAATGTCACGTTCTTAAGTCTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATATTTTATTCAATGAGGTCAAGGTTCAACATTAAGCCTGATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATAACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCACGCAGGCTTCTTGAATTAGAACCTAACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGCTGGGCAGTGGCATGAAGTCGAAAAATTACGTAGCGTTGTGTATGAGAAAGATCTTATCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACGATTCATGGGTTTGTTTCAGGAGATACATCACACAACAAGACCGATGAGATTTATGATTTACTGGTATATATTAATAGGATAAAATAG

mRNA sequence

ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCAGGTTTGTTCCTTTCTGCCCTTTTATTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTCCATTGTGTTGGAATTCGAATGGGCTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGGGAAATGTGGGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACGTCCATGATTTCGGCGTATGTTAATGGCGGTGATGCTGTTTGTGCCTTGGATCTTTATGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTGATGCTGCAAGCTTGTTGTGTGACTCGAAATTTGGTTCTAGGAAGGCTGCTTCAATGTCATGTGGTTAAGAATGGTTTATTGTTTGATATAGGTCTGCAGAATTCGTTCTTGCGAATGTATAGTCAACTGGGTGGGGAGGATGAAGTCGGAGTTATTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCGTGGGGGATATTTTGAAAGTTGTGGATATCTTCAACAAAATCATGCGTGAAGTTGCATTCAGCATTGAGACATTAACCATGCTTATATCAGCAACTGCGAGTTCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGTATTCTGCGGACTTCTTTGTTGTATATGTATGCTAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGAAATTTCCAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACATTTTGATGAGGCAGTTGAGATCTTCAAGCAAATGCAAGCTGCTGGCTTGAAACCCAGTGTTGGAATTTTGAAACACTTAATTGACGCTTATGCCTGTCTGGGTGCTCTGCAGTTGGGGAAAGTAATACATTGTTACCTCATCCGAATCTATGGATTGGAGACATGTAATACACACTTAGAAACATCTCTCCTGAACATGTATGTAAGATGTGGGAGCATTCCTTCTGCCAGAAAATGTTTTGACTTGATCTTAATTAAAGATGTTGTGGCGTGGACTTCCATGATTGAGGGATATGGTTCTCATGGACTAGGTATTGATGCTCTCAATCTGTTCCATCAAATGATGAGTGAAGCAGTGATCCCAAATAATGTCACGTTCTTAAGTCTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATATTTTATTCAATGAGGTCAAGGTTCAACATTAAGCCTGATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATAACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCACGCAGGCTTCTTGAATTAGAACCTAACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGCTGGGCAGTGGCATGAAGTCGAAAAATTACGTAGCGTTGTGTATGAGAAAGATCTTATCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACGATTCATGGGTTTGTTTCAGGAGATACATCACACAACAAGACCGATGAGATTTATGATTTACTGGTATATATTAATAGGATAAAATAG

Coding sequence (CDS)

ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCAGGTTTGTTCCTTTCTGCCCTTTTATTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTCCATTGTGTTGGAATTCGAATGGGCTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGGGAAATGTGGGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACGTCCATGATTTCGGCGTATGTTAATGGCGGTGATGCTGTTTGTGCCTTGGATCTTTATGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTGATGCTGCAAGCTTGTTGTGTGACTCGAAATTTGGTTCTAGGAAGGCTGCTTCAATGTCATGTGGTTAAGAATGGTTTATTGTTTGATATAGGTCTGCAGAATTCGTTCTTGCGAATGTATAGTCAACTGGGTGGGGAGGATGAAGTCGGAGTTATTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCGTGGGGGATATTTTGAAAGTTGTGGATATCTTCAACAAAATCATGCGTGAAGTTGCATTCAGCATTGAGACATTAACCATGCTTATATCAGCAACTGCGAGTTCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGTATTCTGCGGACTTCTTTGTTGTATATGTATGCTAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGAAATTTCCAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACATTTTGATGAGGCAGTTGAGATCTTCAAGCAAATGCAAGCTGCTGGCTTGAAACCCAGTGTTGGAATTTTGAAACACTTAATTGACGCTTATGCCTGTCTGGGTGCTCTGCAGTTGGGGAAAGTAATACATTGTTACCTCATCCGAATCTATGGATTGGAGACATGTAATACACACTTAGAAACATCTCTCCTGAACATGTATGTAAGATGTGGGAGCATTCCTTCTGCCAGAAAATGTTTTGACTTGATCTTAATTAAAGATGTTGTGGCGTGGACTTCCATGATTGAGGGATATGGTTCTCATGGACTAGGTATTGATGCTCTCAATCTGTTCCATCAAATGATGAGTGAAGCAGTGATCCCAAATAATGTCACGTTCTTAAGTCTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATATTTTATTCAATGAGGTCAAGGTTCAACATTAAGCCTGATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATAACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCACGCAGGCTTCTTGAATTAGAACCTAACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGCTGGGCAGTGGCATGAAGTCGAAAAATTACGTAGCGTTGTGTATGAGAAAGATCTTATCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACGATTCATGGGTTTGTTTCAGGAGATACATCACACAACAAGACCGATGAGATTTATGATTTACTGGTATATATTAATAGGATAAAATAG

Protein sequence

MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHNKTDEIYDLLVYINRIK
BLAST of Cla97C04G070890.1 vs. NCBI nr
Match: XP_008457591.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_008457593.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902177.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902178.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902179.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902180.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo])

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 547/616 (88.80%), Postives = 571/616 (92.69%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWN++IKSHFDSGLF SALLLYKNMREV VEHDGFT PIVN VI+SIWVDVVY GMVHC
Sbjct: 1   MLWNNVIKSHFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVILSIWVDVVYGGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGFS+DLYFCNTMMEVYGKCGCLV AR+VFDEMPNRDLVSWTSMISAYV GGD  C
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCLVSARDVFDEMPNRDLVSWTSMISAYVKGGDVFC 120

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
           ALD++EGMRRELEPNSVTV+VMLQACC T+NLVLGRLLQC+VVKNGLLFD GLQNSFLRM
Sbjct: 121 ALDIFEGMRRELEPNSVTVIVMLQACCATQNLVLGRLLQCYVVKNGLLFDTGLQNSFLRM 180

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLT 240
           YS+LGGEDEV   FSEID KNVVSWNILMSFYSS+GDI+KVVDI NKIM EV  SIETLT
Sbjct: 181 YSRLGGEDEVVAFFSEIDFKNVVSWNILMSFYSSMGDIVKVVDILNKIMGEVPLSIETLT 240

Query: 241 MLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISN 300
           +LIS  A+SDSGCLILGENLHSLAIKSGLYD IL TSLL MYAKFGELENSTRLFKEI N
Sbjct: 241 ILISGIATSDSGCLILGENLHSLAIKSGLYDDILCTSLLDMYAKFGELENSTRLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVI 360
           RSIITWGAMMSSFIQNGHFD+AV+IFKQMQ AGLKPSVGILKHLIDAYA LGALQLGK I
Sbjct: 301 RSIITWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAYLGALQLGKAI 360

Query: 361 HCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 420
           HC+LIRIYGL  CNT LETS+LNMYVRCGSI SARKCFDLILIKDVVAWTSMIEGYG+HG
Sbjct: 361 HCHLIRIYGLVVCNTRLETSVLNMYVRCGSIASARKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 480
           LGIDALNLFHQM SE V PNNVTFLSLLSACSHSGLVSEGC IFYSMRSRFNIKPDLEHY
Sbjct: 421 LGIDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCGIFYSMRSRFNIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 600
           +NVGYYTLLSNSQAS GQWHE EKLRS+VYEK+L KKPGWSFIELNGTIHGFVSGD SH 
Sbjct: 541 DNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKNLAKKPGWSFIELNGTIHGFVSGDRSHY 600

Query: 601 KTDEIYDLLVYINRIK 617
           K +EIYDLLVYI RIK
Sbjct: 601 KANEIYDLLVYIYRIK 616

BLAST of Cla97C04G070890.1 vs. NCBI nr
Match: XP_023526509.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 535/617 (86.71%), Postives = 571/617 (92.54%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWNSIIKS FDSGLFLSA++LYKNMREVGVEHDGFTFPI+NHV+MSIWVDVVYAGMVHC
Sbjct: 65  MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 124

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGF +DLYFCNTMMEVY KC CL +AR VFDEMPNRDLVSWTSMISAYVN GD VC
Sbjct: 125 VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNSGDIVC 184

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
           AL+L+EGMRR LEPNSVT+M MLQACCVT +LVLGRL+QC VVKNGLLFD+GLQN FLRM
Sbjct: 185 ALNLFEGMRRVLEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 244

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIM-REVAFSIETL 240
           YS+LGGEDE    FSEIDCKNVVSWNIL+SFYSSVGDI+K VDIF +IM  EV   IETL
Sbjct: 245 YSRLGGEDEFVCFFSEIDCKNVVSWNILISFYSSVGDIVKAVDIFKQIMGGEVPLIIETL 304

Query: 241 TMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEIS 300
           T+LISAT +S+S CLILGENLHSLAIK+GLYD ILRTSLL MYAKFGEL+NSTRLF EI 
Sbjct: 305 TILISATKTSESMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 364

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKV 360
           NRSIITWGAMMSSFIQNGHFDEAVEIF QMQAAGLKPS+GILKHLIDAYA LGALQLG+ 
Sbjct: 365 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 424

Query: 361 IHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSH 420
           IHCYLIRIYGLE CNTHLETSL+NMYVRCGSI SARKCFDLI++KDVVAWTSMIEGYG+H
Sbjct: 425 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGAH 484

Query: 421 GLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480
           G GI+ALNL+H MMSE V PN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH
Sbjct: 485 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 544

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGD KIA YAA RLLELE
Sbjct: 545 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDTKIAIYAAHRLLELE 604

Query: 541 PNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSH 600
           P+NVGYYTLLSN+QAS GQWHEVEKLRSVVYEKDL+KKPGWSFIELNGTIHGFVSGD SH
Sbjct: 605 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 664

Query: 601 NKTDEIYDLLVYINRIK 617
            KTD+IYDLLVY+NRI+
Sbjct: 665 GKTDQIYDLLVYLNRIE 681

BLAST of Cla97C04G070890.1 vs. NCBI nr
Match: XP_022932989.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 532/615 (86.50%), Postives = 568/615 (92.36%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWNSIIKS FDSGLFLSA++LYKNMREVGVEHDGFTFPI+NHV+MSIWVDVVYAGMVHC
Sbjct: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGF +DLYFCNTMMEVY KC CL +AR VFDEMPNRDLVSWTSMISAYVN GD VC
Sbjct: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
           AL+L+EGMRR  EPNSVT+M MLQACCVT +LVLGRL+QC VVKNGLLFD+GLQN FLRM
Sbjct: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMR-EVAFSIETL 240
           YS+LGGEDE    FSEIDCKNVVSW+IL+SFYSSVGDI+K VDIF +IM  EV   IETL
Sbjct: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240

Query: 241 TMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEIS 300
           T+LISAT +SDS CLILGENLHSLAIK+GLYD ILRTSLL MYAKFGEL+NSTRLF EI 
Sbjct: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKV 360
           NRSIITWGAMMSSFIQNGHFDEAVEIF QMQAAGLKPS+GILKHLIDAYA LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360

Query: 361 IHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSH 420
           IHCYLIRIYGLE CNTHLETSL+NMYVRCGSI SARKCFDLI++KDVVAWTSMIEGYGSH
Sbjct: 361 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 420

Query: 421 GLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480
           G GI+ALNL+H MMSE V PN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH
Sbjct: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIA YAA RLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540

Query: 541 PNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSH 600
           P+NVGYYTLLSN+QAS GQWHEVEKLRSVVYEKD +KKPGWSF+ELNGT+HGFVSGD SH
Sbjct: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 600

Query: 601 NKTDEIYDLLVYINR 615
            KTD+IYDLLVY+NR
Sbjct: 601 CKTDQIYDLLVYLNR 615

BLAST of Cla97C04G070890.1 vs. NCBI nr
Match: XP_022967941.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1070.1 bits (2766), Expect = 2.8e-309
Identity = 527/617 (85.41%), Postives = 568/617 (92.06%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWNSIIKS FDSGLF SA++LYKNMREVGVEHDGFTFPI+NHV+MSI VDVVYAGMVHC
Sbjct: 1   MLWNSIIKSQFDSGLFQSAIMLYKNMREVGVEHDGFTFPILNHVVMSICVDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGF +DLYFCNTMMEVY KC CL +AR VFDEMPNRDLVSWTSMISAYVN G  VC
Sbjct: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNSGVIVC 120

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
           AL+L+EGMRR LEPNSVT+M MLQACCVT +LVLGRL+QC VVKNGLLFD+GLQN FLRM
Sbjct: 121 ALNLFEGMRRVLEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMR-EVAFSIETL 240
           YS+LGGEDE   +FSEIDCKNVVSWNIL+SFY SVGDI+K VDIF +IM  EV   I+TL
Sbjct: 181 YSRLGGEDEFVRVFSEIDCKNVVSWNILISFYFSVGDIVKAVDIFKQIMSGEVPLIIDTL 240

Query: 241 TMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEIS 300
           T+LISAT +S+S CLILGENLHSLAIK+GLYD ILRTSLL MYAK GEL+NSTRLF EI 
Sbjct: 241 TILISATKTSESMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKIGELDNSTRLFNEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKV 360
           NRSIITWGAMMSSFIQNGHFDEAV+IF QMQAAGLKPS+GILKHLIDAYA LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDEAVDIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360

Query: 361 IHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSH 420
           IHCYLIRI+GLE CNTHLETSL+NMYVRCGSI SARKCFDLI++KDVVAWTSMIEGYG+H
Sbjct: 361 IHCYLIRIHGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGAH 420

Query: 421 GLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480
           G GI+ALNL+H MMSE V PN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH
Sbjct: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIA YAA RLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540

Query: 541 PNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSH 600
           P+NVGYYTLLSN+QAS GQWHEVEKLRSVVYEK+L+KKPGWSFIELNGTIHGFVSGD SH
Sbjct: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKNLVKKPGWSFIELNGTIHGFVSGDRSH 600

Query: 601 NKTDEIYDLLVYINRIK 617
            KTD+IYDLLVY+NRI+
Sbjct: 601 CKTDQIYDLLVYLNRIE 617

BLAST of Cla97C04G070890.1 vs. NCBI nr
Match: XP_022153922.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Momordica charantia])

HSP 1 Score: 1033.9 bits (2672), Expect = 2.3e-298
Identity = 512/610 (83.93%), Postives = 547/610 (89.67%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWNSIIKSH +SGLF+SALLLYK MRE+GVEHDGFTFP+VN +IMSI +DVVYAGMVHC
Sbjct: 1   MLWNSIIKSHVESGLFVSALLLYKKMRELGVEHDGFTFPMVNRIIMSIQLDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGF ADLYFCNTMMEVYGKCGCLV ARNVFDEMP+RDLVSWTSMIS YV  GD V 
Sbjct: 61  VGIRMGFGADLYFCNTMMEVYGKCGCLVSARNVFDEMPHRDLVSWTSMISVYVCRGDVVS 120

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
            LDL+EGMRRELEPNSVT+MVM+QACC T NL LGR LQ HV KNGLLFDIGLQNS LRM
Sbjct: 121 GLDLFEGMRRELEPNSVTIMVMVQACCATGNLSLGRQLQSHVFKNGLLFDIGLQNSLLRM 180

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLT 240
           Y++LGGEDEVGV FSE+D KNVVSWN+ +SFYSS GD +KVVDIFNKIM EV  S+ETLT
Sbjct: 181 YTRLGGEDEVGVFFSEVDRKNVVSWNVFISFYSSRGDFVKVVDIFNKIMGEVLLSVETLT 240

Query: 241 MLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISN 300
           +L+SATA+ DS  LILG+NLHSLAIKSGLYDGIL+TS L MYAKFGELENSTRLFKEI  
Sbjct: 241 ILVSATAAPDSEHLILGKNLHSLAIKSGLYDGILQTSFLDMYAKFGELENSTRLFKEIPR 300

Query: 301 RSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVI 360
           +SIITWGAMMSSFIQNGHFD AVEIF QMQAAGLKPSVGILKHLIDAY  LG LQLGK I
Sbjct: 301 KSIITWGAMMSSFIQNGHFDGAVEIFNQMQAAGLKPSVGILKHLIDAYTHLGVLQLGKAI 360

Query: 361 HCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 420
           HCYLIR+ GLE  NT L TS+LNMYVRCGS+ SA KCFDLILIKDVVAWTSMIEGYG+HG
Sbjct: 361 HCYLIRLNGLEIYNTQLGTSILNMYVRCGSLVSAIKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 480
           LG DALNLF QMM E V PNNVTFLSLLSACSHSGLVSEGC+IFYSMRSRFNI PDLEHY
Sbjct: 421 LGFDALNLFLQMMREEVTPNNVTFLSLLSACSHSGLVSEGCQIFYSMRSRFNINPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTN  DGRIWGALMGACRVY DNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNFRDGRIWGALMGACRVYEDNKIANYAAHRLLELEP 540

Query: 541 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 600
           +NVGYYTLLSN+QA+ GQWH+VEKLRSVVYEKDL+KKPGWSFIEL G +HGFVSGD SH+
Sbjct: 541 DNVGYYTLLSNAQATVGQWHDVEKLRSVVYEKDLVKKPGWSFIELKGIVHGFVSGDRSHD 600

Query: 601 KTDEIYDLLV 611
           KT EIYDLLV
Sbjct: 601 KTKEIYDLLV 610

BLAST of Cla97C04G070890.1 vs. TrEMBL
Match: tr|A0A1S3C6I7|A0A1S3C6I7_CUCME (pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497256 PE=4 SV=1)

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 547/616 (88.80%), Postives = 571/616 (92.69%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWN++IKSHFDSGLF SALLLYKNMREV VEHDGFT PIVN VI+SIWVDVVY GMVHC
Sbjct: 1   MLWNNVIKSHFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVILSIWVDVVYGGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           VGIRMGFS+DLYFCNTMMEVYGKCGCLV AR+VFDEMPNRDLVSWTSMISAYV GGD  C
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCLVSARDVFDEMPNRDLVSWTSMISAYVKGGDVFC 120

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
           ALD++EGMRRELEPNSVTV+VMLQACC T+NLVLGRLLQC+VVKNGLLFD GLQNSFLRM
Sbjct: 121 ALDIFEGMRRELEPNSVTVIVMLQACCATQNLVLGRLLQCYVVKNGLLFDTGLQNSFLRM 180

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLT 240
           YS+LGGEDEV   FSEID KNVVSWNILMSFYSS+GDI+KVVDI NKIM EV  SIETLT
Sbjct: 181 YSRLGGEDEVVAFFSEIDFKNVVSWNILMSFYSSMGDIVKVVDILNKIMGEVPLSIETLT 240

Query: 241 MLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISN 300
           +LIS  A+SDSGCLILGENLHSLAIKSGLYD IL TSLL MYAKFGELENSTRLFKEI N
Sbjct: 241 ILISGIATSDSGCLILGENLHSLAIKSGLYDDILCTSLLDMYAKFGELENSTRLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVI 360
           RSIITWGAMMSSFIQNGHFD+AV+IFKQMQ AGLKPSVGILKHLIDAYA LGALQLGK I
Sbjct: 301 RSIITWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAYLGALQLGKAI 360

Query: 361 HCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 420
           HC+LIRIYGL  CNT LETS+LNMYVRCGSI SARKCFDLILIKDVVAWTSMIEGYG+HG
Sbjct: 361 HCHLIRIYGLVVCNTRLETSVLNMYVRCGSIASARKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 480
           LGIDALNLFHQM SE V PNNVTFLSLLSACSHSGLVSEGC IFYSMRSRFNIKPDLEHY
Sbjct: 421 LGIDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCGIFYSMRSRFNIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 600
           +NVGYYTLLSNSQAS GQWHE EKLRS+VYEK+L KKPGWSFIELNGTIHGFVSGD SH 
Sbjct: 541 DNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKNLAKKPGWSFIELNGTIHGFVSGDRSHY 600

Query: 601 KTDEIYDLLVYINRIK 617
           K +EIYDLLVYI RIK
Sbjct: 601 KANEIYDLLVYIYRIK 616

BLAST of Cla97C04G070890.1 vs. TrEMBL
Match: tr|A0A2N9FID5|A0A2N9FID5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14662 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 1.9e-200
Identity = 356/609 (58.46%), Postives = 449/609 (73.73%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVG 62
           WN IIKSH D GLF SALLLYK MR +GV HD FTFPIVN  ++S+  DV+Y  MVHCV 
Sbjct: 28  WNLIIKSHLDLGLFDSALLLYKTMRHLGVAHDSFTFPIVNQAVLSLQSDVIYGEMVHCVS 87

Query: 63  IRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCAL 122
            +MGF  ++YFCNTM+EVY KCGC+VYAR +FDEM  RDLVSWTSMIS YV  G    A 
Sbjct: 88  TKMGFGFEVYFCNTMIEVYVKCGCVVYARKLFDEMSQRDLVSWTSMISGYVCEGSVGSAF 147

Query: 123 DLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMYS 182
            L+  M  + EPNSVT+++MLQACC   +L+ G  L  + +K+GL  D  LQNS L+MY+
Sbjct: 148 YLFREMMVKSEPNSVTLIIMLQACCAGESLIHGMQLHGYAIKSGLESDGSLQNSVLKMYT 207

Query: 183 QLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTML 242
           + G  +EV + FS+ID K+ VSWNIL+SFYS  GDI K+V+ F+++    A SIETLT+L
Sbjct: 208 RTGSVEEVEIFFSKIDRKDDVSWNILISFYSMKGDIEKLVNRFSEMQGIAALSIETLTLL 267

Query: 243 ISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISNRS 302
           ISA A S    L  GE +H LAIKSG  D +L TSLL  YAK G++E S +LF++IS R+
Sbjct: 268 ISAFAKSRD--LFQGEQIHCLAIKSGFCDDVLLTSLLDFYAKCGKIEISDQLFRKISYRN 327

Query: 303 IITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVIHC 362
            +T+GAMMS F+QNG+  +A+ +F QMQAA ++P   IL+ ++DAY  LGALQLGK IH 
Sbjct: 328 NVTFGAMMSGFVQNGYVKDAINLFHQMQAANVEPGAEILRSILDAYTQLGALQLGKAIHG 387

Query: 363 YLIRIYGLETC--NTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 422
           Y IR     T    T+LE S+LNMY+RCG+I SAR  F  IL+KD+V WT+MIEG+G+HG
Sbjct: 388 YFIRHIFCRTMEETTYLEASILNMYIRCGNISSARVSFHNILVKDLVIWTTMIEGFGTHG 447

Query: 423 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 482
           LG +AL LF  M+ E + PN+VTFLSLLSACSHSGLV EGCE++ SM+  F I+P+L+HY
Sbjct: 448 LGSEALELFGLMLKERIKPNSVTFLSLLSACSHSGLVREGCEVYNSMKWIFGIQPNLDHY 507

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 542
           TC VDLL R  +++EA AI ++M    DGRIWGAL+ ACRV+GD K+  Y A+RLLELEP
Sbjct: 508 TCMVDLLGRYGKLKEALAIIVKMVIFSDGRIWGALLAACRVHGDIKLGEYTAQRLLELEP 567

Query: 543 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 602
           +NVGY+TLLSN QA  G+W EVE++R V+ EKDL KKPGWS  E  G IHGFVS D SH+
Sbjct: 568 DNVGYHTLLSNVQAGVGRWDEVEEVRRVMNEKDLKKKPGWSCFEAKGMIHGFVSADRSHH 627

Query: 603 KTDEIYDLL 610
           + +EIYD+L
Sbjct: 628 QVEEIYDIL 634

BLAST of Cla97C04G070890.1 vs. TrEMBL
Match: tr|A0A2N9H6Z6|A0A2N9H6Z6_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS35445 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 1.9e-200
Identity = 356/609 (58.46%), Postives = 449/609 (73.73%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVG 62
           WN IIKSH D GLF SALLLYK MR +GV HD FTFPIVN  ++S+  DV+Y  MVHCV 
Sbjct: 28  WNLIIKSHLDLGLFDSALLLYKTMRHLGVAHDSFTFPIVNQAVLSLQSDVIYGEMVHCVS 87

Query: 63  IRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCAL 122
            +MGF  ++YFCNTM+EVY KCGC+VYAR +FDEM  RDLVSWTSMIS YV  G    A 
Sbjct: 88  TKMGFGFEVYFCNTMIEVYVKCGCVVYARKLFDEMSQRDLVSWTSMISGYVCEGSVGSAF 147

Query: 123 DLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMYS 182
            L+  M  + EPNSVT+++MLQACC   +L+ G  L  + +K+GL  D  LQNS L+MY+
Sbjct: 148 YLFREMMVKSEPNSVTLIIMLQACCAGESLIHGMQLHGYAIKSGLESDGSLQNSVLKMYT 207

Query: 183 QLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTML 242
           + G  +EV + FS+ID K+ VSWNIL+SFYS  GDI K+V+ F+++    A SIETLT+L
Sbjct: 208 RTGSVEEVEIFFSKIDRKDDVSWNILISFYSMKGDIEKLVNRFSEMQGIAALSIETLTLL 267

Query: 243 ISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISNRS 302
           ISA A S    L  GE +H LAIKSG  D +L TSLL  YAK G++E S +LF++IS R+
Sbjct: 268 ISAFAKSRD--LFQGEQIHCLAIKSGFCDDVLLTSLLDFYAKCGKIEISDQLFRKISYRN 327

Query: 303 IITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVIHC 362
            +T+GAMMS F+QNG+  +A+ +F QMQAA ++P   IL+ ++DAY  LGALQLGK IH 
Sbjct: 328 NVTFGAMMSGFVQNGYVKDAINLFHQMQAANVEPGAEILRSILDAYTQLGALQLGKAIHG 387

Query: 363 YLIRIYGLETC--NTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 422
           Y IR     T    T+LE S+LNMY+RCG+I SAR  F  IL+KD+V WT+MIEG+G+HG
Sbjct: 388 YFIRHIFCRTMEETTYLEASILNMYIRCGNISSARVSFHNILVKDLVIWTTMIEGFGTHG 447

Query: 423 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 482
           LG +AL LF  M+ E + PN+VTFLSLLSACSHSGLV EGCE++ SM+  F I+P+L+HY
Sbjct: 448 LGSEALELFGLMLKERIKPNSVTFLSLLSACSHSGLVREGCEVYNSMKWIFGIQPNLDHY 507

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 542
           TC VDLL R  +++EA AI ++M    DGRIWGAL+ ACRV+GD K+  Y A+RLLELEP
Sbjct: 508 TCMVDLLGRYGKLKEALAIIVKMVIFSDGRIWGALLAACRVHGDIKLGEYTAQRLLELEP 567

Query: 543 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 602
           +NVGY+TLLSN QA  G+W EVE++R V+ EKDL KKPGWS  E  G IHGFVS D SH+
Sbjct: 568 DNVGYHTLLSNVQAGVGRWDEVEEVRRVMNEKDLKKKPGWSCFEAKGMIHGFVSADRSHH 627

Query: 603 KTDEIYDLL 610
           + +EIYD+L
Sbjct: 628 QVEEIYDIL 634

BLAST of Cla97C04G070890.1 vs. TrEMBL
Match: tr|A0A2N9GL16|A0A2N9GL16_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS28020 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 4.2e-200
Identity = 355/609 (58.29%), Postives = 449/609 (73.73%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVG 62
           WN IIKSH D GLF SALLLYK MR +GV HD FTFPIVN  ++S+  DV+Y  MVHCV 
Sbjct: 28  WNLIIKSHLDLGLFDSALLLYKTMRHLGVAHDSFTFPIVNQAVLSLQSDVIYGEMVHCVS 87

Query: 63  IRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCAL 122
            +MGF  ++YFCNTM+EVY KCGC+VYAR +FDEM  RDLVSWTSMIS YV  G    A 
Sbjct: 88  TKMGFGFEVYFCNTMIEVYVKCGCVVYARKLFDEMSQRDLVSWTSMISGYVCEGSVGSAF 147

Query: 123 DLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMYS 182
            L+  M  + EPNSVT+++MLQACC   +L+ G  L  + +K+GL  D  LQNS L+MY+
Sbjct: 148 YLFREMMVKSEPNSVTLIIMLQACCAGESLIHGMQLHGYAIKSGLESDGSLQNSVLKMYT 207

Query: 183 QLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTML 242
           + G  +EV + FS+ID K+ VSWNIL+SFYS  GDI K+V+ F+++    A SIETLT+L
Sbjct: 208 RTGSVEEVEIFFSKIDRKDDVSWNILISFYSMKGDIEKLVNRFSEMQGIAALSIETLTLL 267

Query: 243 ISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISNRS 302
           ISA A S    L  GE +H +AIKSG  D +L TSLL  YAK G++E S +LF++IS R+
Sbjct: 268 ISAFAKSRD--LFQGEQIHCVAIKSGFCDDVLLTSLLDFYAKCGKIEISDQLFRKISYRN 327

Query: 303 IITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVIHC 362
            +T+GAMMS F+QNG+  +A+ +F QMQAA ++P   IL+ ++DAY  LGALQLGK IH 
Sbjct: 328 NVTFGAMMSGFVQNGYVKDAINLFHQMQAANVEPGAEILRSILDAYTQLGALQLGKAIHG 387

Query: 363 YLIRIYGLETC--NTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGSHG 422
           Y IR     T    T+LE S+LNMY+RCG+I SAR  F  IL+KD+V WT+MIEG+G+HG
Sbjct: 388 YFIRHIFCRTMEETTYLEASILNMYIRCGNISSARVSFHNILVKDLVIWTTMIEGFGTHG 447

Query: 423 LGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 482
           LG +AL LF  M+ E + PN+VTFLSLLSACSHSGLV EGCE++ SM+  F I+P+L+HY
Sbjct: 448 LGSEALELFGLMLKERIKPNSVTFLSLLSACSHSGLVREGCEVYNSMKWIFGIQPNLDHY 507

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLELEP 542
           TC VDLL R  +++EA AI ++M    DGRIWGAL+ ACRV+GD K+  Y A+RLLELEP
Sbjct: 508 TCMVDLLGRYGKLKEALAIIVKMVIFSDGRIWGALLAACRVHGDIKLGEYTAQRLLELEP 567

Query: 543 NNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTSHN 602
           +NVGY+TLLSN QA  G+W EVE++R V+ EKDL KKPGWS  E  G IHGFVS D SH+
Sbjct: 568 DNVGYHTLLSNVQAGVGRWDEVEEVRRVMNEKDLKKKPGWSCFEAKGMIHGFVSADRSHH 627

Query: 603 KTDEIYDLL 610
           + +EIYD+L
Sbjct: 628 QVEEIYDIL 634

BLAST of Cla97C04G070890.1 vs. TrEMBL
Match: tr|A0A2P6RTB7|A0A2P6RTB7_ROSCH (Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0124611 PE=4 SV=1)

HSP 1 Score: 690.3 bits (1780), Expect = 4.0e-195
Identity = 350/617 (56.73%), Postives = 449/617 (72.77%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHC 60
           MLWN II++H +SG   SALLLY+ MRE+GV HD FTFPIVN  ++ +  DV YAGMVH 
Sbjct: 46  MLWNLIIRTHIESGRLDSALLLYRKMRELGVSHDCFTFPIVNKAVLLVGGDVRYAGMVHS 105

Query: 61  VGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVC 120
           V I+MGF  D+YF NTM+EVY KCG L YAR +FDEMP+ DLVSWTSMIS YV+ G+   
Sbjct: 106 VAIQMGFGLDVYFGNTMIEVYVKCGNLSYARKLFDEMPDTDLVSWTSMISGYVSEGNVAS 165

Query: 121 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRM 180
              L+  MR ELEPNSVT++VMLQACC  +  + GR +  +V+KNGLL +  +QNS LRM
Sbjct: 166 GFSLFSEMRMELEPNSVTMLVMLQACCGFQTSIYGRQVHGYVIKNGLLSNGAIQNSILRM 225

Query: 181 YSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLT 240
           Y++LG  +EV   F E+D ++VVSWNI +S Y+S GD +KV D+FN++   VA SIETLT
Sbjct: 226 YAKLGTIEEVEDFFRELDRRDVVSWNICISSYTSRGDFVKVRDLFNEMQGGVAPSIETLT 285

Query: 241 MLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKEISN 300
           +++SA   +  G L  GE+LH LA+KSGL+D +L+TSLL  YAK G+LE+S +LF+E+ +
Sbjct: 286 IVLSAL--TKHGILSQGESLHGLAVKSGLHDDVLQTSLLDFYAKCGKLESSDKLFRELPD 345

Query: 301 RSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGKVI 360
           R+ IT GAMM   I NG+F EAV +F+QMQAA                   GAL+LGK +
Sbjct: 346 RNCITCGAMMLGLIHNGYFTEAVGVFRQMQAA-------------------GALKLGKAV 405

Query: 361 HCYLIR--IYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 420
           H Y+IR    G E   THLETS+LNMY+RCGSI ++R CF+ ++ KDVVAWTSMIEGYGS
Sbjct: 406 HGYIIRKSFCGTEEGLTHLETSILNMYIRCGSISTSRVCFNRMVFKDVVAWTSMIEGYGS 465

Query: 421 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 480
           HGLG +A  LF  M  E + PN+VTFLSLLSA SHSGLV+EGCE FY M+ RF I+PD++
Sbjct: 466 HGLGFEAAKLFDLMTREGIKPNSVTFLSLLSAYSHSGLVTEGCEAFYYMKWRFGIEPDID 525

Query: 481 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 540
           HYTC VDLL RS +++EA  + L+M    D RIWGAL+   ++Y +  +  YAA+RLLEL
Sbjct: 526 HYTCVVDLLGRSGKLKEALVVILKMLAFPDSRIWGALLSGSKIYSNRALGQYAAQRLLEL 585

Query: 541 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 600
           EP NVGY+TLLSN++AS G W EVE++R V+ E DL KKPGWS IE NG IHGFVSG  S
Sbjct: 586 EPGNVGYFTLLSNTEASVGHWDEVEEIRKVMKENDLKKKPGWSCIEANGVIHGFVSGGNS 641

Query: 601 HNKTDEIYDLLVYINRI 616
           H+  +EIY++L +++R+
Sbjct: 646 HHHIEEIYEVLGWLSRM 641

BLAST of Cla97C04G070890.1 vs. Swiss-Prot
Match: sp|P0C8Q2|PP323_ARATH (Pentatricopeptide repeat-containing protein At4g19191, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E1 PE=2 SV=1)

HSP 1 Score: 355.9 bits (912), Expect = 8.9e-97
Identity = 203/608 (33.39%), Postives = 329/608 (54.11%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVG 62
           WN  I+   +    + +LLL++ M+  G E + FTFP V      +  DV    MVH   
Sbjct: 20  WNLQIREAVNRNDPVESLLLFREMKRGGFEPNNFTFPFVAKACARL-ADVGCCEMVHAHL 79

Query: 63  IRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCAL 122
           I+  F +D++     ++++ KC  + YA  VF+ MP RD  +W +M+S +   G    A 
Sbjct: 80  IKSPFWSDVFVGTATVDMFVKCNSVDYAAKVFERMPERDATTWNAMLSGFCQSGHTDKAF 139

Query: 123 DLYEGMR-RELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMY 182
            L+  MR  E+ P+SVTVM ++Q+    ++L L   +    ++ G+   + + N+++  Y
Sbjct: 140 SLFREMRLNEITPDSVTVMTLIQSASFEKSLKLLEAMHAVGIRLGVDVQVTVANTWISTY 199

Query: 183 SQLGGEDEVGVIFSEID--CKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETL 242
            + G  D   ++F  ID   + VVSWN +   YS  G+      ++  ++RE  F  +  
Sbjct: 200 GKCGDLDSAKLVFEAIDRGDRTVVSWNSMFKAYSVFGEAFDAFGLYCLMLRE-EFKPDLS 259

Query: 243 TMLISATASSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLYMYAKFGELENSTRLFKEI 302
           T +  A +  +   L  G  +HS AI  G    I    + + MY+K  +  ++  LF  +
Sbjct: 260 TFINLAASCQNPETLTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCSARLLFDIM 319

Query: 303 SNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGK 362
           ++R+ ++W  M+S + + G  DEA+ +F  M  +G KP +  L  LI      G+L+ GK
Sbjct: 320 TSRTCVSWTVMISGYAEKGDMDEALALFHAMIKSGEKPDLVTLLSLISGCGKFGSLETGK 379

Query: 363 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 422
            I      IYG +  N  +  +L++MY +CGSI  AR  FD    K VV WT+MI GY  
Sbjct: 380 WIDA-RADIYGCKRDNVMICNALIDMYSKCGSIHEARDIFDNTPEKTVVTWTTMIAGYAL 439

Query: 423 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 482
           +G+ ++AL LF +M+     PN++TFL++L AC+HSG + +G E F+ M+  +NI P L+
Sbjct: 440 NGIFLEALKLFSKMIDLDYKPNHITFLAVLQACAHSGSLEKGWEYFHIMKQVYNISPGLD 499

Query: 483 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 542
           HY+C VDLL R  ++ EA  +   M+   D  IWGAL+ AC+++ + KIA  AA  L  L
Sbjct: 500 HYSCMVDLLGRKGKLEEALELIRNMSAKPDAGIWGALLNACKIHRNVKIAEQAAESLFNL 559

Query: 543 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 602
           EP     Y  ++N  A+AG W    ++RS++ ++++ K PG S I++NG  H F  G+  
Sbjct: 560 EPQMAAPYVEMANIYAAAGMWDGFARIRSIMKQRNIKKYPGESVIQVNGKNHSFTVGEHG 619

Query: 603 HNKTDEIY 607
           H + + IY
Sbjct: 620 HVENEVIY 624

BLAST of Cla97C04G070890.1 vs. Swiss-Prot
Match: sp|O49619|PP350_ARATH (Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H27 PE=3 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 7.6e-96
Identity = 196/617 (31.77%), Postives = 336/617 (54.46%), Query Frame = 0

Query: 2   LWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P V   +  I   +     +H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNR-DLVSWTSMISAYVNGGDAVC 121
            I++GF +D+Y CN+++ +Y K GC   A  VF+EMP R                     
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERXXXXXXXXXXXXXXXXXXXXX 216

Query: 122 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGL-LFDIGLQNSFLR 181
                  ++   +P+  + M  L AC    +  +G+ + CH V++ +   D+ +  S L 
Sbjct: 217 XXXXXXXLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETL 241
           MYS+ G       IF+ +  +N+V+WN+++  Y+  G +      F K+  +     + +
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 TMLISATASSDSGCLILGENLHSLAIKSG-LYDGILRTSLLYMYAKFGELENSTRLFKEI 301
           T +    AS+    ++ G  +H  A++ G L   +L T+L+ MY + G+L+++  +F  +
Sbjct: 337 TSINLLPASA----ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIFDRM 396

Query: 302 SNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGK 361
           + +++I+W +++++++QNG    A+E+F+++  + L P    +  ++ AYA   +L  G+
Sbjct: 397 AEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLSEGR 456

Query: 362 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 421
            IH Y+++       NT +  SL++MY  CG +  ARKCF+ IL+KDVV+W S+I  Y  
Sbjct: 457 EIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMAYAV 516

Query: 422 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 481
           HG G  ++ LF +M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P +E
Sbjct: 517 HGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDPGIE 576

Query: 482 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 541
           HY C +DL+ R+     A      M  +   RIWG+L+ A R + D  IA +AA ++ ++
Sbjct: 577 HYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQIFKM 636

Query: 542 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 601
           E +N G Y LL N  A AG+W +V +++ ++  K + +    S +E  G  H F +GD S
Sbjct: 637 EHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNGDRS 696

Query: 602 HNKTDEIYDLLVYINRI 616
           H  T++IY++L  ++R+
Sbjct: 697 HVATNKIYEVLDVVSRM 706

BLAST of Cla97C04G070890.1 vs. Swiss-Prot
Match: sp|Q9M1V3|PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 344.7 bits (883), Expect = 2.1e-93
Identity = 201/618 (32.52%), Postives = 341/618 (55.18%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGM-VH 60
           +LWNSI+ S+  SG  L  L L++ M   G   + +T  IV+ +           G  +H
Sbjct: 250 VLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYT--IVSALTACDGFSYAKLGKEIH 309

Query: 61  CVGIRMG-FSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDA 120
              ++    S++LY CN ++ +Y +CG +  A  +  +M N D+V+W S+I  YV     
Sbjct: 310 ASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMY 369

Query: 121 VCALDLYEGM-RRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSF 180
             AL+ +  M     + + V++  ++ A     NL+ G  L  +V+K+G   ++ + N+ 
Sbjct: 370 KEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTL 429

Query: 181 LRMYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIE 240
           + MYS+      +G  F  +  K+++SW  +++ Y+     ++ +++F  + ++    I+
Sbjct: 430 IDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKK-RMEID 489

Query: 241 TLTMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKE 300
            + +     ASS    +++ + +H   ++ GL D +++  L+ +Y K   +  +TR+F+ 
Sbjct: 490 EMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVFES 549

Query: 301 ISNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLG 360
           I  + +++W +M+SS   NG+  EAVE+F++M   GL      L  ++ A A L AL  G
Sbjct: 550 IKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKG 609

Query: 361 KVIHCYLIRI-YGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGY 420
           + IHCYL+R  + LE     +  ++++MY  CG + SA+  FD I  K ++ +TSMI  Y
Sbjct: 610 REIHCYLLRKGFCLE---GSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAY 669

Query: 421 GSHGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPD 480
           G HG G  A+ LF +M  E V P++++FL+LL ACSH+GL+ EG      M   + ++P 
Sbjct: 670 GMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPW 729

Query: 481 LEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLL 540
            EHY C VD+L R+  V EAF     M       +W AL+ ACR + + +I   AA+RLL
Sbjct: 730 PEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLL 789

Query: 541 ELEPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGD 600
           ELEP N G   L+SN  A  G+W++VEK+R+ +    + K PG S+IE++G +H F + D
Sbjct: 790 ELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARD 849

Query: 601 TSHNKTDEIYDLLVYINR 615
            SH ++ EIY+ L  + R
Sbjct: 850 KSHPESKEIYEKLSEVTR 861

BLAST of Cla97C04G070890.1 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 4.6e-93
Identity = 187/557 (33.57%), Postives = 319/557 (57.27%), Query Frame = 0

Query: 58  VHCVGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGD 117
           +H + ++ GFS DL+    +  +Y KC  +  AR VFD MP RDLVSW ++++ Y   G 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 118 AVCALDLYEGMRRE-LEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNS 177
           A  AL++ + M  E L+P+ +T++ +L A    R + +G+ +  + +++G    + +  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 178 FLRMYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSI 237
            + MY++ G  +    +F  +  +NVVSWN ++  Y    +  + + IF K++ E     
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE-GVKP 336

Query: 238 ETLTMLISATASSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLYMYAKFGELENSTRLF 297
             ++++ +  A +D G L  G  +H L+++ GL   + +  SL+ MY K  E++ +  +F
Sbjct: 337 TDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMF 396

Query: 298 KEISNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQ 357
            ++ +R++++W AM+  F QNG   +A+  F QM++  +KP       +I A A L    
Sbjct: 397 GKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITH 456

Query: 358 LGKVIHCYLIRIYGLETC---NTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSM 417
             K IH  ++R     +C   N  + T+L++MY +CG+I  AR  FD++  + V  W +M
Sbjct: 457 HAKWIHGVVMR-----SCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAM 516

Query: 418 IEGYGSHGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFN 477
           I+GYG+HG G  AL LF +M    + PN VTFLS++SACSHSGLV  G + FY M+  ++
Sbjct: 517 IDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYS 576

Query: 478 IKPDLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAA 537
           I+  ++HY   VDLL R+ R+ EA+   ++M       ++GA++GAC+++ +   A  AA
Sbjct: 577 IELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAA 636

Query: 538 RRLLELEPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGF 597
            RL EL P++ GY+ LL+N   +A  W +V ++R  +  + L K PG S +E+   +H F
Sbjct: 637 ERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSF 696

Query: 598 VSGDTSHNKTDEIYDLL 610
            SG T+H  + +IY  L
Sbjct: 697 FSGSTAHPDSKKIYAFL 707

BLAST of Cla97C04G070890.1 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 339.3 bits (869), Expect = 8.7e-92
Identity = 196/608 (32.24%), Postives = 334/608 (54.93%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYK-NMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCV 62
           WN +I  +  +G     +  +   M   G+  D  TFP V    +     V+    +HC+
Sbjct: 120 WNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSV----LKACRTVIDGNKIHCL 179

Query: 63  GIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCA 122
            ++ GF  D+Y   +++ +Y +   +  AR +FDEMP RD+ SW +MIS Y   G+A  A
Sbjct: 180 ALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEA 239

Query: 123 LDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMY 182
           L L  G+R     +SVTV+ +L AC    +   G  +  + +K+GL  ++ + N  + +Y
Sbjct: 240 LTLSNGLR---AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLY 299

Query: 183 SQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTM 242
           ++ G   +   +F  +  ++++SWN ++  Y      L+ + +F + MR      + LT+
Sbjct: 300 AEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQE-MRLSRIQPDCLTL 359

Query: 243 LISATASSDSGCLILGENLHSLAIKSG--LYDGILRTSLLYMYAKFGELENSTRLFKEIS 302
           +  A+  S  G +    ++    ++ G  L D  +  +++ MYAK G ++++  +F  + 
Sbjct: 360 ISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLP 419

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAG-LKPSVGILKHLIDAYACLGALQLGK 362
           N  +I+W  ++S + QNG   EA+E++  M+  G +  + G    ++ A +  GAL+ G 
Sbjct: 420 NTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGM 479

Query: 363 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 422
            +H  L++  GL   +  + TSL +MY +CG +  A   F  I   + V W ++I  +G 
Sbjct: 480 KLHGRLLK-NGL-YLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGF 539

Query: 423 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 482
           HG G  A+ LF +M+ E V P+++TF++LLSACSHSGLV EG   F  M++ + I P L+
Sbjct: 540 HGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLK 599

Query: 483 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 542
           HY C VD+  R+ ++  A      M+   D  IWGAL+ ACRV+G+  +   A+  L E+
Sbjct: 600 HYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEV 659

Query: 543 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 602
           EP +VGY+ LLSN  ASAG+W  V+++RS+ + K L K PGWS +E++  +  F +G+ +
Sbjct: 660 EPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQT 717

Query: 603 HNKTDEIY 607
           H   +E+Y
Sbjct: 720 HPMYEEMY 717

BLAST of Cla97C04G070890.1 vs. TAIR10
Match: AT4G19191.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 355.9 bits (912), Expect = 5.0e-98
Identity = 203/608 (33.39%), Postives = 329/608 (54.11%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCVG 62
           WN  I+   +    + +LLL++ M+  G E + FTFP V      +  DV    MVH   
Sbjct: 20  WNLQIREAVNRNDPVESLLLFREMKRGGFEPNNFTFPFVAKACARL-ADVGCCEMVHAHL 79

Query: 63  IRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCAL 122
           I+  F +D++     ++++ KC  + YA  VF+ MP RD  +W +M+S +   G    A 
Sbjct: 80  IKSPFWSDVFVGTATVDMFVKCNSVDYAAKVFERMPERDATTWNAMLSGFCQSGHTDKAF 139

Query: 123 DLYEGMR-RELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMY 182
            L+  MR  E+ P+SVTVM ++Q+    ++L L   +    ++ G+   + + N+++  Y
Sbjct: 140 SLFREMRLNEITPDSVTVMTLIQSASFEKSLKLLEAMHAVGIRLGVDVQVTVANTWISTY 199

Query: 183 SQLGGEDEVGVIFSEID--CKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETL 242
            + G  D   ++F  ID   + VVSWN +   YS  G+      ++  ++RE  F  +  
Sbjct: 200 GKCGDLDSAKLVFEAIDRGDRTVVSWNSMFKAYSVFGEAFDAFGLYCLMLRE-EFKPDLS 259

Query: 243 TMLISATASSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLYMYAKFGELENSTRLFKEI 302
           T +  A +  +   L  G  +HS AI  G    I    + + MY+K  +  ++  LF  +
Sbjct: 260 TFINLAASCQNPETLTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCSARLLFDIM 319

Query: 303 SNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGK 362
           ++R+ ++W  M+S + + G  DEA+ +F  M  +G KP +  L  LI      G+L+ GK
Sbjct: 320 TSRTCVSWTVMISGYAEKGDMDEALALFHAMIKSGEKPDLVTLLSLISGCGKFGSLETGK 379

Query: 363 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 422
            I      IYG +  N  +  +L++MY +CGSI  AR  FD    K VV WT+MI GY  
Sbjct: 380 WIDA-RADIYGCKRDNVMICNALIDMYSKCGSIHEARDIFDNTPEKTVVTWTTMIAGYAL 439

Query: 423 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 482
           +G+ ++AL LF +M+     PN++TFL++L AC+HSG + +G E F+ M+  +NI P L+
Sbjct: 440 NGIFLEALKLFSKMIDLDYKPNHITFLAVLQACAHSGSLEKGWEYFHIMKQVYNISPGLD 499

Query: 483 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 542
           HY+C VDLL R  ++ EA  +   M+   D  IWGAL+ AC+++ + KIA  AA  L  L
Sbjct: 500 HYSCMVDLLGRKGKLEEALELIRNMSAKPDAGIWGALLNACKIHRNVKIAEQAAESLFNL 559

Query: 543 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 602
           EP     Y  ++N  A+AG W    ++RS++ ++++ K PG S I++NG  H F  G+  
Sbjct: 560 EPQMAAPYVEMANIYAAAGMWDGFARIRSIMKQRNIKKYPGESVIQVNGKNHSFTVGEHG 619

Query: 603 HNKTDEIY 607
           H + + IY
Sbjct: 620 HVENEVIY 624

BLAST of Cla97C04G070890.1 vs. TAIR10
Match: AT4G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 352.8 bits (904), Expect = 4.2e-97
Identity = 196/617 (31.77%), Postives = 336/617 (54.46%), Query Frame = 0

Query: 2   LWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P V   +  I   +     +H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNR-DLVSWTSMISAYVNGGDAVC 121
            I++GF +D+Y CN+++ +Y K GC   A  VF+EMP R                     
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERXXXXXXXXXXXXXXXXXXXXX 216

Query: 122 ALDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGL-LFDIGLQNSFLR 181
                  ++   +P+  + M  L AC    +  +G+ + CH V++ +   D+ +  S L 
Sbjct: 217 XXXXXXXLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETL 241
           MYS+ G       IF+ +  +N+V+WN+++  Y+  G +      F K+  +     + +
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 TMLISATASSDSGCLILGENLHSLAIKSG-LYDGILRTSLLYMYAKFGELENSTRLFKEI 301
           T +    AS+    ++ G  +H  A++ G L   +L T+L+ MY + G+L+++  +F  +
Sbjct: 337 TSINLLPASA----ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIFDRM 396

Query: 302 SNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLGK 361
           + +++I+W +++++++QNG    A+E+F+++  + L P    +  ++ AYA   +L  G+
Sbjct: 397 AEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLSEGR 456

Query: 362 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 421
            IH Y+++       NT +  SL++MY  CG +  ARKCF+ IL+KDVV+W S+I  Y  
Sbjct: 457 EIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMAYAV 516

Query: 422 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 481
           HG G  ++ LF +M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P +E
Sbjct: 517 HGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDPGIE 576

Query: 482 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 541
           HY C +DL+ R+     A      M  +   RIWG+L+ A R + D  IA +AA ++ ++
Sbjct: 577 HYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQIFKM 636

Query: 542 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 601
           E +N G Y LL N  A AG+W +V +++ ++  K + +    S +E  G  H F +GD S
Sbjct: 637 EHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNGDRS 696

Query: 602 HNKTDEIYDLLVYINRI 616
           H  T++IY++L  ++R+
Sbjct: 697 HVATNKIYEVLDVVSRM 706

BLAST of Cla97C04G070890.1 vs. TAIR10
Match: AT3G63370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 344.7 bits (883), Expect = 1.1e-94
Identity = 201/618 (32.52%), Postives = 341/618 (55.18%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGM-VH 60
           +LWNSI+ S+  SG  L  L L++ M   G   + +T  IV+ +           G  +H
Sbjct: 250 VLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYT--IVSALTACDGFSYAKLGKEIH 309

Query: 61  CVGIRMG-FSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDA 120
              ++    S++LY CN ++ +Y +CG +  A  +  +M N D+V+W S+I  YV     
Sbjct: 310 ASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMY 369

Query: 121 VCALDLYEGM-RRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSF 180
             AL+ +  M     + + V++  ++ A     NL+ G  L  +V+K+G   ++ + N+ 
Sbjct: 370 KEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTL 429

Query: 181 LRMYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIE 240
           + MYS+      +G  F  +  K+++SW  +++ Y+     ++ +++F  + ++    I+
Sbjct: 430 IDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKK-RMEID 489

Query: 241 TLTMLISATASSDSGCLILGENLHSLAIKSGLYDGILRTSLLYMYAKFGELENSTRLFKE 300
            + +     ASS    +++ + +H   ++ GL D +++  L+ +Y K   +  +TR+F+ 
Sbjct: 490 EMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVFES 549

Query: 301 ISNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQLG 360
           I  + +++W +M+SS   NG+  EAVE+F++M   GL      L  ++ A A L AL  G
Sbjct: 550 IKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKG 609

Query: 361 KVIHCYLIRI-YGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGY 420
           + IHCYL+R  + LE     +  ++++MY  CG + SA+  FD I  K ++ +TSMI  Y
Sbjct: 610 REIHCYLLRKGFCLE---GSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAY 669

Query: 421 GSHGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPD 480
           G HG G  A+ LF +M  E V P++++FL+LL ACSH+GL+ EG      M   + ++P 
Sbjct: 670 GMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPW 729

Query: 481 LEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLL 540
            EHY C VD+L R+  V EAF     M       +W AL+ ACR + + +I   AA+RLL
Sbjct: 730 PEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLL 789

Query: 541 ELEPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGD 600
           ELEP N G   L+SN  A  G+W++VEK+R+ +    + K PG S+IE++G +H F + D
Sbjct: 790 ELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARD 849

Query: 601 TSHNKTDEIYDLLVYINR 615
            SH ++ EIY+ L  + R
Sbjct: 850 KSHPESKEIYEKLSEVTR 861

BLAST of Cla97C04G070890.1 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 343.6 bits (880), Expect = 2.5e-94
Identity = 187/557 (33.57%), Postives = 319/557 (57.27%), Query Frame = 0

Query: 58  VHCVGIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGD 117
           +H + ++ GFS DL+    +  +Y KC  +  AR VFD MP RDLVSW ++++ Y   G 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 118 AVCALDLYEGMRRE-LEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNS 177
           A  AL++ + M  E L+P+ +T++ +L A    R + +G+ +  + +++G    + +  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 178 FLRMYSQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSI 237
            + MY++ G  +    +F  +  +NVVSWN ++  Y    +  + + IF K++ E     
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE-GVKP 336

Query: 238 ETLTMLISATASSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLYMYAKFGELENSTRLF 297
             ++++ +  A +D G L  G  +H L+++ GL   + +  SL+ MY K  E++ +  +F
Sbjct: 337 TDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMF 396

Query: 298 KEISNRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGILKHLIDAYACLGALQ 357
            ++ +R++++W AM+  F QNG   +A+  F QM++  +KP       +I A A L    
Sbjct: 397 GKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITH 456

Query: 358 LGKVIHCYLIRIYGLETC---NTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSM 417
             K IH  ++R     +C   N  + T+L++MY +CG+I  AR  FD++  + V  W +M
Sbjct: 457 HAKWIHGVVMR-----SCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAM 516

Query: 418 IEGYGSHGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFN 477
           I+GYG+HG G  AL LF +M    + PN VTFLS++SACSHSGLV  G + FY M+  ++
Sbjct: 517 IDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYS 576

Query: 478 IKPDLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAA 537
           I+  ++HY   VDLL R+ R+ EA+   ++M       ++GA++GAC+++ +   A  AA
Sbjct: 577 IELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAA 636

Query: 538 RRLLELEPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGF 597
            RL EL P++ GY+ LL+N   +A  W +V ++R  +  + L K PG S +E+   +H F
Sbjct: 637 ERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSF 696

Query: 598 VSGDTSHNKTDEIYDLL 610
            SG T+H  + +IY  L
Sbjct: 697 FSGSTAHPDSKKIYAFL 707

BLAST of Cla97C04G070890.1 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 339.3 bits (869), Expect = 4.8e-93
Identity = 196/608 (32.24%), Postives = 334/608 (54.93%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYK-NMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMVHCV 62
           WN +I  +  +G     +  +   M   G+  D  TFP V    +     V+    +HC+
Sbjct: 120 WNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSV----LKACRTVIDGNKIHCL 179

Query: 63  GIRMGFSADLYFCNTMMEVYGKCGCLVYARNVFDEMPNRDLVSWTSMISAYVNGGDAVCA 122
            ++ GF  D+Y   +++ +Y +   +  AR +FDEMP RD+ SW +MIS Y   G+A  A
Sbjct: 180 ALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEA 239

Query: 123 LDLYEGMRRELEPNSVTVMVMLQACCVTRNLVLGRLLQCHVVKNGLLFDIGLQNSFLRMY 182
           L L  G+R     +SVTV+ +L AC    +   G  +  + +K+GL  ++ + N  + +Y
Sbjct: 240 LTLSNGLR---AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLY 299

Query: 183 SQLGGEDEVGVIFSEIDCKNVVSWNILMSFYSSVGDILKVVDIFNKIMREVAFSIETLTM 242
           ++ G   +   +F  +  ++++SWN ++  Y      L+ + +F + MR      + LT+
Sbjct: 300 AEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQE-MRLSRIQPDCLTL 359

Query: 243 LISATASSDSGCLILGENLHSLAIKSG--LYDGILRTSLLYMYAKFGELENSTRLFKEIS 302
           +  A+  S  G +    ++    ++ G  L D  +  +++ MYAK G ++++  +F  + 
Sbjct: 360 ISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLP 419

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAG-LKPSVGILKHLIDAYACLGALQLGK 362
           N  +I+W  ++S + QNG   EA+E++  M+  G +  + G    ++ A +  GAL+ G 
Sbjct: 420 NTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGM 479

Query: 363 VIHCYLIRIYGLETCNTHLETSLLNMYVRCGSIPSARKCFDLILIKDVVAWTSMIEGYGS 422
            +H  L++  GL   +  + TSL +MY +CG +  A   F  I   + V W ++I  +G 
Sbjct: 480 KLHGRLLK-NGL-YLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGF 539

Query: 423 HGLGIDALNLFHQMMSEAVIPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLE 482
           HG G  A+ LF +M+ E V P+++TF++LLSACSHSGLV EG   F  M++ + I P L+
Sbjct: 540 HGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLK 599

Query: 483 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAARRLLEL 542
           HY C VD+  R+ ++  A      M+   D  IWGAL+ ACRV+G+  +   A+  L E+
Sbjct: 600 HYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEV 659

Query: 543 EPNNVGYYTLLSNSQASAGQWHEVEKLRSVVYEKDLIKKPGWSFIELNGTIHGFVSGDTS 602
           EP +VGY+ LLSN  ASAG+W  V+++RS+ + K L K PGWS +E++  +  F +G+ +
Sbjct: 660 EPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQT 717

Query: 603 HNKTDEIY 607
           H   +E+Y
Sbjct: 720 HPMYEEMY 717

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457591.10.0e+0088.80PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic ... [more]
XP_023526509.10.0e+0086.71pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucur... [more]
XP_022932989.10.0e+0086.50pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucur... [more]
XP_022967941.12.8e-30985.41pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucur... [more]
XP_022153922.12.3e-29883.93pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Momor... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C6I7|A0A1S3C6I7_CUCME0.0e+0088.80pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Cucumis ... [more]
tr|A0A2N9FID5|A0A2N9FID5_FAGSY1.9e-20058.46Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14662 PE=4 SV=1[more]
tr|A0A2N9H6Z6|A0A2N9H6Z6_FAGSY1.9e-20058.46Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS35445 PE=4 SV=1[more]
tr|A0A2N9GL16|A0A2N9GL16_FAGSY4.2e-20058.29Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS28020 PE=4 SV=1[more]
tr|A0A2P6RTB7|A0A2P6RTB7_ROSCH4.0e-19556.73Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0124611 P... [more]
Match NameE-valueIdentityDescription
sp|P0C8Q2|PP323_ARATH8.9e-9733.39Pentatricopeptide repeat-containing protein At4g19191, mitochondrial OS=Arabidop... [more]
sp|O49619|PP350_ARATH7.6e-9631.77Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidop... [more]
sp|Q9M1V3|PP296_ARATH2.1e-9332.52Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
sp|Q3E6Q1|PPR32_ARATH4.6e-9333.57Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|O81767|PP348_ARATH8.7e-9232.24Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT4G19191.15.0e-9833.39Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G35130.14.2e-9731.77Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G63370.11.1e-9432.52Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.12.5e-9433.57Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33990.14.8e-9332.24Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C04G070890Cla97C04G070890gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C04G070890.1.CDS.1Cla97C04G070890.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C04G070890.1.exon.1Cla97C04G070890.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C04G070890.1Cla97C04G070890.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 302..338
e-value: 9.8E-8
score: 32.0
coord: 101..147
e-value: 1.8E-8
score: 34.3
coord: 404..452
e-value: 2.2E-9
score: 37.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 407..440
e-value: 3.7E-5
score: 21.6
coord: 74..102
e-value: 2.9E-4
score: 18.8
coord: 2..34
e-value: 0.0022
score: 16.0
coord: 103..130
e-value: 1.7E-4
score: 19.5
coord: 203..230
e-value: 0.0013
score: 16.7
coord: 304..338
e-value: 2.9E-9
score: 34.5
coord: 442..476
e-value: 9.9E-5
score: 20.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 74..100
e-value: 7.5E-4
score: 19.5
coord: 203..230
e-value: 0.0051
score: 16.9
coord: 3..31
e-value: 0.018
score: 15.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..336
score: 12.923
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1..33
score: 8.265
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 542..576
score: 5.525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..236
score: 8.21
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 374..404
score: 5.733
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 105..131
score: 5.546
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 70..104
score: 9.229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..301
score: 6.577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 10.852
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 135..169
score: 5.349
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..470
score: 8.418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 476..506
score: 6.051
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 1..159
e-value: 6.8E-23
score: 83.5
coord: 371..582
e-value: 1.1E-31
score: 112.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 267..367
e-value: 3.6E-15
score: 57.8
coord: 160..252
e-value: 8.9E-7
score: 30.4
NoneNo IPR availablePANTHERPTHR24015:SF1617SUBFAMILY NOT NAMEDcoord: 60..584
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 60..584