Cla97C10G199660 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G199660
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr10: 29632247 .. 29634070 (+)
RNA-Seq ExpressionCla97C10G199660
SyntenyCla97C10G199660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTCCCAAGAAATCCACCTTCCTCACACCCTCCATTCCTGCAAATCCGTAACTCACCTCAAACAAATCCATGGCGTCGCCATTAAAACACCCTCTCTCTCTCTCCCCAACAAATTATTCTTTCCCAAACTTATCTCTCTCTCTTCCTCCTCCCCTGACCTTTTCTACATTCGCTCCATCCTTCTCACCCAATCCTCCGATGCTCAATTCCGCCTCAATCTCTGCAACGCCATCATCCGCAGCATTTCTGCCAACTCCACCCATCTCACGGCCATGGAATTCTTGAGGGAAATGCTTCTAATCGGCCTCGAACCCGATGGGTTCACATTGCCGCATGTTCTCAAGGCATTGGCTTGGATTCAGGGGATTAGAGAAGGCCAACAGATTCACGCTCGTTCAATCAAGACTGGAATGGTGCGATTCAATGTTTATGTGAGTAACACACTGATGAGACTCTATTCCGTCTGTGGCTCTATCGATGATGTCCAGAAGGTGTTCGACGAATGTCCTCACCGAGACTTAGTGTCTTGGACCACGCTCATTCAAGCATTTACGAAGGCTGGGCAATATAGGAGAGCAGTTGGAGCTTTTATGGAAATGTGTGATTTGAAACTAAGGGCCGATGGGCGGACTCTGGTGGTTGTCCTCTCAGCTTGCTCCAACTTGGGAGACGTGAATTTGGGTCGAAAGGTACATTCCTATATCCGTCATCACATTGACACGAAAGCAGATGTATTTGTTGGTAATGCCTTGATTGATATGTACTTGAAATGTGATGATTTGAACTCAGCTAACAAAGTGTTCAACGAAATGTCTGTGAGAAATGTGGTTACCTGGAATGCTATGATTTCGGGATTGGCTTTCCAAGGCCGGTATAGGGAAGCTCTAGATACGTTCCGTATGATGCAAAGCAAAGGGCCCAAGCCAGATGAGGTGACCTTAGTGGGGGTTCTGAACTCTTGCGCAAACCTTGGAGTTCTTGAGCTGGGTAAGTGGGTTCATGCATACATACGTAGAAATCATATTTTAGCTGATAAATTTGTTGGGAATGCGCTTCTAGATATGTATGCAAAATGTGGAAGAATAGACGAAGCTTTTAGGGTGTTTGAGAACATGAAAAGGAGGGATGTATATTCATACACTGCCATGATTGTTGGGTTGGCCTTGCATGGTGAAGCAAACTGGGCATTTCAGGTCTTCTCCGAGATGTTAAGAGTCGGTATCGAGCCAAATGAAGTAACATTTTTAGGTCTTCTTATGGCTTGTAGCCATGGCGGATTGGTTGCTGAAGGCAAGAAGTACTTTTTTGACATGTCAAATATATATAAGCTTAGACCTCATGTGGAGCATTATGGCTGCATGATTGACCTTCTTGGTCGTGCAGGGTTTGTGAAGGAAGCAGAAGAGATTGTCCACAAAATGGAAATCAGGCCAGATGCCTTTGCTTGGGGAGCTCTATTAGGGGCTTGCAAGATTCATGGAAATGTGGATATCGGCGAAAGTGTGATGCAAAAATTGACTGATTTAGATCCTGATGAAGATGCTACTTACATTCTTATGACGAATTTATATTCTTCAGTTCATAGATGGAAAGAAGCATTGAAATTAAGAAAGACGATGAAAAGTAAGAAGATGAGAAAGATTCCTGGATGTAGTTTGATTGAAGTTGATGGTGTTGTTCATGAGTTTCGAAAGGGTGATAAGTCACATCCAAAAAGCAAAGTTATATACTCAGTGTTGGAAGGAATTGCTACTCACTTAAAGAGCTCTGGGATTGTGGAACATAGTGCATTTTCCTTGCAGTCCAAACCATCATTTTAG

mRNA sequence

ATGAATTCCCAAGAAATCCACCTTCCTCACACCCTCCATTCCTGCAAATCCGTAACTCACCTCAAACAAATCCATGGCGTCGCCATTAAAACACCCTCTCTCTCTCTCCCCAACAAATTATTCTTTCCCAAACTTATCTCTCTCTCTTCCTCCTCCCCTGACCTTTTCTACATTCGCTCCATCCTTCTCACCCAATCCTCCGATGCTCAATTCCGCCTCAATCTCTGCAACGCCATCATCCGCAGCATTTCTGCCAACTCCACCCATCTCACGGCCATGGAATTCTTGAGGGAAATGCTTCTAATCGGCCTCGAACCCGATGGGTTCACATTGCCGCATGTTCTCAAGGCATTGGCTTGGATTCAGGGGATTAGAGAAGGCCAACAGATTCACGCTCGTTCAATCAAGACTGGAATGGTGCGATTCAATGTTTATGTGAGTAACACACTGATGAGACTCTATTCCGTCTGTGGCTCTATCGATGATGTCCAGAAGGTGTTCGACGAATGTCCTCACCGAGACTTAGTGTCTTGGACCACGCTCATTCAAGCATTTACGAAGGCTGGGCAATATAGGAGAGCAGTTGGAGCTTTTATGGAAATGTGTGATTTGAAACTAAGGGCCGATGGGCGGACTCTGGTGGTTGTCCTCTCAGCTTGCTCCAACTTGGGAGACGTGAATTTGGGTCGAAAGGTACATTCCTATATCCGTCATCACATTGACACGAAAGCAGATGTATTTGTTGGTAATGCCTTGATTGATATGTACTTGAAATGTGATGATTTGAACTCAGCTAACAAAGTGTTCAACGAAATGTCTGTGAGAAATGTGGTTACCTGGAATGCTATGATTTCGGGATTGGCTTTCCAAGGCCGGTATAGGGAAGCTCTAGATACGTTCCGTATGATGCAAAGCAAAGGGCCCAAGCCAGATGAGGTGACCTTAGTGGGGGTTCTGAACTCTTGCGCAAACCTTGGAGTTCTTGAGCTGGGTAAGTGGGTTCATGCATACATACGTAGAAATCATATTTTAGCTGATAAATTTGTTGGGAATGCGCTTCTAGATATGTATGCAAAATGTGGAAGAATAGACGAAGCTTTTAGGGTGTTTGAGAACATGAAAAGGAGGGATGTATATTCATACACTGCCATGATTGTTGGGTTGGCCTTGCATGGTGAAGCAAACTGGGCATTTCAGGTCTTCTCCGAGATGTTAAGAGTCGGTATCGAGCCAAATGAAGTAACATTTTTAGGTCTTCTTATGGCTTGTAGCCATGGCGGATTGGTTGCTGAAGGCAAGAAGTACTTTTTTGACATGTCAAATATATATAAGCTTAGACCTCATGTGGAGCATTATGGCTGCATGATTGACCTTCTTGGTCGTGCAGGGTTTGTGAAGGAAGCAGAAGAGATTGTCCACAAAATGGAAATCAGGCCAGATGCCTTTGCTTGGGGAGCTCTATTAGGGGCTTGCAAGATTCATGGAAATGTGGATATCGGCGAAAGTGTGATGCAAAAATTGACTGATTTAGATCCTGATGAAGATGCTACTTACATTCTTATGACGAATTTATATTCTTCAGTTCATAGATGGAAAGAAGCATTGAAATTAAGAAAGACGATGAAAAGTAAGAAGATGAGAAAGATTCCTGGATGTAGTTTGATTGAAGTTGATGGTGTTGTTCATGAGTTTCGAAAGGGTGATAAGTCACATCCAAAAAGCAAAGTTATATACTCAGTGTTGGAAGGAATTGCTACTCACTTAAAGAGCTCTGGGATTGTGGAACATAGTGCATTTTCCTTGCAGTCCAAACCATCATTTTAG

Coding sequence (CDS)

ATGAATTCCCAAGAAATCCACCTTCCTCACACCCTCCATTCCTGCAAATCCGTAACTCACCTCAAACAAATCCATGGCGTCGCCATTAAAACACCCTCTCTCTCTCTCCCCAACAAATTATTCTTTCCCAAACTTATCTCTCTCTCTTCCTCCTCCCCTGACCTTTTCTACATTCGCTCCATCCTTCTCACCCAATCCTCCGATGCTCAATTCCGCCTCAATCTCTGCAACGCCATCATCCGCAGCATTTCTGCCAACTCCACCCATCTCACGGCCATGGAATTCTTGAGGGAAATGCTTCTAATCGGCCTCGAACCCGATGGGTTCACATTGCCGCATGTTCTCAAGGCATTGGCTTGGATTCAGGGGATTAGAGAAGGCCAACAGATTCACGCTCGTTCAATCAAGACTGGAATGGTGCGATTCAATGTTTATGTGAGTAACACACTGATGAGACTCTATTCCGTCTGTGGCTCTATCGATGATGTCCAGAAGGTGTTCGACGAATGTCCTCACCGAGACTTAGTGTCTTGGACCACGCTCATTCAAGCATTTACGAAGGCTGGGCAATATAGGAGAGCAGTTGGAGCTTTTATGGAAATGTGTGATTTGAAACTAAGGGCCGATGGGCGGACTCTGGTGGTTGTCCTCTCAGCTTGCTCCAACTTGGGAGACGTGAATTTGGGTCGAAAGGTACATTCCTATATCCGTCATCACATTGACACGAAAGCAGATGTATTTGTTGGTAATGCCTTGATTGATATGTACTTGAAATGTGATGATTTGAACTCAGCTAACAAAGTGTTCAACGAAATGTCTGTGAGAAATGTGGTTACCTGGAATGCTATGATTTCGGGATTGGCTTTCCAAGGCCGGTATAGGGAAGCTCTAGATACGTTCCGTATGATGCAAAGCAAAGGGCCCAAGCCAGATGAGGTGACCTTAGTGGGGGTTCTGAACTCTTGCGCAAACCTTGGAGTTCTTGAGCTGGGTAAGTGGGTTCATGCATACATACGTAGAAATCATATTTTAGCTGATAAATTTGTTGGGAATGCGCTTCTAGATATGTATGCAAAATGTGGAAGAATAGACGAAGCTTTTAGGGTGTTTGAGAACATGAAAAGGAGGGATGTATATTCATACACTGCCATGATTGTTGGGTTGGCCTTGCATGGTGAAGCAAACTGGGCATTTCAGGTCTTCTCCGAGATGTTAAGAGTCGGTATCGAGCCAAATGAAGTAACATTTTTAGGTCTTCTTATGGCTTGTAGCCATGGCGGATTGGTTGCTGAAGGCAAGAAGTACTTTTTTGACATGTCAAATATATATAAGCTTAGACCTCATGTGGAGCATTATGGCTGCATGATTGACCTTCTTGGTCGTGCAGGGTTTGTGAAGGAAGCAGAAGAGATTGTCCACAAAATGGAAATCAGGCCAGATGCCTTTGCTTGGGGAGCTCTATTAGGGGCTTGCAAGATTCATGGAAATGTGGATATCGGCGAAAGTGTGATGCAAAAATTGACTGATTTAGATCCTGATGAAGATGCTACTTACATTCTTATGACGAATTTATATTCTTCAGTTCATAGATGGAAAGAAGCATTGAAATTAAGAAAGACGATGAAAAGTAAGAAGATGAGAAAGATTCCTGGATGTAGTTTGATTGAAGTTGATGGTGTTGTTCATGAGTTTCGAAAGGGTGATAAGTCACATCCAAAAAGCAAAGTTATATACTCAGTGTTGGAAGGAATTGCTACTCACTTAAAGAGCTCTGGGATTGTGGAACATAGTGCATTTTCCTTGCAGTCCAAACCATCATTTTAG

Protein sequence

MNSQEIHLPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIRSILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVEHSAFSLQSKPSF
Homology
BLAST of Cla97C10G199660 vs. NCBI nr
Match: XP_038902993.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 547/601 (91.01%), Postives = 569/601 (94.68%), Query Frame = 0

Query: 1   MNSQEIH-LPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIR 60
           MNSQE+H LPH+LHSCKS+THLKQIHGVAIK PSLSLPNK  FPKLISLSSS  DLFYIR
Sbjct: 1   MNSQELHLLPHSLHSCKSITHLKQIHGVAIKIPSLSLPNKFLFPKLISLSSSFSDLFYIR 60

Query: 61  SILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALA 120
           SILLT S DAQFRLNLCNAIIRSISANST+L AMEFL+EMLLIGLEPDGFTLPHVLKALA
Sbjct: 61  SILLTHSPDAQFRLNLCNAIIRSISANSTNLAAMEFLKEMLLIGLEPDGFTLPHVLKALA 120

Query: 121 WIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWT 180
            IQGIREGQQIHARSIKTGMV FNVYVSNTLMRLYSVCG IDDVQK+FDECPHRDLVSWT
Sbjct: 121 RIQGIREGQQIHARSIKTGMVGFNVYVSNTLMRLYSVCGFIDDVQKMFDECPHRDLVSWT 180

Query: 181 TLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHH 240
           TLIQ FTKAG YRRAVGAF+EMCDLKLRADGRTLVVVLSACSNLGD+NLGRKVHSYIRH+
Sbjct: 181 TLIQGFTKAGLYRRAVGAFVEMCDLKLRADGRTLVVVLSACSNLGDLNLGRKVHSYIRHY 240

Query: 241 IDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYREALDT 300
           ID  ADVFVGNALIDMYLKCDDL SANKVF+EM VRNVVTWNAMISGLA+QGRYREALDT
Sbjct: 241 IDMNADVFVGNALIDMYLKCDDLISANKVFDEMPVRNVVTWNAMISGLAYQGRYREALDT 300

Query: 301 FRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDMYAK 360
           FRMMQ+KGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDMYAK
Sbjct: 301 FRMMQNKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDMYAK 360

Query: 361 CGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVTFLGL 420
           CGRIDE+F VFE+MKRRDVYSYTAMIVGLALHGEANWAFQVFSEM+ VGIEPNEVTFLGL
Sbjct: 361 CGRIDESFSVFESMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMIGVGIEPNEVTFLGL 420

Query: 421 LMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKMEIRP 480
           LMACSHGGLVAEGKKYFF+MSN YKLRP  EHYGCMIDLLGRAG VKEAEEIVHKMEIRP
Sbjct: 421 LMACSHGGLVAEGKKYFFEMSNTYKLRPQTEHYGCMIDLLGRAGLVKEAEEIVHKMEIRP 480

Query: 481 DAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEALKLRK 540
           DA AWGALLGACKI+GNVDIGESVMQKLTDLDP+E+ TYILMTNLYSSV RW++ALKLRK
Sbjct: 481 DAIAWGALLGACKIYGNVDIGESVMQKLTDLDPNENGTYILMTNLYSSVQRWRDALKLRK 540

Query: 541 TMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVEHSAF 600
           TMKSKKMRK PGCSLIEVDG VHEFRKGDKSHPKSKVIYSVLEGI THLKS GIVE+S F
Sbjct: 541 TMKSKKMRKSPGCSLIEVDGGVHEFRKGDKSHPKSKVIYSVLEGIGTHLKSYGIVEYSTF 600

BLAST of Cla97C10G199660 vs. NCBI nr
Match: KAA0053939.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25465.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1057.0 bits (2732), Expect = 6.2e-305
Identity = 528/605 (87.27%), Postives = 558/605 (92.23%), Query Frame = 0

Query: 1   MNSQEIHL-PHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLI---SLSSSSPDLF 60
           MNS E+HL PH+LHSCKS++HLKQIHGVAIKTPSLSLPN    PKLI   S SSSSPDLF
Sbjct: 1   MNSLELHLFPHSLHSCKSLSHLKQIHGVAIKTPSLSLPN--LIPKLIFLSSSSSSSPDLF 60

Query: 61  YIRSILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLK 120
           YIRSILLT S DAQFRLNLCNAI+RSIS NST+LT MEFL EMLLIGLEPDGFTLP VLK
Sbjct: 61  YIRSILLTHSHDAQFRLNLCNAIVRSISRNSTNLTPMEFLNEMLLIGLEPDGFTLPLVLK 120

Query: 121 ALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLV 180
           ALA  +GIREGQQIHARSIKTGMV  NVYV+NTLMRLYSVCGSI DVQKVFDECPHRDLV
Sbjct: 121 ALARTRGIREGQQIHARSIKTGMVGLNVYVTNTLMRLYSVCGSIHDVQKVFDECPHRDLV 180

Query: 181 SWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYI 240
           SWT LIQAFTKAG Y RAV AFMEMCDL+LRADGRTLVVVLSACSNLGD+NLG+KVHSYI
Sbjct: 181 SWTILIQAFTKAGLYSRAVEAFMEMCDLRLRADGRTLVVVLSACSNLGDLNLGQKVHSYI 240

Query: 241 RHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYREA 300
           R++ID  ADVFVGNALIDMYLKCDDLNSANKVF+EM VRNVVTWNAMISGLA+QGRYREA
Sbjct: 241 RYYIDMNADVFVGNALIDMYLKCDDLNSANKVFDEMPVRNVVTWNAMISGLAYQGRYREA 300

Query: 301 LDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDM 360
           LDTFR+MQ+KG KPDEVTLVGVLNSCANLGVLE+GKWVHAY+RRNHILAD+FVGNALLDM
Sbjct: 301 LDTFRIMQNKGVKPDEVTLVGVLNSCANLGVLEIGKWVHAYMRRNHILADEFVGNALLDM 360

Query: 361 YAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVTF 420
           YAKCG IDEAFRVFE+MK+RDVYSYTAMIVGLALHGEANWAFQVFSEM RVGIEPNEVTF
Sbjct: 361 YAKCGSIDEAFRVFESMKKRDVYSYTAMIVGLALHGEANWAFQVFSEMFRVGIEPNEVTF 420

Query: 421 LGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKME 480
           LGLLMACSHGGLVAEGKKYFF+MS+ YKLRP  EHYGCMIDLLGR G VKEAEEIVHKME
Sbjct: 421 LGLLMACSHGGLVAEGKKYFFEMSDKYKLRPQSEHYGCMIDLLGRVGLVKEAEEIVHKME 480

Query: 481 IRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEALK 540
           IRPD FA GALLGAC+IHGNVDIGESVMQKLT++DPDED TYILMTNLYSSVHRWK+A K
Sbjct: 481 IRPDVFACGALLGACRIHGNVDIGESVMQKLTEIDPDEDGTYILMTNLYSSVHRWKDASK 540

Query: 541 LRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVEH 600
           LRKTMK KKMRK PGCS IEVDGVVHEFRKGDKSHP+SKVIY VLEGIATHLKS GIVEH
Sbjct: 541 LRKTMKIKKMRKTPGCSSIEVDGVVHEFRKGDKSHPRSKVIYFVLEGIATHLKSYGIVEH 600

Query: 601 SAFSL 602
           S F +
Sbjct: 601 STFCI 603

BLAST of Cla97C10G199660 vs. NCBI nr
Match: XP_031737800.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus] >KGN59527.2 hypothetical protein Csa_002607 [Cucumis sativus])

HSP 1 Score: 1053.5 bits (2723), Expect = 6.9e-304
Identity = 525/606 (86.63%), Postives = 558/606 (92.08%), Query Frame = 0

Query: 1   MNSQEIHL-PHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLI----SLSSSSPDL 60
           MNS ++HL  H+L+SCKS++HLKQIHGVAIKTPSLSLPN    PKLI    S SSSSPDL
Sbjct: 1   MNSLQLHLFSHSLNSCKSISHLKQIHGVAIKTPSLSLPN--LIPKLIFLSSSSSSSSPDL 60

Query: 61  FYIRSILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVL 120
           FYIRSILLT S DAQF L+LCNAIIRSIS NS +L+ MEFL EML++GLEPDGFT+P VL
Sbjct: 61  FYIRSILLTHSHDAQFCLSLCNAIIRSISRNSINLSPMEFLNEMLVVGLEPDGFTIPLVL 120

Query: 121 KALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDL 180
           KALA IQGIREGQQIHARSIKTGMV FNVYVSNTLMRLYSVCGSI DVQKVFDECPHRDL
Sbjct: 121 KALALIQGIREGQQIHARSIKTGMVGFNVYVSNTLMRLYSVCGSIHDVQKVFDECPHRDL 180

Query: 181 VSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSY 240
           VSWTTLIQAFTKAG Y RAV AFMEMCDL+LRADGRTLVVVLSACSNLGD+NLG+KVHSY
Sbjct: 181 VSWTTLIQAFTKAGLYSRAVEAFMEMCDLRLRADGRTLVVVLSACSNLGDLNLGQKVHSY 240

Query: 241 IRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYRE 300
           IRH+ID KADVFVGNAL+DMYLKCDDLNSA KVF+EM V+NVVTWNAMISGLA+QGRYRE
Sbjct: 241 IRHYIDMKADVFVGNALLDMYLKCDDLNSAYKVFDEMPVKNVVTWNAMISGLAYQGRYRE 300

Query: 301 ALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLD 360
           ALDTFRMMQ KG KPDEVTLVGVLNSCANLGVLE+GKWVHAY+RRNHILADKFVGNALLD
Sbjct: 301 ALDTFRMMQDKGVKPDEVTLVGVLNSCANLGVLEIGKWVHAYMRRNHILADKFVGNALLD 360

Query: 361 MYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVT 420
           MYAKCG IDEAFRVFE+MKRRDVYSYTAMI GLALHGEANWAFQVFSEM RVGIEPNEVT
Sbjct: 361 MYAKCGSIDEAFRVFESMKRRDVYSYTAMIFGLALHGEANWAFQVFSEMFRVGIEPNEVT 420

Query: 421 FLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKM 480
           FLGLLMACSHGGLVAEGKKYFF MS+ YKLRP  EHYGCMIDLLGRAG VKEAEEI+HKM
Sbjct: 421 FLGLLMACSHGGLVAEGKKYFFQMSDKYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHKM 480

Query: 481 EIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEAL 540
           EIRPD FA GALLGAC+IHGNVDIGESVMQKLT+LDPDE+ TYILMTNLYSSVHRWK+AL
Sbjct: 481 EIRPDVFACGALLGACRIHGNVDIGESVMQKLTELDPDEEGTYILMTNLYSSVHRWKDAL 540

Query: 541 KLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVE 600
           K+RKTMK+KKMRK PGCSLIEVDGVVHEFRKGDKSHP+SKVIY VLEGIATHLKS GI E
Sbjct: 541 KIRKTMKNKKMRKTPGCSLIEVDGVVHEFRKGDKSHPRSKVIYLVLEGIATHLKSYGIEE 600

Query: 601 HSAFSL 602
           HS F +
Sbjct: 601 HSTFCI 604

BLAST of Cla97C10G199660 vs. NCBI nr
Match: XP_023521817.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1042.3 bits (2694), Expect = 1.6e-300
Identity = 516/608 (84.87%), Postives = 548/608 (90.13%), Query Frame = 0

Query: 1   MNSQEIH-LPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIR 60
           MNSQE+  LPH+L+SC S+ HLKQ+H VAIKTPSLSL N+L FPKLISLSSSSPDLFYIR
Sbjct: 1   MNSQELRLLPHSLNSCTSIAHLKQLHAVAIKTPSLSLHNQLLFPKLISLSSSSPDLFYIR 60

Query: 61  SILLTQSSDAQFRLNLCNAIIRSISA-------NSTHLTAMEFLREMLLIGLEPDGFTLP 120
           SILLT S+DAQFRLNLCNA I  ISA       NST L AMEFLREMLLIG++PDGFTLP
Sbjct: 61  SILLTSSADAQFRLNLCNAFIHRISANSSGESTNSTDLRAMEFLREMLLIGVQPDGFTLP 120

Query: 121 HVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPH 180
           HVLKALA IQ IREGQQIHA SIK G+VRFNVYV NTLMRLYSVCGSID VQK+F ECPH
Sbjct: 121 HVLKALARIQRIREGQQIHAHSIKIGLVRFNVYVCNTLMRLYSVCGSIDAVQKLFGECPH 180

Query: 181 RDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKV 240
           RDLVSWTTLIQAFTKAG YR+AVGAFMEMCDLKLR DGRTLVVVLSA SNLGD+NLGRKV
Sbjct: 181 RDLVSWTTLIQAFTKAGLYRKAVGAFMEMCDLKLRVDGRTLVVVLSAFSNLGDLNLGRKV 240

Query: 241 HSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGR 300
           H+YI H+ID  ADVFVGNAL+DMYLKCDD NSA KVF+EM VRNVVTWNAMISGLA+QGR
Sbjct: 241 HAYIHHYIDVNADVFVGNALLDMYLKCDDSNSAYKVFDEMPVRNVVTWNAMISGLAYQGR 300

Query: 301 YREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNA 360
           Y+EALD FR MQ  GPKPDEVTLVGVLNSCANLGVLELGKWVHAY+RRNHILADKFVGNA
Sbjct: 301 YKEALDMFRRMQRTGPKPDEVTLVGVLNSCANLGVLELGKWVHAYMRRNHILADKFVGNA 360

Query: 361 LLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPN 420
           LLDMYAKCGRIDEAFRVFE+MKRRDVYSYTAMIVGLALHGEANWAFQVFS MLR G+EPN
Sbjct: 361 LLDMYAKCGRIDEAFRVFESMKRRDVYSYTAMIVGLALHGEANWAFQVFSRMLREGVEPN 420

Query: 421 EVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIV 480
           EVTFLGLLMACSH GLV++GKKYFFDM N YKLRP  EHYGCMIDLLGRAG VKEAEEI+
Sbjct: 421 EVTFLGLLMACSHSGLVSDGKKYFFDMLNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEII 480

Query: 481 HKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWK 540
           H MEIRPDAFAWGALLGAC+IHGNV++GESVMQKL +LDP ED  YILMTNLYSS HRWK
Sbjct: 481 HSMEIRPDAFAWGALLGACRIHGNVNLGESVMQKLMNLDPGEDGNYILMTNLYSSAHRWK 540

Query: 541 EALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSG 600
           +ALKLRK MKSKKMRK PGCSLIEVDGVVHEFRKGDKSHPK++VIYSVLEGIA HLKS G
Sbjct: 541 DALKLRKKMKSKKMRKTPGCSLIEVDGVVHEFRKGDKSHPKNRVIYSVLEGIACHLKSYG 600

BLAST of Cla97C10G199660 vs. NCBI nr
Match: KAG6596202.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1040.4 bits (2689), Expect = 6.0e-300
Identity = 516/609 (84.73%), Postives = 548/609 (89.98%), Query Frame = 0

Query: 1   MNSQEI-HLPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISL-SSSSPDLFYI 60
           MNSQE+  LPH+L+SC S+ HLKQ+H VAIKTPSLSL N+  FPKLISL SSSSPDLFYI
Sbjct: 1   MNSQELCLLPHSLNSCTSIAHLKQLHAVAIKTPSLSLHNQFLFPKLISLSSSSSPDLFYI 60

Query: 61  RSILLTQSSDAQFRLNLCNAIIRSISA-------NSTHLTAMEFLREMLLIGLEPDGFTL 120
           RSILLT S+DAQFRLNLCNA I  ISA       NST L AMEFLREMLLIG++PDGFTL
Sbjct: 61  RSILLTSSADAQFRLNLCNAFIHRISANSNGESTNSTDLRAMEFLREMLLIGVQPDGFTL 120

Query: 121 PHVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECP 180
           PHVLKALA +Q IREGQQIHA SIK G+VRFNVYV NTLMRLYSVCGSID VQK+F ECP
Sbjct: 121 PHVLKALARVQRIREGQQIHAHSIKIGLVRFNVYVCNTLMRLYSVCGSIDAVQKLFGECP 180

Query: 181 HRDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRK 240
           HRDLVSWTTLIQAFTKAG YR+AVGAFMEMCDLKLRADGRTLVVVLSACSNLGD+NLGRK
Sbjct: 181 HRDLVSWTTLIQAFTKAGLYRKAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDLNLGRK 240

Query: 241 VHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQG 300
           VHSYI H+ID  ADVFVGNAL+DMYLKCDD NSA KVF+EM VRNVVTWNAMISGLA+QG
Sbjct: 241 VHSYIHHYIDVNADVFVGNALLDMYLKCDDSNSAYKVFDEMPVRNVVTWNAMISGLAYQG 300

Query: 301 RYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGN 360
           RY+EALD FR MQ  GPKPDEVTLVGVLNSCANLGVLELGKWVHAY+RRNHIL DKFVGN
Sbjct: 301 RYKEALDMFRRMQRTGPKPDEVTLVGVLNSCANLGVLELGKWVHAYMRRNHILTDKFVGN 360

Query: 361 ALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEP 420
           ALLDMYAKCGRIDEAFRVFE+MKRRDVYSYTAMIVGLALHGEANWAFQVFS MLR G+EP
Sbjct: 361 ALLDMYAKCGRIDEAFRVFESMKRRDVYSYTAMIVGLALHGEANWAFQVFSRMLREGVEP 420

Query: 421 NEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEI 480
           NEVTFLGLLMACSH GLV++GKKYFFDM N YKLRP  EHYGCMIDLLGRAG VKEAEEI
Sbjct: 421 NEVTFLGLLMACSHSGLVSDGKKYFFDMLNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEI 480

Query: 481 VHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRW 540
           +H MEIRPDAFAWGALLGAC+IHGNV++GESVMQKL +LDP ED  YILMTNLYSS HRW
Sbjct: 481 IHSMEIRPDAFAWGALLGACRIHGNVNLGESVMQKLMNLDPGEDGNYILMTNLYSSAHRW 540

Query: 541 KEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSS 600
           K+ALKLRK MKSKKMRK PGCSLIEVDGVVHEFRKGDKSHPK++VIYSVLEGIA HLKS 
Sbjct: 541 KDALKLRKKMKSKKMRKTPGCSLIEVDGVVHEFRKGDKSHPKNRVIYSVLEGIACHLKSH 600

BLAST of Cla97C10G199660 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 8.9e-129
Identity = 234/625 (37.44%), Postives = 370/625 (59.20%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLF-FPKLISLSSSSPDLFYIRSILLTQSSDAQ 71
           LH+CK++  L+ IH   IK   + L N  +   KLI     SP    +   +    +  +
Sbjct: 40  LHNCKTLQSLRIIHAQMIK---IGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 99

Query: 72  FRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQI 131
             L + N + R  + +S  ++A++    M+ +GL P+ +T P VLK+ A  +  +EGQQI
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 132 HARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHR----------------- 191
           H   +K G    ++YV  +L+ +Y   G ++D  KVFD+ PHR                 
Sbjct: 160 HGHVLKLG-CDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 219

Query: 192 --------------DLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSA 251
                         D+VSW  +I  + + G Y+ A+  F +M    +R D  T+V V+SA
Sbjct: 220 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 279

Query: 252 CSNLGDVNLGRKVHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVT 311
           C+  G + LGR+VH +I  H    +++ + NALID+Y KC +L +A  +F  +  ++V++
Sbjct: 280 CAQSGSIELGRQVHLWIDDH-GFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVIS 339

Query: 312 WNAMISGLAFQGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYI- 371
           WN +I G      Y+EAL  F+ M   G  P++VT++ +L +CA+LG +++G+W+H YI 
Sbjct: 340 WNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYID 399

Query: 372 -RRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWA 431
            R   +     +  +L+DMYAKCG I+ A +VF ++  + + S+ AMI G A+HG A+ +
Sbjct: 400 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 459

Query: 432 FQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMID 491
           F +FS M ++GI+P+++TF+GLL ACSH G++  G+  F  M+  YK+ P +EHYGCMID
Sbjct: 460 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 519

Query: 492 LLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 551
           LLG +G  KEAEE+++ ME+ PD   W +LL ACK+HGNV++GES  + L  ++P+   +
Sbjct: 520 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 579

Query: 552 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVI 603
           Y+L++N+Y+S  RW E  K R  +  K M+K+PGCS IE+D VVHEF  GDK HP+++ I
Sbjct: 580 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 639

BLAST of Cla97C10G199660 vs. ExPASy Swiss-Prot
Match: Q9SJZ3 (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 5.6e-115
Identity = 219/624 (35.10%), Postives = 352/624 (56.41%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIRSILLTQSSDAQF 71
           L  CK + HLKQI    I    +  P      +LI+  + S   +   S+ + +  +   
Sbjct: 60  LEKCKLLLHLKQIQAQMIINGLILDP--FASSRLIAFCALSESRYLDYSVKILKGIENP- 119

Query: 72  RLNLCNAIIRSISANSTHLTAMEFLREMLLIGL---EPDGFTLPHVLKALAWIQGIREGQ 131
            +   N  IR  S +     +    ++ML  G     PD FT P + K  A ++    G 
Sbjct: 120 NIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGH 179

Query: 132 QIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKA 191
            I    +K  +   + +V N  + +++ CG +++ +KVFDE P RDLVSW  LI  + K 
Sbjct: 180 MILGHVLKLRLELVS-HVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKI 239

Query: 192 GQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFV 251
           G+  +A+  +  M    ++ D  T++ ++S+CS LGD+N G++ + Y++ +   +  + +
Sbjct: 240 GEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKEN-GLRMTIPL 299

Query: 252 GNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQG----------------- 311
            NAL+DM+ KC D++ A ++F+ +  R +V+W  MISG A  G                 
Sbjct: 300 VNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDV 359

Query: 312 --------------RYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAY 371
                         R ++AL  F+ MQ+   KPDE+T++  L++C+ LG L++G W+H Y
Sbjct: 360 VLWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 419

Query: 372 IRRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWA 431
           I +  +  +  +G +L+DMYAKCG I EA  VF  ++ R+  +YTA+I GLALHG+A+ A
Sbjct: 420 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 479

Query: 432 FQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMID 491
              F+EM+  GI P+E+TF+GLL AC HGG++  G+ YF  M + + L P ++HY  M+D
Sbjct: 480 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVD 539

Query: 492 LLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 551
           LLGRAG ++EA+ ++  M +  DA  WGALL  C++HGNV++GE   +KL +LDP +   
Sbjct: 540 LLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSGI 599

Query: 552 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVI 602
           Y+L+  +Y   + W++A + R+ M  + + KIPGCS IEV+G+V EF   DKS P+S+ I
Sbjct: 600 YVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEKI 659

BLAST of Cla97C10G199660 vs. ExPASy Swiss-Prot
Match: Q9C866 (Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E55 PE=2 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 1.5e-112
Identity = 213/546 (39.01%), Postives = 321/546 (58.79%), Query Frame = 0

Query: 73  LNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQIHA 132
           L + N +++S++   +    +    E+   GL PD FTLP VLK++  ++ + EG+++H 
Sbjct: 11  LLMYNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHG 70

Query: 133 RSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKAGQYR 192
            ++K G+  F+ YVSN+LM +Y+  G I+   KVFDE P RD+VSW  LI ++   G++ 
Sbjct: 71  YAVKAGL-EFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFE 130

Query: 193 RAVGAFMEMC-DLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFVGNA 252
            A+G F  M  +  L+ D  T+V  LSACS L ++ +G +++ ++    + +  V +GNA
Sbjct: 131 DAIGVFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFV--VTEFEMSVRIGNA 190

Query: 253 LIDMYLKCDDLNSANKVFNEM-------------------------------SVRNVVTW 312
           L+DM+ KC  L+ A  VF+ M                                V++VV W
Sbjct: 191 LVDMFCKCGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVVLW 250

Query: 313 NAMISGLAFQGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRR 372
            AM++G     R+ EAL+ FR MQ+ G +PD   LV +L  CA  G LE GKW+H YI  
Sbjct: 251 TAMMNGYVQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINE 310

Query: 373 NHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQV 432
           N +  DK VG AL+DMYAKCG I+ A  VF  +K RD  S+T++I GLA++G +  A  +
Sbjct: 311 NRVTVDKVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDL 370

Query: 433 FSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLG 492
           + EM  VG+  + +TF+ +L AC+HGG VAEG+K F  M+  + ++P  EH  C+IDLL 
Sbjct: 371 YYEMENVGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTERHNVQPKSEHCSCLIDLLC 430

Query: 493 RAGFVKEAEEIVHKMEIRPDAF---AWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 552
           RAG + EAEE++ KM    D      + +LL A + +GNV I E V +KL  ++  + + 
Sbjct: 431 RAGLLDEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSA 490

Query: 553 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDK--SHPKSK 582
           + L+ ++Y+S +RW++   +R+ MK   +RK PGCS IE+DGV HEF  GD   SHPK  
Sbjct: 491 HTLLASVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMD 550

BLAST of Cla97C10G199660 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 9.9e-112
Identity = 217/617 (35.17%), Postives = 350/617 (56.73%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLP---NKLFFPKLISLSSSSPDLFYIRSILLTQSSD 71
           +  C S+  LKQ HG  I+T + S P   +KLF    ++  SS   L Y R +       
Sbjct: 37  IERCVSLRQLKQTHGHMIRTGTFSDPYSASKLF---AMAALSSFASLEYARKVFDEIPKP 96

Query: 72  AQFRLNLCNAIIRS-ISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREG 131
             F     N +IR+  S     L+   FL  +      P+ +T P ++KA A +  +  G
Sbjct: 97  NSF---AWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLG 156

Query: 132 QQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTK 191
           Q +H  ++K+  V  +V+V+N+L+  Y  CG +D   KVF     +D+VSW ++I  F +
Sbjct: 157 QSLHGMAVKSA-VGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 216

Query: 192 AGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVF 251
            G   +A+  F +M    ++A   T+V VLSAC+ + ++  GR+V SYI  +     ++ 
Sbjct: 217 KGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN-RVNVNLT 276

Query: 252 VGNALIDMYLKC-------------------------------DDLNSANKVFNEMSVRN 311
           + NA++DMY KC                               +D  +A +V N M  ++
Sbjct: 277 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 336

Query: 312 VVTWNAMISGLAFQGRYREALDTFRMMQ-SKGPKPDEVTLVGVLNSCANLGVLELGKWVH 371
           +V WNA+IS     G+  EAL  F  +Q  K  K +++TLV  L++CA +G LELG+W+H
Sbjct: 337 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 396

Query: 372 AYIRRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEAN 431
           +YI+++ I  +  V +AL+ MY+KCG ++++  VF ++++RDV+ ++AMI GLA+HG  N
Sbjct: 397 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 456

Query: 432 WAFQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCM 491
            A  +F +M    ++PN VTF  +  ACSH GLV E +  F  M + Y + P  +HY C+
Sbjct: 457 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 516

Query: 492 IDLLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDED 551
           +D+LGR+G++++A + +  M I P    WGALLGACKIH N+++ E    +L +L+P  D
Sbjct: 517 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 576

Query: 552 ATYILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSK 593
             ++L++N+Y+ + +W+   +LRK M+   ++K PGCS IE+DG++HEF  GD +HP S+
Sbjct: 577 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 636

BLAST of Cla97C10G199660 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 1.7e-111
Identity = 214/614 (34.85%), Postives = 346/614 (56.35%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIRSILLTQSSDAQF 71
           L  CKS+ H+KQ+H   ++T      N   F   +S+SSSS +L Y  ++  +  S  + 
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFN--LSVSSSSINLSYALNVFSSIPSPPE- 78

Query: 72  RLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQIH 131
              + N  +R +S +S     + F + +  +G   D F+   +LKA++ +  + EG ++H
Sbjct: 79  -SIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 132 ARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKAGQY 191
             + K   +  + +V    M +Y+ CG I+  + VFDE  HRD+V+W T+I+ + + G  
Sbjct: 139 GVAFKIATL-CDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLV 198

Query: 192 RRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFVGNA 251
             A   F EM D  +  D   L  ++SAC   G++   R ++ ++  + D + D  +  A
Sbjct: 199 DEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIEN-DVRMDTHLLTA 258

Query: 252 LIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRY------------------ 311
           L+ MY     ++ A + F +MSVRN+    AM+SG +  GR                   
Sbjct: 259 LVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCW 318

Query: 312 -------------REALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRR 371
                        +EAL  F  M   G KPD V++  V+++CANLG+L+  KWVH+ I  
Sbjct: 319 TTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHV 378

Query: 372 NHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQV 431
           N + ++  + NAL++MYAKCG +D    VFE M RR+V S+++MI  L++HGEA+ A  +
Sbjct: 379 NGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSL 438

Query: 432 FSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLG 491
           F+ M +  +EPNEVTF+G+L  CSH GLV EGKK F  M++ Y + P +EHYGCM+DL G
Sbjct: 439 FARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFG 498

Query: 492 RAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYIL 551
           RA  ++EA E++  M +  +   WG+L+ AC+IHG +++G+   +++ +L+PD D   +L
Sbjct: 499 RANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVL 558

Query: 552 MTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSV 595
           M+N+Y+   RW++   +R+ M+ K + K  G S I+ +G  HEF  GDK H +S  IY+ 
Sbjct: 559 MSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAK 618

BLAST of Cla97C10G199660 vs. ExPASy TrEMBL
Match: A0A5A7UKB6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006110 PE=4 SV=1)

HSP 1 Score: 1057.0 bits (2732), Expect = 3.0e-305
Identity = 528/605 (87.27%), Postives = 558/605 (92.23%), Query Frame = 0

Query: 1   MNSQEIHL-PHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLI---SLSSSSPDLF 60
           MNS E+HL PH+LHSCKS++HLKQIHGVAIKTPSLSLPN    PKLI   S SSSSPDLF
Sbjct: 1   MNSLELHLFPHSLHSCKSLSHLKQIHGVAIKTPSLSLPN--LIPKLIFLSSSSSSSPDLF 60

Query: 61  YIRSILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLK 120
           YIRSILLT S DAQFRLNLCNAI+RSIS NST+LT MEFL EMLLIGLEPDGFTLP VLK
Sbjct: 61  YIRSILLTHSHDAQFRLNLCNAIVRSISRNSTNLTPMEFLNEMLLIGLEPDGFTLPLVLK 120

Query: 121 ALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLV 180
           ALA  +GIREGQQIHARSIKTGMV  NVYV+NTLMRLYSVCGSI DVQKVFDECPHRDLV
Sbjct: 121 ALARTRGIREGQQIHARSIKTGMVGLNVYVTNTLMRLYSVCGSIHDVQKVFDECPHRDLV 180

Query: 181 SWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYI 240
           SWT LIQAFTKAG Y RAV AFMEMCDL+LRADGRTLVVVLSACSNLGD+NLG+KVHSYI
Sbjct: 181 SWTILIQAFTKAGLYSRAVEAFMEMCDLRLRADGRTLVVVLSACSNLGDLNLGQKVHSYI 240

Query: 241 RHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYREA 300
           R++ID  ADVFVGNALIDMYLKCDDLNSANKVF+EM VRNVVTWNAMISGLA+QGRYREA
Sbjct: 241 RYYIDMNADVFVGNALIDMYLKCDDLNSANKVFDEMPVRNVVTWNAMISGLAYQGRYREA 300

Query: 301 LDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDM 360
           LDTFR+MQ+KG KPDEVTLVGVLNSCANLGVLE+GKWVHAY+RRNHILAD+FVGNALLDM
Sbjct: 301 LDTFRIMQNKGVKPDEVTLVGVLNSCANLGVLEIGKWVHAYMRRNHILADEFVGNALLDM 360

Query: 361 YAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVTF 420
           YAKCG IDEAFRVFE+MK+RDVYSYTAMIVGLALHGEANWAFQVFSEM RVGIEPNEVTF
Sbjct: 361 YAKCGSIDEAFRVFESMKKRDVYSYTAMIVGLALHGEANWAFQVFSEMFRVGIEPNEVTF 420

Query: 421 LGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKME 480
           LGLLMACSHGGLVAEGKKYFF+MS+ YKLRP  EHYGCMIDLLGR G VKEAEEIVHKME
Sbjct: 421 LGLLMACSHGGLVAEGKKYFFEMSDKYKLRPQSEHYGCMIDLLGRVGLVKEAEEIVHKME 480

Query: 481 IRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEALK 540
           IRPD FA GALLGAC+IHGNVDIGESVMQKLT++DPDED TYILMTNLYSSVHRWK+A K
Sbjct: 481 IRPDVFACGALLGACRIHGNVDIGESVMQKLTEIDPDEDGTYILMTNLYSSVHRWKDASK 540

Query: 541 LRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVEH 600
           LRKTMK KKMRK PGCS IEVDGVVHEFRKGDKSHP+SKVIY VLEGIATHLKS GIVEH
Sbjct: 541 LRKTMKIKKMRKTPGCSSIEVDGVVHEFRKGDKSHPRSKVIYFVLEGIATHLKSYGIVEH 600

Query: 601 SAFSL 602
           S F +
Sbjct: 601 STFCI 603

BLAST of Cla97C10G199660 vs. ExPASy TrEMBL
Match: A0A6J1G7N7 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451544 PE=4 SV=1)

HSP 1 Score: 1040.0 bits (2688), Expect = 3.8e-300
Identity = 518/610 (84.92%), Postives = 548/610 (89.84%), Query Frame = 0

Query: 1   MNSQEIH-LPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSS--SPDLFY 60
           MNSQE+  LPH+L+SC S+ HLKQ+H VAIKTPSLSL N+L FPKLISLSSS  SPDLFY
Sbjct: 1   MNSQELRLLPHSLNSCTSIAHLKQLHAVAIKTPSLSLHNQLLFPKLISLSSSSPSPDLFY 60

Query: 61  IRSILLTQSSDAQFRLNLCNAIIRSISA-------NSTHLTAMEFLREMLLIGLEPDGFT 120
           IRSILLT S+DAQFRLNLCNA I  ISA       NST L AMEFLREMLLIG++PDGFT
Sbjct: 61  IRSILLTSSADAQFRLNLCNAFIHRISANSNGESTNSTGLRAMEFLREMLLIGVQPDGFT 120

Query: 121 LPHVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDEC 180
           LPHVLKALA IQ IREGQQIHA SIK G+VRFNVYV NTLMRLYSVCGSID VQK+F EC
Sbjct: 121 LPHVLKALARIQRIREGQQIHAHSIKIGLVRFNVYVCNTLMRLYSVCGSIDAVQKLFGEC 180

Query: 181 PHRDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGR 240
           PHRDLVSWTTLIQAFTKAG YR+AVGAFMEMCDLKLRADGRTLVVVLSACSNLGD+NLGR
Sbjct: 181 PHRDLVSWTTLIQAFTKAGLYRKAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDLNLGR 240

Query: 241 KVHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQ 300
           KVHSYI H+ID  ADVFVGNAL+DMYLKCDD NSA KVF+EM VRNVVTWNAMI GLA+Q
Sbjct: 241 KVHSYIHHYIDVNADVFVGNALLDMYLKCDDSNSAYKVFDEMPVRNVVTWNAMILGLAYQ 300

Query: 301 GRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVG 360
           GRY+EALD FR MQ  GPKPDEVTLVGVLNSCANLGVLELGKWVHAY+RRNHILADKFVG
Sbjct: 301 GRYKEALDMFRRMQRTGPKPDEVTLVGVLNSCANLGVLELGKWVHAYMRRNHILADKFVG 360

Query: 361 NALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIE 420
           NALLDMYAKCGRIDEAFRVFE MKRRDVYSYTAMIVGLALHGEANWAFQVFS MLR G+E
Sbjct: 361 NALLDMYAKCGRIDEAFRVFEGMKRRDVYSYTAMIVGLALHGEANWAFQVFSRMLREGVE 420

Query: 421 PNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEE 480
           PNEVTFLGLLMACSH GLV++GKK FFDMSN YKLRP  EHYGCMIDLLGRAG VKEAEE
Sbjct: 421 PNEVTFLGLLMACSHSGLVSDGKKCFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEE 480

Query: 481 IVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHR 540
           I+H MEIRPDAFAWGALLGAC+IHGNV++GESVMQKL +LDP ED  YILMTNLYSS HR
Sbjct: 481 IIHSMEIRPDAFAWGALLGACRIHGNVNLGESVMQKLMNLDPGEDGNYILMTNLYSSAHR 540

Query: 541 WKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKS 600
           WK+ALKLRK MKSKKMRK PGCSLIEVDGVVHEFRKGDKSHPK++VIYSVLEGIA HLKS
Sbjct: 541 WKDALKLRKKMKSKKMRKTPGCSLIEVDGVVHEFRKGDKSHPKNRVIYSVLEGIACHLKS 600

BLAST of Cla97C10G199660 vs. ExPASy TrEMBL
Match: A0A6J1I3M1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470240 PE=4 SV=1)

HSP 1 Score: 1012.7 bits (2617), Expect = 6.5e-292
Identity = 505/605 (83.47%), Postives = 539/605 (89.09%), Query Frame = 0

Query: 1   MNSQE-IHLPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISL-SSSSPDLFYI 60
           MNSQE + LPH+L+SC S+ HLKQ+H VAIKTPSLSL N+  F KLISL SSSSPDLFYI
Sbjct: 1   MNSQELLLLPHSLNSCTSIAHLKQLHAVAIKTPSLSLHNQFLFRKLISLSSSSSPDLFYI 60

Query: 61  RSILLTQSSDAQFRLNLCNAIIRSISA-------NSTHLTAMEFLREMLLIGLEPDGFTL 120
           RSILLT  +DAQFRLNLCNA I  ISA       NST L AMEFLREMLLIG++PDGFTL
Sbjct: 61  RSILLTSLADAQFRLNLCNAFIHRISANSNGESTNSTGLRAMEFLREMLLIGVQPDGFTL 120

Query: 121 PHVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECP 180
           PHVLKALA +Q IREGQQIHA SIK G+VRFNVYV NTLMRLYSVCGSID VQK+F E P
Sbjct: 121 PHVLKALARVQRIREGQQIHAHSIKIGLVRFNVYVCNTLMRLYSVCGSIDAVQKLFGEYP 180

Query: 181 HRDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRK 240
           H DLVSWTTLIQAFTKAG YR+AVGAFMEMCDLKLRADGRTLVVVLSACSNLGD+NLGRK
Sbjct: 181 HPDLVSWTTLIQAFTKAGLYRKAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDLNLGRK 240

Query: 241 VHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQG 300
           +HSYI H+ID   DVFVGNAL+DMYLKCDD NSA KVF+EM VRNVVTWNAMI GLA+QG
Sbjct: 241 MHSYIHHYIDVNVDVFVGNALLDMYLKCDDSNSAYKVFDEMPVRNVVTWNAMILGLAYQG 300

Query: 301 RYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGN 360
           RY+EALD FR MQ  GPKPDEVTLVGVLNSCANLGVLELG+WVHAY+RRN+ILADKFVGN
Sbjct: 301 RYKEALDMFRRMQRTGPKPDEVTLVGVLNSCANLGVLELGRWVHAYMRRNYILADKFVGN 360

Query: 361 ALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEP 420
           ALLDMYAKCG IDEAFRVFE+MKRRDVYSYTAMIVGLALHGEANWAFQVFS MLR G+EP
Sbjct: 361 ALLDMYAKCGGIDEAFRVFESMKRRDVYSYTAMIVGLALHGEANWAFQVFSRMLREGVEP 420

Query: 421 NEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEI 480
           NEVTFLGLLMACSH GLV++GKKYFFDMSN YKLRP  EHYGCMIDLLGRAG VKEAEEI
Sbjct: 421 NEVTFLGLLMACSHSGLVSDGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEI 480

Query: 481 VHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRW 540
           +H MEIRPDAFAWGALLGAC+IHGNV++GESVMQKL +LDP ED  YILMTNLYSS HRW
Sbjct: 481 IHSMEIRPDAFAWGALLGACRIHGNVNLGESVMQKLMNLDPVEDGNYILMTNLYSSAHRW 540

Query: 541 KEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSS 597
           K+ LKLRKTMKSKKMRK PGCSLIEVDGVVHEFRKGD SHPKS+VIYSVLEGIA HLKS 
Sbjct: 541 KDTLKLRKTMKSKKMRKTPGCSLIEVDGVVHEFRKGDMSHPKSRVIYSVLEGIACHLKSF 600

BLAST of Cla97C10G199660 vs. ExPASy TrEMBL
Match: A0A6J1DJ70 (pentatricopeptide repeat-containing protein At1g31430-like OS=Momordica charantia OX=3673 GN=LOC111021570 PE=4 SV=1)

HSP 1 Score: 978.8 bits (2529), Expect = 1.0e-281
Identity = 490/612 (80.07%), Postives = 536/612 (87.58%), Query Frame = 0

Query: 1   MNSQEIH-LPHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISL---SSSSPDLF 60
           MNSQE+H LPH L+SCKS+T LKQIH VAIK  S SL  + F+PKLISL   SSSS DLF
Sbjct: 1   MNSQELHLLPHNLNSCKSITQLKQIHAVAIKAASSSLQKQFFYPKLISLSSASSSSRDLF 60

Query: 61  YIRSILLTQSSDAQFRLNLCNAIIRSISAN-------STHLTAMEFLREMLLIGLEPDGF 120
           YIRSI+L  S DAQF L+LCNAIIR I+AN       ST   AMEFLREMLL+GLEPD F
Sbjct: 61  YIRSIVLNHSDDAQFCLSLCNAIIRGITANSNDRASISTQPMAMEFLREMLLVGLEPDEF 120

Query: 121 TLPHVLKALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDE 180
           TLP+VLKALA I+G+REGQQIHARSIKTG++RFNVYV+NTLMRLYSVCG ID VQK+FD 
Sbjct: 121 TLPYVLKALARIRGMREGQQIHARSIKTGLLRFNVYVNNTLMRLYSVCGLIDAVQKLFDG 180

Query: 181 CPHRDLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLG 240
            PHRDLVSW TLIQAFT+AG +RRA+GAF++MCDL LRADGR LVVVLSACSNLGD+NLG
Sbjct: 181 SPHRDLVSWATLIQAFTQAGLHRRAIGAFLKMCDLNLRADGRILVVVLSACSNLGDLNLG 240

Query: 241 RKVHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAF 300
           RKVHSYIRH+ID  ADVF+GNALIDMYLKC+D NSA +VFNEM VRNVVTWNA+ISGLA+
Sbjct: 241 RKVHSYIRHYIDMNADVFLGNALIDMYLKCNDSNSAYEVFNEMPVRNVVTWNAVISGLAY 300

Query: 301 QGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFV 360
           QGRYREALD FR MQS G KPDEVTLVGVLNSCANLGVLELGKWVH Y+RRN+ILADKFV
Sbjct: 301 QGRYREALDVFRGMQSAGLKPDEVTLVGVLNSCANLGVLELGKWVHEYMRRNNILADKFV 360

Query: 361 GNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGI 420
           GNALLDMY KCGRIDEAFRVF+ MKRRDVYSYT+MIVGLALHG+AN AF++FSEM RVGI
Sbjct: 361 GNALLDMYGKCGRIDEAFRVFQGMKRRDVYSYTSMIVGLALHGKANSAFRIFSEMSRVGI 420

Query: 421 EPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAE 480
           EPNEVTFLGLLMACSHGGLVAEGKKY FDMSN Y LRP  EHYGCMIDLLGRAG VKEAE
Sbjct: 421 EPNEVTFLGLLMACSHGGLVAEGKKYLFDMSNTYNLRPQAEHYGCMIDLLGRAGLVKEAE 480

Query: 481 EIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVH 540
           EI+ +M+I PD FAWGALLGAC+IHGNVD+GE VMQKL DLD +ED  +ILMTNLYSSVH
Sbjct: 481 EIILRMQISPDTFAWGALLGACRIHGNVDLGEGVMQKLMDLDRNEDGAFILMTNLYSSVH 540

Query: 541 RWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLK 600
           RWK+AL+LRK MKSKKMRK PGCSLIEVDGVVHEFRKGDKSHPKS+VIY VLE IA+HLK
Sbjct: 541 RWKDALELRKMMKSKKMRKTPGCSLIEVDGVVHEFRKGDKSHPKSRVIYKVLERIASHLK 600

Query: 601 SSGIVEHSAFSL 602
           S GI EH  F L
Sbjct: 601 SHGIGEHGTFFL 612

BLAST of Cla97C10G199660 vs. ExPASy TrEMBL
Match: A0A1S4DVM9 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486837 PE=4 SV=1)

HSP 1 Score: 917.5 bits (2370), Expect = 2.9e-263
Identity = 475/605 (78.51%), Postives = 501/605 (82.81%), Query Frame = 0

Query: 1   MNSQEIHL-PHTLHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLI---SLSSSSPDLF 60
           MNS E+HL PH+LHSCKS++HLKQIHGVAIKTPSLSLPN    PKLI   S SSSSPDLF
Sbjct: 1   MNSLELHLFPHSLHSCKSLSHLKQIHGVAIKTPSLSLPN--LIPKLIFLSSSSSSSPDLF 60

Query: 61  YIRSILLTQSSDAQFRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLK 120
           YIRSILLT S DAQFRLNLCNAI+RSIS NST+LT MEFL EMLLIGLEPDGFTLP VLK
Sbjct: 61  YIRSILLTHSHDAQFRLNLCNAIVRSISRNSTNLTPMEFLNEMLLIGLEPDGFTLPLVLK 120

Query: 121 ALAWIQGIREGQQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLV 180
           ALA  +GIREGQQIHARSIKTGMV  NVYV+NTLMRLYSVCGSI DVQKVFDECPHRDLV
Sbjct: 121 ALARTRGIREGQQIHARSIKTGMVGLNVYVTNTLMRLYSVCGSIHDVQKVFDECPHRDLV 180

Query: 181 SWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYI 240
           SWT LIQAFTKAG Y RAV AFMEMCDL+LRADGRTLVVVLSACSNLGD+NLG+KVHSYI
Sbjct: 181 SWTILIQAFTKAGLYSRAVEAFMEMCDLRLRADGRTLVVVLSACSNLGDLNLGQKVHSYI 240

Query: 241 RHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRYREA 300
           R++ID  ADVFVGNALIDMYLKCDDLNSANKVF+EM VRNVVTWNAMISGLA+QGRYREA
Sbjct: 241 RYYIDMNADVFVGNALIDMYLKCDDLNSANKVFDEMPVRNVVTWNAMISGLAYQGRYREA 300

Query: 301 LDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRRNHILADKFVGNALLDM 360
           LDTFR+MQ+KG KPDEVTLVGVLNSCANLGVLE                           
Sbjct: 301 LDTFRIMQNKGVKPDEVTLVGVLNSCANLGVLE--------------------------- 360

Query: 361 YAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQVFSEMLRVGIEPNEVTF 420
                                          +ALHGEANWAFQVFSEM RVGIEPNEVTF
Sbjct: 361 -------------------------------IALHGEANWAFQVFSEMFRVGIEPNEVTF 420

Query: 421 LGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLGRAGFVKEAEEIVHKME 480
           LGLLMACSHGGLVAEGKKYFF+MS+ YKLRP  EHYGCMIDLLGR G VKEAEEIVHKME
Sbjct: 421 LGLLMACSHGGLVAEGKKYFFEMSDKYKLRPQSEHYGCMIDLLGRVGLVKEAEEIVHKME 480

Query: 481 IRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYILMTNLYSSVHRWKEALK 540
           IRPD FA GALLGAC+IHGNVDIGESVMQKLT++DPDED TYILMTNLYSSVHRWK+A K
Sbjct: 481 IRPDVFACGALLGACRIHGNVDIGESVMQKLTEIDPDEDGTYILMTNLYSSVHRWKDASK 540

Query: 541 LRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSVLEGIATHLKSSGIVEH 600
           LRKTMK KKMRK PGCS IEVDGVVHEFRKGDKSHP+SKVIY VLEGIATHLKS GIVEH
Sbjct: 541 LRKTMKIKKMRKTPGCSSIEVDGVVHEFRKGDKSHPRSKVIYFVLEGIATHLKSYGIVEH 545

Query: 601 SAFSL 602
           S F +
Sbjct: 601 STFCI 545

BLAST of Cla97C10G199660 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 462.2 bits (1188), Expect = 6.3e-130
Identity = 234/625 (37.44%), Postives = 370/625 (59.20%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLF-FPKLISLSSSSPDLFYIRSILLTQSSDAQ 71
           LH+CK++  L+ IH   IK   + L N  +   KLI     SP    +   +    +  +
Sbjct: 40  LHNCKTLQSLRIIHAQMIK---IGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 99

Query: 72  FRLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQI 131
             L + N + R  + +S  ++A++    M+ +GL P+ +T P VLK+ A  +  +EGQQI
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 132 HARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHR----------------- 191
           H   +K G    ++YV  +L+ +Y   G ++D  KVFD+ PHR                 
Sbjct: 160 HGHVLKLG-CDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 219

Query: 192 --------------DLVSWTTLIQAFTKAGQYRRAVGAFMEMCDLKLRADGRTLVVVLSA 251
                         D+VSW  +I  + + G Y+ A+  F +M    +R D  T+V V+SA
Sbjct: 220 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 279

Query: 252 CSNLGDVNLGRKVHSYIRHHIDTKADVFVGNALIDMYLKCDDLNSANKVFNEMSVRNVVT 311
           C+  G + LGR+VH +I  H    +++ + NALID+Y KC +L +A  +F  +  ++V++
Sbjct: 280 CAQSGSIELGRQVHLWIDDH-GFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVIS 339

Query: 312 WNAMISGLAFQGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYI- 371
           WN +I G      Y+EAL  F+ M   G  P++VT++ +L +CA+LG +++G+W+H YI 
Sbjct: 340 WNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYID 399

Query: 372 -RRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWA 431
            R   +     +  +L+DMYAKCG I+ A +VF ++  + + S+ AMI G A+HG A+ +
Sbjct: 400 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 459

Query: 432 FQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMID 491
           F +FS M ++GI+P+++TF+GLL ACSH G++  G+  F  M+  YK+ P +EHYGCMID
Sbjct: 460 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 519

Query: 492 LLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 551
           LLG +G  KEAEE+++ ME+ PD   W +LL ACK+HGNV++GES  + L  ++P+   +
Sbjct: 520 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 579

Query: 552 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVI 603
           Y+L++N+Y+S  RW E  K R  +  K M+K+PGCS IE+D VVHEF  GDK HP+++ I
Sbjct: 580 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 639

BLAST of Cla97C10G199660 vs. TAIR 10
Match: AT2G22410.1 (SLOW GROWTH 1 )

HSP 1 Score: 416.4 bits (1069), Expect = 4.0e-116
Identity = 219/624 (35.10%), Postives = 352/624 (56.41%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIRSILLTQSSDAQF 71
           L  CK + HLKQI    I    +  P      +LI+  + S   +   S+ + +  +   
Sbjct: 60  LEKCKLLLHLKQIQAQMIINGLILDP--FASSRLIAFCALSESRYLDYSVKILKGIENP- 119

Query: 72  RLNLCNAIIRSISANSTHLTAMEFLREMLLIGL---EPDGFTLPHVLKALAWIQGIREGQ 131
            +   N  IR  S +     +    ++ML  G     PD FT P + K  A ++    G 
Sbjct: 120 NIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGH 179

Query: 132 QIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKA 191
            I    +K  +   + +V N  + +++ CG +++ +KVFDE P RDLVSW  LI  + K 
Sbjct: 180 MILGHVLKLRLELVS-HVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKI 239

Query: 192 GQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFV 251
           G+  +A+  +  M    ++ D  T++ ++S+CS LGD+N G++ + Y++ +   +  + +
Sbjct: 240 GEAEKAIYVYKLMESEGVKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKEN-GLRMTIPL 299

Query: 252 GNALIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQG----------------- 311
            NAL+DM+ KC D++ A ++F+ +  R +V+W  MISG A  G                 
Sbjct: 300 VNALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDV 359

Query: 312 --------------RYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAY 371
                         R ++AL  F+ MQ+   KPDE+T++  L++C+ LG L++G W+H Y
Sbjct: 360 VLWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRY 419

Query: 372 IRRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWA 431
           I +  +  +  +G +L+DMYAKCG I EA  VF  ++ R+  +YTA+I GLALHG+A+ A
Sbjct: 420 IEKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTA 479

Query: 432 FQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMID 491
              F+EM+  GI P+E+TF+GLL AC HGG++  G+ YF  M + + L P ++HY  M+D
Sbjct: 480 ISYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVD 539

Query: 492 LLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 551
           LLGRAG ++EA+ ++  M +  DA  WGALL  C++HGNV++GE   +KL +LDP +   
Sbjct: 540 LLGRAGLLEEADRLMESMPMEADAAVWGALLFGCRMHGNVELGEKAAKKLLELDPSDSGI 599

Query: 552 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVI 602
           Y+L+  +Y   + W++A + R+ M  + + KIPGCS IEV+G+V EF   DKS P+S+ I
Sbjct: 600 YVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESEKI 659

BLAST of Cla97C10G199660 vs. TAIR 10
Match: AT1G31430.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 408.3 bits (1048), Expect = 1.1e-113
Identity = 213/546 (39.01%), Postives = 321/546 (58.79%), Query Frame = 0

Query: 73  LNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQIHA 132
           L + N +++S++   +    +    E+   GL PD FTLP VLK++  ++ + EG+++H 
Sbjct: 11  LLMYNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEGEKVHG 70

Query: 133 RSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKAGQYR 192
            ++K G+  F+ YVSN+LM +Y+  G I+   KVFDE P RD+VSW  LI ++   G++ 
Sbjct: 71  YAVKAGL-EFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGNGRFE 130

Query: 193 RAVGAFMEMC-DLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFVGNA 252
            A+G F  M  +  L+ D  T+V  LSACS L ++ +G +++ ++    + +  V +GNA
Sbjct: 131 DAIGVFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFV--VTEFEMSVRIGNA 190

Query: 253 LIDMYLKCDDLNSANKVFNEM-------------------------------SVRNVVTW 312
           L+DM+ KC  L+ A  VF+ M                                V++VV W
Sbjct: 191 LVDMFCKCGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVVLW 250

Query: 313 NAMISGLAFQGRYREALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRR 372
            AM++G     R+ EAL+ FR MQ+ G +PD   LV +L  CA  G LE GKW+H YI  
Sbjct: 251 TAMMNGYVQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYINE 310

Query: 373 NHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQV 432
           N +  DK VG AL+DMYAKCG I+ A  VF  +K RD  S+T++I GLA++G +  A  +
Sbjct: 311 NRVTVDKVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRALDL 370

Query: 433 FSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLG 492
           + EM  VG+  + +TF+ +L AC+HGG VAEG+K F  M+  + ++P  EH  C+IDLL 
Sbjct: 371 YYEMENVGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTERHNVQPKSEHCSCLIDLLC 430

Query: 493 RAGFVKEAEEIVHKMEIRPDAF---AWGALLGACKIHGNVDIGESVMQKLTDLDPDEDAT 552
           RAG + EAEE++ KM    D      + +LL A + +GNV I E V +KL  ++  + + 
Sbjct: 431 RAGLLDEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDSSA 490

Query: 553 YILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDK--SHPKSK 582
           + L+ ++Y+S +RW++   +R+ MK   +RK PGCS IE+DGV HEF  GD   SHPK  
Sbjct: 491 HTLLASVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPKMD 550

BLAST of Cla97C10G199660 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 405.6 bits (1041), Expect = 7.0e-113
Identity = 217/617 (35.17%), Postives = 350/617 (56.73%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLP---NKLFFPKLISLSSSSPDLFYIRSILLTQSSD 71
           +  C S+  LKQ HG  I+T + S P   +KLF    ++  SS   L Y R +       
Sbjct: 37  IERCVSLRQLKQTHGHMIRTGTFSDPYSASKLF---AMAALSSFASLEYARKVFDEIPKP 96

Query: 72  AQFRLNLCNAIIRS-ISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREG 131
             F     N +IR+  S     L+   FL  +      P+ +T P ++KA A +  +  G
Sbjct: 97  NSF---AWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLG 156

Query: 132 QQIHARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTK 191
           Q +H  ++K+  V  +V+V+N+L+  Y  CG +D   KVF     +D+VSW ++I  F +
Sbjct: 157 QSLHGMAVKSA-VGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 216

Query: 192 AGQYRRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVF 251
            G   +A+  F +M    ++A   T+V VLSAC+ + ++  GR+V SYI  +     ++ 
Sbjct: 217 KGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEEN-RVNVNLT 276

Query: 252 VGNALIDMYLKC-------------------------------DDLNSANKVFNEMSVRN 311
           + NA++DMY KC                               +D  +A +V N M  ++
Sbjct: 277 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 336

Query: 312 VVTWNAMISGLAFQGRYREALDTFRMMQ-SKGPKPDEVTLVGVLNSCANLGVLELGKWVH 371
           +V WNA+IS     G+  EAL  F  +Q  K  K +++TLV  L++CA +G LELG+W+H
Sbjct: 337 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 396

Query: 372 AYIRRNHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEAN 431
           +YI+++ I  +  V +AL+ MY+KCG ++++  VF ++++RDV+ ++AMI GLA+HG  N
Sbjct: 397 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 456

Query: 432 WAFQVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCM 491
            A  +F +M    ++PN VTF  +  ACSH GLV E +  F  M + Y + P  +HY C+
Sbjct: 457 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 516

Query: 492 IDLLGRAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDED 551
           +D+LGR+G++++A + +  M I P    WGALLGACKIH N+++ E    +L +L+P  D
Sbjct: 517 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 576

Query: 552 ATYILMTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSK 593
             ++L++N+Y+ + +W+   +LRK M+   ++K PGCS IE+DG++HEF  GD +HP S+
Sbjct: 577 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 636

BLAST of Cla97C10G199660 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 404.8 bits (1039), Expect = 1.2e-112
Identity = 214/614 (34.85%), Postives = 346/614 (56.35%), Query Frame = 0

Query: 12  LHSCKSVTHLKQIHGVAIKTPSLSLPNKLFFPKLISLSSSSPDLFYIRSILLTQSSDAQF 71
           L  CKS+ H+KQ+H   ++T      N   F   +S+SSSS +L Y  ++  +  S  + 
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFN--LSVSSSSINLSYALNVFSSIPSPPE- 78

Query: 72  RLNLCNAIIRSISANSTHLTAMEFLREMLLIGLEPDGFTLPHVLKALAWIQGIREGQQIH 131
              + N  +R +S +S     + F + +  +G   D F+   +LKA++ +  + EG ++H
Sbjct: 79  -SIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 132 ARSIKTGMVRFNVYVSNTLMRLYSVCGSIDDVQKVFDECPHRDLVSWTTLIQAFTKAGQY 191
             + K   +  + +V    M +Y+ CG I+  + VFDE  HRD+V+W T+I+ + + G  
Sbjct: 139 GVAFKIATL-CDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLV 198

Query: 192 RRAVGAFMEMCDLKLRADGRTLVVVLSACSNLGDVNLGRKVHSYIRHHIDTKADVFVGNA 251
             A   F EM D  +  D   L  ++SAC   G++   R ++ ++  + D + D  +  A
Sbjct: 199 DEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIEN-DVRMDTHLLTA 258

Query: 252 LIDMYLKCDDLNSANKVFNEMSVRNVVTWNAMISGLAFQGRY------------------ 311
           L+ MY     ++ A + F +MSVRN+    AM+SG +  GR                   
Sbjct: 259 LVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCW 318

Query: 312 -------------REALDTFRMMQSKGPKPDEVTLVGVLNSCANLGVLELGKWVHAYIRR 371
                        +EAL  F  M   G KPD V++  V+++CANLG+L+  KWVH+ I  
Sbjct: 319 TTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHV 378

Query: 372 NHILADKFVGNALLDMYAKCGRIDEAFRVFENMKRRDVYSYTAMIVGLALHGEANWAFQV 431
           N + ++  + NAL++MYAKCG +D    VFE M RR+V S+++MI  L++HGEA+ A  +
Sbjct: 379 NGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSL 438

Query: 432 FSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNIYKLRPHVEHYGCMIDLLG 491
           F+ M +  +EPNEVTF+G+L  CSH GLV EGKK F  M++ Y + P +EHYGCM+DL G
Sbjct: 439 FARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFG 498

Query: 492 RAGFVKEAEEIVHKMEIRPDAFAWGALLGACKIHGNVDIGESVMQKLTDLDPDEDATYIL 551
           RA  ++EA E++  M +  +   WG+L+ AC+IHG +++G+   +++ +L+PD D   +L
Sbjct: 499 RANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVL 558

Query: 552 MTNLYSSVHRWKEALKLRKTMKSKKMRKIPGCSLIEVDGVVHEFRKGDKSHPKSKVIYSV 595
           M+N+Y+   RW++   +R+ M+ K + K  G S I+ +G  HEF  GDK H +S  IY+ 
Sbjct: 559 MSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAK 618

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902993.10.0e+0091.01pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benin... [more]
KAA0053939.16.2e-30587.27pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25465... [more]
XP_031737800.16.9e-30486.63pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 ... [more]
XP_023521817.11.6e-30084.87pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
KAG6596202.16.0e-30084.73Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9LN018.9e-12937.44Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9SJZ35.6e-11535.10Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
Q9C8661.5e-11239.01Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana OX... [more]
O823809.9e-11235.17Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
O233371.7e-11134.85Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7UKB63.0e-30587.27Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1G7N73.8e-30084.92pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
A0A6J1I3M16.5e-29283.47pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
A0A6J1DJ701.0e-28180.07pentatricopeptide repeat-containing protein At1g31430-like OS=Momordica charanti... [more]
A0A1S4DVM92.9e-26378.51pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT1G08070.16.3e-13037.44Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22410.14.0e-11635.10SLOW GROWTH 1 [more]
AT1G31430.11.1e-11339.01Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G29760.17.0e-11335.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.11.2e-11234.85Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 394..566
e-value: 6.9E-31
score: 109.8
coord: 75..241
e-value: 5.5E-26
score: 93.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 242..344
e-value: 2.0E-22
score: 82.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 256..537
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 176..203
e-value: 4.4E-5
score: 23.4
coord: 517..545
e-value: 0.78
score: 10.1
coord: 148..170
e-value: 0.37
score: 11.1
coord: 451..476
e-value: 7.4E-4
score: 19.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 275..324
e-value: 1.1E-12
score: 47.9
coord: 377..423
e-value: 2.7E-9
score: 37.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 278..312
e-value: 9.9E-8
score: 29.7
coord: 452..476
e-value: 0.0011
score: 17.0
coord: 176..209
e-value: 1.2E-4
score: 20.0
coord: 379..413
e-value: 1.1E-6
score: 26.5
coord: 351..378
e-value: 3.7E-6
score: 24.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 346..376
score: 10.522905
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 276..310
score: 12.769972
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 174..208
score: 9.744654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 377..411
score: 11.246351
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 12..598
NoneNo IPR availablePANTHERPTHR47928:SF107REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 12..598

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G199660.1Cla97C10G199660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding