Clc09G17420 (gene) Watermelon (cordophanus) v2

Overview
NameClc09G17420
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationClcChr09: 28477782 .. 28480548 (+)
RNA-Seq ExpressionClc09G17420
SyntenyClc09G17420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAACTTAAAAACAAAATACAAGGAGTTGACAAGCAAACAGAAATTGATTTTTTCCAAATTTTTTTTGTTCCCTCACCTTCTCCCTCGTGAACGCGTCGCACAACACCCCTCCCTCCTTTGTTAACCTTTCTCCAGCCGCCGGCTTTTCTTCCTCTCCGGCGACGTCCAACGACTGTGCGCCGACTGTCCGCGGCGTGTTTTCTTCTGTCTCTTGGTAATAACCCCAACGGACCAAAGTTAATTGACTTGCCAGAACCATGCGTTTGCGTTCCCCGATCTGAATACCTATGGCCAGTAAACCTCAGATTTGTGAATTTGAACCTTATTTCTCTTCGGAGGCTGCGATGAGCTGCTTCAACGCCGTCGCCGCCGCCACTAGCCCCCTATAGATGAATGGGTTCTCTATCTCACATGGTGCCCCTCTGCAATGCCCTCTTTGTCTTCCCAAGCCCTCAAGAAACCCACTTTGTTTCGCCCCAAATCCGAGCAACGCCCAGATTCGACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCTCTTCTCAAGGTCACCTCCCGGAAGCCCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTTAAAGGAGGTAGATGCGTTCATCTACATTTGAAACACACGGGTTTCAAACGCCCTACCACTATTGTAGCCAACCACTTGATTGGTATGTACTTTGAATGTGGTAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTTTAAGAAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAATTTGTTTGATAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTATTTCAATGAAGCTATTGGGTTACATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGCGTGAAGCTTAAGGAATTGCAGCTCACGAAGCAAGTTCACGGGCAGGTACTGGTTGTTGGATTCTTGTCTAATGTAGTTCTTTCTAGTTCAATTGTTGATGCGTATGCAAAATGTGGAGAGATGGGTTGTGCAAGGAGATTGTTTGACGAAATGCTCGTCAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTCTGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCAGGCTATGCAAGAAATAGTTTAGGGCATGAAGCACTTGATTACTTCACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTGAAGCATGGTAAACAATTACATGCCTATTTGATCAGAACCAACTTCAGATGCAACACAATAGTCGTTAGCTCTCTCATTGACATGTATTCGAAGTGTGGCATGTTAGAAGCTAGCTGCCGCATTTTTTACCTTATGGGAAATAAGCAGGATGTTGTCTTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTCATTGTGATCCTTAGTGCGTGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGGCCATGACCTATGATCATGGTGTTCTCCCAGATCAAGAACATTATGCATGCTTAATTGACCTCCTGGGTCGAGCTGGGTGTTTTATTGAGTTGGTAAATGAGCTAGAGAAGATGTCCTGTAAACCTGATGATCGGGTATGGAATGCATTACTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAAGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTATATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTTAGGAAAGAACGTGCAATAAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCACTGAAAGAAGAAATATACTCGCTATTGGAGCAGTTAGCCAGCCATACAGAAGATTTTTCAATCATTTGAAAAGAAGTTAAGGTTATTGCTGTGCAGGTTGTTTTCAAGGAGACCTGTTGCAAAGTCTTACCTGTTAAGAATGGAGGTTGCTGATTAGTTCCAAAGAGAATGTGTGTTTTAAGTTGGGAAATTTCACACAAGTGATCTAATACATAGGATGATAAAATGCTTGATATTTACACCATGGCCAGATTTTTCTTTTTCTCTGGGTCAACATTTGCCAAAGCCGAACCAATGGGAATTACTCCTTGAGCATAATTTGTTTATCTCACTGTTATTCAATAATAATGGCTTAGAGTTAGTTTCTCAATTCTACTTTTTGTTTGTACAATTTGTGTTGTAAATTTTTGTCATTGTAGATTTAGCCAAGATTTCCATGATGCTTTTAAAGGGTCGCCAAGGGAGTGACTTTAGAAAGCTAACTCTTAGGCCGAGTTCAATTAACTCCTGTTTGATAATTATCTTTGTTTCTTGTTTCTTGAAAAAGAATAAAAAATTTAAATGTGGTTGATAA

mRNA sequence

CGAAACTTAAAAACAAAATACAAGGAGTTGACAAGCAAACAGAAATTGATTTTTTCCAAATTTTTTTTGTTCCCTCACCTTCTCCCTCGTGAACGCGTCGCACAACACCCCTCCCTCCTTTGTTAACCTTTCTCCAGCCGCCGGCTTTTCTTCCTCTCCGGCGACGTCCAACGACTGTGCGCCGACTGTCCGCGGCGTGTTTTCTTCTGTCTCTTGGTAATAACCCCAACGGACCAAAGTTAATTGACTTGCCAGAACCATGCGTTTGCGTTCCCCGATCTGAATACCTATGGCCAGTAAACCTCAGATTTGTGAATTTGAACCTTATTTCTCTTCGGAGGCTGCGATGAGCTGCTTCAACGCCGTCGCCGCCGCCACTAGCCCCCTATAGATGAATGGGTTCTCTATCTCACATGGTGCCCCTCTGCAATGCCCTCTTTGTCTTCCCAAGCCCTCAAGAAACCCACTTTGTTTCGCCCCAAATCCGAGCAACGCCCAGATTCGACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCTCTTCTCAAGGTCACCTCCCGGAAGCCCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTTAAAGGAGGTAGATGCGTTCATCTACATTTGAAACACACGGGTTTCAAACGCCCTACCACTATTGTAGCCAACCACTTGATTGGTATGTACTTTGAATGTGGTAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTTTAAGAAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAATTTGTTTGATAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTATTTCAATGAAGCTATTGGGTTACATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGCGTGAAGCTTAAGGAATTGCAGCTCACGAAGCAAGTTCACGGGCAGGTACTGGTTGTTGGATTCTTGTCTAATGTAGTTCTTTCTAGTTCAATTGTTGATGCGTATGCAAAATGTGGAGAGATGGGTTGTGCAAGGAGATTGTTTGACGAAATGCTCGTCAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTCTGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCAGGCTATGCAAGAAATAGTTTAGGGCATGAAGCACTTGATTACTTCACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTGAAGCATGGTAAACAATTACATGCCTATTTGATCAGAACCAACTTCAGATGCAACACAATAGTCGTTAGCTCTCTCATTGACATGTATTCGAAGTGTGGCATGTTAGAAGCTAGCTGCCGCATTTTTTACCTTATGGGAAATAAGCAGGATGTTGTCTTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTCATTGTGATCCTTAGTGCGTGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGGCCATGACCTATGATCATGGTGTTCTCCCAGATCAAGAACATTATGCATGCTTAATTGACCTCCTGGGTCGAGCTGGGTGTTTTATTGAGTTGGTAAATGAGCTAGAGAAGATGTCCTGTAAACCTGATGATCGGGTATGGAATGCATTACTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAAGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTATATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTTAGGAAAGAACGTGCAATAAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCACTGAAAGAAGAAATATACTCGCTATTGGAGCAGTTAGCCAGCCATACAGAAGATTTTTCAATCATTTGAAAAGAAGTTAAGGTTATTGCTGTGCAGGTTGTTTTCAAGGAGACCTGTTGCAAAGTCTTACCTGTTAAGAATGGAGGTTGCTGATTAGTTCCAAAGAGAATGTGTGTTTTAAGTTGGGAAATTTCACACAAGTGATCTAATACATAGGATGATAAAATGCTTGATATTTACACCATGGCCAGATTTTTCTTTTTCTCTGGGTCAACATTTGCCAAAGCCGAACCAATGGGAATTACTCCTTGAGCATAATTTGTTTATCTCACTGTTATTCAATAATAATGGCTTAGAGTTAGTTTCTCAATTCTACTTTTTGTTTGTACAATTTGTGTTGTAAATTTTTGTCATTGTAGATTTAGCCAAGATTTCCATGATGCTTTTAAAGGGTCGCCAAGGGAGTGACTTTAGAAAGCTAACTCTTAGGCCGAGTTCAATTAACTCCTGTTTGATAATTATCTTTGTTTCTTGTTTCTTGAAAAAGAATAAAAAATTTAAATGTGGTTGATAA

Coding sequence (CDS)

ATGCCCTCTTTGTCTTCCCAAGCCCTCAAGAAACCCACTTTGTTTCGCCCCAAATCCGAGCAACGCCCAGATTCGACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCTCTTCTCAAGGTCACCTCCCGGAAGCCCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTTAAAGGAGGTAGATGCGTTCATCTACATTTGAAACACACGGGTTTCAAACGCCCTACCACTATTGTAGCCAACCACTTGATTGGTATGTACTTTGAATGTGGTAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTTTAAGAAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAATTTGTTTGATAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTATTTCAATGAAGCTATTGGGTTACATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGCGTGAAGCTTAAGGAATTGCAGCTCACGAAGCAAGTTCACGGGCAGGTACTGGTTGTTGGATTCTTGTCTAATGTAGTTCTTTCTAGTTCAATTGTTGATGCGTATGCAAAATGTGGAGAGATGGGTTGTGCAAGGAGATTGTTTGACGAAATGCTCGTCAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTCTGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCAGGCTATGCAAGAAATAGTTTAGGGCATGAAGCACTTGATTACTTCACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTGAAGCATGGTAAACAATTACATGCCTATTTGATCAGAACCAACTTCAGATGCAACACAATAGTCGTTAGCTCTCTCATTGACATGTATTCGAAGTGTGGCATGTTAGAAGCTAGCTGCCGCATTTTTTACCTTATGGGAAATAAGCAGGATGTTGTCTTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTCATTGTGATCCTTAGTGCGTGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGGCCATGACCTATGATCATGGTGTTCTCCCAGATCAAGAACATTATGCATGCTTAATTGACCTCCTGGGTCGAGCTGGGTGTTTTATTGAGTTGGTAAATGAGCTAGAGAAGATGTCCTGTAAACCTGATGATCGGGTATGGAATGCATTACTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAAGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTATATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTTAGGAAAGAACGTGCAATAAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCACTGAAAGAAGAAATATACTCGCTATTGGAGCAGTTAGCCAGCCATACAGAAGATTTTTCAATCATTTGA

Protein sequence

MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQLASHTEDFSII
Homology
BLAST of Clc09G17420 vs. NCBI nr
Match: XP_038886822.1 (pentatricopeptide repeat-containing protein At2g21090 [Benincasa hispida])

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 583/611 (95.42%), Postives = 595/611 (97.38%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQ LKKP LFRPKS+QRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL
Sbjct: 1   MPSFSSQVLKKPALFRPKSKQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTSTFVNLLRLC KAKF KGG+CVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV
Sbjct: 61  PTSTFVNLLRLCGKAKFLKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNVHHAR LFDRM EKDVVSWNTMVLAYAKKGYFNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFDRMMEKDVVSWNTMVLAYAKKGYFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRR D+GFNEFSFAGVLILCVKLKELQL KQVHGQ+LVVGFLSNVVLSSSIVDAY
Sbjct: 181 GLYRDFRRFDIGFNEFSFAGVLILCVKLKELQLAKQVHGQILVVGFLSNVVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMN ASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQ+HA+LIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAHLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSS+IDMYSKCGMLEASC +FYLMGNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSMIDMYSKCGMLEASCHVFYLMGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVI LEPQSSAAYVSLASL
Sbjct: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIGLEPQSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YA LGKWESVEKVRELMEERFVRKERAISWI IGNKVHSFIASDRLHPLKEEIYS+LEQL
Sbjct: 541 YALLGKWESVEKVRELMEERFVRKERAISWIGIGNKVHSFIASDRLHPLKEEIYSILEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT ED SII
Sbjct: 601 ASHTEEDLSII 611

BLAST of Clc09G17420 vs. NCBI nr
Match: XP_008455202.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo] >KAA0031479.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK06932.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1140.6 bits (2949), Expect = 0.0e+00
Identity = 549/606 (90.59%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP  FRPK EQ PDSTR+CI QSLL+LSSQG LPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGG+CVHLHLKHTGFKRPTTIVANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLG VHHAR LFDRM EKDVVSWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRRLDMGFN FSF+GVLILCVKLKELQLTKQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCG+MGCARRLFDEMLVKDI  WTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFTKMMK  INP+QYTFSSCLCACASIAALKHGKQ+H YLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC +F+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFKAMTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDDRVW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTED 607
           A HTE+
Sbjct: 601 ARHTEE 606

BLAST of Clc09G17420 vs. NCBI nr
Match: KAG7011402.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 550/611 (90.02%), Postives = 579/611 (94.76%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP +FRPKS+Q PDSTR CIVQSLLN SSQGHLPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGG+ VHLHLK TGFKRPTTIVANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNV+ AR LFD M EKDV+SWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRR DMGFNEFSFAG+LILCVKLKELQL KQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMN AS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQ+HAYLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C +FYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFKAM+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT EDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of Clc09G17420 vs. NCBI nr
Match: XP_022967585.1 (pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima])

HSP 1 Score: 1128.2 bits (2917), Expect = 0.0e+00
Identity = 548/611 (89.69%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP +FRPKS+  PDS+R CIVQSLLN SSQGHLPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKHLPDSSRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGG+ VHLHLK TGFKRPTTI+ANHLIGMYF+CG+DIEARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIIANHLIGMYFQCGSDIEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNV+ AR +FD M EKDV+SWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKVFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           G +RDFRR DMGFNEFSFAGVLILCVKLKELQL KQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GFYRDFRRQDMGFNEFSFAGVLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMN AS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQ+HAYLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+CR+FYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACRVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLV EGL+FFKAM+YDH VLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVHEGLQFFKAMSYDHSVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT EDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of Clc09G17420 vs. NCBI nr
Match: XP_022963954.1 (pentatricopeptide repeat-containing protein At2g21090 [Cucurbita moschata])

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 547/611 (89.53%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP +FRPKS+Q PDSTR CIVQSLLN SSQG+LPEALS+LDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGNLPEALSFLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGG+ VHLHLK TGFKRPTTIVANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNV+ AR LFD M EKDV+SWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRR DMGFNEFSFAG+LILCVKLKELQL KQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMN AS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQ+HAYLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C +FYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFKAM+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDDR+WN LLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNTLLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT EDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of Clc09G17420 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 3.0e-209
Identity = 346/587 (58.94%), Postives = 451/587 (76.83%), Query Frame = 0

Query: 23  PDSTRLCIVQSLLNL-SSQGHLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGG 82
           P    +C+ QS L+  +++  L +A+S L+ L Q+GIRLP     +LL+ C   K  K G
Sbjct: 6   PRKRPICVAQSFLSKHATKAELSQAVSRLESLTQQGIRLPFDLLASLLQQCGDTKSLKQG 65

Query: 83  RCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAK 142
           + +H HLK TGFKRP T+++NHLIGMY +CG  I+A KVFD+M LRNLYSWN+M++GY K
Sbjct: 66  KWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVK 125

Query: 143 LGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGV 202
            G +  AR +FD M E+DVVSWNTMV+ YA+ G  +EA+  +++FRR  + FNEFSFAG+
Sbjct: 126 SGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGL 185

Query: 203 LILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDI 262
           L  CVK ++LQL +Q HGQVLV GFLSNVVLS SI+DAYAKCG+M  A+R FDEM VKDI
Sbjct: 186 LTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDI 245

Query: 263 LAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKF 322
             WTT++SGYAK GDM +A +LF +MPEKNPVSWTALI+GY R   G+ ALD F KM+  
Sbjct: 246 HIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIAL 305

Query: 323 RINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEAS 382
            + P+Q+TFSSCLCA ASIA+L+HGK++H Y+IRTN R N IV+SSLIDMYSK G LEAS
Sbjct: 306 GVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVISSLIDMYSKSGSLEAS 365

Query: 383 CRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSH 442
            R+F +  +K D V WNTMISALAQHG G +A++M +DM++  ++P+R T +VIL+ACSH
Sbjct: 366 ERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSH 425

Query: 443 SGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWN 502
           SGLV+EGLR+F++MT  HG++PDQEHYACLIDLLGRAGCF EL+ ++E+M  +PD  +WN
Sbjct: 426 SGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKHIWN 485

Query: 503 ALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERF 562
           A+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R 
Sbjct: 486 AILGVCRIHGNEELGKKAADELIKLDPESSAPYILLSSIYADHGKWELVEKLRGVMKKRR 545

Query: 563 VRKERAISWIDIGNKVHSFIASD--RLHPLKEEIYSLLEQLASHTED 607
           V KE+A+SWI+I  KV +F  SD    H  KEEIY +L  LA+  E+
Sbjct: 546 VNKEKAVSWIEIEKKVEAFTVSDGSHAHARKEEIYFILHNLAAVIEE 592

BLAST of Clc09G17420 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 402.9 bits (1034), Expect = 6.4e-111
Identity = 230/662 (34.74%), Postives = 337/662 (50.91%), Query Frame = 0

Query: 48  SYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFK-GGRCVHLHLKHTGFKRPTTIVANHLIG 107
           S+L   A       +S F  LL  C K+K      R VH  +  +GF      + N LI 
Sbjct: 5   SFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN-EIFIQNRLID 64

Query: 108 MYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTM 167
            Y +CG+  + R+VFDKM  RN+Y+WN ++ G  KLG +  A +LF  M E+D  +WN+M
Sbjct: 65  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 124

Query: 168 VLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGF 227
           V  +A+     EA+       +     NE+SFA VL  C  L ++    QVH  +    F
Sbjct: 125 VSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPF 184

Query: 228 LSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAW---------------------- 287
           LS+V + S++VD Y+KCG +  A+R+FDEM  +++++W                      
Sbjct: 185 LSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQM 244

Query: 288 ------------------------------------------------------------ 347
                                                                       
Sbjct: 245 MLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCS 304

Query: 348 --------------------TTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALISGYAR 407
                               T+M+SGYA      +A  +F +M E+N VSW ALI+GY +
Sbjct: 305 RIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQ 364

Query: 408 NSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRCNT-- 467
           N    EAL  F  + +  + P  Y+F++ L ACA +A L  G Q H ++++  F+  +  
Sbjct: 365 NGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGE 424

Query: 468 ----IVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFN 527
                V +SLIDMY KCG +E    +F  M  ++D V WN MI   AQ+G+G EA+++F 
Sbjct: 425 EDDIFVGNSLIDMYVKCGCVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFR 484

Query: 528 DMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRA 587
           +M+ESG KPD IT I +LSAC H+G V+EG  +F +MT D GV P ++HY C++DLLGRA
Sbjct: 485 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 544

Query: 588 GCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLA 601
           G   E  + +E+M  +PD  +W +LL  C++H NI LG+ VAE ++E+EP +S  YV L+
Sbjct: 545 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 604

BLAST of Clc09G17420 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 7.9e-109
Identity = 207/565 (36.64%), Postives = 332/565 (58.76%), Query Frame = 0

Query: 37  LSSQGHLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRP 96
           L  Q  L EA+  L     R  + P ST+ NL+++C++ +  + G+ VH H++ +GF  P
Sbjct: 64  LCGQKLLREAVQLLG----RAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFV-P 123

Query: 97  TTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMT 156
             ++ N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G +  AR LFD MT
Sbjct: 124 GIVIWNRLLRMYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMT 183

Query: 157 EKDVVSWNTMVLAYAKKGYFNEAIGLHRDFRRL-DMGFNEFSFAGVLILCVKLKELQLTK 216
           EKD  SW  MV  Y KK    EA+ L+   +R+ +   N F+ +  +     +K ++  K
Sbjct: 184 EKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGK 243

Query: 217 QVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWG 276
           ++HG ++  G  S+ VL SS++D Y KCG +  AR +FD+++ KD+              
Sbjct: 244 EIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDV-------------- 303

Query: 277 DMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLC 336
                            VSWT++I  Y ++S   E    F++++     P++YTF+  L 
Sbjct: 304 -----------------VSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLN 363

Query: 337 ACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVV 396
           ACA +   + GKQ+H Y+ R  F   +   SSL+DMY+KCG +E++  +      K D+V
Sbjct: 364 ACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGC-PKPDLV 423

Query: 397 LWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAM 456
            W ++I   AQ+G  +EA++ F+ +++SG KPD +TF+ +LSAC+H+GLV++GL FF ++
Sbjct: 424 SWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSI 483

Query: 457 TYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIEL 516
           T  H +    +HY CL+DLL R+G F +L + + +M  KP   +W ++LG C  +GNI+L
Sbjct: 484 TEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKFLWASVLGGCSTYGNIDL 543

Query: 517 GRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGN 576
             + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +I  
Sbjct: 544 AEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKR 591

Query: 577 KVHSFIASDRLHPLKEEIYSLLEQL 601
           K H FIA+D  HP+  +I   L +L
Sbjct: 604 KRHVFIAADTSHPMYNQIVEFLREL 591

BLAST of Clc09G17420 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 383.3 bits (983), Expect = 5.3e-105
Identity = 214/613 (34.91%), Postives = 323/613 (52.69%), Query Frame = 0

Query: 73  AKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSW 132
           A + + + GRC           R +++  N +I  Y   G    ARK+FD+M  R+L SW
Sbjct: 70  AISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEMPERDLVSW 129

Query: 133 NHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAIGL-HRDFRRLDM 192
           N M+ GY +  N+  AR LF+ M E+DV SWNTM+  YA+ G  ++A  +  R   + D+
Sbjct: 130 NVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEKNDV 189

Query: 193 GFNEFSFAGVL-----ILCVKLKELQLTKQVHGQVLVVGFLS-----------------N 252
            +N    A V        C+  K  +    V    L+ GF+                  +
Sbjct: 190 SWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRD 249

Query: 253 VVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWT------------------------ 312
           VV  ++I+  YA+ G++  AR+LFDE  V+D+  WT                        
Sbjct: 250 VVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPE 309

Query: 313 --------------------------------------TMVSGYAKWGDMNSASELFHQM 372
                                                 TM++GYA+ G ++ A  LF +M
Sbjct: 310 RNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKM 369

Query: 373 PEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGK 432
           P+++PVSW A+I+GY+++    EAL  F +M +     ++ +FSS L  CA + AL+ GK
Sbjct: 370 PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGK 429

Query: 433 QLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQH 492
           QLH  L++  +     V ++L+ MY KCG +E +  +F  M  K D+V WNTMI+  ++H
Sbjct: 430 QLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGK-DIVSWNTMIAGYSRH 489

Query: 493 GHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEH 552
           G GE A++ F  M   GLKPD  T + +LSACSH+GLV +G ++F  MT D+GV+P+ +H
Sbjct: 490 GFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQH 549

Query: 553 YACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELE 601
           YAC++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +E
Sbjct: 550 YACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAME 609

BLAST of Clc09G17420 vs. ExPASy Swiss-Prot
Match: Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 2.0e-104
Identity = 192/508 (37.80%), Postives = 301/508 (59.25%), Query Frame = 0

Query: 98  TIVANHLIGMYFECGND----IEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFD 157
           T V+N L+ +Y +C +       ARKVFD++  ++  SW  M+ GY K G       L +
Sbjct: 184 TSVSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLE 243

Query: 158 RMTEK-DVVSWNTMVLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGVLILCVKLKELQ 217
            M +   +V++N M+  Y  +G++ EA+ + R      +  +EF++  V+  C     LQ
Sbjct: 244 GMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQ 303

Query: 218 LTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWTTMVSGYA 277
           L KQVH  VL     S     +S+V  Y KCG+   AR +F++M  KD+++W  ++SGY 
Sbjct: 304 LGKQVHAYVLRREDFS-FHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 363

Query: 278 KWGDMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSS 337
             G +  A  +F +M EKN +SW  +ISG A N  G E L  F+ M +    P  Y FS 
Sbjct: 364 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 423

Query: 338 CLCACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQ 397
            + +CA + A  +G+Q HA L++  F  +    ++LI MY+KCG++E + ++F  M    
Sbjct: 424 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTM-PCL 483

Query: 398 DVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFF 457
           D V WN +I+AL QHGHG EA+ ++ +M++ G++PDRIT + +L+ACSH+GLV +G ++F
Sbjct: 484 DSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYF 543

Query: 458 KAMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGN 517
            +M   + + P  +HYA LIDLL R+G F +  + +E +  KP   +W ALL  CR+HGN
Sbjct: 544 DSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGN 603

Query: 518 IELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWID 577
           +ELG   A+ +  L P+    Y+ L++++A  G+WE V +VR+LM +R V+KE A SWI+
Sbjct: 604 MELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIE 663

Query: 578 IGNKVHSFIASDRLHPLKEEIYSLLEQL 601
           +  +VH+F+  D  HP  E +Y  L+ L
Sbjct: 664 METQVHTFLVDDTSHPEAEAVYIYLQDL 689

BLAST of Clc09G17420 vs. ExPASy TrEMBL
Match: A0A5D3C6K7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G002430 PE=4 SV=1)

HSP 1 Score: 1140.6 bits (2949), Expect = 0.0e+00
Identity = 549/606 (90.59%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP  FRPK EQ PDSTR+CI QSLL+LSSQG LPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGG+CVHLHLKHTGFKRPTTIVANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLG VHHAR LFDRM EKDVVSWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRRLDMGFN FSF+GVLILCVKLKELQLTKQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCG+MGCARRLFDEMLVKDI  WTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFTKMMK  INP+QYTFSSCLCACASIAALKHGKQ+H YLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC +F+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFKAMTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDDRVW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTED 607
           A HTE+
Sbjct: 601 ARHTEE 606

BLAST of Clc09G17420 vs. ExPASy TrEMBL
Match: A0A1S3BZY1 (pentatricopeptide repeat-containing protein At2g21090 OS=Cucumis melo OX=3656 GN=LOC103495423 PE=4 SV=1)

HSP 1 Score: 1140.6 bits (2949), Expect = 0.0e+00
Identity = 549/606 (90.59%), Postives = 577/606 (95.21%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP  FRPK EQ PDSTR+CI QSLL+LSSQG LPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGG+CVHLHLKHTGFKRPTTIVANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLG VHHAR LFDRM EKDVVSWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRRLDMGFN FSF+GVLILCVKLKELQLTKQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCG+MGCARRLFDEMLVKDI  WTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFTKMMK  INP+QYTFSSCLCACASIAALKHGKQ+H YLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC +F+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFKAMTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDDRVW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTED 607
           A HTE+
Sbjct: 601 ARHTEE 606

BLAST of Clc09G17420 vs. ExPASy TrEMBL
Match: A0A6J1HUW8 (pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima OX=3661 GN=LOC111467037 PE=4 SV=1)

HSP 1 Score: 1128.2 bits (2917), Expect = 0.0e+00
Identity = 548/611 (89.69%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP +FRPKS+  PDS+R CIVQSLLN SSQGHLPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKHLPDSSRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGG+ VHLHLK TGFKRPTTI+ANHLIGMYF+CG+DIEARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIIANHLIGMYFQCGSDIEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNV+ AR +FD M EKDV+SWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKVFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           G +RDFRR DMGFNEFSFAGVLILCVKLKELQL KQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GFYRDFRRQDMGFNEFSFAGVLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMN AS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQ+HAYLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+CR+FYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACRVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLV EGL+FFKAM+YDH VLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVHEGLQFFKAMSYDHSVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT EDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of Clc09G17420 vs. ExPASy TrEMBL
Match: A0A6J1HLP5 (pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3662 GN=LOC111464099 PE=4 SV=1)

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 547/611 (89.53%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQALKKP +FRPKS+Q PDSTR CIVQSLLN SSQG+LPEALS+LDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGNLPEALSFLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGG+ VHLHLK TGFKRPTTIVANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLGNV+ AR LFD M EKDV+SWNTMVLAYAKKG FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRR DMGFNEFSFAG+LILCVKLKELQL KQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMN AS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQ+HAYLIRTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C +FYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFKAM+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDDR+WN LLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNTLLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHT-EDFSII 611
           ASHT EDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of Clc09G17420 vs. ExPASy TrEMBL
Match: A0A0A0K215 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G062860 PE=4 SV=1)

HSP 1 Score: 1117.4 bits (2889), Expect = 0.0e+00
Identity = 539/606 (88.94%), Postives = 572/606 (94.39%), Query Frame = 0

Query: 1   MPSLSSQALKKPTLFRPKSEQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60
           MPS SSQA K P  F PKS+QRPDST LCI QSLL+LSSQG LPEALSYLD LAQRG+RL
Sbjct: 1   MPSFSSQAFKTPASFGPKSKQRPDSTSLCIAQSLLDLSSQGRLPEALSYLDRLAQRGVRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120
           PT  FV+LLRLCAKAK+FKGG+CVHLHLKHTGFKRPTTIVANHLIGMYFECG D+EARKV
Sbjct: 61  PTGIFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAI 180
           FDKMS+RNLYSWNHMLAGYAKLG+V++AR LFDRM EKDVVSWNT+VLAYAK+G FNEAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGDVNNARKLFDRMMEKDVVSWNTIVLAYAKQGCFNEAI 180

Query: 181 GLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GL+RDFRRLDMGFN FSFAGVLILCVKLKELQL KQVHGQVLV GFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAKQVHGQVLVAGFLSNLVLSSSIVDAY 240

Query: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300
           AKCGEM CAR LFDEMLVKDI AWTT+VSGYAKWGDMNSASELFHQMPEKNPVSW+ALIS
Sbjct: 241 AKCGEMRCARTLFDEMLVKDIHAWTTIVSGYAKWGDMNSASELFHQMPEKNPVSWSALIS 300

Query: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRC 360
           GYARNSLGHEALDYFTKMMKF INP+QYTFSSCLCACASIAALKHGKQ+H YLIRT FRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTYFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC +F+LMGNKQDVV+WNTMISALAQ+GHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDVVVWNTMISALAQNGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGV PDQEHY+CLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVFPDQEHYSCLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F+ELVNELE MSCKPDDRVW+ALLGVCRIH NIELGRKVAE VIEL+PQSSAAYVSLASL
Sbjct: 481 FVELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAERVIELKPQSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKER ISWID+GNK HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERGISWIDVGNKTHSFIASDRLHPLKEEIYLLLEQL 600

Query: 601 ASHTED 607
           A HTE+
Sbjct: 601 ARHTEE 606

BLAST of Clc09G17420 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 729.6 bits (1882), Expect = 2.1e-210
Identity = 346/587 (58.94%), Postives = 451/587 (76.83%), Query Frame = 0

Query: 23  PDSTRLCIVQSLLNL-SSQGHLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGG 82
           P    +C+ QS L+  +++  L +A+S L+ L Q+GIRLP     +LL+ C   K  K G
Sbjct: 6   PRKRPICVAQSFLSKHATKAELSQAVSRLESLTQQGIRLPFDLLASLLQQCGDTKSLKQG 65

Query: 83  RCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAK 142
           + +H HLK TGFKRP T+++NHLIGMY +CG  I+A KVFD+M LRNLYSWN+M++GY K
Sbjct: 66  KWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVK 125

Query: 143 LGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGV 202
            G +  AR +FD M E+DVVSWNTMV+ YA+ G  +EA+  +++FRR  + FNEFSFAG+
Sbjct: 126 SGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGL 185

Query: 203 LILCVKLKELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDI 262
           L  CVK ++LQL +Q HGQVLV GFLSNVVLS SI+DAYAKCG+M  A+R FDEM VKDI
Sbjct: 186 LTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDI 245

Query: 263 LAWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKF 322
             WTT++SGYAK GDM +A +LF +MPEKNPVSWTALI+GY R   G+ ALD F KM+  
Sbjct: 246 HIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIAL 305

Query: 323 RINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEAS 382
            + P+Q+TFSSCLCA ASIA+L+HGK++H Y+IRTN R N IV+SSLIDMYSK G LEAS
Sbjct: 306 GVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVISSLIDMYSKSGSLEAS 365

Query: 383 CRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSH 442
            R+F +  +K D V WNTMISALAQHG G +A++M +DM++  ++P+R T +VIL+ACSH
Sbjct: 366 ERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSH 425

Query: 443 SGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWN 502
           SGLV+EGLR+F++MT  HG++PDQEHYACLIDLLGRAGCF EL+ ++E+M  +PD  +WN
Sbjct: 426 SGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKHIWN 485

Query: 503 ALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERF 562
           A+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R 
Sbjct: 486 AILGVCRIHGNEELGKKAADELIKLDPESSAPYILLSSIYADHGKWELVEKLRGVMKKRR 545

Query: 563 VRKERAISWIDIGNKVHSFIASD--RLHPLKEEIYSLLEQLASHTED 607
           V KE+A+SWI+I  KV +F  SD    H  KEEIY +L  LA+  E+
Sbjct: 546 VNKEKAVSWIEIEKKVEAFTVSDGSHAHARKEEIYFILHNLAAVIEE 592

BLAST of Clc09G17420 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 402.9 bits (1034), Expect = 4.6e-112
Identity = 230/662 (34.74%), Postives = 337/662 (50.91%), Query Frame = 0

Query: 48  SYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFK-GGRCVHLHLKHTGFKRPTTIVANHLIG 107
           S+L   A       +S F  LL  C K+K      R VH  +  +GF      + N LI 
Sbjct: 5   SFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN-EIFIQNRLID 64

Query: 108 MYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTM 167
            Y +CG+  + R+VFDKM  RN+Y+WN ++ G  KLG +  A +LF  M E+D  +WN+M
Sbjct: 65  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 124

Query: 168 VLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGVLILCVKLKELQLTKQVHGQVLVVGF 227
           V  +A+     EA+       +     NE+SFA VL  C  L ++    QVH  +    F
Sbjct: 125 VSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPF 184

Query: 228 LSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAW---------------------- 287
           LS+V + S++VD Y+KCG +  A+R+FDEM  +++++W                      
Sbjct: 185 LSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQM 244

Query: 288 ------------------------------------------------------------ 347
                                                                       
Sbjct: 245 MLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCS 304

Query: 348 --------------------TTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALISGYAR 407
                               T+M+SGYA      +A  +F +M E+N VSW ALI+GY +
Sbjct: 305 RIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQ 364

Query: 408 NSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQLHAYLIRTNFRCNT-- 467
           N    EAL  F  + +  + P  Y+F++ L ACA +A L  G Q H ++++  F+  +  
Sbjct: 365 NGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGE 424

Query: 468 ----IVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFN 527
                V +SLIDMY KCG +E    +F  M  ++D V WN MI   AQ+G+G EA+++F 
Sbjct: 425 EDDIFVGNSLIDMYVKCGCVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFR 484

Query: 528 DMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRA 587
           +M+ESG KPD IT I +LSAC H+G V+EG  +F +MT D GV P ++HY C++DLLGRA
Sbjct: 485 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 544

Query: 588 GCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLA 601
           G   E  + +E+M  +PD  +W +LL  C++H NI LG+ VAE ++E+EP +S  YV L+
Sbjct: 545 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 604

BLAST of Clc09G17420 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 396.0 bits (1016), Expect = 5.6e-110
Identity = 207/565 (36.64%), Postives = 332/565 (58.76%), Query Frame = 0

Query: 37  LSSQGHLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGRCVHLHLKHTGFKRP 96
           L  Q  L EA+  L     R  + P ST+ NL+++C++ +  + G+ VH H++ +GF  P
Sbjct: 64  LCGQKLLREAVQLLG----RAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFV-P 123

Query: 97  TTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFDRMT 156
             ++ N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G +  AR LFD MT
Sbjct: 124 GIVIWNRLLRMYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMT 183

Query: 157 EKDVVSWNTMVLAYAKKGYFNEAIGLHRDFRRL-DMGFNEFSFAGVLILCVKLKELQLTK 216
           EKD  SW  MV  Y KK    EA+ L+   +R+ +   N F+ +  +     +K ++  K
Sbjct: 184 EKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGK 243

Query: 217 QVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWG 276
           ++HG ++  G  S+ VL SS++D Y KCG +  AR +FD+++ KD+              
Sbjct: 244 EIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDV-------------- 303

Query: 277 DMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLC 336
                            VSWT++I  Y ++S   E    F++++     P++YTF+  L 
Sbjct: 304 -----------------VSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLN 363

Query: 337 ACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVV 396
           ACA +   + GKQ+H Y+ R  F   +   SSL+DMY+KCG +E++  +      K D+V
Sbjct: 364 ACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGC-PKPDLV 423

Query: 397 LWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAM 456
            W ++I   AQ+G  +EA++ F+ +++SG KPD +TF+ +LSAC+H+GLV++GL FF ++
Sbjct: 424 SWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSI 483

Query: 457 TYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIEL 516
           T  H +    +HY CL+DLL R+G F +L + + +M  KP   +W ++LG C  +GNI+L
Sbjct: 484 TEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKFLWASVLGGCSTYGNIDL 543

Query: 517 GRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGN 576
             + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +I  
Sbjct: 544 AEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKR 591

Query: 577 KVHSFIASDRLHPLKEEIYSLLEQL 601
           K H FIA+D  HP+  +I   L +L
Sbjct: 604 KRHVFIAADTSHPMYNQIVEFLREL 591

BLAST of Clc09G17420 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 383.3 bits (983), Expect = 3.7e-106
Identity = 214/613 (34.91%), Postives = 323/613 (52.69%), Query Frame = 0

Query: 73  AKAKFFKGGRCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKVFDKMSLRNLYSW 132
           A + + + GRC           R +++  N +I  Y   G    ARK+FD+M  R+L SW
Sbjct: 70  AISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEMPERDLVSW 129

Query: 133 NHMLAGYAKLGNVHHARNLFDRMTEKDVVSWNTMVLAYAKKGYFNEAIGL-HRDFRRLDM 192
           N M+ GY +  N+  AR LF+ M E+DV SWNTM+  YA+ G  ++A  +  R   + D+
Sbjct: 130 NVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEKNDV 189

Query: 193 GFNEFSFAGVL-----ILCVKLKELQLTKQVHGQVLVVGFLS-----------------N 252
            +N    A V        C+  K  +    V    L+ GF+                  +
Sbjct: 190 SWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRD 249

Query: 253 VVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWT------------------------ 312
           VV  ++I+  YA+ G++  AR+LFDE  V+D+  WT                        
Sbjct: 250 VVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPE 309

Query: 313 --------------------------------------TMVSGYAKWGDMNSASELFHQM 372
                                                 TM++GYA+ G ++ A  LF +M
Sbjct: 310 RNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKM 369

Query: 373 PEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGK 432
           P+++PVSW A+I+GY+++    EAL  F +M +     ++ +FSS L  CA + AL+ GK
Sbjct: 370 PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGK 429

Query: 433 QLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQDVVLWNTMISALAQH 492
           QLH  L++  +     V ++L+ MY KCG +E +  +F  M  K D+V WNTMI+  ++H
Sbjct: 430 QLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGK-DIVSWNTMIAGYSRH 489

Query: 493 GHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEH 552
           G GE A++ F  M   GLKPD  T + +LSACSH+GLV +G ++F  MT D+GV+P+ +H
Sbjct: 490 GFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQH 549

Query: 553 YACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELE 601
           YAC++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +E
Sbjct: 550 YACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAME 609

BLAST of Clc09G17420 vs. TAIR 10
Match: AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 381.3 bits (978), Expect = 1.4e-105
Identity = 192/508 (37.80%), Postives = 301/508 (59.25%), Query Frame = 0

Query: 98  TIVANHLIGMYFECGND----IEARKVFDKMSLRNLYSWNHMLAGYAKLGNVHHARNLFD 157
           T V+N L+ +Y +C +       ARKVFD++  ++  SW  M+ GY K G       L +
Sbjct: 184 TSVSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLE 243

Query: 158 RMTEK-DVVSWNTMVLAYAKKGYFNEAIGLHRDFRRLDMGFNEFSFAGVLILCVKLKELQ 217
            M +   +V++N M+  Y  +G++ EA+ + R      +  +EF++  V+  C     LQ
Sbjct: 244 GMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQ 303

Query: 218 LTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCARRLFDEMLVKDILAWTTMVSGYA 277
           L KQVH  VL     S     +S+V  Y KCG+   AR +F++M  KD+++W  ++SGY 
Sbjct: 304 LGKQVHAYVLRREDFS-FHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 363

Query: 278 KWGDMNSASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKFRINPDQYTFSS 337
             G +  A  +F +M EKN +SW  +ISG A N  G E L  F+ M +    P  Y FS 
Sbjct: 364 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 423

Query: 338 CLCACASIAALKHGKQLHAYLIRTNFRCNTIVVSSLIDMYSKCGMLEASCRIFYLMGNKQ 397
            + +CA + A  +G+Q HA L++  F  +    ++LI MY+KCG++E + ++F  M    
Sbjct: 424 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTM-PCL 483

Query: 398 DVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFF 457
           D V WN +I+AL QHGHG EA+ ++ +M++ G++PDRIT + +L+ACSH+GLV +G ++F
Sbjct: 484 DSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYF 543

Query: 458 KAMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDRVWNALLGVCRIHGN 517
            +M   + + P  +HYA LIDLL R+G F +  + +E +  KP   +W ALL  CR+HGN
Sbjct: 544 DSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGN 603

Query: 518 IELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWID 577
           +ELG   A+ +  L P+    Y+ L++++A  G+WE V +VR+LM +R V+KE A SWI+
Sbjct: 604 MELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIE 663

Query: 578 IGNKVHSFIASDRLHPLKEEIYSLLEQL 601
           +  +VH+F+  D  HP  E +Y  L+ L
Sbjct: 664 METQVHTFLVDDTSHPEAEAVYIYLQDL 689

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886822.10.0e+0095.42pentatricopeptide repeat-containing protein At2g21090 [Benincasa hispida][more]
XP_008455202.10.0e+0090.59PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo] ... [more]
KAG7011402.10.0e+0090.02Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022967585.10.0e+0089.69pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima][more]
XP_022963954.10.0e+0089.53pentatricopeptide repeat-containing protein At2g21090 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9SKQ43.0e-20958.94Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SIT76.4e-11134.74Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
O231697.9e-10936.64Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9SY025.3e-10534.91Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Q9FRI52.0e-10437.80Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5D3C6K70.0e+0090.59Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BZY10.0e+0090.59pentatricopeptide repeat-containing protein At2g21090 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1HUW80.0e+0089.69pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima O... [more]
A0A6J1HLP50.0e+0089.53pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3... [more]
A0A0A0K2150.0e+0088.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G062860 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G21090.12.1e-21058.94Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.14.6e-11234.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37170.15.6e-11036.64Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G02750.13.7e-10634.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G25360.11.4e-10537.80Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 344..458
e-value: 4.4E-28
score: 100.6
coord: 459..583
e-value: 1.7E-11
score: 46.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 214..343
e-value: 5.3E-29
score: 102.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 30..189
e-value: 5.9E-29
score: 103.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 108..550
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 240..533
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 161..188
e-value: 1.3E-4
score: 21.9
coord: 102..127
e-value: 0.1
score: 12.9
coord: 365..389
e-value: 0.031
score: 14.5
coord: 234..260
e-value: 0.0022
score: 18.1
coord: 533..560
e-value: 1.3
score: 9.4
coord: 130..159
e-value: 1.2E-6
score: 28.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 290..338
e-value: 1.1E-9
score: 38.4
coord: 393..440
e-value: 2.8E-15
score: 56.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 293..326
e-value: 5.3E-7
score: 27.4
coord: 263..290
e-value: 3.6E-6
score: 24.8
coord: 395..428
e-value: 1.6E-10
score: 38.5
coord: 235..261
e-value: 0.0015
score: 16.5
coord: 130..161
e-value: 1.3E-6
score: 26.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 393..427
score: 13.800334
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..325
score: 10.862706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 260..290
score: 9.97484
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 11.147699
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 42..287
coord: 189..584

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc09G17420.1Clc09G17420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding