Cla97C01G007410 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G007410
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 7615894 .. 7618433 (-)
RNA-Seq ExpressionCla97C01G007410
SyntenyCla97C01G007410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCAATCGAGACTCGGTCTCTTCCGCCTTTTGCATTTCGGTGCGATCGATTGCGGCTCTCGCCAACTCTCAACCGGTTCAATTCCCAGTGGCGCGTTTGGCCTATGCCCATAAACCCTAGAAACCTTACGACTTGTAATTAGTTTAACATGATCATCCGTTGCTTTAAAGAAGATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

mRNA sequence

ATGTTCAATCGAGACTCGAAGATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

Coding sequence (CDS)

ATGTTCAATCGAGACTCGAAGATGAAGCTTCCCATTGAGCTTCTTGCCTTTGCTTCCCTTCTTTCAGCGATGTTACTCTTCTTTCGCACTCTTTTCCACGTTAGTCGCAGAGCTTCTTACCAAGTAATCTCTCTATCTTCTAATTCTTCGCATCCCGATTGCCTTTCTTTCAATGTATTTAATCCCTCATCATCTCTAACATCAATAAATGCCTATTGCATTTCTCGTCATTTTTTCTGGTTCACTAGCTTTCTTCGTATATTTCGGCTCCCTTTTGTTAGTTACTCGGGTACAAATAATTCATTTGAATTTTTAGACATTGGTACCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTATTTTATTGGATTCAGCACTAGCGCCCATCTGGGTCTCTAAGGTTTTAGTTGAACTGAAAGAAGATCCGAATTTAGCTCTTAAGTTCTTCAAATGGGCTGGAAGCCGGATTGGCTTCCGTCATACCACCGAGTCTTACTGCATTGTAGCTCACATGCTGTTTCGTGCGAGAATGTATACAAATGCTCATGATATTATTAAAGAAGTGATTGTGAAGAGCCGAATTGATGTGGGTTTTCCAGTTTTTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGCGTCAGGAACAGGAGTCTTTGACGTTTTATTTAGTGTTTTCGTAGAATTGGGTTTGCTCGACGAAGCTAACAAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTTCCGAAAGCACGTTCTTGCAATTTTCTTTTGCACAGATTATCAAAATCAGGTAATGGGCAGTTGGTGAGGAAGTTTTTCAATGACATGATTGGGGCTGGGATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGGGATTTGGAAAACGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTACTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAAGCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGGGTTAATCAATTGTTTTTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTAAGATGAAGAACGATGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGGGTTGGTCTTTTACCTAATGAATTCACTTACACTTCTCTGATTGACGCCCATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAACGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACTTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAAAATGATGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAAATGGAATATCTCCCAACCAGCAGGTTTACACTGCATTGGTTCATGGCTATATTAAGGCAGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGCACCATTATTTGGGGTCTCTGTAGTCAAAGCAAACTTGAAGAAACTAAACTTATTATTAAAGAAATGGAAAGTCAGGGTATTAGTGCAAATCCTGTTATATACACGACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCTGATGCAATGAATCTTCTTCAGGAAATGCAGGATGCTGGTGTTGAGGCTACTGTTGTAACCTACTGTGTACTAATTGATGGTTTGTGCAAAGCAGGTATGGTTGAACTTGCAGTTGATTATTTTGGTAGAATGTCTGATCTTGGTTTACAACCTAATGTTGCAGTTTATACGGCGCTAATTGATGGTCTTTGTAAAACTAATTGTGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACCCCGGATATAACTGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTCGGATTTGATTAGCAGAATGACAGAATCAGCTACCGAGTTTGATTTGCATGCTTATACTTCCTTGGTTTCGGGATTTTCTGAATATGGCGAGCTGCACCAAGCAAGGAAGTTTTTTAGTGAGATGATTGAGAAGGGCATACTTCCTGAGGAGATCTTATGTATATCTCTACTGAGAGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTTTAATTAGTGAAAATTGCAGCCTTGCAGTTCCCTGTCTAAAAACTTGA

Protein sequence

MFNRDSKMKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLKT
Homology
BLAST of Cla97C01G007410 vs. NCBI nr
Match: XP_038906984.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Benincasa hispida])

HSP 1 Score: 1413.7 bits (3658), Expect = 0.0e+00
Identity = 701/771 (90.92%), Postives = 735/771 (95.33%), Query Frame = 0

Query: 24  MLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFTS 83
           M+LFFRTLFH+SRRASY+VISLSSNSSHPDCLSFNVFN  SSLTSINA CISR FFWFTS
Sbjct: 1   MVLFFRTLFHISRRASYRVISLSSNSSHPDCLSFNVFNSLSSLTSINACCISRPFFWFTS 60

Query: 84  FLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLVE 143
           FL IFRLPFVS S   NSFEFLDIG+LR II+QDLWNDPKIVIL DSALAPIWVSKVLVE
Sbjct: 61  FLCIFRLPFVSCSNAKNSFEFLDIGSLRIIIRQDLWNDPKIVILFDSALAPIWVSKVLVE 120

Query: 144 LKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVG 203
           LKEDP LALKFFKWAGS IGF HTTESYCIV HMLFRARMYTNAHDI+KE+IVKSRIDVG
Sbjct: 121 LKEDPKLALKFFKWAGSHIGFHHTTESYCIVVHMLFRARMYTNAHDIVKEMIVKSRIDVG 180

Query: 204 FPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCN 263
           FPV NIFD+LWSTRNIC SG GVFDVLFSV V+LG+L+EAN+CFSRMR FRT PKARSCN
Sbjct: 181 FPVCNIFDVLWSTRNICMSGPGVFDVLFSVLVDLGMLEEANECFSRMRNFRTFPKARSCN 240

Query: 264 FLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQM 323
           FLLHRLSKSGNGQLVRKFF DMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMR M
Sbjct: 241 FLLHRLSKSGNGQLVRKFFKDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRHM 300

Query: 324 GFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQA 383
           GF+PDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMP+A
Sbjct: 301 GFSPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPRA 360

Query: 384 FEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 443
           F YLS+MKN+GLKPNVVTYSTLIDAFCKEGMMQGAIKLF DMRRVGLLPNEFTYTSLIDA
Sbjct: 361 FHYLSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFFDMRRVGLLPNEFTYTSLIDA 420

Query: 444 HCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPN 503
           +CKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCE G+MMEAEEVFR+MLK+GISPN
Sbjct: 421 NCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEYGRMMEAEEVFRSMLKDGISPN 480

Query: 504 QQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIK 563
           QQVYTALVHGYIKAERMEDA+EILKQMTE NIKPDLILYGTIIWGLCSQSKLEETKLIIK
Sbjct: 481 QQVYTALVHGYIKAERMEDAIEILKQMTEYNIKPDLILYGTIIWGLCSQSKLEETKLIIK 540

Query: 564 EMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAG 623
           EM+S+GISANPVIYTTIIDAYFKAGKSSDA+NLLQEMQDAGVEATVVTYCVLIDGLCK G
Sbjct: 541 EMKSRGISANPVIYTTIIDAYFKAGKSSDAINLLQEMQDAGVEATVVTYCVLIDGLCKTG 600

Query: 624 MVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFT 683
           +VELAVDYFGRMS+LGLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQCRGMTPDITAFT
Sbjct: 601 LVELAVDYFGRMSNLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQCRGMTPDITAFT 660

Query: 684 ALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEK 743
           AL+DGNLKLGNLQEA DLISRMTE ATEFDLHAYTSLVSGFS+ GELHQARK+F+EMIEK
Sbjct: 661 ALVDGNLKLGNLQEALDLISRMTELATEFDLHAYTSLVSGFSQCGELHQARKYFNEMIEK 720

Query: 744 GILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCLKT 795
           GILPEEILCI LLREYYKLG+LDEAIE+KNEMQRRGLI+E CS AV  LKT
Sbjct: 721 GILPEEILCICLLREYYKLGKLDEAIEMKNEMQRRGLITEKCSHAVTSLKT 771

BLAST of Cla97C01G007410 vs. NCBI nr
Match: XP_022938692.1 (putative pentatricopeptide repeat-containing protein At2g02150, partial [Cucurbita moschata])

HSP 1 Score: 1396.3 bits (3613), Expect = 0.0e+00
Identity = 697/782 (89.13%), Postives = 736/782 (94.12%), Query Frame = 0

Query: 13  ELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAY 72
           E   FASLLSAMLLFFR LF VSRRASY+VISLSSNSSHP CLSFN FN SSSLTSIN  
Sbjct: 1   EFFPFASLLSAMLLFFRGLFQVSRRASYRVISLSSNSSHPGCLSFNAFNASSSLTSINGC 60

Query: 73  CISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSAL 132
            IS    WF SFL IFRLPFVSYS TN+SFE LDIG+LRKIIQQDLWNDPKIVIL DSAL
Sbjct: 61  YIS--CLWFASFLCIFRLPFVSYSNTNSSFESLDIGSLRKIIQQDLWNDPKIVILFDSAL 120

Query: 133 APIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIK 192
           APIWVSK+LVELKEDP LALKFFKWAGS+IGF HTTESYCI+AHMLF ARMYTNAHDIIK
Sbjct: 121 APIWVSKILVELKEDPKLALKFFKWAGSQIGFCHTTESYCIIAHMLFCARMYTNAHDIIK 180

Query: 193 EVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRK 252
           EVI+K RID+ FPV NIFDMLWSTRN+C SGTGVFD+LFSV VELGLL+EAN+CFSRMRK
Sbjct: 181 EVILKCRIDMIFPVCNIFDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRK 240

Query: 253 FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLEN 312
           FRTLPKARSCNFLLHRLSKSGNGQLV+ FFNDMIGAGIAPSVFTYNVMIDYLCKEGDLE+
Sbjct: 241 FRTLPKARSCNFLLHRLSKSGNGQLVKNFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLES 300

Query: 313 ARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLIN 372
           ARRLFVQMRQMGF+PDVVTYNSLIDGYGKVGLLEE+VYLF EMKDVGCVPDVITYN LIN
Sbjct: 301 ARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALIN 360

Query: 373 CFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLP 432
           CFCKFEKMP+AFEYLS+MKN GLKPNVVTYSTLIDAFCKEGMMQ AIKLFVDMRRVGLLP
Sbjct: 361 CFCKFEKMPRAFEYLSEMKNSGLKPNVVTYSTLIDAFCKEGMMQYAIKLFVDMRRVGLLP 420

Query: 433 NEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVF 492
           NEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLN+V+YTALMDGLCEDG+MMEAEEVF
Sbjct: 421 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVF 480

Query: 493 RAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ 552
           +AMLK+G+SPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ
Sbjct: 481 KAMLKDGLSPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ 540

Query: 553 SKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTY 612
           +KLEETKLIIKEM+SQGISANPVIYTTI+DAYFKAGKSSDA+NLL +MQD GVEATVVTY
Sbjct: 541 NKLEETKLIIKEMKSQGISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTY 600

Query: 613 CVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQC 672
           CVLIDGLCK GMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQ 
Sbjct: 601 CVLIDGLCKTGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQY 660

Query: 673 RGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQ 732
           RGMTPD TAFTALIDGNLKLGNLQEA DLISRMT+ A EFDLHAYTS+VSGFS+ G+LHQ
Sbjct: 661 RGMTPDKTAFTALIDGNLKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQ 720

Query: 733 ARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCL 792
           ARKFF+EMIEKGILPEEILC  LLREYYKLGQLDEAIELKNEM+RRGLI+ENCSL VP L
Sbjct: 721 ARKFFNEMIEKGILPEEILCTCLLREYYKLGQLDEAIELKNEMRRRGLITENCSLEVPSL 780

Query: 793 KT 795
           +T
Sbjct: 781 RT 780

BLAST of Cla97C01G007410 vs. NCBI nr
Match: KAG6601913.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1393.3 bits (3605), Expect = 0.0e+00
Identity = 687/787 (87.29%), Postives = 739/787 (93.90%), Query Frame = 0

Query: 8   MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLT 67
           MKL +E+LAFASL SAMLLFFR+LFHVSRRASY+VISLS NSSHP CLSFNVFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 68  SINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVIL 127
           S+N Y IS  FFWFTSFL IFRLPFVSYS TN+SFE LDIG+LRKIIQQDLWNDPKIV+L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
            DSALAPIWVSK+LVELKEDP LALKFFKWAG+ IGFRHTTESYCI+ HMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
           HDI+KE+++KSR D+  PV N+FD+LWSTRN C SGTGVFDVLFSV VELGLL+EAN+CF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKE 307
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGIAPSVFTYNVMID+LCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 308 GDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITY 367
           GD+ENAR LFVQMR MGF+PDVVTYNSLIDGYGKVGLL+E+VYLFNEMKDVGCVPDVITY
Sbjct: 301 GDVENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 368 NGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 427
           N LINCFCKFEKMPQAFEYLS+MKN+GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 428 VGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMME 487
           VGLLPNEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDG+MME
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 488 AEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 547
           AEEVFRAMLK+GISPNQQVYTALVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIW
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 548 GLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEA 607
           GLC+Q+KLEETKLIIKEM+ +GI ANPVIYTTIIDAYFKAGKSSDA++LLQEMQ+ GVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKKRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 608 TVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLF 667
           TVVTYCVLIDGLCK GMVE+AVDYFGRMSD G+QPNVAVYTALIDGLCK NC+ESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLF 660

Query: 668 DEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEY 727
           DEMQCRGMTPD TAFTALIDGNLKLGNLQEA +LIS+MTE   EFDLHAYT+LVSGFS+ 
Sbjct: 661 DEMQCRGMTPDKTAFTALIDGNLKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 728 GELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSL 787
           GELHQARKFF+EMIEKGILP+EILCI LLREY KLG LDEAIELKNEMQRRGLI+E CS 
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

Query: 788 AVPCLKT 795
            VP  KT
Sbjct: 781 EVPSPKT 787

BLAST of Cla97C01G007410 vs. NCBI nr
Match: XP_022959692.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita moschata] >XP_022959700.1 putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1389.8 bits (3596), Expect = 0.0e+00
Identity = 685/787 (87.04%), Postives = 739/787 (93.90%), Query Frame = 0

Query: 8   MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLT 67
           MKL +E+LAFASL SAMLLFFR+LFHVSRRASY+VISLS NSSHP CLSFNVFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 68  SINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVIL 127
           S+N Y IS  FFWFTSFL IFRLPFVSYS TN+SFE LDIG+LRKIIQQDLWNDPKIV+L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
            DSALAPIWVSK+LVELKEDP LALKFFKWAG+ IGFRHTTESYCI+ HMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
           HDI+KE+++KSR D+  PV N+FD+LWSTRN C SGTGVFDVLFSV VELGLL+EAN+CF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKE 307
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGIAPSVFTYNVMID+LCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 308 GDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITY 367
           GDLENAR LFVQMR MGF+PDVVTYNSLIDGYGKVGLL+E+VYLFNEMKDVGCVPDVITY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 368 NGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 427
           N LINCFCKFEKMPQAFEYLS+MKN GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 428 VGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMME 487
           VGLLPNEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDG+MME
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 488 AEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 547
           AEEVFRAMLK+GISPNQQVYTALVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIW
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 548 GLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEA 607
           GLC+Q+KLEETKLIIKEM+S+GI ANPVIYTTIIDAYFKAGKSSDA++LLQEMQ+ GVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 608 TVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLF 667
           TVVTYCVLIDGLCK GMVE+AVDYFGRMSD G+QPNVAVYTALIDGLCK NC+ESA+KLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLF 660

Query: 668 DEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEY 727
           +EMQCRGMTPD TAFTALIDGNLKLGNLQE  +LIS+MTE   EFDLHAYT+LVSGFS+ 
Sbjct: 661 EEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 728 GELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSL 787
           GELHQARKFF+EMIEKGILP+EILCI LL+EY KLG LDEAI+LKNEMQRRGLI+E CS 
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSH 780

Query: 788 AVPCLKT 795
            VP LKT
Sbjct: 781 EVPSLKT 787

BLAST of Cla97C01G007410 vs. NCBI nr
Match: XP_023534824.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534833.1 putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1387.1 bits (3589), Expect = 0.0e+00
Identity = 683/787 (86.79%), Postives = 739/787 (93.90%), Query Frame = 0

Query: 8   MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLT 67
           MKL +E+LAFASL SAML FFR+LFHVSRRASY+VISLS NSSHP CLSF+VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLPFFRSLFHVSRRASYRVISLSLNSSHPGCLSFHVFNGPSSLT 60

Query: 68  SINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVIL 127
           SIN Y IS  FFWFTSFL IFRLPFVSYS TN+SFE LDI +LRKIIQQDLWNDPKIV+L
Sbjct: 61  SINGYHISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIDSLRKIIQQDLWNDPKIVVL 120

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
            DS+LAPIWVSK+LVELKEDPNLALKFFKWAG+ IGFRHTTESYCI+ HMLFRARMYTNA
Sbjct: 121 FDSSLAPIWVSKILVELKEDPNLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
           HDI+KE+++KSR D+  PV N+FD+LWSTRN C SGTGVFDVLFSV VELGLL+EAN+CF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKE 307
           S+MRKFRTLPKARSCNFLLHRLSK+GNGQLVRKFF+DM+GAGIAPSVFTYNVMID+LCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKAGNGQLVRKFFHDMVGAGIAPSVFTYNVMIDHLCKE 300

Query: 308 GDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITY 367
           GDLENAR LFVQMR MGF+PDVVTYNSLIDGYGKVGLL+E+VYLFNEMKDVGCVPDVITY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 368 NGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 427
           N LINCFCKFEKMPQAFEYLS+MKN+GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 428 VGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMME 487
           VGLLPNEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDG+MME
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 488 AEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 547
           AEEVFRAMLK+GISPNQQVYTALVHGYIKAE+MEDA+EILKQ+T+C IKPDL+LYGTIIW
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQITKCGIKPDLVLYGTIIW 540

Query: 548 GLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEA 607
           GLC+Q+KLEETKLIIKEM+S+GI ANPVIYTTIIDAYFKAGK SDA++LLQEMQ+ GVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKGSDALDLLQEMQEVGVEA 600

Query: 608 TVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLF 667
           TVVTYCVLIDGLCK GMVE+AVDYFGRMSD G+QPNVAVYTALIDGLCK NC+ESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLF 660

Query: 668 DEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEY 727
           DEMQCRGMTPD TAFTALIDGNLKLGNLQEA +LIS+MTE   EFDLHAYT+LVSGFS+ 
Sbjct: 661 DEMQCRGMTPDKTAFTALIDGNLKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 728 GELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSL 787
           GELHQARKFF+EMIEKGILP+EILCI LLREY KLG LDEAIELKNEMQRRGLI+E CS 
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

Query: 788 AVPCLKT 795
            VP LKT
Sbjct: 781 EVPSLKT 787

BLAST of Cla97C01G007410 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 859.4 bits (2219), Expect = 3.3e-248
Identity = 427/769 (55.53%), Postives = 562/769 (73.08%), Query Frame = 0

Query: 24  MLLFFRTLFHVSRRASYQVI-SLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFT 83
           M    R   HV+RR    V  S SS S     L F + +PS S +S     IS  F WFT
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPLSSPSPSQSSF----ISCPFVWFT 60

Query: 84  SFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLV 143
           SFL I R PFV+ SGT+   E  D   +RK++  DLW+DP +  L D  LAPIWV +VLV
Sbjct: 61  SFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRVLV 120

Query: 144 ELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDV 203
           ELKEDP LA KFFKW+ +R GF+H+ ESYCIVAH+LF ARMY +A+ ++KE+++ S+ D 
Sbjct: 121 ELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKADC 180

Query: 204 GFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSC 263
                ++FD+LWSTRN+C  G GVFD LFSV ++LG+L+EA +CFS+M++FR  PK RSC
Sbjct: 181 -----DVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSC 240

Query: 264 NFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQ 323
           N LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+ 
Sbjct: 241 NGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKF 300

Query: 324 MGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQ 383
            G  PD VTYNS+IDG+GKVG L++ V  F EMKD+ C PDVITYN LINCFCKF K+P 
Sbjct: 301 RGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPI 360

Query: 384 AFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLID 443
             E+  +MK +GLKPNVV+YSTL+DAFCKEGMMQ AIK +VDMRRVGL+PNE+TYTSLID
Sbjct: 361 GLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLID 420

Query: 444 AHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISP 503
           A+CK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  +M EAEE+F  M   G+ P
Sbjct: 421 ANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIP 480

Query: 504 NQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLII 563
           N   Y AL+HG++KA+ M+ A+E+L ++    IKPDL+LYGT IWGLCS  K+E  K+++
Sbjct: 481 NLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVM 540

Query: 564 KEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKA 623
            EM+  GI AN +IYTT++DAYFK+G  ++ ++LL EM++  +E TVVT+CVLIDGLCK 
Sbjct: 541 NEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKN 600

Query: 624 GMVELAVDYFGRMS-DLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITA 683
            +V  AVDYF R+S D GLQ N A++TA+IDGLCK N VE+A  LF++M  +G+ PD TA
Sbjct: 601 KLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTA 660

Query: 684 FTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMI 743
           +T+L+DGN K GN+ EA  L  +M E   + DL AYTSLV G S   +L +AR F  EMI
Sbjct: 661 YTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMI 720

Query: 744 EKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 791
            +GI P+E+LCIS+L+++Y+LG +DEA+EL++ + +  L++ +   A+P
Sbjct: 721 GEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALP 759

BLAST of Cla97C01G007410 vs. ExPASy Swiss-Prot
Match: Q9ZUA2 (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX=3702 GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 2.9e-103
Identity = 207/564 (36.70%), Postives = 321/564 (56.91%), Query Frame = 0

Query: 239 LLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYN 298
           ++ EA +  SR+RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 299 VMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMK-- 358
            ++ ++CK G ++ A  +   M + G  PDV++YNSLIDG+ + G +  A  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 359 -DVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMM 418
               C PD++++N L N F K + + + F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLK-CCSPNVVTYSTWIDTFCKSGEL 180

Query: 419 QGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 478
           Q A+K F  M+R  L PN  T+T LID +CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 479 MDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNI 538
           +DG C+ G+M  AEE++  M+++ + PN  VYT ++ G+ +    ++AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 539 KPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMN 598
           + D+  YG II GLC   KL+E   I+++ME   +  + VI+TT+++AYFK+G+   A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 599 LLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLC 658
           +  ++ + G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYF-----CIEKANDVMYTVLIDALC 420

Query: 659 KTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLH 718
           K       ++LF ++   G+ PD   +T+ I G  K GNL +A  L +RM +     DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 719 AYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEM 778
           AYT+L+ G +  G + +AR+ F EM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 779 QRRGLI--------SENCSLAVPC 792
           QRRGL+        S+ C   V C
Sbjct: 541 QRRGLVTAVSDADCSKQCGNEVNC 558

BLAST of Cla97C01G007410 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 6.4e-103
Identity = 209/650 (32.15%), Postives = 346/650 (53.23%), Query Frame = 0

Query: 135 IWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEV 194
           IW   VL+++K D  L L FF WA SR       ES CIV H+   ++    A  +I   
Sbjct: 91  IW---VLMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 195 IVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFR 254
             + +++V       FD+L  T     S   VFDV F V V+ GLL EA + F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 255 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGIAPSVFTYNVMIDYLCKEGDLENA 314
            +    SCN  L RLSK           F +    G+  +V +YN++I ++C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 315 RRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINC 374
             L + M   G+TPDV++Y+++++GY + G L++   L   MK  G  P+   Y  +I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 375 FCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 434
            C+  K+ +A E  S+M   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 435 EFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFR 494
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 495 AMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQS 554
            M++ G SPN   YT L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 555 KLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYC 614
            +EE   ++ E E+ G++A+ V YTT++DAY K+G+   A  +L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 615 VLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCR 674
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++ +M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 675 GMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQA 734
           G+ PD   +  L+ G+ K  N++EA  L   M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 735 RKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISE 784
           R+ F +M  +G+  ++ +        YK  + D  ++  +E+    L+ E
Sbjct: 691 REVFDQMRREGLAADKEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDE 735

BLAST of Cla97C01G007410 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 349.7 bits (896), Expect = 8.4e-95
Identity = 204/637 (32.03%), Postives = 327/637 (51.33%), Query Frame = 0

Query: 150 LALKFFKWAGSRIGFR--HTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVF 209
           LALKF KW   + G    H  +  CI  H+L RARMY  A  I+KE+ + S    G   F
Sbjct: 52  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS----GKSSF 111

Query: 210 NIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLH 269
            +F  L +T  +C S   V+D+L  V++  G++ ++ + F  M  +   P   +CN +L 
Sbjct: 112 -VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 171

Query: 270 RLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTP 329
            + KSG    V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M + G+ P
Sbjct: 172 SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 231

Query: 330 DVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYL 389
            +VTYN+++  Y K G  + A+ L + MK  G   DV TYN LI+  C+  ++ + +  L
Sbjct: 232 TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 291

Query: 390 SKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKA 449
             M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID H   
Sbjct: 292 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 351

Query: 450 GNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVY 509
           GN  EA K+   M   G+  + V+Y  L+DGLC++ +   A   +  M +NG+   +  Y
Sbjct: 352 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 411

Query: 510 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMES 569
           T ++ G  K   +++A+ +L +M++  I PD++ Y  +I G C   + +  K I+  +  
Sbjct: 412 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 471

Query: 570 QGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVEL 629
            G+S N +IY+T+I    + G   +A+ + + M   G      T+ VL+  LCKAG V  
Sbjct: 472 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 531

Query: 630 AVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALID 689
           A ++   M+  G+ PN   +  LI+G   +     A  +FDEM   G  P    + +L+ 
Sbjct: 532 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 591

Query: 690 GNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILP 749
           G  K G+L+EA   +  +       D   Y +L++   + G L +A   F EM+++ ILP
Sbjct: 592 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 651

Query: 750 EEILCISLLREYYKLGQLDEAIELKNEMQRRGLISEN 785
           +     SL+    + G+   AI    E + RG +  N
Sbjct: 652 DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPN 683

BLAST of Cla97C01G007410 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 1.9e-94
Identity = 206/656 (31.40%), Postives = 338/656 (51.52%), Query Frame = 0

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
           L +   P   S +L++ + D  L LKF  WA     F  T    CI  H+L + ++Y  A
Sbjct: 42  LSANFTPEAASNLLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTA 101

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
             + ++V  K+  D    +  +F  L  T ++C S + VFD++   +  L L+D+A    
Sbjct: 102 QILAEDVAAKTLDDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIV 161

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKS-GNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK 307
              +    +P   S N +L    +S  N       F +M+ + ++P+VFTYN++I   C 
Sbjct: 162 HLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCF 221

Query: 308 EGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVIT 367
            G+++ A  LF +M   G  P+VVTYN+LIDGY K+  +++   L   M   G  P++I+
Sbjct: 222 AGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLIS 281

Query: 368 YNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 427
           YN +IN  C+  +M +    L++M   G   + VTY+TLI  +CKEG    A+ +  +M 
Sbjct: 282 YNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEML 341

Query: 428 RVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMM 487
           R GL P+  TYTSLI + CKAGN+  A +  + M   G+  N  TYT L+DG  + G M 
Sbjct: 342 RHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMN 401

Query: 488 EAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTII 547
           EA  V R M  NG SP+   Y AL++G+    +MEDA+ +L+ M E  + PD++ Y T++
Sbjct: 402 EAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVL 461

Query: 548 WGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVE 607
            G C    ++E   + +EM  +GI  + + Y+++I  + +  ++ +A +L +EM   G+ 
Sbjct: 462 SGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLP 521

Query: 608 ATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKL 667
               TY  LI+  C  G +E A+     M + G+ P+V  Y+ LI+GL K +    AK+L
Sbjct: 522 PDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 581

Query: 668 FDEMQCRGMTP-DITAFT--------------ALIDGNLKLGNLQEASDLISRMTESATE 727
             ++      P D+T  T              +LI G    G + EA  +   M     +
Sbjct: 582 LLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHK 641

Query: 728 FDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDE 768
            D  AY  ++ G    G++ +A   + EM++ G L   +  I+L++  +K G+++E
Sbjct: 642 PDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIALVKALHKEGKVNE 693

BLAST of Cla97C01G007410 vs. ExPASy TrEMBL
Match: A0A6J1FET4 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita moschata OX=3662 GN=LOC111444847 PE=4 SV=1)

HSP 1 Score: 1396.3 bits (3613), Expect = 0.0e+00
Identity = 697/782 (89.13%), Postives = 736/782 (94.12%), Query Frame = 0

Query: 13  ELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLTSINAY 72
           E   FASLLSAMLLFFR LF VSRRASY+VISLSSNSSHP CLSFN FN SSSLTSIN  
Sbjct: 1   EFFPFASLLSAMLLFFRGLFQVSRRASYRVISLSSNSSHPGCLSFNAFNASSSLTSINGC 60

Query: 73  CISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSAL 132
            IS    WF SFL IFRLPFVSYS TN+SFE LDIG+LRKIIQQDLWNDPKIVIL DSAL
Sbjct: 61  YIS--CLWFASFLCIFRLPFVSYSNTNSSFESLDIGSLRKIIQQDLWNDPKIVILFDSAL 120

Query: 133 APIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIK 192
           APIWVSK+LVELKEDP LALKFFKWAGS+IGF HTTESYCI+AHMLF ARMYTNAHDIIK
Sbjct: 121 APIWVSKILVELKEDPKLALKFFKWAGSQIGFCHTTESYCIIAHMLFCARMYTNAHDIIK 180

Query: 193 EVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRK 252
           EVI+K RID+ FPV NIFDMLWSTRN+C SGTGVFD+LFSV VELGLL+EAN+CFSRMRK
Sbjct: 181 EVILKCRIDMIFPVCNIFDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRK 240

Query: 253 FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLEN 312
           FRTLPKARSCNFLLHRLSKSGNGQLV+ FFNDMIGAGIAPSVFTYNVMIDYLCKEGDLE+
Sbjct: 241 FRTLPKARSCNFLLHRLSKSGNGQLVKNFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLES 300

Query: 313 ARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLIN 372
           ARRLFVQMRQMGF+PDVVTYNSLIDGYGKVGLLEE+VYLF EMKDVGCVPDVITYN LIN
Sbjct: 301 ARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALIN 360

Query: 373 CFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLP 432
           CFCKFEKMP+AFEYLS+MKN GLKPNVVTYSTLIDAFCKEGMMQ AIKLFVDMRRVGLLP
Sbjct: 361 CFCKFEKMPRAFEYLSEMKNSGLKPNVVTYSTLIDAFCKEGMMQYAIKLFVDMRRVGLLP 420

Query: 433 NEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVF 492
           NEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLN+V+YTALMDGLCEDG+MMEAEEVF
Sbjct: 421 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVF 480

Query: 493 RAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ 552
           +AMLK+G+SPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ
Sbjct: 481 KAMLKDGLSPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ 540

Query: 553 SKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTY 612
           +KLEETKLIIKEM+SQGISANPVIYTTI+DAYFKAGKSSDA+NLL +MQD GVEATVVTY
Sbjct: 541 NKLEETKLIIKEMKSQGISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTY 600

Query: 613 CVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQC 672
           CVLIDGLCK GMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQ 
Sbjct: 601 CVLIDGLCKTGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQY 660

Query: 673 RGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQ 732
           RGMTPD TAFTALIDGNLKLGNLQEA DLISRMT+ A EFDLHAYTS+VSGFS+ G+LHQ
Sbjct: 661 RGMTPDKTAFTALIDGNLKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQ 720

Query: 733 ARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVPCL 792
           ARKFF+EMIEKGILPEEILC  LLREYYKLGQLDEAIELKNEM+RRGLI+ENCSL VP L
Sbjct: 721 ARKFFNEMIEKGILPEEILCTCLLREYYKLGQLDEAIELKNEMRRRGLITENCSLEVPSL 780

Query: 793 KT 795
           +T
Sbjct: 781 RT 780

BLAST of Cla97C01G007410 vs. ExPASy TrEMBL
Match: A0A6J1H589 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460688 PE=4 SV=1)

HSP 1 Score: 1389.8 bits (3596), Expect = 0.0e+00
Identity = 685/787 (87.04%), Postives = 739/787 (93.90%), Query Frame = 0

Query: 8   MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLT 67
           MKL +E+LAFASL SAMLLFFR+LFHVSRRASY+VISLS NSSHP CLSFNVFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 68  SINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVIL 127
           S+N Y IS  FFWFTSFL IFRLPFVSYS TN+SFE LDIG+LRKIIQQDLWNDPKIV+L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
            DSALAPIWVSK+LVELKEDP LALKFFKWAG+ IGFRHTTESYCI+ HMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
           HDI+KE+++KSR D+  PV N+FD+LWSTRN C SGTGVFDVLFSV VELGLL+EAN+CF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKE 307
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGIAPSVFTYNVMID+LCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 308 GDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITY 367
           GDLENAR LFVQMR MGF+PDVVTYNSLIDGYGKVGLL+E+VYLFNEMKDVGCVPDVITY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 368 NGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 427
           N LINCFCKFEKMPQAFEYLS+MKN GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 428 VGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMME 487
           VGLLPNEFTYTSLIDA+CKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDG+MME
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 488 AEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 547
           AEEVFRAMLK+GISPNQQVYTALVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIW
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 548 GLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEA 607
           GLC+Q+KLEETKLIIKEM+S+GI ANPVIYTTIIDAYFKAGKSSDA++LLQEMQ+ GVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 608 TVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLF 667
           TVVTYCVLIDGLCK GMVE+AVDYFGRMSD G+QPNVAVYTALIDGLCK NC+ESA+KLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLF 660

Query: 668 DEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEY 727
           +EMQCRGMTPD TAFTALIDGNLKLGNLQE  +LIS+MTE   EFDLHAYT+LVSGFS+ 
Sbjct: 661 EEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 728 GELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSL 787
           GELHQARKFF+EMIEKGILP+EILCI LL+EY KLG LDEAI+LKNEMQRRGLI+E CS 
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSH 780

Query: 788 AVPCLKT 795
            VP LKT
Sbjct: 781 EVPSLKT 787

BLAST of Cla97C01G007410 vs. ExPASy TrEMBL
Match: A0A5D3BDW6 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold174G00710 PE=4 SV=1)

HSP 1 Score: 1377.1 bits (3563), Expect = 0.0e+00
Identity = 680/785 (86.62%), Postives = 735/785 (93.63%), Query Frame = 0

Query: 8   MKLPIE--LLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSS 67
           MKL +E  LLAF SLLSAMLLFFRTLFHVSRRAS++VISLSSNSSHPD LSFNVFNPSSS
Sbjct: 1   MKLSVELLLLAFPSLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSS 60

Query: 68  LTSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIV 127
           LTSINAYCISR FFWFTSFL IFRLPFVSYS  NNSFEFLDIG+LRKIIQQDLWNDPKIV
Sbjct: 61  LTSINAYCISRPFFWFTSFLCIFRLPFVSYSNANNSFEFLDIGSLRKIIQQDLWNDPKIV 120

Query: 128 ILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYT 187
           +L DSALAPIWVS++LV LKEDP LALKFFKWAGS++GFRHTTESYCI+ H++FRARMYT
Sbjct: 121 VLFDSALAPIWVSRILVGLKEDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYT 180

Query: 188 NAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANK 247
           +AHD +KEVI+K+RID+GFPV NIFDMLWSTRNIC SG+GVFDVLFSVFVELGLL+EAN+
Sbjct: 181 DAHDTVKEVIMKNRIDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANE 240

Query: 248 CFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 307
           CFSRMR FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC
Sbjct: 241 CFSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 300

Query: 308 KEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVI 367
           KEGDLENARRLFVQMR+MG +PDVVTYNSLIDGYGKVG LEEAV  FNEMKDVGCVPD+I
Sbjct: 301 KEGDLENARRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEAVSFFNEMKDVGCVPDII 360

Query: 368 TYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDM 427
           TYNGLINC+CKFEKMP+AFEY S+MKN+GLKPNVVTYSTLIDAFCKEGMMQGA+KLFVDM
Sbjct: 361 TYNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAVKLFVDM 420

Query: 428 RRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKM 487
           +R GLLPNEFTYTSLIDA+CKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLCEDG+M
Sbjct: 421 KRAGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALVDGLCEDGRM 480

Query: 488 MEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTI 547
           +EAEEVFR+MLK+GISPNQQVYTALVHGYIKAERMEDAM+ILKQM ECNIKPDLILYG++
Sbjct: 481 IEAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMKECNIKPDLILYGSV 540

Query: 548 IWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGV 607
           IWGLCSQSKLEETKLI+KEM+S+GISANPVIYTTIIDAYFKAGKSSDA+NL QEMQD GV
Sbjct: 541 IWGLCSQSKLEETKLILKEMKSRGISANPVIYTTIIDAYFKAGKSSDAINLFQEMQDVGV 600

Query: 608 EATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKK 667
           EATVVTYCVLIDGLCKAG+VELAVDYF RM  LGLQPNVAVYT+LIDGL KTNC++SA K
Sbjct: 601 EATVVTYCVLIDGLCKAGIVELAVDYFCRMFSLGLQPNVAVYTSLIDGLSKTNCIKSANK 660

Query: 668 LFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFS 727
           LFDEMQCRGMTPDITAFTALIDGNLK GNLQEA   ISRMTE A EFDLH YTSLV+GFS
Sbjct: 661 LFDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVFISRMTELAIEFDLHFYTSLVAGFS 720

Query: 728 EYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENC 787
           + GEL QARKFF+EMI+KGILPEE+LCI LLREY K GQLDEAIELKNEMQ  GLI+E+ 
Sbjct: 721 KCGELRQARKFFNEMIKKGILPEEVLCICLLREYCKRGQLDEAIELKNEMQGMGLITESA 780

Query: 788 SLAVP 791
           ++  P
Sbjct: 781 AMQFP 785

BLAST of Cla97C01G007410 vs. ExPASy TrEMBL
Match: A0A6J1CX77 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica charantia OX=3673 GN=LOC111015641 PE=4 SV=1)

HSP 1 Score: 1371.3 bits (3548), Expect = 0.0e+00
Identity = 683/785 (87.01%), Postives = 729/785 (92.87%), Query Frame = 0

Query: 8   MKLPIELLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSSLT 67
           MKL +E+LAFAS LSAMLLFFR+LFHVSRRAS++VI+LSS SSHP CLSFN+FN  SS  
Sbjct: 1   MKLCVEVLAFASFLSAMLLFFRSLFHVSRRASHRVIALSSISSHPGCLSFNIFNAPSSK- 60

Query: 68  SINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVIL 127
             N YCIS   FWF SFL IFRLPFV+YS TNNSFEFLD G+LRKIIQQDLWNDP IV+L
Sbjct: 61  --NGYCISFPSFWFASFLCIFRLPFVTYSNTNNSFEFLDFGSLRKIIQQDLWNDPMIVVL 120

Query: 128 LDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNA 187
            DSALAPIWVSK+LVE KEDP LA KFFKWAGS++GFRHTTE+YCIV H+LFRARMY NA
Sbjct: 121 FDSALAPIWVSKILVEFKEDPKLAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANA 180

Query: 188 HDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCF 247
           HDIIKEVI+KS+ D+  PV  IFD+LWSTRNI   GTGVFDVLFSV VELGLL+EAN+CF
Sbjct: 181 HDIIKEVILKSQNDLVLPVCKIFDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECF 240

Query: 248 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKE 307
            RMRKFRTLPKARSCNFLLHRLSKSG GQLVRKFF+DMIGAGIAPSVFTYNVMIDYLCKE
Sbjct: 241 LRMRKFRTLPKARSCNFLLHRLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKE 300

Query: 308 GDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITY 367
           GDLENARRLFVQMRQMGF+PDVVTYNSLIDGYGKVGLLEE+VYLFNEMK  GCVPDVITY
Sbjct: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITY 360

Query: 368 NGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 427
           N LINCFCKFEKMP+AFEYLS+MKN+GLKPNVVTYSTLIDAFCKEG+MQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRR 420

Query: 428 VGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMME 487
           VGL+PNEFTYTSLIDA+CKAGNLTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDG+M E
Sbjct: 421 VGLVPNEFTYTSLIDANCKAGNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTE 480

Query: 488 AEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 547
           AEEV+RAMLK+GISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW
Sbjct: 481 AEEVYRAMLKDGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 540

Query: 548 GLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEA 607
           GLCSQ+KLEETKLIIKEM+S+GI+ANPVIYTTIIDAYFKAG+SSDA+NLLQEMQDAG+EA
Sbjct: 541 GLCSQNKLEETKLIIKEMKSRGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEA 600

Query: 608 TVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLF 667
           TVVTYCVLIDGLCK G VELAVDYFGRMS  GLQPNVAVYTALIDGLCKTNCVESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGKVELAVDYFGRMSAFGLQPNVAVYTALIDGLCKTNCVESAKKLF 660

Query: 668 DEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEY 727
           DEMQCRGM PD TAFTALIDGNLKLGNLQEA +L SRMTE A EFDLHAYTSLVSGFS+ 
Sbjct: 661 DEMQCRGMAPDKTAFTALIDGNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQC 720

Query: 728 GELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSL 787
           GELHQARKFF EM+EKGILPEEILCI LLREYYKLGQLDEAIELK+EMQRRGLI+E CS 
Sbjct: 721 GELHQARKFFDEMVEKGILPEEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 780

Query: 788 AVPCL 793
           AVP L
Sbjct: 781 AVPSL 782

BLAST of Cla97C01G007410 vs. ExPASy TrEMBL
Match: A0A1S3CT40 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucumis melo OX=3656 GN=LOC103503999 PE=4 SV=1)

HSP 1 Score: 1370.1 bits (3545), Expect = 0.0e+00
Identity = 678/785 (86.37%), Postives = 733/785 (93.38%), Query Frame = 0

Query: 8   MKLPIE--LLAFASLLSAMLLFFRTLFHVSRRASYQVISLSSNSSHPDCLSFNVFNPSSS 67
           MKL +E  LLAF SLLSAMLLFFRTLFHVSRRAS++VISLSSNSSHPD LSFNVFNPSSS
Sbjct: 2   MKLSVELLLLAFPSLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSS 61

Query: 68  LTSINAYCISRHFFWFTSFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIV 127
           LTSINAY ISR FFWFTSFL IFRLPFVSYS  NNS EFLDIG+LRKIIQQDLWNDPKIV
Sbjct: 62  LTSINAYRISRPFFWFTSFLCIFRLPFVSYSNANNSIEFLDIGSLRKIIQQDLWNDPKIV 121

Query: 128 ILLDSALAPIWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYT 187
           +L DSALAPIWVS++LV LKEDP LALKFFKWAGS++GFRHTTESYCI+ H++FRARMYT
Sbjct: 122 VLFDSALAPIWVSRILVGLKEDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYT 181

Query: 188 NAHDIIKEVIVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANK 247
           +AHD +KEVI+K+RID+GFPV NIFDMLWSTRNIC SG+GVFDVLFSVFVELGLL+EAN+
Sbjct: 182 DAHDTVKEVIMKNRIDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANE 241

Query: 248 CFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 307
           CFSRMR FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC
Sbjct: 242 CFSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 301

Query: 308 KEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVI 367
           KEGDLENARRLFVQMR+MG +PDVVTYNSLIDGYGKVG LEEAV  FNEMKDVGCVPD+I
Sbjct: 302 KEGDLENARRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEAVSFFNEMKDVGCVPDII 361

Query: 368 TYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDM 427
           TYNGLINC+CKFEKMP+AFEY S+MKN+GLKPNVVTYSTLIDAFCKEGMMQGA+KLFVDM
Sbjct: 362 TYNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAVKLFVDM 421

Query: 428 RRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKM 487
           +R GLLPNEFTYTSLIDA+CKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLCEDG+M
Sbjct: 422 KRAGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALVDGLCEDGRM 481

Query: 488 MEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTI 547
           +EAEEVFR+MLK+GISPNQQVYTALVHGYIKAERMEDAM+ILKQM ECNIKPDLILYG++
Sbjct: 482 IEAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMKECNIKPDLILYGSV 541

Query: 548 IWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGV 607
           IWGLCSQSKLEETKLI+KEM+S+GISANPVIYTTIIDAYFKAGKSSDA+NL QEMQD GV
Sbjct: 542 IWGLCSQSKLEETKLILKEMKSRGISANPVIYTTIIDAYFKAGKSSDAINLFQEMQDVGV 601

Query: 608 EATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKK 667
           EATVVTYCVLIDGLCKAG+VELAVDYF RM  LGLQPNVAVYT+LIDGL KTNC++SA K
Sbjct: 602 EATVVTYCVLIDGLCKAGIVELAVDYFCRMFSLGLQPNVAVYTSLIDGLSKTNCIKSANK 661

Query: 668 LFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFS 727
           LFDEMQCRGMTPDITAFTALIDGNLK GNLQEA   ISRMTE A EFDLH YTSLV+GFS
Sbjct: 662 LFDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVFISRMTELAIEFDLHFYTSLVAGFS 721

Query: 728 EYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENC 787
           + GEL QARKFF+EMI+KGILPEE+LCI LLREY K GQLDEAIELKNEMQ  GLI+E+ 
Sbjct: 722 KCGELRQARKFFNEMIKKGILPEEVLCICLLREYCKRGQLDEAIELKNEMQGMGLITESA 781

Query: 788 SLAVP 791
           ++  P
Sbjct: 782 AMQFP 786

BLAST of Cla97C01G007410 vs. TAIR 10
Match: AT2G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 859.4 bits (2219), Expect = 2.3e-249
Identity = 427/769 (55.53%), Postives = 562/769 (73.08%), Query Frame = 0

Query: 24  MLLFFRTLFHVSRRASYQVI-SLSSNSSHPDCLSFNVFNPSSSLTSINAYCISRHFFWFT 83
           M    R   HV+RR    V  S SS S     L F + +PS S +S     IS  F WFT
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPLSSPSPSQSSF----ISCPFVWFT 60

Query: 84  SFLRIFRLPFVSYSGTNNSFEFLDIGTLRKIIQQDLWNDPKIVILLDSALAPIWVSKVLV 143
           SFL I R PFV+ SGT+   E  D   +RK++  DLW+DP +  L D  LAPIWV +VLV
Sbjct: 61  SFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRVLV 120

Query: 144 ELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDV 203
           ELKEDP LA KFFKW+ +R GF+H+ ESYCIVAH+LF ARMY +A+ ++KE+++ S+ D 
Sbjct: 121 ELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKADC 180

Query: 204 GFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSC 263
                ++FD+LWSTRN+C  G GVFD LFSV ++LG+L+EA +CFS+M++FR  PK RSC
Sbjct: 181 -----DVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSC 240

Query: 264 NFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQ 323
           N LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+ 
Sbjct: 241 NGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKF 300

Query: 324 MGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQ 383
            G  PD VTYNS+IDG+GKVG L++ V  F EMKD+ C PDVITYN LINCFCKF K+P 
Sbjct: 301 RGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPI 360

Query: 384 AFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLID 443
             E+  +MK +GLKPNVV+YSTL+DAFCKEGMMQ AIK +VDMRRVGL+PNE+TYTSLID
Sbjct: 361 GLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLID 420

Query: 444 AHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISP 503
           A+CK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  +M EAEE+F  M   G+ P
Sbjct: 421 ANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIP 480

Query: 504 NQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLII 563
           N   Y AL+HG++KA+ M+ A+E+L ++    IKPDL+LYGT IWGLCS  K+E  K+++
Sbjct: 481 NLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVM 540

Query: 564 KEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKA 623
            EM+  GI AN +IYTT++DAYFK+G  ++ ++LL EM++  +E TVVT+CVLIDGLCK 
Sbjct: 541 NEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKN 600

Query: 624 GMVELAVDYFGRMS-DLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITA 683
            +V  AVDYF R+S D GLQ N A++TA+IDGLCK N VE+A  LF++M  +G+ PD TA
Sbjct: 601 KLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTA 660

Query: 684 FTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMI 743
           +T+L+DGN K GN+ EA  L  +M E   + DL AYTSLV G S   +L +AR F  EMI
Sbjct: 661 YTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMI 720

Query: 744 EKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISENCSLAVP 791
            +GI P+E+LCIS+L+++Y+LG +DEA+EL++ + +  L++ +   A+P
Sbjct: 721 GEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALP 759

BLAST of Cla97C01G007410 vs. TAIR 10
Match: AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 377.9 bits (969), Expect = 2.0e-104
Identity = 207/564 (36.70%), Postives = 321/564 (56.91%), Query Frame = 0

Query: 239 LLDEANKCFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYN 298
           ++ EA +  SR+RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 299 VMIDYLCKEGDLENARRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMK-- 358
            ++ ++CK G ++ A  +   M + G  PDV++YNSLIDG+ + G +  A  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 359 -DVGCVPDVITYNGLINCFCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMM 418
               C PD++++N L N F K + + + F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLK-CCSPNVVTYSTWIDTFCKSGEL 180

Query: 419 QGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 478
           Q A+K F  M+R  L PN  T+T LID +CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 479 MDGLCEDGKMMEAEEVFRAMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNI 538
           +DG C+ G+M  AEE++  M+++ + PN  VYT ++ G+ +    ++AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 539 KPDLILYGTIIWGLCSQSKLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMN 598
           + D+  YG II GLC   KL+E   I+++ME   +  + VI+TT+++AYFK+G+   A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 599 LLQEMQDAGVEATVVTYCVLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLC 658
           +  ++ + G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYF-----CIEKANDVMYTVLIDALC 420

Query: 659 KTNCVESAKKLFDEMQCRGMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLH 718
           K       ++LF ++   G+ PD   +T+ I G  K GNL +A  L +RM +     DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 719 AYTSLVSGFSEYGELHQARKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEM 778
           AYT+L+ G +  G + +AR+ F EM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 779 QRRGLI--------SENCSLAVPC 792
           QRRGL+        S+ C   V C
Sbjct: 541 QRRGLVTAVSDADCSKQCGNEVNC 558

BLAST of Cla97C01G007410 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 376.7 bits (966), Expect = 4.6e-104
Identity = 209/650 (32.15%), Postives = 346/650 (53.23%), Query Frame = 0

Query: 135 IWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEV 194
           IW   VL+++K D  L L FF WA SR       ES CIV H+   ++    A  +I   
Sbjct: 91  IW---VLMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 195 IVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFR 254
             + +++V       FD+L  T     S   VFDV F V V+ GLL EA + F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 255 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGIAPSVFTYNVMIDYLCKEGDLENA 314
            +    SCN  L RLSK           F +    G+  +V +YN++I ++C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 315 RRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINC 374
             L + M   G+TPDV++Y+++++GY + G L++   L   MK  G  P+   Y  +I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 375 FCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 434
            C+  K+ +A E  S+M   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 435 EFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFR 494
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 495 AMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQS 554
            M++ G SPN   YT L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 555 KLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYC 614
            +EE   ++ E E+ G++A+ V YTT++DAY K+G+   A  +L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 615 VLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCR 674
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++ +M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 675 GMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQA 734
           G+ PD   +  L+ G+ K  N++EA  L   M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 735 RKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISE 784
           R+ F +M  +G+  ++ +        YK  + D  ++  +E+    L+ E
Sbjct: 691 REVFDQMRREGLAADKEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDE 735

BLAST of Cla97C01G007410 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 376.7 bits (966), Expect = 4.6e-104
Identity = 209/650 (32.15%), Postives = 346/650 (53.23%), Query Frame = 0

Query: 135 IWVSKVLVELKEDPNLALKFFKWAGSRIGFRHTTESYCIVAHMLFRARMYTNAHDIIKEV 194
           IW   VL+++K D  L L FF WA SR       ES CIV H+   ++    A  +I   
Sbjct: 91  IW---VLMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 195 IVKSRIDVGFPVFNIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFR 254
             + +++V       FD+L  T     S   VFDV F V V+ GLL EA + F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 255 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGIAPSVFTYNVMIDYLCKEGDLENA 314
            +    SCN  L RLSK           F +    G+  +V +YN++I ++C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 315 RRLFVQMRQMGFTPDVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINC 374
             L + M   G+TPDV++Y+++++GY + G L++   L   MK  G  P+   Y  +I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 375 FCKFEKMPQAFEYLSKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 434
            C+  K+ +A E  S+M   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 435 EFTYTSLIDAHCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFR 494
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 495 AMLKNGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQS 554
            M++ G SPN   YT L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 555 KLEETKLIIKEMESQGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYC 614
            +EE   ++ E E+ G++A+ V YTT++DAY K+G+   A  +L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 615 VLIDGLCKAGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCR 674
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++ +M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 675 GMTPDITAFTALIDGNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQA 734
           G+ PD   +  L+ G+ K  N++EA  L   M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 735 RKFFSEMIEKGILPEEILCISLLREYYKLGQLDEAIELKNEMQRRGLISE 784
           R+ F +M  +G+  ++ +        YK  + D  ++  +E+    L+ E
Sbjct: 691 REVFDQMRREGLAADKEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDE 735

BLAST of Cla97C01G007410 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 349.7 bits (896), Expect = 6.0e-96
Identity = 204/637 (32.03%), Postives = 327/637 (51.33%), Query Frame = 0

Query: 150 LALKFFKWAGSRIGFR--HTTESYCIVAHMLFRARMYTNAHDIIKEVIVKSRIDVGFPVF 209
           LALKF KW   + G    H  +  CI  H+L RARMY  A  I+KE+ + S    G   F
Sbjct: 92  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS----GKSSF 151

Query: 210 NIFDMLWSTRNICASGTGVFDVLFSVFVELGLLDEANKCFSRMRKFRTLPKARSCNFLLH 269
            +F  L +T  +C S   V+D+L  V++  G++ ++ + F  M  +   P   +CN +L 
Sbjct: 152 -VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 211

Query: 270 RLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFTP 329
            + KSG    V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M + G+ P
Sbjct: 212 SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 271

Query: 330 DVVTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPQAFEYL 389
            +VTYN+++  Y K G  + A+ L + MK  G   DV TYN LI+  C+  ++ + +  L
Sbjct: 272 TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 331

Query: 390 SKMKNDGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDAHCKA 449
             M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID H   
Sbjct: 332 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 391

Query: 450 GNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGKMMEAEEVFRAMLKNGISPNQQVY 509
           GN  EA K+   M   G+  + V+Y  L+DGLC++ +   A   +  M +NG+   +  Y
Sbjct: 392 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 451

Query: 510 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMES 569
           T ++ G  K   +++A+ +L +M++  I PD++ Y  +I G C   + +  K I+  +  
Sbjct: 452 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 511

Query: 570 QGISANPVIYTTIIDAYFKAGKSSDAMNLLQEMQDAGVEATVVTYCVLIDGLCKAGMVEL 629
            G+S N +IY+T+I    + G   +A+ + + M   G      T+ VL+  LCKAG V  
Sbjct: 512 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 571

Query: 630 AVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMTPDITAFTALID 689
           A ++   M+  G+ PN   +  LI+G   +     A  +FDEM   G  P    + +L+ 
Sbjct: 572 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 631

Query: 690 GNLKLGNLQEASDLISRMTESATEFDLHAYTSLVSGFSEYGELHQARKFFSEMIEKGILP 749
           G  K G+L+EA   +  +       D   Y +L++   + G L +A   F EM+++ ILP
Sbjct: 632 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 691

Query: 750 EEILCISLLREYYKLGQLDEAIELKNEMQRRGLISEN 785
           +     SL+    + G+   AI    E + RG +  N
Sbjct: 692 DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPN 723

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038906984.10.0e+0090.92putative pentatricopeptide repeat-containing protein At2g02150 [Benincasa hispid... [more]
XP_022938692.10.0e+0089.13putative pentatricopeptide repeat-containing protein At2g02150, partial [Cucurbi... [more]
KAG6601913.10.0e+0087.29putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_022959692.10.0e+0087.04putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucur... [more]
XP_023534824.10.0e+0086.79putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucur... [more]
Match NameE-valueIdentityDescription
P0C8943.3e-24855.53Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Q9ZUA22.9e-10336.70Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX... [more]
Q0WVK76.4e-10332.15Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9LVQ58.4e-9532.03Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.9e-9431.40Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1FET40.0e+0089.13putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita mosc... [more]
A0A6J1H5890.0e+0087.04putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cuc... [more]
A0A5D3BDW60.0e+0086.62Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A6J1CX770.0e+0087.01putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica char... [more]
A0A1S3CT400.0e+0086.37putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucumis melo O... [more]
Match NameE-valueIdentityDescription
AT2G02150.12.3e-24955.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G01740.12.0e-10436.70Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.14.6e-10432.15Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.24.6e-10432.15Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G55840.16.0e-9632.03Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 754..780
e-value: 0.0017
score: 18.5
coord: 716..745
e-value: 5.7E-5
score: 23.1
coord: 228..253
e-value: 0.12
score: 12.6
coord: 541..570
e-value: 0.076
score: 13.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 397..446
e-value: 5.2E-19
score: 68.2
coord: 327..376
e-value: 2.6E-19
score: 69.2
coord: 642..688
e-value: 5.5E-14
score: 52.1
coord: 468..508
e-value: 2.1E-11
score: 43.8
coord: 573..621
e-value: 1.6E-15
score: 57.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 289..321
e-value: 2.9E-10
score: 39.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 610..644
e-value: 7.0E-10
score: 36.5
coord: 753..780
e-value: 0.0032
score: 15.5
coord: 646..679
e-value: 7.3E-10
score: 36.4
coord: 400..434
e-value: 1.4E-9
score: 35.5
coord: 435..468
e-value: 7.7E-8
score: 30.1
coord: 295..329
e-value: 3.8E-10
score: 37.3
coord: 506..538
e-value: 8.7E-8
score: 29.9
coord: 330..364
e-value: 1.1E-12
score: 45.3
coord: 365..399
e-value: 6.8E-7
score: 27.1
coord: 470..503
e-value: 4.4E-10
score: 37.1
coord: 716..747
e-value: 3.3E-6
score: 24.9
coord: 261..293
e-value: 0.0024
score: 15.9
coord: 575..608
e-value: 1.4E-7
score: 29.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..397
score: 12.189023
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 538..572
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 13.67976
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 503..537
score: 11.761533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 573..607
score: 11.728648
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 608..642
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..362
score: 14.425128
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 8.714292
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 643..677
score: 12.452094
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 748..782
score: 9.328124
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 468..502
score: 13.712644
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 398..432
score: 13.230347
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 713..747
score: 12.189023
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 433..467
score: 11.904029
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 708..778
e-value: 2.3E-12
score: 48.8
coord: 428..496
e-value: 1.5E-21
score: 78.8
coord: 635..707
e-value: 3.4E-20
score: 74.3
coord: 127..286
e-value: 4.9E-15
score: 57.5
coord: 287..356
e-value: 5.1E-25
score: 90.1
coord: 357..427
e-value: 8.6E-25
score: 89.4
coord: 564..634
e-value: 4.7E-21
score: 77.2
coord: 497..563
e-value: 1.4E-17
score: 65.8
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 137..785
NoneNo IPR availablePANTHERPTHR45613:SF358OS06G0565000 PROTEINcoord: 137..785
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 296..543

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G007410.2Cla97C01G007410.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding