Cla97C01G023510 (gene) Watermelon (97103) v2

NameCla97C01G023510
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr01 : 34908856 .. 34910763 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTACTCTATAGGCCTAAGTTCTTCTTCTGGAATTCGAAACAGAGATTGAATTTCCACGTCTCATCCACTGTTCCTCGCTTACCTCCGCCTCTTTCTTCTCTTCCACCAACACCTAGGATCAGCCAAATTAAGCAAGCCCATGCCCGTACTGTCGTTTTCGGCCTTGCTAACGATGGACGCATCACGGCTCACCTCCTCGCTTTTCTTGCCATTTCTTCCTCTTCACTGCCCTGTGGGTACCCCTTGTCAATTTATCATTCTATTGCTCATCCAAGTGTTTTTGCAACCAATAACATGATACGGTGCTTCGCTAAAGGGGACTTACCTCTCGAGTCCATTTCTCTTTACTCGCACATGCGCCGAAGTTTTGTGGCGGCGCCTAATAAATATACTCTCACCTTTGTGTTGCAAGCTTGCAGTAACGCTTTGGCCATCGGGGAAGGGGTTCAAGTTCAAACCCATGTGATTAAACTTGGTTTTGTCAATGACGTTTTCGTCCGAAATGCGTTGATTCACTTGTATTGTACTTGTTGTAGAGTGGAATCTGCGAAACAGGTGTTTGATGAAATTCCTAGTAGTCGAGATGTAGTATCTTGGAATTCAATGATAGCTGGGTTTGTTAGAGATGGACAAATCAATGTTGCAGAGAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGGGCTCGATTATATCTGGGTGTGTTCAAAATGGGGAATTGGATAAGGCATTAGACTACTTTAAAGAGATGGGAGAGCAAAAAATGAGACCGAATGAGGCAATATTGGTGTCCTTGCTCGCAGCAGCAGCCCAGCTTGGTATGCTTGAGTATGGGAAAATGATCCATTCCATTGCAAACTCACTGAGATTCCCAATGACTGCTTCTCTTGGCACAGCACTAATTGACATGTATGCTAAGTGCGGATGCATTGATGAGTCCAAATTCTTATTTGATAGAATGCCTGAGAAGGATAAATGGTCTTGGAATGTTATGATTTGTGGGCTAGCATCTCATGGTCTTGGGCAAGAAGCGCTTGCATTATTTGAGAAGTTTCTAACTCAAGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCTTGTAGCCGAGCTGGTTTAGTCAGTGAGGGAAGACGCTTCTTTAAGCTAATGACGGACACATATGGCATTGAACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTAAGCCGTGCCGGGTTCGTTTATGATGCTGTAGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTCTTGTGGGCAACGGTGCTTGGTTCTTGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTAATTCAAATGGATCCGACTCACAATGGGCATTATGTCCAGTTGGCTAGTATCTTTGCCAGACTAAGAAAGTGGGAAGACGTAAGCAAGGTTAGAAGACTAATGGCTGAAAGAAACTCTAACAAAATTGCTGGCTGGAGCTTGATTGAAGCAGAAGGAAGAGTTCACCGATTTGTCGCCGGAGATAAGGAGCATGAGCGAACTACTGAGATCTACAAGATTTTGGAGATAATTGGAGTAAGAATAGCAGCAGCAGGATACACAGCAAATGTTTCATCAGTTCTGCATGACATTGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCACAGTGAAAGGTTGGCAATTGCTTTTGGCTTACTGGTGACTAAAGTTGGGGATTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGCAAGATAATTTCTCAAGTCTTTGAAAGAGAGATCATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCAAGATTATTGGTGA

mRNA sequence

ATGCTACTCTATAGGCCTAAGTTCTTCTTCTGGAATTCGAAACAGAGATTGAATTTCCACGTCTCATCCACTGTTCCTCGCTTACCTCCGCCTCTTTCTTCTCTTCCACCAACACCTAGGATCAGCCAAATTAAGCAAGCCCATGCCCGTACTGTCGTTTTCGGCCTTGCTAACGATGGACGCATCACGGCTCACCTCCTCGCTTTTCTTGCCATTTCTTCCTCTTCACTGCCCTGTGGGTACCCCTTGTCAATTTATCATTCTATTGCTCATCCAAGTGTTTTTGCAACCAATAACATGATACGGTGCTTCGCTAAAGGGGACTTACCTCTCGAGTCCATTTCTCTTTACTCGCACATGCGCCGAAGTTTTGTGGCGGCGCCTAATAAATATACTCTCACCTTTGTGTTGCAAGCTTGCAGTAACGCTTTGGCCATCGGGGAAGGGGTTCAAGTTCAAACCCATGTGATTAAACTTGGTTTTGTCAATGACGTTTTCGTCCGAAATGCGTTGATTCACTTGTATTGTACTTGTTGTAGAGTGGAATCTGCGAAACAGGTGTTTGATGAAATTCCTAGTAGTCGAGATGTAGTATCTTGGAATTCAATGATAGCTGGGTTTGTTAGAGATGGACAAATCAATGTTGCAGAGAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGGGCTCGATTATATCTGGGTGTGTTCAAAATGGGGAATTGGATAAGGCATTAGACTACTTTAAAGAGATGGGAGAGCAAAAAATGAGACCGAATGAGGCAATATTGGTGTCCTTGCTCGCAGCAGCAGCCCAGCTTGGTATGCTTGAGTATGGGAAAATGATCCATTCCATTGCAAACTCACTGAGATTCCCAATGACTGCTTCTCTTGGCACAGCACTAATTGACATGTATGCTAAGTGCGGATGCATTGATGAGTCCAAATTCTTATTTGATAGAATGCCTGAGAAGGATAAATGGTCTTGGAATGTTATGATTTGTGGGCTAGCATCTCATGGTCTTGGGCAAGAAGCGCTTGCATTATTTGAGAAGTTTCTAACTCAAGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCTTGTAGCCGAGCTGGTTTAGTCAGTGAGGGAAGACGCTTCTTTAAGCTAATGACGGACACATATGGCATTGAACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTAAGCCGTGCCGGGTTCGTTTATGATGCTGTAGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTCTTGTGGGCAACGGTGCTTGGTTCTTGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTAATTCAAATGGATCCGACTCACAATGGGCATTATGTCCAGTTGGCTAGTATCTTTGCCAGACTAAGAAAGTGGGAAGACGTAAGCAAGGTTAGAAGACTAATGGCTGAAAGAAACTCTAACAAAATTGCTGGCTGGAGCTTGATTGAAGCAGAAGGAAGAGTTCACCGATTTGTCGCCGGAGATAAGGAGCATGAGCGAACTACTGAGATCTACAAGATTTTGGAGATAATTGGAGTAAGAATAGCAGCAGCAGGATACACAGCAAATGTTTCATCAGTTCTGCATGACATTGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCACAGTGAAAGGTTGGCAATTGCTTTTGGCTTACTGGTGACTAAAGTTGGGGATTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGCAAGATAATTTCTCAAGTCTTTGAAAGAGAGATCATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCAAGATTATTGGTGA

Coding sequence (CDS)

ATGCTACTCTATAGGCCTAAGTTCTTCTTCTGGAATTCGAAACAGAGATTGAATTTCCACGTCTCATCCACTGTTCCTCGCTTACCTCCGCCTCTTTCTTCTCTTCCACCAACACCTAGGATCAGCCAAATTAAGCAAGCCCATGCCCGTACTGTCGTTTTCGGCCTTGCTAACGATGGACGCATCACGGCTCACCTCCTCGCTTTTCTTGCCATTTCTTCCTCTTCACTGCCCTGTGGGTACCCCTTGTCAATTTATCATTCTATTGCTCATCCAAGTGTTTTTGCAACCAATAACATGATACGGTGCTTCGCTAAAGGGGACTTACCTCTCGAGTCCATTTCTCTTTACTCGCACATGCGCCGAAGTTTTGTGGCGGCGCCTAATAAATATACTCTCACCTTTGTGTTGCAAGCTTGCAGTAACGCTTTGGCCATCGGGGAAGGGGTTCAAGTTCAAACCCATGTGATTAAACTTGGTTTTGTCAATGACGTTTTCGTCCGAAATGCGTTGATTCACTTGTATTGTACTTGTTGTAGAGTGGAATCTGCGAAACAGGTGTTTGATGAAATTCCTAGTAGTCGAGATGTAGTATCTTGGAATTCAATGATAGCTGGGTTTGTTAGAGATGGACAAATCAATGTTGCAGAGAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGGGCTCGATTATATCTGGGTGTGTTCAAAATGGGGAATTGGATAAGGCATTAGACTACTTTAAAGAGATGGGAGAGCAAAAAATGAGACCGAATGAGGCAATATTGGTGTCCTTGCTCGCAGCAGCAGCCCAGCTTGGTATGCTTGAGTATGGGAAAATGATCCATTCCATTGCAAACTCACTGAGATTCCCAATGACTGCTTCTCTTGGCACAGCACTAATTGACATGTATGCTAAGTGCGGATGCATTGATGAGTCCAAATTCTTATTTGATAGAATGCCTGAGAAGGATAAATGGTCTTGGAATGTTATGATTTGTGGGCTAGCATCTCATGGTCTTGGGCAAGAAGCGCTTGCATTATTTGAGAAGTTTCTAACTCAAGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCTTGTAGCCGAGCTGGTTTAGTCAGTGAGGGAAGACGCTTCTTTAAGCTAATGACGGACACATATGGCATTGAACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTAAGCCGTGCCGGGTTCGTTTATGATGCTGTAGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTCTTGTGGGCAACGGTGCTTGGTTCTTGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTAATTCAAATGGATCCGACTCACAATGGGCATTATGTCCAGTTGGCTAGTATCTTTGCCAGACTAAGAAAGTGGGAAGACGTAAGCAAGGTTAGAAGACTAATGGCTGAAAGAAACTCTAACAAAATTGCTGGCTGGAGCTTGATTGAAGCAGAAGGAAGAGTTCACCGATTTGTCGCCGGAGATAAGGAGCATGAGCGAACTACTGAGATCTACAAGATTTTGGAGATAATTGGAGTAAGAATAGCAGCAGCAGGATACACAGCAAATGTTTCATCAGTTCTGCATGACATTGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCACAGTGAAAGGTTGGCAATTGCTTTTGGCTTACTGGTGACTAAAGTTGGGGATTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGCAAGATAATTTCTCAAGTCTTTGAAAGAGAGATCATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCAAGATTATTGGTGA

Protein sequence

MLLYRPKFFFWNSKQRLNFHVSSTVPRLPPPLSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPEKDVISWGSIISGCVQNGELDKALDYFKEMGEQKMRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW
BLAST of Cla97C01G023510 vs. NCBI nr
Match: XP_008439760.2 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 585/637 (91.84%), Postives = 606/637 (95.13%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLNFHVSSTV--PRLPPPLSSLPPTPRISQIKQAHARTVVFGLAN 60
           MLL RP FFFW SKQRLNFH+ STV  PRLP PLSSLPPTP I+QIKQAHART+VFGLAN
Sbjct: 1   MLLCRPNFFFWTSKQRLNFHLFSTVPNPRLPSPLSSLPPTPGITQIKQAHARTIVFGLAN 60

Query: 61  DGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYS 120
           DGRIT HLLAFLAISSSSLP  Y LSIY+SI HPSVFATNNMIRCF KGDLP  SISLYS
Sbjct: 61  DGRITPHLLAFLAISSSSLPSDYALSIYNSIPHPSVFATNNMIRCFVKGDLPRHSISLYS 120

Query: 121 HMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTC 180
           HM RSF  APNK+TLTFVLQACSNALAI EG QVQTHVIKLGFV DVFVRNALIHLYCTC
Sbjct: 121 HMCRSFEVAPNKHTLTFVLQACSNALAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCTC 180

Query: 181 CRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXX 240
           CRVESAKQVFDE+PSSRDVVSWNSMIAGFVR GQI+ A+KLFVEMPXXXXXXXXXXXXXX
Sbjct: 181 CRVESAKQVFDEVPSSRDVVSWNSMIAGFVRHGQISDAQKLFVEMPXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTA 300
           XXXXXXXXXXXXXXXXXXXX RPNEAILVSLLAAAAQLG LEYGKMIHSIA+SLRFPMTA
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXLRPNEAILVSLLAAAAQLGTLEYGKMIHSIADSLRFPMTA 300

Query: 301 SLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQ 360
           SLGTAL+DMYAKCGCIDES+FLFDRMPEKDKWSWNVMICGLA+HGLGQEALALFEKFLTQ
Sbjct: 301 SLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLTQ 360

Query: 361 GFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD 420
           GF+P+NVTFIGVL ACSRAGLVSEGR FFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD
Sbjct: 361 GFHPINVTFIGVLTACSRAGLVSEGRHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD 420

Query: 421 AVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFAR 480
           AVEMINRMPAPPDPVLWA+VLGSC+VHGF ELGEEIGNKLIQMDPTHNGHYVQLA IFAR
Sbjct: 421 AVEMINRMPAPPDPVLWASVLGSCQVHGFAELGEEIGNKLIQMDPTHNGHYVQLARIFAR 480

Query: 481 LRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVR 540
           LRKWEDVSKVRRLMAERNSNK+AGWSLIEAEGRVH+FVAGDKEHERTTEIYK+LEIIGVR
Sbjct: 481 LRKWEDVSKVRRLMAERNSNKVAGWSLIEAEGRVHQFVAGDKEHERTTEIYKMLEIIGVR 540

Query: 541 IAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC 600
           IAAAGY+ANV+SVLHDIEEEEKE AIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC
Sbjct: 541 IAAAGYSANVTSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC 600

Query: 601 HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW
Sbjct: 601 HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 637

BLAST of Cla97C01G023510 vs. NCBI nr
Match: XP_004134932.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis sativus] >KGN49291.1 hypothetical protein Csa_6G519440 [Cucumis sativus])

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 582/638 (91.22%), Postives = 603/638 (94.51%), Query Frame = 0

Query: 1   MLLYRPKF-FFWNSKQRLNFHVSSTV--PRLPPPLSSLPPTPRISQIKQAHARTVVFGLA 60
           MLL RP F FFW SKQRLNFH  ST+  PRLPPP SSLPPTP I+QIKQAHAR +V GLA
Sbjct: 1   MLLCRPNFLFFWISKQRLNFHFFSTLPNPRLPPPFSSLPPTPGITQIKQAHARILVLGLA 60

Query: 61  NDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLY 120
           NDGRIT+HLLAFLAISSSSLP  Y LSIY+SI+HP+VFATNNMIRCF KGDLP  SISLY
Sbjct: 61  NDGRITSHLLAFLAISSSSLPSDYALSIYNSISHPTVFATNNMIRCFVKGDLPRHSISLY 120

Query: 121 SHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCT 180
           SHM RSFVAAPNK+TLTFVLQACSNA AI EG QVQTHVIKLGFV DVFVRNALIHLYCT
Sbjct: 121 SHMCRSFVAAPNKHTLTFVLQACSNAFAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCT 180

Query: 181 CCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXX 240
           CCRVESAKQVFDE+PSSRDVVSWNSMI GFVR GQI+VA+KLFVEMPXXXXXXXXXXXXX
Sbjct: 181 CCRVESAKQVFDEVPSSRDVVSWNSMIVGFVRLGQISVAQKLFVEMPXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMT 300
           XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLG LEYGK IHSIANSLRFPMT
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGTLEYGKRIHSIANSLRFPMT 300

Query: 301 ASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLT 360
           ASLGTAL+DMYAKCGCIDES+FLFDRMPEKDKWSWNVMICGLA+HGLGQEALALFEKFLT
Sbjct: 301 ASLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLT 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420
           QGF+PVNVTFIGVL ACSRAGLVSEG+ FFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY
Sbjct: 361 QGFHPVNVTFIGVLTACSRAGLVSEGKHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFA 480
           DAVEMINRMPAPPDPVLWA+VLGSC+VHGFIELGEEIGNKLIQMDPTHNGHYVQLA IFA
Sbjct: 421 DAVEMINRMPAPPDPVLWASVLGSCQVHGFIELGEEIGNKLIQMDPTHNGHYVQLARIFA 480

Query: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGV 540
           RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYK+LEI+GV
Sbjct: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKMLEIMGV 540

Query: 541 RIAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGD 600
           RIAAAGY+ANVSSVLHDIEEEEKE AIKEHSERLAIAFGLLVTK GDCIRIIKNLRVCGD
Sbjct: 541 RIAAAGYSANVSSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKDGDCIRIIKNLRVCGD 600

Query: 601 CHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           CHEVSKIIS VFEREIIVRDGSRFHHFK G CSCQDYW
Sbjct: 601 CHEVSKIISLVFEREIIVRDGSRFHHFKKGICSCQDYW 638

BLAST of Cla97C01G023510 vs. NCBI nr
Match: XP_022142302.1 (pentatricopeptide repeat-containing protein At3g62890-like [Momordica charantia])

HSP 1 Score: 1085.5 bits (2806), Expect = 0.0e+00
Identity = 567/635 (89.29%), Postives = 592/635 (93.23%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLNFHVSSTVPRLPPPLSSLPPTPRISQIKQAHARTVVFGLANDG 60
           ML+++PKFFFW SKQRLNFHV STV  LPPPLSSLPPTP I+QIKQAHAR+VVFGLANDG
Sbjct: 1   MLVWKPKFFFWTSKQRLNFHVLSTVSHLPPPLSSLPPTPGINQIKQAHARSVVFGLANDG 60

Query: 61  RITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHM 120
           RI  HLLAFLAISSSSLP  Y  SIY SIA PSVFATNNMIRCFAKGDLP ESISLYSHM
Sbjct: 61  RIMGHLLAFLAISSSSLPYEYAWSIYQSIALPSVFATNNMIRCFAKGDLPRESISLYSHM 120

Query: 121 RRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCR 180
           RRSFV  PNK+TLTFVLQACSNALAI EG+QVQTHVIK GFV DVFVRNALIHLYCTCCR
Sbjct: 121 RRSFV-EPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFVKDVFVRNALIHLYCTCCR 180

Query: 181 VESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXX 240
           VE A+ VFDEIP  RDVVSWNSMIAG VRDGQI VAEKLFVEMP XXXXXXXXXXXXXXX
Sbjct: 181 VECARLVFDEIPGGRDVVSWNSMIAGLVRDGQIYVAEKLFVEMPHXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASL 300
           XXXXXXXXXXXXXX     RPNEAILVSLLAAAAQLGMLEYGKMIHSIA+SL+FPMTASL
Sbjct: 241 XXXXXXXXXXXXXXREQKMRPNEAILVSLLAAAAQLGMLEYGKMIHSIADSLKFPMTASL 300

Query: 301 GTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGF 360
           GTAL+DMYAKCGCIDESKFLFDRMP+KDKWSWNVMICGLASHGLGQEALALFEKF T+GF
Sbjct: 301 GTALVDMYAKCGCIDESKFLFDRMPQKDKWSWNVMICGLASHGLGQEALALFEKFTTEGF 360

Query: 361 YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAV 420
           YPVNVTFIGVLNACSRAGLVS GR FFKLMTDTY IEPEMEHYGCMVDL SRAG VYDAV
Sbjct: 361 YPVNVTFIGVLNACSRAGLVSAGRHFFKLMTDTYCIEPEMEHYGCMVDLFSRAGLVYDAV 420

Query: 421 EMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLR 480
           EMI+RMPA PDPVLWATVLGSCKVHGF+ELGEEIGNKL+QMDP+HNGHYVQL+SI+A LR
Sbjct: 421 EMIDRMPAAPDPVLWATVLGSCKVHGFVELGEEIGNKLVQMDPSHNGHYVQLSSIYATLR 480

Query: 481 KWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIA 540
           KWEDVSKVRRLMAERN+NKIAGWSLIEAEGRVHRF AGDKEHER TEIYK+LEIIGVRIA
Sbjct: 481 KWEDVSKVRRLMAERNTNKIAGWSLIEAEGRVHRFFAGDKEHERCTEIYKMLEIIGVRIA 540

Query: 541 AAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHE 600
           AAGY+AN+SSVLHDIEEEEKETAIKEHSERLAIAFGLLVT+ GDCIRIIKNLRVCGDCHE
Sbjct: 541 AAGYSANLSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTRAGDCIRIIKNLRVCGDCHE 600

Query: 601 VSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           VSKIISQVF+REIIVRDGSRFHHFKNGSCSC DYW
Sbjct: 601 VSKIISQVFQREIIVRDGSRFHHFKNGSCSCLDYW 634

BLAST of Cla97C01G023510 vs. NCBI nr
Match: XP_022926870.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata])

HSP 1 Score: 1078.5 bits (2788), Expect = 0.0e+00
Identity = 570/635 (89.76%), Postives = 597/635 (94.02%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLNFHVSSTVPRLPPPLSSLPPTPRISQIKQAHARTVVFGLANDG 60
           MLL +PKF FW SKQRLN HV STV  LPPPLSSLPP   I+QIKQAHAR+VVFGLANDG
Sbjct: 1   MLLCKPKFIFWTSKQRLNVHVFSTVSHLPPPLSSLPPISGITQIKQAHARSVVFGLANDG 60

Query: 61  RITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHM 120
           RI  HLLAFLA+SSSSLP  Y  SIY SI+HPSVFATNNMIRC AK +L  ESISLYSHM
Sbjct: 61  RIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHM 120

Query: 121 RRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCR 180
           RRS V APNK+TLTFVLQACSNALAI EG+QVQTHVIK GF  DVF+RNALIHLYCT CR
Sbjct: 121 RRSLV-APNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCR 180

Query: 181 VESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXX 240
           VE AKQVFDE+PSSRD+VSWNSMIAGFVR GQINVA+KLFVEMPXXXXXXXXXXXXXXXX
Sbjct: 181 VECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASL 300
           XXXXXXXXXXXXXXXXXXXRPNEAILVS+LAAA+QLGMLEYGKMIHSIA+SL+FPMTASL
Sbjct: 241 XXXXXXXXXXXXXXXXXXXRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASL 300

Query: 301 GTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGF 360
           GTAL+DMYAKCGCIDESKFLFDRMP+KDKW+WNVMICGLASHGLGQEALALFEKFLTQGF
Sbjct: 301 GTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGF 360

Query: 361 YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAV 420
           YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY I PEMEHYGCMVDL SRAGFVYDAV
Sbjct: 361 YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAV 420

Query: 421 EMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLR 480
           EMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLA I+ARLR
Sbjct: 421 EMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLR 480

Query: 481 KWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIA 540
           KWEDVSK+RRLMA+RNSNKIAGWSLIEA GRVHRFVAGDKEHE+ TEIYK+LE IGVRIA
Sbjct: 481 KWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIA 540

Query: 541 AAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHE 600
           AAGY+ANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVT+VGDCIRIIKNLRVCGDCHE
Sbjct: 541 AAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHE 600

Query: 601 VSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           VSKIIS+VFEREIIVRDGSRFHHFKNGSCSC DYW
Sbjct: 601 VSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 634

BLAST of Cla97C01G023510 vs. NCBI nr
Match: XP_023004098.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima])

HSP 1 Score: 1071.2 bits (2769), Expect = 1.3e-309
Identity = 567/635 (89.29%), Postives = 595/635 (93.70%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLNFHVSSTVPRLPPPLSSLPPTPRISQIKQAHARTVVFGLANDG 60
           MLL +PKF FW S QRLNFHV STV  LPPPLSSLPP   ISQIKQAHAR+VVFGLANDG
Sbjct: 1   MLLCKPKFIFWTSTQRLNFHVFSTVSHLPPPLSSLPPISGISQIKQAHARSVVFGLANDG 60

Query: 61  RITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHM 120
           RI  HLLAFLA+SSSSLP  Y  SI+ SI+HPSVFATNNMIRC AK +L  ESISLYSHM
Sbjct: 61  RIMGHLLAFLAVSSSSLPYEYAFSIHQSISHPSVFATNNMIRCCAKEELSCESISLYSHM 120

Query: 121 RRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCR 180
           RR+ V APNK+TLTFVLQACSNALAI EG+QVQTHVIKLGF  DVF+RNALIHLYCT CR
Sbjct: 121 RRNLV-APNKHTLTFVLQACSNALAICEGIQVQTHVIKLGFAKDVFIRNALIHLYCTHCR 180

Query: 181 VESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXX 240
           VE AKQ+FDE+PSSRD+VSWNSMIAG VR GQINVA+KLFVEMPXXXXXXXXXXXXXXXX
Sbjct: 181 VECAKQLFDEVPSSRDIVSWNSMIAGCVRAGQINVADKLFVEMPXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASL 300
           XXXXXXXXXXXXXXXXXXXRPNEAILVS+LAAA+QLGMLEYGKMIHSIA+SL+F MTASL
Sbjct: 241 XXXXXXXXXXXXXXXXXXXRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFSMTASL 300

Query: 301 GTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGF 360
           GTAL+DMYAKCGCIDESKFLFDRMP+KDKW+WNVMICGLASHGLGQEALALFEKFLTQGF
Sbjct: 301 GTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGF 360

Query: 361 YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAV 420
           YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY I PEMEHYGCMVDL SRAGFVYDAV
Sbjct: 361 YPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAV 420

Query: 421 EMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLR 480
           EMINRMPAPPDPVLW TVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLA I+ARLR
Sbjct: 421 EMINRMPAPPDPVLWTTVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLR 480

Query: 481 KWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIA 540
           KWEDVSK+RRLMA+RNSNKIAGWSLIEA GRVHRFVAGDKEHER TEIYK+LE IGVRIA
Sbjct: 481 KWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHERCTEIYKMLETIGVRIA 540

Query: 541 AAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHE 600
           AAGY+ANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVT+VGDCIRIIKNLRVCGDCHE
Sbjct: 541 AAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHE 600

Query: 601 VSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           VSKIIS+VFEREIIVRDGSRFHHFKNGSCSC DYW
Sbjct: 601 VSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 634

BLAST of Cla97C01G023510 vs. TrEMBL
Match: tr|A0A1S3AZ38|A0A1S3AZ38_CUCME (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103484461 PE=4 SV=1)

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 585/637 (91.84%), Postives = 606/637 (95.13%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLNFHVSSTV--PRLPPPLSSLPPTPRISQIKQAHARTVVFGLAN 60
           MLL RP FFFW SKQRLNFH+ STV  PRLP PLSSLPPTP I+QIKQAHART+VFGLAN
Sbjct: 1   MLLCRPNFFFWTSKQRLNFHLFSTVPNPRLPSPLSSLPPTPGITQIKQAHARTIVFGLAN 60

Query: 61  DGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYS 120
           DGRIT HLLAFLAISSSSLP  Y LSIY+SI HPSVFATNNMIRCF KGDLP  SISLYS
Sbjct: 61  DGRITPHLLAFLAISSSSLPSDYALSIYNSIPHPSVFATNNMIRCFVKGDLPRHSISLYS 120

Query: 121 HMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTC 180
           HM RSF  APNK+TLTFVLQACSNALAI EG QVQTHVIKLGFV DVFVRNALIHLYCTC
Sbjct: 121 HMCRSFEVAPNKHTLTFVLQACSNALAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCTC 180

Query: 181 CRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXX 240
           CRVESAKQVFDE+PSSRDVVSWNSMIAGFVR GQI+ A+KLFVEMPXXXXXXXXXXXXXX
Sbjct: 181 CRVESAKQVFDEVPSSRDVVSWNSMIAGFVRHGQISDAQKLFVEMPXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTA 300
           XXXXXXXXXXXXXXXXXXXX RPNEAILVSLLAAAAQLG LEYGKMIHSIA+SLRFPMTA
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXLRPNEAILVSLLAAAAQLGTLEYGKMIHSIADSLRFPMTA 300

Query: 301 SLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLTQ 360
           SLGTAL+DMYAKCGCIDES+FLFDRMPEKDKWSWNVMICGLA+HGLGQEALALFEKFLTQ
Sbjct: 301 SLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLTQ 360

Query: 361 GFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD 420
           GF+P+NVTFIGVL ACSRAGLVSEGR FFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD
Sbjct: 361 GFHPINVTFIGVLTACSRAGLVSEGRHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD 420

Query: 421 AVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFAR 480
           AVEMINRMPAPPDPVLWA+VLGSC+VHGF ELGEEIGNKLIQMDPTHNGHYVQLA IFAR
Sbjct: 421 AVEMINRMPAPPDPVLWASVLGSCQVHGFAELGEEIGNKLIQMDPTHNGHYVQLARIFAR 480

Query: 481 LRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVR 540
           LRKWEDVSKVRRLMAERNSNK+AGWSLIEAEGRVH+FVAGDKEHERTTEIYK+LEIIGVR
Sbjct: 481 LRKWEDVSKVRRLMAERNSNKVAGWSLIEAEGRVHQFVAGDKEHERTTEIYKMLEIIGVR 540

Query: 541 IAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC 600
           IAAAGY+ANV+SVLHDIEEEEKE AIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC
Sbjct: 541 IAAAGYSANVTSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC 600

Query: 601 HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW
Sbjct: 601 HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 637

BLAST of Cla97C01G023510 vs. TrEMBL
Match: tr|A0A0A0KNI8|A0A0A0KNI8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G519440 PE=4 SV=1)

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 582/638 (91.22%), Postives = 603/638 (94.51%), Query Frame = 0

Query: 1   MLLYRPKF-FFWNSKQRLNFHVSSTV--PRLPPPLSSLPPTPRISQIKQAHARTVVFGLA 60
           MLL RP F FFW SKQRLNFH  ST+  PRLPPP SSLPPTP I+QIKQAHAR +V GLA
Sbjct: 1   MLLCRPNFLFFWISKQRLNFHFFSTLPNPRLPPPFSSLPPTPGITQIKQAHARILVLGLA 60

Query: 61  NDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLY 120
           NDGRIT+HLLAFLAISSSSLP  Y LSIY+SI+HP+VFATNNMIRCF KGDLP  SISLY
Sbjct: 61  NDGRITSHLLAFLAISSSSLPSDYALSIYNSISHPTVFATNNMIRCFVKGDLPRHSISLY 120

Query: 121 SHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCT 180
           SHM RSFVAAPNK+TLTFVLQACSNA AI EG QVQTHVIKLGFV DVFVRNALIHLYCT
Sbjct: 121 SHMCRSFVAAPNKHTLTFVLQACSNAFAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCT 180

Query: 181 CCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXX 240
           CCRVESAKQVFDE+PSSRDVVSWNSMI GFVR GQI+VA+KLFVEMPXXXXXXXXXXXXX
Sbjct: 181 CCRVESAKQVFDEVPSSRDVVSWNSMIVGFVRLGQISVAQKLFVEMPXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMT 300
           XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLG LEYGK IHSIANSLRFPMT
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGTLEYGKRIHSIANSLRFPMT 300

Query: 301 ASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLT 360
           ASLGTAL+DMYAKCGCIDES+FLFDRMPEKDKWSWNVMICGLA+HGLGQEALALFEKFLT
Sbjct: 301 ASLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLT 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420
           QGF+PVNVTFIGVL ACSRAGLVSEG+ FFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY
Sbjct: 361 QGFHPVNVTFIGVLTACSRAGLVSEGKHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFA 480
           DAVEMINRMPAPPDPVLWA+VLGSC+VHGFIELGEEIGNKLIQMDPTHNGHYVQLA IFA
Sbjct: 421 DAVEMINRMPAPPDPVLWASVLGSCQVHGFIELGEEIGNKLIQMDPTHNGHYVQLARIFA 480

Query: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGV 540
           RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYK+LEI+GV
Sbjct: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKMLEIMGV 540

Query: 541 RIAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGD 600
           RIAAAGY+ANVSSVLHDIEEEEKE AIKEHSERLAIAFGLLVTK GDCIRIIKNLRVCGD
Sbjct: 541 RIAAAGYSANVSSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKDGDCIRIIKNLRVCGD 600

Query: 601 CHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           CHEVSKIIS VFEREIIVRDGSRFHHFK G CSCQDYW
Sbjct: 601 CHEVSKIISLVFEREIIVRDGSRFHHFKKGICSCQDYW 638

BLAST of Cla97C01G023510 vs. TrEMBL
Match: tr|A0A251RKX3|A0A251RKX3_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G582700 PE=4 SV=1)

HSP 1 Score: 839.0 bits (2166), Expect = 7.3e-240
Identity = 443/638 (69.44%), Postives = 531/638 (83.23%), Query Frame = 0

Query: 1   MLLYRPKFFFWNSKQRLN---FHVSSTVPRLPPPLSSLPPTPRISQIKQAHARTVVFGLA 60
           M  Y+PK F  +  + ++     ++++  +L PPLSSLPP P I Q KQAHA+ +V GLA
Sbjct: 1   MFAYKPKSFLLSLPKHISNSQIFLANSNSQLSPPLSSLPPRPSIPQTKQAHAQIIVSGLA 60

Query: 61  NDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRCFAKGDLPLESISLY 120
            D  + +HLL FLA+S S+ P  Y LS+Y SI +PSVFATNNMIRCFAK D P +S+ L+
Sbjct: 61  ADSPLISHLLCFLALSPST-PFHYSLSLYQSIKYPSVFATNNMIRCFAKSDSPPQSLLLF 120

Query: 121 SHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCT 180
           S M R+ V  PN +T TF+LQACS ALA+ EG QV T  +KLGF   VFVRNALIHLYC+
Sbjct: 121 SSMLRTCV-KPNNHTFTFLLQACSRALALNEGAQVHTVAVKLGFGGYVFVRNALIHLYCS 180

Query: 181 CCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXX 240
           C R+E +K++F+E  SSRDVV+WNSM+  FVRD QI  AEKLF    XXXXXXXXXXXXX
Sbjct: 181 CSRIECSKRLFEENASSRDVVTWNSMLTAFVRDEQIGAAEKLFXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMT 300
           XXXXXXXXXXXXXXXXXXXXX R NEA LVS+L+A+AQLG+LE+G+++HS+  SL FP+T
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXMRLNEATLVSVLSASAQLGLLEHGRLVHSLVESLNFPLT 300

Query: 301 ASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHGLGQEALALFEKFLT 360
            SLGTALIDMYAKCGCI++SK LF  MP+KD W+WNVMICGLASHG+G+EALALF++F+ 
Sbjct: 301 VSLGTALIDMYAKCGCIEQSKLLFKNMPKKDIWTWNVMICGLASHGIGKEALALFQRFID 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420
           +GF+PVNVTFIGVL ACSRAGLVSEGRR FKLMT+ Y I PEMEHYGCMVD+L RAGF+ 
Sbjct: 361 EGFHPVNVTFIGVLGACSRAGLVSEGRRHFKLMTEKYSILPEMEHYGCMVDMLGRAGFLD 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFA 480
           +AV++I +M  PPDPVLWAT+LG+CK+HG IELGE+IG KL+++DPTH+GHYVQLASI+A
Sbjct: 421 EAVQLIEKMTVPPDPVLWATLLGACKIHGSIELGEKIGKKLLKLDPTHDGHYVQLASIYA 480

Query: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGV 540
           + RKWEDV +VRRL+ E+N+NK AGWSLIEA+G VH+FVAGD+EHER+ EIYK+LE IG+
Sbjct: 481 KARKWEDVIRVRRLLVEQNTNKAAGWSLIEAQGTVHKFVAGDREHERSLEIYKMLEKIGI 540

Query: 541 RIAAAGYTANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGD 600
           RIA +GY+ NVSSVLHDI EEEKE AIKEHSERLA+AFGLLVT  GDCIRI+KNLRVC D
Sbjct: 541 RIAESGYSPNVSSVLHDIGEEEKENAIKEHSERLAMAFGLLVTGAGDCIRIVKNLRVCED 600

Query: 601 CHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
           CHEVSKIIS+VFEREIIVRDGSRFHHFK+G CSC DYW
Sbjct: 601 CHEVSKIISRVFEREIIVRDGSRFHHFKDGKCSCLDYW 636

BLAST of Cla97C01G023510 vs. TrEMBL
Match: tr|A0A2I4EEE8|A0A2I4EEE8_9ROSI (pentatricopeptide repeat-containing protein At5g66520-like OS=Juglans regia OX=51240 GN=LOC108988848 PE=4 SV=1)

HSP 1 Score: 833.6 bits (2152), Expect = 3.1e-238
Identity = 436/605 (72.07%), Postives = 514/605 (84.96%), Query Frame = 0

Query: 31  PLSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIA 90
           PL SLPP P + Q KQAHAR +V GLA+D  +  HLL+ LA++ SS    Y LS+Y ++ 
Sbjct: 22  PLYSLPPRPSLFQTKQAHARIIVHGLASDAVLLGHLLSCLALAPSS-SLEYSLSVYRALD 81

Query: 91  HPSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGV 150
           +P+VFA+NNMIRCFAK D P  S+ LYS MR+  VA PN +T TFVLQACS A AI EG 
Sbjct: 82  YPNVFASNNMIRCFAKSDSPRGSVVLYSSMRQRNVALPNSHTFTFVLQACSKAFAIHEGT 141

Query: 151 QVQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRD 210
           QV +HV+KLGF  DVF+ NALIHLY  CCR+E +K+VF E    RDVVSWNS++AG VRD
Sbjct: 142 QVHSHVVKLGFGVDVFITNALIHLYSACCRMECSKRVFQENIHHRDVVSWNSILAGLVRD 201

Query: 211 GQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLL 270
           GQ+ VAE +F +M XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX PNEAILV++L
Sbjct: 202 GQVGVAETMFGKMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPNEAILVTVL 261

Query: 271 AAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKW 330
           +A+AQ+G+LE+G+++HSI NSL F MT SLGTAL+DMYAKCGCI++SK LF+ M ++D  
Sbjct: 262 SASAQMGLLEHGRLVHSIINSLNFRMTTSLGTALVDMYAKCGCIEQSKLLFNSMHQRDIS 321

Query: 331 SWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLM 390
           SWNVMICGLASHGLG+EAL  FEKF+ +GF PVNVTFIGVLNACSRAGLVSEGR +FKLM
Sbjct: 322 SWNVMICGLASHGLGKEALEHFEKFVNEGFRPVNVTFIGVLNACSRAGLVSEGRHYFKLM 381

Query: 391 TDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIEL 450
           T+ Y IEPEMEHYGCMVDLL RAGF+ +AV++I +MPAPPDPVLWAT+LG+CK HG +EL
Sbjct: 382 TENYDIEPEMEHYGCMVDLLGRAGFINEAVDLIEKMPAPPDPVLWATLLGACKTHGLVEL 441

Query: 451 GEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEG 510
           GE IGNKLI +DP H+GHYVQLASI+A+ RKWEDV +VRRLM ++N+NKIAGWSLIEAEG
Sbjct: 442 GEMIGNKLIHLDPNHDGHYVQLASIYAKSRKWEDVVRVRRLMFKQNTNKIAGWSLIEAEG 501

Query: 511 RVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKETAIKEHSER 570
           RVH+FVAGD+EHER++EIYK+LE IG RI+ AGYT N+SSVLHDI EEEKE  IKEHSER
Sbjct: 502 RVHQFVAGDREHERSSEIYKMLETIGTRISEAGYTPNISSVLHDIGEEEKENVIKEHSER 561

Query: 571 LAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCS 630
           LAIA+G+LVT+VGDCIRI+KNLRVC DCHEVSK+ISQVFEREI+VRDGSRFHHFK G CS
Sbjct: 562 LAIAYGMLVTEVGDCIRIVKNLRVCEDCHEVSKMISQVFEREIVVRDGSRFHHFKEGKCS 621

Query: 631 CQDYW 636
           C D+W
Sbjct: 622 CHDFW 625

BLAST of Cla97C01G023510 vs. TrEMBL
Match: tr|A0A2P5CLE5|A0A2P5CLE5_PARAD (DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_142740 PE=4 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 1.6e-226
Identity = 410/606 (67.66%), Postives = 498/606 (82.18%), Query Frame = 0

Query: 30  PPLSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSI 89
           P L S P    ++Q KQ+HAR +V GLA D ++ AHLL  LA+S S  P  Y LSIY+  
Sbjct: 23  PLLPSTPTFRSLNQTKQSHARIIVSGLARDAKLMAHLLLSLALSPSP-PLHYSLSIYNHF 82

Query: 90  AHPSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEG 149
            +PSVFATNNMIRCFAK   P +S+ LYS M R + A PN +T TF+ QACS ALA+ EG
Sbjct: 83  NYPSVFATNNMIRCFAKSHSPRQSVVLYSSMLRRY-AKPNNHTFTFLFQACSEALAVEEG 142

Query: 150 VQVQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVR 209
            Q+  HV+K GF  D+F+RNAL++ Y  C R+E +++VF+E   SRD+V+WN+M+A  VR
Sbjct: 143 AQIHAHVLKFGFGADLFIRNALLNFYSACFRLECSRKVFEESLGSRDLVTWNTMLACVVR 202

Query: 210 DGQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSL 269
           DGQ+ VAEKLF EMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX      R NEA+LVS+
Sbjct: 203 DGQMGVAEKLFDEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVREKGLRVNEAMLVSV 262

Query: 270 LAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDK 329
           L+A+AQLG+LE+G+ +HS+A SL FPMT SLGT+LIDMYAKCGCI++SK LF+ M +KD 
Sbjct: 263 LSASAQLGLLEHGRFVHSLAESLNFPMTTSLGTSLIDMYAKCGCIEQSKILFNNMRQKDI 322

Query: 330 WSWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKL 389
           W+WNVMICGLA+HGL +EALALFE+FL +GF P  VTFIGVL+ACSRAGLV EGR +FKL
Sbjct: 323 WTWNVMICGLATHGLAKEALALFERFLNKGFAPATVTFIGVLSACSRAGLVREGRHYFKL 382

Query: 390 MTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIE 449
           M + YGI+PEMEHYGCMVDLL R GFV +AVE+I +M   PDPVLWAT+LG+CK+HGF E
Sbjct: 383 MKENYGIQPEMEHYGCMVDLLGRTGFVDEAVELIEKMQVSPDPVLWATLLGACKIHGFSE 442

Query: 450 LGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAE 509
           LGEEIG+KLIQ+DPTH+GHYVQL++++A+ +KWE+V +VRRLM+ERN+NK AGWSLIEA+
Sbjct: 443 LGEEIGSKLIQLDPTHSGHYVQLSTVYAKAKKWEEVIRVRRLMSERNTNKAAGWSLIEAQ 502

Query: 510 GRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKETAIKEHSE 569
           G+VH+FVAGD++HE   EI+K+LE IG RIA AGY+ N+SSVLHDI +EEKE  IKEHSE
Sbjct: 503 GKVHKFVAGDRDHESFKEIHKMLETIGTRIAEAGYSPNISSVLHDIGDEEKEIVIKEHSE 562

Query: 570 RLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSC 629
           RLAIA GLLVT  GDCIR++KNLRVC DCHEVSKIIS+VFEREIIVRDGSRFHHFK G+C
Sbjct: 563 RLAIALGLLVTPYGDCIRVVKNLRVCEDCHEVSKIISKVFEREIIVRDGSRFHHFKEGTC 622

Query: 630 SCQDYW 636
           SC D+W
Sbjct: 623 SCHDFW 626

BLAST of Cla97C01G023510 vs. Swiss-Prot
Match: sp|Q9FI80|PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 1.8e-116
Identity = 259/611 (42.39%), Postives = 363/611 (59.41%), Query Frame = 0

Query: 41  ISQIKQAHARTVVFGLANDGRITAHLLAFLAISS-SSLPCGYPLSIYHSIAHPSVFATNN 100
           I  + Q HA  +  G   D    A +L F A S        Y   I++ +   + F+ N 
Sbjct: 36  IRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNT 95

Query: 101 MIRCFAKG--DLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVI 160
           +IR F++   D  L +I+L+  M       PN++T   VL+AC+    I EG Q+    +
Sbjct: 96  IIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLAL 155

Query: 161 KLGFVNDVFVRNALIHLYCTCCRVESAKQVF-------------DEIPSSRDVVSWNSMI 220
           K GF  D FV + L+ +Y  C  ++ A+ +F             D      ++V WN MI
Sbjct: 156 KYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMI 215

Query: 221 AGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEA 280
            G++R G    A  LF    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   RPN  
Sbjct: 216 DGYMRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNYV 275

Query: 281 ILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRM 340
            LVS+L A ++LG LE G+ +H  A      +   LG+ALIDMY+KCG I+++  +F+R+
Sbjct: 276 TLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERL 335

Query: 341 PEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGR 400
           P ++  +W+ MI G A HG   +A+  F K    G  P +V +I +L ACS  GLV EGR
Sbjct: 336 PRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGR 395

Query: 401 RFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKV 460
           R+F  M    G+EP +EHYGCMVDLL R+G + +A E I  MP  PD V+W  +LG+C++
Sbjct: 396 RYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRM 455

Query: 461 HGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWS 520
            G +E+G+ + N L+ M P  +G YV L++++A    W +VS++R  M E++  K  G S
Sbjct: 456 QGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 515

Query: 521 LIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKETAI 580
           LI+ +G +H FV  D  H +  EI  +L  I  ++  AGY    + VL ++EEE+KE  +
Sbjct: 516 LIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVL 575

Query: 581 KEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHF 636
             HSE++A AFGL+ T  G  IRI+KNLR+C DCH   K+IS+V++R+I VRD  RFHHF
Sbjct: 576 HYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHF 635

BLAST of Cla97C01G023510 vs. Swiss-Prot
Match: sp|Q9FJY7|PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 6.8e-116
Identity = 262/605 (43.31%), Postives = 353/605 (58.35%), Query Frame = 0

Query: 32  LSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAH 91
           +S L    +  ++KQ HAR +  GL  D       L+F   S+SS    Y   ++     
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  PSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQ 151
           P  F  N MIR F+  D P  S+ LY  M  S  A  N YT   +L+ACSN  A  E  Q
Sbjct: 78  PDTFLWNLMIRGFSCSDEPERSLLLYQRMLCS-SAPHNAYTFPSLLKACSNLSAFEETTQ 137

Query: 152 VQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDG 211
           +   + KLG+ NDV+  N+LI+ Y      + A  +FD IP                   
Sbjct: 138 IHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEP-XXXXXXXXXXXXXXXX 197

Query: 212 QINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLA 271
                        XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  P+   L + L+
Sbjct: 198 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALS 257

Query: 272 AAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWS 331
           A AQLG LE GK IHS  N  R  M + LG  LIDMYAKCG ++E+  +F  + +K   +
Sbjct: 258 ACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQA 317

Query: 332 WNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMT 391
           W  +I G A HG G+EA++ F +    G  P  +TF  VL ACS  GLV EG+  F  M 
Sbjct: 318 WTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSME 377

Query: 392 DTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELG 451
             Y ++P +EHYGC+VDLL RAG + +A   I  MP  P+ V+W  +L +C++H  IELG
Sbjct: 378 RDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELG 437

Query: 452 EEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGR 511
           EEIG  LI +DP H G YV  A+I A  +KW+  ++ RRLM E+   K+ G S I  EG 
Sbjct: 438 EEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGT 497

Query: 512 VHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHD-IEEEEKETAIKEHSER 571
            H F+AGD+ H    +I     I+  ++   GY   +  +L D ++++E+E  + +HSE+
Sbjct: 498 THEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEK 557

Query: 572 LAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCS 631
           LAI +GL+ TK G  IRI+KNLRVC DCH+V+K+IS++++R+I++RD +RFHHF++G CS
Sbjct: 558 LAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCS 617

Query: 632 CQDYW 636
           C DYW
Sbjct: 618 CGDYW 620

BLAST of Cla97C01G023510 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 415.2 bits (1066), Expect = 1.3e-114
Identity = 237/611 (38.79%), Postives = 342/611 (55.97%), Query Frame = 0

Query: 41  ISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNM 100
           +SQ+KQ HA T+      +          L +SSS     Y   ++ SI + S F  N +
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSSFMWNTL 120

Query: 101 IR-CFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKL 160
           IR C        E+  LY  M     ++P+K+T  FVL+AC+      EG QV   ++K 
Sbjct: 121 IRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKH 180

Query: 161 GFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKL 220
           GF  DV+V N LIHLY +C  ++ A++VFDE+P  R +VSWNSMI   VR G+ + A +L
Sbjct: 181 GFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMP-ERSLVSWNSMIDALVRFGEYDSALQL 240

Query: 221 FVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGML 280
           F EM                                     P+   + S+L+A A LG L
Sbjct: 241 FREM--------------------------------QRSFEPDGYTMQSVLSACAGLGSL 300

Query: 281 EYGKMIHSI---ANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMI 340
             G   H+       +   M   +  +LI+MY KCG +  ++ +F  M ++D  SWN MI
Sbjct: 301 SLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMI 360

Query: 341 CGLASHGLGQEALALFEKFL--TQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY 400
            G A+HG  +EA+  F++ +   +   P +VTF+G+L AC+  G V++GR++F +M   Y
Sbjct: 361 LGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDY 420

Query: 401 GIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGS-CKVHGFIELGEE 460
            IEP +EHYGC+VDL++RAG++ +A++M+  MP  PD V+W ++L + CK    +EL EE
Sbjct: 421 CIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEE 480

Query: 461 IGNKLI---QMDPTHNGH----YVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLI 520
           I   +I   + + + NG+    YV L+ ++A   +W DV  VR+LM+E    K  G S I
Sbjct: 481 IARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSI 540

Query: 521 EAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSS--VLHDIEEEEKETAI 580
           E  G  H F AGD  H +T +IY+ L++I  R+ + GY  + S   ++    +  KE ++
Sbjct: 541 EINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGSKEYSL 600

Query: 581 KEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHF 636
           + HSERLAIAFGL+       IRI KNLRVC DCHEV+K+IS+VF  EIIVRD  RFHHF
Sbjct: 601 RLHSERLAIAFGLINLPPQTPIRIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRVRFHHF 638

BLAST of Cla97C01G023510 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 412.5 bits (1059), Expect = 8.3e-114
Identity = 217/594 (36.53%), Postives = 326/594 (54.88%), Query Frame = 0

Query: 44  IKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRC 103
           ++Q HA  +   L  +  +  H L+ LA+S       Y   ++    +P++   N MIR 
Sbjct: 27  LRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRLNPTLSHCNTMIRA 86

Query: 104 FAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVN 163
           F+    P E   L+  +RR+     N  + +F L+ C  +  +  G+Q+   +   GF++
Sbjct: 87  FSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGLQIHGKIFSDGFLS 146

Query: 164 DVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEM 223
           D  +   L+ LY TC     A +VFDEIP  RD VSWN + + ++R+ +      LF +M
Sbjct: 147 DSLLMTTLMDLYSTCENSTDACKVFDEIP-KRDTVSWNVLFSCYLRNKRTRDVLVLFDKM 206

Query: 224 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGK 283
                                               +P+    +  L A A LG L++GK
Sbjct: 207 ----------------------------KNDVDGCVKPDGVTCLLALQACANLGALDFGK 266

Query: 284 MIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHG 343
            +H   +        +L   L+ MY++CG +D++  +F  M E++  SW  +I GLA +G
Sbjct: 267 QVHDFIDENGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNG 326

Query: 344 LGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFF-KLMTDTYGIEPEMEH 403
            G+EA+  F + L  G  P   T  G+L+ACS +GLV+EG  FF ++ +  + I+P + H
Sbjct: 327 FGKEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHH 386

Query: 404 YGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMD 463
           YGC+VDLL RA  +  A  +I  M   PD  +W T+LG+C+VHG +ELGE + + LI++ 
Sbjct: 387 YGCVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELK 446

Query: 464 PTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEH 523
               G YV L + ++ + KWE V+++R LM E+  +   G S IE +G VH F+  D  H
Sbjct: 447 AEEAGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSH 506

Query: 524 ERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIE-EEEKETAIKEHSERLAIAFGLLVTK 583
            R  EIYK+L  I  ++  AGY A ++S LH++E EEEK  A++ HSE+LAIAFG+LVT 
Sbjct: 507 PRKEEIYKMLAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTP 566

Query: 584 VGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
            G  IR+ KNLR C DCH  +K +S V++R +IVRD SRFHHFK GSCSC D+W
Sbjct: 567 PGTTIRVTKNLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Cla97C01G023510 vs. Swiss-Prot
Match: sp|Q9SZT8|PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 1.8e-113
Identity = 259/613 (42.25%), Postives = 360/613 (58.73%), Query Frame = 0

Query: 27  RLPPP---LSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPL 86
           RLPPP      +  +  + ++ Q HA  +   L    R     L      +S     + L
Sbjct: 25  RLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSL 84

Query: 87  SIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNA 146
           +++H    P +F     I   +   L  ++  LY  +  S +  PN++T + +L++CS  
Sbjct: 85  ALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEI-NPNEFTFSSLLKSCSTK 144

Query: 147 LAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSM 206
                G  + THV+K G   D +V   L+ +Y     V SA++VFD +P    V      
Sbjct: 145 ----SGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSXXXXX 204

Query: 207 IAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNE 266
                                XXXXXXXXXXXXXXXXXXXXXXXXXXXXX      +P+E
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAEGKPKPDE 264

Query: 267 AILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDR 326
             +V+ L+A +Q+G LE G+ IH    S R  +   + T LIDMY+KCG ++E+  +F+ 
Sbjct: 265 ITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFND 324

Query: 327 MPEKDKWSWNVMICGLASHGLGQEALALFEKFL-TQGFYPVNVTFIGVLNACSRAGLVSE 386
            P KD  +WN MI G A HG  Q+AL LF +     G  P ++TFIG L AC+ AGLV+E
Sbjct: 325 TPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNE 384

Query: 387 GRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSC 446
           G R F+ M   YGI+P++EHYGC+V LL RAG +  A E I  M    D VLW++VLGSC
Sbjct: 385 GIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSC 444

Query: 447 KVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAG 506
           K+HG   LG+EI   LI ++  ++G YV L++I+A +  +E V+KVR LM E+   K  G
Sbjct: 445 KLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPG 504

Query: 507 WSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKET 566
            S IE E +VH F AGD+EH ++ EIY +L  I  RI + GY  N ++VL D+EE EKE 
Sbjct: 505 ISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKEQ 564

Query: 567 AIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFH 626
           +++ HSERLAIA+GL+ TK G  ++I KNLRVC DCH V+K+IS++  R+I++RD +RFH
Sbjct: 565 SLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFH 624

Query: 627 HFKNGSCSCQDYW 636
           HF +GSCSC D+W
Sbjct: 625 HFTDGSCSCGDFW 632

BLAST of Cla97C01G023510 vs. TAIR10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 421.4 bits (1082), Expect = 9.9e-118
Identity = 259/611 (42.39%), Postives = 363/611 (59.41%), Query Frame = 0

Query: 41  ISQIKQAHARTVVFGLANDGRITAHLLAFLAISS-SSLPCGYPLSIYHSIAHPSVFATNN 100
           I  + Q HA  +  G   D    A +L F A S        Y   I++ +   + F+ N 
Sbjct: 36  IRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNT 95

Query: 101 MIRCFAKG--DLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVI 160
           +IR F++   D  L +I+L+  M       PN++T   VL+AC+    I EG Q+    +
Sbjct: 96  IIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLAL 155

Query: 161 KLGFVNDVFVRNALIHLYCTCCRVESAKQVF-------------DEIPSSRDVVSWNSMI 220
           K GF  D FV + L+ +Y  C  ++ A+ +F             D      ++V WN MI
Sbjct: 156 KYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMI 215

Query: 221 AGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEA 280
            G++R G    A  LF    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   RPN  
Sbjct: 216 DGYMRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNYV 275

Query: 281 ILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRM 340
            LVS+L A ++LG LE G+ +H  A      +   LG+ALIDMY+KCG I+++  +F+R+
Sbjct: 276 TLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERL 335

Query: 341 PEKDKWSWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGR 400
           P ++  +W+ MI G A HG   +A+  F K    G  P +V +I +L ACS  GLV EGR
Sbjct: 336 PRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGR 395

Query: 401 RFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKV 460
           R+F  M    G+EP +EHYGCMVDLL R+G + +A E I  MP  PD V+W  +LG+C++
Sbjct: 396 RYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRM 455

Query: 461 HGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWS 520
            G +E+G+ + N L+ M P  +G YV L++++A    W +VS++R  M E++  K  G S
Sbjct: 456 QGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 515

Query: 521 LIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKETAI 580
           LI+ +G +H FV  D  H +  EI  +L  I  ++  AGY    + VL ++EEE+KE  +
Sbjct: 516 LIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVL 575

Query: 581 KEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHF 636
             HSE++A AFGL+ T  G  IRI+KNLR+C DCH   K+IS+V++R+I VRD  RFHHF
Sbjct: 576 HYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHF 635

BLAST of Cla97C01G023510 vs. TAIR10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 419.5 bits (1077), Expect = 3.8e-117
Identity = 262/605 (43.31%), Postives = 353/605 (58.35%), Query Frame = 0

Query: 32  LSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAH 91
           +S L    +  ++KQ HAR +  GL  D       L+F   S+SS    Y   ++     
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 92  PSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQ 151
           P  F  N MIR F+  D P  S+ LY  M  S  A  N YT   +L+ACSN  A  E  Q
Sbjct: 78  PDTFLWNLMIRGFSCSDEPERSLLLYQRMLCS-SAPHNAYTFPSLLKACSNLSAFEETTQ 137

Query: 152 VQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDG 211
           +   + KLG+ NDV+  N+LI+ Y      + A  +FD IP                   
Sbjct: 138 IHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEP-XXXXXXXXXXXXXXXX 197

Query: 212 QINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLA 271
                        XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  P+   L + L+
Sbjct: 198 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALS 257

Query: 272 AAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWS 331
           A AQLG LE GK IHS  N  R  M + LG  LIDMYAKCG ++E+  +F  + +K   +
Sbjct: 258 ACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQA 317

Query: 332 WNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMT 391
           W  +I G A HG G+EA++ F +    G  P  +TF  VL ACS  GLV EG+  F  M 
Sbjct: 318 WTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSME 377

Query: 392 DTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELG 451
             Y ++P +EHYGC+VDLL RAG + +A   I  MP  P+ V+W  +L +C++H  IELG
Sbjct: 378 RDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELG 437

Query: 452 EEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGR 511
           EEIG  LI +DP H G YV  A+I A  +KW+  ++ RRLM E+   K+ G S I  EG 
Sbjct: 438 EEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGT 497

Query: 512 VHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHD-IEEEEKETAIKEHSER 571
            H F+AGD+ H    +I     I+  ++   GY   +  +L D ++++E+E  + +HSE+
Sbjct: 498 THEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEK 557

Query: 572 LAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCS 631
           LAI +GL+ TK G  IRI+KNLRVC DCH+V+K+IS++++R+I++RD +RFHHF++G CS
Sbjct: 558 LAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCS 617

Query: 632 CQDYW 636
           C DYW
Sbjct: 618 CGDYW 620

BLAST of Cla97C01G023510 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 415.2 bits (1066), Expect = 7.1e-116
Identity = 237/611 (38.79%), Postives = 342/611 (55.97%), Query Frame = 0

Query: 41  ISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNM 100
           +SQ+KQ HA T+      +          L +SSS     Y   ++ SI + S F  N +
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSSFMWNTL 120

Query: 101 IR-CFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKL 160
           IR C        E+  LY  M     ++P+K+T  FVL+AC+      EG QV   ++K 
Sbjct: 121 IRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKH 180

Query: 161 GFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKL 220
           GF  DV+V N LIHLY +C  ++ A++VFDE+P  R +VSWNSMI   VR G+ + A +L
Sbjct: 181 GFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMP-ERSLVSWNSMIDALVRFGEYDSALQL 240

Query: 221 FVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGML 280
           F EM                                     P+   + S+L+A A LG L
Sbjct: 241 FREM--------------------------------QRSFEPDGYTMQSVLSACAGLGSL 300

Query: 281 EYGKMIHSI---ANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMI 340
             G   H+       +   M   +  +LI+MY KCG +  ++ +F  M ++D  SWN MI
Sbjct: 301 SLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMI 360

Query: 341 CGLASHGLGQEALALFEKFL--TQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY 400
            G A+HG  +EA+  F++ +   +   P +VTF+G+L AC+  G V++GR++F +M   Y
Sbjct: 361 LGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDY 420

Query: 401 GIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGS-CKVHGFIELGEE 460
            IEP +EHYGC+VDL++RAG++ +A++M+  MP  PD V+W ++L + CK    +EL EE
Sbjct: 421 CIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEE 480

Query: 461 IGNKLI---QMDPTHNGH----YVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLI 520
           I   +I   + + + NG+    YV L+ ++A   +W DV  VR+LM+E    K  G S I
Sbjct: 481 IARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSI 540

Query: 521 EAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSS--VLHDIEEEEKETAI 580
           E  G  H F AGD  H +T +IY+ L++I  R+ + GY  + S   ++    +  KE ++
Sbjct: 541 EINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGSKEYSL 600

Query: 581 KEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHF 636
           + HSERLAIAFGL+       IRI KNLRVC DCHEV+K+IS+VF  EIIVRD  RFHHF
Sbjct: 601 RLHSERLAIAFGLINLPPQTPIRIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRVRFHHF 638

BLAST of Cla97C01G023510 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 412.5 bits (1059), Expect = 4.6e-115
Identity = 217/594 (36.53%), Postives = 326/594 (54.88%), Query Frame = 0

Query: 44  IKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPLSIYHSIAHPSVFATNNMIRC 103
           ++Q HA  +   L  +  +  H L+ LA+S       Y   ++    +P++   N MIR 
Sbjct: 27  LRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRLNPTLSHCNTMIRA 86

Query: 104 FAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNALAIGEGVQVQTHVIKLGFVN 163
           F+    P E   L+  +RR+     N  + +F L+ C  +  +  G+Q+   +   GF++
Sbjct: 87  FSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGLQIHGKIFSDGFLS 146

Query: 164 DVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSMIAGFVRDGQINVAEKLFVEM 223
           D  +   L+ LY TC     A +VFDEIP  RD VSWN + + ++R+ +      LF +M
Sbjct: 147 DSLLMTTLMDLYSTCENSTDACKVFDEIP-KRDTVSWNVLFSCYLRNKRTRDVLVLFDKM 206

Query: 224 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNEAILVSLLAAAAQLGMLEYGK 283
                                               +P+    +  L A A LG L++GK
Sbjct: 207 ----------------------------KNDVDGCVKPDGVTCLLALQACANLGALDFGK 266

Query: 284 MIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDRMPEKDKWSWNVMICGLASHG 343
            +H   +        +L   L+ MY++CG +D++  +F  M E++  SW  +I GLA +G
Sbjct: 267 QVHDFIDENGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNG 326

Query: 344 LGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFF-KLMTDTYGIEPEMEH 403
            G+EA+  F + L  G  P   T  G+L+ACS +GLV+EG  FF ++ +  + I+P + H
Sbjct: 327 FGKEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHH 386

Query: 404 YGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMD 463
           YGC+VDLL RA  +  A  +I  M   PD  +W T+LG+C+VHG +ELGE + + LI++ 
Sbjct: 387 YGCVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELK 446

Query: 464 PTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEH 523
               G YV L + ++ + KWE V+++R LM E+  +   G S IE +G VH F+  D  H
Sbjct: 447 AEEAGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSH 506

Query: 524 ERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIE-EEEKETAIKEHSERLAIAFGLLVTK 583
            R  EIYK+L  I  ++  AGY A ++S LH++E EEEK  A++ HSE+LAIAFG+LVT 
Sbjct: 507 PRKEEIYKMLAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTP 566

Query: 584 VGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 636
            G  IR+ KNLR C DCH  +K +S V++R +IVRD SRFHHFK GSCSC D+W
Sbjct: 567 PGTTIRVTKNLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Cla97C01G023510 vs. TAIR10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 411.4 bits (1056), Expect = 1.0e-114
Identity = 259/613 (42.25%), Postives = 360/613 (58.73%), Query Frame = 0

Query: 27  RLPPP---LSSLPPTPRISQIKQAHARTVVFGLANDGRITAHLLAFLAISSSSLPCGYPL 86
           RLPPP      +  +  + ++ Q HA  +   L    R     L      +S     + L
Sbjct: 25  RLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSL 84

Query: 87  SIYHSIAHPSVFATNNMIRCFAKGDLPLESISLYSHMRRSFVAAPNKYTLTFVLQACSNA 146
           +++H    P +F     I   +   L  ++  LY  +  S +  PN++T + +L++CS  
Sbjct: 85  ALFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEI-NPNEFTFSSLLKSCSTK 144

Query: 147 LAIGEGVQVQTHVIKLGFVNDVFVRNALIHLYCTCCRVESAKQVFDEIPSSRDVVSWNSM 206
                G  + THV+K G   D +V   L+ +Y     V SA++VFD +P    V      
Sbjct: 145 ----SGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSXXXXX 204

Query: 207 IAGFVRDGQINVAEKLFVEMPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRPNE 266
                                XXXXXXXXXXXXXXXXXXXXXXXXXXXXX      +P+E
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAEGKPKPDE 264

Query: 267 AILVSLLAAAAQLGMLEYGKMIHSIANSLRFPMTASLGTALIDMYAKCGCIDESKFLFDR 326
             +V+ L+A +Q+G LE G+ IH    S R  +   + T LIDMY+KCG ++E+  +F+ 
Sbjct: 265 ITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFND 324

Query: 327 MPEKDKWSWNVMICGLASHGLGQEALALFEKFL-TQGFYPVNVTFIGVLNACSRAGLVSE 386
            P KD  +WN MI G A HG  Q+AL LF +     G  P ++TFIG L AC+ AGLV+E
Sbjct: 325 TPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNE 384

Query: 387 GRRFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYDAVEMINRMPAPPDPVLWATVLGSC 446
           G R F+ M   YGI+P++EHYGC+V LL RAG +  A E I  M    D VLW++VLGSC
Sbjct: 385 GIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSC 444

Query: 447 KVHGFIELGEEIGNKLIQMDPTHNGHYVQLASIFARLRKWEDVSKVRRLMAERNSNKIAG 506
           K+HG   LG+EI   LI ++  ++G YV L++I+A +  +E V+KVR LM E+   K  G
Sbjct: 445 KLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPG 504

Query: 507 WSLIEAEGRVHRFVAGDKEHERTTEIYKILEIIGVRIAAAGYTANVSSVLHDIEEEEKET 566
            S IE E +VH F AGD+EH ++ EIY +L  I  RI + GY  N ++VL D+EE EKE 
Sbjct: 505 ISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKEQ 564

Query: 567 AIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDCHEVSKIISQVFEREIIVRDGSRFH 626
           +++ HSERLAIA+GL+ TK G  ++I KNLRVC DCH V+K+IS++  R+I++RD +RFH
Sbjct: 565 SLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFH 624

Query: 627 HFKNGSCSCQDYW 636
           HF +GSCSC D+W
Sbjct: 625 HFTDGSCSCGDFW 632

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008439760.20.0e+0091.84PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
XP_004134932.10.0e+0091.22PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis s... [more]
XP_022142302.10.0e+0089.29pentatricopeptide repeat-containing protein At3g62890-like [Momordica charantia][more]
XP_022926870.10.0e+0089.76pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata][more]
XP_023004098.11.3e-30989.29pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AZ38|A0A1S3AZ38_CUCME0.0e+0091.84pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
tr|A0A0A0KNI8|A0A0A0KNI8_CUCSA0.0e+0091.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G519440 PE=4 SV=1[more]
tr|A0A251RKX3|A0A251RKX3_PRUPE7.3e-24069.44Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G582700 PE=4 SV=1[more]
tr|A0A2I4EEE8|A0A2I4EEE8_9ROSI3.1e-23872.07pentatricopeptide repeat-containing protein At5g66520-like OS=Juglans regia OX=5... [more]
tr|A0A2P5CLE5|A0A2P5CLE5_PARAD1.6e-22667.66DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_142... [more]
Match NameE-valueIdentityDescription
sp|Q9FI80|PP425_ARATH1.8e-11642.39Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
sp|Q9FJY7|PP449_ARATH6.8e-11643.31Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
sp|Q0WQW5|PPR85_ARATH1.3e-11438.79Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
sp|Q9SN85|PP267_ARATH8.3e-11436.53Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9SZT8|PP354_ARATH1.8e-11342.25Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
AT5G48910.19.9e-11842.39Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G66520.13.8e-11743.31Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G59720.17.1e-11638.79Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G47530.14.6e-11536.53Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G37380.11.0e-11442.25Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0009536 plastid
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016788 hydrolase activity, acting on ester bonds
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G023510.1Cla97C01G023510.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 402..426
e-value: 0.068
score: 13.3
coord: 229..256
e-value: 3.6E-7
score: 29.9
coord: 365..391
e-value: 0.11
score: 12.7
coord: 331..359
e-value: 2.2E-5
score: 24.3
coord: 302..327
e-value: 5.7E-4
score: 19.9
coord: 169..193
e-value: 0.061
score: 13.5
coord: 98..122
e-value: 0.076
score: 13.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 229..263
e-value: 1.0E-8
score: 32.8
coord: 331..362
e-value: 3.8E-5
score: 21.6
coord: 198..228
e-value: 3.9E-6
score: 24.7
coord: 365..398
e-value: 0.0014
score: 16.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 196..227
e-value: 1.1E-7
score: 31.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 297..327
score: 7.585
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 7.169
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..398
score: 6.939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 465..499
score: 5.525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 129..163
score: 5.163
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 227..261
score: 12.145
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 196..226
score: 10.534
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..362
score: 10.567
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 164..194
score: 8.21
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 431..461
score: 5.185
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 399..429
score: 6.303
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 180..293
e-value: 1.0E-24
score: 89.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 299..525
e-value: 3.4E-34
score: 120.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 61..178
e-value: 3.4E-8
score: 35.0
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 502..625
e-value: 2.5E-38
score: 130.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 81..550
NoneNo IPR availablePANTHERPTHR24015:SF509SUBFAMILY NOT NAMEDcoord: 81..550

The following gene(s) are paralogous to this gene:

None