Cla97C04G075190 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G075190
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr04: 22757825 .. 22759375 (+)
RNA-Seq ExpressionCla97C04G075190
SyntenyCla97C04G075190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACCCATAACAATATGGAGAAAATCTTTCACTTCATCTTGTAGCCAATTGATCAAACTCAAAGCCAATGGCGATGGCGCCGTCGGTACTCCACCTCTTCAATCCAAGACAGCCCCATTTCCTCGCAGTGAATTGGATGCCCGCCAACTGTTCGACAAAATGCCTCACTTAAACGTGGTTTCAGCAACCGCCCAGATTGGGCGTTGCGCAAGGCAGCACCAACACGAAGAGGCATTATGTCTCTTTTCTGCTATGTTTGTGTTAAATTTAAGGCCCAACGAATTCACTTTTGGAACTGTGATTCACTCTGCAACTGCACTTGGAGACATTGATATTGGCAAGCAACTTCATGTTTGTGCCATAAAAAATGGCCTTCATTCTAATGTGTTTGTTGGTAGTGCGCTTTTGGATTTGTATGTCAAGTTTAGTGTCATTGAGGAAGCTCAAACAGCTTTTTATGATATCAAGATGCCAAATGTGGTTTCTTACACTAGTTTGATTTCTGGGTACTTGAAAATGGAGAGGATTCAAGATGCTCTCCGGGTGTTCGATGAAATGCCCGAACGAAATGTCGTGTCGTGGAATGCAATGATTGGTGGGTTTAGTCAAAAGGGACACAATGAGGATGCTGTGCATCTGTTCATTGACATGCTTAGAGAAGGTTTTTTGCCCACTCAATCTACTTTTCCTTGTGCTGTATGTGCTGCTGCTAATATAGCATCTATTGGAATAGGGAGGAGTTTTCATGCTTGTGCTGTCAAGTTCTTGGGCAAGCTCGACGTGTTTGTCAGCAATTCACTTATTAGTTTTTATGCAAAATGTGGAAGTATGGAAGATAGTTTGTTGGTTTTCAATAAATTACTTGATGAAAGAAACATAGTTTCATGGAATGCTGTTCTTTGTGGTTATGCTCAAAATGGTAGGGGAAAGGAAGCTATAGATTTCTATCAAAAGATGACTTTTGCAGGCTGCAAACCAAATGGTGTGACACTTCTTAGCTTGTTATGGGCTTGTAATCATGCTGGCTTGGTTGATGAAGGTTATTCATATTTCAATCAAGCGAGACTCGATAACCCCAGCATGCTAAAACCCGAGCATTATGCTTGTATGGTCGATTTGCTCTCACGTTCGGGTCAATTTAAACAAGCAGAGGAATTTATACATGATTTGCCGTTCGATCCAGGGATCGGGTTTTGGAAAGCGTTGCTTGGGGGCTGCCAAATTCACTCAAATGTGGAACTTGGGGAGTTTGCAGCTCAGAGAATCCTGGCCTTGGATCCGGGTGATGTATCGTCATATGTAATGATGTCTAATGCGCATTCTGCAGCCGGGAAATGGCAAAGCGTGTCGATATTAAGAAGGGAAATGAAGGAGAAGGGGATGAAGAGAATCCCAGGTTGCAGTTGGATTGAAATTAGAAGCAAAGTTCATGTTTTTGTAACGGGTGATAAGAATCACCACCAAAAGGATGAAATTTATTCTGCCTTGAGATTCTTTGTTGAGCACTTGAAAGAGAGAGAAGATTTCAACTTCCTTTCAAATTCATAA

mRNA sequence

ATGAAACCCATAACAATATGGAGAAAATCTTTCACTTCATCTTGTAGCCAATTGATCAAACTCAAAGCCAATGGCGATGGCGCCGTCGGTACTCCACCTCTTCAATCCAAGACAGCCCCATTTCCTCGCAGTGAATTGGATGCCCGCCAACTGTTCGACAAAATGCCTCACTTAAACGTGGTTTCAGCAACCGCCCAGATTGGGCGTTGCGCAAGGCAGCACCAACACGAAGAGGCATTATGTCTCTTTTCTGCTATGTTTGTGTTAAATTTAAGGCCCAACGAATTCACTTTTGGAACTGTGATTCACTCTGCAACTGCACTTGGAGACATTGATATTGGCAAGCAACTTCATGTTTGTGCCATAAAAAATGGCCTTCATTCTAATGTGTTTGTTGGTAGTGCGCTTTTGGATTTGTATGTCAAGTTTAGTGTCATTGAGGAAGCTCAAACAGCTTTTTATGATATCAAGATGCCAAATGTGGTTTCTTACACTAGTTTGATTTCTGGGTACTTGAAAATGGAGAGGATTCAAGATGCTCTCCGGGTGTTCGATGAAATGCCCGAACGAAATGTCGTGTCGTGGAATGCAATGATTGGTGGGTTTAGTCAAAAGGGACACAATGAGGATGCTGTGCATCTGTTCATTGACATGCTTAGAGAAGGTTTTTTGCCCACTCAATCTACTTTTCCTTGTGCTGTATGTGCTGCTGCTAATATAGCATCTATTGGAATAGGGAGGAGTTTTCATGCTTGTGCTGTCAAGTTCTTGGGCAAGCTCGACGTGTTTGTCAGCAATTCACTTATTAGTTTTTATGCAAAATGTGGAAGTATGGAAGATAGTTTGTTGGTTTTCAATAAATTACTTGATGAAAGAAACATAGTTTCATGGAATGCTGTTCTTTGTGGTTATGCTCAAAATGGTAGGGGAAAGGAAGCTATAGATTTCTATCAAAAGATGACTTTTGCAGGCTGCAAACCAAATGGTGTGACACTTCTTAGCTTGTTATGGGCTTGTAATCATGCTGGCTTGGTTGATGAAGGTTATTCATATTTCAATCAAGCGAGACTCGATAACCCCAGCATGCTAAAACCCGAGCATTATGCTTGTATGGTCGATTTGCTCTCACGTTCGGGTCAATTTAAACAAGCAGAGGAATTTATACATGATTTGCCGTTCGATCCAGGGATCGGGTTTTGGAAAGCGTTGCTTGGGGGCTGCCAAATTCACTCAAATGTGGAACTTGGGGAGTTTGCAGCTCAGAGAATCCTGGCCTTGGATCCGGGTGATGTATCGTCATATGTAATGATGTCTAATGCGCATTCTGCAGCCGGGAAATGGCAAAGCGTGTCGATATTAAGAAGGGAAATGAAGGAGAAGGGGATGAAGAGAATCCCAGGTTGCAGTTGGATTGAAATTAGAAGCAAAGTTCATGTTTTTGTAACGGGTGATAAGAATCACCACCAAAAGGATGAAATTTATTCTGCCTTGAGATTCTTTGTTGAGCACTTGAAAGAGAGAGAAGATTTCAACTTCCTTTCAAATTCATAA

Coding sequence (CDS)

ATGAAACCCATAACAATATGGAGAAAATCTTTCACTTCATCTTGTAGCCAATTGATCAAACTCAAAGCCAATGGCGATGGCGCCGTCGGTACTCCACCTCTTCAATCCAAGACAGCCCCATTTCCTCGCAGTGAATTGGATGCCCGCCAACTGTTCGACAAAATGCCTCACTTAAACGTGGTTTCAGCAACCGCCCAGATTGGGCGTTGCGCAAGGCAGCACCAACACGAAGAGGCATTATGTCTCTTTTCTGCTATGTTTGTGTTAAATTTAAGGCCCAACGAATTCACTTTTGGAACTGTGATTCACTCTGCAACTGCACTTGGAGACATTGATATTGGCAAGCAACTTCATGTTTGTGCCATAAAAAATGGCCTTCATTCTAATGTGTTTGTTGGTAGTGCGCTTTTGGATTTGTATGTCAAGTTTAGTGTCATTGAGGAAGCTCAAACAGCTTTTTATGATATCAAGATGCCAAATGTGGTTTCTTACACTAGTTTGATTTCTGGGTACTTGAAAATGGAGAGGATTCAAGATGCTCTCCGGGTGTTCGATGAAATGCCCGAACGAAATGTCGTGTCGTGGAATGCAATGATTGGTGGGTTTAGTCAAAAGGGACACAATGAGGATGCTGTGCATCTGTTCATTGACATGCTTAGAGAAGGTTTTTTGCCCACTCAATCTACTTTTCCTTGTGCTGTATGTGCTGCTGCTAATATAGCATCTATTGGAATAGGGAGGAGTTTTCATGCTTGTGCTGTCAAGTTCTTGGGCAAGCTCGACGTGTTTGTCAGCAATTCACTTATTAGTTTTTATGCAAAATGTGGAAGTATGGAAGATAGTTTGTTGGTTTTCAATAAATTACTTGATGAAAGAAACATAGTTTCATGGAATGCTGTTCTTTGTGGTTATGCTCAAAATGGTAGGGGAAAGGAAGCTATAGATTTCTATCAAAAGATGACTTTTGCAGGCTGCAAACCAAATGGTGTGACACTTCTTAGCTTGTTATGGGCTTGTAATCATGCTGGCTTGGTTGATGAAGGTTATTCATATTTCAATCAAGCGAGACTCGATAACCCCAGCATGCTAAAACCCGAGCATTATGCTTGTATGGTCGATTTGCTCTCACGTTCGGGTCAATTTAAACAAGCAGAGGAATTTATACATGATTTGCCGTTCGATCCAGGGATCGGGTTTTGGAAAGCGTTGCTTGGGGGCTGCCAAATTCACTCAAATGTGGAACTTGGGGAGTTTGCAGCTCAGAGAATCCTGGCCTTGGATCCGGGTGATGTATCGTCATATGTAATGATGTCTAATGCGCATTCTGCAGCCGGGAAATGGCAAAGCGTGTCGATATTAAGAAGGGAAATGAAGGAGAAGGGGATGAAGAGAATCCCAGGTTGCAGTTGGATTGAAATTAGAAGCAAAGTTCATGTTTTTGTAACGGGTGATAAGAATCACCACCAAAAGGATGAAATTTATTCTGCCTTGAGATTCTTTGTTGAGCACTTGAAAGAGAGAGAAGATTTCAACTTCCTTTCAAATTCATAA

Protein sequence

MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS
Homology
BLAST of Cla97C04G075190 vs. NCBI nr
Match: XP_038882815.1 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 970.3 bits (2507), Expect = 6.5e-279
Identity = 465/516 (90.12%), Postives = 493/516 (95.54%), Query Frame = 0

Query: 1   MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNV 60
           MKPI++WRKSFTSSC + IKLK +G GAVGTPPLQS  APFPRSEL+A QLFDK PHLNV
Sbjct: 1   MKPISMWRKSFTSSCRKSIKLKPDGHGAVGTPPLQSNAAPFPRSELNAHQLFDKTPHLNV 60

Query: 61  VSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120
           +SAT +IGRCARQHQHEEAL LFSAM +LNLRPNEFTFGTVIHSATALGDIDIGKQLHVC
Sbjct: 61  ISATVEIGRCARQHQHEEALYLFSAMLMLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120

Query: 121 AIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDA 180
           A+K GLHSNVFVGSALL LYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLKMERI DA
Sbjct: 121 ALKTGLHSNVFVGSALLHLYVKLSVIEEAQKAFDDIKMPNVVSYTSLISGYLKMERIHDA 180

Query: 181 LRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANI 240
           LRVFD+MPERN+VSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCA+CAAANI
Sbjct: 181 LRVFDKMPERNIVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAICAAANI 240

Query: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300
           ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV
Sbjct: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300

Query: 301 LCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360
           LCGYAQNGRGKEAIDFYQ+M FAGCKPNGVTLLSLLWACNHAGLVDEGY+YFNQARL+NP
Sbjct: 301 LCGYAQNGRGKEAIDFYQRMIFAGCKPNGVTLLSLLWACNHAGLVDEGYAYFNQARLNNP 360

Query: 361 SMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAA 420
           ++LKPEHYACMVDLLSRSGQF++A+EFIHDLPFDPGIGFWKALLGGCQIHSNVELGE AA
Sbjct: 361 NLLKPEHYACMVDLLSRSGQFRRAKEFIHDLPFDPGIGFWKALLGGCQIHSNVELGELAA 420

Query: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVF 480
           Q+ILALDPGDVSSYVMMSNAHSAAGKWQ+VSILRR+MKEKG+KRIPGCSWI++RSKVH+F
Sbjct: 421 QKILALDPGDVSSYVMMSNAHSAAGKWQNVSILRRKMKEKGLKRIPGCSWIDVRSKVHIF 480

Query: 481 VTGDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           VT DKNH QKDEIY ALRFFVEHLKE EDFNFLS+S
Sbjct: 481 VTSDKNHRQKDEIYVALRFFVEHLKESEDFNFLSDS 516

BLAST of Cla97C04G075190 vs. NCBI nr
Match: XP_022949636.1 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 959.1 bits (2478), Expect = 1.5e-275
Identity = 466/514 (90.66%), Postives = 489/514 (95.14%), Query Frame = 0

Query: 4   ITIWRKSF-TSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVS 63
           I+IWR+S+ +SSC QLIKLKA+  GAV TP +QSK APFPRSELDARQLFDKMP LNVVS
Sbjct: 2   ISIWRRSYVSSSCRQLIKLKADAYGAVVTPAVQSKKAPFPRSELDARQLFDKMPQLNVVS 61

Query: 64  ATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAI 123
           AT+ IGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLH CAI
Sbjct: 62  ATSMIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHACAI 121

Query: 124 KNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALR 183
           K GLHSNVFVGSA LDLYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLK E+IQDALR
Sbjct: 122 KIGLHSNVFVGSAALDLYVKLSVIEEAQRAFDDIKMPNVVSYTSLISGYLKTEKIQDALR 181

Query: 184 VFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIAS 243
           VFDEMPERN+VSWNAMIGGFSQKGHNEDAVHLFIDML EGFLPTQSTFPCA+CAAANIAS
Sbjct: 182 VFDEMPERNIVSWNAMIGGFSQKGHNEDAVHLFIDMLTEGFLPTQSTFPCALCAAANIAS 241

Query: 244 IGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLC 303
           IGIGRSFHACA+KFLG LDVFVSNSLISFYAKCGSMEDSLLVFNKLLD+RN VSWNAVLC
Sbjct: 242 IGIGRSFHACAIKFLGDLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDKRNTVSWNAVLC 301

Query: 304 GYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSM 363
           GYAQNGRGKEAIDFYQ+M+F+GCKPNGVTLLSLLWACNHAGLVDEGYSYFNQA+L+NPS+
Sbjct: 302 GYAQNGRGKEAIDFYQRMSFSGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQAKLENPSL 361

Query: 364 LKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQR 423
           LKPEHYACMVDLLSRSGQF++AEEFIHDLPFDPGIGFWKALLGGCQIHSN+ELGE AAQR
Sbjct: 362 LKPEHYACMVDLLSRSGQFRRAEEFIHDLPFDPGIGFWKALLGGCQIHSNMELGELAAQR 421

Query: 424 ILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 483
           ILALDP DVSSYVMMSNAHSAAGKW SVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT
Sbjct: 422 ILALDPSDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 481

Query: 484 GDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           GDKNHHQKDEI S L+FFVEHLKE EDFNFLS+S
Sbjct: 482 GDKNHHQKDEICSTLKFFVEHLKESEDFNFLSDS 515

BLAST of Cla97C04G075190 vs. NCBI nr
Match: KAG6603678.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 956.4 bits (2471), Expect = 9.7e-275
Identity = 464/514 (90.27%), Postives = 487/514 (94.75%), Query Frame = 0

Query: 4   ITIWRKSF-TSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVS 63
           I IWR+S+ +SSC QL+KLKA+  GAV TP  QSK APFPR ELDARQLFDKMP LNVVS
Sbjct: 2   IPIWRRSYVSSSCRQLMKLKADAHGAVVTPAEQSKKAPFPRGELDARQLFDKMPQLNVVS 61

Query: 64  ATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAI 123
           ATA IGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDID+GKQLH CAI
Sbjct: 62  ATAMIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDVGKQLHACAI 121

Query: 124 KNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALR 183
           K GLHSNVFVGSA LDLYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLK E+IQDALR
Sbjct: 122 KIGLHSNVFVGSAALDLYVKLSVIEEAQRAFDDIKMPNVVSYTSLISGYLKTEKIQDALR 181

Query: 184 VFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIAS 243
           VFDEMPERN+VSWNA+IGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCA+CAAANIAS
Sbjct: 182 VFDEMPERNIVSWNAVIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCALCAAANIAS 241

Query: 244 IGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLC 303
           IGIGRSFHACA+KFLG LDVFVSNSLISFYAKCGSMEDSLLVFNKLLD+RN VSWNAVLC
Sbjct: 242 IGIGRSFHACAIKFLGDLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDKRNTVSWNAVLC 301

Query: 304 GYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSM 363
           GYAQNGRGKEAIDFYQ+M+F+GCKPNGVTLLSLLWACNHAGLVDEGYSYFNQA+L+NPS+
Sbjct: 302 GYAQNGRGKEAIDFYQRMSFSGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQAKLENPSL 361

Query: 364 LKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQR 423
           LKPEHYACMVDLLSRSGQF++AEEFIHDLPFDPGIGFWKALLGGCQIHSN+ELGE AAQR
Sbjct: 362 LKPEHYACMVDLLSRSGQFRRAEEFIHDLPFDPGIGFWKALLGGCQIHSNMELGELAAQR 421

Query: 424 ILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 483
           ILALDP DVSSYVMMSNAHSAAGKW SVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT
Sbjct: 422 ILALDPSDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 481

Query: 484 GDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           GDKNHHQKDEI S L+FFVEHLKE EDFNFLS+S
Sbjct: 482 GDKNHHQKDEICSTLKFFVEHLKESEDFNFLSDS 515

BLAST of Cla97C04G075190 vs. NCBI nr
Match: XP_004142091.1 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial [Cucumis sativus] >KGN54220.1 hypothetical protein Csa_017944 [Cucumis sativus])

HSP 1 Score: 949.5 bits (2453), Expect = 1.2e-272
Identity = 461/516 (89.34%), Postives = 480/516 (93.02%), Query Frame = 0

Query: 1   MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNV 60
           MKPITIWRK  TSSC+QL KLK +G+ AVGTPPL  +   F   +LDARQL+DKMPHLNV
Sbjct: 1   MKPITIWRKPLTSSCTQLTKLKPDGNAAVGTPPLPFQATSFSGGQLDARQLYDKMPHLNV 60

Query: 61  VSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120
           VSAT  IGRCARQHQHEEAL LFSAM VLNLRPNEFTFGTVI S  ALGDI IGKQLHVC
Sbjct: 61  VSATTAIGRCARQHQHEEALSLFSAMLVLNLRPNEFTFGTVIQSPKALGDIHIGKQLHVC 120

Query: 121 AIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDA 180
           AIK GLHSNVFVGSALLDLYVK  VIEEAQ AF DIKMPNVVSYTSLISGYLK+ERI+DA
Sbjct: 121 AIKTGLHSNVFVGSALLDLYVKVGVIEEAQRAFEDIKMPNVVSYTSLISGYLKIERIRDA 180

Query: 181 LRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANI 240
           LRVFDEMPERNVVSWN+MIGGFSQKGHNEDAVHLFIDMLREG LPTQSTFPCA+CAAANI
Sbjct: 181 LRVFDEMPERNVVSWNSMIGGFSQKGHNEDAVHLFIDMLREGILPTQSTFPCAICAAANI 240

Query: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300
           ASIGIGRSFHACAVKF GKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERN+VSWNAV
Sbjct: 241 ASIGIGRSFHACAVKFFGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNVVSWNAV 300

Query: 301 LCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360
           L G+AQNGRGKEAIDFYQ+M  AGCKPN VT LSLLWACNHAGLVDEGYSYFNQARLDNP
Sbjct: 301 LSGFAQNGRGKEAIDFYQRMILAGCKPNAVTFLSLLWACNHAGLVDEGYSYFNQARLDNP 360

Query: 361 SMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAA 420
           ++LK EHYACMVDLLSRSGQFK+AEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGE AA
Sbjct: 361 NLLKAEHYACMVDLLSRSGQFKRAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGELAA 420

Query: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVF 480
           QRILALDPGDVSSYVMMSNAHSAAGKW SVSILRREMKEKG+KRIPGCSWIEIRSKVHVF
Sbjct: 421 QRILALDPGDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGLKRIPGCSWIEIRSKVHVF 480

Query: 481 VTGDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           VTGDKNHHQKDEIYSAL+FFVEHLKEREDFNFLS+S
Sbjct: 481 VTGDKNHHQKDEIYSALKFFVEHLKEREDFNFLSDS 516

BLAST of Cla97C04G075190 vs. NCBI nr
Match: XP_022977247.1 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 948.0 bits (2449), Expect = 3.5e-272
Identity = 462/514 (89.88%), Postives = 484/514 (94.16%), Query Frame = 0

Query: 4   ITIWRKSF-TSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVS 63
           I IWR+S+ +SSC QLIKLKA+  GAV TP +QSK  PFPR ELDARQLFDKMP LNVVS
Sbjct: 2   IPIWRRSYVSSSCRQLIKLKADAHGAVVTPAVQSKKDPFPRGELDARQLFDKMPQLNVVS 61

Query: 64  ATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAI 123
           ATA IG CARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLH CAI
Sbjct: 62  ATAMIGCCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHACAI 121

Query: 124 KNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALR 183
           K GLHSNVFVGSA LDLYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLK E+IQ ALR
Sbjct: 122 KIGLHSNVFVGSAALDLYVKLSVIEEAQRAFDDIKMPNVVSYTSLISGYLKTEKIQAALR 181

Query: 184 VFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIAS 243
           VFD+MPERN+VSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCA+CAAANIAS
Sbjct: 182 VFDKMPERNIVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCALCAAANIAS 241

Query: 244 IGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLC 303
           IGIGRSFHACA+KFLG LDVFVSNSLISFYAKCGSMEDSLLVFNKLLD+RN VSWNAVLC
Sbjct: 242 IGIGRSFHACAIKFLGDLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDKRNTVSWNAVLC 301

Query: 304 GYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSM 363
           GYAQNGRGKEAIDFYQ+M+F+GCKPNGVTLLSLLWACNHAGLVDEGYSYFNQA+L+NPS+
Sbjct: 302 GYAQNGRGKEAIDFYQRMSFSGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQAKLENPSL 361

Query: 364 LKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQR 423
           LKPEHYACMVDLLSRSGQFK+AEEFIHDLPFDPGIGFWKALLGGCQIHSN+ELGE AAQR
Sbjct: 362 LKPEHYACMVDLLSRSGQFKRAEEFIHDLPFDPGIGFWKALLGGCQIHSNMELGELAAQR 421

Query: 424 ILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 483
           ILALDP DVSSYVMMSNAHSAAGKW SVSILRREMKEKGMKRIPGCSWIE RSKVHVFVT
Sbjct: 422 ILALDPSDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGMKRIPGCSWIESRSKVHVFVT 481

Query: 484 GDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           GDK+HHQKDEI S L+FFVEHLKE EDFNFLS+S
Sbjct: 482 GDKSHHQKDEICSTLKFFVEHLKESEDFNFLSDS 515

BLAST of Cla97C04G075190 vs. ExPASy Swiss-Prot
Match: Q9FIH2 (Pentatricopeptide repeat-containing protein At5g42450, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E102 PE=2 SV=1)

HSP 1 Score: 580.1 bits (1494), Expect = 2.5e-164
Identity = 282/468 (60.26%), Postives = 361/468 (77.14%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +A ++FD++P L+V+SATA IGR  ++ +H EA   F  +  L +RPNEFTFGTVI S+T
Sbjct: 45  NAHKVFDEIPELDVISATAVIGRFVKESRHVEASQAFKRLLCLGIRPNEFTFGTVIGSST 104

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
              D+ +GKQLH  A+K GL SNVFVGSA+L+ YVK S + +A+  F D + PNVVS T+
Sbjct: 105 TSRDVKLGKQLHCYALKMGLASNVFVGSAVLNCYVKLSTLTDARRCFDDTRDPNVVSITN 164

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREG-FLP 226
           LISGYLK    ++AL +F  MPER+VV+WNA+IGGFSQ G NE+AV+ F+DMLREG  +P
Sbjct: 165 LISGYLKKHEFEEALSLFRAMPERSVVTWNAVIGGFSQTGRNEEAVNTFVDMLREGVVIP 224

Query: 227 TQSTFPCAVCAAANIASIGIGRSFHACAVKFLGK-LDVFVSNSLISFYAKCGSMEDSLLV 286
            +STFPCA+ A +NIAS G G+S HACA+KFLGK  +VFV NSLISFY+KCG+MEDSLL 
Sbjct: 225 NESTFPCAITAISNIASHGAGKSIHACAIKFLGKRFNVFVWNSLISFYSKCGNMEDSLLA 284

Query: 287 FNKLLDE-RNIVSWNAVLCGYAQNGRGKEAIDFYQKMT-FAGCKPNGVTLLSLLWACNHA 346
           FNKL +E RNIVSWN+++ GYA NGRG+EA+  ++KM      +PN VT+L +L+ACNHA
Sbjct: 285 FNKLEEEQRNIVSWNSMIWGYAHNGRGEEAVAMFEKMVKDTNLRPNNVTILGVLFACNHA 344

Query: 347 GLVDEGYSYFNQA--RLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFW 406
           GL+ EGY YFN+A    D+P++L+ EHYACMVD+LSRSG+FK+AEE I  +P DPGIGFW
Sbjct: 345 GLIQEGYMYFNKAVNDYDDPNLLELEHYACMVDMLSRSGRFKEAEELIKSMPLDPGIGFW 404

Query: 407 KALLGGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEK 466
           KALLGGCQIHSN  L + AA +IL LDP DVSSYVM+SNA+SA   WQ+VS++RR+MKE 
Sbjct: 405 KALLGGCQIHSNKRLAKLAASKILELDPRDVSSYVMLSNAYSAMENWQNVSLIRRKMKET 464

Query: 467 GMKRIPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKERE 509
           G+KR  GCSWIE+R ++ VFV  DKN+  KDE+Y  L    +HL+E E
Sbjct: 465 GLKRFTGCSWIEVRDQIRVFVNADKNNELKDEVYRMLALVSQHLEENE 512

BLAST of Cla97C04G075190 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 365.2 bits (936), Expect = 1.3e-99
Identity = 179/470 (38.09%), Postives = 290/470 (61.70%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           DA+++FD+M   NVVS  + I    +     EAL +F  M    + P+E T  +VI +  
Sbjct: 205 DAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACA 264

Query: 107 ALGDIDIGKQLHVCAIKNG-LHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYT 166
           +L  I +G+++H   +KN  L +++ + +A +D+Y K S I+EA+  F  + + NV++ T
Sbjct: 265 SLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAET 324

Query: 167 SLISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLP 226
           S+ISGY      + A  +F +M ERNVVSWNA+I G++Q G NE+A+ LF  + RE   P
Sbjct: 325 SMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCP 384

Query: 227 TQSTFPCAVCAAANIASIGIGRSFHACAVKFLGKL------DVFVSNSLISFYAKCGSME 286
           T  +F   + A A++A + +G   H   +K   K       D+FV NSLI  Y KCG +E
Sbjct: 385 THYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVE 444

Query: 287 DSLLVFNKLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWAC 346
           +  LVF K++ ER+ VSWNA++ G+AQNG G EA++ +++M  +G KP+ +T++ +L AC
Sbjct: 445 EGYLVFRKMM-ERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSAC 504

Query: 347 NHAGLVDEGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGF 406
            HAG V+EG  YF+    D       +HY CMVDLL R+G  ++A+  I ++P  P    
Sbjct: 505 GHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVI 564

Query: 407 WKALLGGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKE 466
           W +LL  C++H N+ LG++ A+++L ++P +   YV++SN ++  GKW+ V  +R+ M++
Sbjct: 565 WGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRK 624

Query: 467 KGMKRIPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKERED 510
           +G+ + PGCSWI+I+   HVF+  DK+H +K +I+S L   +  ++  +D
Sbjct: 625 EGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMRPEQD 673

BLAST of Cla97C04G075190 vs. ExPASy Swiss-Prot
Match: Q9FWA6 (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 343.2 bits (879), Expect = 5.1e-93
Identity = 177/468 (37.82%), Postives = 283/468 (60.47%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +A ++FD+M   + VS  A I    +  +  E L LF +M    + P+EFTFG+++ + T
Sbjct: 435 EAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFGSILKACT 494

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
             G +  G ++H   +K+G+ SN  VG +L+D+Y K  +IEEA+      K+ +     +
Sbjct: 495 G-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAE------KIHSRFFQRA 554

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
            +SG   ME ++   ++ ++  +   VSWN++I G+  K  +EDA  LF  M+  G  P 
Sbjct: 555 NVSG--TMEELE---KMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPD 614

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
           + T+   +   AN+AS G+G+  HA  +K   + DV++ ++L+  Y+KCG + DS L+F 
Sbjct: 615 KFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFE 674

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           K L  R+ V+WNA++CGYA +G+G+EAI  +++M     KPN VT +S+L AC H GL+D
Sbjct: 675 KSL-RRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLID 734

Query: 347 EGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGG 406
           +G  YF   + D     +  HY+ MVD+L +SG+ K+A E I ++PF+     W+ LLG 
Sbjct: 735 KGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGV 794

Query: 407 CQIH-SNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRI 466
           C IH +NVE+ E A   +L LDP D S+Y ++SN ++ AG W+ VS LRR M+   +K+ 
Sbjct: 795 CTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKE 854

Query: 467 PGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKEREDFNFL 514
           PGCSW+E++ ++HVF+ GDK H + +EIY  L      +K  +D +F+
Sbjct: 855 PGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIYSEMKPFDDSSFV 889

BLAST of Cla97C04G075190 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 9.6e-92
Identity = 165/458 (36.03%), Postives = 269/458 (58.73%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           DA QLFD+MP  NV+S T  I   ++   H++AL L   M   N+RPN +T+ +V+ S  
Sbjct: 114 DAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCN 173

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
            + D+   + LH   IK GL S+VFV SAL+D++ K    E                   
Sbjct: 174 GMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPE------------------- 233

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
                       DAL VFDEM   + + WN++IGGF+Q   ++ A+ LF  M R GF+  
Sbjct: 234 ------------DALSVFDEMVTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAE 293

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
           Q+T    + A   +A + +G   H   VK+    D+ ++N+L+  Y KCGS+ED+L VFN
Sbjct: 294 QATLTSVLRACTGLALLELGMQAHVHIVKY--DQDLILNNALVDMYCKCGSLEDALRVFN 353

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           + + ER++++W+ ++ G AQNG  +EA+  +++M  +G KPN +T++ +L+AC+HAGL++
Sbjct: 354 Q-MKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLE 413

Query: 347 EGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGG 406
           +G+ YF   +         EHY CM+DLL ++G+   A + ++++  +P    W+ LLG 
Sbjct: 414 DGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGA 473

Query: 407 CQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIP 466
           C++  N+ L E+AA++++ALDP D  +Y ++SN ++ + KW SV  +R  M+++G+K+ P
Sbjct: 474 CRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEP 533

Query: 467 GCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHL 505
           GCSWIE+  ++H F+ GD +H Q  E+   L   +  L
Sbjct: 534 GCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRL 534

BLAST of Cla97C04G075190 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 1.8e-90
Identity = 171/462 (37.01%), Postives = 268/462 (58.01%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +AR++F+KMP  + V+ T  I   ++  +  +AL  F+ M      PNEFT  +VI +A 
Sbjct: 113 EARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAA 172

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
           A      G QLH   +K G  SNV VGSALLDLY ++ ++++AQ                
Sbjct: 173 AERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQL--------------- 232

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
                           VFD +  RN VSWNA+I G +++   E A+ LF  MLR+GF P+
Sbjct: 233 ----------------VFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPS 292

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
             ++     A ++   +  G+  HA  +K   KL  F  N+L+  YAK GS+ D+  +F+
Sbjct: 293 HFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFD 352

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           +L  +R++VSWN++L  YAQ+G GKEA+ ++++M   G +PN ++ LS+L AC+H+GL+D
Sbjct: 353 RLA-KRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLD 412

Query: 347 EGYSYFNQARLDNPSMLKPE--HYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALL 406
           EG+ Y+   + D    + PE  HY  +VDLL R+G   +A  FI ++P +P    WKALL
Sbjct: 413 EGWHYYELMKKDG---IVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALL 472

Query: 407 GGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKR 466
             C++H N ELG +AA+ +  LDP D   +V++ N +++ G+W   + +R++MKE G+K+
Sbjct: 473 NACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKK 532

Query: 467 IPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKE 507
            P CSW+EI + +H+FV  D+ H Q++EI       +  +KE
Sbjct: 533 EPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKE 539

BLAST of Cla97C04G075190 vs. ExPASy TrEMBL
Match: A0A6J1GCJ3 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111452968 PE=4 SV=1)

HSP 1 Score: 959.1 bits (2478), Expect = 7.3e-276
Identity = 466/514 (90.66%), Postives = 489/514 (95.14%), Query Frame = 0

Query: 4   ITIWRKSF-TSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVS 63
           I+IWR+S+ +SSC QLIKLKA+  GAV TP +QSK APFPRSELDARQLFDKMP LNVVS
Sbjct: 2   ISIWRRSYVSSSCRQLIKLKADAYGAVVTPAVQSKKAPFPRSELDARQLFDKMPQLNVVS 61

Query: 64  ATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAI 123
           AT+ IGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLH CAI
Sbjct: 62  ATSMIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHACAI 121

Query: 124 KNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALR 183
           K GLHSNVFVGSA LDLYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLK E+IQDALR
Sbjct: 122 KIGLHSNVFVGSAALDLYVKLSVIEEAQRAFDDIKMPNVVSYTSLISGYLKTEKIQDALR 181

Query: 184 VFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIAS 243
           VFDEMPERN+VSWNAMIGGFSQKGHNEDAVHLFIDML EGFLPTQSTFPCA+CAAANIAS
Sbjct: 182 VFDEMPERNIVSWNAMIGGFSQKGHNEDAVHLFIDMLTEGFLPTQSTFPCALCAAANIAS 241

Query: 244 IGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLC 303
           IGIGRSFHACA+KFLG LDVFVSNSLISFYAKCGSMEDSLLVFNKLLD+RN VSWNAVLC
Sbjct: 242 IGIGRSFHACAIKFLGDLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDKRNTVSWNAVLC 301

Query: 304 GYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSM 363
           GYAQNGRGKEAIDFYQ+M+F+GCKPNGVTLLSLLWACNHAGLVDEGYSYFNQA+L+NPS+
Sbjct: 302 GYAQNGRGKEAIDFYQRMSFSGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQAKLENPSL 361

Query: 364 LKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQR 423
           LKPEHYACMVDLLSRSGQF++AEEFIHDLPFDPGIGFWKALLGGCQIHSN+ELGE AAQR
Sbjct: 362 LKPEHYACMVDLLSRSGQFRRAEEFIHDLPFDPGIGFWKALLGGCQIHSNMELGELAAQR 421

Query: 424 ILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 483
           ILALDP DVSSYVMMSNAHSAAGKW SVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT
Sbjct: 422 ILALDPSDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 481

Query: 484 GDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           GDKNHHQKDEI S L+FFVEHLKE EDFNFLS+S
Sbjct: 482 GDKNHHQKDEICSTLKFFVEHLKESEDFNFLSDS 515

BLAST of Cla97C04G075190 vs. ExPASy TrEMBL
Match: A0A0A0KX53 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293300 PE=4 SV=1)

HSP 1 Score: 949.5 bits (2453), Expect = 5.8e-273
Identity = 461/516 (89.34%), Postives = 480/516 (93.02%), Query Frame = 0

Query: 1   MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNV 60
           MKPITIWRK  TSSC+QL KLK +G+ AVGTPPL  +   F   +LDARQL+DKMPHLNV
Sbjct: 1   MKPITIWRKPLTSSCTQLTKLKPDGNAAVGTPPLPFQATSFSGGQLDARQLYDKMPHLNV 60

Query: 61  VSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120
           VSAT  IGRCARQHQHEEAL LFSAM VLNLRPNEFTFGTVI S  ALGDI IGKQLHVC
Sbjct: 61  VSATTAIGRCARQHQHEEALSLFSAMLVLNLRPNEFTFGTVIQSPKALGDIHIGKQLHVC 120

Query: 121 AIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDA 180
           AIK GLHSNVFVGSALLDLYVK  VIEEAQ AF DIKMPNVVSYTSLISGYLK+ERI+DA
Sbjct: 121 AIKTGLHSNVFVGSALLDLYVKVGVIEEAQRAFEDIKMPNVVSYTSLISGYLKIERIRDA 180

Query: 181 LRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANI 240
           LRVFDEMPERNVVSWN+MIGGFSQKGHNEDAVHLFIDMLREG LPTQSTFPCA+CAAANI
Sbjct: 181 LRVFDEMPERNVVSWNSMIGGFSQKGHNEDAVHLFIDMLREGILPTQSTFPCAICAAANI 240

Query: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300
           ASIGIGRSFHACAVKF GKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERN+VSWNAV
Sbjct: 241 ASIGIGRSFHACAVKFFGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNVVSWNAV 300

Query: 301 LCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360
           L G+AQNGRGKEAIDFYQ+M  AGCKPN VT LSLLWACNHAGLVDEGYSYFNQARLDNP
Sbjct: 301 LSGFAQNGRGKEAIDFYQRMILAGCKPNAVTFLSLLWACNHAGLVDEGYSYFNQARLDNP 360

Query: 361 SMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAA 420
           ++LK EHYACMVDLLSRSGQFK+AEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGE AA
Sbjct: 361 NLLKAEHYACMVDLLSRSGQFKRAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGELAA 420

Query: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVF 480
           QRILALDPGDVSSYVMMSNAHSAAGKW SVSILRREMKEKG+KRIPGCSWIEIRSKVHVF
Sbjct: 421 QRILALDPGDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGLKRIPGCSWIEIRSKVHVF 480

Query: 481 VTGDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           VTGDKNHHQKDEIYSAL+FFVEHLKEREDFNFLS+S
Sbjct: 481 VTGDKNHHQKDEIYSALKFFVEHLKEREDFNFLSDS 516

BLAST of Cla97C04G075190 vs. ExPASy TrEMBL
Match: A0A6J1IHX7 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111477616 PE=4 SV=1)

HSP 1 Score: 948.0 bits (2449), Expect = 1.7e-272
Identity = 462/514 (89.88%), Postives = 484/514 (94.16%), Query Frame = 0

Query: 4   ITIWRKSF-TSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNVVS 63
           I IWR+S+ +SSC QLIKLKA+  GAV TP +QSK  PFPR ELDARQLFDKMP LNVVS
Sbjct: 2   IPIWRRSYVSSSCRQLIKLKADAHGAVVTPAVQSKKDPFPRGELDARQLFDKMPQLNVVS 61

Query: 64  ATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVCAI 123
           ATA IG CARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLH CAI
Sbjct: 62  ATAMIGCCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHACAI 121

Query: 124 KNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDALR 183
           K GLHSNVFVGSA LDLYVK SVIEEAQ AF DIKMPNVVSYTSLISGYLK E+IQ ALR
Sbjct: 122 KIGLHSNVFVGSAALDLYVKLSVIEEAQRAFDDIKMPNVVSYTSLISGYLKTEKIQAALR 181

Query: 184 VFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANIAS 243
           VFD+MPERN+VSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCA+CAAANIAS
Sbjct: 182 VFDKMPERNIVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCALCAAANIAS 241

Query: 244 IGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAVLC 303
           IGIGRSFHACA+KFLG LDVFVSNSLISFYAKCGSMEDSLLVFNKLLD+RN VSWNAVLC
Sbjct: 242 IGIGRSFHACAIKFLGDLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDKRNTVSWNAVLC 301

Query: 304 GYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNPSM 363
           GYAQNGRGKEAIDFYQ+M+F+GCKPNGVTLLSLLWACNHAGLVDEGYSYFNQA+L+NPS+
Sbjct: 302 GYAQNGRGKEAIDFYQRMSFSGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQAKLENPSL 361

Query: 364 LKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAAQR 423
           LKPEHYACMVDLLSRSGQFK+AEEFIHDLPFDPGIGFWKALLGGCQIHSN+ELGE AAQR
Sbjct: 362 LKPEHYACMVDLLSRSGQFKRAEEFIHDLPFDPGIGFWKALLGGCQIHSNMELGELAAQR 421

Query: 424 ILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVFVT 483
           ILALDP DVSSYVMMSNAHSAAGKW SVSILRREMKEKGMKRIPGCSWIE RSKVHVFVT
Sbjct: 422 ILALDPSDVSSYVMMSNAHSAAGKWHSVSILRREMKEKGMKRIPGCSWIESRSKVHVFVT 481

Query: 484 GDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           GDK+HHQKDEI S L+FFVEHLKE EDFNFLS+S
Sbjct: 482 GDKSHHQKDEICSTLKFFVEHLKESEDFNFLSDS 515

BLAST of Cla97C04G075190 vs. ExPASy TrEMBL
Match: A0A6J1CRE4 (pentatricopeptide repeat-containing protein At5g42450, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111013558 PE=4 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 7.8e-262
Identity = 445/516 (86.24%), Postives = 477/516 (92.44%), Query Frame = 0

Query: 1   MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNV 60
           MKP TIWRKS++SSC QLI    NG GA+ T PL+S   PFPR +L A Q+FDKMP LNV
Sbjct: 1   MKPTTIWRKSYSSSCRQLI----NGHGAIETLPLKSSNVPFPRGKLYASQMFDKMPQLNV 60

Query: 61  VSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120
           VSATA IGR AR++QHEE LCLFSAMFVLNLRPNEFTFGTVIHSATA+ DIDIGK++H C
Sbjct: 61  VSATAVIGRHARKNQHEEVLCLFSAMFVLNLRPNEFTFGTVIHSATAVRDIDIGKEIHAC 120

Query: 121 AIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDA 180
           A K GLHSNVFVGSALLDLY K SVIEE Q AF DIK+PNVVSYTSLISGYLKMERIQDA
Sbjct: 121 AKKIGLHSNVFVGSALLDLYAKLSVIEEVQRAFDDIKIPNVVSYTSLISGYLKMERIQDA 180

Query: 181 LRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANI 240
           L+VFDEMPERN+VSWNAMIGG SQKGHNEDAVHLFIDMLREGFLPTQSTFP A+CAAANI
Sbjct: 181 LQVFDEMPERNIVSWNAMIGGLSQKGHNEDAVHLFIDMLREGFLPTQSTFPSAICAAANI 240

Query: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300
           AS+GIGRSFHACAVKFLGKLDVFV+NSLISFYAKCGSMEDSL++FNKL+ ERNIVSWNAV
Sbjct: 241 ASLGIGRSFHACAVKFLGKLDVFVNNSLISFYAKCGSMEDSLVIFNKLV-ERNIVSWNAV 300

Query: 301 LCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360
           LCGYAQNGRGKEAI FY++M+ AG KPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP
Sbjct: 301 LCGYAQNGRGKEAIGFYERMSSAGVKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360

Query: 361 SMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAA 420
           +MLKPEHYAC+VDLLSRSGQF++AEEFI DLPFDPGIGFWKALLGGCQIHSNVELGE AA
Sbjct: 361 NMLKPEHYACLVDLLSRSGQFRRAEEFIRDLPFDPGIGFWKALLGGCQIHSNVELGELAA 420

Query: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVF 480
           QRILALDPGDVSSYVMMSNAHSAAGKWQSVS LRREMKEKGMKR+PGCSWIEIRSKV+VF
Sbjct: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSTLRREMKEKGMKRVPGCSWIEIRSKVNVF 480

Query: 481 VTGDKNHHQKDEIYSALRFFVEHLKEREDFNFLSNS 517
           VTGD+NHHQKDEIYSA+RFFVEHLKE EDFN LS+S
Sbjct: 481 VTGDRNHHQKDEIYSAMRFFVEHLKESEDFNLLSDS 511

BLAST of Cla97C04G075190 vs. ExPASy TrEMBL
Match: A0A5A7U174 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold908G001620 PE=4 SV=1)

HSP 1 Score: 886.3 bits (2289), Expect = 6.0e-254
Identity = 427/501 (85.23%), Postives = 453/501 (90.42%), Query Frame = 0

Query: 1   MKPITIWRKSFTSSCSQLIKLKANGDGAVGTPPLQSKTAPFPRSELDARQLFDKMPHLNV 60
           MKPI IW K FTSSC QL K+K NG+ AVGTPPL SKT PF   +L ARQLFDKMPHLNV
Sbjct: 1   MKPIRIWIKPFTSSCRQLTKIKPNGNAAVGTPPLGSKTTPFSGGQLAARQLFDKMPHLNV 60

Query: 61  VSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSATALGDIDIGKQLHVC 120
           VSATA IG CARQHQHEEAL LFSA+ VLNL+PNEFTFGTVIHSA ALGDI  GKQLHVC
Sbjct: 61  VSATAAIGHCARQHQHEEALYLFSAILVLNLKPNEFTFGTVIHSAKALGDIQSGKQLHVC 120

Query: 121 AIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTSLISGYLKMERIQDA 180
           AIK G HS+VFVGSALLDLY K   IEEA+ AF DIKMPNVVSYTSLISGYLK+ERI DA
Sbjct: 121 AIKTGFHSDVFVGSALLDLYFKLGFIEEAERAFDDIKMPNVVSYTSLISGYLKIERIHDA 180

Query: 181 LRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPTQSTFPCAVCAAANI 240
           LRVFDEMPERNVVSWN+MI GFSQKGHNEDAVHLF+DMLREG LPTQSTF CA+CAAANI
Sbjct: 181 LRVFDEMPERNVVSWNSMISGFSQKGHNEDAVHLFVDMLREGILPTQSTFTCAICAAANI 240

Query: 241 ASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLLDERNIVSWNAV 300
           ASIGIGRSFHACAVKF GKLDVFVSNSLISFYAKCGSMEDSLLVFNKL +ERN+VSWNA+
Sbjct: 241 ASIGIGRSFHACAVKFFGKLDVFVSNSLISFYAKCGSMEDSLLVFNKLRNERNVVSWNAI 300

Query: 301 LCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVDEGYSYFNQARLDNP 360
           L G+AQNGRGKEAIDFYQ+M  AGCKPN VT LSLLWACNHAGLVDEGYSYFNQARL NP
Sbjct: 301 LGGFAQNGRGKEAIDFYQRMILAGCKPNDVTFLSLLWACNHAGLVDEGYSYFNQARLINP 360

Query: 361 SMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGGCQIHSNVELGEFAA 420
           ++LK EHYACMVDLLSRSGQFK+AEEFIHDLPF+PGIGFWKALLGGCQIHSNVELGE AA
Sbjct: 361 NLLKAEHYACMVDLLSRSGQFKRAEEFIHDLPFNPGIGFWKALLGGCQIHSNVELGELAA 420

Query: 421 QRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIPGCSWIEIRSKVHVF 480
           QRILALDPGD SSYVMMSNAHSAAGKWQ+VS+ RREMKEKG+KRIPGCSWIEIRSK+HVF
Sbjct: 421 QRILALDPGDFSSYVMMSNAHSAAGKWQNVSLARREMKEKGLKRIPGCSWIEIRSKIHVF 480

Query: 481 VTGDKNHHQKDEIYSALRFFV 502
           V GDKNHHQKDEIYS L+  +
Sbjct: 481 VNGDKNHHQKDEIYSTLKLLL 501

BLAST of Cla97C04G075190 vs. TAIR 10
Match: AT5G42450.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 580.1 bits (1494), Expect = 1.8e-165
Identity = 282/468 (60.26%), Postives = 361/468 (77.14%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +A ++FD++P L+V+SATA IGR  ++ +H EA   F  +  L +RPNEFTFGTVI S+T
Sbjct: 45  NAHKVFDEIPELDVISATAVIGRFVKESRHVEASQAFKRLLCLGIRPNEFTFGTVIGSST 104

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
              D+ +GKQLH  A+K GL SNVFVGSA+L+ YVK S + +A+  F D + PNVVS T+
Sbjct: 105 TSRDVKLGKQLHCYALKMGLASNVFVGSAVLNCYVKLSTLTDARRCFDDTRDPNVVSITN 164

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREG-FLP 226
           LISGYLK    ++AL +F  MPER+VV+WNA+IGGFSQ G NE+AV+ F+DMLREG  +P
Sbjct: 165 LISGYLKKHEFEEALSLFRAMPERSVVTWNAVIGGFSQTGRNEEAVNTFVDMLREGVVIP 224

Query: 227 TQSTFPCAVCAAANIASIGIGRSFHACAVKFLGK-LDVFVSNSLISFYAKCGSMEDSLLV 286
            +STFPCA+ A +NIAS G G+S HACA+KFLGK  +VFV NSLISFY+KCG+MEDSLL 
Sbjct: 225 NESTFPCAITAISNIASHGAGKSIHACAIKFLGKRFNVFVWNSLISFYSKCGNMEDSLLA 284

Query: 287 FNKLLDE-RNIVSWNAVLCGYAQNGRGKEAIDFYQKMT-FAGCKPNGVTLLSLLWACNHA 346
           FNKL +E RNIVSWN+++ GYA NGRG+EA+  ++KM      +PN VT+L +L+ACNHA
Sbjct: 285 FNKLEEEQRNIVSWNSMIWGYAHNGRGEEAVAMFEKMVKDTNLRPNNVTILGVLFACNHA 344

Query: 347 GLVDEGYSYFNQA--RLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFW 406
           GL+ EGY YFN+A    D+P++L+ EHYACMVD+LSRSG+FK+AEE I  +P DPGIGFW
Sbjct: 345 GLIQEGYMYFNKAVNDYDDPNLLELEHYACMVDMLSRSGRFKEAEELIKSMPLDPGIGFW 404

Query: 407 KALLGGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEK 466
           KALLGGCQIHSN  L + AA +IL LDP DVSSYVM+SNA+SA   WQ+VS++RR+MKE 
Sbjct: 405 KALLGGCQIHSNKRLAKLAASKILELDPRDVSSYVMLSNAYSAMENWQNVSLIRRKMKET 464

Query: 467 GMKRIPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKERE 509
           G+KR  GCSWIE+R ++ VFV  DKN+  KDE+Y  L    +HL+E E
Sbjct: 465 GLKRFTGCSWIEVRDQIRVFVNADKNNELKDEVYRMLALVSQHLEENE 512

BLAST of Cla97C04G075190 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 365.2 bits (936), Expect = 8.9e-101
Identity = 179/470 (38.09%), Postives = 290/470 (61.70%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           DA+++FD+M   NVVS  + I    +     EAL +F  M    + P+E T  +VI +  
Sbjct: 205 DAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACA 264

Query: 107 ALGDIDIGKQLHVCAIKNG-LHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYT 166
           +L  I +G+++H   +KN  L +++ + +A +D+Y K S I+EA+  F  + + NV++ T
Sbjct: 265 SLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAET 324

Query: 167 SLISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLP 226
           S+ISGY      + A  +F +M ERNVVSWNA+I G++Q G NE+A+ LF  + RE   P
Sbjct: 325 SMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCP 384

Query: 227 TQSTFPCAVCAAANIASIGIGRSFHACAVKFLGKL------DVFVSNSLISFYAKCGSME 286
           T  +F   + A A++A + +G   H   +K   K       D+FV NSLI  Y KCG +E
Sbjct: 385 THYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVE 444

Query: 287 DSLLVFNKLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWAC 346
           +  LVF K++ ER+ VSWNA++ G+AQNG G EA++ +++M  +G KP+ +T++ +L AC
Sbjct: 445 EGYLVFRKMM-ERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSAC 504

Query: 347 NHAGLVDEGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGF 406
            HAG V+EG  YF+    D       +HY CMVDLL R+G  ++A+  I ++P  P    
Sbjct: 505 GHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVI 564

Query: 407 WKALLGGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKE 466
           W +LL  C++H N+ LG++ A+++L ++P +   YV++SN ++  GKW+ V  +R+ M++
Sbjct: 565 WGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRK 624

Query: 467 KGMKRIPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKERED 510
           +G+ + PGCSWI+I+   HVF+  DK+H +K +I+S L   +  ++  +D
Sbjct: 625 EGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMRPEQD 673

BLAST of Cla97C04G075190 vs. TAIR 10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 343.2 bits (879), Expect = 3.6e-94
Identity = 177/468 (37.82%), Postives = 283/468 (60.47%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +A ++FD+M   + VS  A I    +  +  E L LF +M    + P+EFTFG+++ + T
Sbjct: 435 EAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFGSILKACT 494

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
             G +  G ++H   +K+G+ SN  VG +L+D+Y K  +IEEA+      K+ +     +
Sbjct: 495 G-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAE------KIHSRFFQRA 554

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
            +SG   ME ++   ++ ++  +   VSWN++I G+  K  +EDA  LF  M+  G  P 
Sbjct: 555 NVSG--TMEELE---KMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPD 614

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
           + T+   +   AN+AS G+G+  HA  +K   + DV++ ++L+  Y+KCG + DS L+F 
Sbjct: 615 KFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFE 674

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           K L  R+ V+WNA++CGYA +G+G+EAI  +++M     KPN VT +S+L AC H GL+D
Sbjct: 675 KSL-RRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLID 734

Query: 347 EGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGG 406
           +G  YF   + D     +  HY+ MVD+L +SG+ K+A E I ++PF+     W+ LLG 
Sbjct: 735 KGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGV 794

Query: 407 CQIH-SNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRI 466
           C IH +NVE+ E A   +L LDP D S+Y ++SN ++ AG W+ VS LRR M+   +K+ 
Sbjct: 795 CTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKE 854

Query: 467 PGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKEREDFNFL 514
           PGCSW+E++ ++HVF+ GDK H + +EIY  L      +K  +D +F+
Sbjct: 855 PGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIYSEMKPFDDSSFV 889

BLAST of Cla97C04G075190 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 339.0 bits (868), Expect = 6.9e-93
Identity = 165/458 (36.03%), Postives = 269/458 (58.73%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           DA QLFD+MP  NV+S T  I   ++   H++AL L   M   N+RPN +T+ +V+ S  
Sbjct: 114 DAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCN 173

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
            + D+   + LH   IK GL S+VFV SAL+D++ K    E                   
Sbjct: 174 GMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPE------------------- 233

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
                       DAL VFDEM   + + WN++IGGF+Q   ++ A+ LF  M R GF+  
Sbjct: 234 ------------DALSVFDEMVTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAE 293

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
           Q+T    + A   +A + +G   H   VK+    D+ ++N+L+  Y KCGS+ED+L VFN
Sbjct: 294 QATLTSVLRACTGLALLELGMQAHVHIVKY--DQDLILNNALVDMYCKCGSLEDALRVFN 353

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           + + ER++++W+ ++ G AQNG  +EA+  +++M  +G KPN +T++ +L+AC+HAGL++
Sbjct: 354 Q-MKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLE 413

Query: 347 EGYSYFNQARLDNPSMLKPEHYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALLGG 406
           +G+ YF   +         EHY CM+DLL ++G+   A + ++++  +P    W+ LLG 
Sbjct: 414 DGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGA 473

Query: 407 CQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKRIP 466
           C++  N+ L E+AA++++ALDP D  +Y ++SN ++ + KW SV  +R  M+++G+K+ P
Sbjct: 474 CRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEP 533

Query: 467 GCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHL 505
           GCSWIE+  ++H F+ GD +H Q  E+   L   +  L
Sbjct: 534 GCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRL 534

BLAST of Cla97C04G075190 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 334.7 bits (857), Expect = 1.3e-91
Identity = 171/462 (37.01%), Postives = 268/462 (58.01%), Query Frame = 0

Query: 47  DARQLFDKMPHLNVVSATAQIGRCARQHQHEEALCLFSAMFVLNLRPNEFTFGTVIHSAT 106
           +AR++F+KMP  + V+ T  I   ++  +  +AL  F+ M      PNEFT  +VI +A 
Sbjct: 113 EARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAA 172

Query: 107 ALGDIDIGKQLHVCAIKNGLHSNVFVGSALLDLYVKFSVIEEAQTAFYDIKMPNVVSYTS 166
           A      G QLH   +K G  SNV VGSALLDLY ++ ++++AQ                
Sbjct: 173 AERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQL--------------- 232

Query: 167 LISGYLKMERIQDALRVFDEMPERNVVSWNAMIGGFSQKGHNEDAVHLFIDMLREGFLPT 226
                           VFD +  RN VSWNA+I G +++   E A+ LF  MLR+GF P+
Sbjct: 233 ----------------VFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPS 292

Query: 227 QSTFPCAVCAAANIASIGIGRSFHACAVKFLGKLDVFVSNSLISFYAKCGSMEDSLLVFN 286
             ++     A ++   +  G+  HA  +K   KL  F  N+L+  YAK GS+ D+  +F+
Sbjct: 293 HFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFD 352

Query: 287 KLLDERNIVSWNAVLCGYAQNGRGKEAIDFYQKMTFAGCKPNGVTLLSLLWACNHAGLVD 346
           +L  +R++VSWN++L  YAQ+G GKEA+ ++++M   G +PN ++ LS+L AC+H+GL+D
Sbjct: 353 RLA-KRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLD 412

Query: 347 EGYSYFNQARLDNPSMLKPE--HYACMVDLLSRSGQFKQAEEFIHDLPFDPGIGFWKALL 406
           EG+ Y+   + D    + PE  HY  +VDLL R+G   +A  FI ++P +P    WKALL
Sbjct: 413 EGWHYYELMKKDG---IVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALL 472

Query: 407 GGCQIHSNVELGEFAAQRILALDPGDVSSYVMMSNAHSAAGKWQSVSILRREMKEKGMKR 466
             C++H N ELG +AA+ +  LDP D   +V++ N +++ G+W   + +R++MKE G+K+
Sbjct: 473 NACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKK 532

Query: 467 IPGCSWIEIRSKVHVFVTGDKNHHQKDEIYSALRFFVEHLKE 507
            P CSW+EI + +H+FV  D+ H Q++EI       +  +KE
Sbjct: 533 EPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKE 539

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882815.16.5e-27990.12pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Benin... [more]
XP_022949636.11.5e-27590.66pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Cucur... [more]
KAG6603678.19.7e-27590.27Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_004142091.11.2e-27289.34pentatricopeptide repeat-containing protein At5g42450, mitochondrial [Cucumis sa... [more]
XP_022977247.13.5e-27289.88pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9FIH22.5e-16460.26Pentatricopeptide repeat-containing protein At5g42450, mitochondrial OS=Arabidop... [more]
Q9SIT71.3e-9938.09Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9FWA65.1e-9337.82Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Q9SI539.6e-9236.03Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9LIQ71.8e-9037.01Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1GCJ37.3e-27690.66pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like OS=Cuc... [more]
A0A0A0KX535.8e-27389.34Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293300 PE=4 SV=1[more]
A0A6J1IHX71.7e-27289.88pentatricopeptide repeat-containing protein At5g42450, mitochondrial-like OS=Cuc... [more]
A0A6J1CRE47.8e-26286.24pentatricopeptide repeat-containing protein At5g42450, mitochondrial OS=Momordic... [more]
A0A5A7U1746.0e-25485.23Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G42450.11.8e-16560.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.18.9e-10138.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02330.13.6e-9437.82Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G03880.16.9e-9336.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G24000.11.3e-9137.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 46..142
e-value: 7.5E-13
score: 50.2
coord: 143..243
e-value: 2.8E-24
score: 87.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 363..490
e-value: 2.1E-6
score: 29.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 246..362
e-value: 1.7E-22
score: 82.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 159..455
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 292..338
e-value: 1.4E-10
score: 41.2
coord: 190..230
e-value: 1.2E-10
score: 41.5
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 158..187
e-value: 1.0E-7
score: 31.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 433..462
e-value: 0.66
score: 10.4
coord: 266..290
e-value: 0.018
score: 15.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 193..226
e-value: 3.1E-5
score: 21.8
coord: 295..328
e-value: 7.3E-9
score: 33.3
coord: 162..193
e-value: 1.2E-7
score: 29.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 12.189023
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 430..464
score: 8.648523
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 191..225
score: 12.265752
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 160..190
score: 10.928473
NoneNo IPR availablePANTHERPTHR47926:SF288OS01G0839400 PROTEINcoord: 45..506
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 45..506

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G075190.1Cla97C04G075190.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding