Spg023840 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg023840
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold13: 10490390 .. 10492237 (+)
RNA-Seq ExpressionSpg023840
SyntenySpg023840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGATTTGGCCCTTGACCCACTTCGGCCGTTCCCGCCTTGTGCATTCATTTTCCTTCAACGCCCTGAAAGCCTCCGCCGCCGTGGACTCAATTCCTCGAGATACCCAATTGCACAGCCTCGTCATAAAGTCGGGATTGGCTAATGAACTGTCTGTACAAAACAAGCTTTTGAAGATTTATTTTAAGTGCAGGAATTTGGAATGTGCACGGAATCTGTTTGATGAAATGCCTATGAGAAATGTTGTGTCGTGGAATACCGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGGTGAGTTTAGGGTGAGGCAGAAATCGAGTGTTTTACTTTTTAAGAAGATGTTGATGGATATGGTAGACCCAGATGGTGTGACGTTTAATGCGTTGTTTCGATCTTGTGTAGTGCTGAATGATGTTGAAAGTGGTAGTCAATTGCATGGTTTTGTGATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCTGTGGTTGATTTTTATGCGAAATGTGGGTTGTATGAAGATGCGAGATTAGCTTTTAGCTGCGTTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTTTTTAATTGTTTGGGCAGAGAGGCGATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGATGTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAATTATAAAGGATCAGGGGAATTGGGTAAGCAGCTTCATGCTCTTCTTATTAAACAGTCATTAGATTTAGATATTGTTGTGGCAAGTTCACTTGTCAATATGTATTCCAAAAGCAATAATTTATATGATGCTCGCAAGGTTTTTGGTGAAATGCCAAATAAAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTACGGGCAGCAAGATGGGAAAGAGGCAGTGAAACTTTTCAGGAGCATGTTTCGGGAAGATTATCGTCCGGATGAGTTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACATCTGGGGCTTCTGAACTGATGCAAGTTCATTCCTGCTTGATAAAGTTTGGTTTTGAAGCATTTTTGTCGATTAAGAACGGTTTGGTAAATGCATATTCAAAGTGTGGTATCATTTCTGCAGCGTCACGATGCTTTAGATTAATTGCAAAACCAGACTTGGTAACATGGACATCAATTATATGTGGACTAGCATTTTGTGGCTTTGAGAAGGATGCTGTTGAGTTCTTTGAGAAGATGTTATCTTATGGCATTAGACCAGATAGAATTGCGTTTCTTGGAGTTCTCTCTGCCTGTAGTCATGGAGGATTAGTAAACACGGGGCTCCACTACTTCAACTTAATGACGAATCAGTACCAAATTGTTCCTGATTCAGAGCATTTAACATGCCTGATCGACCTTATCAGTAGAGCGGGTAGTCTAGACGAGGCTTTTTATCTTTTGAAATCAATGCCGAAGGAAGCTGGACCGGACGCTCTCGGGGCATTTATTCGGGCATGTAGGACTCATGGGAACTTGAGATTAGCAAAATGGGCAATGGAACTTGCATCAGAGATGAATAAACCTGTGAATTATTCTCTAATGTCGAATATTTTTGCTTTTGAAGGAAGATGGTCAGATGTGGCGAGAATGCGCAAATTGATGAAGGATAGTTGTGAACGGAAAGCCCCTGGCTGTAGTTGGATAGAAATTGCTGGTTATATTCATTTGTTTGTATCAAGTGATAGATCTCACCCCCAGTCTTTAGATCTCTACACAATGTTAGGATTATTACTAAACACAATGAAGAAAGATTACATTTCCACGATATCCGAGGTAGATATTGCACCTGAATGA

mRNA sequence

ATGCTGATTTGGCCCTTGACCCACTTCGGCCGTTCCCGCCTTGTGCATTCATTTTCCTTCAACGCCCTGAAAGCCTCCGCCGCCGTGGACTCAATTCCTCGAGATACCCAATTGCACAGCCTCGTCATAAAGTCGGGATTGGCTAATGAACTGTCTGTACAAAACAAGCTTTTGAAGATTTATTTTAAGTGCAGGAATTTGGAATGTGCACGGAATCTGTTTGATGAAATGCCTATGAGAAATGTTGTGTCGTGGAATACCGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGGTGAGTTTAGGGTGAGGCAGAAATCGAGTGTTTTACTTTTTAAGAAGATGTTGATGGATATGGTAGACCCAGATGGTGTGACGTTTAATGCGTTGTTTCGATCTTGTGTAGTGCTGAATGATGTTGAAAGTGGTAGTCAATTGCATGGTTTTGTGATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCTGTGGTTGATTTTTATGCGAAATGTGGGTTGTATGAAGATGCGAGATTAGCTTTTAGCTGCGTTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTTTTTAATTGTTTGGGCAGAGAGGCGATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGATGTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAATTATAAAGGATCAGGGGAATTGGGTAAGCAGCTTCATGCTCTTCTTATTAAACAGTCATTAGATTTAGATATTGTTGTGGCAAGTTCACTTGTCAATATGTATTCCAAAAGCAATAATTTATATGATGCTCGCAAGGTTTTTGGTGAAATGCCAAATAAAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTACGGGCAGCAAGATGGGAAAGAGGCAGTGAAACTTTTCAGGAGCATGTTTCGGGAAGATTATCGTCCGGATGAGTTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACATCTGGGGCTTCTGAACTGATGCAAGTTCATTCCTGCTTGATAAAGTTTGGTTTTGAAGCATTTTTGTCGATTAAGAACGGTTTGGTAAATGCATATTCAAAGTGTGGTATCATTTCTGCAGCGTCACGATGCTTTAGATTAATTGCAAAACCAGACTTGGTAACATGGACATCAATTATATGTGGACTAGCATTTTGTGGCTTTGAGAAGGATGCTGTTGAGTTCTTTGAGAAGATGTTATCTTATGGCATTAGACCAGATAGAATTGCGTTTCTTGGAGTTCTCTCTGCCTGTAGTCATGGAGGATTAGTAAACACGGGGCTCCACTACTTCAACTTAATGACGAATCAGTACCAAATTGTTCCTGATTCAGAGCATTTAACATGCCTGATCGACCTTATCAGTAGAGCGGGTAGTCTAGACGAGGCTTTTTATCTTTTGAAATCAATGCCGAAGGAAGCTGGACCGGACGCTCTCGGGGCATTTATTCGGGCATGTAGGACTCATGGGAACTTGAGATTAGCAAAATGGGCAATGGAACTTGCATCAGAGATGAATAAACCTGTGAATTATTCTCTAATGTCGAATATTTTTGCTTTTGAAGGAAGATGGTCAGATGTGGCGAGAATGCGCAAATTGATGAAGGATAGTTGTGAACGGAAAGCCCCTGGCTGTAGTTGGATAGAAATTGCTGGTTATATTCATTTGTTTGTATCAAGTGATAGATCTCACCCCCAGTCTTTAGATCTCTACACAATGTTAGGATTATTACTAAACACAATGAAGAAAGATTACATTTCCACGATATCCGAGGTAGATATTGCACCTGAATGA

Coding sequence (CDS)

ATGCTGATTTGGCCCTTGACCCACTTCGGCCGTTCCCGCCTTGTGCATTCATTTTCCTTCAACGCCCTGAAAGCCTCCGCCGCCGTGGACTCAATTCCTCGAGATACCCAATTGCACAGCCTCGTCATAAAGTCGGGATTGGCTAATGAACTGTCTGTACAAAACAAGCTTTTGAAGATTTATTTTAAGTGCAGGAATTTGGAATGTGCACGGAATCTGTTTGATGAAATGCCTATGAGAAATGTTGTGTCGTGGAATACCGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGGTGAGTTTAGGGTGAGGCAGAAATCGAGTGTTTTACTTTTTAAGAAGATGTTGATGGATATGGTAGACCCAGATGGTGTGACGTTTAATGCGTTGTTTCGATCTTGTGTAGTGCTGAATGATGTTGAAAGTGGTAGTCAATTGCATGGTTTTGTGATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCTGTGGTTGATTTTTATGCGAAATGTGGGTTGTATGAAGATGCGAGATTAGCTTTTAGCTGCGTTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTTTTTAATTGTTTGGGCAGAGAGGCGATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGATGTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAATTATAAAGGATCAGGGGAATTGGGTAAGCAGCTTCATGCTCTTCTTATTAAACAGTCATTAGATTTAGATATTGTTGTGGCAAGTTCACTTGTCAATATGTATTCCAAAAGCAATAATTTATATGATGCTCGCAAGGTTTTTGGTGAAATGCCAAATAAAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTACGGGCAGCAAGATGGGAAAGAGGCAGTGAAACTTTTCAGGAGCATGTTTCGGGAAGATTATCGTCCGGATGAGTTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACATCTGGGGCTTCTGAACTGATGCAAGTTCATTCCTGCTTGATAAAGTTTGGTTTTGAAGCATTTTTGTCGATTAAGAACGGTTTGGTAAATGCATATTCAAAGTGTGGTATCATTTCTGCAGCGTCACGATGCTTTAGATTAATTGCAAAACCAGACTTGGTAACATGGACATCAATTATATGTGGACTAGCATTTTGTGGCTTTGAGAAGGATGCTGTTGAGTTCTTTGAGAAGATGTTATCTTATGGCATTAGACCAGATAGAATTGCGTTTCTTGGAGTTCTCTCTGCCTGTAGTCATGGAGGATTAGTAAACACGGGGCTCCACTACTTCAACTTAATGACGAATCAGTACCAAATTGTTCCTGATTCAGAGCATTTAACATGCCTGATCGACCTTATCAGTAGAGCGGGTAGTCTAGACGAGGCTTTTTATCTTTTGAAATCAATGCCGAAGGAAGCTGGACCGGACGCTCTCGGGGCATTTATTCGGGCATGTAGGACTCATGGGAACTTGAGATTAGCAAAATGGGCAATGGAACTTGCATCAGAGATGAATAAACCTGTGAATTATTCTCTAATGTCGAATATTTTTGCTTTTGAAGGAAGATGGTCAGATGTGGCGAGAATGCGCAAATTGATGAAGGATAGTTGTGAACGGAAAGCCCCTGGCTGTAGTTGGATAGAAATTGCTGGTTATATTCATTTGTTTGTATCAAGTGATAGATCTCACCCCCAGTCTTTAGATCTCTACACAATGTTAGGATTATTACTAAACACAATGAAGAAAGATTACATTTCCACGATATCCGAGGTAGATATTGCACCTGAATGA

Protein sequence

MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQDGKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTMKKDYISTISEVDIAPE
Homology
BLAST of Spg023840 vs. NCBI nr
Match: XP_038874466.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1051.2 bits (2717), Expect = 3.5e-303
Identity = 519/607 (85.50%), Postives = 549/607 (90.44%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFGRSRLVHSFSFN LKA+AAV+SIPR TQLHS  IK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGRSRLVHSFSFNVLKAAAAVNSIPRGTQLHSHFIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCRNLE ARNLFDEMP RNVVSWNTVICGLV+CGYGGEF+VRQ S    FKKMLM +V
Sbjct: 61  YVKCRNLESARNLFDEMPRRNVVSWNTVICGLVNCGYGGEFKVRQHSIFSYFKKMLMGLV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSCVVLNDVESG QLHGFVMKIGFDLDCFVGSAVV FYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVAFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFSC+LYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEG KGDDFTFSSLLSSC YKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH LLIKQS DLDI+VASSLVN+Y+K++NLYDARKVF EMP++NSVSWTTMIVG
Sbjct: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300

Query: 301 YGQQ-DGKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           +GQQ DGKEAVKLFR MF EDY PDELTFASVLSSCG TSGA EL QVHSCLIK GFEAF
Sbjct: 301 HGQQEDGKEAVKLFRRMFGEDYYPDELTFASVLSSCGLTSGACELKQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
            SI NGL+NAYSKCGIISAA +CFRLIA+PDLVTWTS ICGLA CG EK+A+E F+KMLS
Sbjct: 361 SSINNGLINAYSKCGIISAALQCFRLIAEPDLVTWTSTICGLALCGLEKNAIELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
           Y IRPD+IAFLGVLSACSHGG V+ GLHYFNLMTNQYQIVPDSEHLTCLIDL+ RAGSLD
Sbjct: 421 YAIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKSMP  AGPDA  AFIRACRTHGNLRLAKWAME ASE N+ VNYSL+SN++A E
Sbjct: 481 EAFDLLKSMP--AGPDAFRAFIRACRTHGNLRLAKWAMEFASEPNEQVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKLMKDSC+RKAPG SW+EIAGY HLFVS DRSHP+SLDLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLMKDSCDRKAPGFSWVEIAGYNHLFVSGDRSHPESLDLYAMLGLLLNTM 600

Query: 601 KKDYIST 607
           K D  ST
Sbjct: 601 KMDNKST 605

BLAST of Spg023840 vs. NCBI nr
Match: KAG6575187.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1025.8 bits (2651), Expect = 1.6e-295
Identity = 504/616 (81.82%), Postives = 547/616 (88.80%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG  RLVHSFSFN LKA+A ++SIPR T+LHSLVIK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  ARNLFDEM  RNVVSWNTVICG+V+CGYGGEF++R++S +  FK MLMDMV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+KIGFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AF+ VLY+DLVLWNVMLYCYVFNCL +EAI++F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFTSVLYKDLVLWNVMLYCYVFNCLAKEAIDIFLLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH  LIK S DLDI+VASSLVNMY+K+N+LYDARKVF EMP +NSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R MF EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS A +CFRLIA+PDLV+WTSIICGLAFCG EKDAVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGAISPALQCFRLIAEPDLVSWTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSAC+HGG VN GLHYFNLMTN+YQIVPDSEHLTCLIDLI RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACNHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ +EAGPDA  +FIRACRTHG LRLAKWAME AS+  KPVN SLMSN++A E
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKL+KDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYEMLGLLLNTM 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY S  S +DI PE
Sbjct: 601 KKDYKSIASNIDIEPE 616

BLAST of Spg023840 vs. NCBI nr
Match: XP_022958961.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958962.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958963.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 1025.4 bits (2650), Expect = 2.0e-295
Identity = 505/616 (81.98%), Postives = 545/616 (88.47%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG  RLVHSFSFN LKA+A ++SIPR T+LHSLVIK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  ARNLFDEM  RNVVSWNTVICG+V+CGYGGEF++R++S +  FK MLMDMV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+KIGFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFS VLY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH  LIK S DLDI+VASSLVNMY+K+N+LYDARK F EMP +NSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKAFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R MF EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS A RCFRLIA+PDLV+WTSIICG AFCG EK AVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGTISPALRCFRLIAEPDLVSWTSIICGFAFCGLEKAAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSACSHGG VN GLHYFNLMTN+YQIVPDSEHLTCLIDLI RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ +EAGPDA  +FIRACRTHG LRLAKWAME AS+  KPVN SLMSN++A E
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKL+KDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNT+
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYEMLGLLLNTV 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY ST S +DI PE
Sbjct: 601 KKDYKSTASNIDIEPE 616

BLAST of Spg023840 vs. NCBI nr
Match: KAG7013750.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1022.3 bits (2642), Expect = 1.7e-294
Identity = 505/616 (81.98%), Postives = 544/616 (88.31%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG  RLVHSFSFN LKA+A V+SIPR TQLHSLVIK GLANEL VQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELFVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  ARNLFDEM  RNVVSWNTVICG+VDCGYG EF++R++S +  FK MLMDMV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVDCGYGDEFKMRERSILSCFKNMLMDMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+KIGFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AF+ VLY+DLVLWNVMLYCYVFN L +EAI++F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFTSVLYKDLVLWNVMLYCYVFNFLAKEAIDIFLLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH  LIK S DLDI+VASSLVNMY+K+N+LYDARKVF EMP +NSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R MF EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS A +CFRLIA+PDLV+WTSIICGLAFCG EKDAVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGAISPALQCFRLIAEPDLVSWTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSACSHGG VN GLHYFNLMTN+YQIVPDSEHLTCLIDLI RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ +EAGPDA  +FIRACRTHG LRLAKWAME AS+  KPVN SLMSN++A E
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKL+KDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYEMLGLLLNTM 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY S  S +DI PE
Sbjct: 601 KKDYKSIASNIDIEPE 616

BLAST of Spg023840 vs. NCBI nr
Match: XP_023006538.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1021.1 bits (2639), Expect = 3.8e-294
Identity = 505/616 (81.98%), Postives = 544/616 (88.31%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG SRLVHSFSFN LKA+A V+SIPR TQLHSLVIK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCSRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  A NLFDEM  RNVVSWNTVICG+VDCGYGGEF++R++S++  FK MLM+MV
Sbjct: 61  YVKCRDLGRAWNLFDEMRRRNVVSWNTVICGVVDCGYGGEFKMRERSNLSCFKNMLMEMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+K GFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKFGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFS VLY+DLVLWNVMLYCYVFNCL  EAIE+F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAEEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELG QLH  LIK S DLDI+VASSLVNMY+K+N+LYDARKVF EMP +NSVSWTTMIVG
Sbjct: 241 GELGMQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R M  EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMLEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS+A RCFRLIA+PDLV+ TSIICGLAFCG EKDAVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGAISSALRCFRLIAEPDLVSRTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSACSHGG  N GLHYFNLMTN+YQIVPDSEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGYANMGLHYFNLMTNEYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ ++AGPDA  +FIRACRTHG+LRLAKWAME AS+  KPVN SLMSNI+A E
Sbjct: 481 EAFKLLKSVSEKAGPDAFRSFIRACRTHGHLRLAKWAMEFASDPYKPVNCSLMSNIYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKLMKDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLMKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYAMLGLLLNTM 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY S  S +DI PE
Sbjct: 601 KKDYKSIASNIDIEPE 616

BLAST of Spg023840 vs. ExPASy Swiss-Prot
Match: O82363 (Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E39 PE=3 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 1.4e-126
Identity = 252/552 (45.65%), Postives = 339/552 (61.41%), Query Frame = 0

Query: 24  KASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVV 83
           K SA++D +    Q H  ++K G+ N L +QNKLL+ Y K R  + A  LFDEMP+RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVESG 143
           +WN +I G++     G+   R         ++L   V  D V+F  L R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 SQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FY KCGL  +AR  F  VL RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLMQLEGC-----KGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDL 263
            +  EA   F L++L G      +GD FTFSSLLS+C      E GKQ+HA+L K S   
Sbjct: 224 GMIDEA---FGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQF 283

Query: 264 DIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMF 323
           DI VA++L+NMY+KSN+L DAR+ F  M  +N VSW  MIVG+ Q  +G+EA++LF  M 
Sbjct: 284 DIPVATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQML 343

Query: 324 REDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIIS 383
            E+ +PDELTFASVLSSC   S   E+ QV + + K G   FLS+ N L+++YS+ G +S
Sbjct: 344 LENLQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLS 403

Query: 384 AASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACS 443
            A  CF  I +PDLV+WTS+I  LA  GF +++++ FE ML   ++PD+I FL VLSACS
Sbjct: 404 EALLCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLQPDKITFLEVLSACS 463

Query: 444 HGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDAL 503
           HGGLV  GL  F  MT  Y+I  + EH TCLIDL+ RAG +DEA  +L SMP E    AL
Sbjct: 464 HGGLVQEGLRCFKRMTEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHAL 523

Query: 504 GAFIRACRTHGNLRLAKWAME--LASEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKDS 563
            AF   C  H      KW  +  L  E  KPVNYS++SN +  EG W+  A +RK  + +
Sbjct: 524 AAFTGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRN 583

Query: 564 C-ERKAPGCSWI 567
           C   K PGCSW+
Sbjct: 584 CYNPKTPGCSWL 585

BLAST of Spg023840 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 2.6e-104
Identity = 208/631 (32.96%), Postives = 340/631 (53.88%), Query Frame = 0

Query: 38  LHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGY 97
           +H+ VIKSG +NE+ +QN+L+  Y KC +LE  R +FD+MP RN+ +WN+V+ GL   G+
Sbjct: 42  VHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGF 101

Query: 98  GGE----FRVRQKSSVLLFKKMLMDMVDPD--------------------GVTFNALFRS 157
             E    FR   +     +  M+      D                      +F ++  +
Sbjct: 102 LDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSA 161

Query: 158 CVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLW 217
           C  LND+  G Q+H  + K  F  D ++GSA+VD Y+KCG   DA+  F  +  R++V W
Sbjct: 162 CSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSW 221

Query: 218 NVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIK 277
           N ++ C+  N    EA++VF +M     + D+ T +S++S+C    + ++G+++H  ++K
Sbjct: 222 NSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVK 281

Query: 278 -QSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMP------------------------- 337
              L  DI+++++ V+MY+K + + +AR +F  MP                         
Sbjct: 282 NDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARL 341

Query: 338 ------NKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSG 397
                  +N VSW  +I GY Q  + +EA+ LF  + RE   P   +FA++L +C   + 
Sbjct: 342 MFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAE 401

Query: 398 ASELMQVHSCLIKFGF------EAFLSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTW 457
               MQ H  ++K GF      E  + + N L++ Y KCG +      FR + + D V+W
Sbjct: 402 LHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSW 461

Query: 458 TSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTN 517
            ++I G A  G+  +A+E F +ML  G +PD I  +GVLSAC H G V  G HYF+ MT 
Sbjct: 462 NAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTR 521

Query: 518 QYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAK 577
            + + P  +H TC++DL+ RAG L+EA  +++ MP +      G+ + AC+ H N+ L K
Sbjct: 522 DFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGK 581

Query: 578 WAMELASEMNKPVN---YSLMSNIFAFEGRWSDVARMRKLMKDSCERKAPGCSWIEIAGY 603
           +  E   E+ +P N   Y L+SN++A  G+W DV  +RK M+     K PGCSWI+I G+
Sbjct: 582 YVAEKLLEV-EPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGH 641

BLAST of Spg023840 vs. ExPASy Swiss-Prot
Match: P0C898 (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 3.2e-94
Identity = 187/554 (33.75%), Postives = 300/554 (54.15%), Query Frame = 0

Query: 37  QLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCG 96
           Q+H  ++KSG    L   N L+ +Y KCR    A  +FD MP RNVVSW+ ++ G V   
Sbjct: 27  QVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPERNVVSWSALMSGHV--- 86

Query: 97  YGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFD 156
             G+     K S+ LF +M    + P+  TF+   ++C +LN +E G Q+HGF +KIGF+
Sbjct: 87  LNGDL----KGSLSLFSEMGRQGIYPNEFTFSTNLKACGLLNALEKGLQIHGFCLKIGFE 146

Query: 157 LDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLM 216
           +   VG+++VD Y+KCG   +A   F  ++ R L+ WN M+  +V    G +A++ F +M
Sbjct: 147 MMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYGSKALDTFGMM 206

Query: 217 QLEGCK--GDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDL--DIVVASSLVNMYSK 276
           Q    K   D+FT +SLL +C+  G    GKQ+H  L++          +  SLV++Y K
Sbjct: 207 QEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATITGSLVDLYVK 266

Query: 277 SNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMFREDYRPDELTFASV 336
              L+ ARK F ++  K  +SW+++I+GY Q+ +  EA+ LF+ +   + + D    +S+
Sbjct: 267 CGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNSQIDSFALSSI 326

Query: 337 LSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIISAASRCFRLIAKPDL 396
           +      +   +  Q+ +  +K       S+ N +V+ Y KCG++  A +CF  +   D+
Sbjct: 327 IGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEKCFAEMQLKDV 386

Query: 397 VTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLVNTGLHYFNL 456
           ++WT +I G    G  K +V  F +ML + I PD + +L VLSACSH G++  G   F+ 
Sbjct: 387 ISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGMIKEGEELFSK 446

Query: 457 MTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALGAFIRACRTHGNLR 516
           +   + I P  EH  C++DL+ RAG L EA +L+ +MP +         +  CR HG++ 
Sbjct: 447 LLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLLSLCRVHGDIE 506

Query: 517 LAK--WAMELASEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKDSCERKAPGCSWIEIA 576
           L K    + L  +   P NY +MSN++   G W++    R+L      +K  G SW+EI 
Sbjct: 507 LGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKKEAGMSWVEIE 566

Query: 577 GYIHLFVSSDRSHP 584
             +H F S + SHP
Sbjct: 567 REVHFFRSGEDSHP 573

BLAST of Spg023840 vs. ExPASy Swiss-Prot
Match: Q9CAA8 (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 4.2e-94
Identity = 205/579 (35.41%), Postives = 302/579 (52.16%), Query Frame = 0

Query: 55  NKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKK 114
           N LL  Y K   +    + F+++P R+ V+WN +I G    G  G       ++V  +  
Sbjct: 76  NNLLLAYSKAGLISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVG-------AAVKAYNT 135

Query: 115 MLMDM-VDPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCG 174
           M+ D   +   VT   + +       V  G Q+HG V+K+GF+    VGS ++  YA  G
Sbjct: 136 MMRDFSANLTRVTLMTMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVG 195

Query: 175 LYEDARLAF------SCVLY------------------------RDLVLWNVMLYCYVFN 234
              DA+  F      + V+Y                        +D V W  M+     N
Sbjct: 196 CISDAKKVFYGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGMEKDSVSWAAMIKGLAQN 255

Query: 235 CLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDLDIVVA 294
            L +EAIE F  M+++G K D + F S+L +C   G+   GKQ+HA +I+ +    I V 
Sbjct: 256 GLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVG 315

Query: 295 SSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQD-GKEAVKLFRSMFREDYR 354
           S+L++MY K   L+ A+ VF  M  KN VSWT M+VGYGQ    +EAVK+F  M R    
Sbjct: 316 SALIDMYCKCKCLHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKIFLDMQRSGID 375

Query: 355 PDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIISAASRC 414
           PD  T    +S+C   S   E  Q H   I  G   ++++ N LV  Y KCG I  ++R 
Sbjct: 376 PDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNSLVTLYGKCGDIDDSTRL 435

Query: 415 FRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLV 474
           F  +   D V+WT+++   A  G   + ++ F+KM+ +G++PD +   GV+SACS  GLV
Sbjct: 436 FNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLTGVISACSRAGLV 495

Query: 475 NTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALG--AF 534
             G  YF LMT++Y IVP   H +C+IDL SR+G L+EA   +  MP    PDA+G    
Sbjct: 496 EKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMP--FPPDAIGWTTL 555

Query: 535 IRACRTHGNLRLAKWAMELASEM--NKPVNYSLMSNIFAFEGRWSDVARMRKLMKDSCER 594
           + ACR  GNL + KWA E   E+  + P  Y+L+S+I+A +G+W  VA++R+ M++   +
Sbjct: 556 LSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQLRRGMREKNVK 615

Query: 595 KAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLN 598
           K PG SWI+  G +H F + D S P    +Y  L  L N
Sbjct: 616 KEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNN 645

BLAST of Spg023840 vs. ExPASy Swiss-Prot
Match: Q9SVA5 (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 7.2e-94
Identity = 203/582 (34.88%), Postives = 319/582 (54.81%), Query Frame = 0

Query: 23  LKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNV 82
           L A + +  +    Q+H+ +++ GL  + S+ N L+  Y KC  +  A  LF+ MP +N+
Sbjct: 256 LSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNI 315

Query: 83  VSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVES 142
           +SW T++ G        +     K ++ LF  M    + PD    +++  SC  L+ +  
Sbjct: 316 ISWTTLLSGY-------KQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGF 375

Query: 143 GSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVF 202
           G+Q+H + +K     D +V ++++D YAKC    DAR  F      D+VL+N M+  Y  
Sbjct: 376 GTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGY-- 435

Query: 203 NCLG-----REAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLD 262
           + LG      EA+ +F  M+    +    TF SLL +     S  L KQ+H L+ K  L+
Sbjct: 436 SRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLN 495

Query: 263 LDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSM 322
           LDI   S+L+++YS    L D+R VF EM  K+ V W +M  GY QQ + +EA+ LF  +
Sbjct: 496 LDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLEL 555

Query: 323 FREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGII 382
                RPDE TFA+++++ G  +      + H  L+K G E    I N L++ Y+KCG  
Sbjct: 556 QLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSP 615

Query: 383 SAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSAC 442
             A + F   A  D+V W S+I   A  G  K A++  EKM+S GI P+ I F+GVLSAC
Sbjct: 616 EDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSAC 675

Query: 443 SHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDA 502
           SH GLV  GL  F LM  ++ I P++EH  C++ L+ RAG L++A  L++ MP +     
Sbjct: 676 SHAGLVEDGLKQFELML-RFGIEPETEHYVCMVSLLGRAGRLNKARELIEKMPTKPAAIV 735

Query: 503 LGAFIRACRTHGNLRLAKWAMELA--SEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKD 562
             + +  C   GN+ LA+ A E+A  S+     +++++SNI+A +G W++  ++R+ MK 
Sbjct: 736 WRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKKVRERMKV 795

Query: 563 SCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLL 597
               K PG SWI I   +H+F+S D+SH ++  +Y +L  LL
Sbjct: 796 EGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLL 827

BLAST of Spg023840 vs. ExPASy TrEMBL
Match: A0A0A0K863 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1)

HSP 1 Score: 1046.2 bits (2704), Expect = 5.4e-302
Identity = 513/616 (83.28%), Postives = 554/616 (89.94%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIW  THFGRSRLVHSFSFN LKA+A V+SIP DT LHSLV+K GL NELSVQNKLL++
Sbjct: 1   MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPHDTLLHSLVVKLGLVNELSVQNKLLRV 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L+ ARNLFDEM  RNVVSWNTVICGLVD GYGGEF++RQ S  L FKKMLM +V
Sbjct: 61  YVKCRDLDSARNLFDEMARRNVVSWNTVICGLVDGGYGGEFKMRQHSIFLYFKKMLMGLV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDG+TFN LFRSCVVLNDVESG QLH FVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFSC+LYRDLVLWNVMLYC VFN L REAIEVF LMQLEG KGDDFTFSSLLSSC YKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCCVFNSLSREAIEVFRLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH LLIKQS DLDI+VASSLVN+Y+K++NLYDARKVF EMP +NSVSWTTMIVG
Sbjct: 241 GELGKQLHCLLIKQSFDLDILVASSLVNVYTKNDNLYDARKVFDEMPTRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQ + GKEAVKLFR MFR+DY PDELTFASVLSSCGFTSGASELMQVHSCLIK GFEAF
Sbjct: 301 YGQHEYGKEAVKLFRRMFRKDYCPDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LSI NGL+ AYSKCGII+AA +CFRLIA+PDLVTWTSIICGLA CG EKDAV+ F+KMLS
Sbjct: 361 LSINNGLIYAYSKCGIIAAALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVKLFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
           YGIRPD+IAFLGVLSACSHGG V+ GLHYFNLMTNQYQ+VPDSEHLTCLIDL+ RAGSLD
Sbjct: 421 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           +AF LLKSMPKEAGPDAL AFIRACRTHGNLRLAK AME ASE ++PVNYSL+SN++A E
Sbjct: 481 QAFDLLKSMPKEAGPDALRAFIRACRTHGNLRLAKRAMEFASEPDEPVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKL+ D CE+K PG SW+EIAGY HLF+S DRSHPQSLDLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLINDRCEQKTPGLSWVEIAGYNHLFISGDRSHPQSLDLYAMLGLLLNTM 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY  T S+VDI PE
Sbjct: 601 KKDYKFTASQVDIVPE 616

BLAST of Spg023840 vs. ExPASy TrEMBL
Match: A0A6J1H3L2 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460096 PE=4 SV=1)

HSP 1 Score: 1025.4 bits (2650), Expect = 9.8e-296
Identity = 505/616 (81.98%), Postives = 545/616 (88.47%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG  RLVHSFSFN LKA+A ++SIPR T+LHSLVIK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  ARNLFDEM  RNVVSWNTVICG+V+CGYGGEF++R++S +  FK MLMDMV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+KIGFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFS VLY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH  LIK S DLDI+VASSLVNMY+K+N+LYDARK F EMP +NSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKAFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R MF EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS A RCFRLIA+PDLV+WTSIICG AFCG EK AVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGTISPALRCFRLIAEPDLVSWTSIICGFAFCGLEKAAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSACSHGG VN GLHYFNLMTN+YQIVPDSEHLTCLIDLI RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ +EAGPDA  +FIRACRTHG LRLAKWAME AS+  KPVN SLMSN++A E
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKL+KDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNT+
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYEMLGLLLNTV 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY ST S +DI PE
Sbjct: 601 KKDYKSTASNIDIEPE 616

BLAST of Spg023840 vs. ExPASy TrEMBL
Match: A0A6J1L572 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111499233 PE=4 SV=1)

HSP 1 Score: 1021.1 bits (2639), Expect = 1.9e-294
Identity = 505/616 (81.98%), Postives = 544/616 (88.31%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIWP THFG SRLVHSFSFN LKA+A V+SIPR TQLHSLVIK GLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCSRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L  A NLFDEM  RNVVSWNTVICG+VDCGYGGEF++R++S++  FK MLM+MV
Sbjct: 61  YVKCRDLGRAWNLFDEMRRRNVVSWNTVICGVVDCGYGGEFKMRERSNLSCFKNMLMEMV 120

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDGVTFN LFRSC V+NDV SG QLHGFV+K GFDLDCFVGSAVVDFYAKCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKFGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFS VLY+DLVLWNVMLYCYVFNCL  EAIE+F LMQLEG  GDDFTFSSLLSSC YKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAEEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELG QLH  LIK S DLDI+VASSLVNMY+K+N+LYDARKVF EMP +NSVSWTTMIVG
Sbjct: 241 GELGMQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKL R M  EDY PDELTFASVLSSCGFTSGASEL+QVHSCLIK GFEAF
Sbjct: 301 YGQQEHGKEAVKLLRRMLEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LS+ NGL+NAYSKCG IS+A RCFRLIA+PDLV+ TSIICGLAFCG EKDAVE F+KMLS
Sbjct: 361 LSVNNGLINAYSKCGAISSALRCFRLIAEPDLVSRTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
            GIRPD+IAFLGVLSACSHGG  N GLHYFNLMTN+YQIVPDSEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGYANMGLHYFNLMTNEYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           EAF LLKS+ ++AGPDA  +FIRACRTHG+LRLAKWAME AS+  KPVN SLMSNI+A E
Sbjct: 481 EAFKLLKSVSEKAGPDAFRSFIRACRTHGHLRLAKWAMEFASDPYKPVNCSLMSNIYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARMRKLMKDSCE K PG SWIEIAGY HLFVSSDRSHPQS DLY MLGLLLNTM
Sbjct: 541 GRWSDVARMRKLMKDSCEPKVPGFSWIEIAGYNHLFVSSDRSHPQSSDLYAMLGLLLNTM 600

Query: 601 KKDYISTISEVDIAPE 616
           KKDY S  S +DI PE
Sbjct: 601 KKDYKSIASNIDIEPE 616

BLAST of Spg023840 vs. ExPASy TrEMBL
Match: A0A5D3BXR6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold46G001210 PE=4 SV=1)

HSP 1 Score: 1017.7 bits (2630), Expect = 2.1e-293
Identity = 496/607 (81.71%), Postives = 543/607 (89.46%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIW  THFGRSRLVHSFSFN LKA+A V+SIPRDT LHS+V+K GLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L+ AR+LFDEMP RN VSWNTVICGLVD GYGGEF+ RQ+   L FKKMLM +V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDG+TFN LFRSCVVLNDVESG QLH FVMKIGFDLDCFVGSA+VDFYAKCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEG KGD+FTFSSLLSSC YKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH LLIKQS DLDI+VASSL+++Y+K++NLYDARKVF EMP +NSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKLFR MF +DY  DELTFASVLSSCGFTSGASELMQVHSCLIK GFEAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LSI NGL+ AYSKCGI++AA +CFRLIA+PDLVTWTSIICGLAFCG EKDAV+ F+KMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
           YGIRPD+IAFLGVLSACSHGG V+ GLHYFNLMTNQYQ+VPD EHLTCLIDL+ RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           +AF LLKSM KEAGPDAL AFIRACRTHGNL+LAKWAME  SE ++PVNYSL+SN++A E
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARM KL+ D CE+K PG SW+EIAGY HLF S DRSHPQS DLY MLGLLLNTM
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAGYNHLFKSGDRSHPQSSDLYAMLGLLLNTM 610

Query: 601 KKDYIST 607
           K+DY ST
Sbjct: 611 KEDYKST 617

BLAST of Spg023840 vs. ExPASy TrEMBL
Match: A0A1S3C6T7 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497699 PE=4 SV=1)

HSP 1 Score: 1017.7 bits (2630), Expect = 2.1e-293
Identity = 496/607 (81.71%), Postives = 543/607 (89.46%), Query Frame = 0

Query: 1   MLIWPLTHFGRSRLVHSFSFNALKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKI 60
           MLIW  THFGRSRLVHSFSFN LKA+A V+SIPRDT LHS+V+K GLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMV 120
           Y KCR+L+ AR+LFDEMP RN VSWNTVICGLVD GYGGEF+ RQ+   L FKKMLM +V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 DPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180
           DPDG+TFN LFRSCVVLNDVESG QLH FVMKIGFDLDCFVGSA+VDFYAKCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEG KGD+FTFSSLLSSC YKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHALLIKQSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVG 300
           GELGKQLH LLIKQS DLDI+VASSL+++Y+K++NLYDARKVF EMP +NSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQD-GKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAF 360
           YGQQ+ GKEAVKLFR MF +DY  DELTFASVLSSCGFTSGASELMQVHSCLIK GFEAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLS 420
           LSI NGL+ AYSKCGI++AA +CFRLIA+PDLVTWTSIICGLAFCG EKDAV+ F+KMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLD 480
           YGIRPD+IAFLGVLSACSHGG V+ GLHYFNLMTNQYQ+VPD EHLTCLIDL+ RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 EAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAKWAMELASEMNKPVNYSLMSNIFAFE 540
           +AF LLKSM KEAGPDAL AFIRACRTHGNL+LAKWAME  SE ++PVNYSL+SN++A E
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLNTM 600
           GRWSDVARM KL+ D CE+K PG SW+EIAGY HLF S DRSHPQS DLY MLGLLLNTM
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAGYNHLFKSGDRSHPQSSDLYAMLGLLLNTM 610

Query: 601 KKDYIST 607
           K+DY ST
Sbjct: 611 KEDYKST 617

BLAST of Spg023840 vs. TAIR 10
Match: AT2G46050.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 454.9 bits (1169), Expect = 1.0e-127
Identity = 252/552 (45.65%), Postives = 339/552 (61.41%), Query Frame = 0

Query: 24  KASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVV 83
           K SA++D +    Q H  ++K G+ N L +QNKLL+ Y K R  + A  LFDEMP+RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVESG 143
           +WN +I G++     G+   R         ++L   V  D V+F  L R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 SQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FY KCGL  +AR  F  VL RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLMQLEGC-----KGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDL 263
            +  EA   F L++L G      +GD FTFSSLLS+C      E GKQ+HA+L K S   
Sbjct: 224 GMIDEA---FGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQF 283

Query: 264 DIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMF 323
           DI VA++L+NMY+KSN+L DAR+ F  M  +N VSW  MIVG+ Q  +G+EA++LF  M 
Sbjct: 284 DIPVATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQML 343

Query: 324 REDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIIS 383
            E+ +PDELTFASVLSSC   S   E+ QV + + K G   FLS+ N L+++YS+ G +S
Sbjct: 344 LENLQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLS 403

Query: 384 AASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACS 443
            A  CF  I +PDLV+WTS+I  LA  GF +++++ FE ML   ++PD+I FL VLSACS
Sbjct: 404 EALLCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLQPDKITFLEVLSACS 463

Query: 444 HGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDAL 503
           HGGLV  GL  F  MT  Y+I  + EH TCLIDL+ RAG +DEA  +L SMP E    AL
Sbjct: 464 HGGLVQEGLRCFKRMTEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHAL 523

Query: 504 GAFIRACRTHGNLRLAKWAME--LASEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKDS 563
            AF   C  H      KW  +  L  E  KPVNYS++SN +  EG W+  A +RK  + +
Sbjct: 524 AAFTGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRN 583

Query: 564 C-ERKAPGCSWI 567
           C   K PGCSW+
Sbjct: 584 CYNPKTPGCSWL 585

BLAST of Spg023840 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 380.9 bits (977), Expect = 1.9e-105
Identity = 208/631 (32.96%), Postives = 340/631 (53.88%), Query Frame = 0

Query: 38  LHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGY 97
           +H+ VIKSG +NE+ +QN+L+  Y KC +LE  R +FD+MP RN+ +WN+V+ GL   G+
Sbjct: 42  VHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGF 101

Query: 98  GGE----FRVRQKSSVLLFKKMLMDMVDPD--------------------GVTFNALFRS 157
             E    FR   +     +  M+      D                      +F ++  +
Sbjct: 102 LDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSA 161

Query: 158 CVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLW 217
           C  LND+  G Q+H  + K  F  D ++GSA+VD Y+KCG   DA+  F  +  R++V W
Sbjct: 162 CSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSW 221

Query: 218 NVMLYCYVFNCLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIK 277
           N ++ C+  N    EA++VF +M     + D+ T +S++S+C    + ++G+++H  ++K
Sbjct: 222 NSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVK 281

Query: 278 -QSLDLDIVVASSLVNMYSKSNNLYDARKVFGEMP------------------------- 337
              L  DI+++++ V+MY+K + + +AR +F  MP                         
Sbjct: 282 NDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARL 341

Query: 338 ------NKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMFREDYRPDELTFASVLSSCGFTSG 397
                  +N VSW  +I GY Q  + +EA+ LF  + RE   P   +FA++L +C   + 
Sbjct: 342 MFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAE 401

Query: 398 ASELMQVHSCLIKFGF------EAFLSIKNGLVNAYSKCGIISAASRCFRLIAKPDLVTW 457
               MQ H  ++K GF      E  + + N L++ Y KCG +      FR + + D V+W
Sbjct: 402 LHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSW 461

Query: 458 TSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLVNTGLHYFNLMTN 517
            ++I G A  G+  +A+E F +ML  G +PD I  +GVLSAC H G V  G HYF+ MT 
Sbjct: 462 NAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTR 521

Query: 518 QYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALGAFIRACRTHGNLRLAK 577
            + + P  +H TC++DL+ RAG L+EA  +++ MP +      G+ + AC+ H N+ L K
Sbjct: 522 DFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGK 581

Query: 578 WAMELASEMNKPVN---YSLMSNIFAFEGRWSDVARMRKLMKDSCERKAPGCSWIEIAGY 603
           +  E   E+ +P N   Y L+SN++A  G+W DV  +RK M+     K PGCSWI+I G+
Sbjct: 582 YVAEKLLEV-EPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGH 641

BLAST of Spg023840 vs. TAIR 10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 347.4 bits (890), Expect = 2.3e-95
Identity = 187/554 (33.75%), Postives = 300/554 (54.15%), Query Frame = 0

Query: 37  QLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCG 96
           Q+H  ++KSG    L   N L+ +Y KCR    A  +FD MP RNVVSW+ ++ G V   
Sbjct: 27  QVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPERNVVSWSALMSGHV--- 86

Query: 97  YGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFD 156
             G+     K S+ LF +M    + P+  TF+   ++C +LN +E G Q+HGF +KIGF+
Sbjct: 87  LNGDL----KGSLSLFSEMGRQGIYPNEFTFSTNLKACGLLNALEKGLQIHGFCLKIGFE 146

Query: 157 LDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVFNCLGREAIEVFCLM 216
           +   VG+++VD Y+KCG   +A   F  ++ R L+ WN M+  +V    G +A++ F +M
Sbjct: 147 MMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYGSKALDTFGMM 206

Query: 217 QLEGCK--GDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDL--DIVVASSLVNMYSK 276
           Q    K   D+FT +SLL +C+  G    GKQ+H  L++          +  SLV++Y K
Sbjct: 207 QEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATITGSLVDLYVK 266

Query: 277 SNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSMFREDYRPDELTFASV 336
              L+ ARK F ++  K  +SW+++I+GY Q+ +  EA+ LF+ +   + + D    +S+
Sbjct: 267 CGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNSQIDSFALSSI 326

Query: 337 LSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIISAASRCFRLIAKPDL 396
           +      +   +  Q+ +  +K       S+ N +V+ Y KCG++  A +CF  +   D+
Sbjct: 327 IGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEKCFAEMQLKDV 386

Query: 397 VTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLVNTGLHYFNL 456
           ++WT +I G    G  K +V  F +ML + I PD + +L VLSACSH G++  G   F+ 
Sbjct: 387 ISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGMIKEGEELFSK 446

Query: 457 MTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALGAFIRACRTHGNLR 516
           +   + I P  EH  C++DL+ RAG L EA +L+ +MP +         +  CR HG++ 
Sbjct: 447 LLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLLSLCRVHGDIE 506

Query: 517 LAK--WAMELASEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKDSCERKAPGCSWIEIA 576
           L K    + L  +   P NY +MSN++   G W++    R+L      +K  G SW+EI 
Sbjct: 507 LGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKKEAGMSWVEIE 566

Query: 577 GYIHLFVSSDRSHP 584
             +H F S + SHP
Sbjct: 567 REVHFFRSGEDSHP 573

BLAST of Spg023840 vs. TAIR 10
Match: AT1G68930.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 347.1 bits (889), Expect = 3.0e-95
Identity = 205/579 (35.41%), Postives = 302/579 (52.16%), Query Frame = 0

Query: 55  NKLLKIYFKCRNLECARNLFDEMPMRNVVSWNTVICGLVDCGYGGEFRVRQKSSVLLFKK 114
           N LL  Y K   +    + F+++P R+ V+WN +I G    G  G       ++V  +  
Sbjct: 76  NNLLLAYSKAGLISEMESTFEKLPDRDGVTWNVLIEGYSLSGLVG-------AAVKAYNT 135

Query: 115 MLMDM-VDPDGVTFNALFRSCVVLNDVESGSQLHGFVMKIGFDLDCFVGSAVVDFYAKCG 174
           M+ D   +   VT   + +       V  G Q+HG V+K+GF+    VGS ++  YA  G
Sbjct: 136 MMRDFSANLTRVTLMTMLKLSSSNGHVSLGKQIHGQVIKLGFESYLLVGSPLLYMYANVG 195

Query: 175 LYEDARLAF------SCVLY------------------------RDLVLWNVMLYCYVFN 234
              DA+  F      + V+Y                        +D V W  M+     N
Sbjct: 196 CISDAKKVFYGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGMEKDSVSWAAMIKGLAQN 255

Query: 235 CLGREAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLDLDIVVA 294
            L +EAIE F  M+++G K D + F S+L +C   G+   GKQ+HA +I+ +    I V 
Sbjct: 256 GLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVG 315

Query: 295 SSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQD-GKEAVKLFRSMFREDYR 354
           S+L++MY K   L+ A+ VF  M  KN VSWT M+VGYGQ    +EAVK+F  M R    
Sbjct: 316 SALIDMYCKCKCLHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKIFLDMQRSGID 375

Query: 355 PDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGIISAASRC 414
           PD  T    +S+C   S   E  Q H   I  G   ++++ N LV  Y KCG I  ++R 
Sbjct: 376 PDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNSLVTLYGKCGDIDDSTRL 435

Query: 415 FRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSACSHGGLV 474
           F  +   D V+WT+++   A  G   + ++ F+KM+ +G++PD +   GV+SACS  GLV
Sbjct: 436 FNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLTGVISACSRAGLV 495

Query: 475 NTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDALG--AF 534
             G  YF LMT++Y IVP   H +C+IDL SR+G L+EA   +  MP    PDA+G    
Sbjct: 496 EKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMP--FPPDAIGWTTL 555

Query: 535 IRACRTHGNLRLAKWAMELASEM--NKPVNYSLMSNIFAFEGRWSDVARMRKLMKDSCER 594
           + ACR  GNL + KWA E   E+  + P  Y+L+S+I+A +G+W  VA++R+ M++   +
Sbjct: 556 LSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQLRRGMREKNVK 615

Query: 595 KAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLLN 598
           K PG SWI+  G +H F + D S P    +Y  L  L N
Sbjct: 616 KEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNN 645

BLAST of Spg023840 vs. TAIR 10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 346.3 bits (887), Expect = 5.1e-95
Identity = 203/582 (34.88%), Postives = 319/582 (54.81%), Query Frame = 0

Query: 23  LKASAAVDSIPRDTQLHSLVIKSGLANELSVQNKLLKIYFKCRNLECARNLFDEMPMRNV 82
           L A + +  +    Q+H+ +++ GL  + S+ N L+  Y KC  +  A  LF+ MP +N+
Sbjct: 256 LSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNI 315

Query: 83  VSWNTVICGLVDCGYGGEFRVRQKSSVLLFKKMLMDMVDPDGVTFNALFRSCVVLNDVES 142
           +SW T++ G        +     K ++ LF  M    + PD    +++  SC  L+ +  
Sbjct: 316 ISWTTLLSGY-------KQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGF 375

Query: 143 GSQLHGFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARLAFSCVLYRDLVLWNVMLYCYVF 202
           G+Q+H + +K     D +V ++++D YAKC    DAR  F      D+VL+N M+  Y  
Sbjct: 376 GTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGY-- 435

Query: 203 NCLG-----REAIEVFCLMQLEGCKGDDFTFSSLLSSCNYKGSGELGKQLHALLIKQSLD 262
           + LG      EA+ +F  M+    +    TF SLL +     S  L KQ+H L+ K  L+
Sbjct: 436 SRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLN 495

Query: 263 LDIVVASSLVNMYSKSNNLYDARKVFGEMPNKNSVSWTTMIVGYGQQ-DGKEAVKLFRSM 322
           LDI   S+L+++YS    L D+R VF EM  K+ V W +M  GY QQ + +EA+ LF  +
Sbjct: 496 LDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLEL 555

Query: 323 FREDYRPDELTFASVLSSCGFTSGASELMQVHSCLIKFGFEAFLSIKNGLVNAYSKCGII 382
                RPDE TFA+++++ G  +      + H  L+K G E    I N L++ Y+KCG  
Sbjct: 556 QLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSP 615

Query: 383 SAASRCFRLIAKPDLVTWTSIICGLAFCGFEKDAVEFFEKMLSYGIRPDRIAFLGVLSAC 442
             A + F   A  D+V W S+I   A  G  K A++  EKM+S GI P+ I F+GVLSAC
Sbjct: 616 EDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITFVGVLSAC 675

Query: 443 SHGGLVNTGLHYFNLMTNQYQIVPDSEHLTCLIDLISRAGSLDEAFYLLKSMPKEAGPDA 502
           SH GLV  GL  F LM  ++ I P++EH  C++ L+ RAG L++A  L++ MP +     
Sbjct: 676 SHAGLVEDGLKQFELML-RFGIEPETEHYVCMVSLLGRAGRLNKARELIEKMPTKPAAIV 735

Query: 503 LGAFIRACRTHGNLRLAKWAMELA--SEMNKPVNYSLMSNIFAFEGRWSDVARMRKLMKD 562
             + +  C   GN+ LA+ A E+A  S+     +++++SNI+A +G W++  ++R+ MK 
Sbjct: 736 WRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKKVRERMKV 795

Query: 563 SCERKAPGCSWIEIAGYIHLFVSSDRSHPQSLDLYTMLGLLL 597
               K PG SWI I   +H+F+S D+SH ++  +Y +L  LL
Sbjct: 796 EGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLL 827

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874466.13.5e-30385.50pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
KAG6575187.11.6e-29581.82Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022958961.12.0e-29581.98pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
KAG7013750.11.7e-29481.98Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023006538.13.8e-29481.98pentatricopeptide repeat-containing protein At2g46050, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
O823631.4e-12645.65Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidop... [more]
Q9SIT72.6e-10432.96Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
P0C8983.2e-9433.75Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
Q9CAA84.2e-9435.41Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
Q9SVA57.2e-9434.88Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0K8635.4e-30283.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1[more]
A0A6J1H3L29.8e-29681.98pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
A0A6J1L5721.9e-29481.98pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Cucurbit... [more]
A0A5D3BXR62.1e-29381.71Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6T72.1e-29381.71pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT2G46050.11.0e-12745.65Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.11.9e-10532.96Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G15130.12.3e-9533.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G68930.13.0e-9535.41pentatricopeptide (PPR) repeat-containing protein [more]
AT4G39530.15.1e-9534.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 191..224
e-value: 2.5E-4
score: 19.0
coord: 392..425
e-value: 9.2E-5
score: 20.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 189..235
e-value: 5.4E-8
score: 32.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 466..490
e-value: 0.17
score: 12.2
coord: 264..290
e-value: 0.0071
score: 16.5
coord: 292..319
e-value: 0.28
score: 11.5
coord: 56..81
e-value: 0.13
score: 12.6
coord: 392..422
e-value: 2.8E-5
score: 24.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..293
score: 8.714292
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 390..424
score: 11.345003
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 241..342
e-value: 4.3E-19
score: 70.6
coord: 343..444
e-value: 1.3E-15
score: 59.2
coord: 12..140
e-value: 1.5E-15
score: 59.0
coord: 141..240
e-value: 4.0E-15
score: 57.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 445..570
e-value: 2.2E-10
score: 42.5
NoneNo IPR availablePANTHERPTHR24015:SF398OS07G0259400 PROTEINcoord: 7..587
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 7..587

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg023840.1Spg023840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding