HG10018570 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018570
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 5377366 .. 5379819 (-)
RNA-Seq ExpressionHG10018570
SyntenyHG10018570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCCTTTTCGTGCTGTACAACTACTTTTCGGGTCCTCCAATTCCCTCCACAAGCGCCGATTTCTTCTGCCGGCTTCCTTCCTCTTTTCATCGATTTCATGGCGCGAAGCGGAATCTGTTTTAAAACCCAGAAACTCTGAGTTTCTTGAGAAGTCACATGTTTTCAATAATCTTAGTTTTATCAGATCTTATTGTTCTGGAAAGAACAGTGGCAATGGGGCTAGTGAGTGGACTGAGGATATTGAGTATCTAGATGAGTCGGGGAGTGTTATTTTCTCTGGTAAAGGCGTTCGGTTGGTTGAACCAGGTCTTGATGGTCATGTAATGGTGGGCGGACTCAAAAAGCCCTTTCTGAATGCATCGGCTGTTGCTAAGATAGTTGAGGTTGTAACGAGGTGGAAATGGGGTCCAGAGTTGGAATCTCAGCTCGAAAAGCTCCAATTTGTTCCAAATATGACGCACATCACTCAGTCATTGAAGATTATCGATGATGCTGAGGCTTCTTTGAGCTTGTTTCGTTGGGCTAAGAGGCAGTCCTGGTATTCACCAAATGATGAGTGCTATGGGTTGTTGTTTGATGGGTTAAATCAGAGAAGAGATTTTGATGCAATTCAATTGTTGTTTGATGAGGTTGTTCGTGATTTGAGCAGTGATGGGACTGTCTCATTCAGTGCATATAATCGTGTGATTCAGTACTTGGCTAAAGCTGAGAAATTGGAAGTGTCTTTCTGTTGTTTTAAGAAGATTCATGATTCAGGTTTCAAGGTTGATACTCAAACATACAATTCTCTTATAACCTTGTTCTTAAACAAGGGTCTGCCTTACAAGGCTTTCGAGATATACGAGAGCATGGCAGGAGCAGGGTGTTCTTTAGATGCATCTACCTTTGAGCTGATGATACCAAGTTTGGCAAAATCGGGTCGTCTTGATGCAGCAATGAAGCTCTTTCAAGAGATGAAAGAGAGGAATTATCGTCCCGCCCTGAATGTTTATACATCCCTTGTGGATTCTATGGGGAAAGCTGGGAGGCTCGACACATCGATGAAGATTTACATGGAAATGCAGCTGCTTGAGCTCAGACCGCCTGCTTCAATGTTTGTTTCCTTAATTGAGTCACATGTGAAGGCTGGGAAATTGGATTCTGCTCTCAAGCTTTGGGATGAGATGAAAAGGGCAGGTTTTAGGCCTAACTTTGGTTTGTACTCTATGGTTGTTGAGTCACATGCCAAATCAGGGAAACTTGATGTTGCAATGTCTATCTTCACCGAAATGGAGAAAGCTGGATTTCTTCCCATCCCATCTACTTATTGCTGTCTCTTGGAAATGCACGCAGCGTCGGGACAAGTAGATGCTGCCATGAAACTCTACAACTCTATGACTAATGCAGGTTTGAGGCTCGGGTTAAGTACGTACACTGCTTTATTGACACTTCTGGCTAATAAGAAGCTTATCGATATTGCTGCAAAAGTTTTACTTGAAATGAAGGCCATGGGATTCTCTGTTGATGTGAGCGCTAGCGATGTCTTGATGGTGTATATCAAGGAAGGCTCTATCGATTCCGCTTTGAGGTGGCTTCAGTTCATGGGTTCATCTGGAATAAGAACTAATAGCTTTATTCTCAGGCAATTGTTTGAGTCATGCATGAAGAAAGGGATGTATGAGTCAGCTATGCCTCTCTTAGAAACTTATGTAGATTCTGCTGCTAAAGTTGATCTTATACTCTACACATCCATCCTGGCCCATCTTGTAAGGTGTCAAGATGAGCAGAAGGAAAGATATTTGATGTCCATCCTCAGTGCTACAAGACATAAGGCACACTCTTTTTTGTGTGGACTGTTCACTGGAACAGAACAAAGGAAACAACCAGTTTTGTCTTTTGTAAGGGAGTTTTTTCAGGGCATCGACTATGAGCTGGAAGAGAGCAGTGCAAGATACTTTGTCAATGTCCTCCTCAATTATCTCATTCTCATGGGACAAATAAATCGAGCTCGTTGTATTTGGAAAGTTGCTTACGAGAATAAGCTCTTCCCAAAAGCCATTGTCTTTGATCAACACATTGCCTGGTCCCTCGACATTCGGAACTTGTCGGTTGGAGCTGCTCTTATAGCAGTTGTGCACACTCTCCATCGGTTCAGGAAGCGAATGTTGTATTATGGAATAGTTCCGAGGCGCATAAAATTGGTTACGGGACCTACTCTGAAGCTTGTGGTTGCTCAAATGTTGAGCTCTGTGGAATCCCCATTTGAGGTCAGTAAAGTAGTTCTGAGAGCAACAGGAGACTCTGTGATGGAGTGGTTCAAAAAACCAATCGTCCAACAATTCCTTCTGAATGAGATTCCATCAAGATCAGATATCCTAATGCACAAGTTGAATACTCTCTTTCCCAGTTCAGCACCTGAAATTAGATCTCTTTCACCTCCCAAACCCCTCATTTCCCGGAATTCAGCATAA

mRNA sequence

ATGCTTCCTTTTCGTGCTGTACAACTACTTTTCGGGTCCTCCAATTCCCTCCACAAGCGCCGATTTCTTCTGCCGGCTTCCTTCCTCTTTTCATCGATTTCATGGCGCGAAGCGGAATCTGTTTTAAAACCCAGAAACTCTGAGTTTCTTGAGAAGTCACATGTTTTCAATAATCTTAGTTTTATCAGATCTTATTGTTCTGGAAAGAACAGTGGCAATGGGGCTAGTGAGTGGACTGAGGATATTGAGTATCTAGATGAGTCGGGGAGTGTTATTTTCTCTGGTAAAGGCGTTCGGTTGGTTGAACCAGGTCTTGATGGTCATGTAATGGTGGGCGGACTCAAAAAGCCCTTTCTGAATGCATCGGCTGTTGCTAAGATAGTTGAGGTTGTAACGAGGTGGAAATGGGGTCCAGAGTTGGAATCTCAGCTCGAAAAGCTCCAATTTGTTCCAAATATGACGCACATCACTCAGTCATTGAAGATTATCGATGATGCTGAGGCTTCTTTGAGCTTGTTTCGTTGGGCTAAGAGGCAGTCCTGGTATTCACCAAATGATGAGTGCTATGGGTTGTTGTTTGATGGGTTAAATCAGAGAAGAGATTTTGATGCAATTCAATTGTTGTTTGATGAGGTTGTTCGTGATTTGAGCAGTGATGGGACTGTCTCATTCAGTGCATATAATCGTGTGATTCAGTACTTGGCTAAAGCTGAGAAATTGGAAGTGTCTTTCTGTTGTTTTAAGAAGATTCATGATTCAGGTTTCAAGGTTGATACTCAAACATACAATTCTCTTATAACCTTGTTCTTAAACAAGGGTCTGCCTTACAAGGCTTTCGAGATATACGAGAGCATGGCAGGAGCAGGGTGTTCTTTAGATGCATCTACCTTTGAGCTGATGATACCAAGTTTGGCAAAATCGGGTCGTCTTGATGCAGCAATGAAGCTCTTTCAAGAGATGAAAGAGAGGAATTATCGTCCCGCCCTGAATGTTTATACATCCCTTGTGGATTCTATGGGGAAAGCTGGGAGGCTCGACACATCGATGAAGATTTACATGGAAATGCAGCTGCTTGAGCTCAGACCGCCTGCTTCAATGTTTGTTTCCTTAATTGAGTCACATGTGAAGGCTGGGAAATTGGATTCTGCTCTCAAGCTTTGGGATGAGATGAAAAGGGCAGGTTTTAGGCCTAACTTTGGTTTGTACTCTATGGTTGTTGAGTCACATGCCAAATCAGGGAAACTTGATGTTGCAATGTCTATCTTCACCGAAATGGAGAAAGCTGGATTTCTTCCCATCCCATCTACTTATTGCTGTCTCTTGGAAATGCACGCAGCGTCGGGACAAGTAGATGCTGCCATGAAACTCTACAACTCTATGACTAATGCAGGTTTGAGGCTCGGGTTAAGTACGTACACTGCTTTATTGACACTTCTGGCTAATAAGAAGCTTATCGATATTGCTGCAAAAGTTTTACTTGAAATGAAGGCCATGGGATTCTCTGTTGATGTGAGCGCTAGCGATGTCTTGATGGTGTATATCAAGGAAGGCTCTATCGATTCCGCTTTGAGGTGGCTTCAGTTCATGGGTTCATCTGGAATAAGAACTAATAGCTTTATTCTCAGGCAATTGTTTGAGTCATGCATGAAGAAAGGGATGTATGAGTCAGCTATGCCTCTCTTAGAAACTTATGTAGATTCTGCTGCTAAAGTTGATCTTATACTCTACACATCCATCCTGGCCCATCTTGTAAGGTGTCAAGATGAGCAGAAGGAAAGATATTTGATGTCCATCCTCAGTGCTACAAGACATAAGGCACACTCTTTTTTGTGTGGACTGTTCACTGGAACAGAACAAAGGAAACAACCAGTTTTGTCTTTTGTAAGGGAGTTTTTTCAGGGCATCGACTATGAGCTGGAAGAGAGCAGTGCAAGATACTTTGTCAATGTCCTCCTCAATTATCTCATTCTCATGGGACAAATAAATCGAGCTCGTTGTATTTGGAAAGTTGCTTACGAGAATAAGCTCTTCCCAAAAGCCATTGTCTTTGATCAACACATTGCCTGGTCCCTCGACATTCGGAACTTGTCGGTTGGAGCTGCTCTTATAGCAGTTGTGCACACTCTCCATCGGTTCAGGAAGCGAATGTTGTATTATGGAATAGTTCCGAGGCGCATAAAATTGGTTACGGGACCTACTCTGAAGCTTGTGGTTGCTCAAATGTTGAGCTCTGTGGAATCCCCATTTGAGGTCAGTAAAGTAGTTCTGAGAGCAACAGGAGACTCTGTGATGGAGTGGTTCAAAAAACCAATCGTCCAACAATTCCTTCTGAATGAGATTCCATCAAGATCAGATATCCTAATGCACAAGTTGAATACTCTCTTTCCCAGTTCAGCACCTGAAATTAGATCTCTTTCACCTCCCAAACCCCTCATTTCCCGGAATTCAGCATAA

Coding sequence (CDS)

ATGCTTCCTTTTCGTGCTGTACAACTACTTTTCGGGTCCTCCAATTCCCTCCACAAGCGCCGATTTCTTCTGCCGGCTTCCTTCCTCTTTTCATCGATTTCATGGCGCGAAGCGGAATCTGTTTTAAAACCCAGAAACTCTGAGTTTCTTGAGAAGTCACATGTTTTCAATAATCTTAGTTTTATCAGATCTTATTGTTCTGGAAAGAACAGTGGCAATGGGGCTAGTGAGTGGACTGAGGATATTGAGTATCTAGATGAGTCGGGGAGTGTTATTTTCTCTGGTAAAGGCGTTCGGTTGGTTGAACCAGGTCTTGATGGTCATGTAATGGTGGGCGGACTCAAAAAGCCCTTTCTGAATGCATCGGCTGTTGCTAAGATAGTTGAGGTTGTAACGAGGTGGAAATGGGGTCCAGAGTTGGAATCTCAGCTCGAAAAGCTCCAATTTGTTCCAAATATGACGCACATCACTCAGTCATTGAAGATTATCGATGATGCTGAGGCTTCTTTGAGCTTGTTTCGTTGGGCTAAGAGGCAGTCCTGGTATTCACCAAATGATGAGTGCTATGGGTTGTTGTTTGATGGGTTAAATCAGAGAAGAGATTTTGATGCAATTCAATTGTTGTTTGATGAGGTTGTTCGTGATTTGAGCAGTGATGGGACTGTCTCATTCAGTGCATATAATCGTGTGATTCAGTACTTGGCTAAAGCTGAGAAATTGGAAGTGTCTTTCTGTTGTTTTAAGAAGATTCATGATTCAGGTTTCAAGGTTGATACTCAAACATACAATTCTCTTATAACCTTGTTCTTAAACAAGGGTCTGCCTTACAAGGCTTTCGAGATATACGAGAGCATGGCAGGAGCAGGGTGTTCTTTAGATGCATCTACCTTTGAGCTGATGATACCAAGTTTGGCAAAATCGGGTCGTCTTGATGCAGCAATGAAGCTCTTTCAAGAGATGAAAGAGAGGAATTATCGTCCCGCCCTGAATGTTTATACATCCCTTGTGGATTCTATGGGGAAAGCTGGGAGGCTCGACACATCGATGAAGATTTACATGGAAATGCAGCTGCTTGAGCTCAGACCGCCTGCTTCAATGTTTGTTTCCTTAATTGAGTCACATGTGAAGGCTGGGAAATTGGATTCTGCTCTCAAGCTTTGGGATGAGATGAAAAGGGCAGGTTTTAGGCCTAACTTTGGTTTGTACTCTATGGTTGTTGAGTCACATGCCAAATCAGGGAAACTTGATGTTGCAATGTCTATCTTCACCGAAATGGAGAAAGCTGGATTTCTTCCCATCCCATCTACTTATTGCTGTCTCTTGGAAATGCACGCAGCGTCGGGACAAGTAGATGCTGCCATGAAACTCTACAACTCTATGACTAATGCAGGTTTGAGGCTCGGGTTAAGTACGTACACTGCTTTATTGACACTTCTGGCTAATAAGAAGCTTATCGATATTGCTGCAAAAGTTTTACTTGAAATGAAGGCCATGGGATTCTCTGTTGATGTGAGCGCTAGCGATGTCTTGATGGTGTATATCAAGGAAGGCTCTATCGATTCCGCTTTGAGGTGGCTTCAGTTCATGGGTTCATCTGGAATAAGAACTAATAGCTTTATTCTCAGGCAATTGTTTGAGTCATGCATGAAGAAAGGGATGTATGAGTCAGCTATGCCTCTCTTAGAAACTTATGTAGATTCTGCTGCTAAAGTTGATCTTATACTCTACACATCCATCCTGGCCCATCTTGTAAGGTGTCAAGATGAGCAGAAGGAAAGATATTTGATGTCCATCCTCAGTGCTACAAGACATAAGGCACACTCTTTTTTGTGTGGACTGTTCACTGGAACAGAACAAAGGAAACAACCAGTTTTGTCTTTTGTAAGGGAGTTTTTTCAGGGCATCGACTATGAGCTGGAAGAGAGCAGTGCAAGATACTTTGTCAATGTCCTCCTCAATTATCTCATTCTCATGGGACAAATAAATCGAGCTCGTTGTATTTGGAAAGTTGCTTACGAGAATAAGCTCTTCCCAAAAGCCATTGTCTTTGATCAACACATTGCCTGGTCCCTCGACATTCGGAACTTGTCGGTTGGAGCTGCTCTTATAGCAGTTGTGCACACTCTCCATCGGTTCAGGAAGCGAATGTTGTATTATGGAATAGTTCCGAGGCGCATAAAATTGGTTACGGGACCTACTCTGAAGCTTGTGGTTGCTCAAATGTTGAGCTCTGTGGAATCCCCATTTGAGGTCAGTAAAGTAGTTCTGAGAGCAACAGGAGACTCTGTGATGGAGTGGTTCAAAAAACCAATCGTCCAACAATTCCTTCTGAATGAGATTCCATCAAGATCAGATATCCTAATGCACAAGTTGAATACTCTCTTTCCCAGTTCAGCACCTGAAATTAGATCTCTTTCACCTCCCAAACCCCTCATTTCCCGGAATTCAGCATAA

Protein sequence

MLPFRAVQLLFGSSNSLHKRRFLLPASFLFSSISWREAESVLKPRNSEFLEKSHVFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Homology
BLAST of HG10018570 vs. NCBI nr
Match: XP_038884643.1 (pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Benincasa hispida])

HSP 1 Score: 1517.7 bits (3928), Expect = 0.0e+00
Identity = 781/823 (94.90%), Postives = 794/823 (96.48%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           M PFRAVQLLFGSSNSLHKRRFLL ASFLF      SSIS RE + VLK RNSEFLE   
Sbjct: 1   MRPFRAVQLLFGSSNSLHKRRFLLSASFLFETRWFNSSISCRETDFVLKHRNSEFLENPC 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF RSYCSGK SGNG SEWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRSYCSGKESGNGCSEWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RW+WGPELESQLEKLQFVPNMTHITQ+LKII D EASL+LFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWRWGPELESQLEKLQFVPNMTHITQALKIIGDVEASLTLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYS NDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSLNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIPSLAKSGRLDAAMKLFQEMKERNYRP LNVYTSLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPPLNVYTSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP AS+F SLIESHVKAGKLD+ALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSASVFASLIESHVKAGKLDTALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASG VDAAMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGHVDAAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKA+GFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAIGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFILRQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY
Sbjct: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAV+HTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVMHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. NCBI nr
Match: XP_008441211.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Cucumis melo] >KAA0037017.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK06633.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1509.2 bits (3906), Expect = 0.0e+00
Identity = 768/823 (93.32%), Postives = 790/823 (95.99%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  H
Sbjct: 1   MLPFRAVQLLIGSSNPLHKRRILLSGSFLFQTRWFNSSFPWREAVSILKPRNSQFLENPH 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRPYCSGKEIGNGGREWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWKWGPELESQLEKLQFVPNMTHITQALKIIDDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFEVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPCLAKSGRLDAAMKLFQEMKEKNYRPAQNIYSSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AGF+PNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSALMFVSLIESHVKAGKLDTALKLWDDMKKAGFKPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPIPPTYCCLLEMHASSGHVDAAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVSVSASDVLMVYIKEGSVDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFIIRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNY
Sbjct: 601 LMSILSATKHRAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSAKYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. NCBI nr
Match: XP_004138818.1 (pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Cucumis sativus] >KGN63095.1 hypothetical protein Csa_022257 [Cucumis sativus])

HSP 1 Score: 1503.4 bits (3891), Expect = 0.0e+00
Identity = 768/823 (93.32%), Postives = 787/823 (95.63%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA+SVL+PRNSEFLE  H
Sbjct: 1   MLPFRAVQLLLGSSNPLHKRRILLSGSFLFQTRWFDSSFPWREADSVLRPRNSEFLENPH 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF RSYCSGK SGNG  EWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRSYCSGKESGNGGREWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ LKIIDDAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWKWGPELESQLEKLQFVPNMTHITQVLKIIDDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSD TVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDETVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGA CSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAECSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIP LAKSGRLDAAMKLFQEMKE+ YRPA NVY+SLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPCLAKSGRLDAAMKLFQEMKEKKYRPAQNVYSSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MKRAGF+PNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSALMFVSLIESHVKAGKLDTALKLWDDMKRAGFKPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMS+FTEMEKAGFLPIPSTYCCLLEM AASG VDAAMKLYNSMTNAGLRLGL+TYT+
Sbjct: 421 LDVAMSVFTEMEKAGFLPIPSTYCCLLEMQAASGHVDAAMKLYNSMTNAGLRLGLNTYTS 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVSVSASDVLMVYIKEGSVDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFIIRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILS T+HKAHSFLCGLFTGTEQRKQPVLSFVREFFQ IDYELEESSA+YFVNVLLNY
Sbjct: 601 LMSILSTTKHKAHSFLCGLFTGTEQRKQPVLSFVREFFQSIDYELEESSAKYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. NCBI nr
Match: KAG6602718.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1465.3 bits (3792), Expect = 0.0e+00
Identity = 755/823 (91.74%), Postives = 780/823 (94.78%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++
Sbjct: 1   MPSFRAVQLLLG-SYSLRKRRFILPTSFLFQGRWFKAPISCRETPSVLNSRNSELSEKAY 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VF+N SFIRSY S KNSGNG+SEWTE+IEYLDESGSVIFSGKGVR VEPGLD HVMVGGL
Sbjct: 61  VFHNRSFIRSYSSEKNSGNGSSEWTENIEYLDESGSVIFSGKGVRSVEPGLDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEIVWRWKWGPELESQLEKLQFVPNMTHITQALKVINDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQSRDFDAIQLLFDEIVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMEGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+
Sbjct: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNFRPGLNVFTTLVDSMGKAGRLDTSMKIYMQ 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAGFRPNFGLYS+VVESHAKSGK
Sbjct: 361 MQLLELRPPASMFVSLVESHVKAGKLDTALKLWDEMKRAGFRPNFGLYSIVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPTPSTYCCLLEMHAASRQVDPAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGHTDAALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Sbjct: 541 RTNNFILRQLFESCMKKGMYESAKPLLESYVDSAAKVDLILYTSILAHLVRCQEEHNERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNY
Sbjct: 601 LMSILSATRHKAHSFLSGLFTGPEQRKQPVLSFVREFFQSIDYELEESSARYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSP KPLI RNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPIKPLIPRNSA 822

BLAST of HG10018570 vs. NCBI nr
Match: KAG7033406.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1463.4 bits (3787), Expect = 0.0e+00
Identity = 754/823 (91.62%), Postives = 780/823 (94.78%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++
Sbjct: 1   MPSFRAVQLLLG-SYSLRKRRFILPTSFLFQGRWFKAPISCRETPSVLNSRNSELSEKAY 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFSGKGVR VEPGLD HVMVGGL
Sbjct: 61  VFHNRSFIRSYSSEKSSGNGSSEWTENIEYLDESGSVIFSGKGVRSVEPGLDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEIVWRWKWGPELESQLEKLQFVPNMTHITQALKVINDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQSRDFDAIQLLFDEIVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMEGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+
Sbjct: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNFRPGLNVFTTLVDSMGKAGRLDTSMKIYMQ 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAGFRPNFGLYS+VVESHAKSGK
Sbjct: 361 MQLLELRPPASMFVSLVESHVKAGKLDTALKLWDEMKRAGFRPNFGLYSIVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPTPSTYCCLLEMHAASRQVDPAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGHTDAALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Sbjct: 541 RTNNFILRQLFESCMKKGMYESAKPLLESYVDSAAKVDLILYTSILAHLVRCQEEHNERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNY
Sbjct: 601 LMSILSATRHKAHSFLSGLFTGPEQRKQPVLSFVREFFQSIDYELEESSARYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSP KPLI RNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPIKPLIPRNSA 822

BLAST of HG10018570 vs. ExPASy Swiss-Prot
Match: Q9SAK0 (Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=EMB2217 PE=2 SV=1)

HSP 1 Score: 1219.9 bits (3155), Expect = 0.0e+00
Identity = 595/754 (78.91%), Postives = 686/754 (90.98%), Query Frame = 0

Query: 60  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFL 119
           S +R +CS K   + +S WTE++EYLDESGSV+ SGKG+R VEPGLD HVMVGGLKKP++
Sbjct: 79  SIVRRFCSEKIGSSESSGWTEEVEYLDESGSVLHSGKGIRSVEPGLDDHVMVGGLKKPYM 138

Query: 120 NASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQ 179
           NAS+VAKIVEVV RWKWGPELE+QL+KLQFVPNM HITQSLKI+ + +A+LSLFRWAK+Q
Sbjct: 139 NASSVAKIVEVVQRWKWGPELETQLDKLQFVPNMVHITQSLKIVKEVDAALSLFRWAKKQ 198

Query: 180 SWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEK 239
            WY P+DECY +LFDGLNQ RDF  IQ LF+E+V+D SS G +SF+AYN+VIQYLAKAEK
Sbjct: 199 PWYLPSDECYVVLFDGLNQGRDFVGIQSLFEEMVQDSSSHGDLSFNAYNQVIQYLAKAEK 258

Query: 240 LEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFEL 299
           LEV+FCCFKK  +SG K+DTQTYN+L+ LFLNKGLPYKAFEIYESM      LD ST+EL
Sbjct: 259 LEVAFCCFKKAQESGCKIDTQTYNNLMMLFLNKGLPYKAFEIYESMEKTDSLLDGSTYEL 318

Query: 300 MIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE 359
           +IPSLAKSGRLDAA KLFQ+MKER  RP+ +V++SLVDSMGKAGRLDTSMK+YMEMQ   
Sbjct: 319 IIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFSSLVDSMGKAGRLDTSMKVYMEMQGFG 378

Query: 360 LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAM 419
            RP A+MFVSLI+S+ KAGKLD+AL+LWDEMK++GFRPNFGLY+M++ESHAKSGKL+VAM
Sbjct: 379 HRPSATMFVSLIDSYAKAGKLDTALRLWDEMKKSGFRPNFGLYTMIIESHAKSGKLEVAM 438

Query: 420 SIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLL 479
           ++F +MEKAGFLP PSTY CLLEMHA SGQVD+AMK+YNSMTNAGLR GLS+Y +LLTLL
Sbjct: 439 TVFKDMEKAGFLPTPSTYSCLLEMHAGSGQVDSAMKIYNSMTNAGLRPGLSSYISLLTLL 498

Query: 480 ANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSF 539
           ANK+L+D+A K+LLEMKAMG+SVDV ASDVLM+YIK+ S+D AL+WL+FMGSSGI+TN+F
Sbjct: 499 ANKRLVDVAGKILLEMKAMGYSVDVCASDVLMIYIKDASVDLALKWLRFMGSSGIKTNNF 558

Query: 540 ILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSIL 599
           I+RQLFESCMK G+Y+SA PLLET V SA KVDL+LYTSILAHLVRCQDE KER LMSIL
Sbjct: 559 IIRQLFESCMKNGLYDSARPLLETLVHSAGKVDLVLYTSILAHLVRCQDEDKERQLMSIL 618

Query: 600 SATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG 659
           SAT+HKAH+F+CGLFTG EQRKQPVL+FVREF+QGIDYELEE +ARYFVNVLLNYL+LMG
Sbjct: 619 SATKHKAHAFMCGLFTGPEQRKQPVLTFVREFYQGIDYELEEGAARYFVNVLLNYLVLMG 678

Query: 660 QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYY 719
           QINRARC+WKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRKRMLYY
Sbjct: 679 QINRARCVWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRKRMLYY 738

Query: 720 GIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNE 779
           G+VPRRIKLVTGPTLK+V+AQMLSSVESPFEVSKVVLRA G+ VMEWFKKPIVQQFLLNE
Sbjct: 739 GVVPRRIKLVTGPTLKIVIAQMLSSVESPFEVSKVVLRAPGELVMEWFKKPIVQQFLLNE 798

Query: 780 IPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS 814
           IPSRSDILMHK+N +FPSSAPE+RS+SPPKPL+S
Sbjct: 799 IPSRSDILMHKMNVMFPSSAPELRSMSPPKPLMS 832

BLAST of HG10018570 vs. ExPASy Swiss-Prot
Match: Q8GYP6 (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX=3702 GN=At1g18900 PE=2 SV=1)

HSP 1 Score: 203.8 bits (517), Expect = 7.7e-51
Identity = 166/636 (26.10%), Postives = 269/636 (42.30%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  + 
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F AI  L DE+VRD     TV+   YNR+I    +A  L  +
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT---YNRLIHSYGRANYLNEA 421

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
              F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  
Sbjct: 422 MNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINC 481

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
           L K+G L AA KLF EM ++   P L  Y  ++D                          
Sbjct: 482 LGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMD-------------------------- 541

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
                     H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Sbjct: 542 ---------LHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFT 601

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+      
Sbjct: 602 EMQQKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLS------ 661

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
                                        +++   I  A   LQ M + G+R +      
Sbjct: 662 ----------------------------TFLRVNKIAEAYELLQNMLALGLRPSLQTYTL 721

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
           L   C                 D  +K+D+                    +   ++++T 
Sbjct: 722 LLSCC----------------TDGRSKLDM-------------------GFCGQLMASTG 781

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        + V +    F   +  E +  S R  V+ ++++L   GQ   
Sbjct: 782 HPAHMFLLKM-PAAGPDGENVRNHANNFLDLMHSE-DRESKRGLVDAVVDFLHKSGQKEE 828

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Sbjct: 842 AGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCP 828

Query: 724 RRIKLVTG----------PTLKLVVAQMLSSVESPF 750
            RI +VTG            ++  V ++L+   SPF
Sbjct: 902 SRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPF 828

BLAST of HG10018570 vs. ExPASy Swiss-Prot
Match: Q9SSF9 (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX=3702 GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 6.7e-47
Identity = 155/608 (25.49%), Postives = 249/608 (40.95%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  ++ R+KWG   E  L    F  +     Q LK +D+   +L  F W KRQ  + 
Sbjct: 297 VENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQMDNYANALGFFYWLKRQPGFK 356

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F  I  L DE+VRD                            
Sbjct: 357 HDGHTYTTMVGNLGRAKQFGEINKLLDEMVRD---------------------------- 416

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
                     G K +T TYN LI  +       +A  ++  M  AGC  D  T+  +I  
Sbjct: 417 ----------GCKPNTVTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDI 476

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
            AK+G LD AM ++Q M+E    P    Y+ +++ +GKAG L  + +++ EM      P 
Sbjct: 477 HAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPN 536

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
              F  +I  H KA   ++ALKL+ +M+ AGF+P+   YS+V+E     G L+ A  +F 
Sbjct: 537 LVTFNIMIALHAKARNYETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFA 596

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G VD A + Y +M  AGLR  + T  +LL+      
Sbjct: 597 EMQRKNWVPDEPVYGLLVDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVH 656

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
            +  A  +L  M A+G                                            
Sbjct: 657 RMSEAYNLLQSMLALGLH------------------------------------------ 716

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
                          P L+T            YT +L+     +      +   +++ + 
Sbjct: 717 ---------------PSLQT------------YTLLLSCCTDARSNFDMGFCGQLMAVSG 776

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        Q V   V  F   +  E +  S R  ++ ++++L   G    
Sbjct: 777 HPAHMFLLKM-PPAGPDGQKVRDHVSNFLDFMHSE-DRESKRGLMDAVVDFLHKSGLKEE 795

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA    ++P A+    +  W +++  +S G A+IA+  TL  FRK+ML  G  P
Sbjct: 837 AGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRTLAWFRKQMLVSGDCP 795

Query: 724 RRIKLVTG 732
            RI +VTG
Sbjct: 897 SRIDIVTG 795

BLAST of HG10018570 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 2.6e-35
Identity = 107/434 (24.65%), Postives = 200/434 (46.08%), Query Frame = 0

Query: 160 LKIIDDAEASLSLFRWAKRQSWYSPNDECYGLLFDGL-NQRRDFDAIQLLFDEVVRDLSS 219
           +++ +D + +   F+W+  ++ +  + E Y ++   L   R  +DA  +L + V+     
Sbjct: 116 VELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVLSKADC 175

Query: 220 D-----------GTVSFSAYNRVIQYLAKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLIT 279
           D               F  ++ +   L     LE +  CF K+        T++ N L+ 
Sbjct: 176 DVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLH 235

Query: 280 LFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPSLAKSGRLDAAMKLFQEMKERNYRP 339
            F   G        ++ M GAG      T+ +MI  + K G ++AA  LF+EMK R   P
Sbjct: 236 RFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVP 295

Query: 340 ALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPASMFVSLIESHVKAGKLDSALKLW 399
               Y S++D  GK GRLD ++  + EM+ +   P    + +LI    K GKL   L+ +
Sbjct: 296 DTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFY 355

Query: 400 DEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAAS 459
            EMK  G +PN   YS +V++  K G +  A+  + +M + G +P   TY  L++ +   
Sbjct: 356 REMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDANCKI 415

Query: 460 GQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSAS 519
           G +  A +L N M   G+   + TYTAL+  L + + +  A ++  +M   G   ++++ 
Sbjct: 416 GNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNLASY 475

Query: 520 DVLM-VYIKEGSIDSALRWLQFMGSSGIRTNSFILRQLFESCMKKGMYESAMPLLETYVD 579
           + L+  ++K  ++D AL  L  +   GI+ +  +              E+A  ++    +
Sbjct: 476 NALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVMNEMKE 535

Query: 580 SAAKVDLILYTSIL 581
              K + ++YT+++
Sbjct: 536 CGIKANSLIYTTLM 549

BLAST of HG10018570 vs. ExPASy Swiss-Prot
Match: Q9LW84 (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 5.9e-35
Identity = 119/528 (22.54%), Postives = 223/528 (42.23%), Query Frame = 0

Query: 126 KIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYSPN 185
           + + +V  +KWGP+ E  LE L+   +   +   L+I  +    +  F+WA ++  +  +
Sbjct: 66  RFIRIVKIFKWGPDAEKALEVLKLKVDHRLVRSILEIDVEINVKIQFFKWAGKRRNFQHD 125

Query: 186 DECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVSFC 245
              Y  L   L + R +  +     EVVR  ++  +VS +  + +++ L +A+ +  +  
Sbjct: 126 CSTYMTLIRCLEEARLYGEMYRTIQEVVR--NTYVSVSPAVLSELVKALGRAKMVSKALS 185

Query: 246 CFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAG-CSLDASTFELMIPSL 305
            F +      K  + TYNS+I + + +G   K  E+Y  M   G C  D  T+  +I S 
Sbjct: 186 VFYQAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTEMCNEGDCFPDTITYSALISSY 245

Query: 306 AKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPPA 365
            K GR D+A++LF EMK+   +P   +YT+L+    K G+++ ++ ++ EM+     P  
Sbjct: 246 EKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKVEKALDLFEEMKRAGCSPTV 305

Query: 366 SMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPN-------------------------- 425
             +  LI+   KAG++D A   + +M R G  P+                          
Sbjct: 306 YTYTELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVVFLNNLMNILGKVGRVEELTNVFSE 365

Query: 426 FGL---------------------------------------------YSMVVESHAKSG 485
            G+                                             YS++++ + K+ 
Sbjct: 366 MGMWRCTPTVVSYNTVIKALFESKAHVSEVSSWFDKMKADSVSPSEFTYSILIDGYCKTN 425

Query: 486 KLDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYT 545
           +++ A+ +  EM++ GF P P+ YC L+     + + +AA +L+  +      +    Y 
Sbjct: 426 RVEKALLLLEEMDEKGFPPCPAAYCSLINALGKAKRYEAANELFKELKENFGNVSSRVYA 485

Query: 546 ALLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMV-YIKEGSIDSALRWLQFMGSS 581
            ++        +  A  +  EMK  G   DV A + LM   +K G I+ A   L+ M  +
Sbjct: 486 VMIKHFGKCGKLSEAVDLFNEMKNQGSGPDVYAYNALMSGMVKAGMINEANSLLRKMEEN 545

BLAST of HG10018570 vs. ExPASy TrEMBL
Match: A0A5A7T633 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold453G001100 PE=3 SV=1)

HSP 1 Score: 1509.2 bits (3906), Expect = 0.0e+00
Identity = 768/823 (93.32%), Postives = 790/823 (95.99%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  H
Sbjct: 1   MLPFRAVQLLIGSSNPLHKRRILLSGSFLFQTRWFNSSFPWREAVSILKPRNSQFLENPH 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRPYCSGKEIGNGGREWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWKWGPELESQLEKLQFVPNMTHITQALKIIDDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFEVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPCLAKSGRLDAAMKLFQEMKEKNYRPAQNIYSSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AGF+PNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSALMFVSLIESHVKAGKLDTALKLWDDMKKAGFKPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPIPPTYCCLLEMHASSGHVDAAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVSVSASDVLMVYIKEGSVDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFIIRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNY
Sbjct: 601 LMSILSATKHRAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSAKYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. ExPASy TrEMBL
Match: A0A1S3B2G1 (pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103485413 PE=3 SV=1)

HSP 1 Score: 1509.2 bits (3906), Expect = 0.0e+00
Identity = 768/823 (93.32%), Postives = 790/823 (95.99%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA S+LKPRNS+FLE  H
Sbjct: 1   MLPFRAVQLLIGSSNPLHKRRILLSGSFLFQTRWFNSSFPWREAVSILKPRNSQFLENPH 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF R YCSGK  GNG  EWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRPYCSGKEIGNGGREWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ+LKIIDDAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWKWGPELESQLEKLQFVPNMTHITQALKIIDDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGF+VDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFEVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIP LAKSGRLDAAMKLFQEMKE+NYRPA N+Y+SLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPCLAKSGRLDAAMKLFQEMKEKNYRPAQNIYSSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MK+AGF+PNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSALMFVSLIESHVKAGKLDTALKLWDDMKKAGFKPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLPIP TYCCLLEMHA+SG VDAAMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPIPPTYCCLLEMHASSGHVDAAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVSVSASDVLMVYIKEGSVDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFIIRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSAT+H+AHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSA+YFVNVLLNY
Sbjct: 601 LMSILSATKHRAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSAKYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. ExPASy TrEMBL
Match: A0A0A0LMG2 (Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G402030 PE=3 SV=1)

HSP 1 Score: 1503.4 bits (3891), Expect = 0.0e+00
Identity = 768/823 (93.32%), Postives = 787/823 (95.63%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           MLPFRAVQLL GSSN LHKRR LL  SFLF      SS  WREA+SVL+PRNSEFLE  H
Sbjct: 1   MLPFRAVQLLLGSSNPLHKRRILLSGSFLFQTRWFDSSFPWREADSVLRPRNSEFLENPH 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VFNN SF RSYCSGK SGNG  EWTEDIEYLDESGSVIFSGKGVR VEPG+D HVMVGGL
Sbjct: 61  VFNNRSFTRSYCSGKESGNGGREWTEDIEYLDESGSVIFSGKGVRSVEPGVDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVEVV RWKWGPELESQLEKLQFVPNMTHITQ LKIIDDAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEVVRRWKWGPELESQLEKLQFVPNMTHITQVLKIIDDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSD TVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDETVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGA CSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAECSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIP LAKSGRLDAAMKLFQEMKE+ YRPA NVY+SLVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPCLAKSGRLDAAMKLFQEMKEKKYRPAQNVYSSLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRP A MFVSLIESHVKAGKLD+ALKLWD+MKRAGF+PNFGLYSMVVESHAKSGK
Sbjct: 361 MQLLELRPSALMFVSLIESHVKAGKLDTALKLWDDMKRAGFKPNFGLYSMVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMS+FTEMEKAGFLPIPSTYCCLLEM AASG VDAAMKLYNSMTNAGLRLGL+TYT+
Sbjct: 421 LDVAMSVFTEMEKAGFLPIPSTYCCLLEMQAASGHVDAAMKLYNSMTNAGLRLGLNTYTS 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSV VSASDVLMVYIKEGS+DSALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVSVSASDVLMVYIKEGSVDSALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTNSFI+RQLFESCMKKGMYESAMPLLETYV+SAAKVDLILYTSILAHLVRCQ+EQKERY
Sbjct: 541 RTNSFIIRQLFESCMKKGMYESAMPLLETYVNSAAKVDLILYTSILAHLVRCQEEQKERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILS T+HKAHSFLCGLFTGTEQRKQPVLSFVREFFQ IDYELEESSA+YFVNVLLNY
Sbjct: 601 LMSILSTTKHKAHSFLCGLFTGTEQRKQPVLSFVREFFQSIDYELEESSAKYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 823

BLAST of HG10018570 vs. ExPASy TrEMBL
Match: A0A6J1H9M8 (pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111461771 PE=3 SV=1)

HSP 1 Score: 1461.8 bits (3783), Expect = 0.0e+00
Identity = 752/823 (91.37%), Postives = 780/823 (94.78%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           M  FRAVQLL G S SL KRRF+LP SFLF      + IS RE  SVL  RNSE  EK++
Sbjct: 1   MPSFRAVQLLLG-SYSLRKRRFILPTSFLFQGRWFKAPISCRETPSVLNSRNSELSEKAY 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFSGKGVR VEPGLD HVMVGGL
Sbjct: 61  VFHNRSFIRSYSSEKSSGNGSSEWTENIEYLDESGSVIFSGKGVRSVEPGLDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LK+I+DAEASLSLFR
Sbjct: 121 KKPFLNASAVAKIVEIVWRWKWGPELESQLEKLQFVPNMTHITQALKVINDAEASLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYSPNDECYGLLFDGLN+ RDFDAIQLLFDE+VRDLSSDGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSPNDECYGLLFDGLNRNRDFDAIQLLFDEIVRDLSSDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMEGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYM+
Sbjct: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNFRPGLNVFTTLVDSMGKAGRLDTSMKIYMQ 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAGFRPNFGLYS+VVESHAKSGK
Sbjct: 361 MQLLELRPPASMFVSLVESHVKAGKLDTALKLWDEMKRAGFRPNFGLYSIVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           LDVAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LDVAMSIFTEMEKAGFLPTPSTYCCLLEMHAASRQVDPAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGHTDAALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Sbjct: 541 RTNNFILRQLFESCMKKGMYESAKPLLESYVDSAAKVDLILYTSILAHLVRCQEEHNERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNY
Sbjct: 601 LMSILSATRHKAHSFLSGLFTGPEQRKQPVLSFVREFFQSIDYELEESSARYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSD+LMHKLNTLFPSSAPEIRSLSP KPLI RNSA
Sbjct: 781 FLLNEIPSRSDVLMHKLNTLFPSSAPEIRSLSPIKPLIPRNSA 822

BLAST of HG10018570 vs. ExPASy TrEMBL
Match: A0A6J1JM38 (pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111487119 PE=3 SV=1)

HSP 1 Score: 1459.9 bits (3778), Expect = 0.0e+00
Identity = 753/823 (91.49%), Postives = 780/823 (94.78%), Query Frame = 0

Query: 1   MLPFRAVQLLFGSSNSLHKRRFLLPASFLF------SSISWREAESVLKPRNSEFLEKSH 60
           M  FRAVQLL G S SL KRRF+LP S LF      S IS REA SVL  RNSE  EK++
Sbjct: 1   MPSFRAVQLLLG-SYSLRKRRFILPTSLLFQGRWFKSHISCREAPSVLNSRNSELSEKAY 60

Query: 61  VFNNLSFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGL 120
           VF+N SFIRSY S K+SGNG+SEWTE+IEYLDESGSVIFSGKGVR VEPGLD HVMVGGL
Sbjct: 61  VFHNRSFIRSYSSEKSSGNGSSEWTENIEYLDESGSVIFSGKGVRSVEPGLDDHVMVGGL 120

Query: 121 KKPFLNASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFR 180
           KKPFLNASAVAKIVE+V RWKWGPELESQLEKLQFVPNMTHITQ+LKII+DAE+SLSLFR
Sbjct: 121 KKPFLNASAVAKIVEIVWRWKWGPELESQLEKLQFVPNMTHITQALKIINDAESSLSLFR 180

Query: 181 WAKRQSWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYL 240
           WAKRQSWYS NDECYGLLFDGLNQ+RDFDAIQLLFDE+VRDLS+DGTVSFSAYNRVIQYL
Sbjct: 181 WAKRQSWYSANDECYGLLFDGLNQKRDFDAIQLLFDEIVRDLSNDGTVSFSAYNRVIQYL 240

Query: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDA 300
           AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESM GAGCSLDA
Sbjct: 241 AKAEKLEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMGGAGCSLDA 300

Query: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYME 360
           STFELMIPSLAKSGRLDAAMKLFQEMKERN+RP LNV+T+LVDSMGKAGRLDTSMKIYME
Sbjct: 301 STFELMIPSLAKSGRLDAAMKLFQEMKERNFRPGLNVFTTLVDSMGKAGRLDTSMKIYME 360

Query: 361 MQLLELRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGK 420
           MQLLELRPPASMFVSL+ESHVKAGKLD+ALKLWDEMKRAGFRPNFGLYS+VVESHAKSGK
Sbjct: 361 MQLLELRPPASMFVSLVESHVKAGKLDTALKLWDEMKRAGFRPNFGLYSIVVESHAKSGK 420

Query: 421 LDVAMSIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTA 480
           L+VAMSIFTEMEKAGFLP PSTYCCLLEMHAAS QVD AMKLYNSMTNAGLRLGLSTYTA
Sbjct: 421 LEVAMSIFTEMEKAGFLPTPSTYCCLLEMHAASRQVDPAMKLYNSMTNAGLRLGLSTYTA 480

Query: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGI 540
           LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEG  D+ALRWLQFMGSSGI
Sbjct: 481 LLTLLANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGHTDAALRWLQFMGSSGI 540

Query: 541 RTNSFILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERY 600
           RTN+FILRQLFESCMKKGMYESA PLLE+YVDSAAKVDLILYTSILAHLVRCQ+E  ERY
Sbjct: 541 RTNNFILRQLFESCMKKGMYESAKPLLESYVDSAAKVDLILYTSILAHLVRCQEEHNERY 600

Query: 601 LMSILSATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNY 660
           LMSILSATRHKAHSFL GLFTG EQRKQPVLSFVREFFQ IDYELEESSARYFVNVLLNY
Sbjct: 601 LMSILSATRHKAHSFLSGLFTGPEQRKQPVLSFVREFFQSIDYELEESSARYFVNVLLNY 660

Query: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRK 720
           LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRK
Sbjct: 661 LILMGQINRARCIWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRK 720

Query: 721 RMLYYGIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780
           RMLYYGIVPRRIKLVTGPTLKLV+AQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ
Sbjct: 721 RMLYYGIVPRRIKLVTGPTLKLVIAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQ 780

Query: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLISRNSA 818
           FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSP KPLI RNSA
Sbjct: 781 FLLNEIPSRSDILMHKLNTLFPSSAPEIRSLSPIKPLIPRNSA 822

BLAST of HG10018570 vs. TAIR 10
Match: AT1G79490.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1219.9 bits (3155), Expect = 0.0e+00
Identity = 595/754 (78.91%), Postives = 686/754 (90.98%), Query Frame = 0

Query: 60  SFIRSYCSGKNSGNGASEWTEDIEYLDESGSVIFSGKGVRLVEPGLDGHVMVGGLKKPFL 119
           S +R +CS K   + +S WTE++EYLDESGSV+ SGKG+R VEPGLD HVMVGGLKKP++
Sbjct: 79  SIVRRFCSEKIGSSESSGWTEEVEYLDESGSVLHSGKGIRSVEPGLDDHVMVGGLKKPYM 138

Query: 120 NASAVAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQ 179
           NAS+VAKIVEVV RWKWGPELE+QL+KLQFVPNM HITQSLKI+ + +A+LSLFRWAK+Q
Sbjct: 139 NASSVAKIVEVVQRWKWGPELETQLDKLQFVPNMVHITQSLKIVKEVDAALSLFRWAKKQ 198

Query: 180 SWYSPNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEK 239
            WY P+DECY +LFDGLNQ RDF  IQ LF+E+V+D SS G +SF+AYN+VIQYLAKAEK
Sbjct: 199 PWYLPSDECYVVLFDGLNQGRDFVGIQSLFEEMVQDSSSHGDLSFNAYNQVIQYLAKAEK 258

Query: 240 LEVSFCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFEL 299
           LEV+FCCFKK  +SG K+DTQTYN+L+ LFLNKGLPYKAFEIYESM      LD ST+EL
Sbjct: 259 LEVAFCCFKKAQESGCKIDTQTYNNLMMLFLNKGLPYKAFEIYESMEKTDSLLDGSTYEL 318

Query: 300 MIPSLAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLE 359
           +IPSLAKSGRLDAA KLFQ+MKER  RP+ +V++SLVDSMGKAGRLDTSMK+YMEMQ   
Sbjct: 319 IIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFSSLVDSMGKAGRLDTSMKVYMEMQGFG 378

Query: 360 LRPPASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAM 419
            RP A+MFVSLI+S+ KAGKLD+AL+LWDEMK++GFRPNFGLY+M++ESHAKSGKL+VAM
Sbjct: 379 HRPSATMFVSLIDSYAKAGKLDTALRLWDEMKKSGFRPNFGLYTMIIESHAKSGKLEVAM 438

Query: 420 SIFTEMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLL 479
           ++F +MEKAGFLP PSTY CLLEMHA SGQVD+AMK+YNSMTNAGLR GLS+Y +LLTLL
Sbjct: 439 TVFKDMEKAGFLPTPSTYSCLLEMHAGSGQVDSAMKIYNSMTNAGLRPGLSSYISLLTLL 498

Query: 480 ANKKLIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSF 539
           ANK+L+D+A K+LLEMKAMG+SVDV ASDVLM+YIK+ S+D AL+WL+FMGSSGI+TN+F
Sbjct: 499 ANKRLVDVAGKILLEMKAMGYSVDVCASDVLMIYIKDASVDLALKWLRFMGSSGIKTNNF 558

Query: 540 ILRQLFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSIL 599
           I+RQLFESCMK G+Y+SA PLLET V SA KVDL+LYTSILAHLVRCQDE KER LMSIL
Sbjct: 559 IIRQLFESCMKNGLYDSARPLLETLVHSAGKVDLVLYTSILAHLVRCQDEDKERQLMSIL 618

Query: 600 SATRHKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMG 659
           SAT+HKAH+F+CGLFTG EQRKQPVL+FVREF+QGIDYELEE +ARYFVNVLLNYL+LMG
Sbjct: 619 SATKHKAHAFMCGLFTGPEQRKQPVLTFVREFYQGIDYELEEGAARYFVNVLLNYLVLMG 678

Query: 660 QINRARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYY 719
           QINRARC+WKVAYENKLFPKAIVFDQHIAWSLD+RNLSVGAALIAVVHTLHRFRKRMLYY
Sbjct: 679 QINRARCVWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLHRFRKRMLYY 738

Query: 720 GIVPRRIKLVTGPTLKLVVAQMLSSVESPFEVSKVVLRATGDSVMEWFKKPIVQQFLLNE 779
           G+VPRRIKLVTGPTLK+V+AQMLSSVESPFEVSKVVLRA G+ VMEWFKKPIVQQFLLNE
Sbjct: 739 GVVPRRIKLVTGPTLKIVIAQMLSSVESPFEVSKVVLRAPGELVMEWFKKPIVQQFLLNE 798

Query: 780 IPSRSDILMHKLNTLFPSSAPEIRSLSPPKPLIS 814
           IPSRSDILMHK+N +FPSSAPE+RS+SPPKPL+S
Sbjct: 799 IPSRSDILMHKMNVMFPSSAPELRSMSPPKPLMS 832

BLAST of HG10018570 vs. TAIR 10
Match: AT1G18900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 203.8 bits (517), Expect = 5.4e-52
Identity = 166/636 (26.10%), Postives = 269/636 (42.30%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  + 
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F AI  L DE+VRD     TV+   YNR+I    +A  L  +
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT---YNRLIHSYGRANYLNEA 421

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
              F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  
Sbjct: 422 MNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINC 481

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
           L K+G L AA KLF EM ++   P L  Y  ++D                          
Sbjct: 482 LGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMD-------------------------- 541

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
                     H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Sbjct: 542 ---------LHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFT 601

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+      
Sbjct: 602 EMQQKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLS------ 661

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
                                        +++   I  A   LQ M + G+R +      
Sbjct: 662 ----------------------------TFLRVNKIAEAYELLQNMLALGLRPSLQTYTL 721

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
           L   C                 D  +K+D+                    +   ++++T 
Sbjct: 722 LLSCC----------------TDGRSKLDM-------------------GFCGQLMASTG 781

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        + V +    F   +  E +  S R  V+ ++++L   GQ   
Sbjct: 782 HPAHMFLLKM-PAAGPDGENVRNHANNFLDLMHSE-DRESKRGLVDAVVDFLHKSGQKEE 828

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Sbjct: 842 AGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCP 828

Query: 724 RRIKLVTG----------PTLKLVVAQMLSSVESPF 750
            RI +VTG            ++  V ++L+   SPF
Sbjct: 902 SRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPF 828

BLAST of HG10018570 vs. TAIR 10
Match: AT1G18900.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 203.8 bits (517), Expect = 5.4e-52
Identity = 166/636 (26.10%), Postives = 269/636 (42.30%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  + 
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F AI  L DE+VRD     TV+   YNR+I    +A  L  +
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT---YNRLIHSYGRANYLNEA 421

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
              F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  
Sbjct: 422 MNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINC 481

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
           L K+G L AA KLF EM ++   P L  Y  ++D                          
Sbjct: 482 LGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMD-------------------------- 541

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
                     H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Sbjct: 542 ---------LHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFT 601

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+      
Sbjct: 602 EMQQKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLS------ 661

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
                                        +++   I  A   LQ M + G+R +      
Sbjct: 662 ----------------------------TFLRVNKIAEAYELLQNMLALGLRPSLQTYTL 721

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
           L   C                 D  +K+D+                    +   ++++T 
Sbjct: 722 LLSCC----------------TDGRSKLDM-------------------GFCGQLMASTG 781

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        + V +    F   +  E +  S R  V+ ++++L   GQ   
Sbjct: 782 HPAHMFLLKM-PAAGPDGENVRNHANNFLDLMHSE-DRESKRGLVDAVVDFLHKSGQKEE 828

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Sbjct: 842 AGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCP 828

Query: 724 RRIKLVTG----------PTLKLVVAQMLSSVESPF 750
            RI +VTG            ++  V ++L+   SPF
Sbjct: 902 SRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPF 828

BLAST of HG10018570 vs. TAIR 10
Match: AT1G18900.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 203.8 bits (517), Expect = 5.4e-52
Identity = 166/636 (26.10%), Postives = 269/636 (42.30%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  V+ R++WGP  E  L+ L    +     Q LK ++D   +L  F W KRQ  + 
Sbjct: 302 VENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQPGFK 361

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F AI  L DE+VRD     TV+   YNR+I    +A  L  +
Sbjct: 362 HDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT---YNRLIHSYGRANYLNEA 421

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
              F ++ ++G K D  TY +LI +    G    A ++Y+ M   G S D  T+ ++I  
Sbjct: 422 MNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQAGGLSPDTFTYSVIINC 481

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
           L K+G L AA KLF EM ++   P L  Y  ++D                          
Sbjct: 482 LGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMD-------------------------- 541

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
                     H KA    +ALKL+ +M+ AGF P+   YS+V+E     G L+ A ++FT
Sbjct: 542 ---------LHAKARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFT 601

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G V+ A + Y +M +AGLR  + T  +LL+      
Sbjct: 602 EMQQKNWIPDEPVYGLLVDLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLS------ 661

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
                                        +++   I  A   LQ M + G+R +      
Sbjct: 662 ----------------------------TFLRVNKIAEAYELLQNMLALGLRPSLQTYTL 721

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
           L   C                 D  +K+D+                    +   ++++T 
Sbjct: 722 LLSCC----------------TDGRSKLDM-------------------GFCGQLMASTG 781

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        + V +    F   +  E +  S R  V+ ++++L   GQ   
Sbjct: 782 HPAHMFLLKM-PAAGPDGENVRNHANNFLDLMHSE-DRESKRGLVDAVVDFLHKSGQKEE 828

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA +  +FP A+       W +++  +S G A+ A+  TL  FRK+ML  G  P
Sbjct: 842 AGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRKQMLASGTCP 828

Query: 724 RRIKLVTG----------PTLKLVVAQMLSSVESPF 750
            RI +VTG            ++  V ++L+   SPF
Sbjct: 902 SRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFGSPF 828

BLAST of HG10018570 vs. TAIR 10
Match: AT1G74750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 190.7 bits (483), Expect = 4.8e-48
Identity = 155/608 (25.49%), Postives = 249/608 (40.95%), Query Frame = 0

Query: 124 VAKIVEVVTRWKWGPELESQLEKLQFVPNMTHITQSLKIIDDAEASLSLFRWAKRQSWYS 183
           V  +  ++ R+KWG   E  L    F  +     Q LK +D+   +L  F W KRQ  + 
Sbjct: 297 VENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQMDNYANALGFFYWLKRQPGFK 356

Query: 184 PNDECYGLLFDGLNQRRDFDAIQLLFDEVVRDLSSDGTVSFSAYNRVIQYLAKAEKLEVS 243
            +   Y  +   L + + F  I  L DE+VRD                            
Sbjct: 357 HDGHTYTTMVGNLGRAKQFGEINKLLDEMVRD---------------------------- 416

Query: 244 FCCFKKIHDSGFKVDTQTYNSLITLFLNKGLPYKAFEIYESMAGAGCSLDASTFELMIPS 303
                     G K +T TYN LI  +       +A  ++  M  AGC  D  T+  +I  
Sbjct: 417 ----------GCKPNTVTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDI 476

Query: 304 LAKSGRLDAAMKLFQEMKERNYRPALNVYTSLVDSMGKAGRLDTSMKIYMEMQLLELRPP 363
            AK+G LD AM ++Q M+E    P    Y+ +++ +GKAG L  + +++ EM      P 
Sbjct: 477 HAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPN 536

Query: 364 ASMFVSLIESHVKAGKLDSALKLWDEMKRAGFRPNFGLYSMVVESHAKSGKLDVAMSIFT 423
              F  +I  H KA   ++ALKL+ +M+ AGF+P+   YS+V+E     G L+ A  +F 
Sbjct: 537 LVTFNIMIALHAKARNYETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFA 596

Query: 424 EMEKAGFLPIPSTYCCLLEMHAASGQVDAAMKLYNSMTNAGLRLGLSTYTALLTLLANKK 483
           EM++  ++P    Y  L+++   +G VD A + Y +M  AGLR  + T  +LL+      
Sbjct: 597 EMQRKNWVPDEPVYGLLVDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVH 656

Query: 484 LIDIAAKVLLEMKAMGFSVDVSASDVLMVYIKEGSIDSALRWLQFMGSSGIRTNSFILRQ 543
            +  A  +L  M A+G                                            
Sbjct: 657 RMSEAYNLLQSMLALGLH------------------------------------------ 716

Query: 544 LFESCMKKGMYESAMPLLETYVDSAAKVDLILYTSILAHLVRCQDEQKERYLMSILSATR 603
                          P L+T            YT +L+     +      +   +++ + 
Sbjct: 717 ---------------PSLQT------------YTLLLSCCTDARSNFDMGFCGQLMAVSG 776

Query: 604 HKAHSFLCGLFTGTEQRKQPVLSFVREFFQGIDYELEESSARYFVNVLLNYLILMGQINR 663
           H AH FL  +        Q V   V  F   +  E +  S R  ++ ++++L   G    
Sbjct: 777 HPAHMFLLKM-PPAGPDGQKVRDHVSNFLDFMHSE-DRESKRGLMDAVVDFLHKSGLKEE 795

Query: 664 ARCIWKVAYENKLFPKAIVFDQHIAWSLDIRNLSVGAALIAVVHTLHRFRKRMLYYGIVP 723
           A  +W+VA    ++P A+    +  W +++  +S G A+IA+  TL  FRK+ML  G  P
Sbjct: 837 AGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRTLAWFRKQMLVSGDCP 795

Query: 724 RRIKLVTG 732
            RI +VTG
Sbjct: 897 SRIDIVTG 795

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884643.10.0e+0094.90pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Benincasa ... [more]
XP_008441211.10.0e+0093.32PREDICTED: pentatricopeptide repeat-containing protein At1g79490, mitochondrial ... [more]
XP_004138818.10.0e+0093.32pentatricopeptide repeat-containing protein At1g79490, mitochondrial [Cucumis sa... [more]
KAG6602718.10.0e+0091.74Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG7033406.10.0e+0091.62Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9SAK00.0e+0078.91Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidop... [more]
Q8GYP67.7e-5126.10Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX... [more]
Q9SSF96.7e-4725.49Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX... [more]
P0C8942.6e-3524.65Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Q9LW845.9e-3522.54Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7T6330.0e+0093.32Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B2G10.0e+0093.32pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucumis ... [more]
A0A0A0LMG20.0e+0093.32Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G402030 PE=3 SV... [more]
A0A6J1H9M80.0e+0091.37pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucurbit... [more]
A0A6J1JM380.0e+0091.49pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT1G79490.10.0e+0078.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.15.4e-5226.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.25.4e-5226.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.35.4e-5226.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74750.14.8e-4825.49Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainSMARTSM00463SMR_2coord: 688..767
e-value: 1.3E-9
score: 47.9
IPR002625Smr domainPROSITEPS50828SMRcoord: 691..767
score: 12.365887
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 226..380
e-value: 4.2E-9
score: 36.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 372..507
e-value: 1.4E-29
score: 105.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 286..361
e-value: 7.0E-16
score: 60.3
coord: 128..285
e-value: 4.7E-18
score: 67.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 508..598
e-value: 1.2E-7
score: 33.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 296..327
e-value: 4.8E-6
score: 24.4
coord: 402..432
e-value: 5.7E-5
score: 21.0
coord: 331..359
e-value: 7.0E-4
score: 17.6
coord: 261..294
e-value: 1.7E-4
score: 19.5
coord: 366..398
e-value: 6.4E-8
score: 30.3
coord: 436..466
e-value: 3.0E-4
score: 18.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 456..499
e-value: 2.4E-4
score: 21.1
coord: 386..443
e-value: 5.5E-6
score: 26.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 433..467
score: 9.086975
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 398..432
score: 10.807899
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..397
score: 11.619036
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 12.726127
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..362
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 10.237912
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 128..815
NoneNo IPR availablePANTHERPTHR45613:SF144OS04G0612800 PROTEINcoord: 128..815

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018570.1HG10018570.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding