HG10022724 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022724
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 27622302 .. 27624391 (-)
RNA-Seq ExpressionHG10022724
SyntenyHG10022724
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGTACTCCGGCTCCCCTGTTACTGTCACCTCAAACCCACCACCACCGCTGTCGCACATCGTCACTTCGCCACCAAATACACTGCCAAAATCACTTCTTCTTCTCCCACAGGACGCTCCGTTTCCGTCGAGGTCACTCCGCCGGCTACTCTTCCGGTCGACTCTCGCGGCTACTCTCTTCCCCGCCGTGATCTCATCTGCAGAGCCGTTGACATACTCCTCCATCGCAAATCCCATTCCTCTTCAATCACCATTGATGACCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCCGTCTCCCTAACCCCAGCCGAAGCCTCTGAAATTCTCAAATCCCTAAACTGCCCTGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTTCGCCATGATGCCTTCACTTACAGCCGCATCCTTCTCATGCTCTCCCATTCATCTTCCTCGAAACGGTTCGATCAAGTTCGTGAGATTCTTTCGCAGATGGATAGAGATCAAATACGTGGTACAATTTCTACTGTTAATATCTTGATTAAAATTTTTGGTAGCAAGGACGACTTGGAATTATATACTGGTCTGATCAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGATCCCATGATTCTGATAGGGCTTTCAATGTGTATATGGAAATGCGGAGTAGAGGGTATCAGCTTGATATCTTTGCCTACAATATGCTGCTGGATGCTCTGGCAAAGGATGAAAAGGTTTAATGTGATAATGTTGAAAAAGTTTTGATGTTTGATTGATTCTGTATGAATTAGTTGTTCACTATTCTTCTTTTCCAATCTTTTTTGTTGTGTTTTAACAGCTTGATCGATCTTACAAAGTTTTTAAGGACATGAAACTGAAGCACTGTAATCCAGATGAGTATACATATACTATTATGATTAGGATGACTGGAAAAATGAGTAGAACTGAAGAGTCTTTGGCGCTCTTTGAAGAAATGCTAACAAAAGGCTGTACTCCAAATTTGATTGCATATAATACAATGATTCAGGCACTTTGTAAGAGCAGAATGGTTGACAAGGCAATTCTTCTATTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTTGCAGAAGGGCAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCCGATAAATTTATGAACAAATCACTATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTTTGCAACATGTGGAGCTTTCATGATAGAGGGGATAGAGATGCTTACATTTCCATGTTGGAGAGCTTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTCTGATACGATGATGTATAACACCGTGTTATCTACTTTGGGGAGGTTGAAGCAAGTATCTCATCTTCATGATCTTTATGAGAAAATGAAACAAGATGGACCTTTTCCGGACATATTCACGTATAATATTCTTATATCAAGCTTAGGACGTGTTGGGAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAATTGCAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTAGGGAAAAATGGGGATGTTGATGAAGCTCATATGAGATTTCTGGAGATGCAAGATAAAGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGAAAAACAGATAAAGTGGAGATGGCTCGCAGTTTGTTCGATAAAATGATAACTCAAGGATGCAGTCCAAATATTGTAACGTACAACATCCTTCTTGACTGTCTTGAAAGAGCTGGGAGAACTGCTGAAACAGTTGATCTGTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCATTAGTCCTTTGAGGTAA

mRNA sequence

ATGAAGGTACTCCGGCTCCCCTGTTACTGTCACCTCAAACCCACCACCACCGCTGTCGCACATCGTCACTTCGCCACCAAATACACTGCCAAAATCACTTCTTCTTCTCCCACAGGACGCTCCGTTTCCGTCGAGGTCACTCCGCCGGCTACTCTTCCGGTCGACTCTCGCGGCTACTCTCTTCCCCGCCGTGATCTCATCTGCAGAGCCGTTGACATACTCCTCCATCGCAAATCCCATTCCTCTTCAATCACCATTGATGACCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCCGTCTCCCTAACCCCAGCCGAAGCCTCTGAAATTCTCAAATCCCTAAACTGCCCTGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTTCGCCATGATGCCTTCACTTACAGCCGCATCCTTCTCATGCTCTCCCATTCATCTTCCTCGAAACGGTTCGATCAAGTTCGTGAGATTCTTTCGCAGATGGATAGAGATCAAATACGTGGTACAATTTCTACTGTTAATATCTTGATTAAAATTTTTGGTAGCAAGGACGACTTGGAATTATATACTGGTCTGATCAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGATCCCATGATTCTGATAGGGCTTTCAATGTGTATATGGAAATGCGGAGTAGAGGGTATCAGCTTGATATCTTTGCCTACAATATGCTGCTGGATGCTCTGGCAAAGGATGAAAAGCTTGATCGATCTTACAAAGTTTTTAAGGACATGAAACTGAAGCACTGTAATCCAGATGAGTATACATATACTATTATGATTAGGATGACTGGAAAAATGAGTAGAACTGAAGAGTCTTTGGCGCTCTTTGAAGAAATGCTAACAAAAGGCTGTACTCCAAATTTGATTGCATATAATACAATGATTCAGGCACTTTGTAAGAGCAGAATGGTTGACAAGGCAATTCTTCTATTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTTGCAGAAGGGCAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCCGATAAATTTATGAACAAATCACTATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTTTGCAACATGTGGAGCTTTCATGATAGAGGGGATAGAGATGCTTACATTTCCATGTTGGAGAGCTTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTCTGATACGATGATGTATAACACCGTGTTATCTACTTTGGGGAGGTTGAAGCAAGTATCTCATCTTCATGATCTTTATGAGAAAATGAAACAAGATGGACCTTTTCCGGACATATTCACGTATAATATTCTTATATCAAGCTTAGGACGTGTTGGGAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAATTGCAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTAGGGAAAAATGGGGATGTTGATGAAGCTCATATGAGATTTCTGGAGATGCAAGATAAAGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGAAAAACAGATAAAGTGGAGATGGCTCGCAGTTTGTTCGATAAAATGATAACTCAAGGATGCAGTCCAAATATTGTAACGTACAACATCCTTCTTGACTGTCTTGAAAGAGCTGGGAGAACTGCTGAAACAGTTGATCTGTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCATTAGTCCTTTGAGGTAA

Coding sequence (CDS)

ATGAAGGTACTCCGGCTCCCCTGTTACTGTCACCTCAAACCCACCACCACCGCTGTCGCACATCGTCACTTCGCCACCAAATACACTGCCAAAATCACTTCTTCTTCTCCCACAGGACGCTCCGTTTCCGTCGAGGTCACTCCGCCGGCTACTCTTCCGGTCGACTCTCGCGGCTACTCTCTTCCCCGCCGTGATCTCATCTGCAGAGCCGTTGACATACTCCTCCATCGCAAATCCCATTCCTCTTCAATCACCATTGATGACCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCCGTCTCCCTAACCCCAGCCGAAGCCTCTGAAATTCTCAAATCCCTAAACTGCCCTGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTTCGCCATGATGCCTTCACTTACAGCCGCATCCTTCTCATGCTCTCCCATTCATCTTCCTCGAAACGGTTCGATCAAGTTCGTGAGATTCTTTCGCAGATGGATAGAGATCAAATACGTGGTACAATTTCTACTGTTAATATCTTGATTAAAATTTTTGGTAGCAAGGACGACTTGGAATTATATACTGGTCTGATCAAGAAATGGGACTTGAGGCTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGATCCCATGATTCTGATAGGGCTTTCAATGTGTATATGGAAATGCGGAGTAGAGGGTATCAGCTTGATATCTTTGCCTACAATATGCTGCTGGATGCTCTGGCAAAGGATGAAAAGCTTGATCGATCTTACAAAGTTTTTAAGGACATGAAACTGAAGCACTGTAATCCAGATGAGTATACATATACTATTATGATTAGGATGACTGGAAAAATGAGTAGAACTGAAGAGTCTTTGGCGCTCTTTGAAGAAATGCTAACAAAAGGCTGTACTCCAAATTTGATTGCATATAATACAATGATTCAGGCACTTTGTAAGAGCAGAATGGTTGACAAGGCAATTCTTCTATTTTCTAATATGGTTAAGAACAATTGTAGGCCAAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTTGCAGAAGGGCAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCCGATAAATTTATGAACAAATCACTATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTTTGCAACATGTGGAGCTTTCATGATAGAGGGGATAGAGATGCTTACATTTCCATGTTGGAGAGCTTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTCTGATACGATGATGTATAACACCGTGTTATCTACTTTGGGGAGGTTGAAGCAAGTATCTCATCTTCATGATCTTTATGAGAAAATGAAACAAGATGGACCTTTTCCGGACATATTCACGTATAATATTCTTATATCAAGCTTAGGACGTGTTGGGAAAGTTAAGGAGGCTGTTGAAGTTTTTGAAGAACTTGAGAATAGTAATTGCAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTAGGGAAAAATGGGGATGTTGATGAAGCTCATATGAGATTTCTGGAGATGCAAGATAAAGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGAAAAACAGATAAAGTGGAGATGGCTCGCAGTTTGTTCGATAAAATGATAACTCAAGGATGCAGTCCAAATATTGTAACGTACAACATCCTTCTTGACTGTCTTGAAAGAGCTGGGAGAACTGCTGAAACAGTTGATCTGTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCATTAGTCCTTTGAGGTAA

Protein sequence

MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYSLPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPDLALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTISTVNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPLR
Homology
BLAST of HG10022724 vs. NCBI nr
Match: XP_038898111.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa hispida])

HSP 1 Score: 1265.8 bits (3274), Expect = 0.0e+00
Identity = 629/661 (95.16%), Postives = 647/661 (97.88%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRLPCY HL+PT TA  +RHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS
Sbjct: 4   MKVLRLPCYYHLQPTATAATYRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 63

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICRAVDILLHRK HSSSITIDDRFSDL+SYFQSLSVSLTPAEASEILKSLNCPD
Sbjct: 64  LPRRDLICRAVDILLHRKPHSSSITIDDRFSDLASYFQSLSVSLTPAEASEILKSLNCPD 123

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST
Sbjct: 124 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 183

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIFGSK+DLE+ TGLIKKWDLR NAYTYRCLLQAH+RSHDSDRAFNVYMEMR +G
Sbjct: 184 VNILIKIFGSKEDLEVCTGLIKKWDLRFNAYTYRCLLQAHLRSHDSDRAFNVYMEMRGKG 243

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAK+E+LDRSY+VFKDMKLKHCNPDEYTYTIMIRMTGKM RTEESL
Sbjct: 244 YQLDIFAYNMLLDALAKNEQLDRSYRVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 303

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
            LFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 304 VLFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 363

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVLGVS+KFMNKS+YAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY
Sbjct: 364 VAEGQLGRLDEVLGVSNKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 423

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLCS GKTVEAIDLL KVHE+GISSDTMMYNTVLSTLG+LKQVSHLHDLYEKMK+
Sbjct: 424 ISMLESLCSTGKTVEAIDLLSKVHERGISSDTMMYNTVLSTLGKLKQVSHLHDLYEKMKR 483

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENS+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 484 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSDCKPDIISYNSLINCLGKNGDVDE 543

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMA SLFDKMITQGC PNIVTYNILLD
Sbjct: 544 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMAHSLFDKMITQGCCPNIVTYNILLD 603

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 604 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 663

Query: 661 R 662
           R
Sbjct: 664 R 664

BLAST of HG10022724 vs. NCBI nr
Match: XP_008447220.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucumis melo])

HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 610/661 (92.28%), Postives = 632/661 (95.61%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLR PCY HLKPT TA AHRHFATKYTAKITSSSPTGRSV+V VTPPATL VDSRGYS
Sbjct: 1   MKVLRFPCYSHLKPTATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +DILLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD
Sbjct: 61  LPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  CPSLC KFRHD FTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST
Sbjct: 121 LALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM S+G
Sbjct: 181 VNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWSKG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM RTEESL
Sbjct: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPN+IAYNTMIQALCKSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 ALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 361 VAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS++TMMYNTVLSTLG+LKQVSHLHDLYEKMK+
Sbjct: 421 ISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKMKR 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQ C PNIVTYNILLD
Sbjct: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 601 CLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 661

BLAST of HG10022724 vs. NCBI nr
Match: KAA0041996.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK17931.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1217.6 bits (3149), Expect = 0.0e+00
Identity = 608/661 (91.98%), Postives = 630/661 (95.31%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLR  CY HLKPT TA AHRHFATKYTAKITSSSPTGRSV+V VTPPATL VDSRGYS
Sbjct: 1   MKVLRFACYSHLKPTATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +DILLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD
Sbjct: 61  LPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  CPSLC KFRHD FTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST
Sbjct: 121 LALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM S+G
Sbjct: 181 VNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWSKG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM RTEESL
Sbjct: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPN+IAYNTMIQALCKSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 ALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEG LGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 361 VAEGLLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS++TMMYNTVLSTLG+LKQVSHLHDLYEKMK+
Sbjct: 421 ISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKMKR 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQ C PNIVTYNILLD
Sbjct: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 601 CLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 661

BLAST of HG10022724 vs. NCBI nr
Match: XP_031744831.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial isoform X2 [Cucumis sativus])

HSP 1 Score: 1211.1 bits (3132), Expect = 0.0e+00
Identity = 606/661 (91.68%), Postives = 629/661 (95.16%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRLPCY HLKP     AHRHFATKYTAKITSSSPTGRSV+V VTPPATLPVDSRGY+
Sbjct: 1   MKVLRLPCYSHLKP---PAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLPVDSRGYA 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +D+LLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN PD
Sbjct: 61  LPRRDLICRVIDMLLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNSPD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  C SLCPKFRHDAFTYSRILLMLSHSSSSKR DQVREILSQMDRDQIRGTIST
Sbjct: 121 LALQFFHRCSSLCPKFRHDAFTYSRILLMLSHSSSSKRIDQVREILSQMDRDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLRLNAYTYRCLLQAH+RS DSDRAFNVYMEM S+G
Sbjct: 181 VNILIKIFSSNEDLELCTGLIKKWDLRLNAYTYRCLLQAHIRSRDSDRAFNVYMEMWSKG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDE+LDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM R EESL
Sbjct: 241 YQLDIFAYNMLLDALAKDEQLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRAEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPNLIAYNTMIQAL KS MVDKAILLF NM+KNNCRPNEFTYS+ILNVL
Sbjct: 301 ALFEEMLTKGCTPNLIAYNTMIQALSKSGMVDKAILLFCNMIKNNCRPNEFTYSIILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 361 VAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS+DTMMYNTVLSTLG+LKQVSHLHDLYEKMKQ
Sbjct: 421 ISMLESLCRGGKTVEAIELLSKVHEKGISTDTMMYNTVLSTLGKLKQVSHLHDLYEKMKQ 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQGC PNIVTYNILLD
Sbjct: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQGCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAETVDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 601 CLERAGRTAETVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 658

Query: 661 R 662
           R
Sbjct: 661 R 658

BLAST of HG10022724 vs. NCBI nr
Match: XP_004150337.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial isoform X1 [Cucumis sativus])

HSP 1 Score: 1211.1 bits (3132), Expect = 0.0e+00
Identity = 606/661 (91.68%), Postives = 629/661 (95.16%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRLPCY HLKP     AHRHFATKYTAKITSSSPTGRSV+V VTPPATLPVDSRGY+
Sbjct: 33  MKVLRLPCYSHLKP---PAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLPVDSRGYA 92

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +D+LLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN PD
Sbjct: 93  LPRRDLICRVIDMLLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNSPD 152

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  C SLCPKFRHDAFTYSRILLMLSHSSSSKR DQVREILSQMDRDQIRGTIST
Sbjct: 153 LALQFFHRCSSLCPKFRHDAFTYSRILLMLSHSSSSKRIDQVREILSQMDRDQIRGTIST 212

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLRLNAYTYRCLLQAH+RS DSDRAFNVYMEM S+G
Sbjct: 213 VNILIKIFSSNEDLELCTGLIKKWDLRLNAYTYRCLLQAHIRSRDSDRAFNVYMEMWSKG 272

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDE+LDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM R EESL
Sbjct: 273 YQLDIFAYNMLLDALAKDEQLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRAEESL 332

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPNLIAYNTMIQAL KS MVDKAILLF NM+KNNCRPNEFTYS+ILNVL
Sbjct: 333 ALFEEMLTKGCTPNLIAYNTMIQALSKSGMVDKAILLFCNMIKNNCRPNEFTYSIILNVL 392

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 393 VAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 452

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS+DTMMYNTVLSTLG+LKQVSHLHDLYEKMKQ
Sbjct: 453 ISMLESLCRGGKTVEAIELLSKVHEKGISTDTMMYNTVLSTLGKLKQVSHLHDLYEKMKQ 512

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 513 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 572

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQGC PNIVTYNILLD
Sbjct: 573 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQGCCPNIVTYNILLD 632

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAETVDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 633 CLERAGRTAETVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 690

Query: 661 R 662
           R
Sbjct: 693 R 690

BLAST of HG10022724 vs. ExPASy Swiss-Prot
Match: Q9ZU27 (Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g51965 PE=2 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 1.6e-253
Identity = 427/660 (64.70%), Postives = 540/660 (81.82%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MK+LR   +  +  T T    RH+ATKY AK+TSSSP+GRS+S EV+ P  LP D RGY 
Sbjct: 1   MKLLRRRFFNSVN-TITRPNRRHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYP 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRR LICRA +++      + +  + D FSDLS Y  SLS+SLTP EASEILKSLN P 
Sbjct: 61  LPRRHLICRATNLI------TGASNLSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPL 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LA++FF+L PSLCP  ++D F Y+RI+L+LS S+   RFD+VR IL  M +  + G IST
Sbjct: 121 LAVEFFKLVPSLCPYSQNDPFLYNRIILILSRSNLPDRFDRVRSILDSMVKSNVHGNIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILI  FG+ +DL++   L+KKWDL++N++TY+CLLQA++RS D  +AF+VY E+R  G
Sbjct: 181 VNILIGFFGNTEDLQMCLRLVKKWDLKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           ++LDIFAYNMLLDALAKDEK   + +VF+DMK +HC  DEYTYTIMIR  G++ + +E++
Sbjct: 241 HKLDIFAYNMLLDALAKDEK---ACQVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAV 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
            LF EM+T+G T N++ YNT++Q L K +MVDKAI +FS MV+  CRPNE+TYS++LN+L
Sbjct: 301 GLFNEMITEGLTLNVVGYNTLMQVLAKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQL RLD V+ +S ++M + +Y+YLVRTLSKLGH SEAHRLFC+MWSF  +G+RD+Y
Sbjct: 361 VAEGQLVRLDGVVEISKRYMTQGIYSYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           +SMLESLC AGKT+EAI++L K+HEKG+ +DTMMYNTV S LG+LKQ+SH+HDL+EKMK+
Sbjct: 421 MSMLESLCGAGKTIEAIEMLSKIHEKGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKK 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGP PDIFTYNILI+S GRVG+V EA+ +FEELE S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPSPDIFTYNILIASFGRVGEVDEAINIFEELERSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AH+RF EMQ+KGLNPDVVTYSTL+ECFGKT++VEMA SLF++M+ +GC PNIVTYNILLD
Sbjct: 541 AHVRFKEMQEKGLNPDVVTYSTLMECFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLE+ GRTAE VDLY+K+KQQGLTPDSITY +L+RLQS S+ K R+RR+NPITGWV+SPL
Sbjct: 601 CLEKNGRTAEAVDLYSKMKQQGLTPDSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of HG10022724 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 225.3 bits (573), Expect = 2.0e-57
Identity = 141/514 (27.43%), Postives = 246/514 (47.86%), Query Frame = 0

Query: 158 RFDQVREILSQMDRDQIRGTISTVNILIKIFGSKDDLELYTGLIKK---WDLRLNAYTYR 217
           + +++  +   M +  I+   +T   + K    K  L+     ++K   +   LNAY+Y 
Sbjct: 133 KLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYN 192

Query: 218 CLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLK 277
            L+   ++S     A  VY  M   G++  +  Y+ L+  L K   +D    + K+M+  
Sbjct: 193 GLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETL 252

Query: 278 HCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKA 337
              P+ YT+TI IR+ G+  +  E+  + + M  +GC P+++ Y  +I ALC +R +D A
Sbjct: 253 GLKPNVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCA 312

Query: 338 ILLF-----------------------------------SNMVKNNCRPNEFTYSVILNV 397
             +F                                   S M K+   P+  T++++++ 
Sbjct: 313 KEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDA 372

Query: 398 LVAEGQLGRLDEVLGV-SDKFMNKSLYAY--LVRTLSKLGHASEAHRLFCNMWSFHDRGD 457
           L   G  G   + L V  D+ +  +L+ Y  L+  L ++    +A  LF NM S   +  
Sbjct: 373 LCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPT 432

Query: 458 RDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYE 517
              YI  ++    +G +V A++   K+  KGI+ + +  N  L +L +  +      ++ 
Sbjct: 433 AYTYIVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFY 492

Query: 518 KMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNG 577
            +K  G  PD  TYN+++    +VG++ EA+++  E+  + C+PD+I  NSLIN L K  
Sbjct: 493 GLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKAD 552

Query: 578 DVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYN 631
            VDEA   F+ M++  L P VVTY+TL+   GK  K++ A  LF+ M+ +GC PN +T+N
Sbjct: 553 RVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFN 612

BLAST of HG10022724 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 1.7e-56
Identity = 151/577 (26.17%), Postives = 285/577 (49.39%), Query Frame = 0

Query: 79  SHSSSITIDDRFSDLSSY---FQSLSVSLTPAEASE-ILKSLNCPDLALQFFQLCPSLCP 138
           S S S+  D   + L  +      LS + TP  AS  +LKS N   L L+F         
Sbjct: 18  SPSDSLLADKALTFLKRHPYQLHHLSANFTPEAASNLLLKSQNDQALILKFLNWANP--- 77

Query: 139 KFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTISTVNILIKIFGSKDDL 198
              H  FT  R   +  H  +  +  +  +IL++   D    T+      +     ++  
Sbjct: 78  ---HQFFTL-RCKCITLHILTKFKLYKTAQILAE---DVAAKTLDDEYASLVFKSLQETY 137

Query: 199 ELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDA 258
           +L       +DL + +Y+   L+         D+A ++    ++ G+   + +YN +LDA
Sbjct: 138 DLCYSTSSVFDLVVKSYSRLSLI---------DKALSIVHLAQAHGFMPGVLSYNAVLDA 197

Query: 259 LAKDEK-LDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTP 318
             + ++ +  +  VFK+M     +P+ +TY I+IR        + +L LF++M TKGC P
Sbjct: 198 TIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLP 257

Query: 319 NLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVL 378
           N++ YNT+I   CK R +D    L  +M      PN  +Y+V++N L  EG++  +  VL
Sbjct: 258 NVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVL 317

Query: 379 GVSDK---FMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSA 438
              ++    +++  Y  L++   K G+  +A  +   M           Y S++ S+C A
Sbjct: 318 TEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKA 377

Query: 439 GKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQDGPFPDIFTY 498
           G    A++ L ++  +G+  +   Y T++    +   ++  + +  +M  +G  P + TY
Sbjct: 378 GNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY 437

Query: 499 NILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQD 558
           N LI+     GK+++A+ V E+++     PD++SY+++++   ++ DVDEA     EM +
Sbjct: 438 NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVE 497

Query: 559 KGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLDCLERAGRTAE 618
           KG+ PD +TYS+LI+ F +  + + A  L+++M+  G  P+  TY  L++     G   +
Sbjct: 498 KGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEK 557

Query: 619 TVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVR 648
            + L+ ++ ++G+ PD +TY++   L +G N++ R R
Sbjct: 558 ALQLHNEMVEKGVLPDVVTYSV---LINGLNKQSRTR 572

BLAST of HG10022724 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 7.1e-55
Identity = 125/409 (30.56%), Postives = 217/409 (53.06%), Query Frame = 0

Query: 209 NAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVF 268
           + + Y  L+    + +  D A  V   MRS+ +  D   YN+++ +L    KLD + KV 
Sbjct: 157 DVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVL 216

Query: 269 KDMKLKHCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKS 328
             +   +C P   TYTI+I  T      +E+L L +EML++G  P++  YNT+I+ +CK 
Sbjct: 217 NQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKE 276

Query: 329 RMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLG--VSDKF-MNKSLY 388
            MVD+A  +  N+    C P+  +Y+++L  L+ +G+    ++++    S+K   N   Y
Sbjct: 277 GMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTY 336

Query: 389 AYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHE 448
           + L+ TL + G   EA  L   M       D  +Y  ++ + C  G+   AI+ L  +  
Sbjct: 337 SILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMIS 396

Query: 449 KGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQDGPFPDIFTYNILISSLGRVGKVKE 508
            G   D + YNTVL+TL +  +     +++ K+ + G  P+  +YN + S+L   G    
Sbjct: 397 DGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIR 456

Query: 509 AVEVFEELENSNCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIE 568
           A+ +  E+ ++   PD I+YNS+I+CL + G VDEA    ++M+    +P VVTY+ ++ 
Sbjct: 457 ALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLL 516

Query: 569 CFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLDCLERAGRTAETVDL 615
            F K  ++E A ++ + M+  GC PN  TY +L++ +  AG  AE ++L
Sbjct: 517 GFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMEL 565

BLAST of HG10022724 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 8.7e-53
Identity = 146/540 (27.04%), Postives = 264/540 (48.89%), Query Frame = 0

Query: 112 ILKSLNCPDLALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDR 171
           +L   N  D A++ F+       K R    TY+   +++ + S+ K FD   E  S ++R
Sbjct: 282 VLCKANRLDEAVEMFE----HLEKNRRVPCTYAYNTMIMGYGSAGK-FD---EAYSLLER 341

Query: 172 DQIRGTISTV-------NILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSH 231
            + +G+I +V         L K+    + L+++  +  K D   N  TY  L+    R+ 
Sbjct: 342 QRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEM--KKDAAPNLSTYNILIDMLCRAG 401

Query: 232 DSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYT 291
             D AF +   M+  G   ++   N+++D L K +KLD +  +F++M  K C PDE T+ 
Sbjct: 402 KLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFC 461

Query: 292 IMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKN 351
            +I   GK+ R +++  ++E+ML   C  N I Y ++I+        +    ++ +M+  
Sbjct: 462 SLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQ 521

Query: 352 NCRPNEFTYSVILNVL--VAEGQLGR-LDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEA 411
           NC P+    +  ++ +    E + GR + E +       +   Y+ L+  L K G A+E 
Sbjct: 522 NCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFANET 581

Query: 412 HRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLST 471
           + LF +M       D  AY  +++  C  GK  +A  LL ++  KG     + Y +V+  
Sbjct: 582 YELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSVIDG 641

Query: 472 LGRLKQVSHLHDLYEKMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPD 531
           L ++ ++   + L+E+ K      ++  Y+ LI   G+VG++ EA  + EEL      P+
Sbjct: 642 LAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPN 701

Query: 532 IISYNSLINCLGKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD 591
           + ++NSL++ L K  +++EA + F  M++    P+ VTY  LI    K  K   A   + 
Sbjct: 702 LYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQ 761

Query: 592 KMITQGCSPNIVTYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITY-AILDRLQSGS 641
           +M  QG  P+ ++Y  ++  L +AG  AE   L+ + K  G  PDS  Y A+++ L +G+
Sbjct: 762 EMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGN 811

BLAST of HG10022724 vs. ExPASy TrEMBL
Match: A0A1S3BGX8 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103489714 PE=3 SV=1)

HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 610/661 (92.28%), Postives = 632/661 (95.61%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLR PCY HLKPT TA AHRHFATKYTAKITSSSPTGRSV+V VTPPATL VDSRGYS
Sbjct: 1   MKVLRFPCYSHLKPTATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +DILLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD
Sbjct: 61  LPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  CPSLC KFRHD FTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST
Sbjct: 121 LALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM S+G
Sbjct: 181 VNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWSKG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM RTEESL
Sbjct: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPN+IAYNTMIQALCKSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 ALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 361 VAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS++TMMYNTVLSTLG+LKQVSHLHDLYEKMK+
Sbjct: 421 ISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKMKR 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQ C PNIVTYNILLD
Sbjct: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 601 CLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 661

BLAST of HG10022724 vs. ExPASy TrEMBL
Match: A0A5A7TJ34 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G002280 PE=3 SV=1)

HSP 1 Score: 1217.6 bits (3149), Expect = 0.0e+00
Identity = 608/661 (91.98%), Postives = 630/661 (95.31%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLR  CY HLKPT TA AHRHFATKYTAKITSSSPTGRSV+V VTPPATL VDSRGYS
Sbjct: 1   MKVLRFACYSHLKPTATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +DILLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD
Sbjct: 61  LPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  CPSLC KFRHD FTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST
Sbjct: 121 LALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM S+G
Sbjct: 181 VNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWSKG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM RTEESL
Sbjct: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPN+IAYNTMIQALCKSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 ALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEG LGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 361 VAEGLLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS++TMMYNTVLSTLG+LKQVSHLHDLYEKMK+
Sbjct: 421 ISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKMKR 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQ C PNIVTYNILLD
Sbjct: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAE VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 601 CLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 661

BLAST of HG10022724 vs. ExPASy TrEMBL
Match: A0A0A0K6Z6 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G372880 PE=3 SV=1)

HSP 1 Score: 1211.1 bits (3132), Expect = 0.0e+00
Identity = 606/661 (91.68%), Postives = 629/661 (95.16%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRLPCY HLKP     AHRHFATKYTAKITSSSPTGRSV+V VTPPATLPVDSRGY+
Sbjct: 33  MKVLRLPCYSHLKP---PAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLPVDSRGYA 92

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICR +D+LLHR  HSS ITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN PD
Sbjct: 93  LPRRDLICRVIDMLLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNSPD 152

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFF  C SLCPKFRHDAFTYSRILLMLSHSSSSKR DQVREILSQMDRDQIRGTIST
Sbjct: 153 LALQFFHRCSSLCPKFRHDAFTYSRILLMLSHSSSSKRIDQVREILSQMDRDQIRGTIST 212

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILIKIF S +DLEL TGLIKKWDLRLNAYTYRCLLQAH+RS DSDRAFNVYMEM S+G
Sbjct: 213 VNILIKIFSSNEDLELCTGLIKKWDLRLNAYTYRCLLQAHIRSRDSDRAFNVYMEMWSKG 272

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           YQLDIFAYNMLLDALAKDE+LDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKM R EESL
Sbjct: 273 YQLDIFAYNMLLDALAKDEQLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRAEESL 332

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           ALFEEMLTKGCTPNLIAYNTMIQAL KS MVDKAILLF NM+KNNCRPNEFTYS+ILNVL
Sbjct: 333 ALFEEMLTKGCTPNLIAYNTMIQALSKSGMVDKAILLFCNMIKNNCRPNEFTYSIILNVL 392

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQLGRLDEVL VS+KF+NKS+YAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRDAY
Sbjct: 393 VAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRDAY 452

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLC  GKTVEAI+LL KVHEKGIS+DTMMYNTVLSTLG+LKQVSHLHDLYEKMKQ
Sbjct: 453 ISMLESLCRGGKTVEAIELLSKVHEKGISTDTMMYNTVLSTLGKLKQVSHLHDLYEKMKQ 512

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELE+S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 513 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDVDE 572

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD+MITQGC PNIVTYNILLD
Sbjct: 573 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQGCCPNIVTYNILLD 632

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLERAGRTAETVDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV+SPL
Sbjct: 633 CLERAGRTAETVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 690

Query: 661 R 662
           R
Sbjct: 693 R 690

BLAST of HG10022724 vs. ExPASy TrEMBL
Match: A0A6J1KR93 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111497945 PE=3 SV=1)

HSP 1 Score: 1174.1 bits (3036), Expect = 0.0e+00
Identity = 585/661 (88.50%), Postives = 620/661 (93.80%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRL  Y  LKP+ TA +HRHFATKYTAKITSSSPTGRSVSVEVT PA LP+D RGYS
Sbjct: 1   MKVLRLYYYSLLKPSATAASHRHFATKYTAKITSSSPTGRSVSVEVTSPAPLPIDPRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICRA+ ILL RK HSSS T+DDRF+DLSSYFQSLS+SLTPAEASEIL+SLN PD
Sbjct: 61  LPRRDLICRAIQILLDRKRHSSSSTVDDRFTDLSSYFQSLSISLTPAEASEILRSLN-PD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFFQLCPSLCPKFRHD FTYSRILL+LSHSSS KRFDQVREILSQM+RDQIRGTIST
Sbjct: 121 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILI IFG K+DLEL  GLIKKWDLRLNAYTYRCLLQAHVRSH SD AFNVYMEMR+RG
Sbjct: 181 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHHSDGAFNVYMEMRNRG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           ++LDIFAYNMLLDALAKDE+LDR+YK+FKDMKLKHCNPD YTYT+MIRMTGK  RTEESL
Sbjct: 241 FKLDIFAYNMLLDALAKDEQLDRAYKIFKDMKLKHCNPDVYTYTVMIRMTGKRGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           A FEEML  GCTPNLI YNTMI+AL KSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 AFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQ GRLDEVL +S+KFMNKS+YAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDAY
Sbjct: 361 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYN VLSTLG+LKQVSHLHDLYEKMKQ
Sbjct: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGP PD+FTYNILISSLGRVGKV+EAV+VFEELENS+CKPDIISYNSLINC GKNGDVDE
Sbjct: 481 DGPLPDVFTYNILISSLGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCHGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEM++KGL PDVVTYSTLIECFGKTDKVEMARSLFDKMI QGC PNIVTYNILLD
Sbjct: 541 AHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLER GRTAE VDLYA+LKQQGLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWV+SPL
Sbjct: 601 CLERTGRTAEAVDLYAELKQQGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 660

BLAST of HG10022724 vs. ExPASy TrEMBL
Match: A0A6J1H6J4 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111460908 PE=3 SV=1)

HSP 1 Score: 1174.1 bits (3036), Expect = 0.0e+00
Identity = 587/661 (88.80%), Postives = 620/661 (93.80%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MKVLRL  Y  LKP+ TA +HRHFATKYTAKITSSSPTGRSV VEVTPPA LP+D RGYS
Sbjct: 1   MKVLRLYYYSLLKPSATAASHRHFATKYTAKITSSSPTGRSVYVEVTPPAPLPIDPRGYS 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRRDLICRA+ ILL RK HSSS T+DDRFSDLSSYFQSLSVSLTPAEASEIL++LN PD
Sbjct: 61  LPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRALN-PD 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LALQFFQLCPSLCPKFRHD FTYSRILL+LSHSSS KRFDQVREILSQM+RDQIRGTIST
Sbjct: 121 LALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILI IFG K+DLEL  GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFNVYMEMR+RG
Sbjct: 181 VNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNRG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           ++LDIFAYNMLLDALAKDE+LDR+YKVFKDMKLKHCNPD YTYTIMIRMTGK  RTEESL
Sbjct: 241 FKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIMIRMTGKRGRTEESL 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
           A FEEML  G TPNLI YNTMI+AL KSRMVDKAILLFSNM+KNNCRPNEFTYSVILNVL
Sbjct: 301 AFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNVL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQ GRLDEVL +S+KFMNKS+YAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDAY
Sbjct: 361 VAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDAY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYN VLSTLG+LKQVSHLHDLYEKMKQ
Sbjct: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMKQ 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGP PD+FTYNILISS GRVGKV+EAV+VFEELENS+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AHMRFLEM++KGL PDVVTYSTLIECFGKTDKVEMARSLFDKMI QGC PNIVTYNILLD
Sbjct: 541 AHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLER GRTAE VDLYA+LKQ+GLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWV+SPL
Sbjct: 601 CLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSPL 660

Query: 661 R 662
           R
Sbjct: 661 R 660

BLAST of HG10022724 vs. TAIR 10
Match: AT1G51965.1 (ABA Overly-Sensitive 5 )

HSP 1 Score: 876.7 bits (2264), Expect = 1.2e-254
Identity = 427/660 (64.70%), Postives = 540/660 (81.82%), Query Frame = 0

Query: 1   MKVLRLPCYCHLKPTTTAVAHRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDSRGYS 60
           MK+LR   +  +  T T    RH+ATKY AK+TSSSP+GRS+S EV+ P  LP D RGY 
Sbjct: 1   MKLLRRRFFNSVN-TITRPNRRHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYP 60

Query: 61  LPRRDLICRAVDILLHRKSHSSSITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNCPD 120
           LPRR LICRA +++      + +  + D FSDLS Y  SLS+SLTP EASEILKSLN P 
Sbjct: 61  LPRRHLICRATNLI------TGASNLSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPL 120

Query: 121 LALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTIST 180
           LA++FF+L PSLCP  ++D F Y+RI+L+LS S+   RFD+VR IL  M +  + G IST
Sbjct: 121 LAVEFFKLVPSLCPYSQNDPFLYNRIILILSRSNLPDRFDRVRSILDSMVKSNVHGNIST 180

Query: 181 VNILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRG 240
           VNILI  FG+ +DL++   L+KKWDL++N++TY+CLLQA++RS D  +AF+VY E+R  G
Sbjct: 181 VNILIGFFGNTEDLQMCLRLVKKWDLKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGG 240

Query: 241 YQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESL 300
           ++LDIFAYNMLLDALAKDEK   + +VF+DMK +HC  DEYTYTIMIR  G++ + +E++
Sbjct: 241 HKLDIFAYNMLLDALAKDEK---ACQVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAV 300

Query: 301 ALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVL 360
            LF EM+T+G T N++ YNT++Q L K +MVDKAI +FS MV+  CRPNE+TYS++LN+L
Sbjct: 301 GLFNEMITEGLTLNVVGYNTLMQVLAKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLL 360

Query: 361 VAEGQLGRLDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAY 420
           VAEGQL RLD V+ +S ++M + +Y+YLVRTLSKLGH SEAHRLFC+MWSF  +G+RD+Y
Sbjct: 361 VAEGQLVRLDGVVEISKRYMTQGIYSYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSY 420

Query: 421 ISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQ 480
           +SMLESLC AGKT+EAI++L K+HEKG+ +DTMMYNTV S LG+LKQ+SH+HDL+EKMK+
Sbjct: 421 MSMLESLCGAGKTIEAIEMLSKIHEKGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKK 480

Query: 481 DGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDE 540
           DGP PDIFTYNILI+S GRVG+V EA+ +FEELE S+CKPDIISYNSLINCLGKNGDVDE
Sbjct: 481 DGPSPDIFTYNILIASFGRVGEVDEAINIFEELERSDCKPDIISYNSLINCLGKNGDVDE 540

Query: 541 AHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLD 600
           AH+RF EMQ+KGLNPDVVTYSTL+ECFGKT++VEMA SLF++M+ +GC PNIVTYNILLD
Sbjct: 541 AHVRFKEMQEKGLNPDVVTYSTLMECFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLD 600

Query: 601 CLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVISPL 660
           CLE+ GRTAE VDLY+K+KQQGLTPDSITY +L+RLQS S+ K R+RR+NPITGWV+SPL
Sbjct: 601 CLEKNGRTAEAVDLYSKMKQQGLTPDSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of HG10022724 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 225.3 bits (573), Expect = 1.4e-58
Identity = 141/514 (27.43%), Postives = 246/514 (47.86%), Query Frame = 0

Query: 158 RFDQVREILSQMDRDQIRGTISTVNILIKIFGSKDDLELYTGLIKK---WDLRLNAYTYR 217
           + +++  +   M +  I+   +T   + K    K  L+     ++K   +   LNAY+Y 
Sbjct: 133 KLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYN 192

Query: 218 CLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLK 277
            L+   ++S     A  VY  M   G++  +  Y+ L+  L K   +D    + K+M+  
Sbjct: 193 GLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETL 252

Query: 278 HCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKA 337
              P+ YT+TI IR+ G+  +  E+  + + M  +GC P+++ Y  +I ALC +R +D A
Sbjct: 253 GLKPNVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCA 312

Query: 338 ILLF-----------------------------------SNMVKNNCRPNEFTYSVILNV 397
             +F                                   S M K+   P+  T++++++ 
Sbjct: 313 KEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDA 372

Query: 398 LVAEGQLGRLDEVLGV-SDKFMNKSLYAY--LVRTLSKLGHASEAHRLFCNMWSFHDRGD 457
           L   G  G   + L V  D+ +  +L+ Y  L+  L ++    +A  LF NM S   +  
Sbjct: 373 LCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPT 432

Query: 458 RDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYE 517
              YI  ++    +G +V A++   K+  KGI+ + +  N  L +L +  +      ++ 
Sbjct: 433 AYTYIVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFY 492

Query: 518 KMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNG 577
            +K  G  PD  TYN+++    +VG++ EA+++  E+  + C+PD+I  NSLIN L K  
Sbjct: 493 GLKDIGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKAD 552

Query: 578 DVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYN 631
            VDEA   F+ M++  L P VVTY+TL+   GK  K++ A  LF+ M+ +GC PN +T+N
Sbjct: 553 RVDEAWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFN 612

BLAST of HG10022724 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 222.2 bits (565), Expect = 1.2e-57
Identity = 151/577 (26.17%), Postives = 285/577 (49.39%), Query Frame = 0

Query: 79  SHSSSITIDDRFSDLSSY---FQSLSVSLTPAEASE-ILKSLNCPDLALQFFQLCPSLCP 138
           S S S+  D   + L  +      LS + TP  AS  +LKS N   L L+F         
Sbjct: 18  SPSDSLLADKALTFLKRHPYQLHHLSANFTPEAASNLLLKSQNDQALILKFLNWANP--- 77

Query: 139 KFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTISTVNILIKIFGSKDDL 198
              H  FT  R   +  H  +  +  +  +IL++   D    T+      +     ++  
Sbjct: 78  ---HQFFTL-RCKCITLHILTKFKLYKTAQILAE---DVAAKTLDDEYASLVFKSLQETY 137

Query: 199 ELYTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDA 258
           +L       +DL + +Y+   L+         D+A ++    ++ G+   + +YN +LDA
Sbjct: 138 DLCYSTSSVFDLVVKSYSRLSLI---------DKALSIVHLAQAHGFMPGVLSYNAVLDA 197

Query: 259 LAKDEK-LDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTP 318
             + ++ +  +  VFK+M     +P+ +TY I+IR        + +L LF++M TKGC P
Sbjct: 198 TIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLP 257

Query: 319 NLIAYNTMIQALCKSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVL 378
           N++ YNT+I   CK R +D    L  +M      PN  +Y+V++N L  EG++  +  VL
Sbjct: 258 NVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVL 317

Query: 379 GVSDK---FMNKSLYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSA 438
              ++    +++  Y  L++   K G+  +A  +   M           Y S++ S+C A
Sbjct: 318 TEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKA 377

Query: 439 GKTVEAIDLLGKVHEKGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQDGPFPDIFTY 498
           G    A++ L ++  +G+  +   Y T++    +   ++  + +  +M  +G  P + TY
Sbjct: 378 GNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY 437

Query: 499 NILISSLGRVGKVKEAVEVFEELENSNCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQD 558
           N LI+     GK+++A+ V E+++     PD++SY+++++   ++ DVDEA     EM +
Sbjct: 438 NALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVE 497

Query: 559 KGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLDCLERAGRTAE 618
           KG+ PD +TYS+LI+ F +  + + A  L+++M+  G  P+  TY  L++     G   +
Sbjct: 498 KGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEK 557

Query: 619 TVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVR 648
            + L+ ++ ++G+ PD +TY++   L +G N++ R R
Sbjct: 558 ALQLHNEMVEKGVLPDVVTYSV---LINGLNKQSRTR 572

BLAST of HG10022724 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 216.9 bits (551), Expect = 5.0e-56
Identity = 125/409 (30.56%), Postives = 217/409 (53.06%), Query Frame = 0

Query: 209 NAYTYRCLLQAHVRSHDSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVF 268
           + + Y  L+    + +  D A  V   MRS+ +  D   YN+++ +L    KLD + KV 
Sbjct: 157 DVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVL 216

Query: 269 KDMKLKHCNPDEYTYTIMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKS 328
             +   +C P   TYTI+I  T      +E+L L +EML++G  P++  YNT+I+ +CK 
Sbjct: 217 NQLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKE 276

Query: 329 RMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLG--VSDKF-MNKSLY 388
            MVD+A  +  N+    C P+  +Y+++L  L+ +G+    ++++    S+K   N   Y
Sbjct: 277 GMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTY 336

Query: 389 AYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHE 448
           + L+ TL + G   EA  L   M       D  +Y  ++ + C  G+   AI+ L  +  
Sbjct: 337 SILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMIS 396

Query: 449 KGISSDTMMYNTVLSTLGRLKQVSHLHDLYEKMKQDGPFPDIFTYNILISSLGRVGKVKE 508
            G   D + YNTVL+TL +  +     +++ K+ + G  P+  +YN + S+L   G    
Sbjct: 397 DGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIR 456

Query: 509 AVEVFEELENSNCKPDIISYNSLINCLGKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIE 568
           A+ +  E+ ++   PD I+YNS+I+CL + G VDEA    ++M+    +P VVTY+ ++ 
Sbjct: 457 ALHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLL 516

Query: 569 CFGKTDKVEMARSLFDKMITQGCSPNIVTYNILLDCLERAGRTAETVDL 615
            F K  ++E A ++ + M+  GC PN  TY +L++ +  AG  AE ++L
Sbjct: 517 GFCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMEL 565

BLAST of HG10022724 vs. TAIR 10
Match: AT3G06920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 209.9 bits (533), Expect = 6.2e-54
Identity = 146/540 (27.04%), Postives = 264/540 (48.89%), Query Frame = 0

Query: 112 ILKSLNCPDLALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDR 171
           +L   N  D A++ F+       K R    TY+   +++ + S+ K FD   E  S ++R
Sbjct: 282 VLCKANRLDEAVEMFE----HLEKNRRVPCTYAYNTMIMGYGSAGK-FD---EAYSLLER 341

Query: 172 DQIRGTISTV-------NILIKIFGSKDDLELYTGLIKKWDLRLNAYTYRCLLQAHVRSH 231
            + +G+I +V         L K+    + L+++  +  K D   N  TY  L+    R+ 
Sbjct: 342 QRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEM--KKDAAPNLSTYNILIDMLCRAG 401

Query: 232 DSDRAFNVYMEMRSRGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYT 291
             D AF +   M+  G   ++   N+++D L K +KLD +  +F++M  K C PDE T+ 
Sbjct: 402 KLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFC 461

Query: 292 IMIRMTGKMSRTEESLALFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMVKN 351
            +I   GK+ R +++  ++E+ML   C  N I Y ++I+        +    ++ +M+  
Sbjct: 462 SLIDGLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQ 521

Query: 352 NCRPNEFTYSVILNVL--VAEGQLGR-LDEVLGVSDKFMNKSLYAYLVRTLSKLGHASEA 411
           NC P+    +  ++ +    E + GR + E +       +   Y+ L+  L K G A+E 
Sbjct: 522 NCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFANET 581

Query: 412 HRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNTVLST 471
           + LF +M       D  AY  +++  C  GK  +A  LL ++  KG     + Y +V+  
Sbjct: 582 YELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSVIDG 641

Query: 472 LGRLKQVSHLHDLYEKMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSNCKPD 531
           L ++ ++   + L+E+ K      ++  Y+ LI   G+VG++ EA  + EEL      P+
Sbjct: 642 LAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPN 701

Query: 532 IISYNSLINCLGKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFD 591
           + ++NSL++ L K  +++EA + F  M++    P+ VTY  LI    K  K   A   + 
Sbjct: 702 LYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQ 761

Query: 592 KMITQGCSPNIVTYNILLDCLERAGRTAETVDLYAKLKQQGLTPDSITY-AILDRLQSGS 641
           +M  QG  P+ ++Y  ++  L +AG  AE   L+ + K  G  PDS  Y A+++ L +G+
Sbjct: 762 EMQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGN 811

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898111.10.0e+0095.16pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa ... [more]
XP_008447220.10.0e+0092.28PREDICTED: pentatricopeptide repeat-containing protein At1g51965, mitochondrial ... [more]
KAA0041996.10.0e+0091.98pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK17931... [more]
XP_031744831.10.0e+0091.68pentatricopeptide repeat-containing protein At1g51965, mitochondrial isoform X2 ... [more]
XP_004150337.10.0e+0091.68pentatricopeptide repeat-containing protein At1g51965, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
Q9ZU271.6e-25364.70Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidop... [more]
Q9SZ522.0e-5727.43Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9FIX31.7e-5626.17Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SR007.1e-5530.56Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Q9M9078.7e-5327.04Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3BGX80.0e+0092.28pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis ... [more]
A0A5A7TJ340.0e+0091.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0K6Z60.0e+0091.68PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G372880 PE... [more]
A0A6J1KR930.0e+0088.50pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
A0A6J1H6J40.0e+0088.80pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT1G51965.11.2e-25464.70ABA Overly-Sensitive 5 [more]
AT4G31850.11.4e-5827.43proton gradient regulation 3 [more]
AT5G39710.11.2e-5726.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G04760.15.0e-5630.56Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G06920.16.2e-5427.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 440..514
e-value: 3.0E-19
score: 71.3
coord: 515..585
e-value: 3.1E-22
score: 81.0
coord: 586..636
e-value: 2.0E-10
score: 42.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 87..202
e-value: 3.2E-8
score: 35.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 228..384
e-value: 2.7E-42
score: 147.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 247..586
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 482..513
e-value: 1.1E-10
score: 41.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 419..448
e-value: 0.0069
score: 16.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 419..451
e-value: 3.8E-4
score: 18.5
coord: 593..627
e-value: 3.6E-6
score: 24.8
coord: 523..557
e-value: 3.2E-10
score: 37.5
coord: 488..522
e-value: 4.2E-10
score: 37.2
coord: 247..280
e-value: 4.5E-6
score: 24.5
coord: 211..244
e-value: 5.2E-5
score: 21.2
coord: 454..486
e-value: 1.0E-4
score: 20.2
coord: 558..592
e-value: 3.5E-10
score: 37.4
coord: 317..350
e-value: 2.2E-10
score: 38.1
coord: 281..314
e-value: 1.1E-7
score: 29.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 590..633
e-value: 5.9E-10
score: 39.2
coord: 520..567
e-value: 3.5E-15
score: 56.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 314..348
score: 12.824779
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 591..625
score: 11.509422
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 556..590
score: 12.682281
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 521..555
score: 12.802855
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 244..278
score: 11.147699
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 12.057487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 451..485
score: 9.744654
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 209..243
score: 10.402331
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 486..520
score: 13.493418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 416..450
score: 9.470621
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 212..373
e-value: 9.9E-21
score: 74.0
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 10..655
NoneNo IPR availablePANTHERPTHR46128:SF166BNAA05G15090D PROTEINcoord: 10..655

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022724.1HG10022724.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding