HG10003654 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003654
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr08: 4939837 .. 4941201 (-)
RNA-Seq ExpressionHG10003654
SyntenyHG10003654
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGTGATTCGCCGCCGGACCCTCTTTTTCTCTCGTCGTCTCTACCACTCGAGTCTCCAATTTCCAGTCCATCCTCGACCCATTGATTCTGAATCATGCACCTTCTTCATCAAAACTTGCAGTTCTATCAAATCTTTGAAATGTGTTCACGCTTCAATTCTCAGAGCCAACCTCCACCTCAACTTGTTCTTTTGCACCACTCTCATTTCCCACTACGCCTCGCTTGGCTCTGTTTCTTACGCCTATTCTCTCTTCTCTTTGTTGCAATCACTTGATGTTTTTCTGTGGAACGTCATGCTTCGCTGTTTTGTTGATGCTGGGTGTCATCGCAGGGCCATGCTTCTCTATGCCCAAATGCTGGATTTAGGTATTCGACCTAATAATTTTACGTTTCCATTTGTTTTTAAGGCCTGTGGGTGTGTGGAGGATTTGGATTTTGGGGTCAAGGTTCATCATGACGCTGTGTATTTTGGGTATGAGTTGGATGTTTTTGTTGCGAATTCACTCATTGCAATGTATGGTAGATGTGGGCGTTCAGAGCTTGCACGAGAGGTGTTTGATAAAATGCCTCAAAGAAGTGTGTTGTCCTGGAGTTCAATAATTGGTGCTTATGCACAAAATGGTCAATATGGCTTAGGAGTGTCGTTGTTCTCGCTGATGTTGAATGAAGGATTTCAACCTAACAGGTCTGTAATGTTGAATGTGATGGCTTGCATCCATTCAGAGACAGAAGCTGATGATGTTTATCGCATGGCTATGGATAACCAGCTTGGTTTAGATCAATCAGTCCAAAACGCAGCAGTTGGTATGTATGCACGATGTGGAAGAATTGACACTGCTGAAAAGATCTTTAATAGAATTCATAATAAGGACTTGGTTTCGTGGACATCAATGATTGAAGCTTACGTGCAGGCCGATCTTCCTCTGAAAGCTTTGGAGATTTTTAGAGAAATGATACTAAAAGGTATTATGCCTGATGGGATTACCCTTTTGGGTGTCATTCATGCTTGTTTAGCTTTAGGATCTTTTAGCCAAGCGTGTTGGGTACATGGTCTTGTTATCCGGAGGTTTTCTGAAAACCAAAAAATGGTTGAAACTGCCATTGTTGATCTCTATGTCAAATGTGGAAGTTTAATATATGCCAGAAAAGTTTTTGATAATATGCAGGAAAGAAATGTTATCTCATGGAGCACCATGATTTCAGGGTATGGATTACATGGCCATGGTAGAAAGGCAATCTGTCTCTTCAATGAGATGAAGAACTCAACTAAGCCTGACCACATAACATTTGTATCGTTATTGGCAGCATGGTCATGCGGAATTGGTTACAGAAGGATGGGATTTCTTCAATGCCATGTGTAG

mRNA sequence

ATGAGTGTGATTCGCCGCCGGACCCTCTTTTTCTCTCGTCGTCTCTACCACTCGAGTCTCCAATTTCCAGTCCATCCTCGACCCATTGATTCTGAATCATGCACCTTCTTCATCAAAACTTGCAGTTCTATCAAATCTTTGAAATGTGTTCACGCTTCAATTCTCAGAGCCAACCTCCACCTCAACTTGTTCTTTTGCACCACTCTCATTTCCCACTACGCCTCGCTTGGCTCTGTTTCTTACGCCTATTCTCTCTTCTCTTTGTTGCAATCACTTGATGTTTTTCTGTGGAACGTCATGCTTCGCTGTTTTGTTGATGCTGGGTGTCATCGCAGGGCCATGCTTCTCTATGCCCAAATGCTGGATTTAGGTATTCGACCTAATAATTTTACGTTTCCATTTGTTTTTAAGGCCTGTGGGTGTGTGGAGGATTTGGATTTTGGGGTCAAGGTTCATCATGACGCTGTGTATTTTGGGTATGAGTTGGATGTTTTTGTTGCGAATTCACTCATTGCAATGTATGGTAGATGTGGGCGTTCAGAGCTTGCACGAGAGGTGTTTGATAAAATGCCTCAAAGAAGTGTGTTGTCCTGGAGTTCAATAATTGGTGCTTATGCACAAAATGGTCAATATGGCTTAGGAGTGTCGTTGTTCTCGCTGATGTTGAATGAAGGATTTCAACCTAACAGGTCTGTAATGTTGAATGTGATGGCTTGCATCCATTCAGAGACAGAAGCTGATGATGTTTATCGCATGGCTATGGATAACCAGCTTGGTTTAGATCAATCAGTCCAAAACGCAGCAGTTGGTATGTATGCACGATGTGGAAGAATTGACACTGCTGAAAAGATCTTTAATAGAATTCATAATAAGGACTTGGTTTCGTGGACATCAATGATTGAAGCTTACGTGCAGGCCGATCTTCCTCTGAAAGCTTTGGAGATTTTTAGAGAAATGATACTAAAAGGTATTATGCCTGATGGGATTACCCTTTTGGGTGTCATTCATGCTTGTTTAGCTTTAGGATCTTTTAGCCAAGCGTGTTGGGTACATGGTCTTGTTATCCGGAGGTTTTCTGAAAACCAAAAAATGGTTGAAACTGCCATTGTTGATCTCTATGTCAAATGTGGAAGTTTAATATATGCCAGAAAAGTTTTTGATAATATGCAGGAAAGAAATGTTATCTCATGGAGCACCATGATTTCAGGGTATGGATTACATGGCCATGGTAGAAAGGCAATCTGTCTCTTCAATGAGATGAAGAACTCAACTAAGCCTGACCACATAACATTTGTATCGTTATTGGCAGCATGGTCATGCGGAATTGGTTACAGAAGGATGGGATTTCTTCAATGCCATGTGTAG

Coding sequence (CDS)

ATGAGTGTGATTCGCCGCCGGACCCTCTTTTTCTCTCGTCGTCTCTACCACTCGAGTCTCCAATTTCCAGTCCATCCTCGACCCATTGATTCTGAATCATGCACCTTCTTCATCAAAACTTGCAGTTCTATCAAATCTTTGAAATGTGTTCACGCTTCAATTCTCAGAGCCAACCTCCACCTCAACTTGTTCTTTTGCACCACTCTCATTTCCCACTACGCCTCGCTTGGCTCTGTTTCTTACGCCTATTCTCTCTTCTCTTTGTTGCAATCACTTGATGTTTTTCTGTGGAACGTCATGCTTCGCTGTTTTGTTGATGCTGGGTGTCATCGCAGGGCCATGCTTCTCTATGCCCAAATGCTGGATTTAGGTATTCGACCTAATAATTTTACGTTTCCATTTGTTTTTAAGGCCTGTGGGTGTGTGGAGGATTTGGATTTTGGGGTCAAGGTTCATCATGACGCTGTGTATTTTGGGTATGAGTTGGATGTTTTTGTTGCGAATTCACTCATTGCAATGTATGGTAGATGTGGGCGTTCAGAGCTTGCACGAGAGGTGTTTGATAAAATGCCTCAAAGAAGTGTGTTGTCCTGGAGTTCAATAATTGGTGCTTATGCACAAAATGGTCAATATGGCTTAGGAGTGTCGTTGTTCTCGCTGATGTTGAATGAAGGATTTCAACCTAACAGGTCTGTAATGTTGAATGTGATGGCTTGCATCCATTCAGAGACAGAAGCTGATGATGTTTATCGCATGGCTATGGATAACCAGCTTGGTTTAGATCAATCAGTCCAAAACGCAGCAGTTGGTATGTATGCACGATGTGGAAGAATTGACACTGCTGAAAAGATCTTTAATAGAATTCATAATAAGGACTTGGTTTCGTGGACATCAATGATTGAAGCTTACGTGCAGGCCGATCTTCCTCTGAAAGCTTTGGAGATTTTTAGAGAAATGATACTAAAAGGTATTATGCCTGATGGGATTACCCTTTTGGGTGTCATTCATGCTTGTTTAGCTTTAGGATCTTTTAGCCAAGCGTGTTGGGTACATGGTCTTGTTATCCGGAGGTTTTCTGAAAACCAAAAAATGGTTGAAACTGCCATTGTTGATCTCTATGTCAAATGTGGAAGTTTAATATATGCCAGAAAAGTTTTTGATAATATGCAGGAAAGAAATGTTATCTCATGGAGCACCATGATTTCAGGGTATGGATTACATGGCCATGGTAGAAAGGCAATCTGTCTCTTCAATGAGATGAAGAACTCAACTAAGCCTGACCACATAACATTTGTATCGTTATTGGCAGCATGGTCATGCGGAATTGGTTACAGAAGGATGGGATTTCTTCAATGCCATGTGTAG

Protein sequence

MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACIHSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSENQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEMKNSTKPDHITFVSLLAAWSCGIGYRRMGFLQCHV
Homology
BLAST of HG10003654 vs. NCBI nr
Match: XP_038884398.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 778.9 bits (2010), Expect = 2.4e-221
Identity = 385/439 (87.70%), Postives = 408/439 (92.94%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M + RRR+L FSRRLYH SLQ    PRPIDSESCTF+IKTCS+IKSLKC+HASIL+ANLH
Sbjct: 1   MILNRRRSLLFSRRLYHWSLQLAYPPRPIDSESCTFYIKTCSTIKSLKCIHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG HRRAMLLY QM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFHRRAMLLYTQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGI P+NFTFPFVFKACGC+EDLDFGV+VH+DAVYFGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIGPDNFTFPFVFKACGCMEDLDFGVRVHYDAVYFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP R+V+SWSSIIGAYAQN QYGLGVSLFSLML+EGFQPNRSVMLNVMACI
Sbjct: 181 ELAREVFDKMPGRNVVSWSSIIGAYAQNAQYGLGVSLFSLMLSEGFQPNRSVMLNVMACI 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
            SE EADDVYRMA+D +LGLDQSVQNAAVGMYARCG+IDTA+ IFN IHNKDLVSW SMI
Sbjct: 241 QSEKEADDVYRMAVDYKLGLDQSVQNAAVGMYARCGKIDTAQNIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPL AL+ FREMIL GI+PD ITLLGVIHACLALG FSQACW+HG VIRR  E
Sbjct: 301 EAYVQADLPLNALKTFREMILMGILPDSITLLGVIHACLALGCFSQACWLHGFVIRRSFE 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAI+DLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQIVVETAIIDLYVKCGSLIYARKVFDNMRERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KNSTKPDHITFVSLLAA S
Sbjct: 421 KNSTKPDHITFVSLLAACS 439

BLAST of HG10003654 vs. NCBI nr
Match: KAG7025143.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 773.1 bits (1995), Expect = 1.3e-219
Identity = 384/439 (87.47%), Postives = 412/439 (93.85%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPIDSESCT +IK CS+IKSLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQHPVSPRPIDSESCTNYIKNCSTIKSLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACGCV+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRC RS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGCVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCARS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMAC+
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLIEGFQLNRSVLLNVMACV 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRIDTAE+IFN IHNKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDTAEEIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREMILKG++PD ITLLGVI ACLALGSFSQAC+VHG VIRRF  
Sbjct: 301 EAYVQADLPLKAMEIFREMILKGLLPDSITLLGVIRACLALGSFSQACFVHGFVIRRFFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQVVVETAIVDLYVKCGSLIYARKVFDNMKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KN+TKPDHITFVS+LAA S
Sbjct: 421 KNTTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. NCBI nr
Match: XP_022925426.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 769.2 bits (1985), Expect = 1.9e-218
Identity = 383/439 (87.24%), Postives = 411/439 (93.62%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPID ESCT +IK CS+IKSLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQHPVSPRPIDCESCTNYIKNCSTIKSLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACG V+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGFVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMAC+
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLIEGFQLNRSVLLNVMACV 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRIDTAE+IFN IHNKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDTAEEIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREMILKG++PD ITLLGVI ACLALGSFSQAC+VHG VIRRF  
Sbjct: 301 EAYVQADLPLKAMEIFREMILKGLLPDSITLLGVIRACLALGSFSQACFVHGFVIRRFFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQVVVETAIVDLYVKCGSLIYARKVFDNMKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KN+TKPDHITFVS+LAA S
Sbjct: 421 KNTTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. NCBI nr
Match: KAG6592321.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 768.5 bits (1983), Expect = 3.3e-218
Identity = 382/439 (87.02%), Postives = 412/439 (93.85%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPIDSESCT +IK CS+IKSLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQHPVSPRPIDSESCTNYIKNCSTIKSLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACG V+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGFVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDK+P+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMAC+
Sbjct: 181 ELAREVFDKIPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLIEGFQLNRSVLLNVMACV 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRIDTA++IFN IHNKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDTAQEIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREMILKG++PD ITLLGVI ACLALGSFSQAC+VHG VIRRF  
Sbjct: 301 EAYVQADLPLKAMEIFREMILKGLLPDSITLLGVIRACLALGSFSQACFVHGFVIRRFFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQVVVETAIVDLYVKCGSLIYARKVFDNMKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KN+TKPDHITFVS+LAA S
Sbjct: 421 KNTTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. NCBI nr
Match: XP_023535836.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 762.3 bits (1967), Expect = 2.4e-216
Identity = 380/439 (86.56%), Postives = 408/439 (92.94%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPIDSESCT +IK CS+I+SLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQLPVPPRPIDSESCTNYIKNCSTIESLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+ M LYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKVMFLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACGCV+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGCVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMACI
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLAEGFQLNRSVLLNVMACI 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRID A++IFN I NKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDKAQEIFNGIQNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQA+LPLKALEIFRE+ILKGI+PD ITLLGVI ACLALGSFSQAC+VHG VIRR   
Sbjct: 301 EAYVQAELPLKALEIFRELILKGILPDSITLLGVIRACLALGSFSQACFVHGFVIRRLFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQIVVETAIVDLYVKCGSLIYARKVFDNMKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KNSTKPDHITFVS+LAA S
Sbjct: 421 KNSTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 4.5e-69
Identity = 149/435 (34.25%), Postives = 243/435 (55.86%), Query Frame = 0

Query: 19  SLQFPVHPRPIDSE----SCTFFIKTCSSIKSL---KCVHASILRANLHLNLFFCTTLIS 78
           +LQF V  R  D E    + T+ +K C     L   K +H  ++++   L+LF  T L +
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 79  HYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFT 138
            YA    V+ A  +F  +   D+  WN ++  +   G  R A+ +   M +  ++P+  T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 139 FPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMP 198
              V  A   +  +  G ++H  A+  G++  V ++ +L+ MY +CG  E AR++FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 199 QRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACIHSETEADDVYR 258
           +R+V+SW+S+I AY QN      + +F  ML+EG +P     ++VM  +H+  +  D+ R
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTD---VSVMGALHACADLGDLER 358

Query: 259 ----MAMDNQLGLDQ--SVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQ 318
                 +  +LGLD+  SV N+ + MY +C  +DTA  +F ++ ++ LVSW +MI  + Q
Sbjct: 359 GRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQ 418

Query: 319 ADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSENQKMV 378
              P+ AL  F +M  + + PD  T + VI A   L     A W+HG+V+R   +    V
Sbjct: 419 NGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFV 478

Query: 379 ETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEMKNST- 438
            TA+VD+Y KCG+++ AR +FD M ER+V +W+ MI GYG HG G+ A+ LF EM+  T 
Sbjct: 479 TTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTI 538

Query: 439 KPDHITFVSLLAAWS 440
           KP+ +TF+S+++A S
Sbjct: 539 KPNGVTFLSVISACS 550

BLAST of HG10003654 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 1.2e-66
Identity = 135/397 (34.01%), Postives = 224/397 (56.42%), Query Frame = 0

Query: 47  LKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVD 106
           LK +HA +L   L  + F  T LI   +S G +++A  +F  L    +F WN ++R +  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 107 AGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFV 166
               + A+L+Y+ M    + P++FTFP + KAC  +  L  G  VH      G++ DVFV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 167 ANSLIAMYGRCGRSELAREVFD--KMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNE 226
            N LIA+Y +C R   AR VF+   +P+R+++SW++I+ AYAQNG+    + +FS M   
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 227 GFQPNRSVM---LNVMACIHSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTA 286
             +P+   +   LN   C+    +   ++   +   L ++  +  +   MYA+CG++ TA
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 276

Query: 287 EKIFNRIHNKDLVSWTSMIEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLAL 346
           + +F+++ + +L+ W +MI  Y +     +A+++F EMI K + PD I++   I AC  +
Sbjct: 277 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 336

Query: 347 GSFSQACWVHGLVIRRFSENQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMI 406
           GS  QA  ++  V R    +   + +A++D++ KCGS+  AR VFD   +R+V+ WS MI
Sbjct: 337 GSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMI 396

Query: 407 SGYGLHGHGRKAICLFNEM-KNSTKPDHITFVSLLAA 438
            GYGLHG  R+AI L+  M +    P+ +TF+ LL A
Sbjct: 397 VGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMA 433

BLAST of HG10003654 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 2.6e-64
Identity = 140/445 (31.46%), Postives = 236/445 (53.03%), Query Frame = 0

Query: 38  IKTCSSIKSLKCVHASILRANLHLNLFFCTTLIS------HYASLGSVSYAYSLFSLLQS 97
           +  C +++SL+ +HA +++  LH   +  + LI       H+  L    YA S+F  +Q 
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGL---PYAISVFKTIQE 99

Query: 98  LDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKV 157
            ++ +WN M R    +     A+ LY  M+ LG+ PN++TFPFV K+C   +    G ++
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 158 HHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQR------------------ 217
           H   +  G +LD++V  SLI+MY + GR E A +VFDK P R                  
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 219

Query: 218 -------------SVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVM-AC 277
                         V+SW+++I  YA+ G Y   + LF  M+    +P+ S M+ V+ AC
Sbjct: 220 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSAC 279

Query: 278 IHSET--EADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWT 337
             S +      V+    D+  G +  + NA + +Y++CG ++TA  +F R+  KD++SW 
Sbjct: 280 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWN 339

Query: 338 SMIEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRR 397
           ++I  Y   +L  +AL +F+EM+  G  P+ +T+L ++ AC  LG+     W+H  + +R
Sbjct: 340 TLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKR 399

Query: 398 FS--ENQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAIC 440
                N   + T+++D+Y KCG +  A +VF+++  +++ SW+ MI G+ +HG    +  
Sbjct: 400 LKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFD 459

BLAST of HG10003654 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 2.0e-61
Identity = 146/454 (32.16%), Postives = 235/454 (51.76%), Query Frame = 0

Query: 37  FIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLD--V 96
           FI  C +I  +K +H  +L   + L L   + LIS Y S+G +S+A SL       D  V
Sbjct: 34  FIHKCKTISQVKLIHQKLLSFGI-LTLNLTSHLISTYISVGCLSHAVSLLRRFPPSDAGV 93

Query: 97  FLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHD 156
           + WN ++R + D GC  + + L+  M  L   P+N+TFPFVFKACG +  +  G   H  
Sbjct: 94  YHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHAL 153

Query: 157 AVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLG 216
           ++  G+  +VFV N+L+AMY RC     AR+VFD+M    V+SW+SII +YA+ G+  + 
Sbjct: 154 SLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVA 213

Query: 217 VSLFSLMLNE-GFQPNRSVMLNVM---ACIHSETEADDVYRMAMDNQLGLDQSVQNAAVG 276
           + +FS M NE G +P+   ++NV+   A + + +    ++  A+ +++  +  V N  V 
Sbjct: 214 LEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLVD 273

Query: 277 MYARCGRIDTAEKIFNRIHNKDLVSWTSM------------------------------- 336
           MYA+CG +D A  +F+ +  KD+VSW +M                               
Sbjct: 274 MYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVT 333

Query: 337 ----IEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVI 396
               I  Y Q  L  +AL + R+M+  GI P+ +TL+ V+  C ++G+      +H   I
Sbjct: 334 WSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAI 393

Query: 397 -------RRFSENQKMVETAIVDLYVKCGSLIYARKVFDNM--QERNVISWSTMISGYGL 438
                  +    ++ MV   ++D+Y KC  +  AR +FD++  +ER+V++W+ MI GY  
Sbjct: 394 KYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQ 453

BLAST of HG10003654 vs. ExPASy Swiss-Prot
Match: O64705 (Pentatricopeptide repeat-containing protein At2g34400 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E23 PE=3 SV=2)

HSP 1 Score: 236.1 bits (601), Expect = 7.7e-61
Identity = 133/406 (32.76%), Postives = 223/406 (54.93%), Query Frame = 0

Query: 36  FFIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVF 95
           F +K C S+  L+ + A +L  ++    F    LI     LG  +Y+  LFS+ +  + +
Sbjct: 42  FLLKKCISVNQLRQIQAQMLLHSVEKPNF----LIPKAVELGDFNYSSFLFSVTEEPNHY 101

Query: 96  LWNVMLRCFVDA-GCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHD 155
            +N M+R   +    H  A+ LY +M   G++P+ FT+ FVF AC  +E++  G  VH  
Sbjct: 102 SFNYMIRGLTNTWNDHEAALSLYRRMKFSGLKPDKFTYNFVFIACAKLEEIGVGRSVHSS 161

Query: 156 AVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLG 215
               G E DV + +SLI MY +CG+   AR++FD++ +R  +SW+S+I  Y++ G     
Sbjct: 162 LFKVGLERDVHINHSLIMMYAKCGQVGYARKLFDEITERDTVSWNSMISGYSEAGYAKDA 221

Query: 216 VSLFSLMLNEGFQPNRSVMLNVM-ACIH--SETEADDVYRMAMDNQLGLDQSVQNAAVGM 275
           + LF  M  EGF+P+   +++++ AC H         +  MA+  ++GL   + +  + M
Sbjct: 222 MDLFRKMEEEGFEPDERTLVSMLGACSHLGDLRTGRLLEEMAITKKIGLSTFLGSKLISM 281

Query: 276 YARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQADLPLKALEIFREMILKGIMPDGITL 335
           Y +CG +D+A ++FN++  KD V+WT+MI  Y Q     +A ++F EM   G+ PD  TL
Sbjct: 282 YGKCGDLDSARRVFNQMIKKDRVAWTAMITVYSQNGKSSEAFKLFFEMEKTGVSPDAGTL 341

Query: 336 LGVIHACLALGSFSQACWVHGLVIRRFSENQKMVETAIVDLYVKCGSLIYARKVFDNMQE 395
             V+ AC ++G+      +         ++   V T +VD+Y KCG +  A +VF+ M  
Sbjct: 342 STVLSACGSVGALELGKQIETHASELSLQHNIYVATGLVDMYGKCGRVEEALRVFEAMPV 401

Query: 396 RNVISWSTMISGYGLHGHGRKAICLFNEMKNSTKPDHITFVSLLAA 438
           +N  +W+ MI+ Y   GH ++A+ LF+ M  S  P  ITF+ +L+A
Sbjct: 402 KNEATWNAMITAYAHQGHAKEALLLFDRM--SVPPSDITFIGVLSA 441

BLAST of HG10003654 vs. ExPASy TrEMBL
Match: A0A6J1EC65 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432725 PE=3 SV=1)

HSP 1 Score: 769.2 bits (1985), Expect = 9.4e-219
Identity = 383/439 (87.24%), Postives = 411/439 (93.62%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPID ESCT +IK CS+IKSLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQHPVSPRPIDCESCTNYIKNCSTIKSLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACG V+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGFVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMAC+
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLIEGFQLNRSVLLNVMACV 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRIDTAE+IFN IHNKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDTAEEIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREMILKG++PD ITLLGVI ACLALGSFSQAC+VHG VIRRF  
Sbjct: 301 EAYVQADLPLKAMEIFREMILKGLLPDSITLLGVIRACLALGSFSQACFVHGFVIRRFFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDNM+ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQVVVETAIVDLYVKCGSLIYARKVFDNMKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KN+TKPDHITFVS+LAA S
Sbjct: 421 KNTTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. ExPASy TrEMBL
Match: A0A6J1ID68 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472031 PE=3 SV=1)

HSP 1 Score: 761.5 bits (1965), Expect = 2.0e-216
Identity = 380/439 (86.56%), Postives = 408/439 (92.94%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPIDSESCT  IK CS+IKSLKCVH SIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQLPVPPRPIDSESCTNCIKNCSTIKSLKCVHTSILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACGCV+DLDFGV+VH+D+V FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGCVQDLDFGVRVHYDSVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMACI
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLAEGFQLNRSVLLNVMACI 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV RMAMD++LGL+QSVQNAAVGMYARCGRIDTA++IFN IH+KDLVSW SMI
Sbjct: 241 HSEKEADDVCRMAMDHELGLNQSVQNAAVGMYARCGRIDTAQEIFNGIHSKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREM LKG++PD ITLLGVI ACLALGSFSQAC+VHG VIRR   
Sbjct: 301 EAYVQADLPLKAMEIFREMTLKGLLPDSITLLGVIRACLALGSFSQACFVHGFVIRRLFG 360

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
           NQ +VETAIVDLYVKCGSLIYARKVFDN++ERNVISWSTMISGYGLHGHGRKAICLFNEM
Sbjct: 361 NQIVVETAIVDLYVKCGSLIYARKVFDNIKERNVISWSTMISGYGLHGHGRKAICLFNEM 420

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KNSTKPDHITFVS+LAA S
Sbjct: 421 KNSTKPDHITFVSILAACS 439

BLAST of HG10003654 vs. ExPASy TrEMBL
Match: A0A6J1EBP0 (pentatricopeptide repeat-containing protein At2g01510, mitochondrial-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432725 PE=3 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 1.1e-166
Identity = 314/439 (71.53%), Postives = 337/439 (76.77%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPID ESCT +IK CS+IKSLKCVHASIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQHPVSPRPIDCESCTNYIKNCSTIKSLKCVHASILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACG V+DLDFGV+VH+DAV FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGFVQDLDFGVRVHYDAVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMAC+
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLIEGFQLNRSVLLNVMACV 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV+RMAMD++LGL+QSVQNAAVGMYARCGRIDTAE+IFN IHNKDLVSW SMI
Sbjct: 241 HSEKEADDVFRMAMDHELGLNQSVQNAAVGMYARCGRIDTAEEIFNGIHNKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREMILK                                      
Sbjct: 301 EAYVQADLPLKAMEIFREMILK-------------------------------------- 359

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
                                                     GYGLHGHGRKAICLFNEM
Sbjct: 361 ------------------------------------------GYGLHGHGRKAICLFNEM 359

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KN+TKPDHITFVS+LAA S
Sbjct: 421 KNTTKPDHITFVSILAACS 359

BLAST of HG10003654 vs. ExPASy TrEMBL
Match: A0A6J1IEQ3 (pentatricopeptide repeat-containing protein At2g01510, mitochondrial-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472031 PE=3 SV=1)

HSP 1 Score: 592.8 bits (1527), Expect = 1.2e-165
Identity = 313/439 (71.30%), Postives = 335/439 (76.31%), Query Frame = 0

Query: 1   MSVIRRRTLFFSRRLYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLH 60
           M VIR R+L F RRL HSSLQ PV PRPIDSESCT  IK CS+IKSLKCVH SIL+ANLH
Sbjct: 1   MIVIRCRSLLFFRRLCHSSLQLPVPPRPIDSESCTNCIKNCSTIKSLKCVHTSILKANLH 60

Query: 61  LNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQM 120
           LNLFFCTTLIS YASLGSVSYAYSLFSLLQSLDVFLWNVMLR FVDAG +R+AMLLYAQM
Sbjct: 61  LNLFFCTTLISQYASLGSVSYAYSLFSLLQSLDVFLWNVMLRGFVDAGFYRKAMLLYAQM 120

Query: 121 LDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRS 180
           LDLGIRP+NFTFPFVFKACGCV+DLDFGV+VH+D+V FGYELDVFVANSLIAMYGRCGRS
Sbjct: 121 LDLGIRPDNFTFPFVFKACGCVQDLDFGVRVHYDSVNFGYELDVFVANSLIAMYGRCGRS 180

Query: 181 ELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACI 240
           ELAREVFDKMP+R+V+SWSSIIGAYAQNGQY LGVSLFSLML EGFQ NRSV+LNVMACI
Sbjct: 181 ELAREVFDKMPERNVVSWSSIIGAYAQNGQYSLGVSLFSLMLAEGFQLNRSVLLNVMACI 240

Query: 241 HSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMI 300
           HSE EADDV RMAMD++LGL+QSVQNAAVGMYARCGRIDTA++IFN IH+KDLVSW SMI
Sbjct: 241 HSEKEADDVCRMAMDHELGLNQSVQNAAVGMYARCGRIDTAQEIFNGIHSKDLVSWASMI 300

Query: 301 EAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSE 360
           EAYVQADLPLKA+EIFREM LK                                      
Sbjct: 301 EAYVQADLPLKAMEIFREMTLK-------------------------------------- 359

Query: 361 NQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEM 420
                                                     GYGLHGHGRKAICLFNEM
Sbjct: 361 ------------------------------------------GYGLHGHGRKAICLFNEM 359

Query: 421 KNSTKPDHITFVSLLAAWS 440
           KNSTKPDHITFVS+LAA S
Sbjct: 421 KNSTKPDHITFVSILAACS 359

BLAST of HG10003654 vs. ExPASy TrEMBL
Match: A0A2I4EAK9 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Juglans regia OX=51240 GN=LOC108987857 PE=3 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 9.9e-152
Identity = 262/427 (61.36%), Postives = 340/427 (79.63%), Query Frame = 0

Query: 15  LYHSSLQFPVHPRPIDSESCTFFIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYA 74
           LYHS  Q      PI+ ++C   IK C +I+SLK VHAS+LR++LH NLFF T LIS YA
Sbjct: 19  LYHSEAQIKFLLGPIEPDTCVSLIKQCRTIQSLKSVHASMLRSHLHSNLFFSTNLISQYA 78

Query: 75  SLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPF 134
           SLGS+S+AYSLFS  QS DVFLWNVMLR FVD G + R+MLLY +ML  GI+P+NFT+PF
Sbjct: 79  SLGSMSHAYSLFSTTQSSDVFLWNVMLRGFVDNGLYNRSMLLYRKMLLRGIQPDNFTYPF 138

Query: 135 VFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRS 194
           + KACGC  DL+FGV VH + +  GY+ DV V NSL+ MYG+C R +++R VFDK+ +RS
Sbjct: 139 ILKACGCFRDLEFGVIVHGNLIESGYDSDVVVGNSLVTMYGKCERLDISRLVFDKIAERS 198

Query: 195 VLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACIHSETEADDVYRMAM 254
           ++SWSS+IGA AQNGQY  G+SLFS ML+EG +PNR+++LNVM+C+H E +ADDV R+ +
Sbjct: 199 IVSWSSMIGACAQNGQYEEGLSLFSRMLDEGIRPNRALILNVMSCVHRENDADDVCRIVI 258

Query: 255 DNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQADLPLKALE 314
           ++ + LD+ V+NAA+GMYARCGRID A + F+ I  KDL+SW +MIEAYVQ DLPL ALE
Sbjct: 259 NHGVDLDRPVRNAAMGMYARCGRIDIARRFFDGILEKDLMSWAAMIEAYVQTDLPLTALE 318

Query: 315 IFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSENQKMVETAIVDLYV 374
           +F++M+L+ I  D ++LL VIHAC  L SF QA ++HG + R F ENQ  VETA+VDLYV
Sbjct: 319 LFKQMVLERIPLDSVSLLSVIHACSNLASFQQARFIHGFITRGFLENQISVETALVDLYV 378

Query: 375 KCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEMKNSTKPDHITFVSL 434
           KCG+L+YARK+FDNM+ERN+ISWST+ISGYG+HGHGR+A+ LF++MK+S KPDHI F+S+
Sbjct: 379 KCGNLLYARKIFDNMRERNIISWSTLISGYGVHGHGREALYLFDQMKDSIKPDHIAFLSV 438

Query: 435 LAAWSCG 442
           L+A S G
Sbjct: 439 LSACSHG 445

BLAST of HG10003654 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 263.5 bits (672), Expect = 3.2e-70
Identity = 149/435 (34.25%), Postives = 243/435 (55.86%), Query Frame = 0

Query: 19  SLQFPVHPRPIDSE----SCTFFIKTCSSIKSL---KCVHASILRANLHLNLFFCTTLIS 78
           +LQF V  R  D E    + T+ +K C     L   K +H  ++++   L+LF  T L +
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 79  HYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFT 138
            YA    V+ A  +F  +   D+  WN ++  +   G  R A+ +   M +  ++P+  T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 139 FPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMP 198
              V  A   +  +  G ++H  A+  G++  V ++ +L+ MY +CG  E AR++FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 199 QRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVMACIHSETEADDVYR 258
           +R+V+SW+S+I AY QN      + +F  ML+EG +P     ++VM  +H+  +  D+ R
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTD---VSVMGALHACADLGDLER 358

Query: 259 ----MAMDNQLGLDQ--SVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQ 318
                 +  +LGLD+  SV N+ + MY +C  +DTA  +F ++ ++ LVSW +MI  + Q
Sbjct: 359 GRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQ 418

Query: 319 ADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRRFSENQKMV 378
              P+ AL  F +M  + + PD  T + VI A   L     A W+HG+V+R   +    V
Sbjct: 419 NGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFV 478

Query: 379 ETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAICLFNEMKNST- 438
            TA+VD+Y KCG+++ AR +FD M ER+V +W+ MI GYG HG G+ A+ LF EM+  T 
Sbjct: 479 TTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTI 538

Query: 439 KPDHITFVSLLAAWS 440
           KP+ +TF+S+++A S
Sbjct: 539 KPNGVTFLSVISACS 550

BLAST of HG10003654 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 255.4 bits (651), Expect = 8.8e-68
Identity = 135/397 (34.01%), Postives = 224/397 (56.42%), Query Frame = 0

Query: 47  LKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVFLWNVMLRCFVD 106
           LK +HA +L   L  + F  T LI   +S G +++A  +F  L    +F WN ++R +  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 107 AGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHDAVYFGYELDVFV 166
               + A+L+Y+ M    + P++FTFP + KAC  +  L  G  VH      G++ DVFV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 167 ANSLIAMYGRCGRSELAREVFD--KMPQRSVLSWSSIIGAYAQNGQYGLGVSLFSLMLNE 226
            N LIA+Y +C R   AR VF+   +P+R+++SW++I+ AYAQNG+    + +FS M   
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 227 GFQPNRSVM---LNVMACIHSETEADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTA 286
             +P+   +   LN   C+    +   ++   +   L ++  +  +   MYA+CG++ TA
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 276

Query: 287 EKIFNRIHNKDLVSWTSMIEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLAL 346
           + +F+++ + +L+ W +MI  Y +     +A+++F EMI K + PD I++   I AC  +
Sbjct: 277 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 336

Query: 347 GSFSQACWVHGLVIRRFSENQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMI 406
           GS  QA  ++  V R    +   + +A++D++ KCGS+  AR VFD   +R+V+ WS MI
Sbjct: 337 GSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMI 396

Query: 407 SGYGLHGHGRKAICLFNEM-KNSTKPDHITFVSLLAA 438
            GYGLHG  R+AI L+  M +    P+ +TF+ LL A
Sbjct: 397 VGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMA 433

BLAST of HG10003654 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 247.7 bits (631), Expect = 1.8e-65
Identity = 140/445 (31.46%), Postives = 236/445 (53.03%), Query Frame = 0

Query: 38  IKTCSSIKSLKCVHASILRANLHLNLFFCTTLIS------HYASLGSVSYAYSLFSLLQS 97
           +  C +++SL+ +HA +++  LH   +  + LI       H+  L    YA S+F  +Q 
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGL---PYAISVFKTIQE 99

Query: 98  LDVFLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKV 157
            ++ +WN M R    +     A+ LY  M+ LG+ PN++TFPFV K+C   +    G ++
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 158 HHDAVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQR------------------ 217
           H   +  G +LD++V  SLI+MY + GR E A +VFDK P R                  
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 219

Query: 218 -------------SVLSWSSIIGAYAQNGQYGLGVSLFSLMLNEGFQPNRSVMLNVM-AC 277
                         V+SW+++I  YA+ G Y   + LF  M+    +P+ S M+ V+ AC
Sbjct: 220 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSAC 279

Query: 278 IHSET--EADDVYRMAMDNQLGLDQSVQNAAVGMYARCGRIDTAEKIFNRIHNKDLVSWT 337
             S +      V+    D+  G +  + NA + +Y++CG ++TA  +F R+  KD++SW 
Sbjct: 280 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWN 339

Query: 338 SMIEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVIRR 397
           ++I  Y   +L  +AL +F+EM+  G  P+ +T+L ++ AC  LG+     W+H  + +R
Sbjct: 340 TLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKR 399

Query: 398 FS--ENQKMVETAIVDLYVKCGSLIYARKVFDNMQERNVISWSTMISGYGLHGHGRKAIC 440
                N   + T+++D+Y KCG +  A +VF+++  +++ SW+ MI G+ +HG    +  
Sbjct: 400 LKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFD 459

BLAST of HG10003654 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 238.0 bits (606), Expect = 1.4e-62
Identity = 146/454 (32.16%), Postives = 235/454 (51.76%), Query Frame = 0

Query: 37  FIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLD--V 96
           FI  C +I  +K +H  +L   + L L   + LIS Y S+G +S+A SL       D  V
Sbjct: 34  FIHKCKTISQVKLIHQKLLSFGI-LTLNLTSHLISTYISVGCLSHAVSLLRRFPPSDAGV 93

Query: 97  FLWNVMLRCFVDAGCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHD 156
           + WN ++R + D GC  + + L+  M  L   P+N+TFPFVFKACG +  +  G   H  
Sbjct: 94  YHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHAL 153

Query: 157 AVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLG 216
           ++  G+  +VFV N+L+AMY RC     AR+VFD+M    V+SW+SII +YA+ G+  + 
Sbjct: 154 SLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVA 213

Query: 217 VSLFSLMLNE-GFQPNRSVMLNVM---ACIHSETEADDVYRMAMDNQLGLDQSVQNAAVG 276
           + +FS M NE G +P+   ++NV+   A + + +    ++  A+ +++  +  V N  V 
Sbjct: 214 LEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAVTSEMIQNMFVGNCLVD 273

Query: 277 MYARCGRIDTAEKIFNRIHNKDLVSWTSM------------------------------- 336
           MYA+CG +D A  +F+ +  KD+VSW +M                               
Sbjct: 274 MYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVT 333

Query: 337 ----IEAYVQADLPLKALEIFREMILKGIMPDGITLLGVIHACLALGSFSQACWVHGLVI 396
               I  Y Q  L  +AL + R+M+  GI P+ +TL+ V+  C ++G+      +H   I
Sbjct: 334 WSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAI 393

Query: 397 -------RRFSENQKMVETAIVDLYVKCGSLIYARKVFDNM--QERNVISWSTMISGYGL 438
                  +    ++ MV   ++D+Y KC  +  AR +FD++  +ER+V++W+ MI GY  
Sbjct: 394 KYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQ 453

BLAST of HG10003654 vs. TAIR 10
Match: AT2G34400.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 236.1 bits (601), Expect = 5.5e-62
Identity = 133/406 (32.76%), Postives = 223/406 (54.93%), Query Frame = 0

Query: 36  FFIKTCSSIKSLKCVHASILRANLHLNLFFCTTLISHYASLGSVSYAYSLFSLLQSLDVF 95
           F +K C S+  L+ + A +L  ++    F    LI     LG  +Y+  LFS+ +  + +
Sbjct: 42  FLLKKCISVNQLRQIQAQMLLHSVEKPNF----LIPKAVELGDFNYSSFLFSVTEEPNHY 101

Query: 96  LWNVMLRCFVDA-GCHRRAMLLYAQMLDLGIRPNNFTFPFVFKACGCVEDLDFGVKVHHD 155
            +N M+R   +    H  A+ LY +M   G++P+ FT+ FVF AC  +E++  G  VH  
Sbjct: 102 SFNYMIRGLTNTWNDHEAALSLYRRMKFSGLKPDKFTYNFVFIACAKLEEIGVGRSVHSS 161

Query: 156 AVYFGYELDVFVANSLIAMYGRCGRSELAREVFDKMPQRSVLSWSSIIGAYAQNGQYGLG 215
               G E DV + +SLI MY +CG+   AR++FD++ +R  +SW+S+I  Y++ G     
Sbjct: 162 LFKVGLERDVHINHSLIMMYAKCGQVGYARKLFDEITERDTVSWNSMISGYSEAGYAKDA 221

Query: 216 VSLFSLMLNEGFQPNRSVMLNVM-ACIH--SETEADDVYRMAMDNQLGLDQSVQNAAVGM 275
           + LF  M  EGF+P+   +++++ AC H         +  MA+  ++GL   + +  + M
Sbjct: 222 MDLFRKMEEEGFEPDERTLVSMLGACSHLGDLRTGRLLEEMAITKKIGLSTFLGSKLISM 281

Query: 276 YARCGRIDTAEKIFNRIHNKDLVSWTSMIEAYVQADLPLKALEIFREMILKGIMPDGITL 335
           Y +CG +D+A ++FN++  KD V+WT+MI  Y Q     +A ++F EM   G+ PD  TL
Sbjct: 282 YGKCGDLDSARRVFNQMIKKDRVAWTAMITVYSQNGKSSEAFKLFFEMEKTGVSPDAGTL 341

Query: 336 LGVIHACLALGSFSQACWVHGLVIRRFSENQKMVETAIVDLYVKCGSLIYARKVFDNMQE 395
             V+ AC ++G+      +         ++   V T +VD+Y KCG +  A +VF+ M  
Sbjct: 342 STVLSACGSVGALELGKQIETHASELSLQHNIYVATGLVDMYGKCGRVEEALRVFEAMPV 401

Query: 396 RNVISWSTMISGYGLHGHGRKAICLFNEMKNSTKPDHITFVSLLAA 438
           +N  +W+ MI+ Y   GH ++A+ LF+ M  S  P  ITF+ +L+A
Sbjct: 402 KNEATWNAMITAYAHQGHAKEALLLFDRM--SVPPSDITFIGVLSA 441

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884398.12.4e-22187.70pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benin... [more]
KAG7025143.11.3e-21987.47putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_022925426.11.9e-21887.24pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isofor... [more]
KAG6592321.13.3e-21887.02Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_023535836.12.4e-21686.56pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isofor... [more]
Match NameE-valueIdentityDescription
Q3E6Q14.5e-6934.25Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LTV81.2e-6634.01Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LN012.6e-6431.46Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LFL52.0e-6132.16Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
O647057.7e-6132.76Pentatricopeptide repeat-containing protein At2g34400 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1EC659.4e-21987.24pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isofor... [more]
A0A6J1ID682.0e-21686.56pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isofor... [more]
A0A6J1EBP01.1e-16671.53pentatricopeptide repeat-containing protein At2g01510, mitochondrial-like isofor... [more]
A0A6J1IEQ31.2e-16571.30pentatricopeptide repeat-containing protein At2g01510, mitochondrial-like isofor... [more]
A0A2I4EAK99.9e-15261.36pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Jug... [more]
Match NameE-valueIdentityDescription
AT1G11290.13.2e-7034.25Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.18.8e-6834.01mitochondrial editing factor 22 [more]
AT1G08070.11.8e-6531.46Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G16860.11.4e-6232.16Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G34400.15.5e-6232.76Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 93..139
e-value: 1.8E-8
score: 34.4
coord: 392..436
e-value: 6.3E-9
score: 35.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 66..92
e-value: 0.45
score: 10.9
coord: 266..291
e-value: 0.16
score: 12.3
coord: 168..194
e-value: 6.9E-5
score: 22.8
coord: 294..324
e-value: 5.5E-8
score: 32.6
coord: 197..226
e-value: 1.1E-4
score: 22.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 294..327
e-value: 3.8E-7
score: 27.9
coord: 395..422
e-value: 6.6E-6
score: 24.0
coord: 168..194
e-value: 2.2E-4
score: 19.2
coord: 96..128
e-value: 2.1E-6
score: 25.5
coord: 197..229
e-value: 4.4E-6
score: 24.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 292..326
score: 11.849223
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 11.169622
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 393..423
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 10.654441
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..193
score: 9.580234
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 248..349
e-value: 9.0E-21
score: 76.0
coord: 354..440
e-value: 2.4E-17
score: 64.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 4..156
e-value: 4.3E-17
score: 64.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 158..247
e-value: 1.3E-8
score: 36.6
NoneNo IPR availablePANTHERPTHR47924:SF34PPR CONTAINING PLANT-LIKE PROTEINcoord: 26..438
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 26..438

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003654.1HG10003654.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019915 lipid storage
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016020 membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding