HG10007827 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007827
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr10: 14561540 .. 14563285 (-)
RNA-Seq ExpressionHG10007827
SyntenyHG10007827
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTCATCGGAATTTCTCCCTCAGAACCTCCATTTCACCAACCCATTGTCGAAGCCAACAATTCCCCAATCACATTCCGACTCCCTCGTCACTCGCAAATTTCCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCCCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAATCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACGAAGCTCATTAAAGGGTTCTTTAATTCGAGGAATTTGAAGAAAGCTGTGAGGGTTATGGAAATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCCAACCAAATTGAGTCTGCAAACCAGGTGTTTGATAGAATGCGCATCAGGGGATTTTCCCCTGATGTTGTTACTTACAATATAATGATTGGGAGTTTGTGTAGCAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAACTATTCGACGAATTGCTGTCGAGGGGCCTGCGTCCTGACTTGTATACATACAATGCCATCATTAGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCGTGGAATTTGTTCGGGAGTTATCAGCTAGAGGGTGTAATCCAGATGCGATTTCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCAGGTGGGAAGATGGGGAGAAGCTTATGAAAGACATGGTTTTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTGTGTCGCGAAGGGAGAGTCAGGGAAGCCGTGAATGTGTTGAAGGTGATGAAGGAGAAGGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTCTCTGATGGTTGTTTGCCTGATATTGTAAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGTGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGATGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTCTGGGAACAAGATCAAGGCTCTGGAAATGATATCAGAAATGATAAGAAAAGGAATCGATCCCGATGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGCTTCCAGCCAACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGTACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAACGAAACTAGTTATGTCTTGTTAATCGAGGGGATCGCTTATGCCGGGTGGCGACCAGAGGCTATGGAGTTGGCTAACTCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCGGAAAGCAAGAACCAACTCTTGCAAAGCTGA

mRNA sequence

ATGTTTTCATCGGAATTTCTCCCTCAGAACCTCCATTTCACCAACCCATTGTCGAAGCCAACAATTCCCCAATCACATTCCGACTCCCTCGTCACTCGCAAATTTCCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCCCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAATCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACGAAGCTCATTAAAGGGTTCTTTAATTCGAGGAATTTGAAGAAAGCTGTGAGGGTTATGGAAATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCCAACCAAATTGAGTCTGCAAACCAGGTGTTTGATAGAATGCGCATCAGGGGATTTTCCCCTGATGTTGTTACTTACAATATAATGATTGGGAGTTTGTGTAGCAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAACTATTCGACGAATTGCTGTCGAGGGGCCTGCGTCCTGACTTGTATACATACAATGCCATCATTAGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCGTGGAATTTGTTCGGGAGTTATCAGCTAGAGGGTGTAATCCAGATGCGATTTCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCAGGTGGGAAGATGGGGAGAAGCTTATGAAAGACATGGTTTTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTGTGTCGCGAAGGGAGAGTCAGGGAAGCCGTGAATGTGTTGAAGGTGATGAAGGAGAAGGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTCTCTGATGGTTGTTTGCCTGATATTGTAAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGTGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGATGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTCTGGGAACAAGATCAAGGCTCTGGAAATGATATCAGAAATGATAAGAAAAGGAATCGATCCCGATGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGCTTCCAGCCAACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGTACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAACGAAACTAGTTATGTCTTGTTAATCGAGGGGATCGCTTATGCCGGGTGGCGACCAGAGGCTATGGAGTTGGCTAACTCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCGGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Coding sequence (CDS)

ATGTTTTCATCGGAATTTCTCCCTCAGAACCTCCATTTCACCAACCCATTGTCGAAGCCAACAATTCCCCAATCACATTCCGACTCCCTCGTCACTCGCAAATTTCCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCCCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAATCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACGAAGCTCATTAAAGGGTTCTTTAATTCGAGGAATTTGAAGAAAGCTGTGAGGGTTATGGAAATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCCAACCAAATTGAGTCTGCAAACCAGGTGTTTGATAGAATGCGCATCAGGGGATTTTCCCCTGATGTTGTTACTTACAATATAATGATTGGGAGTTTGTGTAGCAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAACTATTCGACGAATTGCTGTCGAGGGGCCTGCGTCCTGACTTGTATACATACAATGCCATCATTAGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCGTGGAATTTGTTCGGGAGTTATCAGCTAGAGGGTGTAATCCAGATGCGATTTCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCAGGTGGGAAGATGGGGAGAAGCTTATGAAAGACATGGTTTTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTGTGTCGCGAAGGGAGAGTCAGGGAAGCCGTGAATGTGTTGAAGGTGATGAAGGAGAAGGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTCTCTGATGGTTGTTTGCCTGATATTGTAAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGTGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGATGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTCTGGGAACAAGATCAAGGCTCTGGAAATGATATCAGAAATGATAAGAAAAGGAATCGATCCCGATGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGCTTCCAGCCAACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGTACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAACGAAACTAGTTATGTCTTGTTAATCGAGGGGATCGCTTATGCCGGGTGGCGACCAGAGGCTATGGAGTTGGCTAACTCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCGGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Protein sequence

MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Homology
BLAST of HG10007827 vs. NCBI nr
Match: XP_038880759.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1121.7 bits (2900), Expect = 0.0e+00
Identity = 558/581 (96.04%), Postives = 568/581 (97.76%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPLSKPTIP+SHSDSLVTRKF NKTHLRNGASSAESREPHF NLHN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIESAN+VFDRMR RGFSPDVVTYNIMIG LCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGFSPDVVTYNIMIGCLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAVEFV+ LSARGCNPD ISYNILLRSFLNKSRW DGEKLMKDMVL 
Sbjct: 241 AIIRGICKEGMEDRAVEFVQGLSARGCNPDVISYNILLRSFLNKSRWADGEKLMKDMVLI 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSLCREGRV EAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSLCREGRVGEAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS G KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT+FQPT
Sbjct: 421 LWSCGKKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATNFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC+PN+TSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCVPNKTSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           +LYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQ+
Sbjct: 541 ALYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQT 581

BLAST of HG10007827 vs. NCBI nr
Match: KAA0038402.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 566/581 (97.42%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLD+VGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDQVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. NCBI nr
Match: TYJ96990.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1114.0 bits (2880), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 565/581 (97.25%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+ HFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSFHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. NCBI nr
Match: XP_008443759.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 565/581 (97.25%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KL LAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLALAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. NCBI nr
Match: XP_004142590.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sativus] >KGN66736.1 hypothetical protein Csa_007448 [Cucumis sativus])

HSP 1 Score: 1092.8 bits (2825), Expect = 0.0e+00
Identity = 542/581 (93.29%), Postives = 558/581 (96.04%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPL+KPTIPQS SDS+   +F NKTHLRN  SSAE R+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRA++FVR LSARGCNPD +SYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI  DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 7.5e-210
Identity = 352/555 (63.42%), Postives = 440/555 (79.28%), Query Frame = 0

Query: 11  LHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHNRDAHLMKLLN 70
           L F+N  S P      S S    +    T   +     E R+ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KAVRVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+I+ A +V DRMR + FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE+LSRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHS 310
           M DRA E VR L  +GC PD ISYNILLR+ LN+ +WE+GEKLM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSD 370
           ILI++LCR+G++ EA+N+LK+MKEKGLTPDAYSYDPLI+AFC+EGRLD+AIE+L  M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL+IF KL EVGC PN SSYNTMFSALWSSG+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICE 550
            CK HR+ + I +L +MV  GC PNET+Y +LIEGI +AG+R EAMELAN L R+  I E
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of HG10007827 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 1.3e-129
Identity = 223/489 (45.60%), Postives = 322/489 (65.85%), Query Frame = 0

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I +A  V DRM +   SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRMSV---SPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVV 308
           KEG  D A++F+ ++ + GC P+ I++NI+LRS  +  RW D EKL+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKM 368
           T +ILI+ LCR+G +  A+++L+ M + G  P++ SY+PL+  FCKE ++D AIEYL +M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNK 428
           VS GC PDIV YNT+L  LCK G  + A++I  +L   GC P + +YNT+   L  +G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGV 548
           +LG+CK  +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ICEDSSKRL 557
           + + S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of HG10007827 vs. ExPASy Swiss-Prot
Match: A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 4.1e-83
Identity = 173/487 (35.52%), Postives = 278/487 (57.08%), Query Frame = 0

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG-DPDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVE 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHSILISSLC 318
             REL A+G   + +SYNILLR      RWE+   L+ +M      P+VVT++ILI+SL 
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLKVMKEKG--LTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPD 378
             GR  +A+ VLK M +        A SY+P+I+  CKEG++DL ++ L +M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGS-ADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKALEMI 438
              YN I  +LC+  S    A  I + L           Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATSFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+ + +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 VHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICEDSS 558
           + R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI +++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of HG10007827 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 1.1e-75
Identity = 152/523 (29.06%), Postives = 265/523 (50.67%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG 127
           L+   CRA +   ++  LE + S G  PD    T +++G+    +L  A+R+ E +  +G
Sbjct: 195 LIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFG 254

Query: 128 -------------------------------------DPDVYSYNAMISGFSKANQIESA 187
                                                 PD Y++N +++G  KA  ++ A
Sbjct: 255 CSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHA 314

Query: 188 NQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEA 247
            ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++   C P+ +TY  LI  
Sbjct: 315 IEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLIST 374

Query: 248 TILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVEFVRELSARGCNPD 307
              E ++ EA EL   L S+G+ PD+ T+N++I+G+C       A+E   E+ ++GC PD
Sbjct: 375 LCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPD 434

Query: 308 AISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHSILISSLCREGRVREAVNVLK 367
             +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI   C+  + REA  +  
Sbjct: 435 EFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFD 494

Query: 368 VMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPDIVNYNTILATLCKFG 427
            M+  G++ ++ +Y+ LI   CK  R++ A + + +M+ +G  PD   YN++L   C+ G
Sbjct: 495 EMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGG 554

Query: 428 SADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKALEMISEMIRKGIDPDEITYN 487
               A DI + +   GC P++ +Y T+ S L  +G    A +++  +  KGI+     YN
Sbjct: 555 DIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYN 614

Query: 488 SLISCLCRDGLVDEAIGLLVDM-EATSFQPTVISFNIVLLGMCK-VHRVFEGIELLITMV 547
            +I  L R     EAI L  +M E     P  +S+ IV  G+C     + E ++ L+ ++
Sbjct: 615 PVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELL 674

Query: 548 EKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICED 552
           EKG +P  +S  +L EG+         ++L N + +     E+
Sbjct: 675 EKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARFSEE 717

BLAST of HG10007827 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 279.6 bits (714), Expect = 7.8e-74
Identity = 141/480 (29.38%), Postives = 258/480 (53.75%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG 127
           L+N  CR  +   +L  L  ++  G++P++V  + L+ G+ +S+ + +AV +++ +   G
Sbjct: 122 LINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTG 181

Query: 128 -DPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAF 187
             P+  ++N +I G    N+   A  + DRM  +G  PD+VTY +++  LC RG  +LAF
Sbjct: 182 YQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAF 241

Query: 188 EVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGI 247
            +++++ +   +P V+ Y  +I+       +++AL LF E+ ++G+RP++ TY+++I  +
Sbjct: 242 NLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCL 301

Query: 248 CKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNV 307
           C  G    A   + ++  R  NPD  +++ L+ +F+ + +  + EKL  +MV    +P++
Sbjct: 302 CNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSI 361

Query: 308 VTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHK 367
           VT+S LI+  C   R+ EA  + + M  K   PD  +Y+ LI  FCK  R++  +E   +
Sbjct: 362 VTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFRE 421

Query: 368 MVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGN 427
           M   G + + V YN ++  L + G  D+A +IF+++   G PPN+ +YNT+   L  +G 
Sbjct: 422 MSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGK 481

Query: 428 KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNI 487
             KA+ +   + R  ++P   TYN +I  +C+ G V++   L  ++     +P V+++N 
Sbjct: 482 LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNT 541

Query: 488 VLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLG 547
           ++ G C+     E   L   M E G LPN   Y  LI      G R  + EL   +   G
Sbjct: 542 MISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKEMRSCG 601

BLAST of HG10007827 vs. ExPASy TrEMBL
Match: A0A5A7T4J1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold270G002930 PE=4 SV=1)

HSP 1 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 566/581 (97.42%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLD+VGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDQVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. ExPASy TrEMBL
Match: A0A5D3BAX6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold506G00380 PE=4 SV=1)

HSP 1 Score: 1114.0 bits (2880), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 565/581 (97.25%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+ HFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSFHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. ExPASy TrEMBL
Match: A0A1S3B9K3 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487275 PE=4 SV=1)

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 554/581 (95.35%), Postives = 565/581 (97.25%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPLSKPTIPQSHSDS+ TR+F NKT+LRN  SSAESR+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KL LAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN
Sbjct: 181 KLALAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRAV+FVR+LSARGCNPD +SYNILLRSFLNKSRWEDGEKLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI EDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. ExPASy TrEMBL
Match: A0A0A0M3C6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1)

HSP 1 Score: 1092.8 bits (2825), Expect = 0.0e+00
Identity = 542/581 (93.29%), Postives = 558/581 (96.04%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSEFLPQ+LHFTNPL+KPTIPQS SDS+   +F NKTHLRN  SSAE R+PHFPNL N
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SANQVFDRMR RGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMEDRA++FVR LSARGCNPD +SYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRVREAVNVL+VMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLGMCK HRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVI  DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of HG10007827 vs. ExPASy TrEMBL
Match: A0A6J1H8M7 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111461503 PE=4 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 3.0e-302
Identity = 520/571 (91.07%), Postives = 542/571 (94.92%), Query Frame = 0

Query: 1   MFSSEFLPQNLHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHN 60
           MFSSE L Q+LHF NPLS PTIPQSHS S  TR+FPNKTHLRNGASSAE+REPH P L N
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSF-TRRFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIESAN+VFDRMR RGFSPDVVTYNI+IGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGFSPDVVTYNILIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240
           KLELA+EV+DELLKDGC+PSVITYTILIEATIL+GRI EAL+L DELLSRGLRPD YTYN
Sbjct: 181 KLELAYEVLDELLKDGCEPSVITYTILIEATILDGRIREALKLLDELLSRGLRPDRYTYN 240

Query: 241 AIIRGICKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLS 300
           AIIRGICKEGMED+AVEFVR+L ARGCNPD ISYNILLRS LNKSRW DGE+LMKDMV S
Sbjct: 241 AIIRGICKEGMEDQAVEFVRDLLARGCNPDVISYNILLRSLLNKSRWGDGERLMKDMVSS 300

Query: 301 GCEPNVVTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSLCREGRV EAVNVLKVMK+KGLTPDAYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSLCREGRVEEAVNVLKVMKQKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420
           IEYLHKMVSDGCLPDIVNYN+ILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA
Sbjct: 361 IEYLHKMVSDGCLPDIVNYNSILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSSGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480
           LWS GNKIKALEMISEMI KGID DEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIGKGIDADEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELAN 540
           VISFNIVLLG+CK HRVFEGIELL TMVEKGC PNETSYVLLIEGIAYAGWR EAMELAN
Sbjct: 481 VISFNIVLLGLCKAHRVFEGIELLTTMVEKGCQPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVICEDSSKRLNKTFPMLDVYKGLSL 572
           +LYR+GVICE+SSKRLNK FPML+VYKGLSL
Sbjct: 541 ALYRMGVICEESSKRLNKIFPMLEVYKGLSL 570

BLAST of HG10007827 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 731.5 bits (1887), Expect = 5.3e-211
Identity = 352/555 (63.42%), Postives = 440/555 (79.28%), Query Frame = 0

Query: 11  LHFTNPLSKPTIPQSHSDSLVTRKFPNKTHLRNGASSAESREPHFPNLHNRDAHLMKLLN 70
           L F+N  S P      S S    +    T   +     E R+ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KAVRVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+I+ A +V DRMR + FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE+LSRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHS 310
           M DRA E VR L  +GC PD ISYNILLR+ LN+ +WE+GEKLM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSD 370
           ILI++LCR+G++ EA+N+LK+MKEKGLTPDAYSYDPLI+AFC+EGRLD+AIE+L  M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL+IF KL EVGC PN SSYNTMFSALWSSG+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICE 550
            CK HR+ + I +L +MV  GC PNET+Y +LIEGI +AG+R EAMELAN L R+  I E
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of HG10007827 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 464.9 bits (1195), Expect = 9.3e-131
Identity = 223/489 (45.60%), Postives = 322/489 (65.85%), Query Frame = 0

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I +A  V DRM +   SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRMSV---SPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVV 308
           KEG  D A++F+ ++ + GC P+ I++NI+LRS  +  RW D EKL+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKM 368
           T +ILI+ LCR+G +  A+++L+ M + G  P++ SY+PL+  FCKE ++D AIEYL +M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNK 428
           VS GC PDIV YNT+L  LCK G  + A++I  +L   GC P + +YNT+   L  +G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGV 548
           +LG+CK  +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ICEDSSKRL 557
           + + S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of HG10007827 vs. TAIR 10
Match: AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 310.5 bits (794), Expect = 2.9e-84
Identity = 173/487 (35.52%), Postives = 278/487 (57.08%), Query Frame = 0

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG-DPDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVE 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHSILISSLC 318
             REL A+G   + +SYNILLR      RWE+   L+ +M      P+VVT++ILI+SL 
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLKVMKEKG--LTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPD 378
             GR  +A+ VLK M +        A SY+P+I+  CKEG++DL ++ L +M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGS-ADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKALEMI 438
              YN I  +LC+  S    A  I + L           Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATSFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+ + +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 VHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICEDSS 558
           + R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI +++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of HG10007827 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 285.8 bits (730), Expect = 7.8e-77
Identity = 152/523 (29.06%), Postives = 265/523 (50.67%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG 127
           L+   CRA +   ++  LE + S G  PD    T +++G+    +L  A+R+ E +  +G
Sbjct: 195 LIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFG 254

Query: 128 -------------------------------------DPDVYSYNAMISGFSKANQIESA 187
                                                 PD Y++N +++G  KA  ++ A
Sbjct: 255 CSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHA 314

Query: 188 NQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEA 247
            ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++   C P+ +TY  LI  
Sbjct: 315 IEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLIST 374

Query: 248 TILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVEFVRELSARGCNPD 307
              E ++ EA EL   L S+G+ PD+ T+N++I+G+C       A+E   E+ ++GC PD
Sbjct: 375 LCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPD 434

Query: 308 AISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNVVTHSILISSLCREGRVREAVNVLK 367
             +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI   C+  + REA  +  
Sbjct: 435 EFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFD 494

Query: 368 VMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPDIVNYNTILATLCKFG 427
            M+  G++ ++ +Y+ LI   CK  R++ A + + +M+ +G  PD   YN++L   C+ G
Sbjct: 495 EMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGG 554

Query: 428 SADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGNKIKALEMISEMIRKGIDPDEITYN 487
               A DI + +   GC P++ +Y T+ S L  +G    A +++  +  KGI+     YN
Sbjct: 555 DIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYN 614

Query: 488 SLISCLCRDGLVDEAIGLLVDM-EATSFQPTVISFNIVLLGMCK-VHRVFEGIELLITMV 547
            +I  L R     EAI L  +M E     P  +S+ IV  G+C     + E ++ L+ ++
Sbjct: 615 PVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELL 674

Query: 548 EKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLGVICED 552
           EKG +P  +S  +L EG+         ++L N + +     E+
Sbjct: 675 EKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARFSEE 717

BLAST of HG10007827 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 279.6 bits (714), Expect = 5.6e-75
Identity = 141/480 (29.38%), Postives = 258/480 (53.75%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVMEILETYG 127
           L+N  CR  +   +L  L  ++  G++P++V  + L+ G+ +S+ + +AV +++ +   G
Sbjct: 122 LINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTG 181

Query: 128 -DPDVYSYNAMISGFSKANQIESANQVFDRMRIRGFSPDVVTYNIMIGSLCSRGKLELAF 187
             P+  ++N +I G    N+   A  + DRM  +G  PD+VTY +++  LC RG  +LAF
Sbjct: 182 YQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAF 241

Query: 188 EVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGI 247
            +++++ +   +P V+ Y  +I+       +++AL LF E+ ++G+RP++ TY+++I  +
Sbjct: 242 NLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCL 301

Query: 248 CKEGMEDRAVEFVRELSARGCNPDAISYNILLRSFLNKSRWEDGEKLMKDMVLSGCEPNV 307
           C  G    A   + ++  R  NPD  +++ L+ +F+ + +  + EKL  +MV    +P++
Sbjct: 302 CNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSI 361

Query: 308 VTHSILISSLCREGRVREAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHK 367
           VT+S LI+  C   R+ EA  + + M  K   PD  +Y+ LI  FCK  R++  +E   +
Sbjct: 362 VTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFRE 421

Query: 368 MVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSSGN 427
           M   G + + V YN ++  L + G  D+A +IF+++   G PPN+ +YNT+   L  +G 
Sbjct: 422 MSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGK 481

Query: 428 KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPTVISFNI 487
             KA+ +   + R  ++P   TYN +I  +C+ G V++   L  ++     +P V+++N 
Sbjct: 482 LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNT 541

Query: 488 VLLGMCKVHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRPEAMELANSLYRLG 547
           ++ G C+     E   L   M E G LPN   Y  LI      G R  + EL   +   G
Sbjct: 542 MISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKEMRSCG 601

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880759.10.0e+0096.04pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Benincasa ... [more]
KAA0038402.10.0e+0095.35pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
TYJ96990.10.0e+0095.35pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008443759.10.0e+0095.35PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_004142590.10.0e+0093.29pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sa... [more]
Match NameE-valueIdentityDescription
Q9SR007.5e-21063.42Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Q3EDF81.3e-12945.60Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
A3KPF84.1e-8335.52Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
Q9LFF11.1e-7529.06Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q9SXD17.8e-7429.38Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5A7T4J10.0e+0095.35Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BAX60.0e+0095.35Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B9K30.0e+0095.35pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis ... [more]
A0A0A0M3C60.0e+0093.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1[more]
A0A6J1H8M73.0e-30291.07pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT3G04760.15.3e-21163.42Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G09900.19.3e-13145.60Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G79080.12.9e-8435.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.17.8e-7729.06Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.15.6e-7529.38rna processing factor 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 195..226
e-value: 8.3E-8
score: 31.8
coord: 230..261
e-value: 4.0E-9
score: 36.1
coord: 440..473
e-value: 1.0E-13
score: 50.7
coord: 335..368
e-value: 2.7E-9
score: 36.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 129..178
e-value: 5.9E-18
score: 64.8
coord: 269..318
e-value: 7.4E-14
score: 51.7
coord: 374..421
e-value: 6.1E-11
score: 42.4
coord: 479..526
e-value: 1.8E-7
score: 31.3
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 85..122
e-value: 1.9E-5
score: 24.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 237..271
e-value: 4.6E-8
score: 30.8
coord: 167..201
e-value: 1.0E-9
score: 36.0
coord: 307..341
e-value: 4.6E-9
score: 33.9
coord: 132..166
e-value: 5.7E-10
score: 36.8
coord: 482..516
e-value: 3.0E-4
score: 18.8
coord: 342..376
e-value: 2.0E-9
score: 35.0
coord: 272..306
e-value: 5.0E-8
score: 30.7
coord: 379..411
e-value: 2.3E-5
score: 22.2
coord: 202..235
e-value: 1.2E-6
score: 26.3
coord: 413..446
e-value: 1.7E-4
score: 19.5
coord: 447..480
e-value: 2.0E-8
score: 31.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..199
score: 13.361882
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 12.715165
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 11.345003
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 130..164
score: 13.328999
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 11.180584
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 13.033043
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..269
score: 12.539784
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 11.640958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 12.561707
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 11.695765
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 9.656963
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 52..180
e-value: 2.1E-30
score: 107.4
coord: 181..283
e-value: 4.6E-31
score: 109.6
coord: 284..388
e-value: 6.5E-31
score: 109.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 389..564
e-value: 2.6E-38
score: 134.1
NoneNo IPR availablePANTHERPTHR47932:SF16PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 1..566
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 1..566
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 107..449

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007827.1HG10007827.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding