HG10022069 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022069
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 20472738 .. 20474861 (+)
RNA-Seq ExpressionHG10022069
SyntenyHG10022069
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCATGGAACTCCCTCTCTCCCGATATCAAAACTATGTTTATGATCGCCTTCAATGTAACTCCACTTCTAGCTCTACTTCCTACTTCTCCCTGCGTTTCTCAGATTCCGAGCTTTTTAGGAAGAGATCTTTGCTTTCTAATAGAAGAAAATGCCGTAATTCACTTTGTTGGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAACCTAAACCTTCCAAAGTTGATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATAAGGAAATCCAGTGTAGGGATATGTAGCCAGATAGAGAAGCTGGTTTTGTGTAAAAAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGCGGTTTTCAGGTTGGTAACAGCACGTTTGATGCGCTGATTAATGCGTGTATTGGGTTGAAGTCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAATACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATAAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCTGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATTTTGATGTGGGAAGAGTATTATGATTGTGGGCCTCGCACCTTTGCCACAATGATTCGGGCATCGGCTGGTTTGGAAGTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAAGGCAGGTCTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACAATAGTTGGATGGAACTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACTATGAGATGCGTGACTCCGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCTCGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTTTAGAAACGTAATATCATGGAATGCTTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTGGGGAAGGCATGATGCCCAACCATGTGACATATCTTGCAGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGCGCTATGCATTATGCGTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCTACAGCAAATATGTGGGCTGCCTTGCTTAGAGCTTGTAGAGTTGATGGAAATCTAGAACTTGGGAAGTTTGCTGCTGAGAAACTTTATGGGATGGAACCTGAGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAACAGTTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAGCCCCATGCATTCCGGTCTGGGGATAACCACCATGCTCAAATAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTATGTGCCTGAACAGAACTTCATGCTTCCAGATGTTGATGAACATGAAGAGAAGATACAGATGTACCACAGTGAGAAGTTGGCAATAGCTTATGGAGTATTAAATACTTTAGAACAAACGCCATTGCAGATTGTGCAGAGCCATCGCATTTGTGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAGAGATGGGAGTTGCTCTTGTGGAGACTATTGGTGA

mRNA sequence

ATGACCATGGAACTCCCTCTCTCCCGATATCAAAACTATGTTTATGATCGCCTTCAATGTAACTCCACTTCTAGCTCTACTTCCTACTTCTCCCTGCGTTTCTCAGATTCCGAGCTTTTTAGGAAGAGATCTTTGCTTTCTAATAGAAGAAAATGCCGTAATTCACTTTGTTGGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAACCTAAACCTTCCAAAGTTGATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATAAGGAAATCCAGTGTAGGGATATGTAGCCAGATAGAGAAGCTGGTTTTGTGTAAAAAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGCGGTTTTCAGGTTGGTAACAGCACGTTTGATGCGCTGATTAATGCGTGTATTGGGTTGAAGTCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAATACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATAAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCTGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATTTTGATGTGGGAAGAGTATTATGATTGTGGGCCTCGCACCTTTGCCACAATGATTCGGGCATCGGCTGGTTTGGAAGTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAAGGCAGGTCTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACAATAGTTGGATGGAACTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACTATGAGATGCGTGACTCCGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCTCGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTTTAGAAACGTAATATCATGGAATGCTTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTGGGGAAGGCATGATGCCCAACCATGTGACATATCTTGCAGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGCGCTATGCATTATGCGTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCTACAGCAAATATGTGGGCTGCCTTGCTTAGAGCTTGTAGAGTTGATGGAAATCTAGAACTTGGGAAGTTTGCTGCTGAGAAACTTTATGGGATGGAACCTGAGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAACAGTTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAGCCCCATGCATTCCGGTCTGGGGATAACCACCATGCTCAAATAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTATGTGCCTGAACAGAACTTCATGCTTCCAGATGTTGATGAACATGAAGAGAAGATACAGATGTACCACAGTGAGAAGTTGGCAATAGCTTATGGAGTATTAAATACTTTAGAACAAACGCCATTGCAGATTGTGCAGAGCCATCGCATTTGTGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAGAGATGGGAGTTGCTCTTGTGGAGACTATTGGTGA

Coding sequence (CDS)

ATGACCATGGAACTCCCTCTCTCCCGATATCAAAACTATGTTTATGATCGCCTTCAATGTAACTCCACTTCTAGCTCTACTTCCTACTTCTCCCTGCGTTTCTCAGATTCCGAGCTTTTTAGGAAGAGATCTTTGCTTTCTAATAGAAGAAAATGCCGTAATTCACTTTGTTGGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAACCTAAACCTTCCAAAGTTGATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATAAGGAAATCCAGTGTAGGGATATGTAGCCAGATAGAGAAGCTGGTTTTGTGTAAAAAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGCGGTTTTCAGGTTGGTAACAGCACGTTTGATGCGCTGATTAATGCGTGTATTGGGTTGAAGTCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAATACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTAGATTGTTCGATAAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCTGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATTTTGATGTGGGAAGAGTATTATGATTGTGGGCCTCGCACCTTTGCCACAATGATTCGGGCATCGGCTGGTTTGGAAGTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAAGGCAGGTCTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTGATGAGATGCCCGATAAGACAATAGTTGGATGGAACTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACTATGAGATGCGTGACTCCGGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCTCGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGTAATGGCTTTGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTTTAGAAACGTAATATCATGGAATGCTTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTGGGGAAGGCATGATGCCCAACCATGTGACATATCTTGCAGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGCGCTATGCATTATGCGTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCTACAGCAAATATGTGGGCTGCCTTGCTTAGAGCTTGTAGAGTTGATGGAAATCTAGAACTTGGGAAGTTTGCTGCTGAGAAACTTTATGGGATGGAACCTGAGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAACAGTTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAGCCCCATGCATTCCGGTCTGGGGATAACCACCATGCTCAAATAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTATGTGCCTGAACAGAACTTCATGCTTCCAGATGTTGATGAACATGAAGAGAAGATACAGATGTACCACAGTGAGAAGTTGGCAATAGCTTATGGAGTATTAAATACTTTAGAACAAACGCCATTGCAGATTGTGCAGAGCCATCGCATTTGTGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAGAGATGGGAGTTGCTCTTGTGGAGACTATTGGTGA

Protein sequence

MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
Homology
BLAST of HG10022069 vs. NCBI nr
Match: XP_008459324.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo] >XP_008459325.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo])

HSP 1 Score: 1340.1 bits (3467), Expect = 0.0e+00
Identity = 652/708 (92.09%), Postives = 675/708 (95.34%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIK 60
           M MELPLSRYQNYVYDRLQC     ST YFSLR+SDS LF K S LSNRRKCRNS CW+K
Sbjct: 1   MNMELPLSRYQNYVYDRLQC----YSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVK 60

Query: 61  CSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEM 120
           CSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGICSQIEKLVLCK+YRDALEM
Sbjct: 61  CSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEM 120

Query: 121 FEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHV 180
           FEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHV
Sbjct: 121 FEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHV 180

Query: 181 KCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 240
           KCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT AT
Sbjct: 181 KCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT 240

Query: 241 MIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300
           MIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Sbjct: 241 MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300

Query: 301 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHA 360
           IVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHA
Sbjct: 301 IVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHA 360

Query: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEE 420
           SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RNVISWNALIAGYGNHGRGEE
Sbjct: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEE 420

Query: 421 AIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 480
           AI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMI
Sbjct: 421 AIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMI 480

Query: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLS 540
           ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLS
Sbjct: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEPEKLS 540

Query: 541 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK 600
           NYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EK
Sbjct: 541 NYIVLLNIYNTSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEK 600

Query: 601 VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQ 660
           VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHSEKLAIAYG+LNTLE+TPLQ
Sbjct: 601 VVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQ 660

Query: 661 IVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           IVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+CSCGDYW
Sbjct: 661 IVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 704

BLAST of HG10022069 vs. NCBI nr
Match: XP_038890388.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida] >XP_038890389.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida] >XP_038890390.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1338.9 bits (3464), Expect = 0.0e+00
Identity = 651/708 (91.95%), Postives = 673/708 (95.06%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIK 60
           M ME+PLS YQNY+YDR+QCN    STSY SLRFS  +LFR+R  L NRRKCRNSL WIK
Sbjct: 1   MNMEIPLSCYQNYLYDRVQCN----STSYVSLRFSYFDLFRERFFLCNRRKCRNSLRWIK 60

Query: 61  CSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEM 120
           CSSFEQGLRPRPQPKPSK+DP V K TPLKET + +SSVGICSQIEKLVLCKKYRDALEM
Sbjct: 61  CSSFEQGLRPRPQPKPSKLDPGVHKITPLKETHVMQSSVGICSQIEKLVLCKKYRDALEM 120

Query: 121 FEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHV 180
           FEIFELEGGF  GN+T DALINAC+ LKSIRGVK+LCNYMVDNGFEPDQYMRNR+LLMHV
Sbjct: 121 FEIFELEGGFHAGNTTLDALINACVELKSIRGVKKLCNYMVDNGFEPDQYMRNRVLLMHV 180

Query: 181 KCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 240
           KCGMMIDACRLFD+MPERNAVSWNTIISG+VDSGNYVEAFRLFILMWEEYYDCGPRTFAT
Sbjct: 181 KCGMMIDACRLFDQMPERNAVSWNTIISGHVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 240

Query: 241 MIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300
           MIRASAGLE+IFPGRQLHSCAIKA LGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Sbjct: 241 MIRASAGLELIFPGRQLHSCAIKADLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300

Query: 301 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHA 360
           IVGWNSIIAGYALHGYSEEALDLYYEMRDSG+KMDHFTFSIIIRICSRLASVA AKQAHA
Sbjct: 301 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGIKMDHFTFSIIIRICSRLASVACAKQAHA 360

Query: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEE 420
           SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RN+ISWNALIAGYGNHGRG E
Sbjct: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGNHGRGVE 420

Query: 421 AIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 480
           AI+MFEKML EG +PNHVT+LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI
Sbjct: 421 AIDMFEKMLREGKIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 480

Query: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLS 540
           ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLS
Sbjct: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEPEKLS 540

Query: 541 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK 600
           NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QIEK
Sbjct: 541 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQIEK 600

Query: 601 VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQ 660
           VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKIQMYHSEKLAIAYG+LNTLEQTPLQ
Sbjct: 601 VVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIQMYHSEKLAIAYGLLNTLEQTPLQ 660

Query: 661 IVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           IVQSHRIC DCH VIKLIAMITKREIV+RDASRFHHFRDGSCSCGDYW
Sbjct: 661 IVQSHRICSDCHFVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 704

BLAST of HG10022069 vs. NCBI nr
Match: XP_004148701.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus] >XP_031740897.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 1336.6 bits (3458), Expect = 0.0e+00
Identity = 653/710 (91.97%), Postives = 675/710 (95.07%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIK 60
           M MELPLSRYQNYVYDRLQCN    STS+FSLR+SDS+LF K S LSN RK RNS CWIK
Sbjct: 1   MNMELPLSRYQNYVYDRLQCN----STSFFSLRYSDSDLFTKTSFLSNPRKYRNSFCWIK 60

Query: 61  CSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDAL 120
           CSSFEQGL  RPRPQPKPSK+D   RKETPLKET ++KSSVGICSQIEKLVLCKKYRDAL
Sbjct: 61  CSSFEQGLRPRPRPQPKPSKLDVGDRKETPLKETHVKKSSVGICSQIEKLVLCKKYRDAL 120

Query: 121 EMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLM 180
           EMFEIFELE GF VG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLM
Sbjct: 121 EMFEIFELEDGFHVGYSTYDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRVLLM 180

Query: 181 HVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTF 240
           HVKCGMMIDACRLFD+MP RNAVSW TIISGYVDSGNYVEAFRLFILM EE+YDCGPRTF
Sbjct: 181 HVKCGMMIDACRLFDEMPARNAVSWGTIISGYVDSGNYVEAFRLFILMREEFYDCGPRTF 240

Query: 241 ATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD 300
           ATMIRASAGLE+IFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
Sbjct: 241 ATMIRASAGLEIIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD 300

Query: 301 KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQA 360
           KTIVGWNSIIAGYALHGYSEEALDLY+EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ 
Sbjct: 301 KTIVGWNSIIAGYALHGYSEEALDLYHEMRDSGVKMDHFTFSIIIRICSRLASVARAKQV 360

Query: 361 HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRG 420
           HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RN+ISWNALIAGYGNHG G
Sbjct: 361 HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGNHGHG 420

Query: 421 EEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYAC 480
           EEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+AC
Sbjct: 421 EEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVKPRAMHFAC 480

Query: 481 MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEK 540
           MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEK
Sbjct: 481 MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEPEK 540

Query: 541 LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI 600
           LSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QI
Sbjct: 541 LSNYIVLLNIYNSSGKLKEAADVFQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQI 600

Query: 601 EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTP 660
           EKVVGKVDELML ISKLGYVP EQNFMLPDVDE+EEKI+MYHSEKLAIAYG+LNTLE+TP
Sbjct: 601 EKVVGKVDELMLNISKLGYVPEEQNFMLPDVDENEEKIRMYHSEKLAIAYGLLNTLEKTP 660

Query: 661 LQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           LQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDGSCSCGDYW
Sbjct: 661 LQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 706

BLAST of HG10022069 vs. NCBI nr
Match: XP_022133879.1 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia] >XP_022133880.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia])

HSP 1 Score: 1301.2 bits (3366), Expect = 0.0e+00
Identity = 627/714 (87.82%), Postives = 670/714 (93.84%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRN 60
           MTME+PL RYQNYVYDRLQC+STSSS+SY  +RF+DS+LFRKRSLL      SNRRK RN
Sbjct: 1   MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRN 60

Query: 61  SLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIRKSSVGICSQIEKLVLCKK 120
           S CWIKCSS EQGLRPRP+P+PSK+D DVRK T   E TRIRKS VGICSQIEKLVLCKK
Sbjct: 61  SFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIRKSGVGICSQIEKLVLCKK 120

Query: 121 YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRN 180
           YRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+N
Sbjct: 121 YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKN 180

Query: 181 RILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDC 240
           RILLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNY+EAFRLFI+MWEE  D 
Sbjct: 181 RILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDS 240

Query: 241 GPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF 300
           GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Sbjct: 241 GPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF 300

Query: 301 DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVA 360
           DEMPDKTIVGWNSIIAGYALHGYSEEALDL YEMRDSG+KMDHFTFSIIIRICSRLASVA
Sbjct: 301 DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVA 360

Query: 361 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG 420
           RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FDRMS +N+ISWNALIAGYG
Sbjct: 361 RAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYG 420

Query: 421 NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRA 480
           NHGRGEEAI+MFE+ML EGM PNHVT+LAVLSACSISGLFERGWEIFQS+T DHKIKPRA
Sbjct: 421 NHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRA 480

Query: 481 MHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYG 540
           MH+ACMIELLGREGLLDEAYALIR APF+PTANMWAALLRACRV  NLELGK AAE LYG
Sbjct: 481 MHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYG 540

Query: 541 MEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN 600
           MEP+KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+F SGD 
Sbjct: 541 MEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK 600

Query: 601 HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTL 660
           HHA+IEKVV KVDE+MLKISKLGYV EQNF+LPDVDE EEKI MYHSEKLAIAYG+L+TL
Sbjct: 601 HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTL 660

Query: 661 EQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           ++TPLQIVQSHRICGDCHS IKLIA+IT+REIVVRDASRFHHFRDGSCSCGDYW
Sbjct: 661 KKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW 714

BLAST of HG10022069 vs. NCBI nr
Match: KAA0039490.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK15245.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1286.9 bits (3329), Expect = 0.0e+00
Identity = 622/667 (93.25%), Postives = 643/667 (96.40%), Query Frame = 0

Query: 42  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGI 101
           K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGI
Sbjct: 2   KTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGI 61

Query: 102 CSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMV 161
           CSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMV
Sbjct: 62  CSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMV 121

Query: 162 DNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFR 221
           DNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFR
Sbjct: 122 DNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFR 181

Query: 222 LFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYS 281
           LFILMWEE Y CGPRT ATMIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYS
Sbjct: 182 LFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYS 241

Query: 282 KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI 341
           KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSI
Sbjct: 242 KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSI 301

Query: 342 IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRN 401
           IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RN
Sbjct: 302 IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN 361

Query: 402 VISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQ 461
           VISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQ
Sbjct: 362 VISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQ 421

Query: 462 SMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNL 521
           SMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNL
Sbjct: 422 SMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNL 481

Query: 522 ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV 581
           ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV
Sbjct: 482 ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV 541

Query: 582 NNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS 641
           NNQPHAF SGD HH Q+EKVVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHS
Sbjct: 542 NNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHS 601

Query: 642 EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGS 701
           EKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+
Sbjct: 602 EKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGN 661

Query: 702 CSCGDYW 708
           CSCGDYW
Sbjct: 662 CSCGDYW 668

BLAST of HG10022069 vs. ExPASy Swiss-Prot
Match: Q9FK33 (Pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H58 PE=2 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 1.4e-255
Identity = 427/712 (59.97%), Postives = 542/712 (76.12%), Query Frame = 0

Query: 3   MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCS 62
           ME+PLSRYQ+   D ++ +S++     F  +FS              R+ +N    + CS
Sbjct: 1   MEIPLSRYQSIRLDEIRDSSSNPKVLTFPRKFS-----------LRGRRWKNPFGRLSCS 60

Query: 63  SFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEM 122
           S  QGL+P+P  +P+P +++    K+  L +T+I KS V ICSQIEKLVLC ++R+A E+
Sbjct: 61  SVVQGLKPKPKLKPEPIRIEVKESKDQILDDTQISKSGVTICSQIEKLVLCNRFREAFEL 120

Query: 123 FEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHV 182
           FEI E+   F+VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHV
Sbjct: 121 FEILEIRCSFKVGVSTYDALVEACIRLKSIRCVKRVYGFMMSNGFEPEQYMMNRILLMHV 180

Query: 183 KCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 242
           KCGM+IDA RLFD++PERN  S+ +IISG+V+ GNYVEAF LF +MWEE  DC   TFA 
Sbjct: 181 KCGMIIDARRLFDEIPERNLYSYYSIISGFVNFGNYVEAFELFKMMWEELSDCETHTFAV 240

Query: 243 MIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 302
           M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT
Sbjct: 241 MLRASAGLGSIYVGKQLHVCALKLGVVDNTFVSCGLIDMYSKCGDIEDARCAFECMPEKT 300

Query: 303 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHA 362
            V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHA
Sbjct: 301 TVAWNNVIAGYALHGYSEEALCLLYDMRDSGVSIDQFTLSIMIRISTKLAKLELTKQAHA 360

Query: 363 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEE 422
           SL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+ISWNAL+ GY NHGRG +
Sbjct: 361 SLIRNGFESEIVANTALVDFYSKWGRVDTARYVFDKLPRKNIISWNALMGGYANHGRGTD 420

Query: 423 AIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 482
           A+++FEKM+   + PNHVT+LAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMI
Sbjct: 421 AVKLFEKMIAANVAPNHVTFLAVLSACAYSGLSEQGWEIFLSMSEVHGIKPRAMHYACMI 480

Query: 483 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLS 542
           ELLGR+GLLDEA A IR+AP + T NMWAALL ACR+  NLELG+  AEKLYGM PEKL 
Sbjct: 481 ELLGRDGLLDEAIAFIRRAPLKTTVNMWAALLNACRMQENLELGRVVAEKLYGMGPEKLG 540

Query: 543 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE- 602
           NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+F SGD   +  E 
Sbjct: 541 NYVVMYNMYNSMGKTAEAAGVLETLESKGLSMMPACTWVEVGDQTHSFLSGDRFDSYNET 600

Query: 603 ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQ 662
              ++  KVDELM +IS+ GY  E+  +LPDVDE  EE++  YHSEKLAIAYG++NT E 
Sbjct: 601 VKRQIYQKVDELMEEISEYGYSEEEQHLLPDVDEKEEERVGRYHSEKLAIAYGLVNTPEW 660

Query: 663 TPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
            PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHHF++G CSCG YW
Sbjct: 661 NPLQITQNHRICKNCHKVVEFISLVTGREMVVRDASRFHHFKEGKCSCGGYW 701

BLAST of HG10022069 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 9.7e-135
Identity = 226/585 (38.63%), Postives = 363/585 (62.05%), Query Frame = 0

Query: 125 ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCG 184
           +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG
Sbjct: 50  DLEGSYIPADRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCG 109

Query: 185 MMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIR 244
            + +A ++F+KMP+R+ V+W T+ISGY       +A   F  M    Y     T +++I+
Sbjct: 110 SLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIK 169

Query: 245 ASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVG 304
           A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V 
Sbjct: 170 AAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVS 229

Query: 305 WNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLV 364
           WN++IAG+A    +E+AL+L+  M   G +  HF+++ +   CS    + + K  HA ++
Sbjct: 230 WNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMI 289

Query: 365 RNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE 424
           ++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Sbjct: 290 KSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVW 349

Query: 425 MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELL 484
            FE+M   G+ PN +++L+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LL
Sbjct: 350 WFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDLL 409

Query: 485 GREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYI 544
           GR G L+ A   I + P +PTA +W ALL ACR+  N ELG +AAE ++ ++P+    ++
Sbjct: 410 GRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHV 469

Query: 545 VLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVG 604
           +L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F + D  H Q E++  
Sbjct: 470 ILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIAR 529

Query: 605 KVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQ 664
           K +E++ KI +LGYVP+ + ++  VD+ E ++ + YHSEK+A+A+ +LNT   + + I +
Sbjct: 530 KWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKK 589

Query: 665 SHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           + R+CGDCH+ IKL + +  REI+VRD +RFHHF+DG+CSC DYW
Sbjct: 590 NIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDGNCSCKDYW 633

BLAST of HG10022069 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 3.1e-133
Identity = 232/580 (40.00%), Postives = 359/580 (61.90%), Query Frame = 0

Query: 129 GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDA 188
           G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA
Sbjct: 56  GLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDA 115

Query: 189 CRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGL 248
            +LFD+MP+RN +SW T+IS Y     + +A  L +LM  +       T+++++R+  G+
Sbjct: 116 HQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGM 175

Query: 249 EVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSII 308
             +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII
Sbjct: 176 SDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSII 235

Query: 309 AGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFG 368
            G+A +  S+ AL+L+  M+ +G   +  T + ++R C+ LA +    QAH  +V+  + 
Sbjct: 236 GGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YD 295

Query: 369 LDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM 428
            D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Sbjct: 296 QDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERM 355

Query: 429 LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGL 488
              G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G 
Sbjct: 356 KSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGK 415

Query: 489 LDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNI 548
           LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  ++PE    Y +L NI
Sbjct: 416 LDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNI 475

Query: 549 YNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDEL 608
           Y +S K     ++   ++ +G++  P CSWIEVN Q HAF  GDN H QI +V  K+++L
Sbjct: 476 YANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQL 535

Query: 609 MLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRIC 668
           + +++ +GYVPE NF+L D++ E  E    +HSEKLA+A+G++    +  ++I ++ RIC
Sbjct: 536 IHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRIC 595

Query: 669 GDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           GDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Sbjct: 596 GDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of HG10022069 vs. ExPASy Swiss-Prot
Match: Q9S7F4 (Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H36 PE=3 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 1.6e-129
Identity = 233/597 (39.03%), Postives = 363/597 (60.80%), Query Frame = 0

Query: 114 YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRN 173
           Y +++ +F +   + G Q  + TF  ++ A +GL      ++L    V  GF  D  + N
Sbjct: 231 YTESIHLF-LKMRQSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGN 290

Query: 174 RILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDC 233
           +IL  + K   +++   LFD+MPE + VS+N +IS Y  +  Y  +   F  M    +D 
Sbjct: 291 QILDFYSKHDRVLETRMLFDEMPELDFVSYNVVISSYSQADQYEASLHFFREMQCMGFDR 350

Query: 234 GPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF 293
               FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F
Sbjct: 351 RNFPFATMLSIAANLSSLQMGRQLHCQALLATADSILHVGNSLVDMYAKCEMFEEAELIF 410

Query: 294 DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVA 353
             +P +T V W ++I+GY   G     L L+ +MR S ++ D  TF+ +++  +  AS+ 
Sbjct: 411 KSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVLKASASFASLL 470

Query: 354 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG 413
             KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Sbjct: 471 LGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAVSWNALISAHA 530

Query: 414 NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRA 473
           ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+ 
Sbjct: 531 DNGDGEAAIGAFAKMIESGLQPDSVSILGVLTACSHCGFVEQGTEYFQAMSPIYGITPKK 590

Query: 474 MHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYG 533
            HYACM++LLGR G   EA  L+ + PF+P   MW+++L ACR+  N  L + AAEKL+ 
Sbjct: 591 KHYACMLDLLGRNGRFAEAEKLMDEMPFEPDEIMWSSVLNACRIHKNQSLAERAAEKLFS 650

Query: 534 MEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGD 593
           ME     + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D
Sbjct: 651 MEKLRDAAAYVSMSNIYAAAGEWEKVRDVKKAMRERGIKKVPAYSWVEVNHKIHVFSSND 710

Query: 594 NHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVL 653
             H   +++V K++EL  +I + GY P+ + ++ DVDE + KI+   YHSE+LA+A+ ++
Sbjct: 711 QTHPNGDEIVRKINELTAEIEREGYKPDTSSVVQDVDE-QMKIESLKYHSERLAVAFALI 770

Query: 654 NTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           +T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Sbjct: 771 STPEGCPIVVMKNLRACRDCHAAIKLISKIVKREITVRDTSRFHHFSEGVCSCGDYW 825

BLAST of HG10022069 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 5.7e-127
Identity = 225/611 (36.82%), Postives = 358/611 (58.59%), Query Frame = 0

Query: 134 NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACR 193
           ++ F +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    
Sbjct: 105 HNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGN 164

Query: 194 LFDKMPER---------------------------------NAVSWNTIISGYVDSGNYV 253
           +FD+MP+R                                 + VS+NTII+GY  SG Y 
Sbjct: 165 VFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYE 224

Query: 254 EAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALI 313
           +A R+   M          T ++++   +    +  G+++H   I+ G+  D+++  +L+
Sbjct: 225 DALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLV 284

Query: 314 DMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHF 373
           DMY+K   +ED+  VF  +  +  + WNS++AGY  +G   EAL L+ +M  + VK    
Sbjct: 285 DMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAV 344

Query: 374 TFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM 433
            FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Sbjct: 345 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 404

Query: 434 SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGW 493
           +  + +SW A+I G+  HG G EA+ +FE+M  +G+ PN V ++AVL+ACS  GL +  W
Sbjct: 405 NVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAW 464

Query: 494 EIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV 553
             F SMT+ + +     HYA + +LLGR G L+EAY  I K   +PT ++W+ LL +C V
Sbjct: 465 GYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSV 524

Query: 554 DGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACS 613
             NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACS
Sbjct: 525 HKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACS 584

Query: 614 WIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQ 673
           WIE+ N+ H F SGD  H  ++K+   +  +M ++ K GYV + + +L DVD EH+ ++ 
Sbjct: 585 WIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELL 644

Query: 674 MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF 708
             HSE+LA+A+G++NT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Sbjct: 645 FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHF 704

BLAST of HG10022069 vs. ExPASy TrEMBL
Match: A0A1S3C9W7 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498490 PE=3 SV=1)

HSP 1 Score: 1340.1 bits (3467), Expect = 0.0e+00
Identity = 652/708 (92.09%), Postives = 675/708 (95.34%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIK 60
           M MELPLSRYQNYVYDRLQC     ST YFSLR+SDS LF K S LSNRRKCRNS CW+K
Sbjct: 1   MNMELPLSRYQNYVYDRLQC----YSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVK 60

Query: 61  CSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEM 120
           CSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGICSQIEKLVLCK+YRDALEM
Sbjct: 61  CSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEM 120

Query: 121 FEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHV 180
           FEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHV
Sbjct: 121 FEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHV 180

Query: 181 KCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 240
           KCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT AT
Sbjct: 181 KCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT 240

Query: 241 MIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300
           MIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Sbjct: 241 MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 300

Query: 301 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHA 360
           IVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHA
Sbjct: 301 IVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHA 360

Query: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEE 420
           SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RNVISWNALIAGYGNHGRGEE
Sbjct: 361 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEE 420

Query: 421 AIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 480
           AI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMI
Sbjct: 421 AIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMI 480

Query: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLS 540
           ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLS
Sbjct: 481 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEPEKLS 540

Query: 541 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK 600
           NYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EK
Sbjct: 541 NYIVLLNIYNTSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEK 600

Query: 601 VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQ 660
           VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHSEKLAIAYG+LNTLE+TPLQ
Sbjct: 601 VVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQ 660

Query: 661 IVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           IVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+CSCGDYW
Sbjct: 661 IVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW 704

BLAST of HG10022069 vs. ExPASy TrEMBL
Match: A0A0A0KXD9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G636580 PE=3 SV=1)

HSP 1 Score: 1336.6 bits (3458), Expect = 0.0e+00
Identity = 653/710 (91.97%), Postives = 675/710 (95.07%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIK 60
           M MELPLSRYQNYVYDRLQCN    STS+FSLR+SDS+LF K S LSN RK RNS CWIK
Sbjct: 1   MNMELPLSRYQNYVYDRLQCN----STSFFSLRYSDSDLFTKTSFLSNPRKYRNSFCWIK 60

Query: 61  CSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDAL 120
           CSSFEQGL  RPRPQPKPSK+D   RKETPLKET ++KSSVGICSQIEKLVLCKKYRDAL
Sbjct: 61  CSSFEQGLRPRPRPQPKPSKLDVGDRKETPLKETHVKKSSVGICSQIEKLVLCKKYRDAL 120

Query: 121 EMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLM 180
           EMFEIFELE GF VG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLM
Sbjct: 121 EMFEIFELEDGFHVGYSTYDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRVLLM 180

Query: 181 HVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTF 240
           HVKCGMMIDACRLFD+MP RNAVSW TIISGYVDSGNYVEAFRLFILM EE+YDCGPRTF
Sbjct: 181 HVKCGMMIDACRLFDEMPARNAVSWGTIISGYVDSGNYVEAFRLFILMREEFYDCGPRTF 240

Query: 241 ATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD 300
           ATMIRASAGLE+IFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
Sbjct: 241 ATMIRASAGLEIIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD 300

Query: 301 KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQA 360
           KTIVGWNSIIAGYALHGYSEEALDLY+EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ 
Sbjct: 301 KTIVGWNSIIAGYALHGYSEEALDLYHEMRDSGVKMDHFTFSIIIRICSRLASVARAKQV 360

Query: 361 HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRG 420
           HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RN+ISWNALIAGYGNHG G
Sbjct: 361 HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNIISWNALIAGYGNHGHG 420

Query: 421 EEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYAC 480
           EEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+AC
Sbjct: 421 EEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVKPRAMHFAC 480

Query: 481 MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEK 540
           MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEK
Sbjct: 481 MIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGKFAAEKLYGMEPEK 540

Query: 541 LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI 600
           LSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QI
Sbjct: 541 LSNYIVLLNIYNSSGKLKEAADVFQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQI 600

Query: 601 EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTP 660
           EKVVGKVDELML ISKLGYVP EQNFMLPDVDE+EEKI+MYHSEKLAIAYG+LNTLE+TP
Sbjct: 601 EKVVGKVDELMLNISKLGYVPEEQNFMLPDVDENEEKIRMYHSEKLAIAYGLLNTLEKTP 660

Query: 661 LQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           LQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDGSCSCGDYW
Sbjct: 661 LQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGSCSCGDYW 706

BLAST of HG10022069 vs. ExPASy TrEMBL
Match: A0A6J1BWH3 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111006324 PE=3 SV=1)

HSP 1 Score: 1301.2 bits (3366), Expect = 0.0e+00
Identity = 627/714 (87.82%), Postives = 670/714 (93.84%), Query Frame = 0

Query: 1   MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRN 60
           MTME+PL RYQNYVYDRLQC+STSSS+SY  +RF+DS+LFRKRSLL      SNRRK RN
Sbjct: 1   MTMEVPLPRYQNYVYDRLQCSSTSSSSSYLPVRFTDSKLFRKRSLLSEYTLWSNRRKLRN 60

Query: 61  SLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIRKSSVGICSQIEKLVLCKK 120
           S CWIKCSS EQGLRPRP+P+PSK+D DVRK T   E TRIRKS VGICSQIEKLVLCKK
Sbjct: 61  SFCWIKCSSLEQGLRPRPEPRPSKIDHDVRKGTSSNETTRIRKSGVGICSQIEKLVLCKK 120

Query: 121 YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRN 180
           YRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+N
Sbjct: 121 YRDALEMFEIFELEGGYDIGNSTYDALINACIGLKSIRGVKRLCNYMIDNGFEPDQYMKN 180

Query: 181 RILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDC 240
           RILLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNY+EAFRLFI+MWEE  D 
Sbjct: 181 RILLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYIEAFRLFIMMWEECSDS 240

Query: 241 GPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF 300
           GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Sbjct: 241 GPRTFAIMIRASAGLELIFPGRQLHSCAIKAGVGQDIFVSCALIDMYSKCGSLEDAHCVF 300

Query: 301 DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVA 360
           DEMPDKTIVGWNSIIAGYALHGYSEEALDL YEMRDSG+KMDHFTFSIIIRICSRLASVA
Sbjct: 301 DEMPDKTIVGWNSIIAGYALHGYSEEALDLCYEMRDSGIKMDHFTFSIIIRICSRLASVA 360

Query: 361 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG 420
           RAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FDRMS +N+ISWNALIAGYG
Sbjct: 361 RAKQVHAGLVRNGFGLDVVANTALVDFYSKWGKIDDARHIFDRMSHKNIISWNALIAGYG 420

Query: 421 NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRA 480
           NHGRGEEAI+MFE+ML EGM PNHVT+LAVLSACSISGLFERGWEIFQS+T DHKIKPRA
Sbjct: 421 NHGRGEEAIQMFERMLREGMTPNHVTFLAVLSACSISGLFERGWEIFQSITTDHKIKPRA 480

Query: 481 MHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYG 540
           MH+ACMIELLGREGLLDEAYALIR APF+PTANMWAALLRACRV  NLELGK AAE LYG
Sbjct: 481 MHFACMIELLGREGLLDEAYALIRNAPFKPTANMWAALLRACRVHENLELGKLAAENLYG 540

Query: 541 MEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN 600
           MEP+KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+F SGD 
Sbjct: 541 MEPDKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMVPACSWIEVKNQPHSFLSGDK 600

Query: 601 HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTL 660
           HHA+IEKVV KVDE+MLKISKLGYV EQNF+LPDVDE EEKI MYHSEKLAIAYG+L+TL
Sbjct: 601 HHAEIEKVVEKVDEIMLKISKLGYVAEQNFLLPDVDEKEEKIHMYHSEKLAIAYGLLSTL 660

Query: 661 EQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           ++TPLQIVQSHRICGDCHS IKLIA+IT+REIVVRDASRFHHFRDGSCSCGDYW
Sbjct: 661 KKTPLQIVQSHRICGDCHSTIKLIALITRREIVVRDASRFHHFRDGSCSCGDYW 714

BLAST of HG10022069 vs. ExPASy TrEMBL
Match: A0A5A7T8C6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00190 PE=3 SV=1)

HSP 1 Score: 1286.9 bits (3329), Expect = 0.0e+00
Identity = 622/667 (93.25%), Postives = 643/667 (96.40%), Query Frame = 0

Query: 42  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGI 101
           K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGI
Sbjct: 2   KTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGI 61

Query: 102 CSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMV 161
           CSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMV
Sbjct: 62  CSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMV 121

Query: 162 DNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFR 221
           DNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFR
Sbjct: 122 DNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFR 181

Query: 222 LFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYS 281
           LFILMWEE Y CGPRT ATMIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYS
Sbjct: 182 LFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYS 241

Query: 282 KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI 341
           KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSI
Sbjct: 242 KCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSI 301

Query: 342 IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRN 401
           IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RN
Sbjct: 302 IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN 361

Query: 402 VISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQ 461
           VISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQ
Sbjct: 362 VISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQ 421

Query: 462 SMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNL 521
           SMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNL
Sbjct: 422 SMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNL 481

Query: 522 ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV 581
           ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV
Sbjct: 482 ELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEV 541

Query: 582 NNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS 641
           NNQPHAF SGD HH Q+EKVVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHS
Sbjct: 542 NNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHS 601

Query: 642 EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGS 701
           EKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+
Sbjct: 602 EKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGN 661

Query: 702 CSCGDYW 708
           CSCGDYW
Sbjct: 662 CSCGDYW 668

BLAST of HG10022069 vs. ExPASy TrEMBL
Match: A0A6J1JGW0 (pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111486894 PE=3 SV=1)

HSP 1 Score: 1248.4 bits (3229), Expect = 0.0e+00
Identity = 610/711 (85.79%), Postives = 651/711 (91.56%), Query Frame = 0

Query: 3   MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSL 62
           ME+PL  YQNYV+D L+  S SSSTSYFS  FS SELFR RSLL      SNRRK RNS 
Sbjct: 1   MEVPL--YQNYVHDHLRRTSLSSSTSYFSHHFSGSELFRDRSLLSAYSLWSNRRKLRNSF 60

Query: 63  CWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRD 122
           CW+KCSS EQGLRPR +PKPSKVD DVRK TP KETRI KSSV IC  IEKLVLC K+RD
Sbjct: 61  CWVKCSSLEQGLRPRLKPKPSKVDRDVRKGTPSKETRITKSSVRICCHIEKLVLCNKFRD 120

Query: 123 ALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRIL 182
           ALEMFEI ELEGG+ VGNSTFDALI ACIGLKSIRG KRLC YM+DNG EPDQY+ NRIL
Sbjct: 121 ALEMFEILELEGGYDVGNSTFDALIIACIGLKSIRGAKRLCAYMIDNGIEPDQYIMNRIL 180

Query: 183 LMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPR 242
           LMHV+CGMMIDA +LFD+MPERNAVSWNTIISGYVDSGNY EAFRLFI+MWEEY  C PR
Sbjct: 181 LMHVRCGMMIDASKLFDEMPERNAVSWNTIISGYVDSGNYKEAFRLFIMMWEEYPGCSPR 240

Query: 243 TFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM 302
           TFAT+IRASAGLE+IFPG+QLHSCA+KAG+GQDIFVSCALIDMYSKCG LEDAHCVFDEM
Sbjct: 241 TFATVIRASAGLELIFPGKQLHSCAVKAGVGQDIFVSCALIDMYSKCGGLEDAHCVFDEM 300

Query: 303 PDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAK 362
           PDKTIVGWNSIIAGYALHG+SEEAL+LY++MRDSGVK+DHFTFSIIIRICSRLASV RAK
Sbjct: 301 PDKTIVGWNSIIAGYALHGHSEEALNLYFQMRDSGVKIDHFTFSIIIRICSRLASVTRAK 360

Query: 363 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHG 422
           QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDRMS +N+ISWNALIAGYGNHG
Sbjct: 361 QAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHIFDRMSCKNLISWNALIAGYGNHG 420

Query: 423 RGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHY 482
           RGEEAIE+FE+ML EGM+PNHVT+LAVLSACSISGLFERGWEIFQSMTRDHKIK RAMHY
Sbjct: 421 RGEEAIEIFERMLREGMVPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKLRAMHY 480

Query: 483 ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEP 542
            CMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV  NLELGK+AAEKLYGMEP
Sbjct: 481 TCMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHENLELGKYAAEKLYGMEP 540

Query: 543 EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHA 602
           EKL NYIVLLNIY SSGKLKEAADVV+TLKRKGL MLPACSWIEV +QPHAF SGD HH 
Sbjct: 541 EKLRNYIVLLNIYKSSGKLKEAADVVRTLKRKGLSMLPACSWIEVKHQPHAFLSGDKHHP 600

Query: 603 QIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQT 662
           +IEKVV KVDELML+ISKLGYVPEQN +LPDVD HEEKIQ+YHSEKLAIAYG++NTL+QT
Sbjct: 601 EIEKVVEKVDELMLEISKLGYVPEQNILLPDVD-HEEKIQIYHSEKLAIAYGLINTLKQT 660

Query: 663 PLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           PLQIVQ HR+CGDCHSVIKLIAMITKREIVVRDASRFHHFRDG CSCGDYW
Sbjct: 661 PLQIVQGHRVCGDCHSVIKLIAMITKREIVVRDASRFHHFRDGRCSCGDYW 708

BLAST of HG10022069 vs. TAIR 10
Match: AT5G50390.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 883.6 bits (2282), Expect = 1.0e-256
Identity = 427/712 (59.97%), Postives = 542/712 (76.12%), Query Frame = 0

Query: 3   MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCS 62
           ME+PLSRYQ+   D ++ +S++     F  +FS              R+ +N    + CS
Sbjct: 1   MEIPLSRYQSIRLDEIRDSSSNPKVLTFPRKFS-----------LRGRRWKNPFGRLSCS 60

Query: 63  SFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEM 122
           S  QGL+P+P  +P+P +++    K+  L +T+I KS V ICSQIEKLVLC ++R+A E+
Sbjct: 61  SVVQGLKPKPKLKPEPIRIEVKESKDQILDDTQISKSGVTICSQIEKLVLCNRFREAFEL 120

Query: 123 FEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHV 182
           FEI E+   F+VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHV
Sbjct: 121 FEILEIRCSFKVGVSTYDALVEACIRLKSIRCVKRVYGFMMSNGFEPEQYMMNRILLMHV 180

Query: 183 KCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFAT 242
           KCGM+IDA RLFD++PERN  S+ +IISG+V+ GNYVEAF LF +MWEE  DC   TFA 
Sbjct: 181 KCGMIIDARRLFDEIPERNLYSYYSIISGFVNFGNYVEAFELFKMMWEELSDCETHTFAV 240

Query: 243 MIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT 302
           M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT
Sbjct: 241 MLRASAGLGSIYVGKQLHVCALKLGVVDNTFVSCGLIDMYSKCGDIEDARCAFECMPEKT 300

Query: 303 IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHA 362
            V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHA
Sbjct: 301 TVAWNNVIAGYALHGYSEEALCLLYDMRDSGVSIDQFTLSIMIRISTKLAKLELTKQAHA 360

Query: 363 SLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEE 422
           SL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+ISWNAL+ GY NHGRG +
Sbjct: 361 SLIRNGFESEIVANTALVDFYSKWGRVDTARYVFDKLPRKNIISWNALMGGYANHGRGTD 420

Query: 423 AIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMI 482
           A+++FEKM+   + PNHVT+LAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMI
Sbjct: 421 AVKLFEKMIAANVAPNHVTFLAVLSACAYSGLSEQGWEIFLSMSEVHGIKPRAMHYACMI 480

Query: 483 ELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLS 542
           ELLGR+GLLDEA A IR+AP + T NMWAALL ACR+  NLELG+  AEKLYGM PEKL 
Sbjct: 481 ELLGRDGLLDEAIAFIRRAPLKTTVNMWAALLNACRMQENLELGRVVAEKLYGMGPEKLG 540

Query: 543 NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE- 602
           NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+F SGD   +  E 
Sbjct: 541 NYVVMYNMYNSMGKTAEAAGVLETLESKGLSMMPACTWVEVGDQTHSFLSGDRFDSYNET 600

Query: 603 ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQ 662
              ++  KVDELM +IS+ GY  E+  +LPDVDE  EE++  YHSEKLAIAYG++NT E 
Sbjct: 601 VKRQIYQKVDELMEEISEYGYSEEEQHLLPDVDEKEEERVGRYHSEKLAIAYGLVNTPEW 660

Query: 663 TPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
            PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHHF++G CSCG YW
Sbjct: 661 NPLQITQNHRICKNCHKVVEFISLVTGREMVVRDASRFHHFKEGKCSCGGYW 701

BLAST of HG10022069 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 477.2 bits (1227), Expect = 2.2e-134
Identity = 232/580 (40.00%), Postives = 359/580 (61.90%), Query Frame = 0

Query: 129 GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDA 188
           G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA
Sbjct: 56  GLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDA 115

Query: 189 CRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGL 248
            +LFD+MP+RN +SW T+IS Y     + +A  L +LM  +       T+++++R+  G+
Sbjct: 116 HQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGM 175

Query: 249 EVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSII 308
             +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII
Sbjct: 176 SDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSII 235

Query: 309 AGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFG 368
            G+A +  S+ AL+L+  M+ +G   +  T + ++R C+ LA +    QAH  +V+  + 
Sbjct: 236 GGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YD 295

Query: 369 LDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM 428
            D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Sbjct: 296 QDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERM 355

Query: 429 LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGL 488
              G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G 
Sbjct: 356 KSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGK 415

Query: 489 LDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNI 548
           LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA+K+  ++PE    Y +L NI
Sbjct: 416 LDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNI 475

Query: 549 YNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDEL 608
           Y +S K     ++   ++ +G++  P CSWIEVN Q HAF  GDN H QI +V  K+++L
Sbjct: 476 YANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQL 535

Query: 609 MLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRIC 668
           + +++ +GYVPE NF+L D++ E  E    +HSEKLA+A+G++    +  ++I ++ RIC
Sbjct: 536 IHRLTGIGYVPETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRIC 595

Query: 669 GDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           GDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Sbjct: 596 GDCHVFCKLASKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of HG10022069 vs. TAIR 10
Match: AT3G02010.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 464.9 bits (1195), Expect = 1.1e-130
Identity = 233/597 (39.03%), Postives = 363/597 (60.80%), Query Frame = 0

Query: 114 YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRN 173
           Y +++ +F +   + G Q  + TF  ++ A +GL      ++L    V  GF  D  + N
Sbjct: 231 YTESIHLF-LKMRQSGHQPSDFTFSGVLKAVVGLHDFALGQQLHALSVTTGFSRDASVGN 290

Query: 174 RILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDC 233
           +IL  + K   +++   LFD+MPE + VS+N +IS Y  +  Y  +   F  M    +D 
Sbjct: 291 QILDFYSKHDRVLETRMLFDEMPELDFVSYNVVISSYSQADQYEASLHFFREMQCMGFDR 350

Query: 234 GPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF 293
               FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F
Sbjct: 351 RNFPFATMLSIAANLSSLQMGRQLHCQALLATADSILHVGNSLVDMYAKCEMFEEAELIF 410

Query: 294 DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVA 353
             +P +T V W ++I+GY   G     L L+ +MR S ++ D  TF+ +++  +  AS+ 
Sbjct: 411 KSLPQRTTVSWTALISGYVQKGLHGAGLKLFTKMRGSNLRADQSTFATVLKASASFASLL 470

Query: 354 RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG 413
             KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Sbjct: 471 LGKQLHAFIIRSGNLENVFSGSGLVDMYAKCGSIKDAVQVFEEMPDRNAVSWNALISAHA 530

Query: 414 NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRA 473
           ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+ 
Sbjct: 531 DNGDGEAAIGAFAKMIESGLQPDSVSILGVLTACSHCGFVEQGTEYFQAMSPIYGITPKK 590

Query: 474 MHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYG 533
            HYACM++LLGR G   EA  L+ + PF+P   MW+++L ACR+  N  L + AAEKL+ 
Sbjct: 591 KHYACMLDLLGRNGRFAEAEKLMDEMPFEPDEIMWSSVLNACRIHKNQSLAERAAEKLFS 650

Query: 534 MEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGD 593
           ME     + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D
Sbjct: 651 MEKLRDAAAYVSMSNIYAAAGEWEKVRDVKKAMRERGIKKVPAYSWVEVNHKIHVFSSND 710

Query: 594 NHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVL 653
             H   +++V K++EL  +I + GY P+ + ++ DVDE + KI+   YHSE+LA+A+ ++
Sbjct: 711 QTHPNGDEIVRKINELTAEIEREGYKPDTSSVVQDVDE-QMKIESLKYHSERLAVAFALI 770

Query: 654 NTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW 708
           +T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Sbjct: 771 STPEGCPIVVMKNLRACRDCHAAIKLISKIVKREITVRDTSRFHHFSEGVCSCGDYW 825

BLAST of HG10022069 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 464.2 bits (1193), Expect = 1.9e-130
Identity = 220/578 (38.06%), Postives = 356/578 (61.59%), Query Frame = 0

Query: 125 ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCG 184
           +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG
Sbjct: 50  DLEGSYIPADRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCG 109

Query: 185 MMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIR 244
            + +A ++F+KMP+R+ V+W T+ISGY       +A   F  M    Y     T +++I+
Sbjct: 110 SLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIK 169

Query: 245 ASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVG 304
           A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V 
Sbjct: 170 AAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVS 229

Query: 305 WNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLV 364
           WN++IAG+A    +E+AL+L+  M   G +  HF+++ +   CS    + + K  HA ++
Sbjct: 230 WNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMI 289

Query: 365 RNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE 424
           ++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Sbjct: 290 KSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVW 349

Query: 425 MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELL 484
            FE+M   G+ PN +++L+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LL
Sbjct: 350 WFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDLL 409

Query: 485 GREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYI 544
           GR G L+ A   I + P +PTA +W ALL ACR+  N ELG +AAE ++ ++P+    ++
Sbjct: 410 GRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHV 469

Query: 545 VLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVG 604
           +L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F + D  H Q E++  
Sbjct: 470 ILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIAR 529

Query: 605 KVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQ 664
           K +E++ KI +LGYVP+ + ++  VD+ E ++ + YHSEK+A+A+ +LNT   + + I +
Sbjct: 530 KWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKK 589

Query: 665 SHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGS 701
           + R+CGDCH+ IKL + +  REI+VRD +RFHHF+D S
Sbjct: 590 NIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDAS 626

BLAST of HG10022069 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 456.4 bits (1173), Expect = 4.0e-128
Identity = 225/611 (36.82%), Postives = 358/611 (58.59%), Query Frame = 0

Query: 134 NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACR 193
           ++ F +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    
Sbjct: 105 HNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGN 164

Query: 194 LFDKMPER---------------------------------NAVSWNTIISGYVDSGNYV 253
           +FD+MP+R                                 + VS+NTII+GY  SG Y 
Sbjct: 165 VFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYE 224

Query: 254 EAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALI 313
           +A R+   M          T ++++   +    +  G+++H   I+ G+  D+++  +L+
Sbjct: 225 DALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLV 284

Query: 314 DMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHF 373
           DMY+K   +ED+  VF  +  +  + WNS++AGY  +G   EAL L+ +M  + VK    
Sbjct: 285 DMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAV 344

Query: 374 TFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM 433
            FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Sbjct: 345 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 404

Query: 434 SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGW 493
           +  + +SW A+I G+  HG G EA+ +FE+M  +G+ PN V ++AVL+ACS  GL +  W
Sbjct: 405 NVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAW 464

Query: 494 EIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV 553
             F SMT+ + +     HYA + +LLGR G L+EAY  I K   +PT ++W+ LL +C V
Sbjct: 465 GYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSV 524

Query: 554 DGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACS 613
             NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACS
Sbjct: 525 HKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACS 584

Query: 614 WIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQ 673
           WIE+ N+ H F SGD  H  ++K+   +  +M ++ K GYV + + +L DVD EH+ ++ 
Sbjct: 585 WIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELL 644

Query: 674 MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF 708
             HSE+LA+A+G++NT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Sbjct: 645 FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHF 704

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008459324.10.0e+0092.09PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic ... [more]
XP_038890388.10.0e+0091.95pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 ... [more]
XP_004148701.10.0e+0091.97pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 ... [more]
XP_022133879.10.0e+0087.82pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica ... [more]
KAA0039490.10.0e+0093.25pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK15245... [more]
Match NameE-valueIdentityDescription
Q9FK331.4e-25559.97Pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Arabidop... [more]
Q9LIQ79.7e-13538.63Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9SI533.1e-13340.00Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9S7F41.6e-12939.03Putative pentatricopeptide repeat-containing protein At2g01510 OS=Arabidopsis th... [more]
Q9LW635.7e-12736.82Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A1S3C9W70.0e+0092.09pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucumis ... [more]
A0A0A0KXD90.0e+0091.97DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G6365... [more]
A0A6J1BWH30.0e+0087.82pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Momordic... [more]
A0A5A7T8C60.0e+0093.25Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1JGW00.0e+0085.79pentatricopeptide repeat-containing protein At5g50390, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G50390.11.0e-25659.97Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G03880.12.2e-13440.00Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02010.11.1e-13039.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G24000.11.9e-13038.06Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G23330.14.0e-12836.82Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 360..429
e-value: 5.5E-10
score: 41.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 100..252
e-value: 3.5E-25
score: 91.0
coord: 253..359
e-value: 3.5E-23
score: 84.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 441..580
e-value: 2.9E-15
score: 58.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 542..570
e-value: 1.2
score: 9.6
coord: 275..299
e-value: 0.0012
score: 19.0
coord: 476..497
e-value: 0.88
score: 10.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 400..447
e-value: 1.7E-12
score: 47.4
coord: 199..244
e-value: 2.0E-7
score: 31.1
coord: 300..346
e-value: 2.5E-12
score: 46.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 275..298
e-value: 6.7E-4
score: 17.7
coord: 403..436
e-value: 2.3E-8
score: 31.7
coord: 304..335
e-value: 8.1E-7
score: 26.8
coord: 375..401
e-value: 2.8E-4
score: 18.8
coord: 438..471
e-value: 7.7E-4
score: 17.5
coord: 201..228
e-value: 2.5E-5
score: 22.1
coord: 136..168
e-value: 0.0018
score: 16.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..471
score: 8.670445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..299
score: 8.769097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..229
score: 9.569272
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 13.109773
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 575..697
e-value: 8.3E-36
score: 122.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..87
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 3..701
NoneNo IPR availablePANTHERPTHR47924:SF36SUBFAMILY NOT NAMEDcoord: 3..701

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022069.1HG10022069.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding