HG10013839 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013839
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr02: 5251638 .. 5253341 (+)
RNA-Seq ExpressionHG10013839
SyntenyHG10013839
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACTCTGCTCAATACAGTGTCACCAATTACAAACCCATCACCAGAAACCACAAGAAGAGGGTGTGGGTTCTTTTCCCATATCCCAAATCTCCACAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTGCCCAATTGGAGGACCGGGAAAGTTGATCAAAAGAGTAGAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAATTCATGGTTGAGAAGGGCCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGCAAGGCATGTAAGTTGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGCTCTGGAATCATTCCAGATGCATCATCCTATACATTTTTAGTAAGTTCTTTGTGTAAAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACCAACACTGTTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGCTGGTCCCAAATGCTTACACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAATTGAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAATTTGGTTAGCTATAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATCCAGTTATTTAGGGAATTGCCTTCAAAGGGATTCAGTCCAAATGTTGTTAGTTACAATATCTTGCTAAGGAGTCTGTGCTATGAAGGGAGGTGGGAAGAGGCAAATGTACTTCTAGCTGAAATGGACGGTGACGATCGCTCCCCTTCGATCGTCACTTACAATATATTGATTGGTTCACTTACCCTCCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGATTCAAGCCGACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTAGACCAAATGATCTATAGGCATTGCAACCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAGGCATTCTCCATTATACAGAGTTTAGGCAACAAGCAACATTCCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCTTGTGTCGAAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACGCCTGATTCTTTTACCTATTCGTCTTTGATCCGGGGGTTATGCATGGAAGGTATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATAACTACAGGCCTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACCGATTTGGCCTTGGAGGTTTTCGAAATAATGGTTGATAAAGGTTATCTTCCCAATGAAACGACATACACCATTCTTGTGGAAGGGATCGTCCATGAAAAAGAGATAGATCTAGCAACCAAAGTACTGAGGGAGTTGCAACTAAGAGATGTTATAAGTCAAAGCACAGTGGAAAGACTTGTTATGCAGTATGACCTAAATGAATTACCATTGTGA

mRNA sequence

ATGGCGACTCTGCTCAATACAGTGTCACCAATTACAAACCCATCACCAGAAACCACAAGAAGAGGGTGTGGGTTCTTTTCCCATATCCCAAATCTCCACAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTGCCCAATTGGAGGACCGGGAAAGTTGATCAAAAGAGTAGAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAATTCATGGTTGAGAAGGGCCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGCAAGGCATGTAAGTTGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGCTCTGGAATCATTCCAGATGCATCATCCTATACATTTTTAGTAAGTTCTTTGTGTAAAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACCAACACTGTTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGCTGGTCCCAAATGCTTACACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAATTGAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAATTTGGTTAGCTATAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATCCAGTTATTTAGGGAATTGCCTTCAAAGGGATTCAGTCCAAATGTTGTTAGTTACAATATCTTGCTAAGGAGTCTGTGCTATGAAGGGAGGTGGGAAGAGGCAAATGTACTTCTAGCTGAAATGGACGGTGACGATCGCTCCCCTTCGATCGTCACTTACAATATATTGATTGGTTCACTTACCCTCCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGATTCAAGCCGACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTAGACCAAATGATCTATAGGCATTGCAACCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAGGCATTCTCCATTATACAGAGTTTAGGCAACAAGCAACATTCCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCTTGTGTCGAAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACGCCTGATTCTTTTACCTATTCGTCTTTGATCCGGGGGTTATGCATGGAAGGTATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATAACTACAGGCCTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACCGATTTGGCCTTGGAGGTTTTCGAAATAATGGTTGATAAAGGTTATCTTCCCAATGAAACGACATACACCATTCTTGTGGAAGGGATCGTCCATGAAAAAGAGATAGATCTAGCAACCAAAGTACTGAGGGAGTTGCAACTAAGAGATGTTATAAGTCAAAGCACAGTGGAAAGACTTGTTATGCAGTATGACCTAAATGAATTACCATTGTGA

Coding sequence (CDS)

ATGGCGACTCTGCTCAATACAGTGTCACCAATTACAAACCCATCACCAGAAACCACAAGAAGAGGGTGTGGGTTCTTTTCCCATATCCCAAATCTCCACAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTGCCCAATTGGAGGACCGGGAAAGTTGATCAAAAGAGTAGAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAATTCATGGTTGAGAAGGGCCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGCAAGGCATGTAAGTTGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGCTCTGGAATCATTCCAGATGCATCATCCTATACATTTTTAGTAAGTTCTTTGTGTAAAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACCAACACTGTTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGCTGGTCCCAAATGCTTACACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAATTGAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAATTTGGTTAGCTATAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATCCAGTTATTTAGGGAATTGCCTTCAAAGGGATTCAGTCCAAATGTTGTTAGTTACAATATCTTGCTAAGGAGTCTGTGCTATGAAGGGAGGTGGGAAGAGGCAAATGTACTTCTAGCTGAAATGGACGGTGACGATCGCTCCCCTTCGATCGTCACTTACAATATATTGATTGGTTCACTTACCCTCCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGAGCGCGATTCAAGCCGACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTAGACCAAATGATCTATAGGCATTGCAACCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAGGCATTCTCCATTATACAGAGTTTAGGCAACAAGCAACATTCCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCTTGTGTCGAAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACGCCTGATTCTTTTACCTATTCGTCTTTGATCCGGGGGTTATGCATGGAAGGTATGCTGGATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATAACTACAGGCCTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACCGATTTGGCCTTGGAGGTTTTCGAAATAATGGTTGATAAAGGTTATCTTCCCAATGAAACGACATACACCATTCTTGTGGAAGGGATCGTCCATGAAAAAGAGATAGATCTAGCAACCAAAGTACTGAGGGAGTTGCAACTAAGAGATGTTATAAGTCAAAGCACAGTGGAAAGACTTGTTATGCAGTATGACCTAAATGAATTACCATTGTGA

Protein sequence

MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFTLPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLRELQLRDVISQSTVERLVMQYDLNELPL
Homology
BLAST of HG10013839 vs. NCBI nr
Match: XP_038899825.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Benincasa hispida])

HSP 1 Score: 1119.4 bits (2894), Expect = 0.0e+00
Identity = 556/567 (98.06%), Postives = 563/567 (99.29%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNP PETTRRGCGFFSHIPNLHKLSL+KGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPLPETTRRGCGFFSHIPNLHKLSLSKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNWRT K DQKSRELRLNDAFFHLEF+VEKGQKPDVFQATQLLYDLCKACK+RKAIKVM
Sbjct: 61  LPNWRTAKGDQKSRELRLNDAFFHLEFLVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAI+LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPSIVTYNILIGSLTL GRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSIVTYNILIGSLTLSGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNY+PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYKPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQSTVERLVMQYDLNELPL
Sbjct: 541 ELQLRDVISQSTVERLVMQYDLNELPL 567

BLAST of HG10013839 vs. NCBI nr
Match: XP_022141778.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Momordica charantia])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 536/567 (94.53%), Postives = 555/567 (97.88%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPI NPSPET+RRGCGFFSHIPNLHKLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNWRTGKVDQKSR+LRLNDAF HLEFMV KGQKPDVFQATQLLYDLCKA K+RKAIKVM
Sbjct: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LCK+GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEAI+LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVS NILLRSLCYEGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDR+PSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEEN Y+PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYN+LILGCCKS+RTDLALEVFE+MVDKGYLPNETTYTILVEGI+HEKEIDLA KVL+
Sbjct: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 567

BLAST of HG10013839 vs. NCBI nr
Match: XP_008444287.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis melo] >ADN33755.1 pentatricopeptide repeat-containing protein [Cucumis melo subsp. melo] >KAA0052323.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK01883.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 538/567 (94.89%), Postives = 556/567 (98.06%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPNL KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+TGKV+QKS+ELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCKACK+RKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEENN +PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENN-KPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYLPNETTYTILVEGI+HEKE+DLATKVLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQST+ERLVMQYDLNELPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of HG10013839 vs. NCBI nr
Match: XP_011653982.1 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis sativus] >KGN54942.1 hypothetical protein Csa_012601 [Cucumis sativus])

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 536/567 (94.53%), Postives = 554/567 (97.71%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNPSPETTRRGCGFFSHIPN+ KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+ GK+DQKS+ELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCK CK+RKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDA+QLFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQH STQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEE N + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYL NETTYTILVEGI+HEKE+DLAT+VLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVI+QSTVERLVMQYDLNELPL
Sbjct: 541 ELQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of HG10013839 vs. NCBI nr
Match: KAG6607968.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia] >KAG7037481.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1067.0 bits (2758), Expect = 5.6e-308
Identity = 527/567 (92.95%), Postives = 547/567 (96.47%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNPSPETTRRG GFFSHIPNLHKLSL+KGFSKVLASTQ+TISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGYGFFSHIPNLHKLSLSKGFSKVLASTQVTISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNWR GK DQK+RE RLNDAF +LE++V KGQKPDVFQATQLLYDLCKA KLR+AIKVM
Sbjct: 61  LPNWRIGKGDQKNREHRLNDAFLNLEYLVGKGQKPDVFQATQLLYDLCKASKLRRAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLCK+GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLT SLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA++LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTHSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAVKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRT+DAIQLFRELPSKGF+PNVVSYNILLRSLCYEGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDDAIQLFRELPSKGFNPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDR+PS VTYN LIGSL  HGRTEHALEVLEEMIRARFKPTASSYNPIIARLC+D+KVDL
Sbjct: 301 DDRAPSAVTYNTLIGSLAFHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCRDKKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQMIYRHCNPNEGTYNAIATLCE GMVQEAFSI+QSLGNKQH STQEFYK VITS
Sbjct: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEVGMVQEAFSILQSLGNKQHYSTQEFYKSVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTY AFQLLYEMTKYGFTPDSF YSSLIRGLCMEGMLDEAIEIFSVMEENNY+PD
Sbjct: 421 LCRKGNTYSAFQLLYEMTKYGFTPDSFAYSSLIRGLCMEGMLDEAIEIFSVMEENNYKPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLI GCCKSRRTDLALEVFE MV+KGYLPNETTYT LVEGIVHEK+IDLATKVLR
Sbjct: 481 TENYNSLIFGCCKSRRTDLALEVFETMVNKGYLPNETTYTTLVEGIVHEKQIDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQL+DVISQSTVERLVMQYDLNELPL
Sbjct: 541 ELQLKDVISQSTVERLVMQYDLNELPL 567

BLAST of HG10013839 vs. ExPASy Swiss-Prot
Match: A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 3.9e-219
Identity = 383/577 (66.38%), Postives = 473/577 (81.98%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPN--LHKLSLNKGFSKVLASTQITISPKDTI 60
           M+TLLN+V  + +P   + R+  GF SHIP+  LH  S++KG ++VLASTQIT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTL------PNWRTGKV--DQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKA 120
           FT+      P+  +G    D +S E  L+D+F HLE +V  G KP+V  +TQLLYDLCKA
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 CKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTY 180
            +L+KAI+V+E+M+ SGIIPDAS+YT+LV+ LCK+GNVGYAMQLV+KME++GYP+NTVTY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIA 240
           N+LVRGLCM G+L QSLQ ++RL+QKGL PNA+TYSFLLEAAYKERG DEA++LLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DA+ LFRELP+KGF  NVVSYNILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG DR+PS+VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNKQH 420
           IARLCK+ KVDLVVKCLD+MIYR C PNEGTYNAI +LCE    VQEAF IIQSL NKQ 
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 SSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIE 480
             T +FYK VITSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVMEEN-NYRPDTENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGI 540
           + S+MEE+ N +P  +N+N++ILG CK RRTDLA+EVFE+MV+K  +PNETTY ILVEGI
Sbjct: 481 VLSIMEESENCKPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEGI 540

Query: 541 VHEKEIDLATKVLRELQLRDVISQSTVERLVMQYDLN 564
            HE E++LA +VL EL+LR VI Q+ V+R+VMQ++L+
Sbjct: 541 AHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of HG10013839 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 6.2e-108
Identity = 196/487 (40.25%), Postives = 306/487 (62.83%), Query Frame = 0

Query: 71  QKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIP 130
           Q  R   L + F  LE MV  G  PD+   T L+   C+  K RKA K++E++ GSG +P
Sbjct: 111 QMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVP 170

Query: 131 DASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLL 190
           D  +Y  ++S  CK G +  A+ ++D+M       + VTYN+++R LC  G L Q++++L
Sbjct: 171 DVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAMEVL 230

Query: 191 DRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKE 250
           DR++Q+   P+  TY+ L+EA  ++ G   A++LLDE+  +G  P++V+YNVL+ G+CKE
Sbjct: 231 DRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKE 290

Query: 251 GRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTY 310
           GR ++AI+   ++PS G  PNV+++NI+LRS+C  GRW +A  LLA+M     SPS+VT+
Sbjct: 291 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 350

Query: 311 NILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIY 370
           NILI  L   G    A+++LE+M +   +P + SYNP++   CK++K+D  ++ L++M+ 
Sbjct: 351 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVS 410

Query: 371 RHCNPNEGTYNAIAT-LCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYP 430
           R C P+  TYN + T LC++G V++A  I+  L +K  S     Y  VI  L + G T  
Sbjct: 411 RGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGK 470

Query: 431 AFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLIL 490
           A +LL EM      PD+ TYSSL+ GL  EG +DEAI+ F   E    RP+   +NS++L
Sbjct: 471 AIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIML 530

Query: 491 GCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLRELQLRDVIS 550
           G CKSR+TD A++    M+++G  PNET+YTIL+EG+ +E     A ++L EL  + ++ 
Sbjct: 531 GLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMK 590

Query: 551 QSTVERL 557
           +S+ E++
Sbjct: 591 KSSAEQV 594

BLAST of HG10013839 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 342.4 bits (877), Expect = 9.6e-93
Identity = 178/479 (37.16%), Postives = 276/479 (57.62%), Query Frame = 0

Query: 85  LEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCK 144
           LE MV KG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  CK
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFG-QPDVFAYNALINGFCK 171

Query: 145 KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYT 204
              +  A +++D+M    +  +TVTYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELP 264
           Y+ L+EA   E G DEA++L+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTE 324
            KG  P+V+SYNILLR+L  +G+WEE   L+ +M  +   P++VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ MI   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG    S     Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEV 504
           PD  TY+S+I  LC EGM+DEA E+   M    + P    YN ++LG CK+ R + A+ V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLRELQLRDVISQSTVERLVMQYDL 563
            E MV  G  PNETTYT+L+EGI        A ++  +L   D IS+ + +RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of HG10013839 vs. ExPASy Swiss-Prot
Match: Q9ASZ8 (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX=3702 GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 4.8e-68
Identity = 135/454 (29.74%), Postives = 239/454 (52.64%), Query Frame = 0

Query: 91  KGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGY 150
           KG   +++  + ++   C+  KL  A   M  +I  G  PD  +++ L++ LC +G V  
Sbjct: 101 KGIAHNLYTLSIMINCCCRCRKLSLAFSAMGKIIKLGYEPDTVTFSTLINGLCLEGRVSE 160

Query: 151 AMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+    +T N+LV GLC++G ++ ++ L+DR+++ G  PN  TY  +L+
Sbjct: 161 ALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLK 220

Query: 211 AAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSP 270
              K      A+ELL ++  +  K + V Y++++ GLCK+G  ++A  LF E+  KGF  
Sbjct: 221 VMCKSGQTALAMELLRKMEERKIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKA 280

Query: 271 NVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVL 330
           +++ Y  L+R  CY GRW++   LL +M     +P +V ++ LI      G+   A E+ 
Sbjct: 281 DIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELH 340

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIAT-LCEE 390
           +EMI+    P   +Y  +I   CK+ ++D     LD M+ + C PN  T+N +    C+ 
Sbjct: 341 KEMIQRGISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKA 400

Query: 391 GMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
            ++ +   + + +  +   +    Y  +I   C  G    A +L  EM      PD  +Y
Sbjct: 401 NLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSY 460

Query: 451 SSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEVFEIMVD 510
             L+ GLC  G  ++A+EIF  +E++    D   YN +I G C + + D A ++F  +  
Sbjct: 461 KILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 520

Query: 511 KGYLPNETTYTILVEGIVHEKEIDLATKVLRELQ 544
           KG  P+  TY I++ G+  +  +  A  + R+++
Sbjct: 521 KGVKPDVKTYNIMIGGLCKKGSLSEADLLFRKME 554

BLAST of HG10013839 vs. ExPASy Swiss-Prot
Match: Q0WKV3 (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.8e-67
Identity = 135/454 (29.74%), Postives = 240/454 (52.86%), Query Frame = 0

Query: 91  KGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGY 150
           KG   +++  + ++   C+  KL  A   M  +I  G  P+  +++ L++ LC +G V  
Sbjct: 117 KGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSE 176

Query: 151 AMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+  + +T N+LV GLC+ G   +++ L+D++++ G  PNA TY  +L 
Sbjct: 177 ALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLN 236

Query: 211 AAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSP 270
              K      A+ELL ++  +  K + V Y++++ GLCK G  ++A  LF E+  KG + 
Sbjct: 237 VMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITT 296

Query: 271 NVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVL 330
           N+++YNIL+   C  GRW++   LL +M     +P++VT+++LI S    G+   A E+ 
Sbjct: 297 NIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELH 356

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIAT-LCEE 390
           +EMI     P   +Y  +I   CK+  +D   + +D M+ + C+PN  T+N +    C+ 
Sbjct: 357 KEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKA 416

Query: 391 GMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
             + +   + + +  +   +    Y  +I   C  G    A +L  EM      P+  TY
Sbjct: 417 NRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTY 476

Query: 451 SSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEVFEIMVD 510
             L+ GLC  G  ++A+EIF  +E++    D   YN +I G C + + D A ++F  +  
Sbjct: 477 KILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 536

Query: 511 KGYLPNETTYTILVEGIVHEKEIDLATKVLRELQ 544
           KG  P   TY I++ G+  +  +  A  + R+++
Sbjct: 537 KGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKME 570

BLAST of HG10013839 vs. ExPASy TrEMBL
Match: A0A6J1CLJ3 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012056 PE=4 SV=1)

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 536/567 (94.53%), Postives = 555/567 (97.88%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPI NPSPET+RRGCGFFSHIPNLHKLSLNKGFSKVLAST ITISPKDTIFT
Sbjct: 1   MATLLNTVSPIANPSPETSRRGCGFFSHIPNLHKLSLNKGFSKVLASTHITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNWRTGKVDQKSR+LRLNDAF HLEFMV KGQKPDVFQATQLLYDLCKA K+RKAIKVM
Sbjct: 61  LPNWRTGKVDQKSRDLRLNDAFLHLEFMVRKGQKPDVFQATQLLYDLCKASKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVS LCK+GNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSCLCKRGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNL+QSLQLLDRLI KGLVPNAYTYSFLLEAAYKERGADEAI+LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLSQSLQLLDRLIHKGLVPNAYTYSFLLEAAYKERGADEAIKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVS NILLRSLCYEGRWEEAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSCNILLRSLCYEGRWEEANELLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DDR+PSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DDRAPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGN+Q SSTQEFYK+V+TS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNRQQSSTQEFYKMVVTS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEEN Y+PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENCYKPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYN+LILGCCKS+RTDLALEVFE+MVDKGYLPNETTYTILVEGI+HEKEIDLA KVL+
Sbjct: 481 TENYNALILGCCKSQRTDLALEVFEVMVDKGYLPNETTYTILVEGIIHEKEIDLAAKVLK 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQSTV+RLVMQYDLN+LPL
Sbjct: 541 ELQLRDVISQSTVDRLVMQYDLNDLPL 567

BLAST of HG10013839 vs. ExPASy TrEMBL
Match: A0A5A7U8T1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold113G001460 PE=4 SV=1)

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 538/567 (94.89%), Postives = 556/567 (98.06%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPNL KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+TGKV+QKS+ELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCKACK+RKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEENN +PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENN-KPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYLPNETTYTILVEGI+HEKE+DLATKVLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQST+ERLVMQYDLNELPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of HG10013839 vs. ExPASy TrEMBL
Match: E5GBB3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 538/567 (94.89%), Postives = 556/567 (98.06%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPNL KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+TGKV+QKS+ELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCKACK+RKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEENN +PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENN-KPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYLPNETTYTILVEGI+HEKE+DLATKVLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQST+ERLVMQYDLNELPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of HG10013839 vs. ExPASy TrEMBL
Match: A0A1S3BA04 (pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487653 PE=4 SV=1)

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 538/567 (94.89%), Postives = 556/567 (98.06%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPNL KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+TGKV+QKS+ELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCKACK+RKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDASSYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAI+LFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEENN +PD
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENN-KPD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYLPNETTYTILVEGI+HEKE+DLATKVLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVISQST+ERLVMQYDLNELPL
Sbjct: 541 ELQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of HG10013839 vs. ExPASy TrEMBL
Match: A0A0A0L2W8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G613170 PE=4 SV=1)

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 536/567 (94.53%), Postives = 554/567 (97.71%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNLHKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNPSPETTRRGCGFFSHIPN+ KLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWRTGKVDQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVM 120
           LPNW+ GK+DQKS+ELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCK CK+RKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVSSLC+KGNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEA +LLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDA+QLFRELPSKGFSPNVVSYNILLRSLC EGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMIYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420
           VVKCLDQM+YRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQH STQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPD 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGML+EAIEIFSVMEE N + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480

Query: 481 TENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLR 540
           TENYNSLILGCCKSRRTDLAL+VFEIMV KGYL NETTYTILVEGI+HEKE+DLAT+VLR
Sbjct: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540

Query: 541 ELQLRDVISQSTVERLVMQYDLNELPL 568
           ELQLRDVI+QSTVERLVMQYDLNELPL
Sbjct: 541 ELQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of HG10013839 vs. TAIR 10
Match: AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 762.3 bits (1967), Expect = 2.8e-220
Identity = 383/577 (66.38%), Postives = 473/577 (81.98%), Query Frame = 0

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPN--LHKLSLNKGFSKVLASTQITISPKDTI 60
           M+TLLN+V  + +P   + R+  GF SHIP+  LH  S++KG ++VLASTQIT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTL------PNWRTGKV--DQKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKA 120
           FT+      P+  +G    D +S E  L+D+F HLE +V  G KP+V  +TQLLYDLCKA
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 CKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTY 180
            +L+KAI+V+E+M+ SGIIPDAS+YT+LV+ LCK+GNVGYAMQLV+KME++GYP+NTVTY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIA 240
           N+LVRGLCM G+L QSLQ ++RL+QKGL PNA+TYSFLLEAAYKERG DEA++LLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DA+ LFRELP+KGF  NVVSYNILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG DR+PS+VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNKQH 420
           IARLCK+ KVDLVVKCLD+MIYR C PNEGTYNAI +LCE    VQEAF IIQSL NKQ 
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 SSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIE 480
             T +FYK VITSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVMEEN-NYRPDTENYNSLILGCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGI 540
           + S+MEE+ N +P  +N+N++ILG CK RRTDLA+EVFE+MV+K  +PNETTY ILVEGI
Sbjct: 481 VLSIMEESENCKPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEGI 540

Query: 541 VHEKEIDLATKVLRELQLRDVISQSTVERLVMQYDLN 564
            HE E++LA +VL EL+LR VI Q+ V+R+VMQ++L+
Sbjct: 541 AHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of HG10013839 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 392.9 bits (1008), Expect = 4.4e-109
Identity = 196/487 (40.25%), Postives = 306/487 (62.83%), Query Frame = 0

Query: 71  QKSRELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIP 130
           Q  R   L + F  LE MV  G  PD+   T L+   C+  K RKA K++E++ GSG +P
Sbjct: 111 QMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVP 170

Query: 131 DASSYTFLVSSLCKKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLL 190
           D  +Y  ++S  CK G +  A+ ++D+M       + VTYN+++R LC  G L Q++++L
Sbjct: 171 DVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAMEVL 230

Query: 191 DRLIQKGLVPNAYTYSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKE 250
           DR++Q+   P+  TY+ L+EA  ++ G   A++LLDE+  +G  P++V+YNVL+ G+CKE
Sbjct: 231 DRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKE 290

Query: 251 GRTEDAIQLFRELPSKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTY 310
           GR ++AI+   ++PS G  PNV+++NI+LRS+C  GRW +A  LLA+M     SPS+VT+
Sbjct: 291 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 350

Query: 311 NILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIY 370
           NILI  L   G    A+++LE+M +   +P + SYNP++   CK++K+D  ++ L++M+ 
Sbjct: 351 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVS 410

Query: 371 RHCNPNEGTYNAIAT-LCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYP 430
           R C P+  TYN + T LC++G V++A  I+  L +K  S     Y  VI  L + G T  
Sbjct: 411 RGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGK 470

Query: 431 AFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLIL 490
           A +LL EM      PD+ TYSSL+ GL  EG +DEAI+ F   E    RP+   +NS++L
Sbjct: 471 AIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIML 530

Query: 491 GCCKSRRTDLALEVFEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLRELQLRDVIS 550
           G CKSR+TD A++    M+++G  PNET+YTIL+EG+ +E     A ++L EL  + ++ 
Sbjct: 531 GLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMK 590

Query: 551 QSTVERL 557
           +S+ E++
Sbjct: 591 KSSAEQV 594

BLAST of HG10013839 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 342.4 bits (877), Expect = 6.8e-94
Identity = 178/479 (37.16%), Postives = 276/479 (57.62%), Query Frame = 0

Query: 85  LEFMVEKGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCK 144
           LE MV KG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  CK
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFG-QPDVFAYNALINGFCK 171

Query: 145 KGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYT 204
              +  A +++D+M    +  +TVTYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELP 264
           Y+ L+EA   E G DEA++L+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTE 324
            KG  P+V+SYNILLR+L  +G+WEE   L+ +M  +   P++VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ MI   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG    S     Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEV 504
           PD  TY+S+I  LC EGM+DEA E+   M    + P    YN ++LG CK+ R + A+ V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEIMVDKGYLPNETTYTILVEGIVHEKEIDLATKVLRELQLRDVISQSTVERLVMQYDL 563
            E MV  G  PNETTYT+L+EGI        A ++  +L   D IS+ + +RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of HG10013839 vs. TAIR 10
Match: AT1G12620.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 260.4 bits (664), Expect = 3.4e-69
Identity = 135/454 (29.74%), Postives = 239/454 (52.64%), Query Frame = 0

Query: 91  KGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGY 150
           KG   +++  + ++   C+  KL  A   M  +I  G  PD  +++ L++ LC +G V  
Sbjct: 101 KGIAHNLYTLSIMINCCCRCRKLSLAFSAMGKIIKLGYEPDTVTFSTLINGLCLEGRVSE 160

Query: 151 AMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+    +T N+LV GLC++G ++ ++ L+DR+++ G  PN  TY  +L+
Sbjct: 161 ALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLK 220

Query: 211 AAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSP 270
              K      A+ELL ++  +  K + V Y++++ GLCK+G  ++A  LF E+  KGF  
Sbjct: 221 VMCKSGQTALAMELLRKMEERKIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKA 280

Query: 271 NVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVL 330
           +++ Y  L+R  CY GRW++   LL +M     +P +V ++ LI      G+   A E+ 
Sbjct: 281 DIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELH 340

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIAT-LCEE 390
           +EMI+    P   +Y  +I   CK+ ++D     LD M+ + C PN  T+N +    C+ 
Sbjct: 341 KEMIQRGISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKA 400

Query: 391 GMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
            ++ +   + + +  +   +    Y  +I   C  G    A +L  EM      PD  +Y
Sbjct: 401 NLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSY 460

Query: 451 SSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEVFEIMVD 510
             L+ GLC  G  ++A+EIF  +E++    D   YN +I G C + + D A ++F  +  
Sbjct: 461 KILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 520

Query: 511 KGYLPNETTYTILVEGIVHEKEIDLATKVLRELQ 544
           KG  P+  TY I++ G+  +  +  A  + R+++
Sbjct: 521 KGVKPDVKTYNIMIGGLCKKGSLSEADLLFRKME 554

BLAST of HG10013839 vs. TAIR 10
Match: AT1G12300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 258.5 bits (659), Expect = 1.3e-68
Identity = 135/454 (29.74%), Postives = 240/454 (52.86%), Query Frame = 0

Query: 91  KGQKPDVFQATQLLYDLCKACKLRKAIKVMEMMIGSGIIPDASSYTFLVSSLCKKGNVGY 150
           KG   +++  + ++   C+  KL  A   M  +I  G  P+  +++ L++ LC +G V  
Sbjct: 117 KGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSE 176

Query: 151 AMQLVDKMEEYGYPTNTVTYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+  + +T N+LV GLC+ G   +++ L+D++++ G  PNA TY  +L 
Sbjct: 177 ALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLN 236

Query: 211 AAYKERGADEAIELLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAIQLFRELPSKGFSP 270
              K      A+ELL ++  +  K + V Y++++ GLCK G  ++A  LF E+  KG + 
Sbjct: 237 VMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITT 296

Query: 271 NVVSYNILLRSLCYEGRWEEANVLLAEMDGDDRSPSIVTYNILIGSLTLHGRTEHALEVL 330
           N+++YNIL+   C  GRW++   LL +M     +P++VT+++LI S    G+   A E+ 
Sbjct: 297 NIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELH 356

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMIYRHCNPNEGTYNAIAT-LCEE 390
           +EMI     P   +Y  +I   CK+  +D   + +D M+ + C+PN  T+N +    C+ 
Sbjct: 357 KEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKA 416

Query: 391 GMVQEAFSIIQSLGNKQHSSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
             + +   + + +  +   +    Y  +I   C  G    A +L  EM      P+  TY
Sbjct: 417 NRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTY 476

Query: 451 SSLIRGLCMEGMLDEAIEIFSVMEENNYRPDTENYNSLILGCCKSRRTDLALEVFEIMVD 510
             L+ GLC  G  ++A+EIF  +E++    D   YN +I G C + + D A ++F  +  
Sbjct: 477 KILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 536

Query: 511 KGYLPNETTYTILVEGIVHEKEIDLATKVLRELQ 544
           KG  P   TY I++ G+  +  +  A  + R+++
Sbjct: 537 KGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKME 570

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899825.10.0e+0098.06pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Benincasa ... [more]
XP_022141778.10.0e+0094.53pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Momordica ... [more]
XP_008444287.10.0e+0094.89PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
XP_011653982.10.0e+0094.53pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis sa... [more]
KAG6607968.15.6e-30892.95Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A3KPF83.9e-21966.38Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
Q3EDF86.2e-10840.25Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9SR009.6e-9337.16Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Q9ASZ84.8e-6829.74Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX... [more]
Q0WKV31.8e-6729.74Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1CLJ30.0e+0094.53pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Momordic... [more]
A0A5A7U8T10.0e+0094.89Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
E5GBB30.0e+0094.89Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo OX=41267... [more]
A0A1S3BA040.0e+0094.89pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Cucumis ... [more]
A0A0A0L2W80.0e+0094.53Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G613170 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79080.12.8e-22066.38Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09900.14.4e-10940.25Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G04760.16.8e-9437.16Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G12620.13.4e-6929.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12300.11.3e-6829.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 273..306
e-value: 8.4E-6
score: 23.6
coord: 447..481
e-value: 3.4E-9
score: 34.3
coord: 238..272
e-value: 2.9E-10
score: 37.7
coord: 344..377
e-value: 3.0E-4
score: 18.8
coord: 168..202
e-value: 2.8E-7
score: 28.3
coord: 308..341
e-value: 1.8E-8
score: 32.1
coord: 107..132
e-value: 7.9E-4
score: 17.4
coord: 484..516
e-value: 1.2E-7
score: 29.4
coord: 134..164
e-value: 7.1E-5
score: 20.7
coord: 414..446
e-value: 0.0013
score: 16.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 479..526
e-value: 2.9E-12
score: 46.6
coord: 340..383
e-value: 5.2E-8
score: 33.0
coord: 235..283
e-value: 2.6E-16
score: 59.6
coord: 166..211
e-value: 2.1E-10
score: 40.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 304..333
e-value: 5.2E-7
score: 29.3
coord: 127..159
e-value: 4.9E-9
score: 35.8
coord: 441..473
e-value: 7.4E-12
score: 44.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 9.076014
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 166..200
score: 12.506901
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 13.888024
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 13.570147
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..130
score: 8.714292
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 9.624079
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 131..165
score: 10.731171
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 9.492543
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 12.704205
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 64..216
e-value: 5.6E-33
score: 116.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 217..318
e-value: 1.2E-29
score: 105.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 474..546
e-value: 4.6E-19
score: 70.7
coord: 334..405
e-value: 2.5E-11
score: 45.4
coord: 406..473
e-value: 5.5E-16
score: 60.6
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..565
NoneNo IPR availablePANTHERPTHR45613:SF297PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..565
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 75..269

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013839.1HG10013839.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding