HG10004783 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004783
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr08: 20368501 .. 20370894 (-)
RNA-Seq ExpressionHG10004783
SyntenyHG10004783
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAAATGCCTTGGACGTCCGTATTTTGGCGAACAGGTATGCCGCACAATTGCAACTCTGTGCCCCACAAAATCCCTCTTCATTTTCACTTGCTCGGACTGTTCATGCCCACATGATTGCTTCGGGATTCAAGCCTCGTGGGCACCTTGTCAATCGACTACTTGATATATACTGGAAATCGTCGAATTTTATTTATGCCCGCCAACTGTTCGACGAAATTCCCCACCCAGATGCTGTAGCGAGAACTACATTGATTACGGCGTACTCTGCGTTGGGGAATTTGAATATGGCTAGAGAAATATTCAATGGAACTCCATTGAATATGAGGGACACTATTTTCTACAATGCAATGATTACTGGGTATTCGCACAAGGATGATGGGTATTCTGCTATTGAATTGTTTCATGCTATGAGACGGGCCAATTTTCAGCCTGATGACTTTACATTTACTAGTGTGCTCAGTGCTTTAGCGTTGATTGTTGATGGTGAGCACCAGTGTGGCCAAATGCATGGTGCAGTGGTGAAATTTGGAGTTGGACTTATTTCTTCAGTGTTGAACGCTCTTCTATCTGTTTATGTTAAGTGTGCTTCTTCACCCTTGGTGTCATCGTCGTCATTGATGGCATCAGCTAGGAAACTGTTTGATGAAATGCCAGAGAGGGATGAGTTGACATGGACGACTTTGATTACTGGGTATGTGCGGAATGATGATCTAACTGGGGCACGTGAACTTCTTGACACAATGACTGAACCTCTAGGTGTAGCATGGAATGCCATGATCTCTGGATATGTGCATCATAGTCTTTTCGAGGATGCCTTGACATTGTTTAGGAAAATGCGTTTGCTTGGTGTCCAGCACGATGAGTTCACCTATACGAGCGTGATCAGTGCTTGCGCTAATGGTGGTTTTTTTCAACTGGGAAAACAGGTACATGCTTACATTTTGAAAAATGAGATGAACCCAAATCATGATTTTTTATTGTCTGTGAGTAATGCATTGATTACTTTGTACTGGAAATATGGTAAAGTTGATGGGGCACGGAAGATTTTTTATGAGATGCCAGTTAAAGATATCGTTACTTGGAATGCAATCCTATCGGGCTATGTGAATGCAGGGCGTATGGAAGACGCAAAATATTTTTTTGCACAAATGCCGGAGAAAAACCTTCTTACATGGACTGTGATGATTTCAGGACTAGCCCAAAATGGATTTGGGGAAGAGGGTTTGAAGCTGTTTAACCAAATGAGGTTAGATGGCTATGAACCTTGTGATTATGCATTTGCAGGTGTGGTCACAGCTTGTTCTGTGCTTGGAGCATTGGAGAATGGTCGTCAACTCCATGCTCAGATTGTTCATCTCGGCCACGATTCAAGCCTCTCAGTTGGCAATGCAATGATCTCAATGTATGCAAGATGTGGAGTGGTTGAAGCTGCCAAATCTGTGTTTCTAACCATGCCTTTTGTGGACTCTGTTTCATGGAATGCGATGATTGCAGCACTGGGACAGCACGGACATGGCGTAAAAGCAATTGAACTATATGAACAAATGTTGAAAGAGGGTATACTCCCTGATAGAATAACGTTTCTTACTGTTCTCTCTGCGTGTAGTCATGCAGGTCTGATTGAAGAAGGACGCCACTATTTTAATTCAATGCTCGAAACTTATGGTATCACCCCGGGCGAGGATCATTATGCTCGGATGATCGATTTGTTTTGTCGAGCTGGGAAGTTTTCAGATGCAAAGAATGTCATTGATTCCATGCCTTGTGAAGCTGGGGCACCAATTTGGGAGGCTCTTCTTGCTGGTTGTCGGATTCATGGAAACATGGACTTGGGAGTAGAAGCTGCTGAAAAGCTTTTCAAGCTAATACCGCAACACGATGGAACCTATATACTTTTATCAAACATGTACGCCAATGTTGGGTGGTGGAACGATGTTGCTAGGACGCGAAAACTAATGCGGGATCGAGGGGTTAAAAAGGAGCCTGCTTGTAGTTGGACCGAGGTTGAGAACAAGGTTCATGTGTTCTTGGTAGATGATACAGTGCACCCTGAGATACAATCTGTGTACAACTATCTAGAGAAGTTGAACCTTGAAATGAAGAAATTAGGATATATTCCAGACACAAAGTATGTGCTACATGATATGGAATCTGAACATAAAGAATATGCCTTATCTACTCACAGTGAGAAGCTTGCAGTTGGGTTTGGGCTAATGAAGCTCCCTGAAGGTGCCACTGTCAGGGTTTTCAAGAACCTTAGGATATGTGGAGATTGTCACAATGCATTCAAGTTCATGTCCAAAGTGGTGAAGAGGGAGATAGTAGTGAGAGATGGAAAGAGGTTTCATCATTTCAAAAATGGCGAATGCTCATGTGGTAATTATTGGTAA

mRNA sequence

ATGAGAAATGCCTTGGACGTCCGTATTTTGGCGAACAGGTATGCCGCACAATTGCAACTCTGTGCCCCACAAAATCCCTCTTCATTTTCACTTGCTCGGACTGTTCATGCCCACATGATTGCTTCGGGATTCAAGCCTCGTGGGCACCTTGTCAATCGACTACTTGATATATACTGGAAATCGTCGAATTTTATTTATGCCCGCCAACTGTTCGACGAAATTCCCCACCCAGATGCTGTAGCGAGAACTACATTGATTACGGCGTACTCTGCGTTGGGGAATTTGAATATGGCTAGAGAAATATTCAATGGAACTCCATTGAATATGAGGGACACTATTTTCTACAATGCAATGATTACTGGGTATTCGCACAAGGATGATGGGTATTCTGCTATTGAATTGTTTCATGCTATGAGACGGGCCAATTTTCAGCCTGATGACTTTACATTTACTAGTGTGCTCAGTGCTTTAGCGTTGATTGTTGATGGTGAGCACCAGTGTGGCCAAATGCATGGTGCAGTGGTGAAATTTGGAGTTGGACTTATTTCTTCAGTGTTGAACGCTCTTCTATCTGTTTATGTTAAGTGTGCTTCTTCACCCTTGGTGTCATCGTCGTCATTGATGGCATCAGCTAGGAAACTGTTTGATGAAATGCCAGAGAGGGATGAGTTGACATGGACGACTTTGATTACTGGGTATGTGCGGAATGATGATCTAACTGGGGCACGTGAACTTCTTGACACAATGACTGAACCTCTAGGTGTAGCATGGAATGCCATGATCTCTGGATATGTGCATCATAGTCTTTTCGAGGATGCCTTGACATTGTTTAGGAAAATGCGTTTGCTTGGTGTCCAGCACGATGAGTTCACCTATACGAGCGTGATCAGTGCTTGCGCTAATGGACTAGCCCAAAATGGATTTGGGGAAGAGGGTTTGAAGCTGTTTAACCAAATGAGGTTAGATGGCTATGAACCTTGTGATTATGCATTTGCAGGTGTGGTCACAGCTTGTTCTGTGCTTGGAGCATTGGAGAATGGTCGTCAACTCCATGCTCAGATTGTTCATCTCGGCCACGATTCAAGCCTCTCAGTTGGCAATGCAATGATCTCAATTGAGAAGCTTGCAGTTGGGTTTGGGCTAATGAAGCTCCCTGAAGGTGCCACTGTCAGGGTTTTCAAGAACCTTAGGATATGTGGAGATTGTCACAATGCATTCAAGTTCATGTCCAAAGTGGTGAAGAGGGAGATAGTAGTGAGAGATGGAAAGAGGTTTCATCATTTCAAAAATGGCGAATGCTCATGTGGTAATTATTGGTAA

Coding sequence (CDS)

ATGAGAAATGCCTTGGACGTCCGTATTTTGGCGAACAGGTATGCCGCACAATTGCAACTCTGTGCCCCACAAAATCCCTCTTCATTTTCACTTGCTCGGACTGTTCATGCCCACATGATTGCTTCGGGATTCAAGCCTCGTGGGCACCTTGTCAATCGACTACTTGATATATACTGGAAATCGTCGAATTTTATTTATGCCCGCCAACTGTTCGACGAAATTCCCCACCCAGATGCTGTAGCGAGAACTACATTGATTACGGCGTACTCTGCGTTGGGGAATTTGAATATGGCTAGAGAAATATTCAATGGAACTCCATTGAATATGAGGGACACTATTTTCTACAATGCAATGATTACTGGGTATTCGCACAAGGATGATGGGTATTCTGCTATTGAATTGTTTCATGCTATGAGACGGGCCAATTTTCAGCCTGATGACTTTACATTTACTAGTGTGCTCAGTGCTTTAGCGTTGATTGTTGATGGTGAGCACCAGTGTGGCCAAATGCATGGTGCAGTGGTGAAATTTGGAGTTGGACTTATTTCTTCAGTGTTGAACGCTCTTCTATCTGTTTATGTTAAGTGTGCTTCTTCACCCTTGGTGTCATCGTCGTCATTGATGGCATCAGCTAGGAAACTGTTTGATGAAATGCCAGAGAGGGATGAGTTGACATGGACGACTTTGATTACTGGGTATGTGCGGAATGATGATCTAACTGGGGCACGTGAACTTCTTGACACAATGACTGAACCTCTAGGTGTAGCATGGAATGCCATGATCTCTGGATATGTGCATCATAGTCTTTTCGAGGATGCCTTGACATTGTTTAGGAAAATGCGTTTGCTTGGTGTCCAGCACGATGAGTTCACCTATACGAGCGTGATCAGTGCTTGCGCTAATGGACTAGCCCAAAATGGATTTGGGGAAGAGGGTTTGAAGCTGTTTAACCAAATGAGGTTAGATGGCTATGAACCTTGTGATTATGCATTTGCAGGTGTGGTCACAGCTTGTTCTGTGCTTGGAGCATTGGAGAATGGTCGTCAACTCCATGCTCAGATTGTTCATCTCGGCCACGATTCAAGCCTCTCAGTTGGCAATGCAATGATCTCAATTGAGAAGCTTGCAGTTGGGTTTGGGCTAATGAAGCTCCCTGAAGGTGCCACTGTCAGGGTTTTCAAGAACCTTAGGATATGTGGAGATTGTCACAATGCATTCAAGTTCATGTCCAAAGTGGTGAAGAGGGAGATAGTAGTGAGAGATGGAAAGAGGTTTCATCATTTCAAAAATGGCGAATGCTCATGTGGTAATTATTGGTAA

Protein sequence

MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLTGARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACANGLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISIEKLAVGFGLMKLPEGATVRVFKNLRICGDCHNAFKFMSKVVKREIVVRDGKRFHHFKNGECSCGNYW
Homology
BLAST of HG10004783 vs. NCBI nr
Match: XP_038886633.1 (pentatricopeptide repeat-containing protein At1g25360-like [Benincasa hispida])

HSP 1 Score: 711.1 bits (1834), Expect = 6.1e-201
Identity = 415/797 (52.07%), Postives = 430/797 (53.95%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNALDVR+LANRYAAQLQLC PQNPSS+SLARTVHAHMIASGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNALDVRVLANRYAAQLQLCCPQNPSSYSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SS+F+ ARQLFDEIPHPDAVARTTLI+AYSALGNLNMAR+IFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSDFVCARQLFDEIPHPDAVARTTLISAYSALGNLNMARDIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
            YSHKDDG+SAIELFHAMRRANFQPDDFTFTSVLSALALIVD EHQCGQMHGAVVKFG+G
Sbjct: 121 AYSHKDDGHSAIELFHAMRRANFQPDDFTFTSVLSALALIVDDEHQCGQMHGAVVKFGIG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE L VAWNAMISGYVHH LFEDALTLFRKMRLLGVQHDEFTYTSVISACA
Sbjct: 241 GARELLDTMTEHLSVAWNAMISGYVHHGLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKEVHAYILKNELNPNHDFLLSVSNALITLYWKYGKVDGARKIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEE LKLFNQMRLDG
Sbjct: 361 SWNAILSGYVNAGRMEDAKSFFAQMPEKCLLTWTVIISGLAQNGFGEESLKLFNQMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLGALENGRQLHAQIVHLGH+SSLSVGNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGALENGRQLHAQIVHLGHNSSLSVGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. NCBI nr
Match: XP_023537093.1 (pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 695.3 bits (1793), Expect = 3.4e-196
Identity = 400/766 (52.22%), Postives = 420/766 (54.83%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNA+DVR+LANRYAAQLQLC PQNPSSFSLARTVHAHMI SGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNAIDVRVLANRYAAQLQLCCPQNPSSFSLARTVHAHMIVSGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN +YARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVYARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMR LGV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRFLGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGRMEEAKSFFAQMPEKSLL 360

Query: 361 -------GLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAGVVTACSVLGALENGRQLHAQI 420
                  GLAQNGFGEEGL LFN+MRLDGYEPCDYAFAG +TACSVLG+LENGRQLHAQ+
Sbjct: 361 TWTVMISGLAQNGFGEEGLNLFNRMRLDGYEPCDYAFAGAITACSVLGSLENGRQLHAQL 420

Query: 421 VHLGHDSSLSVGNAMISI------------------------------------------ 440
           +HLGHDSSLS+GNAMIS+                                          
Sbjct: 421 IHLGHDSSLSIGNAMISMYARCGVVEAARSVFLTMPFVDSVSWNAMIAALGQHGHGVKAT 480

BLAST of HG10004783 vs. NCBI nr
Match: XP_022951057.1 (pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita moschata])

HSP 1 Score: 683.3 bits (1762), Expect = 1.4e-192
Identity = 400/797 (50.19%), Postives = 420/797 (52.70%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNA+DVR+LANRYAAQLQLC PQNPSSFSLARTVHAHMI SGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNAIDVRVLANRYAAQLQLCCPQNPSSFSLARTVHAHMIVSGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN +YARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVYARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMR LGV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRFLGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGKVDGARHIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGL LFN+MRLDG
Sbjct: 361 SWNAILSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEEGLNLFNRMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLG+LENGRQLHAQ++HLGHDSSLS+GNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGSLENGRQLHAQLIHLGHDSSLSIGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. NCBI nr
Match: KAG7020472.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 682.2 bits (1759), Expect = 3.0e-192
Identity = 399/797 (50.06%), Postives = 421/797 (52.82%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNA+DVR+LANRYAAQLQLC PQNPSSFSLARTVHAHMI SGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNAIDVRVLANRYAAQLQLCCPQNPSSFSLARTVHAHMIVSGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN ++ARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVHARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMRL+GV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRLIGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGKVDGARHIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGL LFN+MRLDG
Sbjct: 361 SWNAILSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEEGLNLFNRMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLG+LENGRQLHAQ++HLGHDSSLS+GNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGSLENGRQLHAQLIHLGHDSSLSIGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. NCBI nr
Match: KAG6585558.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 682.2 bits (1759), Expect = 3.0e-192
Identity = 399/797 (50.06%), Postives = 421/797 (52.82%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNA+DVR+LANRYAAQLQLC PQNPSSFSLARTVHAHMI SGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNAIDVRVLANRYAAQLQLCCPQNPSSFSLARTVHAHMIVSGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN ++ARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVHARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMRL+GV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRLIGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGKVDGARHIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGL LFN+MRLDG
Sbjct: 361 SWNAILSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEEGLNLFNRMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLG+LENGRQLHAQ++HLGHDSSLS+GNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGSLENGRQLHAQLIHLGHDSSLSIGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. ExPASy Swiss-Prot
Match: Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 7.9e-111
Identity = 257/788 (32.61%), Postives = 335/788 (42.51%), Query Frame = 0

Query: 7   VRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIY 66
           VR +ANRYAA L+LC P   +S  LAR VH ++I  GF+PR H++NRL+D+Y KSS   Y
Sbjct: 8   VRAIANRYAANLRLCLPLRRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNY 67

Query: 67  ARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKD 126
           ARQLFDEI  PD +ARTT+++ Y A G++ +AR +F   P+ MRDT+ YNAMITG+SH +
Sbjct: 68  ARQLFDEISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNN 127

Query: 127 DGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLISSVL 186
           DGYSAI LF  M+   F+PD+FTF SVL+ LAL+ D E QC Q H A +K G G I+SV 
Sbjct: 128 DGYSAINLFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVS 187

Query: 187 NALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLTGARELL 246
           NAL+SVY KCASSP     SL+ SARK+FDE+ E+DE +WTT++TGYV+N       ELL
Sbjct: 188 NALVSVYSKCASSP-----SLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELL 247

Query: 247 DTMTEPLG-VAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA----- 306
           + M + +  VA+NAMISGYV+   +++AL + R+M   G++ DEFTY SVI ACA     
Sbjct: 248 EGMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLL 307

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 308 QLGKQVHAYVLRREDFSFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 367

Query: 367 ---------------------------NGLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAG 426
                                      +GLA+NGFGEEGLKLF+ M+ +G+EPCDYAF+G
Sbjct: 368 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 427

Query: 427 VVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI--------------------- 440
            + +C+VLGA  NG+Q HAQ++ +G DSSLS GNA+I++                     
Sbjct: 428 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTMPCLD 487

BLAST of HG10004783 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 4.0e-46
Identity = 146/528 (27.65%), Postives = 230/528 (43.56%), Query Frame = 0

Query: 6   DVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFI 65
           D  +  +R+     L A  N     + + +H+H++ +GF   G ++N L+ +Y +     
Sbjct: 272 DSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVE 331

Query: 66  YARQLFDEIPHPDAVAR--TTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYS 125
            AR+L ++    D      T L+  Y  LG++N A+ IF    L  RD + + AMI GY 
Sbjct: 332 TARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIF--VSLKDRDVVAWTAMIVGYE 391

Query: 126 HKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLIS 185
                  AI LF +M     +P+ +T  ++LS  + +    H   Q+HG+ VK G     
Sbjct: 392 QHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHG-KQIHGSAVKSGEIYSV 451

Query: 186 SVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP-ERDELTWTTLITGYVRNDDLTGA 245
           SV NAL+++Y K  +         + SA + FD +  ERD ++WT++I    ++     A
Sbjct: 452 SVSNALITMYAKAGN---------ITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEA 511

Query: 246 RELLDTM----TEPLGVAWNAMISGYVHHSLFEDALTLFRKMR----------------- 305
            EL +TM      P  + +  + S   H  L       F  M+                 
Sbjct: 512 LELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVD 571

Query: 306 LLG----------------VQHDEFTYTSVISAC-------------------------- 365
           L G                ++ D  T+ S++SAC                          
Sbjct: 572 LFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGA 631

Query: 366 ----ANGLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAGVVTACSV-LGALENG------- 425
               AN  +  G  EE  K+   M+ DG    +  F+ +     V +  +E+G       
Sbjct: 632 YSALANLYSACGKWEEAAKIRKSMK-DGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNE 691

Query: 426 -----RQLHAQIVHLG---------HDSSLSVGNAMI--SIEKLAVGFGLMKLPEGATVR 440
                +++  +I  +G         HD    V   ++    EKLA+ FGL+  P+  T+R
Sbjct: 692 IYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLR 751

BLAST of HG10004783 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.7e-44
Identity = 140/632 (22.15%), Postives = 242/632 (38.29%), Query Frame = 0

Query: 5   LDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNF 64
           + + +L N Y     L +     +F   + +H H++  G     ++   L+ +Y ++   
Sbjct: 126 ISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRL 185

Query: 65  IYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSH 124
             A ++FD+ PH D V+ T LI  Y++ G +  A+++F+  P  ++D + +NAMI+GY+ 
Sbjct: 186 EDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIP--VKDVVSWNAMISGYAE 245

Query: 125 KDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCG-QMHGAVVKFGVGLIS 184
             +   A+ELF  M + N +PD+ T  +V+SA A    G  + G Q+H  +   G G   
Sbjct: 246 TGNYKEALELFKDMMKTNVRPDESTMVTVVSACA--QSGSIELGRQVHLWIDDHGFGSNL 305

Query: 185 SVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVR-------- 244
            ++NAL+ +Y KC           + +A  LF+ +P +D ++W TLI GY          
Sbjct: 306 KIVNALIDLYSKCGE---------LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEAL 365

Query: 245 -------------ND--------------------------------------------- 304
                        ND                                             
Sbjct: 366 LLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLID 425

Query: 305 ------DLTGARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFT 364
                 D+  A ++ +++      +WNAMI G+  H   + +  LF +MR +G+Q D+ T
Sbjct: 426 MYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDIT 485

Query: 365 YTSVISACANG--------------------------------LAQNGFGEEGLKLFNQM 424
           +  ++SAC++                                 L  +G  +E  ++ N M
Sbjct: 486 FVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMM 545

Query: 425 RLDGYEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHD------------------- 440
            +   EP    +  ++ AC + G +E G      ++ +  +                   
Sbjct: 546 EM---EPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWN 605

BLAST of HG10004783 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 1.6e-42
Identity = 147/625 (23.52%), Postives = 246/625 (39.36%), Query Frame = 0

Query: 11  ANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQL 70
           A+ Y   +Q+C+     +    + VH H+  SGF P   + NRLL +Y K  + + AR++
Sbjct: 85  ASTYCNLIQVCS--QTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLVDARKV 144

Query: 71  FDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKDDGYS 130
           FDE+P+ D  +   ++  Y+ +G L  AR++F+   +  +D+  + AM+TGY  KD    
Sbjct: 145 FDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFD--EMTEKDSYSWTAMVTGYVKKDQPEE 204

Query: 131 AIELFHAMRRA-NFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLISSVLNAL 190
           A+ L+  M+R  N +P+ FT  S+  A A  V    +  ++HG +V+ G+     + ++L
Sbjct: 205 ALVLYSLMQRVPNSRPNIFT-VSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSL 264

Query: 191 LSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRND------------ 250
           + +Y KC           +  AR +FD++ E+D ++WT++I  Y ++             
Sbjct: 265 MDMYGKC---------GCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSEL 324

Query: 251 --------------------DLT------------------------------------- 310
                               DLT                                     
Sbjct: 325 VGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNI 384

Query: 311 -GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISAC 370
             A+ ++D   +P  V+W ++I G   +   ++AL  F  +   G + D  T+ +V+SAC
Sbjct: 385 ESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSAC 444

Query: 371 ANGLAQNGFGEEGLKLF----NQMRL----DGY-------------------------EP 430
            +     G  E+GL+ F     + RL    D Y                         +P
Sbjct: 445 THA----GLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKP 504

Query: 431 CDYAFAGVVTACSVLGALENGRQLHAQI-------------------------------- 440
             + +A V+  CS  G ++   +   ++                                
Sbjct: 505 SKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRK 564

BLAST of HG10004783 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 2.5e-40
Identity = 134/555 (24.14%), Postives = 217/555 (39.10%), Query Frame = 0

Query: 12  NRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLF 71
           N Y     L A  N S+F     +HA +   G++   + VN L++ Y  + NF  A  LF
Sbjct: 114 NAYTFPSLLKACSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLF 173

Query: 72  DEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKDDGYSA 131
           D IP PD V+  ++I  Y   G +++A  +F    +  ++ I +  MI+GY   D    A
Sbjct: 174 DRIPEPDDVSWNSVIKGYVKAGKMDIALTLFR--KMAEKNAISWTTMISGYVQADMNKEA 233

Query: 132 IELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQ-MHGAVVKFGVGLISSVLNALL 191
           ++LFH M+ ++ +PD+ +  + LSA A +  G  + G+ +H  + K  + + S +   L+
Sbjct: 234 LQLFHEMQNSDVEPDNVSLANALSACAQL--GALEQGKWIHSYLNKTRIRMDSVLGCVLI 293

Query: 192 SVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLTGARELLDTMT 251
            +Y KC                    EM E                     A E+   + 
Sbjct: 294 DMYAKCG-------------------EMEE---------------------ALEVFKNIK 353

Query: 252 EPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACANGLAQNGFGE 311
           +    AW A+ISGY +H    +A++ F +M+ +G++ +  T+T+V++AC    +  G  E
Sbjct: 354 KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTAC----SYTGLVE 413

Query: 312 EGLKLFNQMRLD-GYEPCDYAFAGVVTACSVLGALEN-----------------GRQLHA 371
           EG  +F  M  D   +P    +  +V      G L+                  G  L A
Sbjct: 414 EGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKA 473

Query: 372 QIVHLGHDSSLSVGNAMISI---------------------------------------- 431
             +H   +    +G  +I+I                                        
Sbjct: 474 CRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVP 533

Query: 432 ------------------------------------------------------------ 440
                                                                       
Sbjct: 534 GCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDER 593

BLAST of HG10004783 vs. ExPASy TrEMBL
Match: A0A6J1GGL5 (pentatricopeptide repeat-containing protein At1g25360-like OS=Cucurbita moschata OX=3662 GN=LOC111454019 PE=3 SV=1)

HSP 1 Score: 683.3 bits (1762), Expect = 6.6e-193
Identity = 400/797 (50.19%), Postives = 420/797 (52.70%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRNA+DVR+LANRYAAQLQLC PQNPSSFSLARTVHAHMI SGFKPRGHLVNRLLDIYWK
Sbjct: 1   MRNAIDVRVLANRYAAQLQLCCPQNPSSFSLARTVHAHMIVSGFKPRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN +YARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFNGTPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVYARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNGTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMR LGV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRFLGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGKVDGARHIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGL LFN+MRLDG
Sbjct: 361 SWNAILSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEEGLNLFNRMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLG+LENGRQLHAQ++HLGHDSSLS+GNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGSLENGRQLHAQLIHLGHDSSLSIGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. ExPASy TrEMBL
Match: A0A6J1CRF6 (pentatricopeptide repeat-containing protein At1g25360-like OS=Momordica charantia OX=3673 GN=LOC111014069 PE=3 SV=1)

HSP 1 Score: 679.9 bits (1753), Expect = 7.2e-192
Identity = 396/797 (49.69%), Postives = 420/797 (52.70%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRN+L +R++ANRYAAQLQLC PQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLD+YWK
Sbjct: 1   MRNSLGIRVVANRYAAQLQLCCPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDVYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN +YARQLFDEIP+PDAVARTTLITAYSA+GNL++AR IFNGTPL MRDTIFYNAMIT
Sbjct: 61  SSNLVYARQLFDEIPNPDAVARTTLITAYSAVGNLSLARAIFNGTPLVMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           GYSH DDG+SA+ELFHAMRR NFQPDDFTFTSVLSALALIVD E QC Q+HGAVVK G G
Sbjct: 121 GYSHNDDGHSALELFHAMRRGNFQPDDFTFTSVLSALALIVDNEQQCSQIHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLV SSSLMASARKLFDEMPERDELTWTT+ITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVLSSSLMASARKLFDEMPERDELTWTTMITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GA  LLDTMTE LGVAWN+MISGYVHH LF++ALTLFRKMRLLGVQHDEFTYTSVISACA
Sbjct: 241 GAHGLLDTMTEKLGVAWNSMISGYVHHGLFQEALTLFRKMRLLGVQHDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKQVHAYILKNELNPNHDFILSVSNALITLYWKFGKVDGARKIFYEMPIKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGLKLFNQMRLDG
Sbjct: 361 TWNAILSGYVNSGRMEEAKSFFVEMPEKNLLTWTVMISGLAQNGFGEEGLKLFNQMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLGALENGRQLHAQ++HLGHDSSLSVGNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGALENGRQLHAQLIHLGHDSSLSVGNAMISMYARCGVVEAAK 480

BLAST of HG10004783 vs. ExPASy TrEMBL
Match: A0A6J1KSZ4 (pentatricopeptide repeat-containing protein At1g25360-like OS=Cucurbita maxima OX=3661 GN=LOC111496149 PE=3 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 4.4e-189
Identity = 396/797 (49.69%), Postives = 416/797 (52.20%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRN +DVR+LANRYAAQLQLC PQN SSFSLARTVHAHMI SGFK RGHLVNRLLDIYWK
Sbjct: 1   MRNVIDVRVLANRYAAQLQLCCPQNSSSFSLARTVHAHMIVSGFKLRGHLVNRLLDIYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SSN +YARQLFDEIP+PDAVARTTLITAYS LGNLNMAREIFN TPLNMRDTIFYNAMIT
Sbjct: 61  SSNLVYARQLFDEIPNPDAVARTTLITAYSNLGNLNMAREIFNRTPLNMRDTIFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           G+SH  DG+SAI LFHAMRR+NF+PDDFTFTSVLSALALIVD E QCGQMHGAVVK G G
Sbjct: 121 GFSHNVDGHSAIGLFHAMRRSNFRPDDFTFTSVLSALALIVDNEQQCGQMHGAVVKSGTG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L+SSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP+RDELTWTTLITGYVRNDDL 
Sbjct: 181 LVSSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPQRDELTWTTLITGYVRNDDLN 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
           GARELLDTMTE LGVAWNAMISGYVHH LFEDALTLFRKMR LGV+ DEFTYTSVISACA
Sbjct: 241 GARELLDTMTEKLGVAWNAMISGYVHHGLFEDALTLFRKMRFLGVELDEFTYTSVISACA 300

Query: 301 N----------------------------------------------------------- 360
           N                                                           
Sbjct: 301 NGGFFQLGKELHAYILKNELNPNHDFLLSVSNSLITLYWKYGKVDGARNIFYEMPVKDIV 360

Query: 361 --------------------------------------GLAQNGFGEEGLKLFNQMRLDG 420
                                                 GLAQNGFGEEGL LFN+MRLDG
Sbjct: 361 SWNAILSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEEGLNLFNRMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEPCDYAFAG +TACSVLG+LENGRQLHAQ+VHLGHDSSLS+GNAMIS+           
Sbjct: 421 YEPCDYAFAGAITACSVLGSLENGRQLHAQLVHLGHDSSLSIGNAMISMYARCGVVEAAR 480

BLAST of HG10004783 vs. ExPASy TrEMBL
Match: A0A1S4DVG9 (pentatricopeptide repeat-containing protein At1g25360-like OS=Cucumis melo OX=3656 GN=LOC103488043 PE=3 SV=1)

HSP 1 Score: 647.1 bits (1668), Expect = 5.2e-182
Identity = 373/742 (50.27%), Postives = 401/742 (54.04%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRN LDVR+LANRY AQL LC PQNPSS+SLARTVHAH+IASGFK RGH+VNRL+D+YWK
Sbjct: 1   MRNVLDVRVLANRYVAQLNLCCPQNPSSYSLARTVHAHVIASGFKLRGHIVNRLIDVYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SS+F+YARQLFDEIP PD +ARTTLITAYSALGNL MAREIFN TPL+MRDT+FYNAMIT
Sbjct: 61  SSDFVYARQLFDEIPQPDVIARTTLITAYSALGNLKMAREIFNETPLDMRDTVFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           GYSH +DG+SAIELF AMR ANFQPDDFTF SVLSA  LI D E QCGQMHGAVVK G+G
Sbjct: 121 GYSHMNDGHSAIELFRAMRWANFQPDDFTFASVLSASTLIFDDERQCGQMHGAVVKSGIG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L  +VLN+LLSVYVKCASSPLVSSSSLMASARKLFDEMP+R+E  WTTLITGYVRNDDL 
Sbjct: 181 LFPAVLNSLLSVYVKCASSPLVSSSSLMASARKLFDEMPKRNEFIWTTLITGYVRNDDLA 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
            ARE+LDTMTE  G+AWNAMISGY+HH LFEDALTLFRKMRLLGVQ DE TYTSVISACA
Sbjct: 241 AAREILDTMTEQPGIAWNAMISGYLHHGLFEDALTLFRKMRLLGVQLDESTYTSVISACA 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 DGGFFLLGKQVHAYILKNELNPDRNFLLSVGNTLITLYWKYGKVDGARKIFYEMPVKDVI 360

Query: 361 -------------------------------------NGLAQNGFGEEGLKLFNQMRLDG 420
                                                +GLAQNGFGE+ LKLFNQMRLDG
Sbjct: 361 SWNTLLSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEQALKLFNQMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEP DYAFAG +TACSVLGALENGRQLHAQIVHLGHDS+LSVGNAMI++           
Sbjct: 421 YEPNDYAFAGAITACSVLGALENGRQLHAQIVHLGHDSTLSVGNAMITMYARCGVVEAAR 480

BLAST of HG10004783 vs. ExPASy TrEMBL
Match: A0A5A7VDN1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005030 PE=3 SV=1)

HSP 1 Score: 625.9 bits (1613), Expect = 1.2e-175
Identity = 373/797 (46.80%), Postives = 401/797 (50.31%), Query Frame = 0

Query: 1   MRNALDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWK 60
           MRN LDVR+LANRY AQL LC PQNPSS+SLARTVHAH+IASGFK RGH+VNRL+D+YWK
Sbjct: 1   MRNVLDVRVLANRYVAQLNLCCPQNPSSYSLARTVHAHVIASGFKLRGHIVNRLIDVYWK 60

Query: 61  SSNFIYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMIT 120
           SS+F+YARQLFDEIP PD +ARTTLITAYSALGNL MAREIFN TPL+MRDT+FYNAMIT
Sbjct: 61  SSDFVYARQLFDEIPQPDVIARTTLITAYSALGNLKMAREIFNETPLDMRDTVFYNAMIT 120

Query: 121 GYSHKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVG 180
           GYSH +DG+SAIELF AMR ANFQPDDFTF SVLSA  LI D E QCGQMHGAVVK G+G
Sbjct: 121 GYSHMNDGHSAIELFRAMRWANFQPDDFTFASVLSASTLIFDDERQCGQMHGAVVKSGIG 180

Query: 181 LISSVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLT 240
           L  +VLN+LLSVYVKCASSPLVSSSSLMASARKLFDEMP+R+E  WTTLITGYVRNDDL 
Sbjct: 181 LFPAVLNSLLSVYVKCASSPLVSSSSLMASARKLFDEMPKRNEFIWTTLITGYVRNDDLA 240

Query: 241 GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA 300
            ARE+LDTMTE  G+AWNAMISGY+HH LFEDALTLFRKMRLLGVQ DE TYTSVISACA
Sbjct: 241 AAREILDTMTEQPGIAWNAMISGYLHHGLFEDALTLFRKMRLLGVQLDESTYTSVISACA 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 DGGFFLLGKQVHAYILKNELNPDRNFLLSVGNTLITLYWKYGKVDGARKIFYEMPVKDVI 360

Query: 361 -------------------------------------NGLAQNGFGEEGLKLFNQMRLDG 420
                                                +GLAQNGFGE+ LKLFNQMRLDG
Sbjct: 361 SWNTLLSGYVNAGRMEEAKSFFAQMPEKNLLTWTVMISGLAQNGFGEQALKLFNQMRLDG 420

Query: 421 YEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI----------- 440
           YEP DYAFAG +TACSVLGALENGRQLHAQIVHLGHDS+LSVGNAMI++           
Sbjct: 421 YEPNDYAFAGAITACSVLGALENGRQLHAQIVHLGHDSTLSVGNAMITMYARCGVVEAAR 480

BLAST of HG10004783 vs. TAIR 10
Match: AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 402.1 bits (1032), Expect = 5.6e-112
Identity = 257/788 (32.61%), Postives = 335/788 (42.51%), Query Frame = 0

Query: 7   VRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIY 66
           VR +ANRYAA L+LC P   +S  LAR VH ++I  GF+PR H++NRL+D+Y KSS   Y
Sbjct: 8   VRAIANRYAANLRLCLPLRRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNY 67

Query: 67  ARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKD 126
           ARQLFDEI  PD +ARTT+++ Y A G++ +AR +F   P+ MRDT+ YNAMITG+SH +
Sbjct: 68  ARQLFDEISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNN 127

Query: 127 DGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLISSVL 186
           DGYSAI LF  M+   F+PD+FTF SVL+ LAL+ D E QC Q H A +K G G I+SV 
Sbjct: 128 DGYSAINLFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVS 187

Query: 187 NALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLTGARELL 246
           NAL+SVY KCASSP     SL+ SARK+FDE+ E+DE +WTT++TGYV+N       ELL
Sbjct: 188 NALVSVYSKCASSP-----SLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELL 247

Query: 247 DTMTEPLG-VAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACA----- 306
           + M + +  VA+NAMISGYV+   +++AL + R+M   G++ DEFTY SVI ACA     
Sbjct: 248 EGMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLL 307

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 308 QLGKQVHAYVLRREDFSFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 367

Query: 367 ---------------------------NGLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAG 426
                                      +GLA+NGFGEEGLKLF+ M+ +G+EPCDYAF+G
Sbjct: 368 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 427

Query: 427 VVTACSVLGALENGRQLHAQIVHLGHDSSLSVGNAMISI--------------------- 440
            + +C+VLGA  NG+Q HAQ++ +G DSSLS GNA+I++                     
Sbjct: 428 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTMPCLD 487

BLAST of HG10004783 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 187.2 bits (474), Expect = 2.8e-47
Identity = 146/528 (27.65%), Postives = 230/528 (43.56%), Query Frame = 0

Query: 6   DVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFI 65
           D  +  +R+     L A  N     + + +H+H++ +GF   G ++N L+ +Y +     
Sbjct: 272 DSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVE 331

Query: 66  YARQLFDEIPHPDAVAR--TTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYS 125
            AR+L ++    D      T L+  Y  LG++N A+ IF    L  RD + + AMI GY 
Sbjct: 332 TARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIF--VSLKDRDVVAWTAMIVGYE 391

Query: 126 HKDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLIS 185
                  AI LF +M     +P+ +T  ++LS  + +    H   Q+HG+ VK G     
Sbjct: 392 QHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHG-KQIHGSAVKSGEIYSV 451

Query: 186 SVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMP-ERDELTWTTLITGYVRNDDLTGA 245
           SV NAL+++Y K  +         + SA + FD +  ERD ++WT++I    ++     A
Sbjct: 452 SVSNALITMYAKAGN---------ITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEA 511

Query: 246 RELLDTM----TEPLGVAWNAMISGYVHHSLFEDALTLFRKMR----------------- 305
            EL +TM      P  + +  + S   H  L       F  M+                 
Sbjct: 512 LELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVD 571

Query: 306 LLG----------------VQHDEFTYTSVISAC-------------------------- 365
           L G                ++ D  T+ S++SAC                          
Sbjct: 572 LFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGA 631

Query: 366 ----ANGLAQNGFGEEGLKLFNQMRLDGYEPCDYAFAGVVTACSV-LGALENG------- 425
               AN  +  G  EE  K+   M+ DG    +  F+ +     V +  +E+G       
Sbjct: 632 YSALANLYSACGKWEEAAKIRKSMK-DGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNE 691

Query: 426 -----RQLHAQIVHLG---------HDSSLSVGNAMI--SIEKLAVGFGLMKLPEGATVR 440
                +++  +I  +G         HD    V   ++    EKLA+ FGL+  P+  T+R
Sbjct: 692 IYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLAIAFGLISTPDKTTLR 751

BLAST of HG10004783 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 181.8 bits (460), Expect = 1.2e-45
Identity = 140/632 (22.15%), Postives = 242/632 (38.29%), Query Frame = 0

Query: 5   LDVRILANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNF 64
           + + +L N Y     L +     +F   + +H H++  G     ++   L+ +Y ++   
Sbjct: 126 ISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRL 185

Query: 65  IYARQLFDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSH 124
             A ++FD+ PH D V+ T LI  Y++ G +  A+++F+  P  ++D + +NAMI+GY+ 
Sbjct: 186 EDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIP--VKDVVSWNAMISGYAE 245

Query: 125 KDDGYSAIELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCG-QMHGAVVKFGVGLIS 184
             +   A+ELF  M + N +PD+ T  +V+SA A    G  + G Q+H  +   G G   
Sbjct: 246 TGNYKEALELFKDMMKTNVRPDESTMVTVVSACA--QSGSIELGRQVHLWIDDHGFGSNL 305

Query: 185 SVLNALLSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVR-------- 244
            ++NAL+ +Y KC           + +A  LF+ +P +D ++W TLI GY          
Sbjct: 306 KIVNALIDLYSKCGE---------LETACGLFERLPYKDVISWNTLIGGYTHMNLYKEAL 365

Query: 245 -------------ND--------------------------------------------- 304
                        ND                                             
Sbjct: 366 LLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLID 425

Query: 305 ------DLTGARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFT 364
                 D+  A ++ +++      +WNAMI G+  H   + +  LF +MR +G+Q D+ T
Sbjct: 426 MYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDIT 485

Query: 365 YTSVISACANG--------------------------------LAQNGFGEEGLKLFNQM 424
           +  ++SAC++                                 L  +G  +E  ++ N M
Sbjct: 486 FVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMM 545

Query: 425 RLDGYEPCDYAFAGVVTACSVLGALENGRQLHAQIVHLGHD------------------- 440
            +   EP    +  ++ AC + G +E G      ++ +  +                   
Sbjct: 546 EM---EPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWN 605

BLAST of HG10004783 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 175.3 bits (443), Expect = 1.1e-43
Identity = 147/625 (23.52%), Postives = 246/625 (39.36%), Query Frame = 0

Query: 11  ANRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQL 70
           A+ Y   +Q+C+     +    + VH H+  SGF P   + NRLL +Y K  + + AR++
Sbjct: 85  ASTYCNLIQVCS--QTRALEEGKKVHEHIRTSGFVPGIVIWNRLLRMYAKCGSLVDARKV 144

Query: 71  FDEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKDDGYS 130
           FDE+P+ D  +   ++  Y+ +G L  AR++F+   +  +D+  + AM+TGY  KD    
Sbjct: 145 FDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFD--EMTEKDSYSWTAMVTGYVKKDQPEE 204

Query: 131 AIELFHAMRRA-NFQPDDFTFTSVLSALALIVDGEHQCGQMHGAVVKFGVGLISSVLNAL 190
           A+ L+  M+R  N +P+ FT  S+  A A  V    +  ++HG +V+ G+     + ++L
Sbjct: 205 ALVLYSLMQRVPNSRPNIFT-VSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSL 264

Query: 191 LSVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRND------------ 250
           + +Y KC           +  AR +FD++ E+D ++WT++I  Y ++             
Sbjct: 265 MDMYGKC---------GCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSEL 324

Query: 251 --------------------DLT------------------------------------- 310
                               DLT                                     
Sbjct: 325 VGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNI 384

Query: 311 -GARELLDTMTEPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISAC 370
             A+ ++D   +P  V+W ++I G   +   ++AL  F  +   G + D  T+ +V+SAC
Sbjct: 385 ESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSAC 444

Query: 371 ANGLAQNGFGEEGLKLF----NQMRL----DGY-------------------------EP 430
            +     G  E+GL+ F     + RL    D Y                         +P
Sbjct: 445 THA----GLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKP 504

Query: 431 CDYAFAGVVTACSVLGALENGRQLHAQI-------------------------------- 440
             + +A V+  CS  G ++   +   ++                                
Sbjct: 505 SKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRK 564

BLAST of HG10004783 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 167.9 bits (424), Expect = 1.8e-41
Identity = 134/555 (24.14%), Postives = 217/555 (39.10%), Query Frame = 0

Query: 12  NRYAAQLQLCAPQNPSSFSLARTVHAHMIASGFKPRGHLVNRLLDIYWKSSNFIYARQLF 71
           N Y     L A  N S+F     +HA +   G++   + VN L++ Y  + NF  A  LF
Sbjct: 114 NAYTFPSLLKACSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLF 173

Query: 72  DEIPHPDAVARTTLITAYSALGNLNMAREIFNGTPLNMRDTIFYNAMITGYSHKDDGYSA 131
           D IP PD V+  ++I  Y   G +++A  +F    +  ++ I +  MI+GY   D    A
Sbjct: 174 DRIPEPDDVSWNSVIKGYVKAGKMDIALTLFR--KMAEKNAISWTTMISGYVQADMNKEA 233

Query: 132 IELFHAMRRANFQPDDFTFTSVLSALALIVDGEHQCGQ-MHGAVVKFGVGLISSVLNALL 191
           ++LFH M+ ++ +PD+ +  + LSA A +  G  + G+ +H  + K  + + S +   L+
Sbjct: 234 LQLFHEMQNSDVEPDNVSLANALSACAQL--GALEQGKWIHSYLNKTRIRMDSVLGCVLI 293

Query: 192 SVYVKCASSPLVSSSSLMASARKLFDEMPERDELTWTTLITGYVRNDDLTGARELLDTMT 251
            +Y KC                    EM E                     A E+   + 
Sbjct: 294 DMYAKCG-------------------EMEE---------------------ALEVFKNIK 353

Query: 252 EPLGVAWNAMISGYVHHSLFEDALTLFRKMRLLGVQHDEFTYTSVISACANGLAQNGFGE 311
           +    AW A+ISGY +H    +A++ F +M+ +G++ +  T+T+V++AC    +  G  E
Sbjct: 354 KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTAC----SYTGLVE 413

Query: 312 EGLKLFNQMRLD-GYEPCDYAFAGVVTACSVLGALEN-----------------GRQLHA 371
           EG  +F  M  D   +P    +  +V      G L+                  G  L A
Sbjct: 414 EGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKA 473

Query: 372 QIVHLGHDSSLSVGNAMISI---------------------------------------- 431
             +H   +    +G  +I+I                                        
Sbjct: 474 CRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVP 533

Query: 432 ------------------------------------------------------------ 440
                                                                       
Sbjct: 534 GCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDER 593

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886633.16.1e-20152.07pentatricopeptide repeat-containing protein At1g25360-like [Benincasa hispida][more]
XP_023537093.13.4e-19652.22pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita pepo subsp... [more]
XP_022951057.11.4e-19250.19pentatricopeptide repeat-containing protein At1g25360-like [Cucurbita moschata][more]
KAG7020472.13.0e-19250.06Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6585558.13.0e-19250.06Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q9FRI57.9e-11132.61Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... [more]
Q9SHZ84.0e-4627.65Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9LN011.7e-4422.15Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O231691.6e-4223.52Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9FJY72.5e-4024.14Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GGL56.6e-19350.19pentatricopeptide repeat-containing protein At1g25360-like OS=Cucurbita moschata... [more]
A0A6J1CRF67.2e-19249.69pentatricopeptide repeat-containing protein At1g25360-like OS=Momordica charanti... [more]
A0A6J1KSZ44.4e-18949.69pentatricopeptide repeat-containing protein At1g25360-like OS=Cucurbita maxima O... [more]
A0A1S4DVG95.2e-18250.27pentatricopeptide repeat-containing protein At1g25360-like OS=Cucumis melo OX=36... [more]
A0A5A7VDN11.2e-17546.80Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G25360.15.6e-11232.61Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.12.8e-4727.65pentatricopeptide (PPR) repeat-containing protein [more]
AT1G08070.11.2e-4522.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37170.11.1e-4323.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.11.8e-4124.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 12..174
e-value: 4.7E-24
score: 87.3
coord: 302..420
e-value: 5.7E-7
score: 31.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 175..301
e-value: 3.8E-25
score: 90.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 115..146
e-value: 2.5E-5
score: 22.2
coord: 225..250
e-value: 5.4E-4
score: 18.0
coord: 299..326
e-value: 0.0018
score: 16.3
coord: 255..288
e-value: 1.9E-7
score: 28.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 111..158
e-value: 2.6E-12
score: 46.7
coord: 255..301
e-value: 1.3E-12
score: 47.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 83..103
e-value: 0.17
score: 12.2
coord: 225..250
e-value: 1.6E-4
score: 21.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 9.613118
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 222..252
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 111..145
score: 10.511944
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..326
score: 9.788499
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 373..429
e-value: 1.3E-20
score: 73.6
NoneNo IPR availablePANTHERPTHR47929:SF13BNACNNG04730D PROTEINcoord: 6..302
coord: 307..372
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 6..302
coord: 307..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004783.1HG10004783.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding