HG10023238 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023238
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 32439003 .. 32440361 (+)
RNA-Seq ExpressionHG10023238
SyntenyHG10023238
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTTTTCCTTCGCTTCCCGCCGCTTCGTTCCGTCTATTCTCTCCTTCTCCGACGTGTTCCTTCCGCCGGCAACCGTATTCTTTTCCACCAAAACAATCAATCCCATTCCGGAGTCTCGTCCAACACAAACTAATTTCGACCCTCCTACTGTTCGCGAGGCGCTTGATTCTTACTGCAATGACTGGAGGCGATCGTACGAGTTTTTCAACTGGGTTGAATCAGAATGTAAGTTTGATCACACCACCGAGACCTACAATCGCATGCTGGATATTCTCGGTAAGTTCTTCGAGTTCGACCTTTCGTGGGTCTTGATTCACCGTATGCAACAATCCTCGTTTGCTTTACCGGATCACGCGACGTTTCGAATTTTGTTTAAGCGTTATGCGTTGGCGCATTTGGTTAGTGAAGCGATTGCTGCTTATGAGAGATTGCGGGAGTTTAAATTGAGGGATGAGACTTCGTTTTGTAATCTTATTGATGCACTCTGTGAGTCTAGACATGTTGTTGAAGCTCAGGATTTGTGTTTCGGGAAGAACAGGAAGTTGGATTGTGATGCGAGTACGAAGATTCATAACTTAATTCTTCGTGGTTGGTTTAAGATGGGGTGGTGGAGTAAGTGTAGAGAGTTTTGGGAAGAGATGGATAAGAAGGGTGTCCGTAAGGATTTGCATTCGTATTCGATATACATGGATATACAATGCAAGAGTGGGAAGCCTTGGAAGGCTGTTAAATTGTACAAGGAGATGAAAAAGAAGGGACTGAAATTGGATGTAGTGGCCTATAATACGGTGATTCATGCCATTGGGATTTCGGAAGGTGTTGATTTTGCCAGCCGGGTGTTTCATGAGATGAAGGAAATGGGATGTAAGCCTAACGTTGTGACTTGCAATACTATTATTAAGCTATTTTGTGAGAATGGAAGATTCAAGGATGCCCATGTGATGCTCGACCAAATGCTTAAGAAGGACTGTCCACCGAATGTTATCACCTATCATTGTTTTTTCAGGTCTCTTGAAAAGCCGAAAGAGATTCTTATGTTATTTGATAGGATGATTAAATTTGGGGTTCATCCAAAAGTGGATACCTATGTTATGCTCATGAGGAAATTTGGAAGATGGGGGTTTCTGAGACCAGTGTTTTTAGTGTGGAATAAGATGGAGGAACTTGGGTGTAGCCCAAATGAGTCTGCTTACAATGCTTTGATTGATGCTCTTGTGGAGAAGGGCATGATAGATATGGCTAGGAAGTATGACGAAGAGATGGTAGCGAAAGGTCTTTCGCCTAAGCTGAGAGAGGAATTGGGGACAAAGATGGTGAATGGTGGCTATCATGCCAATGTGAACTGCAACAAGTAG

mRNA sequence

ATGCTTTTTTCCTTCGCTTCCCGCCGCTTCGTTCCGTCTATTCTCTCCTTCTCCGACGTGTTCCTTCCGCCGGCAACCGTATTCTTTTCCACCAAAACAATCAATCCCATTCCGGAGTCTCGTCCAACACAAACTAATTTCGACCCTCCTACTGTTCGCGAGGCGCTTGATTCTTACTGCAATGACTGGAGGCGATCGTACGAGTTTTTCAACTGGGTTGAATCAGAATGTAAGTTTGATCACACCACCGAGACCTACAATCGCATGCTGGATATTCTCGGTAAGTTCTTCGAGTTCGACCTTTCGTGGGTCTTGATTCACCGTATGCAACAATCCTCGTTTGCTTTACCGGATCACGCGACGTTTCGAATTTTGTTTAAGCGTTATGCGTTGGCGCATTTGGTTAGTGAAGCGATTGCTGCTTATGAGAGATTGCGGGAGTTTAAATTGAGGGATGAGACTTCGTTTTGTAATCTTATTGATGCACTCTGTGAGTCTAGACATGTTGTTGAAGCTCAGGATTTGTGTTTCGGGAAGAACAGGAAGTTGGATTGTGATGCGAGTACGAAGATTCATAACTTAATTCTTCGTGGTTGGTTTAAGATGGGGTGGTGGAGTAAGTGTAGAGAGTTTTGGGAAGAGATGGATAAGAAGGGTGTCCGTAAGGATTTGCATTCGTATTCGATATACATGGATATACAATGCAAGAGTGGGAAGCCTTGGAAGGCTGTTAAATTGTACAAGGAGATGAAAAAGAAGGGACTGAAATTGGATGTAGTGGCCTATAATACGGTGATTCATGCCATTGGGATTTCGGAAGGTGTTGATTTTGCCAGCCGGGTGTTTCATGAGATGAAGGAAATGGGATGTAAGCCTAACGTTGTGACTTGCAATACTATTATTAAGCTATTTTGTGAGAATGGAAGATTCAAGGATGCCCATGTGATGCTCGACCAAATGCTTAAGAAGGACTGTCCACCGAATGTTATCACCTATCATTGTTTTTTCAGGTCTCTTGAAAAGCCGAAAGAGATTCTTATGTTATTTGATAGGATGATTAAATTTGGGGTTCATCCAAAAGTGGATACCTATGTTATGCTCATGAGGAAATTTGGAAGATGGGGGTTTCTGAGACCAGTGTTTTTAGTGTGGAATAAGATGGAGGAACTTGGGTGTAGCCCAAATGAGTCTGCTTACAATGCTTTGATTGATGCTCTTGTGGAGAAGGGCATGATAGATATGGCTAGGAAGTATGACGAAGAGATGGTAGCGAAAGGTCTTTCGCCTAAGCTGAGAGAGGAATTGGGGACAAAGATGGTGAATGGTGGCTATCATGCCAATGTGAACTGCAACAAGTAG

Coding sequence (CDS)

ATGCTTTTTTCCTTCGCTTCCCGCCGCTTCGTTCCGTCTATTCTCTCCTTCTCCGACGTGTTCCTTCCGCCGGCAACCGTATTCTTTTCCACCAAAACAATCAATCCCATTCCGGAGTCTCGTCCAACACAAACTAATTTCGACCCTCCTACTGTTCGCGAGGCGCTTGATTCTTACTGCAATGACTGGAGGCGATCGTACGAGTTTTTCAACTGGGTTGAATCAGAATGTAAGTTTGATCACACCACCGAGACCTACAATCGCATGCTGGATATTCTCGGTAAGTTCTTCGAGTTCGACCTTTCGTGGGTCTTGATTCACCGTATGCAACAATCCTCGTTTGCTTTACCGGATCACGCGACGTTTCGAATTTTGTTTAAGCGTTATGCGTTGGCGCATTTGGTTAGTGAAGCGATTGCTGCTTATGAGAGATTGCGGGAGTTTAAATTGAGGGATGAGACTTCGTTTTGTAATCTTATTGATGCACTCTGTGAGTCTAGACATGTTGTTGAAGCTCAGGATTTGTGTTTCGGGAAGAACAGGAAGTTGGATTGTGATGCGAGTACGAAGATTCATAACTTAATTCTTCGTGGTTGGTTTAAGATGGGGTGGTGGAGTAAGTGTAGAGAGTTTTGGGAAGAGATGGATAAGAAGGGTGTCCGTAAGGATTTGCATTCGTATTCGATATACATGGATATACAATGCAAGAGTGGGAAGCCTTGGAAGGCTGTTAAATTGTACAAGGAGATGAAAAAGAAGGGACTGAAATTGGATGTAGTGGCCTATAATACGGTGATTCATGCCATTGGGATTTCGGAAGGTGTTGATTTTGCCAGCCGGGTGTTTCATGAGATGAAGGAAATGGGATGTAAGCCTAACGTTGTGACTTGCAATACTATTATTAAGCTATTTTGTGAGAATGGAAGATTCAAGGATGCCCATGTGATGCTCGACCAAATGCTTAAGAAGGACTGTCCACCGAATGTTATCACCTATCATTGTTTTTTCAGGTCTCTTGAAAAGCCGAAAGAGATTCTTATGTTATTTGATAGGATGATTAAATTTGGGGTTCATCCAAAAGTGGATACCTATGTTATGCTCATGAGGAAATTTGGAAGATGGGGGTTTCTGAGACCAGTGTTTTTAGTGTGGAATAAGATGGAGGAACTTGGGTGTAGCCCAAATGAGTCTGCTTACAATGCTTTGATTGATGCTCTTGTGGAGAAGGGCATGATAGATATGGCTAGGAAGTATGACGAAGAGATGGTAGCGAAAGGTCTTTCGCCTAAGCTGAGAGAGGAATTGGGGACAAAGATGGTGAATGGTGGCTATCATGCCAATGTGAACTGCAACAAGTAG

Protein sequence

MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPIPESRPTQTNFDPPTVREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQSSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVEAQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCKPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGMIDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK
Homology
BLAST of HG10023238 vs. NCBI nr
Match: XP_038896131.1 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Benincasa hispida] >XP_038896132.1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Benincasa hispida])

HSP 1 Score: 891.3 bits (2302), Expect = 3.4e-255
Identity = 428/461 (92.84%), Postives = 436/461 (94.58%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML SFASRRF PSI+SFSDVFLPPATVFFSTKTINPI         PESR T TNFDPPT
Sbjct: 1   MLLSFASRRFSPSIISFSDVFLPPATVFFSTKTINPIPEFGFKFDSPESRSTHTNFDPPT 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ
Sbjct: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRILFKRYA  HLVSEAI  YERLREFKLRDETSFCNLIDALC+SRHVVE
Sbjct: 121 SPFASPDHATFRILFKRYASGHLVSEAIDVYERLREFKLRDETSFCNLIDALCDSRHVVE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNRKLDCD+STKIHNL L GW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 181 AQDLCFGKNRKLDCDSSTKIHNLFLHGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DI CKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKE+GCK
Sbjct: 241 DILCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEIGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCN IIKLFCE+GRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 301 PNVVTCNIIIKLFCESGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIKFGVHPK+DTYVMLMRKFG+WGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM
Sbjct: 361 MIKFGVHPKMDTYVMLMRKFGKWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVN NK
Sbjct: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNSNK 461

BLAST of HG10023238 vs. NCBI nr
Match: XP_022953157.1 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita moschata] >XP_022953158.1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita moschata] >XP_022953159.1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita moschata])

HSP 1 Score: 878.2 bits (2268), Expect = 3.0e-251
Identity = 422/461 (91.54%), Postives = 433/461 (93.93%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI         PE RPTQTNFDPPT
Sbjct: 18  MLSSFASRRFPPSIFSFSDVFLPAAAILFSTKTTNPISEFGFNFNTPEDRPTQTNFDPPT 77

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 78  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 137

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLID+LCE RHVVE
Sbjct: 138 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDSLCEYRHVVE 197

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNRKL+CDASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 198 AQDLCFGKNRKLNCDASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 257

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK
Sbjct: 258 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 317

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH+MLDQMLKKDC PNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 318 PNVVTCNTIIKLFCENGRFKDAHMMLDQMLKKDCLPNVITYHCFFRSLEKPKEILMLFDR 377

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 378 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 437

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLR ELGTKMVNGGYHANVNCNK
Sbjct: 438 IDMARKYDEEMVAKGLSPKLRAELGTKMVNGGYHANVNCNK 478

BLAST of HG10023238 vs. NCBI nr
Match: KAG7014158.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 877.5 bits (2266), Expect = 5.0e-251
Identity = 422/461 (91.54%), Postives = 432/461 (93.71%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPIPE---------SRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI E          RPTQTNFDPPT
Sbjct: 1   MLSSFASRRFPPSIFSFSDVFLPAAAILFSTKTTNPISEFGFNFNSRDDRPTQTNFDPPT 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLIDALCE RHVVE
Sbjct: 121 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDALCEYRHVVE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNR L+C ASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 181 AQDLCFGKNRMLNCGASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK
Sbjct: 241 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH+MLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 301 PNVVTCNTIIKLFCENGRFKDAHMMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 361 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK
Sbjct: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 461

BLAST of HG10023238 vs. NCBI nr
Match: KAG6575615.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 877.5 bits (2266), Expect = 5.0e-251
Identity = 422/461 (91.54%), Postives = 432/461 (93.71%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPIPE---------SRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI E          RPTQTNFDPPT
Sbjct: 1   MLSSFASRRFPPSIFSFSDVFLPAAAILFSTKTTNPISEFGFNFNSRDDRPTQTNFDPPT 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLIDALCE RHVVE
Sbjct: 121 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDALCEYRHVVE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNR L+C ASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 181 AQDLCFGKNRMLNCGASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK
Sbjct: 241 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH+MLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 301 PNVVTCNTIIKLFCENGRFKDAHMMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 361 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK
Sbjct: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 461

BLAST of HG10023238 vs. NCBI nr
Match: XP_023547614.1 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023547616.1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 873.2 bits (2255), Expect = 9.5e-250
Identity = 420/461 (91.11%), Postives = 430/461 (93.28%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI         PE RPTQTNFDPPT
Sbjct: 18  MLSSFASRRFPPSIFSFSDVFLPAAAILFSTKTTNPISEFGFNFNTPEDRPTQTNFDPPT 77

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWR SYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 78  VREALDSYCNDWRCSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 137

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLIDALCE RHVVE
Sbjct: 138 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDALCEYRHVVE 197

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNRKL+C ASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 198 AQDLCFGKNRKLNCGASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 257

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVF EMKEMGCK
Sbjct: 258 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFQEMKEMGCK 317

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH+MLDQMLKKDCPPNVITYHCFFRSLEKP EILMLFDR
Sbjct: 318 PNVVTCNTIIKLFCENGRFKDAHMMLDQMLKKDCPPNVITYHCFFRSLEKPNEILMLFDR 377

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 378 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 437

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMV KGLSPKLREELGTKMVNGGYHANVNCNK
Sbjct: 438 IDMARKYDEEMVVKGLSPKLREELGTKMVNGGYHANVNCNK 478

BLAST of HG10023238 vs. ExPASy Swiss-Prot
Match: Q9M8M3 (Pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g80550 PE=1 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 1.8e-155
Identity = 262/427 (61.36%), Postives = 332/427 (77.75%), Query Frame = 0

Query: 23  PPATVFFSTKTINPIPESR------PTQTNFDPPTVREALDSYCNDWRRSYEFFNWVESE 82
           P +    S K I+ + +++        Q+++D  TV EAL  Y NDW+++ EFFNWVE E
Sbjct: 15  PYSVRLLSVKPISNVDDAKFRSQEEEDQSSYDQKTVCEALTCYSNDWQKALEFFNWVERE 74

Query: 83  CKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQSSFALPDHATFRILFKRYALAHLVS 142
             F HTTET+NR++DILGK+FEF++SW LI+RM  ++ ++P+H TFRI+FKRY  AHLV 
Sbjct: 75  SGFRHTTETFNRVIDILGKYFEFEISWALINRMIGNTESVPNHVTFRIVFKRYVTAHLVQ 134

Query: 143 EAIAAYERLREFKLRDETSFCNLIDALCESRHVVEAQDLCFGKN--RKLDCDASTKIHNL 202
           EAI AY++L +F LRDETSF NL+DALCE +HVVEA++LCFGKN        ++TKIHNL
Sbjct: 135 EAIDAYDKLDDFNLRDETSFYNLVDALCEHKHVVEAEELCFGKNVIGNGFSVSNTKIHNL 194

Query: 203 ILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKG 262
           ILRGW K+GWW KC+E+W++MD +GV KDL SYSIYMDI CKSGKPWKAVKLYKEMK + 
Sbjct: 195 ILRGWSKLGWWGKCKEYWKKMDTEGVTKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRR 254

Query: 263 LKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCKPNVVTCNTIIKLFCENGRFKDAH 322
           +KLDVVAYNTVI AIG S+GV+F  RVF EM+E GC+PNV T NTIIKL CE+GR +DA+
Sbjct: 255 MKLDVVAYNTVIRAIGASQGVEFGIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAY 314

Query: 323 VMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDRMIKFGVHPKVDTYVMLMRKFGRW 382
            MLD+M K+ C P+ ITY C F  LEKP EIL LF RMI+ GV PK+DTYVMLMRKF RW
Sbjct: 315 RMLDEMPKRGCQPDSITYMCLFSRLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERW 374

Query: 383 GFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGMIDMARKYDEEMVAKGLSPKLREE 442
           GFL+PV  VW  M+E G +P+ +AYNA+IDAL++KGM+DMAR+Y+EEM+ +GLSP+ R E
Sbjct: 375 GFLQPVLYVWKTMKESGDTPDSAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRRPE 434

BLAST of HG10023238 vs. ExPASy Swiss-Prot
Match: Q9LFQ4 (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 230.3 bits (586), Expect = 4.2e-59
Identity = 134/381 (35.17%), Postives = 202/381 (53.02%), Query Frame = 0

Query: 52  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 111
           V E L    NDW  ++ FF W   +  +  +   Y+ M+ ILGK  +FD +W LI  M++
Sbjct: 130 VVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKFDTAWTLIDEMRK 189

Query: 112 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLR-DETSFCNLIDALCESRHVV 171
            S +L +  T  I+ ++Y   H V +AI  +   + FKL      F +L+ ALC  ++V 
Sbjct: 190 FSPSLVNSQTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQSLLSALCRYKNVS 249

Query: 172 EAQDLCFGKNRKLDCDASTKIHNLILRGWFK-MGWWSKCREFWEEMDKKGVRKDLHSYSI 231
           +A  L F    K   DA  K  N++L GW   +G   +    W EM   GV+ D+ SYS 
Sbjct: 250 DAGHLIFCNKDKYPFDA--KSFNIVLNGWCNVIGSPREAERVWMEMGNVGVKHDVVSYSS 309

Query: 232 YMDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEM-KEM 291
            +    K G   K +KL+  MKK+ ++ D   YN V+HA+  +  V  A  +   M +E 
Sbjct: 310 MISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEARNLMKTMEEEK 369

Query: 292 GCKPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILML 351
           G +PNVVT N++IK  C+  + ++A  + D+ML+K   P + TYH F R L   +E+  L
Sbjct: 370 GIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMRILRTGEEVFEL 429

Query: 352 FDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVE 411
             +M K G  P V+TY+ML+RK  RW     V L+W++M+E    P+ S+Y  +I  L  
Sbjct: 430 LAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLSSYIVMIHGLFL 489

Query: 412 KGMIDMARKYDEEMVAKGLSP 430
            G I+ A  Y +EM  KG+ P
Sbjct: 490 NGKIEEAYGYYKEMKDKGMRP 508

BLAST of HG10023238 vs. ExPASy Swiss-Prot
Match: Q9LIL5 (Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis thaliana OX=3702 GN=At3g15200 PE=3 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 4.0e-49
Identity = 118/387 (30.49%), Postives = 196/387 (50.65%), Query Frame = 0

Query: 52  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 111
           V E ++   +DW+ +Y     V  +     ++  YN +LD+LGK   F+    +   M +
Sbjct: 112 VLEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGKMRRFEEFHQVFDEMSK 171

Query: 112 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDE-TSFCNLIDALCESRHVV 171
                 +  T+ +L  RYA AH V EA+  +ER +EF + D+  +F  L+  LC  +HV 
Sbjct: 172 RD-GFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVAFHGLLMWLCRYKHVE 231

Query: 172 EAQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIY 231
            A+ L   + R+  CD   K  N+IL GW  +G   + + FW+++     R D+ SY   
Sbjct: 232 FAETLFCSRRREFGCD--IKAMNMILNGWCVLGNVHEAKRFWKDIIASKCRPDVVSYGTM 291

Query: 232 MDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGC 291
           ++   K GK  KA++LY+ M       DV   N VI A+   + +  A  VF E+ E G 
Sbjct: 292 INALTKKGKLGKAMELYRAMWDTRRNPDVKICNNVIDALCFKKRIPEALEVFREISEKGP 351

Query: 292 KPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKK--DCPPNVITYHCFFRSLEKPKEILML 351
            PNVVT N+++K  C+  R +    ++++M  K   C PN +T+    +  ++ K++ ++
Sbjct: 352 DPNVVTYNSLLKHLCKIRRTEKVWELVEEMELKGGSCSPNDVTFSYLLKYSQRSKDVDIV 411

Query: 352 FDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVE 411
            +RM K       D Y ++ R + +W     V  +W++ME  G  P++  Y   I  L  
Sbjct: 412 LERMAKNKCEMTSDLYNLMFRLYVQWDKEEKVREIWSEMERSGLGPDQRTYTIRIHGLHT 471

Query: 412 KGMIDMARKYDEEMVAKGLSPKLREEL 436
           KG I  A  Y +EM++KG+ P+ R E+
Sbjct: 472 KGKIGEALSYFQEMMSKGMVPEPRTEM 495

BLAST of HG10023238 vs. ExPASy Swiss-Prot
Match: Q9SSR6 (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.0e-44
Identity = 113/372 (30.38%), Postives = 192/372 (51.61%), Query Frame = 0

Query: 66  SYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQSSFALPDHATFRIL 125
           ++ FF W      F H+ E+Y+ +++ILG   +F L W  +   ++ ++       F I+
Sbjct: 85  AHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKVFWIV 144

Query: 126 FKRYALAHLVSEAIAAYERLREFKLRD-ETSFCNLIDALCESRHVVEAQDLCFGKNRKLD 185
           F+ Y+ A+L SEA  A+ R+ EF ++        L+ +LC+ +HV  AQ+  FGK +   
Sbjct: 145 FRAYSRANLPSEACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKHVNHAQEF-FGKAKGFG 204

Query: 186 CDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAV 245
              S K +++++RGW ++   S  R+ ++EM ++    DL +Y+  +D  CKSG      
Sbjct: 205 IVPSAKTYSILVRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDVDGGY 264

Query: 246 KLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCKPNVVTCNTIIKLF 305
           K+++EM   GLK D  ++   IHA   +  V  A +V   MK     PNV T N IIK  
Sbjct: 265 KMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPNVYTFNHIIKTL 324

Query: 306 CENGRFKDAHVMLDQMLKKDCPP------NVITYHCFFRSLEKPKEILMLFDRMIKFGVH 365
           C+N +  DA+++LD+M++K   P      +++ YHC    + +  ++L    RM +    
Sbjct: 325 CKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLL---SRMDRTKCL 384

Query: 366 PKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALV-EKGMIDMARK 425
           P   TY M+++   R G       +W  M E    P  + Y  +I  LV +KG ++ A +
Sbjct: 385 PDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIHGLVRKKGKLEEACR 444

Query: 426 YDEEMVAKGLSP 430
           Y E M+ +G+ P
Sbjct: 445 YFEMMIDEGIPP 452

BLAST of HG10023238 vs. ExPASy Swiss-Prot
Match: Q9C9A2 (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 1.1e-43
Identity = 111/385 (28.83%), Postives = 190/385 (49.35%), Query Frame = 0

Query: 49  PPTVREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHR 108
           P  + E L    N    +   F W E++  F HTT  YN +++ LGK  +F L W L+  
Sbjct: 94  PALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDD 153

Query: 109 MQQSSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETS-FCNLIDALCESR 168
           M+     L    TF ++ +RYA A  V EAI A+ ++ EF  + E+S F  ++D L +SR
Sbjct: 154 MKAKK--LLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSR 213

Query: 169 HVVEAQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSY 228
           +V +AQ + F K +K   +   K + ++L GW +     +  E   EM  +G   D+ +Y
Sbjct: 214 NVGDAQKV-FDKMKKKRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAY 273

Query: 229 SIYMDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKE 288
            I ++  CK+ K  +A++ + EM+++  K     + ++I+ +G  + ++ A   F   K 
Sbjct: 274 GIIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKS 333

Query: 289 MGCKPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSL---EKPKE 348
            G      T N ++  +C + R +DA+  +D+M  K   PN  TY      L   ++ KE
Sbjct: 334 SGFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKE 393

Query: 349 ILMLFDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALID 408
              ++  M      P V TY +++R F     L     +W++M+  G  P    +++LI 
Sbjct: 394 AYEVYQTM---SCEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLIT 453

Query: 409 ALVEKGMIDMARKYDEEMVAKGLSP 430
           AL  +  +D A +Y  EM+  G+ P
Sbjct: 454 ALCHENKLDEACEYFNEMLDVGIRP 472

BLAST of HG10023238 vs. ExPASy TrEMBL
Match: A0A6J1GNU3 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111455781 PE=4 SV=1)

HSP 1 Score: 878.2 bits (2268), Expect = 1.4e-251
Identity = 422/461 (91.54%), Postives = 433/461 (93.93%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI         PE RPTQTNFDPPT
Sbjct: 18  MLSSFASRRFPPSIFSFSDVFLPAAAILFSTKTTNPISEFGFNFNTPEDRPTQTNFDPPT 77

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 78  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 137

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLID+LCE RHVVE
Sbjct: 138 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDSLCEYRHVVE 197

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNRKL+CDASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 198 AQDLCFGKNRKLNCDASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 257

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK
Sbjct: 258 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 317

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH+MLDQMLKKDC PNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 318 PNVVTCNTIIKLFCENGRFKDAHMMLDQMLKKDCLPNVITYHCFFRSLEKPKEILMLFDR 377

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 378 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 437

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLR ELGTKMVNGGYHANVNCNK
Sbjct: 438 IDMARKYDEEMVAKGLSPKLRAELGTKMVNGGYHANVNCNK 478

BLAST of HG10023238 vs. ExPASy TrEMBL
Match: A0A6J1JMZ1 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111488358 PE=4 SV=1)

HSP 1 Score: 872.5 bits (2253), Expect = 7.8e-250
Identity = 420/461 (91.11%), Postives = 430/461 (93.28%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML SFASRRF PSI SFSDVFLP A + FSTKT NPI         PE RPTQTNFDPPT
Sbjct: 18  MLSSFASRRFSPSIFSFSDVFLPAAAILFSTKTTNPISVFGFNFNTPEDRPTQTNFDPPT 77

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSW LI RM Q
Sbjct: 78  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWELIQRMLQ 137

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDHATFRI+FKRYA AHLVSEAIAAYERLREFKLRDETSFCNLIDALCE RHVVE
Sbjct: 138 SPFASPDHATFRIMFKRYASAHLVSEAIAAYERLREFKLRDETSFCNLIDALCEYRHVVE 197

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKNRKL+CDASTKIHNLILRGW KMGWWSKCREFWEEMDKKGVRKDLHSYSIYM
Sbjct: 198 AQDLCFGKNRKLNCDASTKIHNLILRGWLKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 257

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVF EMKEMGCK
Sbjct: 258 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFQEMKEMGCK 317

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAH ML QMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR
Sbjct: 318 PNVVTCNTIIKLFCENGRFKDAHKMLHQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 377

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GV PK+DTYVMLMRKFGRWGFLRPVF+VWNKMEELGCSP+ESAYN+LIDALVEKGM
Sbjct: 378 MIKYGVQPKMDTYVMLMRKFGRWGFLRPVFVVWNKMEELGCSPDESAYNSLIDALVEKGM 437

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           ID ARKYDEEMVAKGLSPKLR ELGT+MVNGGYHANVNCNK
Sbjct: 438 IDRARKYDEEMVAKGLSPKLRAELGTEMVNGGYHANVNCNK 478

BLAST of HG10023238 vs. ExPASy TrEMBL
Match: A0A6J1D599 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111017413 PE=4 SV=1)

HSP 1 Score: 868.2 bits (2242), Expect = 1.5e-248
Identity = 416/461 (90.24%), Postives = 429/461 (93.06%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPI---------PESRPTQTNFDPPT 60
           ML S ASRRF P ILSFS+V LPP  + FSTKT  PI         PE RP QTNFDP T
Sbjct: 1   MLSSLASRRFPPFILSFSEVLLPPGNLLFSTKTAAPISELDCKFNSPERRPVQTNFDPST 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDWRRS+EFFNWVESECKFDHTTETYNRMLDILGKFFEFD+SWVL+ RMQQ
Sbjct: 61  VREALDSYCNDWRRSFEFFNWVESECKFDHTTETYNRMLDILGKFFEFDISWVLVQRMQQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S+FA PDHATFRILFKRYA AHLV+EAIAAYER REFKLRDETSFCNLIDALCE RHVVE
Sbjct: 121 SAFASPDHATFRILFKRYASAHLVTEAIAAYERSREFKLRDETSFCNLIDALCEYRHVVE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQDLCFGKN+K+DC+ASTKIHNLILRGW KMGWWSKCREFWEEMD KGVRKDLHSYSIYM
Sbjct: 181 AQDLCFGKNKKVDCNASTKIHNLILRGWLKMGWWSKCREFWEEMDNKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMKKKG+KLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK
Sbjct: 241 DIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRS EKP EILMLFDR
Sbjct: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSFEKPNEILMLFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIKFGVHPK+DTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM
Sbjct: 361 MIKFGVHPKMDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANV CN+
Sbjct: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVICNE 461

BLAST of HG10023238 vs. ExPASy TrEMBL
Match: A0A1S3CFA1 (pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103499823 PE=4 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 4.5e-245
Identity = 409/461 (88.72%), Postives = 428/461 (92.84%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPIP---------ESRPTQTNFDPPT 60
           MLFSFASRR           FLPPAT+FFSTKTINPIP         E RPT  NFDP T
Sbjct: 1   MLFSFASRR-----------FLPPATIFFSTKTINPIPEFDFKFNTSERRPTHANFDPST 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDW+RSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLI+RM+Q
Sbjct: 61  VREALDSYCNDWKRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLINRMRQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDH TFRILFKRYA AHLV+EAIAAYERLREFKLRDETSFCNLIDALCESRHV E
Sbjct: 121 SPFAPPDHTTFRILFKRYAAAHLVTEAIAAYERLREFKLRDETSFCNLIDALCESRHVDE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQ+LCFGKNR+LDCD+STKIHNLILRGW KMGWWSKCR+FWEEMDKKGVRKDLHSYSIYM
Sbjct: 181 AQELCFGKNRRLDCDSSTKIHNLILRGWLKMGWWSKCRDFWEEMDKKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DIQCKSGKPWKAVKLYKEMK+KG+KLDVVAYNTVIHA+GISEGVDFASRVFHEMKEMGCK
Sbjct: 241 DIQCKSGKPWKAVKLYKEMKQKGMKLDVVAYNTVIHAVGISEGVDFASRVFHEMKEMGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNT+IKLFCENGRFKDAH+MLDQMLK+DC PNVITYHCFFRSLEKPKEIL+LFDR
Sbjct: 301 PNVVTCNTVIKLFCENGRFKDAHMMLDQMLKRDCQPNVITYHCFFRSLEKPKEILILFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GVHPK+DTYVML+RKFGRWGFLRPVFLVWNKMEELGCSPNE AYNALIDALVEKGM
Sbjct: 361 MIKYGVHPKMDTYVMLLRKFGRWGFLRPVFLVWNKMEELGCSPNECAYNALIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLR ELGTKMVNGGYHANVNCNK
Sbjct: 421 IDMARKYDEEMVAKGLSPKLRVELGTKMVNGGYHANVNCNK 450

BLAST of HG10023238 vs. ExPASy TrEMBL
Match: A0A5A7UVK9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G001970 PE=4 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 1.7e-244
Identity = 408/461 (88.50%), Postives = 428/461 (92.84%), Query Frame = 0

Query: 1   MLFSFASRRFVPSILSFSDVFLPPATVFFSTKTINPIP---------ESRPTQTNFDPPT 60
           MLFSFASRR           FLPPAT+FFSTKTINPIP         E RPT  NFDP T
Sbjct: 1   MLFSFASRR-----------FLPPATIFFSTKTINPIPEFDFKFNTSERRPTHANFDPST 60

Query: 61  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 120
           VREALDSYCNDW+RSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLI+RM+Q
Sbjct: 61  VREALDSYCNDWKRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLINRMRQ 120

Query: 121 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVVE 180
           S FA PDH TFRILFKRYA AHLV+EAIAAYERLREFKLRDETSFCNLIDALCESRHV E
Sbjct: 121 SPFAPPDHTTFRILFKRYASAHLVTEAIAAYERLREFKLRDETSFCNLIDALCESRHVDE 180

Query: 181 AQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYM 240
           AQ+LCFGKNR+LDCD+STKIHNLILRGW KMGWWSKCR+FWEEMDKKGVRKDLHSYSIYM
Sbjct: 181 AQELCFGKNRRLDCDSSTKIHNLILRGWLKMGWWSKCRDFWEEMDKKGVRKDLHSYSIYM 240

Query: 241 DIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCK 300
           DI+CKSGKPWKAVKLYKEMK+KG+KLDVVAYNTVIHA+GISEGVDFASRVFHEMKEMGCK
Sbjct: 241 DIRCKSGKPWKAVKLYKEMKQKGMKLDVVAYNTVIHAVGISEGVDFASRVFHEMKEMGCK 300

Query: 301 PNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDR 360
           PNVVTCNT+IKLFCENGRFKDAH+MLDQMLK+DC PNVITYHCFFRSLEKPKEIL+LFDR
Sbjct: 301 PNVVTCNTVIKLFCENGRFKDAHMMLDQMLKRDCQPNVITYHCFFRSLEKPKEILILFDR 360

Query: 361 MIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGM 420
           MIK+GVHPK+DTYVML+RKFGRWGFLRPVFLVWNKMEELGCSPNE AYNALIDALVEKGM
Sbjct: 361 MIKYGVHPKMDTYVMLLRKFGRWGFLRPVFLVWNKMEELGCSPNECAYNALIDALVEKGM 420

Query: 421 IDMARKYDEEMVAKGLSPKLREELGTKMVNGGYHANVNCNK 453
           IDMARKYDEEMVAKGLSPKLR ELGTKMVNGGYHANVNCNK
Sbjct: 421 IDMARKYDEEMVAKGLSPKLRVELGTKMVNGGYHANVNCNK 450

BLAST of HG10023238 vs. TAIR 10
Match: AT1G80550.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 550.4 bits (1417), Expect = 1.3e-156
Identity = 262/427 (61.36%), Postives = 332/427 (77.75%), Query Frame = 0

Query: 23  PPATVFFSTKTINPIPESR------PTQTNFDPPTVREALDSYCNDWRRSYEFFNWVESE 82
           P +    S K I+ + +++        Q+++D  TV EAL  Y NDW+++ EFFNWVE E
Sbjct: 15  PYSVRLLSVKPISNVDDAKFRSQEEEDQSSYDQKTVCEALTCYSNDWQKALEFFNWVERE 74

Query: 83  CKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQSSFALPDHATFRILFKRYALAHLVS 142
             F HTTET+NR++DILGK+FEF++SW LI+RM  ++ ++P+H TFRI+FKRY  AHLV 
Sbjct: 75  SGFRHTTETFNRVIDILGKYFEFEISWALINRMIGNTESVPNHVTFRIVFKRYVTAHLVQ 134

Query: 143 EAIAAYERLREFKLRDETSFCNLIDALCESRHVVEAQDLCFGKN--RKLDCDASTKIHNL 202
           EAI AY++L +F LRDETSF NL+DALCE +HVVEA++LCFGKN        ++TKIHNL
Sbjct: 135 EAIDAYDKLDDFNLRDETSFYNLVDALCEHKHVVEAEELCFGKNVIGNGFSVSNTKIHNL 194

Query: 203 ILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKG 262
           ILRGW K+GWW KC+E+W++MD +GV KDL SYSIYMDI CKSGKPWKAVKLYKEMK + 
Sbjct: 195 ILRGWSKLGWWGKCKEYWKKMDTEGVTKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRR 254

Query: 263 LKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCKPNVVTCNTIIKLFCENGRFKDAH 322
           +KLDVVAYNTVI AIG S+GV+F  RVF EM+E GC+PNV T NTIIKL CE+GR +DA+
Sbjct: 255 MKLDVVAYNTVIRAIGASQGVEFGIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAY 314

Query: 323 VMLDQMLKKDCPPNVITYHCFFRSLEKPKEILMLFDRMIKFGVHPKVDTYVMLMRKFGRW 382
            MLD+M K+ C P+ ITY C F  LEKP EIL LF RMI+ GV PK+DTYVMLMRKF RW
Sbjct: 315 RMLDEMPKRGCQPDSITYMCLFSRLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERW 374

Query: 383 GFLRPVFLVWNKMEELGCSPNESAYNALIDALVEKGMIDMARKYDEEMVAKGLSPKLREE 442
           GFL+PV  VW  M+E G +P+ +AYNA+IDAL++KGM+DMAR+Y+EEM+ +GLSP+ R E
Sbjct: 375 GFLQPVLYVWKTMKESGDTPDSAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRRPE 434

BLAST of HG10023238 vs. TAIR 10
Match: AT5G15010.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 230.3 bits (586), Expect = 3.0e-60
Identity = 134/381 (35.17%), Postives = 202/381 (53.02%), Query Frame = 0

Query: 52  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 111
           V E L    NDW  ++ FF W   +  +  +   Y+ M+ ILGK  +FD +W LI  M++
Sbjct: 130 VVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKFDTAWTLIDEMRK 189

Query: 112 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLR-DETSFCNLIDALCESRHVV 171
            S +L +  T  I+ ++Y   H V +AI  +   + FKL      F +L+ ALC  ++V 
Sbjct: 190 FSPSLVNSQTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQSLLSALCRYKNVS 249

Query: 172 EAQDLCFGKNRKLDCDASTKIHNLILRGWFK-MGWWSKCREFWEEMDKKGVRKDLHSYSI 231
           +A  L F    K   DA  K  N++L GW   +G   +    W EM   GV+ D+ SYS 
Sbjct: 250 DAGHLIFCNKDKYPFDA--KSFNIVLNGWCNVIGSPREAERVWMEMGNVGVKHDVVSYSS 309

Query: 232 YMDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEM-KEM 291
            +    K G   K +KL+  MKK+ ++ D   YN V+HA+  +  V  A  +   M +E 
Sbjct: 310 MISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEARNLMKTMEEEK 369

Query: 292 GCKPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSLEKPKEILML 351
           G +PNVVT N++IK  C+  + ++A  + D+ML+K   P + TYH F R L   +E+  L
Sbjct: 370 GIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMRILRTGEEVFEL 429

Query: 352 FDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVE 411
             +M K G  P V+TY+ML+RK  RW     V L+W++M+E    P+ S+Y  +I  L  
Sbjct: 430 LAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLSSYIVMIHGLFL 489

Query: 412 KGMIDMARKYDEEMVAKGLSP 430
            G I+ A  Y +EM  KG+ P
Sbjct: 490 NGKIEEAYGYYKEMKDKGMRP 508

BLAST of HG10023238 vs. TAIR 10
Match: AT3G15200.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 197.2 bits (500), Expect = 2.8e-50
Identity = 118/387 (30.49%), Postives = 196/387 (50.65%), Query Frame = 0

Query: 52  VREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQ 111
           V E ++   +DW+ +Y     V  +     ++  YN +LD+LGK   F+    +   M +
Sbjct: 112 VLEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGKMRRFEEFHQVFDEMSK 171

Query: 112 SSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDE-TSFCNLIDALCESRHVV 171
                 +  T+ +L  RYA AH V EA+  +ER +EF + D+  +F  L+  LC  +HV 
Sbjct: 172 RD-GFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVAFHGLLMWLCRYKHVE 231

Query: 172 EAQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIY 231
            A+ L   + R+  CD   K  N+IL GW  +G   + + FW+++     R D+ SY   
Sbjct: 232 FAETLFCSRRREFGCD--IKAMNMILNGWCVLGNVHEAKRFWKDIIASKCRPDVVSYGTM 291

Query: 232 MDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGC 291
           ++   K GK  KA++LY+ M       DV   N VI A+   + +  A  VF E+ E G 
Sbjct: 292 INALTKKGKLGKAMELYRAMWDTRRNPDVKICNNVIDALCFKKRIPEALEVFREISEKGP 351

Query: 292 KPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKK--DCPPNVITYHCFFRSLEKPKEILML 351
            PNVVT N+++K  C+  R +    ++++M  K   C PN +T+    +  ++ K++ ++
Sbjct: 352 DPNVVTYNSLLKHLCKIRRTEKVWELVEEMELKGGSCSPNDVTFSYLLKYSQRSKDVDIV 411

Query: 352 FDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALVE 411
            +RM K       D Y ++ R + +W     V  +W++ME  G  P++  Y   I  L  
Sbjct: 412 LERMAKNKCEMTSDLYNLMFRLYVQWDKEEKVREIWSEMERSGLGPDQRTYTIRIHGLHT 471

Query: 412 KGMIDMARKYDEEMVAKGLSPKLREEL 436
           KG I  A  Y +EM++KG+ P+ R E+
Sbjct: 472 KGKIGEALSYFQEMMSKGMVPEPRTEM 495

BLAST of HG10023238 vs. TAIR 10
Match: AT1G52640.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 182.6 bits (462), Expect = 7.2e-46
Identity = 113/372 (30.38%), Postives = 192/372 (51.61%), Query Frame = 0

Query: 66  SYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHRMQQSSFALPDHATFRIL 125
           ++ FF W      F H+ E+Y+ +++ILG   +F L W  +   ++ ++       F I+
Sbjct: 85  AHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKVFWIV 144

Query: 126 FKRYALAHLVSEAIAAYERLREFKLRD-ETSFCNLIDALCESRHVVEAQDLCFGKNRKLD 185
           F+ Y+ A+L SEA  A+ R+ EF ++        L+ +LC+ +HV  AQ+  FGK +   
Sbjct: 145 FRAYSRANLPSEACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKHVNHAQEF-FGKAKGFG 204

Query: 186 CDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAV 245
              S K +++++RGW ++   S  R+ ++EM ++    DL +Y+  +D  CKSG      
Sbjct: 205 IVPSAKTYSILVRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDVDGGY 264

Query: 246 KLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKEMGCKPNVVTCNTIIKLF 305
           K+++EM   GLK D  ++   IHA   +  V  A +V   MK     PNV T N IIK  
Sbjct: 265 KMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPNVYTFNHIIKTL 324

Query: 306 CENGRFKDAHVMLDQMLKKDCPP------NVITYHCFFRSLEKPKEILMLFDRMIKFGVH 365
           C+N +  DA+++LD+M++K   P      +++ YHC    + +  ++L    RM +    
Sbjct: 325 CKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLL---SRMDRTKCL 384

Query: 366 PKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALIDALV-EKGMIDMARK 425
           P   TY M+++   R G       +W  M E    P  + Y  +I  LV +KG ++ A +
Sbjct: 385 PDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIHGLVRKKGKLEEACR 444

Query: 426 YDEEMVAKGLSP 430
           Y E M+ +G+ P
Sbjct: 445 YFEMMIDEGIPP 452

BLAST of HG10023238 vs. TAIR 10
Match: AT1G71060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 179.1 bits (453), Expect = 8.0e-45
Identity = 111/385 (28.83%), Postives = 190/385 (49.35%), Query Frame = 0

Query: 49  PPTVREALDSYCNDWRRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIHR 108
           P  + E L    N    +   F W E++  F HTT  YN +++ LGK  +F L W L+  
Sbjct: 94  PALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDD 153

Query: 109 MQQSSFALPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETS-FCNLIDALCESR 168
           M+     L    TF ++ +RYA A  V EAI A+ ++ EF  + E+S F  ++D L +SR
Sbjct: 154 MKAKK--LLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSR 213

Query: 169 HVVEAQDLCFGKNRKLDCDASTKIHNLILRGWFKMGWWSKCREFWEEMDKKGVRKDLHSY 228
           +V +AQ + F K +K   +   K + ++L GW +     +  E   EM  +G   D+ +Y
Sbjct: 214 NVGDAQKV-FDKMKKKRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAY 273

Query: 229 SIYMDIQCKSGKPWKAVKLYKEMKKKGLKLDVVAYNTVIHAIGISEGVDFASRVFHEMKE 288
            I ++  CK+ K  +A++ + EM+++  K     + ++I+ +G  + ++ A   F   K 
Sbjct: 274 GIIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKS 333

Query: 289 MGCKPNVVTCNTIIKLFCENGRFKDAHVMLDQMLKKDCPPNVITYHCFFRSL---EKPKE 348
            G      T N ++  +C + R +DA+  +D+M  K   PN  TY      L   ++ KE
Sbjct: 334 SGFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKE 393

Query: 349 ILMLFDRMIKFGVHPKVDTYVMLMRKFGRWGFLRPVFLVWNKMEELGCSPNESAYNALID 408
              ++  M      P V TY +++R F     L     +W++M+  G  P    +++LI 
Sbjct: 394 AYEVYQTM---SCEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLIT 453

Query: 409 ALVEKGMIDMARKYDEEMVAKGLSP 430
           AL  +  +D A +Y  EM+  G+ P
Sbjct: 454 ALCHENKLDEACEYFNEMLDVGIRP 472

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896131.13.4e-25592.84pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Benincasa ... [more]
XP_022953157.13.0e-25191.54pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita ... [more]
KAG7014158.15.0e-25191.54Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG6575615.15.0e-25191.54Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023547614.19.5e-25091.11pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9M8M31.8e-15561.36Pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Arabidop... [more]
Q9LFQ44.2e-5935.17Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
Q9LIL54.0e-4930.49Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis th... [more]
Q9SSR61.0e-4430.38Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
Q9C9A21.1e-4328.83Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1GNU31.4e-25191.54pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucurbit... [more]
A0A6J1JMZ17.8e-25091.11pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucurbit... [more]
A0A6J1D5991.5e-24890.24pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Momordic... [more]
A0A1S3CFA14.5e-24588.72pentatricopeptide repeat-containing protein At1g80550, mitochondrial OS=Cucumis ... [more]
A0A5A7UVK91.7e-24488.50Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G80550.11.3e-15661.36Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G15010.13.0e-6035.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G15200.12.8e-5030.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G52640.17.2e-4630.38Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G71060.18.0e-4528.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 193..220
e-value: 0.0075
score: 16.4
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 348..407
e-value: 1.7E-5
score: 24.8
coord: 246..306
e-value: 1.3E-15
score: 57.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 260..294
e-value: 2.5E-5
score: 22.1
coord: 363..396
e-value: 3.1E-4
score: 18.7
coord: 398..429
e-value: 5.3E-5
score: 21.1
coord: 295..329
e-value: 5.4E-10
score: 36.8
coord: 226..259
e-value: 9.7E-5
score: 20.3
coord: 191..222
e-value: 1.7E-4
score: 19.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 11.202506
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..394
score: 8.780059
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 12.463056
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 9.251395
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 11.432693
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 10.98328
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 178..272
e-value: 8.0E-18
score: 66.9
coord: 273..339
e-value: 2.0E-18
score: 68.9
coord: 340..434
e-value: 3.7E-18
score: 68.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 47..177
e-value: 5.6E-14
score: 53.9
NoneNo IPR availablePANTHERPTHR47942:SF22OSJNBA0033G05.8 PROTEINcoord: 25..444
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 25..444

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023238.1HG10023238.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding