HG10009455 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10009455
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr06: 6145091 .. 6146779 (+)
RNA-Seq ExpressionHG10009455
SyntenyHG10009455
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGAAAATAATTCGTGCAAAACTCTCTCCATCTTCCCACCAAAACTCAATCCATTCTAAAGCTTTCTCCATCTTGAATCAGTACTTGTTCAACAGCGCCATCGCTGTGTTCTCTATCAATTCAAGGAGGGTGAAGAGGCTAAAGATGGCGATGCTCAAGAGGCTAACTCAAATCCCTCAAATACCCATTTCTTCCCATTCGTTTCTGATTCTTGCAGCTCCTTTTTGTACTCATCTTCATCTTCATCCGTCTTCACAACCCACAACTGAAATCGAAAGAATCGCCAAAATTATCAACGACCACCCATTTCCCGACCACCCCCTTCAACCCATTCTTCTTCACCACATTCCATCTCCTCTTCCTTCAAACACTTTCCTCAATGACGTTCTTGGCCGCCTCTTTGCAGCCCATTCCAATGGCCTCAAAGCCTTAGAGTTTTTCAAATTCTGCCTTCACCATTCTCAAGCTTCTTCCCCAACTCCAGACGCTTTCGAGAAGACGCTTCACATTCTCGCCAGGATGAGGTATTTTGATCAATCATGGGAATTGATGCGTGAGATTGGACGAACTCACCCTTTTTTGCTTACTCTCAAGTCCATGAGCATCTTGCTTTCGAAGATCGCGAAGTTTCAATCATTCGAAGAAACTATCGAAGCGTTTCAGAGAATGGAGAATGAAGTGTTTGTTGGAAGGAAATTTGGTACTGAGGAATTTAATGTACTCCTTCGAGCTTTTTGCACTCAGAGACAGATGAAAGAGGCGCGATCAGTGTTCCAAAAGATGTATTCTCGATTTCCCCCAACTACTAAAACCATGAATCTCTTGCTTTTAGGTTTTAAGGAATCGACCGATGTTACTGCTGTTGAGCTTTTTTATCATGAGATGATTAAGAGAGGTTTCAAGCCAAATGCTGTGACTTATAGCATTAGAATTGATGCATATTGCAAAAGAGGTTGTTTTGTGGATGGTTTGAGAGTTCTTAAAGAAATGGAGAGGGCAAAATTTGAGCCAACATTGGAAACAATCACTACTCTGATCCATGGAGGAGGGTTAACAAAGGATAAAACCAAAGCACGCCAACTGTTTGATGAAATTACTCTGAGAAACTTATGTCCTGACATTGGGGCTTATAATGCTTTGATAAGTTCCTTGATTAGGTCTGATGATGTAAAGTGTGCAGCAGCTTTAATGGAAGATATGGAAGCTAAACACATTGGACATGACAGTCTGACCTATCATATGATGTTCTCAGGCCTGATGCGATTGGAGGATGTTGGTGGGTTTTATGAGTTGTATAGCAAGATGGTACGAAGAAATTTCGTGCCTAAAACACGAACGGTAATCATGATAATGAAGTTTTTCTGCGAAAATCGCCGAGTTGATTTGGGTTTGGAATTGTGGGGTTATCTGGTAGAGAAGGGTTACTGTCCTCATAGTCATGTTTTGGATCTGTTGGTGACAGGATTATGTGCCCGTGGAATGGTCCTTCAAGCATTTGAGTGCTCCAAACAAATGTTGGAGAGAGGGAGACAAATGAGTGAAGCAGCATTTCTGATCATGAAGAGATCTCTTTTGCAGGCACATGCAACAGACAATTATGGGGAGCTTGAACAGTTGAGGAAGAAGCTTAAAACTGTTTTGCCCCCACCAAACCAGCTTTCATCTGAGATTTCAGCATCCTCATAG

mRNA sequence

ATGTTGAAAATAATTCGTGCAAAACTCTCTCCATCTTCCCACCAAAACTCAATCCATTCTAAAGCTTTCTCCATCTTGAATCAGTACTTGTTCAACAGCGCCATCGCTGTGTTCTCTATCAATTCAAGGAGGGTGAAGAGGCTAAAGATGGCGATGCTCAAGAGGCTAACTCAAATCCCTCAAATACCCATTTCTTCCCATTCGTTTCTGATTCTTGCAGCTCCTTTTTGTACTCATCTTCATCTTCATCCGTCTTCACAACCCACAACTGAAATCGAAAGAATCGCCAAAATTATCAACGACCACCCATTTCCCGACCACCCCCTTCAACCCATTCTTCTTCACCACATTCCATCTCCTCTTCCTTCAAACACTTTCCTCAATGACGTTCTTGGCCGCCTCTTTGCAGCCCATTCCAATGGCCTCAAAGCCTTAGAGTTTTTCAAATTCTGCCTTCACCATTCTCAAGCTTCTTCCCCAACTCCAGACGCTTTCGAGAAGACGCTTCACATTCTCGCCAGGATGAGGTATTTTGATCAATCATGGGAATTGATGCGTGAGATTGGACGAACTCACCCTTTTTTGCTTACTCTCAAGTCCATGAGCATCTTGCTTTCGAAGATCGCGAAGTTTCAATCATTCGAAGAAACTATCGAAGCGTTTCAGAGAATGGAGAATGAAGTGTTTGTTGGAAGGAAATTTGGTACTGAGGAATTTAATGTACTCCTTCGAGCTTTTTGCACTCAGAGACAGATGAAAGAGGCGCGATCAGTGTTCCAAAAGATGTATTCTCGATTTCCCCCAACTACTAAAACCATGAATCTCTTGCTTTTAGGTTTTAAGGAATCGACCGATGTTACTGCTGTTGAGCTTTTTTATCATGAGATGATTAAGAGAGGTTTCAAGCCAAATGCTGTGACTTATAGCATTAGAATTGATGCATATTGCAAAAGAGGTTGTTTTGTGGATGGTTTGAGAGTTCTTAAAGAAATGGAGAGGGCAAAATTTGAGCCAACATTGGAAACAATCACTACTCTGATCCATGGAGGAGGGTTAACAAAGGATAAAACCAAAGCACGCCAACTGTTTGATGAAATTACTCTGAGAAACTTATGTCCTGACATTGGGGCTTATAATGCTTTGATAAGTTCCTTGATTAGGTCTGATGATGTAAAGTGTGCAGCAGCTTTAATGGAAGATATGGAAGCTAAACACATTGGACATGACAGTCTGACCTATCATATGATGTTCTCAGGCCTGATGCGATTGGAGGATGTTGGTGGGTTTTATGAGTTGTATAGCAAGATGGTACGAAGAAATTTCGTGCCTAAAACACGAACGGTAATCATGATAATGAAGTTTTTCTGCGAAAATCGCCGAGTTGATTTGGGTTTGGAATTGTGGGGTTATCTGGTAGAGAAGGGTTACTGTCCTCATAGTCATGTTTTGGATCTGTTGGTGACAGGATTATGTGCCCGTGGAATGGTCCTTCAAGCATTTGAGTGCTCCAAACAAATGTTGGAGAGAGGGAGACAAATGAGTGAAGCAGCATTTCTGATCATGAAGAGATCTCTTTTGCAGGCACATGCAACAGACAATTATGGGGAGCTTGAACAGTTGAGGAAGAAGCTTAAAACTGTTTTGCCCCCACCAAACCAGCTTTCATCTGAGATTTCAGCATCCTCATAG

Coding sequence (CDS)

ATGTTGAAAATAATTCGTGCAAAACTCTCTCCATCTTCCCACCAAAACTCAATCCATTCTAAAGCTTTCTCCATCTTGAATCAGTACTTGTTCAACAGCGCCATCGCTGTGTTCTCTATCAATTCAAGGAGGGTGAAGAGGCTAAAGATGGCGATGCTCAAGAGGCTAACTCAAATCCCTCAAATACCCATTTCTTCCCATTCGTTTCTGATTCTTGCAGCTCCTTTTTGTACTCATCTTCATCTTCATCCGTCTTCACAACCCACAACTGAAATCGAAAGAATCGCCAAAATTATCAACGACCACCCATTTCCCGACCACCCCCTTCAACCCATTCTTCTTCACCACATTCCATCTCCTCTTCCTTCAAACACTTTCCTCAATGACGTTCTTGGCCGCCTCTTTGCAGCCCATTCCAATGGCCTCAAAGCCTTAGAGTTTTTCAAATTCTGCCTTCACCATTCTCAAGCTTCTTCCCCAACTCCAGACGCTTTCGAGAAGACGCTTCACATTCTCGCCAGGATGAGGTATTTTGATCAATCATGGGAATTGATGCGTGAGATTGGACGAACTCACCCTTTTTTGCTTACTCTCAAGTCCATGAGCATCTTGCTTTCGAAGATCGCGAAGTTTCAATCATTCGAAGAAACTATCGAAGCGTTTCAGAGAATGGAGAATGAAGTGTTTGTTGGAAGGAAATTTGGTACTGAGGAATTTAATGTACTCCTTCGAGCTTTTTGCACTCAGAGACAGATGAAAGAGGCGCGATCAGTGTTCCAAAAGATGTATTCTCGATTTCCCCCAACTACTAAAACCATGAATCTCTTGCTTTTAGGTTTTAAGGAATCGACCGATGTTACTGCTGTTGAGCTTTTTTATCATGAGATGATTAAGAGAGGTTTCAAGCCAAATGCTGTGACTTATAGCATTAGAATTGATGCATATTGCAAAAGAGGTTGTTTTGTGGATGGTTTGAGAGTTCTTAAAGAAATGGAGAGGGCAAAATTTGAGCCAACATTGGAAACAATCACTACTCTGATCCATGGAGGAGGGTTAACAAAGGATAAAACCAAAGCACGCCAACTGTTTGATGAAATTACTCTGAGAAACTTATGTCCTGACATTGGGGCTTATAATGCTTTGATAAGTTCCTTGATTAGGTCTGATGATGTAAAGTGTGCAGCAGCTTTAATGGAAGATATGGAAGCTAAACACATTGGACATGACAGTCTGACCTATCATATGATGTTCTCAGGCCTGATGCGATTGGAGGATGTTGGTGGGTTTTATGAGTTGTATAGCAAGATGGTACGAAGAAATTTCGTGCCTAAAACACGAACGGTAATCATGATAATGAAGTTTTTCTGCGAAAATCGCCGAGTTGATTTGGGTTTGGAATTGTGGGGTTATCTGGTAGAGAAGGGTTACTGTCCTCATAGTCATGTTTTGGATCTGTTGGTGACAGGATTATGTGCCCGTGGAATGGTCCTTCAAGCATTTGAGTGCTCCAAACAAATGTTGGAGAGAGGGAGACAAATGAGTGAAGCAGCATTTCTGATCATGAAGAGATCTCTTTTGCAGGCACATGCAACAGACAATTATGGGGAGCTTGAACAGTTGAGGAAGAAGCTTAAAACTGTTTTGCCCCCACCAAACCAGCTTTCATCTGAGATTTCAGCATCCTCATAG

Protein sequence

MLKIIRAKLSPSSHQNSIHSKAFSILNQYLFNSAIAVFSINSRRVKRLKMAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGELEQLRKKLKTVLPPPNQLSSEISASS
Homology
BLAST of HG10009455 vs. NCBI nr
Match: XP_038876095.1 (pentatricopeptide repeat-containing protein At3g61360 [Benincasa hispida] >XP_038876096.1 pentatricopeptide repeat-containing protein At3g61360 [Benincasa hispida])

HSP 1 Score: 944.9 bits (2441), Expect = 3.2e-271
Identity = 468/513 (91.23%), Postives = 487/513 (94.93%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L+RLTQIPQ+PIS HSFLI AAPFCTH+HLHPSSQP TE+ERIAKIINDHPFPDHPL
Sbjct: 1   MAILQRLTQIPQLPISFHSFLIYAAPFCTHIHLHPSSQPKTEVERIAKIINDHPFPDHPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P LLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALE FKFCLHHSQ  SPTPDAFEKTL
Sbjct: 61  HPTLLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALELFKFCLHHSQ-PSPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI RTHPFLLTLKSMSIL+SKIAKFQSFEETIEAF RMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIQRTHPFLLTLKSMSILISKIAKFQSFEETIEAFHRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFG EEFNVLLRAFCTQRQMKEARSVFQKMYSRFPP TKTMNLLLLGFKES+DVTAV
Sbjct: 181 VGRKFGIEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPNTKTMNLLLLGFKESSDVTAV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMI+RGFKPNAVTYSIRIDAYCKRG FVDGLRV +EMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIRRGFKPNAVTYSIRIDAYCKRGYFVDGLRVFEEMERAKLEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEITLRNLCPDIGAYNALISSLIRS+DVK AAALMEDMEAKHIGHD
Sbjct: 301 AGVAKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSNDVKSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHM+FSGLMRLEDVGGFYELYSKM+R+NFVPKTRTV+MIMKF CENRRVDLGL+ WG
Sbjct: 361 SMTYHMLFSGLMRLEDVGGFYELYSKMIRQNFVPKTRTVVMIMKFLCENRRVDLGLDFWG 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQA ECSKQMLERGRQMSEAAFLIMKR+LLQAH
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQASECSKQMLERGRQMSEAAFLIMKRTLLQAH 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISASS 563
           ATD YGELEQLR KL+TVLPPP QLS EI ASS
Sbjct: 481 ATDKYGELEQLRNKLQTVLPPPKQLSFEIPASS 512

BLAST of HG10009455 vs. NCBI nr
Match: XP_016898809.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g61360 [Cucumis melo])

HSP 1 Score: 906.0 bits (2340), Expect = 1.6e-259
Identity = 456/515 (88.54%), Postives = 476/515 (92.43%), Query Frame = 0

Query: 47  RLKMAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPD 106
           +LKMA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD
Sbjct: 2   KLKMAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPD 61

Query: 107 HPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFE 166
            PL P LLH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFE
Sbjct: 62  QPLHPTLLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFE 121

Query: 167 KTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMEN 226
           KTLHILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMEN
Sbjct: 122 KTLHILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMEN 181

Query: 227 EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDV 286
           EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+
Sbjct: 182 EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDI 241

Query: 287 TAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTL 346
           T+VELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTL
Sbjct: 242 TSVELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTL 301

Query: 347 IHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHI 406
           IHG G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHI
Sbjct: 302 IHGAGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHI 361

Query: 407 GHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLE 466
           GHDS+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT + IMKFFCENRRVDL L 
Sbjct: 362 GHDSVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVTIMKFFCENRRVDLALG 421

Query: 467 LWGYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLL 526
            W YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL
Sbjct: 422 FWAYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLL 481

Query: 527 QAHATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           +A ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 482 KARATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 510

BLAST of HG10009455 vs. NCBI nr
Match: KAA0055168.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 902.5 bits (2331), Expect = 1.8e-258
Identity = 454/512 (88.67%), Postives = 473/512 (92.38%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD PL
Sbjct: 1   MAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P  LH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  HPTFLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+T+V
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDITSV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHIGHD
Sbjct: 301 AGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT +MIMKFFCENRRVDL L  W 
Sbjct: 361 SVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVMIMKFFCENRRVDLALGFWA 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL+A 
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLLKAR 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 481 ATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 506

BLAST of HG10009455 vs. NCBI nr
Match: TYK00295.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 901.7 bits (2329), Expect = 3.1e-258
Identity = 454/512 (88.67%), Postives = 473/512 (92.38%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD PL
Sbjct: 1   MAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P LLH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  HPTLLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+T+V
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDITSV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHIGHD
Sbjct: 301 AGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT + IMKFFCENRRVDL L  W 
Sbjct: 361 SVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVTIMKFFCENRRVDLALGFWA 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL+A 
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLLKAR 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 481 ATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 506

BLAST of HG10009455 vs. NCBI nr
Match: XP_022929103.1 (pentatricopeptide repeat-containing protein At3g61360 [Cucurbita moschata] >XP_022929104.1 pentatricopeptide repeat-containing protein At3g61360 [Cucurbita moschata] >XP_022929105.1 pentatricopeptide repeat-containing protein At3g61360 [Cucurbita moschata] >XP_022929106.1 pentatricopeptide repeat-containing protein At3g61360 [Cucurbita moschata])

HSP 1 Score: 889.0 bits (2296), Expect = 2.1e-254
Identity = 449/513 (87.52%), Postives = 472/513 (92.01%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           M +  RLTQI QI I  HSFLI AAPF T  HLH SS+P  EIERI KIINDHPFPD PL
Sbjct: 1   MVIPTRLTQISQIRIFFHSFLIPAAPFST--HLHQSSEPGGEIERITKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
           +P LLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  RPTLLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI RTHPFLLTLKSMSILL+KIAKFQSFEETIEAF+RMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIQRTHPFLLTLKSMSILLTKIAKFQSFEETIEAFRRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP TKTMNLLLLGFKES++VTAV
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFLKMYSRFPPNTKTMNLLLLGFKESSNVTAV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMI+RGFKP+AVTYSIR+DAYCKRGCFVDGLRV +EM+RAKFEPTLETITTLIHG
Sbjct: 241 ELFYHEMIRRGFKPDAVTYSIRMDAYCKRGCFVDGLRVFEEMKRAKFEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KD  KARQLFDE+ LRNLCPDIGAYNALISSLIRSDDV  AAALMEDMEAKHIGHD
Sbjct: 301 AGVAKDIAKARQLFDEMPLRNLCPDIGAYNALISSLIRSDDVNSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GLM+LEDV GFYELY KM+RRNFVPKTRTV+MIMKFFCENRRVDLGL+LWG
Sbjct: 361 SMTYHMMFVGLMKLEDVHGFYELYRKMIRRNFVPKTRTVVMIMKFFCENRRVDLGLDLWG 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLD+LVTGLCARGMV +AFECSKQMLERGRQMS+AAFLIM+RSLLQA 
Sbjct: 421 YLVEKGYCPHSHVLDVLVTGLCARGMVHEAFECSKQMLERGRQMSDAAFLIMERSLLQAD 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISASS 563
           A D  GELEQLRKKLKTVLPPP QL  EI ASS
Sbjct: 481 AKDILGELEQLRKKLKTVLPPPKQLPYEIPASS 510

BLAST of HG10009455 vs. ExPASy Swiss-Prot
Match: Q9M2C8 (Pentatricopeptide repeat-containing protein At3g61360 OS=Arabidopsis thaliana OX=3702 GN=At3g61360 PE=2 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 2.2e-158
Identity = 286/491 (58.25%), Postives = 366/491 (74.54%), Query Frame = 0

Query: 64  ISSHSFLILAAP-FCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLP 123
           ISS S+  L AP   +   L  +S   TEIERI  IIN HPFP+HP+QPIL  HIP    
Sbjct: 5   ISSDSYRRLLAPVLSSSSSLSSTSINRTEIERITIIINGHPFPNHPIQPILAKHIPLSSL 64

Query: 124 SNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSW 183
           S  F+++VLGRLFAAHSNGLKALEFFK+ L  S+ SSPT D+FEKTLHILARMRYFDQ+W
Sbjct: 65  SPEFVSEVLGRLFAAHSNGLKALEFFKYSLKSSK-SSPTSDSFEKTLHILARMRYFDQAW 124

Query: 184 ELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVL 243
            LM E+ + +P LL+ KSMSILL KIAKF S+EET+EAF +ME E+F  +KFG +EFN+L
Sbjct: 125 ALMAEVRKDYPNLLSFKSMSILLCKIAKFGSYEETLEAFVKMEKEIF-RKKFGVDEFNIL 184

Query: 244 LRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFK 303
           LRAFCT+R+MKEARS+F+K++SRF P  KTMN+LLLGFKE+ DVTA ELFYHEM+KRGFK
Sbjct: 185 LRAFCTEREMKEARSIFEKLHSRFNPDVKTMNILLLGFKEAGDVTATELFYHEMVKRGFK 244

Query: 304 PNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQL 363
           PN+VTY IRID +CK+  F + LR+ ++M+R  F+ T++ +TTLIHG G+ ++K KARQL
Sbjct: 245 PNSVTYGIRIDGFCKKRNFGEALRLFEDMDRLDFDITVQILTTLIHGSGVARNKIKARQL 304

Query: 364 FDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMR 423
           FDEI+ R L PD GAYNAL+SSL++  DV  A  +M++ME K I  DS+T+H MF G+M+
Sbjct: 305 FDEISKRGLTPDCGAYNALMSSLMKCGDVSGAIKVMKEMEEKGIEPDSVTFHSMFIGMMK 364

Query: 424 LEDVG--GFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHS 483
            ++ G  G  E Y KM  R+ VPKT T++M+MK FC N  V+LGL+LW Y++EKGYCPH 
Sbjct: 365 SKEFGFNGVCEYYQKMKERSLVPKTPTIVMLMKLFCHNGEVNLGLDLWKYMLEKGYCPHG 424

Query: 484 HVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGELEQL 543
           H L+LL T LCAR     AFECS Q +ERGR +SE  + +++ SL   +      EL++ 
Sbjct: 425 HALELLTTALCARRRANDAFECSWQTVERGRCVSEPVYRMLETSLSSNNELKKLEELKEE 484

Query: 544 RKKLKTVLPPP 552
            +KL + LPPP
Sbjct: 485 IQKLHSFLPPP 493

BLAST of HG10009455 vs. ExPASy Swiss-Prot
Match: Q9FZ19 (Putative pentatricopeptide repeat-containing protein At1g02420 OS=Arabidopsis thaliana OX=3702 GN=At1g02420 PE=3 SV=2)

HSP 1 Score: 228.4 bits (581), Expect = 2.0e-58
Identity = 138/432 (31.94%), Postives = 230/432 (53.24%), Query Frame = 0

Query: 123 SNTFLNDVLGRLFAAHSNGLKALEFFKFC-----LHHSQASSPTPDAFEKTLHILARMRY 182
           S   ++ VL R+  +H N ++ LEF+++       +HS  S  T       L+IL R R 
Sbjct: 70  SKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDT------MLYILGRNRK 129

Query: 183 FDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTE 242
           FDQ WEL+ E  R    L++ ++M ++L ++AK  S  +T+E+F + +    V   F T 
Sbjct: 130 FDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKR--LVPDFFDTA 189

Query: 243 EFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMI 302
            FN LLR  C ++ M +AR+V+  +  +F P  +T N+LL G+K S +    E F+ EM 
Sbjct: 190 CFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKSSEE---AEAFFEEMK 249

Query: 303 KRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKT 362
            +G KP+ VTY+  ID YCK        +++ +M   +  P + T TT+I G GL     
Sbjct: 250 GKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPD 309

Query: 363 KARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMF 422
           KAR++  E+      PD+ AYNA I +   +  +  A  L+++M  K +  ++ TY++ F
Sbjct: 310 KAREVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFF 369

Query: 423 SGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYC 482
             L    D+G  +ELY +M+    +P T++ + ++K F  + +VD+ + LW  +V KG+ 
Sbjct: 370 RVLSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFG 429

Query: 483 PHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGEL 542
            +S V D+L+  LC    V +A +C  +M+E+G + S  +F  +K  +  A+  D    L
Sbjct: 430 SYSLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNL 489

Query: 543 EQLRKKLKTVLP 550
            Q      T +P
Sbjct: 490 IQKMAIFSTEIP 490

BLAST of HG10009455 vs. ExPASy Swiss-Prot
Match: Q9C9A2 (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 6.5e-41
Identity = 140/523 (26.77%), Postives = 253/523 (48.37%), Query Frame = 0

Query: 30  LFNSAIAVFSIN-SRRV-KRLKMAMLKRLTQIPQIPISSHSFL---ILAAPFCTHLHLHP 89
           +F+    V  +N SRRV  R+  +    L  IP I  +S+  L     A+   T +  + 
Sbjct: 2   VFSRFFRVTGVNLSRRVYSRISSSSSPSLESIPWIHKASNFTLYGSFHASSVETQVSAND 61

Query: 90  SSQPTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKA 149
           +SQ   + ERI KI+    F D  ++ +L  +  S   S   + +VL +L  A   G+ A
Sbjct: 62  ASQ---DAERICKILT--KFTDSKVETLL--NEASVKLSPALIEEVLKKLSNA---GVLA 121

Query: 150 LEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSIL 209
           L  FK+   + +    T   +   +  L +++ F   W L+ ++      LL+ ++ +++
Sbjct: 122 LSVFKWA-ENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKK--LLSKETFALI 181

Query: 210 LSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKM-Y 269
             + A+ +  +E I AF +ME     G K  + +FN +L      R + +A+ VF KM  
Sbjct: 182 SRRYARARKVKEAIGAFHKMEE---FGFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKK 241

Query: 270 SRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVD 329
            RF P  K+  +LL G+ +  ++  V+    EM   GF+P+ V Y I I+A+CK   + +
Sbjct: 242 KRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYGIIINAHCKAKKYEE 301

Query: 330 GLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALIS 389
            +R   EME+   +P+     +LI+G G  K    A + F+         +   YNAL+ 
Sbjct: 302 AIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVG 361

Query: 390 SLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVP 449
           +   S  ++ A   +++M  K +G ++ TY ++   L+R++     YE+Y  M   +  P
Sbjct: 362 AYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEP 421

Query: 450 KTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECS 509
              T  ++++ FC   R+D+ +++W  +  KG  P  H+   L+T LC    + +A E  
Sbjct: 422 TVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYF 481

Query: 510 KQMLERGRQMSEAAFLIMKRSLL----QAHATDNYGELEQLRK 543
            +ML+ G +     F  +K++LL    +   TD   ++++LRK
Sbjct: 482 NEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRK 505

BLAST of HG10009455 vs. ExPASy Swiss-Prot
Match: Q9FVX2 (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 165.6 bits (418), Expect = 1.6e-39
Identity = 107/429 (24.94%), Postives = 212/429 (49.42%), Query Frame = 0

Query: 123 SNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSW 182
           S   + DVL R     + GL    FF++     +    +  A+   +   A++R +   W
Sbjct: 99  SQEVVEDVLNRF---RNAGLLTYRFFQWS-EKQRHYEHSVRAYHMMIESTAKIRQYKLMW 158

Query: 183 ELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVL 242
           +L+  + +    +L +++  I++ K A+ Q  +E I AF  ME             FN L
Sbjct: 159 DLINAMRKKK--MLNVETFCIVMRKYARAQKVDEAIYAFNVMEKYDLPPNLVA---FNGL 218

Query: 243 LRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFK 302
           L A C  + +++A+ VF+ M  RF P +KT ++LL G+ +  ++      + EMI  G  
Sbjct: 219 LSALCKSKNVRKAQEVFENMRDRFTPDSKTYSILLEGWGKEPNLPKAREVFREMIDAGCH 278

Query: 303 PNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQL 362
           P+ VTYSI +D  CK G   + L +++ M+ +  +PT    + L+H  G      +A   
Sbjct: 279 PDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKPTTFIYSVLVHTYGTENRLEEAVDT 338

Query: 363 FDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMR 422
           F E+    +  D+  +N+LI +  +++ +K    ++++M++K +  +S + +++   L+ 
Sbjct: 339 FLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGVTPNSKSCNIILRHLIE 398

Query: 423 LEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHV 482
             +    ++++ KM+ +   P   T  M++K FCE + ++   ++W Y+ +KG  P  H 
Sbjct: 399 RGEKDEAFDVFRKMI-KVCEPDADTYTMVIKMFCEKKEMETADKVWKYMRKKGVFPSMHT 458

Query: 483 LDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGELEQLRK 542
             +L+ GLC      +A    ++M+E G + S   F  +++ L++    D    L+ L +
Sbjct: 459 FSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLIKEERED---VLKFLNE 514

Query: 543 KLKTVLPPP 552
           K+  ++  P
Sbjct: 519 KMNVLVNEP 514

BLAST of HG10009455 vs. ExPASy Swiss-Prot
Match: Q9S7R4 (Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=OTP43 PE=2 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 2.6e-34
Identity = 107/426 (25.12%), Postives = 196/426 (46.01%), Query Frame = 0

Query: 88  PTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEF 147
           P  +   IAK+I   P   H     LL    +P   N  +N VL RL+   ++G KAL+F
Sbjct: 21  PPADSAAIAKLILSSPNTTHQDDQFLLSTKTTPWTPN-LVNSVLKRLW---NHGPKALQF 80

Query: 148 FKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSK 207
           F F  +H +       +F+  + I AR+      W L+  + R+     + K+ +I+  +
Sbjct: 81  FHFLDNHHREYVHDASSFDLAIDIAARLHLHPTVWSLIHRM-RSLRIGPSPKTFAIVAER 140

Query: 208 IAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFP 267
            A     ++ ++ F  M      G       FN +L   C  +++++A  +F+ +  RF 
Sbjct: 141 YASAGKPDKAVKLFLNMHEH---GCFQDLASFNTILDVLCKSKRVEKAYELFRALRGRFS 200

Query: 268 PTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRV 327
             T T N++L G+              EM++RG  PN  TY+  +  + + G        
Sbjct: 201 VDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFFRAGQIRHAWEF 260

Query: 328 LKEMERAKFEPTLETITTLIHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIR 387
             EM++   E  + T TT++HG G+  +  +AR +FDE+    + P +  YNA+I  L +
Sbjct: 261 FLEMKKRDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGVLPSVATYNAMIQVLCK 320

Query: 388 SDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRT 447
            D+V+ A  + E+M  +    +  TY+++  GL    +     EL  +M      P  +T
Sbjct: 321 KDNVENAVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEELMQRMENEGCEPNFQT 380

Query: 448 VIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHVLDLLVTGLCARGM---VLQAFECSK 507
             M+++++ E   V+  L L+  +      P+    ++L++G+  R     ++ A +   
Sbjct: 381 YNMMIRYYSECSEVEKALGLFEKMGSGDCLPNLDTYNILISGMFVRKRSEDMVVAGKLLL 438

Query: 508 QMLERG 511
           +M+ERG
Sbjct: 441 EMVERG 438

BLAST of HG10009455 vs. ExPASy TrEMBL
Match: A0A1S4DS46 (pentatricopeptide repeat-containing protein At3g61360 OS=Cucumis melo OX=3656 GN=LOC103504627 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 8.0e-260
Identity = 456/515 (88.54%), Postives = 476/515 (92.43%), Query Frame = 0

Query: 47  RLKMAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPD 106
           +LKMA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD
Sbjct: 2   KLKMAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPD 61

Query: 107 HPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFE 166
            PL P LLH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFE
Sbjct: 62  QPLHPTLLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFE 121

Query: 167 KTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMEN 226
           KTLHILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMEN
Sbjct: 122 KTLHILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMEN 181

Query: 227 EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDV 286
           EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+
Sbjct: 182 EVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDI 241

Query: 287 TAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTL 346
           T+VELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTL
Sbjct: 242 TSVELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTL 301

Query: 347 IHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHI 406
           IHG G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHI
Sbjct: 302 IHGAGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHI 361

Query: 407 GHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLE 466
           GHDS+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT + IMKFFCENRRVDL L 
Sbjct: 362 GHDSVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVTIMKFFCENRRVDLALG 421

Query: 467 LWGYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLL 526
            W YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL
Sbjct: 422 FWAYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLL 481

Query: 527 QAHATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           +A ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 482 KARATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 510

BLAST of HG10009455 vs. ExPASy TrEMBL
Match: A0A5A7UJI2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold231G001000 PE=4 SV=1)

HSP 1 Score: 902.5 bits (2331), Expect = 8.8e-259
Identity = 454/512 (88.67%), Postives = 473/512 (92.38%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD PL
Sbjct: 1   MAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P  LH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  HPTFLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+T+V
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDITSV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHIGHD
Sbjct: 301 AGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT +MIMKFFCENRRVDL L  W 
Sbjct: 361 SVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVMIMKFFCENRRVDLALGFWA 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL+A 
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLLKAR 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 481 ATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 506

BLAST of HG10009455 vs. ExPASy TrEMBL
Match: A0A5D3BMK0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00390 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.5e-258
Identity = 454/512 (88.67%), Postives = 473/512 (92.38%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L +LTQIPQ PISSHSFLI AA +CTHLH     + TTEIERIAKIINDHPFPD PL
Sbjct: 1   MAILMKLTQIPQKPISSHSFLIYAASYCTHLH-----KSTTEIERIAKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P LLH IPSPLPSNTFLNDVLG LFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  HPTLLHLIPSPLPSNTFLNDVLGHLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF+SFEETIEAFQRMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFRSFEETIEAFQRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+T+V
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDITSV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKLEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AAALMEDMEAKHIGHD
Sbjct: 301 AGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GL+RLEDVGGF ELYSKMVR+NFVPKTRT + IMKFFCENRRVDL L  W 
Sbjct: 361 SVTYHMMFLGLIRLEDVGGFRELYSKMVRQNFVPKTRTTVTIMKFFCENRRVDLALGFWA 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+RSLL+A 
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERSLLKAR 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISAS 562
           ATD YGELEQLRKKLKTVLPPPNQL SEI AS
Sbjct: 481 ATDKYGELEQLRKKLKTVLPPPNQLLSEIPAS 506

BLAST of HG10009455 vs. ExPASy TrEMBL
Match: A0A6J1EM60 (pentatricopeptide repeat-containing protein At3g61360 OS=Cucurbita moschata OX=3662 GN=LOC111435788 PE=4 SV=1)

HSP 1 Score: 889.0 bits (2296), Expect = 1.0e-254
Identity = 449/513 (87.52%), Postives = 472/513 (92.01%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           M +  RLTQI QI I  HSFLI AAPF T  HLH SS+P  EIERI KIINDHPFPD PL
Sbjct: 1   MVIPTRLTQISQIRIFFHSFLIPAAPFST--HLHQSSEPGGEIERITKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
           +P LLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQA SPTPDAFEKTL
Sbjct: 61  RPTLLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQA-SPTPDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HILARMRYFDQSWELMREI RTHPFLLTLKSMSILL+KIAKFQSFEETIEAF+RMENEVF
Sbjct: 121 HILARMRYFDQSWELMREIQRTHPFLLTLKSMSILLTKIAKFQSFEETIEAFRRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP TKTMNLLLLGFKES++VTAV
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFLKMYSRFPPNTKTMNLLLLGFKESSNVTAV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMI+RGFKP+AVTYSIR+DAYCKRGCFVDGLRV +EM+RAKFEPTLETITTLIHG
Sbjct: 241 ELFYHEMIRRGFKPDAVTYSIRMDAYCKRGCFVDGLRVFEEMKRAKFEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KD  KARQLFDE+ LRNLCPDIGAYNALISSLIRSDDV  AAALMEDMEAKHIGHD
Sbjct: 301 AGVAKDIAKARQLFDEMPLRNLCPDIGAYNALISSLIRSDDVNSAAALMEDMEAKHIGHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMF GLM+LEDV GFYELY KM+RRNFVPKTRTV+MIMKFFCENRRVDLGL+LWG
Sbjct: 361 SMTYHMMFVGLMKLEDVHGFYELYRKMIRRNFVPKTRTVVMIMKFFCENRRVDLGLDLWG 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLD+LVTGLCARGMV +AFECSKQMLERGRQMS+AAFLIM+RSLLQA 
Sbjct: 421 YLVEKGYCPHSHVLDVLVTGLCARGMVHEAFECSKQMLERGRQMSDAAFLIMERSLLQAD 480

Query: 530 ATDNYGELEQLRKKLKTVLPPPNQLSSEISASS 563
           A D  GELEQLRKKLKTVLPPP QL  EI ASS
Sbjct: 481 AKDILGELEQLRKKLKTVLPPPKQLPYEIPASS 510

BLAST of HG10009455 vs. ExPASy TrEMBL
Match: A0A0A0KRS8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G173440 PE=4 SV=1)

HSP 1 Score: 880.9 bits (2275), Expect = 2.7e-252
Identity = 442/501 (88.22%), Postives = 465/501 (92.81%), Query Frame = 0

Query: 50  MAMLKRLTQIPQIPISSHSFLILAAPFCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPL 109
           MA+L +LT IPQ PISSHSFLILAA +CT  +LHPSS+ TTEIERIAKIINDHPFPD PL
Sbjct: 1   MAVLMKLTLIPQKPISSHSFLILAASYCT--NLHPSSKSTTEIERIAKIINDHPFPDQPL 60

Query: 110 QPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTL 169
            P LLH IPSPLPSNTFLNDVLG+LFAAHSNGLKALEFFKFCLHHSQA  PT DAFEKTL
Sbjct: 61  HPTLLHLIPSPLPSNTFLNDVLGQLFAAHSNGLKALEFFKFCLHHSQA-PPTSDAFEKTL 120

Query: 170 HILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVF 229
           HIL+RMRYFDQSWELMREI +THPFLLTLKSMSILLS+IAKF SFEETIEAFQRMENEVF
Sbjct: 121 HILSRMRYFDQSWELMREIRQTHPFLLTLKSMSILLSRIAKFLSFEETIEAFQRMENEVF 180

Query: 230 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAV 289
           VGRKFGTEEFNVLLRAFCTQRQMKEARSVF KMYSRFPP  KT+NLLLLGFKES+D+T+V
Sbjct: 181 VGRKFGTEEFNVLLRAFCTQRQMKEARSVFHKMYSRFPPNIKTINLLLLGFKESSDITSV 240

Query: 290 ELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHG 349
           ELFYHEMIKRGFKPNAVTYSIRIDAYCK+GCFVDGLRV KEMERAK EPTLETITTLIHG
Sbjct: 241 ELFYHEMIKRGFKPNAVTYSIRIDAYCKKGCFVDGLRVFKEMERAKCEPTLETITTLIHG 300

Query: 350 GGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHD 409
            G+ KDKTKARQLFDEI LRNLCPDIGAYNALISSLIRS DVK AA++MEDMEAKHI HD
Sbjct: 301 AGIVKDKTKARQLFDEIPLRNLCPDIGAYNALISSLIRSGDVKSAASVMEDMEAKHIEHD 360

Query: 410 SLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWG 469
           S+TYHMMFSGL+RLEDVGGFYELY KMV RNFVPKTRT +MIMKFFCENRRVDLGL  W 
Sbjct: 361 SVTYHMMFSGLIRLEDVGGFYELYIKMVGRNFVPKTRTAVMIMKFFCENRRVDLGLGFWA 420

Query: 470 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAH 529
           YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIM+R LL+AH
Sbjct: 421 YLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMERCLLKAH 480

Query: 530 ATDNYGELEQLRKKLKTVLPP 551
           ATD Y ELE+LRKKLKTVLPP
Sbjct: 481 ATDKYEELERLRKKLKTVLPP 498

BLAST of HG10009455 vs. TAIR 10
Match: AT3G61360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 560.5 bits (1443), Expect = 1.6e-159
Identity = 286/491 (58.25%), Postives = 366/491 (74.54%), Query Frame = 0

Query: 64  ISSHSFLILAAP-FCTHLHLHPSSQPTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLP 123
           ISS S+  L AP   +   L  +S   TEIERI  IIN HPFP+HP+QPIL  HIP    
Sbjct: 5   ISSDSYRRLLAPVLSSSSSLSSTSINRTEIERITIIINGHPFPNHPIQPILAKHIPLSSL 64

Query: 124 SNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSW 183
           S  F+++VLGRLFAAHSNGLKALEFFK+ L  S+ SSPT D+FEKTLHILARMRYFDQ+W
Sbjct: 65  SPEFVSEVLGRLFAAHSNGLKALEFFKYSLKSSK-SSPTSDSFEKTLHILARMRYFDQAW 124

Query: 184 ELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVL 243
            LM E+ + +P LL+ KSMSILL KIAKF S+EET+EAF +ME E+F  +KFG +EFN+L
Sbjct: 125 ALMAEVRKDYPNLLSFKSMSILLCKIAKFGSYEETLEAFVKMEKEIF-RKKFGVDEFNIL 184

Query: 244 LRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFK 303
           LRAFCT+R+MKEARS+F+K++SRF P  KTMN+LLLGFKE+ DVTA ELFYHEM+KRGFK
Sbjct: 185 LRAFCTEREMKEARSIFEKLHSRFNPDVKTMNILLLGFKEAGDVTATELFYHEMVKRGFK 244

Query: 304 PNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQL 363
           PN+VTY IRID +CK+  F + LR+ ++M+R  F+ T++ +TTLIHG G+ ++K KARQL
Sbjct: 245 PNSVTYGIRIDGFCKKRNFGEALRLFEDMDRLDFDITVQILTTLIHGSGVARNKIKARQL 304

Query: 364 FDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMR 423
           FDEI+ R L PD GAYNAL+SSL++  DV  A  +M++ME K I  DS+T+H MF G+M+
Sbjct: 305 FDEISKRGLTPDCGAYNALMSSLMKCGDVSGAIKVMKEMEEKGIEPDSVTFHSMFIGMMK 364

Query: 424 LEDVG--GFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHS 483
            ++ G  G  E Y KM  R+ VPKT T++M+MK FC N  V+LGL+LW Y++EKGYCPH 
Sbjct: 365 SKEFGFNGVCEYYQKMKERSLVPKTPTIVMLMKLFCHNGEVNLGLDLWKYMLEKGYCPHG 424

Query: 484 HVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGELEQL 543
           H L+LL T LCAR     AFECS Q +ERGR +SE  + +++ SL   +      EL++ 
Sbjct: 425 HALELLTTALCARRRANDAFECSWQTVERGRCVSEPVYRMLETSLSSNNELKKLEELKEE 484

Query: 544 RKKLKTVLPPP 552
            +KL + LPPP
Sbjct: 485 IQKLHSFLPPP 493

BLAST of HG10009455 vs. TAIR 10
Match: AT1G02420.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 228.4 bits (581), Expect = 1.4e-59
Identity = 138/432 (31.94%), Postives = 230/432 (53.24%), Query Frame = 0

Query: 123 SNTFLNDVLGRLFAAHSNGLKALEFFKFC-----LHHSQASSPTPDAFEKTLHILARMRY 182
           S   ++ VL R+  +H N ++ LEF+++       +HS  S  T       L+IL R R 
Sbjct: 70  SKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDT------MLYILGRNRK 129

Query: 183 FDQSWELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTE 242
           FDQ WEL+ E  R    L++ ++M ++L ++AK  S  +T+E+F + +    V   F T 
Sbjct: 130 FDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKR--LVPDFFDTA 189

Query: 243 EFNVLLRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMI 302
            FN LLR  C ++ M +AR+V+  +  +F P  +T N+LL G+K S +    E F+ EM 
Sbjct: 190 CFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKSSEE---AEAFFEEMK 249

Query: 303 KRGFKPNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKT 362
            +G KP+ VTY+  ID YCK        +++ +M   +  P + T TT+I G GL     
Sbjct: 250 GKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPD 309

Query: 363 KARQLFDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMF 422
           KAR++  E+      PD+ AYNA I +   +  +  A  L+++M  K +  ++ TY++ F
Sbjct: 310 KAREVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFF 369

Query: 423 SGLMRLEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYC 482
             L    D+G  +ELY +M+    +P T++ + ++K F  + +VD+ + LW  +V KG+ 
Sbjct: 370 RVLSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFG 429

Query: 483 PHSHVLDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGEL 542
            +S V D+L+  LC    V +A +C  +M+E+G + S  +F  +K  +  A+  D    L
Sbjct: 430 SYSLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNL 489

Query: 543 EQLRKKLKTVLP 550
            Q      T +P
Sbjct: 490 IQKMAIFSTEIP 490

BLAST of HG10009455 vs. TAIR 10
Match: AT1G71060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 170.2 bits (430), Expect = 4.6e-42
Identity = 140/523 (26.77%), Postives = 253/523 (48.37%), Query Frame = 0

Query: 30  LFNSAIAVFSIN-SRRV-KRLKMAMLKRLTQIPQIPISSHSFL---ILAAPFCTHLHLHP 89
           +F+    V  +N SRRV  R+  +    L  IP I  +S+  L     A+   T +  + 
Sbjct: 2   VFSRFFRVTGVNLSRRVYSRISSSSSPSLESIPWIHKASNFTLYGSFHASSVETQVSAND 61

Query: 90  SSQPTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKA 149
           +SQ   + ERI KI+    F D  ++ +L  +  S   S   + +VL +L  A   G+ A
Sbjct: 62  ASQ---DAERICKILT--KFTDSKVETLL--NEASVKLSPALIEEVLKKLSNA---GVLA 121

Query: 150 LEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSIL 209
           L  FK+   + +    T   +   +  L +++ F   W L+ ++      LL+ ++ +++
Sbjct: 122 LSVFKWA-ENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKK--LLSKETFALI 181

Query: 210 LSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKM-Y 269
             + A+ +  +E I AF +ME     G K  + +FN +L      R + +A+ VF KM  
Sbjct: 182 SRRYARARKVKEAIGAFHKMEE---FGFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKK 241

Query: 270 SRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVD 329
            RF P  K+  +LL G+ +  ++  V+    EM   GF+P+ V Y I I+A+CK   + +
Sbjct: 242 KRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYGIIINAHCKAKKYEE 301

Query: 330 GLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALIS 389
            +R   EME+   +P+     +LI+G G  K    A + F+         +   YNAL+ 
Sbjct: 302 AIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVG 361

Query: 390 SLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVP 449
           +   S  ++ A   +++M  K +G ++ TY ++   L+R++     YE+Y  M   +  P
Sbjct: 362 AYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTM---SCEP 421

Query: 450 KTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHVLDLLVTGLCARGMVLQAFECS 509
              T  ++++ FC   R+D+ +++W  +  KG  P  H+   L+T LC    + +A E  
Sbjct: 422 TVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYF 481

Query: 510 KQMLERGRQMSEAAFLIMKRSLL----QAHATDNYGELEQLRK 543
            +ML+ G +     F  +K++LL    +   TD   ++++LRK
Sbjct: 482 NEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRK 505

BLAST of HG10009455 vs. TAIR 10
Match: AT1G77360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 165.6 bits (418), Expect = 1.1e-40
Identity = 107/429 (24.94%), Postives = 212/429 (49.42%), Query Frame = 0

Query: 123 SNTFLNDVLGRLFAAHSNGLKALEFFKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSW 182
           S   + DVL R     + GL    FF++     +    +  A+   +   A++R +   W
Sbjct: 99  SQEVVEDVLNRF---RNAGLLTYRFFQWS-EKQRHYEHSVRAYHMMIESTAKIRQYKLMW 158

Query: 183 ELMREIGRTHPFLLTLKSMSILLSKIAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVL 242
           +L+  + +    +L +++  I++ K A+ Q  +E I AF  ME             FN L
Sbjct: 159 DLINAMRKKK--MLNVETFCIVMRKYARAQKVDEAIYAFNVMEKYDLPPNLVA---FNGL 218

Query: 243 LRAFCTQRQMKEARSVFQKMYSRFPPTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFK 302
           L A C  + +++A+ VF+ M  RF P +KT ++LL G+ +  ++      + EMI  G  
Sbjct: 219 LSALCKSKNVRKAQEVFENMRDRFTPDSKTYSILLEGWGKEPNLPKAREVFREMIDAGCH 278

Query: 303 PNAVTYSIRIDAYCKRGCFVDGLRVLKEMERAKFEPTLETITTLIHGGGLTKDKTKARQL 362
           P+ VTYSI +D  CK G   + L +++ M+ +  +PT    + L+H  G      +A   
Sbjct: 279 PDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKPTTFIYSVLVHTYGTENRLEEAVDT 338

Query: 363 FDEITLRNLCPDIGAYNALISSLIRSDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMR 422
           F E+    +  D+  +N+LI +  +++ +K    ++++M++K +  +S + +++   L+ 
Sbjct: 339 FLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGVTPNSKSCNIILRHLIE 398

Query: 423 LEDVGGFYELYSKMVRRNFVPKTRTVIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHV 482
             +    ++++ KM+ +   P   T  M++K FCE + ++   ++W Y+ +KG  P  H 
Sbjct: 399 RGEKDEAFDVFRKMI-KVCEPDADTYTMVIKMFCEKKEMETADKVWKYMRKKGVFPSMHT 458

Query: 483 LDLLVTGLCARGMVLQAFECSKQMLERGRQMSEAAFLIMKRSLLQAHATDNYGELEQLRK 542
             +L+ GLC      +A    ++M+E G + S   F  +++ L++    D    L+ L +
Sbjct: 459 FSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLIKEERED---VLKFLNE 514

Query: 543 KLKTVLPPP 552
           K+  ++  P
Sbjct: 519 KMNVLVNEP 514

BLAST of HG10009455 vs. TAIR 10
Match: AT1G74900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 147.1 bits (370), Expect = 4.2e-35
Identity = 102/406 (25.12%), Postives = 186/406 (45.81%), Query Frame = 0

Query: 88  PTTEIERIAKIINDHPFPDHPLQPILLHHIPSPLPSNTFLNDVLGRLFAAHSNGLKALEF 147
           P  +   IAK+I   P   H     LL    +P   N  +N VL RL+   ++G KAL+F
Sbjct: 21  PPADSAAIAKLILSSPNTTHQDDQFLLSTKTTPWTPN-LVNSVLKRLW---NHGPKALQF 80

Query: 148 FKFCLHHSQASSPTPDAFEKTLHILARMRYFDQSWELMREIGRTHPFLLTLKSMSILLSK 207
           F F  +H +       +F+  + I AR+      W L+  + R+     + K+ +I+  +
Sbjct: 81  FHFLDNHHREYVHDASSFDLAIDIAARLHLHPTVWSLIHRM-RSLRIGPSPKTFAIVAER 140

Query: 208 IAKFQSFEETIEAFQRMENEVFVGRKFGTEEFNVLLRAFCTQRQMKEARSVFQKMYSRFP 267
            A     ++ ++ F  M      G       FN +L   C  +++++A  +F+ +  RF 
Sbjct: 141 YASAGKPDKAVKLFLNMHEH---GCFQDLASFNTILDVLCKSKRVEKAYELFRALRGRFS 200

Query: 268 PTTKTMNLLLLGFKESTDVTAVELFYHEMIKRGFKPNAVTYSIRIDAYCKRGCFVDGLRV 327
             T T N++L G+              EM++RG  PN  TY+  +  + + G        
Sbjct: 201 VDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFFRAGQIRHAWEF 260

Query: 328 LKEMERAKFEPTLETITTLIHGGGLTKDKTKARQLFDEITLRNLCPDIGAYNALISSLIR 387
             EM++   E  + T TT++HG G+  +  +AR +FDE+    + P +  YNA+I  L +
Sbjct: 261 FLEMKKRDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGVLPSVATYNAMIQVLCK 320

Query: 388 SDDVKCAAALMEDMEAKHIGHDSLTYHMMFSGLMRLEDVGGFYELYSKMVRRNFVPKTRT 447
            D+V+ A  + E+M  +    +  TY+++  GL    +     EL  +M      P  +T
Sbjct: 321 KDNVENAVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEELMQRMENEGCEPNFQT 380

Query: 448 VIMIMKFFCENRRVDLGLELWGYLVEKGYCPHSHVLDLLVTGLCAR 494
             M+++++ E   V+  L L+  +      P+    ++L++G+  R
Sbjct: 381 YNMMIRYYSECSEVEKALGLFEKMGSGDCLPNLDTYNILISGMFVR 418

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876095.13.2e-27191.23pentatricopeptide repeat-containing protein At3g61360 [Benincasa hispida] >XP_03... [more]
XP_016898809.11.6e-25988.54PREDICTED: pentatricopeptide repeat-containing protein At3g61360 [Cucumis melo][more]
KAA0055168.11.8e-25888.67pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
TYK00295.13.1e-25888.67pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_022929103.12.1e-25487.52pentatricopeptide repeat-containing protein At3g61360 [Cucurbita moschata] >XP_0... [more]
Match NameE-valueIdentityDescription
Q9M2C82.2e-15858.25Pentatricopeptide repeat-containing protein At3g61360 OS=Arabidopsis thaliana OX... [more]
Q9FZ192.0e-5831.94Putative pentatricopeptide repeat-containing protein At1g02420 OS=Arabidopsis th... [more]
Q9C9A26.5e-4126.77Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
Q9FVX21.6e-3924.94Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
Q9S7R42.6e-3425.12Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A1S4DS468.0e-26088.54pentatricopeptide repeat-containing protein At3g61360 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7UJI28.8e-25988.67Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BMK01.5e-25888.67Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1EM601.0e-25487.52pentatricopeptide repeat-containing protein At3g61360 OS=Cucurbita moschata OX=3... [more]
A0A0A0KRS82.7e-25288.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G173440 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G61360.11.6e-15958.25Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02420.11.4e-5931.94Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G71060.14.6e-4226.77Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G77360.11.1e-4024.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74900.14.2e-3525.12Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 373..421
e-value: 8.8E-8
score: 32.2
coord: 268..317
e-value: 1.6E-9
score: 37.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 239..263
e-value: 0.0032
score: 17.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 412..443
e-value: 6.8E-4
score: 17.6
coord: 306..339
e-value: 1.3E-8
score: 32.5
coord: 239..265
e-value: 1.5E-4
score: 19.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..265
score: 8.527949
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 409..443
score: 9.45966
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..303
score: 8.648523
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 11.673842
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 163..334
e-value: 2.1E-26
score: 95.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 352..560
e-value: 4.5E-29
score: 103.8
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 78..549
NoneNo IPR availablePANTHERPTHR45613:SF143OS06G0694600 PROTEINcoord: 78..549

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10009455.1HG10009455.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding