HG10007490 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007490
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr10: 5932720 .. 5934114 (-)
RNA-Seq ExpressionHG10007490
SyntenyHG10007490
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTGCCACCCATCTTCCCACAACCACTTCACCTTCACTTACGCTCTCAAAGCTTGTTGCGTCCTTCATCAAACCCATAAGGGCCTCGAAATCCATGCCCATGTCATCAAATCAGGTCATCTTTCTGACATCTTCATCCAAAATTCTCTTCTCCATTTCTACATTCTCAATGGCGATGTTCCTTCTGCTTCTCAAATCTTTGATTCCATCCCTGACCCAGATGTTGTTTCGTGGACTTCAATCATTTCGGGCCTTTCCAAGTTGGGTTTTGAAGAAGAAGCTCTAGGTAAGTTCTTGTCCATGAATGTGAGGCCTAATTCTACTACTCTTGTTACTGCTTTATCAGCTTGTTCTAGTCTTAGATGTCTCAAGCTTGGGAAAGCTATACATGGGCTTAGATTGAGGAGTTTGAATGAGGAAAGTGTTAGTTTGGACAATGCCCTTCTTGATTTTTATGTTAGATGTGGCTATTTGAGGAGTGCAAAGAACCTATTTGATGCAATGCCTAAAAGAGATGTAGTGACTTGGACTACAATGATTGGGGGTTATGCACAGAGAGGATTGTGCGAAGAGGCTGTGAGGGTATTTCAAAACATGGTTCATGTGGGAGAGGCCATGCCCAATGAGGCCACTCTAGTTAATGTATTATCTGCATGTTCTTCCATTTCTGCTCTGCATTTAGGTCAATGGGTGCATTCCTATATCAACTCTAGGCATGATGTGATAATTGATGGAAATGTTGGAAATGCTTTGATTAACATGTATGTCAAATGTGGCAAGATGGAAATGGCAATTTTGATCTTCAAAGCTATTGAGCACAAGGATATCATATCATGGAGTACAATCATTAGTGGGTTAGCCATGAATGGCCTAGGAAAGCAAGCTTTTTGTCTCTTCTCACTCATGCTAGTTCATGGCATTTCTCCTGATGACATAACATTTCTTGGCTTGTTATCTGCTTGCAGCCATGGTGGGTTGATCAACCAAGGTTTAATGGTGTTTGAAGCCATGAAAAATGTTTATAATCTTCCACCTGAGATGAGGCATTATGCTTGCATGGTAGACATGTATGGAAAGGCTGGGCTATTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCTCAGTTTGGGGAGCTTTGCTTCATGCTTGTCAAATTCATGGGAATGAGAAGAAATATGAGAAAGTTAGGGAATGGCTGCTTGGAAGCAAGGGAGTTACAGTAGGAACTTTTGCTTTGTTATCAAATACTTATGCTAGTTGTGATAAATGGAATTATGCCAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATGGCTGGATGTAGTTGGATTGAATTGGTTGATCCCTTGAATCCTTTAGGTTAA

mRNA sequence

ATGCTTTGCCACCCATCTTCCCACAACCACTTCACCTTCACTTACGCTCTCAAAGCTTGTTGCGTCCTTCATCAAACCCATAAGGGCCTCGAAATCCATGCCCATGTCATCAAATCAGGTCATCTTTCTGACATCTTCATCCAAAATTCTCTTCTCCATTTCTACATTCTCAATGGCGATGTTCCTTCTGCTTCTCAAATCTTTGATTCCATCCCTGACCCAGATGTTGTTTCGTGGACTTCAATCATTTCGGGCCTTTCCAAGTTGGGTTTTGAAGAAGAAGCTCTAGGTAAGTTCTTGTCCATGAATGTGAGGCCTAATTCTACTACTCTTGTTACTGCTTTATCAGCTTGTTCTAGTCTTAGATGTCTCAAGCTTGGGAAAGCTATACATGGGCTTAGATTGAGGAGTTTGAATGAGGAAAGTGTTAGTTTGGACAATGCCCTTCTTGATTTTTATGTTAGATGTGGCTATTTGAGGAGTGCAAAGAACCTATTTGATGCAATGCCTAAAAGAGATGTAGTGACTTGGACTACAATGATTGGGGGTTATGCACAGAGAGGATTGTGCGAAGAGGCTGTGAGGGTATTTCAAAACATGGTTCATGTGGGAGAGGCCATGCCCAATGAGGCCACTCTAGTTAATGTATTATCTGCATGTTCTTCCATTTCTGCTCTGCATTTAGGTCAATGGGTGCATTCCTATATCAACTCTAGGCATGATGTGATAATTGATGGAAATGTTGGAAATGCTTTGATTAACATGTATGTCAAATGTGGCAAGATGGAAATGGCAATTTTGATCTTCAAAGCTATTGAGCACAAGGATATCATATCATGGAGTACAATCATTAGTGGGTTAGCCATGAATGGCCTAGGAAAGCAAGCTTTTTGTCTCTTCTCACTCATGCTAGTTCATGGCATTTCTCCTGATGACATAACATTTCTTGGCTTGTTATCTGCTTGCAGCCATGGTGGGTTGATCAACCAAGGTTTAATGGTGTTTGAAGCCATGAAAAATGTTTATAATCTTCCACCTGAGATGAGGCATTATGCTTGCATGGTAGACATGTATGGAAAGGCTGGGCTATTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCTCAGTTTGGGGAGCTTTGCTTCATGCTTGTCAAATTCATGGGAATGAGAAGAAATATGAGAAAGTTAGGGAATGGCTGCTTGGAAGCAAGGGAGTTACAGTAGGAACTTTTGCTTTGTTATCAAATACTTATGCTAGTTGTGATAAATGGAATTATGCCAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATGGCTGGATGTAGTTGGATTGAATTGGTTGATCCCTTGAATCCTTTAGGTTAA

Coding sequence (CDS)

ATGCTTTGCCACCCATCTTCCCACAACCACTTCACCTTCACTTACGCTCTCAAAGCTTGTTGCGTCCTTCATCAAACCCATAAGGGCCTCGAAATCCATGCCCATGTCATCAAATCAGGTCATCTTTCTGACATCTTCATCCAAAATTCTCTTCTCCATTTCTACATTCTCAATGGCGATGTTCCTTCTGCTTCTCAAATCTTTGATTCCATCCCTGACCCAGATGTTGTTTCGTGGACTTCAATCATTTCGGGCCTTTCCAAGTTGGGTTTTGAAGAAGAAGCTCTAGGTAAGTTCTTGTCCATGAATGTGAGGCCTAATTCTACTACTCTTGTTACTGCTTTATCAGCTTGTTCTAGTCTTAGATGTCTCAAGCTTGGGAAAGCTATACATGGGCTTAGATTGAGGAGTTTGAATGAGGAAAGTGTTAGTTTGGACAATGCCCTTCTTGATTTTTATGTTAGATGTGGCTATTTGAGGAGTGCAAAGAACCTATTTGATGCAATGCCTAAAAGAGATGTAGTGACTTGGACTACAATGATTGGGGGTTATGCACAGAGAGGATTGTGCGAAGAGGCTGTGAGGGTATTTCAAAACATGGTTCATGTGGGAGAGGCCATGCCCAATGAGGCCACTCTAGTTAATGTATTATCTGCATGTTCTTCCATTTCTGCTCTGCATTTAGGTCAATGGGTGCATTCCTATATCAACTCTAGGCATGATGTGATAATTGATGGAAATGTTGGAAATGCTTTGATTAACATGTATGTCAAATGTGGCAAGATGGAAATGGCAATTTTGATCTTCAAAGCTATTGAGCACAAGGATATCATATCATGGAGTACAATCATTAGTGGGTTAGCCATGAATGGCCTAGGAAAGCAAGCTTTTTGTCTCTTCTCACTCATGCTAGTTCATGGCATTTCTCCTGATGACATAACATTTCTTGGCTTGTTATCTGCTTGCAGCCATGGTGGGTTGATCAACCAAGGTTTAATGGTGTTTGAAGCCATGAAAAATGTTTATAATCTTCCACCTGAGATGAGGCATTATGCTTGCATGGTAGACATGTATGGAAAGGCTGGGCTATTAGATGAAGCAGAGGCATTCATAAAGGAGATGCCTGTGGAAGCAGAAGGCTCAGTTTGGGGAGCTTTGCTTCATGCTTGTCAAATTCATGGGAATGAGAAGAAATATGAGAAAGTTAGGGAATGGCTGCTTGGAAGCAAGGGAGTTACAGTAGGAACTTTTGCTTTGTTATCAAATACTTATGCTAGTTGTGATAAATGGAATTATGCCAATGAAGTTCGAGATACCATGAGAAGTAGAGGGTTGAAGAAAATGGCTGGATGTAGTTGGATTGAATTGGTTGATCCCTTGAATCCTTTAGGTTAA

Protein sequence

MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSSLRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTMIGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRHDVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG
Homology
BLAST of HG10007490 vs. NCBI nr
Match: XP_038878297.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 883.2 bits (2281), Expect = 9.4e-253
Identity = 429/464 (92.46%), Postives = 450/464 (96.98%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LH+T KGLEIHAH+IKSGHLSDIF+QNSLLHFYIL+GD
Sbjct: 54  MLRYPSSHNHFTFTYALKACCFLHETQKGLEIHAHLIKSGHLSDIFLQNSLLHFYILDGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           VPSAS+IFDSIPDPDV+SWTSIISGLSKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 114 VPSASRIFDSIPDPDVISWTSIISGLSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRCLKLGKAIHGLRLRSLNEE+VSLDNALLDFYVRCGYLRSA+ LFD MPKRDVV+WTTM
Sbjct: 174 LRCLKLGKAIHGLRLRSLNEENVSLDNALLDFYVRCGYLRSAEYLFDEMPKRDVVSWTTM 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQRGLCEEAVRVFQNMVHVGEA+PNEATL+NVLSACSSISALHLGQWVHSYINSRH
Sbjct: 234 IGGYAQRGLCEEAVRVFQNMVHVGEAIPNEATLINVLSACSSISALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDIISWSTIISGLAMNGLG QAF LF
Sbjct: 294 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIISWSTIISGLAMNGLGNQAFGLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFL LLSACSHGGLINQGLMVFEAMK+VYN+ P+MRHYACMVD+YGK
Sbjct: 354 SLMLVHGISPDDITFLSLLSACSHGGLINQGLMVFEAMKDVYNISPQMRHYACMVDLYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEGSVWGALLHACQIHGNEKKYEKV+EWLLGSKGVTVGTFALL
Sbjct: 414 AGLLDEAEAFIKEMPMEAEGSVWGALLHACQIHGNEKKYEKVKEWLLGSKGVTVGTFALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN ANEVRDTMRS+GLKKMAGCSWIELVD  N LG
Sbjct: 474 SNTYASCDRWNDANEVRDTMRSKGLKKMAGCSWIELVDSSNSLG 517

BLAST of HG10007490 vs. NCBI nr
Match: TYK18848.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 861.7 bits (2225), Expect = 2.9e-246
Identity = 421/463 (90.93%), Postives = 443/463 (95.68%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYILNGD
Sbjct: 159 MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILNGD 218

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISGLSKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 219 VSSASLIFDSIPNPDVVSWTSIISGLSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 278

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRCLKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 279 LRCLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 338

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVHVGEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 339 IGGYAQSGLCEEAVRVFQNMVHVGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 398

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIF AIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 399 DVIIDGNVGNALINMYVKCGNMEMAILIFNAIEHKDIISWSTVISGLAMNGLGKQAFVLF 458

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 459 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 518

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 519 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 578

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPL 464
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+
Sbjct: 579 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPV 621

BLAST of HG10007490 vs. NCBI nr
Match: XP_011660133.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus] >XP_031745752.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus] >XP_031745757.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus] >XP_031745768.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 [Cucumis sativus] >KGN66483.1 hypothetical protein Csa_006958 [Cucumis sativus])

HSP 1 Score: 859.0 bits (2218), Expect = 1.9e-245
Identity = 418/463 (90.28%), Postives = 441/463 (95.25%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 54  MLRYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILDGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIPDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNSTTLVTALSACSS
Sbjct: 114 VSSASLIFDSIPDPDVVSWTSIISGLSKLGFEKEALSKFLSMNVRPNSTTLVTALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRCLKLGKAIHGLR+R+LNEE+V L+NALLDFYVRC YLRSA+NLF+ MPKRDVV+WTTM
Sbjct: 174 LRCLKLGKAIHGLRMRTLNEENVILENALLDFYVRCAYLRSAENLFEKMPKRDVVSWTTM 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVHVGEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 234 IGGYAQSGLCEEAVRVFQNMVHVGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDI+SWSTIISGLAMNGLGKQAF LF
Sbjct: 294 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIVSWSTIISGLAMNGLGKQAFVLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHG+SPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P+MRHYACMVDMYGK
Sbjct: 354 SLMLVHGVSPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQMRHYACMVDMYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQ+HGNEKKYEKVREWLLGSKGVTVGTFALL
Sbjct: 414 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQLHGNEKKYEKVREWLLGSKGVTVGTFALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPL 464
           SNTYA CD+WN AN+VR  MRSRGLKKMAG SWIE+VD   PL
Sbjct: 474 SNTYACCDRWNDANDVRVAMRSRGLKKMAGRSWIEMVDSTYPL 516

BLAST of HG10007490 vs. NCBI nr
Match: XP_008450427.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucumis melo] >XP_016900885.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucumis melo] >XP_016900886.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucumis melo])

HSP 1 Score: 856.3 bits (2211), Expect = 1.2e-244
Identity = 419/464 (90.30%), Postives = 442/464 (95.26%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 54  MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILHGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 114 VSSASLIFDSIPNPDVVSWTSIISGFSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LR LKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 174 LRRLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVH GEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 234 IGGYAQSGLCEEAVRVFQNMVHAGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 294 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIISWSTVISGLAMNGLGKQAFVLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 354 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 414 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+G
Sbjct: 474 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPVG 517

BLAST of HG10007490 vs. NCBI nr
Match: KAA0059096.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 856.3 bits (2211), Expect = 1.2e-244
Identity = 419/464 (90.30%), Postives = 442/464 (95.26%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 1   MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILHGD 60

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 61  VSSASLIFDSIPNPDVVSWTSIISGFSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 120

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LR LKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 121 LRRLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 180

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVH GEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 181 IGGYAQSGLCEEAVRVFQNMVHAGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 241 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIISWSTVISGLAMNGLGKQAFVLF 300

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 360

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 361 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 420

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+G
Sbjct: 421 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPVG 464

BLAST of HG10007490 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 2.1e-98
Identity = 185/483 (38.30%), Postives = 283/483 (58.59%), Query Frame = 0

Query: 9   NHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD-------- 68
           N +TF + LK+C       +G +IH HV+K G   D+++  SL+  Y+ NG         
Sbjct: 133 NSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVF 192

Query: 69  -----------------------VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALG 128
                                  + +A ++FD IP  DVVSW ++ISG ++ G  +EAL 
Sbjct: 193 DKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALE 252

Query: 129 KFLSM---NVRPNSTTLVTALSACSSLRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYV 188
            F  M   NVRP+ +T+VT +SAC+    ++LG+ +H          ++ + NAL+D Y 
Sbjct: 253 LFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 312

Query: 189 RCGYLRSAKNLFDAMPKRDVVTWTTMIGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLV 248
           +CG L +A  LF+ +P +DV++W T+IGGY    L +EA+ +FQ M+  GE  PN+ T++
Sbjct: 313 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGET-PNDVTML 372

Query: 249 NVLSACSSISALHLGQWVHSYINSR-HDVIIDGNVGNALINMYVKCGKMEMAILIFKAIE 308
           ++L AC+ + A+ +G+W+H YI+ R   V    ++  +LI+MY KCG +E A  +F +I 
Sbjct: 373 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 432

Query: 309 HKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM 368
           HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Sbjct: 433 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 492

Query: 369 VFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLHACQIH 428
           +F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M +E +G +W +LL AC++H
Sbjct: 493 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 552

Query: 429 GNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKMAGCSW 457
           GN +  E   E L+  +    G++ LLSN YAS  +WN   + R  +  +G+KK+ GCS 
Sbjct: 553 GNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSS 612

BLAST of HG10007490 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 4.2e-94
Identity = 187/488 (38.32%), Postives = 277/488 (56.76%), Query Frame = 0

Query: 3   CHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVP 62
           C+P   N +TF + +KA   +     G  +H   +KS   SD+F+ NSL+H Y   GD+ 
Sbjct: 127 CYP---NKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLD 186

Query: 63  SASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSTTLVTALSACS 122
           SA ++F +I + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+
Sbjct: 187 SACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACA 246

Query: 123 SLRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTT 182
            +R L+ G+ +      +    +++L NA+LD Y +CG +  AK LFDAM ++D VTWTT
Sbjct: 247 KIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTT 306

Query: 183 MIGGYA-------------------------------QRGLCEEAVRVFQNMVHVGEAMP 242
           M+ GYA                               Q G   EA+ VF  +        
Sbjct: 307 MLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKL 366

Query: 243 NEATLVNVLSACSSISALHLGQWVHSYINSRHDVIIDGNVGNALINMYVKCGKMEMAILI 302
           N+ TLV+ LSAC+ + AL LG+W+HSYI  +H + ++ +V +ALI+MY KCG +E +  +
Sbjct: 367 NQITLVSTLSACAQVGALELGRWIHSYI-KKHGIRMNFHVTSALIHMYSKCGDLEKSREV 426

Query: 303 FKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLI 362
           F ++E +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF  +  ACSH GL+
Sbjct: 427 FNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLV 486

Query: 363 NQGLMVFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLH 422
           ++   +F  M++ Y + PE +HYAC+VD+ G++G L++A  FI+ MP+    SVWGALL 
Sbjct: 487 DEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLG 546

Query: 423 ACQIHGNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKM 457
           AC+IH N    E     LL  +    G   LLSN YA   KW   +E+R  MR  GLKK 
Sbjct: 547 ACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKE 606

BLAST of HG10007490 vs. ExPASy Swiss-Prot
Match: Q9SX45 (Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E42 PE=2 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 1.3e-90
Identity = 174/456 (38.16%), Postives = 276/456 (60.53%), Query Frame = 0

Query: 5   PSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSA 64
           PS H   TF   LKA   L  ++   + HAH++K G  SD F++NSL+  Y  +G    A
Sbjct: 102 PSRH---TFPPLLKAVFKLRDSNP-FQFHAHIVKFGLDSDPFVRNSLISGYSSSGLFDFA 161

Query: 65  SQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSTTLVTALSACSSL 124
           S++FD   D DVV+WT++I G  + G   EA+  F+ M    V  N  T+V+ L A   +
Sbjct: 162 SRLFDGAEDKDVVTWTAMIDGFVRNGSASEAMVYFVEMKKTGVAANEMTVVSVLKAAGKV 221

Query: 125 RCLKLGKAIHGLRLRSLNEE-SVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 184
             ++ G+++HGL L +   +  V + ++L+D Y +C     A+ +FD MP R+VVTWT +
Sbjct: 222 EDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTWTAL 281

Query: 185 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 244
           I GY Q    ++ + VF+ M+   +  PNE TL +VLSAC+ + ALH G+ VH Y+  ++
Sbjct: 282 IAGYVQSRCFDKGMLVFEEMLK-SDVAPNEKTLSSVLSACAHVGALHRGRRVHCYM-IKN 341

Query: 245 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 304
            + I+   G  LI++YVKCG +E AIL+F+ +  K++ +W+ +I+G A +G  + AF LF
Sbjct: 342 SIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAHGYARDAFDLF 401

Query: 305 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 364
             ML   +SP+++TF+ +LSAC+HGGL+ +G  +F +MK  +N+ P+  HYACMVD++G+
Sbjct: 402 YTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKADHYACMVDLFGR 461

Query: 365 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 424
            GLL+EA+A I+ MP+E    VWGAL  +C +H + +  +     ++  +    G + LL
Sbjct: 462 KGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYELGKYAASRVIKLQPSHSGRYTLL 521

Query: 425 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIEL 457
           +N Y+    W+    VR  M+ + + K  G SWIE+
Sbjct: 522 ANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWIEV 551

BLAST of HG10007490 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 1.8e-89
Identity = 175/488 (35.86%), Postives = 269/488 (55.12%), Query Frame = 0

Query: 9   NHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSASQIF 68
           N ++F   L AC  L+  +KG+++H+ + KS  LSD++I ++L+  Y   G+V  A ++F
Sbjct: 151 NEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVF 210

Query: 69  DSIPDPDVVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSTTLVTALSACSSLRCLK 128
           D + D +VVSW S+I+   + G   EAL  F   L   V P+  TL + +SAC+SL  +K
Sbjct: 211 DEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIK 270

Query: 129 LGKAIHGLRLRSLN-EESVSLDNALLDFYVRCGYLRSAKNLFDAMP-------------- 188
           +G+ +HG  +++      + L NA +D Y +C  ++ A+ +FD+MP              
Sbjct: 271 VGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGY 330

Query: 189 -----------------KRDVVTWTTMIGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATL 248
                            +R+VV+W  +I GY Q G  EEA+ +F  ++      P   + 
Sbjct: 331 AMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLF-CLLKRESVCPTHYSF 390

Query: 249 VNVLSACSSISALHLGQWVHSYINSRHDVIIDGN-----VGNALINMYVKCGKMEMAILI 308
            N+L AC+ ++ LHLG   H ++         G      VGN+LI+MYVKCG +E   L+
Sbjct: 391 ANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLV 450

Query: 309 FKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLI 368
           F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LSAC H G +
Sbjct: 451 FRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFV 510

Query: 369 NQGLMVFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLH 428
            +G   F +M   + + P   HY CMVD+ G+AG L+EA++ I+EMP++ +  +WG+LL 
Sbjct: 511 EEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLA 570

Query: 429 ACQIHGNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKM 457
           AC++H N    + V E LL  +    G + LLSN YA   KW     VR +MR  G+ K 
Sbjct: 571 ACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQ 630

BLAST of HG10007490 vs. ExPASy Swiss-Prot
Match: Q9SZK1 (Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E45 PE=3 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 4.0e-89
Identity = 174/451 (38.58%), Postives = 265/451 (58.76%), Query Frame = 0

Query: 7   SHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSASQ 66
           S + FTF    KAC       +G +IH  V K G   DI++QNSL+HFY + G+  +A +
Sbjct: 103 SPDMFTFPPVFKACGKFSGIREGKQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACK 162

Query: 67  IFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSSLRCLKL 126
           +F  +P  DVVSWT II+G ++ G  +EAL  F  M+V PN  T V  L +   + CL L
Sbjct: 163 VFGEMPVRDVVSWTGIITGFTRTGLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSL 222

Query: 127 GKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTMIGGYAQ 186
           GK IHGL L+  +  S+   NAL+D YV+C  L  A  +F  + K+D V+W +MI G   
Sbjct: 223 GKGIHGLILKRASLISLETGNALIDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVH 282

Query: 187 RGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRHDVIIDG 246
               +EA+ +F  M       P+   L +VLSAC+S+ A+  G+WVH YI +   +  D 
Sbjct: 283 CERSKEAIDLFSLMQTSSGIKPDGHILTSVLSACASLGAVDHGRWVHEYILTA-GIKWDT 342

Query: 247 NVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVH 306
           ++G A+++MY KCG +E A+ IF  I  K++ +W+ ++ GLA++G G ++   F  M+  
Sbjct: 343 HIGTAIVDMYAKCGYIETALEIFNGIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKL 402

Query: 307 GISPDDITFLGLLSACSHGGLINQGLMVFEAMKN-VYNLPPEMRHYACMVDMYGKAGLLD 366
           G  P+ +TFL  L+AC H GL+++G   F  MK+  YNL P++ HY CM+D+  +AGLLD
Sbjct: 403 GFKPNLVTFLAALNACCHTGLVDEGRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLD 462

Query: 367 EAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEK-VREWLLGSKGVTVGTFALLSNTY 426
           EA   +K MPV+ +  + GA+L AC+  G   +  K + +  L  +    G + LLSN +
Sbjct: 463 EALELVKAMPVKPDVRICGAILSACKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIF 522

Query: 427 ASCDKWNYANEVRDTMRSRGLKKMAGCSWIE 456
           A+  +W+    +R  M+ +G+ K+ G S+IE
Sbjct: 523 AANRRWDDVARIRRLMKVKGISKVPGSSYIE 552

BLAST of HG10007490 vs. ExPASy TrEMBL
Match: A0A5D3D5L6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G00410 PE=4 SV=1)

HSP 1 Score: 861.7 bits (2225), Expect = 1.4e-246
Identity = 421/463 (90.93%), Postives = 443/463 (95.68%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYILNGD
Sbjct: 159 MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILNGD 218

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISGLSKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 219 VSSASLIFDSIPNPDVVSWTSIISGLSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 278

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRCLKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 279 LRCLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 338

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVHVGEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 339 IGGYAQSGLCEEAVRVFQNMVHVGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 398

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIF AIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 399 DVIIDGNVGNALINMYVKCGNMEMAILIFNAIEHKDIISWSTVISGLAMNGLGKQAFVLF 458

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 459 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 518

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 519 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 578

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPL 464
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+
Sbjct: 579 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPV 621

BLAST of HG10007490 vs. ExPASy TrEMBL
Match: A0A0A0LXJ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G613550 PE=4 SV=1)

HSP 1 Score: 859.0 bits (2218), Expect = 9.2e-246
Identity = 418/463 (90.28%), Postives = 441/463 (95.25%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 54  MLRYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILDGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIPDPDVVSWTSIISGLSKLGFE+EAL KFLSMNVRPNSTTLVTALSACSS
Sbjct: 114 VSSASLIFDSIPDPDVVSWTSIISGLSKLGFEKEALSKFLSMNVRPNSTTLVTALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRCLKLGKAIHGLR+R+LNEE+V L+NALLDFYVRC YLRSA+NLF+ MPKRDVV+WTTM
Sbjct: 174 LRCLKLGKAIHGLRMRTLNEENVILENALLDFYVRCAYLRSAENLFEKMPKRDVVSWTTM 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVHVGEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 234 IGGYAQSGLCEEAVRVFQNMVHVGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDI+SWSTIISGLAMNGLGKQAF LF
Sbjct: 294 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIVSWSTIISGLAMNGLGKQAFVLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHG+SPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P+MRHYACMVDMYGK
Sbjct: 354 SLMLVHGVSPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQMRHYACMVDMYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQ+HGNEKKYEKVREWLLGSKGVTVGTFALL
Sbjct: 414 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQLHGNEKKYEKVREWLLGSKGVTVGTFALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPL 464
           SNTYA CD+WN AN+VR  MRSRGLKKMAG SWIE+VD   PL
Sbjct: 474 SNTYACCDRWNDANDVRVAMRSRGLKKMAGRSWIEMVDSTYPL 516

BLAST of HG10007490 vs. ExPASy TrEMBL
Match: A0A1S4DY27 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492035 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 6.0e-245
Identity = 419/464 (90.30%), Postives = 442/464 (95.26%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 54  MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILHGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 114 VSSASLIFDSIPNPDVVSWTSIISGFSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LR LKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 174 LRRLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVH GEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 234 IGGYAQSGLCEEAVRVFQNMVHAGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 294 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIISWSTVISGLAMNGLGKQAFVLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 354 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 414 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+G
Sbjct: 474 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPVG 517

BLAST of HG10007490 vs. ExPASy TrEMBL
Match: A0A5A7UY28 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold601G00080 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 6.0e-245
Identity = 419/464 (90.30%), Postives = 442/464 (95.26%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML +PSSHNHFTFTYALKACC LHQT KGLEIHAH+IKSGHLSDIFIQNSLLHFYIL+GD
Sbjct: 1   MLHYPSSHNHFTFTYALKACCFLHQTQKGLEIHAHLIKSGHLSDIFIQNSLLHFYILHGD 60

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           V SAS IFDSIP+PDVVSWTSIISG SKLGFE+EALGKFLSMNVRPNSTTLVTALSACSS
Sbjct: 61  VSSASLIFDSIPNPDVVSWTSIISGFSKLGFEKEALGKFLSMNVRPNSTTLVTALSACSS 120

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LR LKLGKAIHGLRLR+LNEE+VSL+NALLDFYVRC YLRSA+NLF+ M KRDVV+WTTM
Sbjct: 121 LRRLKLGKAIHGLRLRTLNEENVSLENALLDFYVRCAYLRSAENLFEKMHKRDVVSWTTM 180

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYAQ GLCEEAVRVFQNMVH GEA+PNEATLVNVLSACSSISALHLGQWVHSYINSRH
Sbjct: 181 IGGYAQSGLCEEAVRVFQNMVHAGEAIPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGNVGNALINMYVKCG MEMAILIFKAIEHKDIISWST+ISGLAMNGLGKQAF LF
Sbjct: 241 DVIIDGNVGNALINMYVKCGNMEMAILIFKAIEHKDIISWSTVISGLAMNGLGKQAFVLF 300

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGISPDDITFLGLLSACSHGGLINQG+MVFEAMK+VYN+ P++RHYACMVDMYGK
Sbjct: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGMMVFEAMKDVYNISPQIRHYACMVDMYGK 360

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMP+EAEG VWGALLHACQIHGNEKKYEKVRE LLGSKGVTVG FALL
Sbjct: 361 AGLLDEAEAFIKEMPMEAEGPVWGALLHACQIHGNEKKYEKVRECLLGSKGVTVGAFALL 420

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN AN+VR  MRSRGLKKMAGCSWIELV+P NP+G
Sbjct: 421 SNTYASCDRWNDANDVRVAMRSRGLKKMAGCSWIELVNPSNPVG 464

BLAST of HG10007490 vs. ExPASy TrEMBL
Match: A0A6J1H7Z9 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461372 PE=4 SV=1)

HSP 1 Score: 836.3 bits (2159), Expect = 6.4e-239
Identity = 403/464 (86.85%), Postives = 436/464 (93.97%), Query Frame = 0

Query: 1   MLCHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD 60
           ML HPSSHNH+TFTYALKAC +LH+THKGLEIHA +IKSGHLSDIFIQNSLLHFYI++GD
Sbjct: 54  MLRHPSSHNHYTFTYALKACFLLHETHKGLEIHARLIKSGHLSDIFIQNSLLHFYIVDGD 113

Query: 61  VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSS 120
           VPSAS++FDSIPDPDVVSWTSIISGLSKLGF+EEALGKFLSMNV PNS TLV+ALSACSS
Sbjct: 114 VPSASRVFDSIPDPDVVSWTSIISGLSKLGFKEEALGKFLSMNVSPNSATLVSALSACSS 173

Query: 121 LRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 180
           LRC+K+GKAIHGL+LRSLNEESV+LDNALLDFYVRCG LR A+NLFD MP+RDVV+WTT+
Sbjct: 174 LRCVKIGKAIHGLKLRSLNEESVNLDNALLDFYVRCGSLRGAQNLFDEMPQRDVVSWTTL 233

Query: 181 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 240
           IGGYA  GLCEEAVRVFQNMVH  EA+PNEATL+NVLSACSS+SALHLGQWVHSYINSRH
Sbjct: 234 IGGYALTGLCEEAVRVFQNMVHAREAIPNEATLINVLSACSSMSALHLGQWVHSYINSRH 293

Query: 241 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 300
           DVIIDGN+GNALINMYVKCG M+ AI IFK +EHKDIISWSTIISGLAMNG GKQAF LF
Sbjct: 294 DVIIDGNIGNALINMYVKCGSMDKAISIFKTVEHKDIISWSTIISGLAMNGQGKQAFGLF 353

Query: 301 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 360
           SLMLVHGI+PD ITFL LLSACSHGGLINQGLMVFEAMK+VYN+ PEMRHYACMVDMYGK
Sbjct: 354 SLMLVHGITPDAITFLSLLSACSHGGLINQGLMVFEAMKDVYNVAPEMRHYACMVDMYGK 413

Query: 361 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 420
           AGLLDEAEAFIKEMPVEAEG VWGALLHACQ+HGNE +YEKVR+WLL SK +TVGT+ALL
Sbjct: 414 AGLLDEAEAFIKEMPVEAEGPVWGALLHACQLHGNE-RYEKVRQWLLSSKSITVGTYALL 473

Query: 421 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIELVDPLNPLG 465
           SNTYASCD+WN ANEVRD MRSRGLKKMAGCSWIEL DPLNPLG
Sbjct: 474 SNTYASCDRWNDANEVRDAMRSRGLKKMAGCSWIELADPLNPLG 516

BLAST of HG10007490 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 360.9 bits (925), Expect = 1.5e-99
Identity = 185/483 (38.30%), Postives = 283/483 (58.59%), Query Frame = 0

Query: 9   NHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGD-------- 68
           N +TF + LK+C       +G +IH HV+K G   D+++  SL+  Y+ NG         
Sbjct: 133 NSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVF 192

Query: 69  -----------------------VPSASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALG 128
                                  + +A ++FD IP  DVVSW ++ISG ++ G  +EAL 
Sbjct: 193 DKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALE 252

Query: 129 KFLSM---NVRPNSTTLVTALSACSSLRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYV 188
            F  M   NVRP+ +T+VT +SAC+    ++LG+ +H          ++ + NAL+D Y 
Sbjct: 253 LFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 312

Query: 189 RCGYLRSAKNLFDAMPKRDVVTWTTMIGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLV 248
           +CG L +A  LF+ +P +DV++W T+IGGY    L +EA+ +FQ M+  GE  PN+ T++
Sbjct: 313 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGET-PNDVTML 372

Query: 249 NVLSACSSISALHLGQWVHSYINSR-HDVIIDGNVGNALINMYVKCGKMEMAILIFKAIE 308
           ++L AC+ + A+ +G+W+H YI+ R   V    ++  +LI+MY KCG +E A  +F +I 
Sbjct: 373 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 432

Query: 309 HKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLINQGLM 368
           HK + SW+ +I G AM+G    +F LFS M   GI PDDITF+GLLSACSH G+++ G  
Sbjct: 433 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 492

Query: 369 VFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLHACQIH 428
           +F  M   Y + P++ HY CM+D+ G +GL  EAE  I  M +E +G +W +LL AC++H
Sbjct: 493 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 552

Query: 429 GNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKMAGCSW 457
           GN +  E   E L+  +    G++ LLSN YAS  +WN   + R  +  +G+KK+ GCS 
Sbjct: 553 GNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSS 612

BLAST of HG10007490 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 346.7 bits (888), Expect = 3.0e-95
Identity = 187/488 (38.32%), Postives = 277/488 (56.76%), Query Frame = 0

Query: 3   CHPSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVP 62
           C+P   N +TF + +KA   +     G  +H   +KS   SD+F+ NSL+H Y   GD+ 
Sbjct: 127 CYP---NKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLD 186

Query: 63  SASQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSTTLVTALSACS 122
           SA ++F +I + DVVSW S+I+G  + G  ++AL  F  M   +V+ +  T+V  LSAC+
Sbjct: 187 SACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACA 246

Query: 123 SLRCLKLGKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTT 182
            +R L+ G+ +      +    +++L NA+LD Y +CG +  AK LFDAM ++D VTWTT
Sbjct: 247 KIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTT 306

Query: 183 MIGGYA-------------------------------QRGLCEEAVRVFQNMVHVGEAMP 242
           M+ GYA                               Q G   EA+ VF  +        
Sbjct: 307 MLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKL 366

Query: 243 NEATLVNVLSACSSISALHLGQWVHSYINSRHDVIIDGNVGNALINMYVKCGKMEMAILI 302
           N+ TLV+ LSAC+ + AL LG+W+HSYI  +H + ++ +V +ALI+MY KCG +E +  +
Sbjct: 367 NQITLVSTLSACAQVGALELGRWIHSYI-KKHGIRMNFHVTSALIHMYSKCGDLEKSREV 426

Query: 303 FKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLI 362
           F ++E +D+  WS +I GLAM+G G +A  +F  M    + P+ +TF  +  ACSH GL+
Sbjct: 427 FNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLV 486

Query: 363 NQGLMVFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLH 422
           ++   +F  M++ Y + PE +HYAC+VD+ G++G L++A  FI+ MP+    SVWGALL 
Sbjct: 487 DEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLG 546

Query: 423 ACQIHGNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKM 457
           AC+IH N    E     LL  +    G   LLSN YA   KW   +E+R  MR  GLKK 
Sbjct: 547 ACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKE 606

BLAST of HG10007490 vs. TAIR 10
Match: AT1G50270.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 335.1 bits (858), Expect = 8.9e-92
Identity = 174/456 (38.16%), Postives = 276/456 (60.53%), Query Frame = 0

Query: 5   PSSHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSA 64
           PS H   TF   LKA   L  ++   + HAH++K G  SD F++NSL+  Y  +G    A
Sbjct: 102 PSRH---TFPPLLKAVFKLRDSNP-FQFHAHIVKFGLDSDPFVRNSLISGYSSSGLFDFA 161

Query: 65  SQIFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSM---NVRPNSTTLVTALSACSSL 124
           S++FD   D DVV+WT++I G  + G   EA+  F+ M    V  N  T+V+ L A   +
Sbjct: 162 SRLFDGAEDKDVVTWTAMIDGFVRNGSASEAMVYFVEMKKTGVAANEMTVVSVLKAAGKV 221

Query: 125 RCLKLGKAIHGLRLRSLNEE-SVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTM 184
             ++ G+++HGL L +   +  V + ++L+D Y +C     A+ +FD MP R+VVTWT +
Sbjct: 222 EDVRFGRSVHGLYLETGRVKCDVFIGSSLVDMYGKCSCYDDAQKVFDEMPSRNVVTWTAL 281

Query: 185 IGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRH 244
           I GY Q    ++ + VF+ M+   +  PNE TL +VLSAC+ + ALH G+ VH Y+  ++
Sbjct: 282 IAGYVQSRCFDKGMLVFEEMLK-SDVAPNEKTLSSVLSACAHVGALHRGRRVHCYM-IKN 341

Query: 245 DVIIDGNVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLF 304
            + I+   G  LI++YVKCG +E AIL+F+ +  K++ +W+ +I+G A +G  + AF LF
Sbjct: 342 SIEINTTAGTTLIDLYVKCGCLEEAILVFERLHEKNVYTWTAMINGFAAHGYARDAFDLF 401

Query: 305 SLMLVHGISPDDITFLGLLSACSHGGLINQGLMVFEAMKNVYNLPPEMRHYACMVDMYGK 364
             ML   +SP+++TF+ +LSAC+HGGL+ +G  +F +MK  +N+ P+  HYACMVD++G+
Sbjct: 402 YTMLSSHVSPNEVTFMAVLSACAHGGLVEEGRRLFLSMKGRFNMEPKADHYACMVDLFGR 461

Query: 365 AGLLDEAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEKVREWLLGSKGVTVGTFALL 424
            GLL+EA+A I+ MP+E    VWGAL  +C +H + +  +     ++  +    G + LL
Sbjct: 462 KGLLEEAKALIERMPMEPTNVVWGALFGSCLLHKDYELGKYAASRVIKLQPSHSGRYTLL 521

Query: 425 SNTYASCDKWNYANEVRDTMRSRGLKKMAGCSWIEL 457
           +N Y+    W+    VR  M+ + + K  G SWIE+
Sbjct: 522 ANLYSESQNWDEVARVRKQMKDQQVVKSPGFSWIEV 551

BLAST of HG10007490 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 331.3 bits (848), Expect = 1.3e-90
Identity = 175/488 (35.86%), Postives = 269/488 (55.12%), Query Frame = 0

Query: 9   NHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSASQIF 68
           N ++F   L AC  L+  +KG+++H+ + KS  LSD++I ++L+  Y   G+V  A ++F
Sbjct: 151 NEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVF 210

Query: 69  DSIPDPDVVSWTSIISGLSKLGFEEEALGKF---LSMNVRPNSTTLVTALSACSSLRCLK 128
           D + D +VVSW S+I+   + G   EAL  F   L   V P+  TL + +SAC+SL  +K
Sbjct: 211 DEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIK 270

Query: 129 LGKAIHGLRLRSLN-EESVSLDNALLDFYVRCGYLRSAKNLFDAMP-------------- 188
           +G+ +HG  +++      + L NA +D Y +C  ++ A+ +FD+MP              
Sbjct: 271 VGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGY 330

Query: 189 -----------------KRDVVTWTTMIGGYAQRGLCEEAVRVFQNMVHVGEAMPNEATL 248
                            +R+VV+W  +I GY Q G  EEA+ +F  ++      P   + 
Sbjct: 331 AMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLF-CLLKRESVCPTHYSF 390

Query: 249 VNVLSACSSISALHLGQWVHSYINSRHDVIIDGN-----VGNALINMYVKCGKMEMAILI 308
            N+L AC+ ++ LHLG   H ++         G      VGN+LI+MYVKCG +E   L+
Sbjct: 391 ANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLV 450

Query: 309 FKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVHGISPDDITFLGLLSACSHGGLI 368
           F+ +  +D +SW+ +I G A NG G +A  LF  ML  G  PD IT +G+LSAC H G +
Sbjct: 451 FRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFV 510

Query: 369 NQGLMVFEAMKNVYNLPPEMRHYACMVDMYGKAGLLDEAEAFIKEMPVEAEGSVWGALLH 428
            +G   F +M   + + P   HY CMVD+ G+AG L+EA++ I+EMP++ +  +WG+LL 
Sbjct: 511 EEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLA 570

Query: 429 ACQIHGNEKKYEKVREWLLGSKGVTVGTFALLSNTYASCDKWNYANEVRDTMRSRGLKKM 457
           AC++H N    + V E LL  +    G + LLSN YA   KW     VR +MR  G+ K 
Sbjct: 571 ACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQ 630

BLAST of HG10007490 vs. TAIR 10
Match: AT4G38010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 330.1 bits (845), Expect = 2.9e-90
Identity = 174/451 (38.58%), Postives = 265/451 (58.76%), Query Frame = 0

Query: 7   SHNHFTFTYALKACCVLHQTHKGLEIHAHVIKSGHLSDIFIQNSLLHFYILNGDVPSASQ 66
           S + FTF    KAC       +G +IH  V K G   DI++QNSL+HFY + G+  +A +
Sbjct: 103 SPDMFTFPPVFKACGKFSGIREGKQIHGIVTKMGFYDDIYVQNSLVHFYGVCGESRNACK 162

Query: 67  IFDSIPDPDVVSWTSIISGLSKLGFEEEALGKFLSMNVRPNSTTLVTALSACSSLRCLKL 126
           +F  +P  DVVSWT II+G ++ G  +EAL  F  M+V PN  T V  L +   + CL L
Sbjct: 163 VFGEMPVRDVVSWTGIITGFTRTGLYKEALDTFSKMDVEPNLATYVCVLVSSGRVGCLSL 222

Query: 127 GKAIHGLRLRSLNEESVSLDNALLDFYVRCGYLRSAKNLFDAMPKRDVVTWTTMIGGYAQ 186
           GK IHGL L+  +  S+   NAL+D YV+C  L  A  +F  + K+D V+W +MI G   
Sbjct: 223 GKGIHGLILKRASLISLETGNALIDMYVKCEQLSDAMRVFGELEKKDKVSWNSMISGLVH 282

Query: 187 RGLCEEAVRVFQNMVHVGEAMPNEATLVNVLSACSSISALHLGQWVHSYINSRHDVIIDG 246
               +EA+ +F  M       P+   L +VLSAC+S+ A+  G+WVH YI +   +  D 
Sbjct: 283 CERSKEAIDLFSLMQTSSGIKPDGHILTSVLSACASLGAVDHGRWVHEYILTA-GIKWDT 342

Query: 247 NVGNALINMYVKCGKMEMAILIFKAIEHKDIISWSTIISGLAMNGLGKQAFCLFSLMLVH 306
           ++G A+++MY KCG +E A+ IF  I  K++ +W+ ++ GLA++G G ++   F  M+  
Sbjct: 343 HIGTAIVDMYAKCGYIETALEIFNGIRSKNVFTWNALLGGLAIHGHGLESLRYFEEMVKL 402

Query: 307 GISPDDITFLGLLSACSHGGLINQGLMVFEAMKN-VYNLPPEMRHYACMVDMYGKAGLLD 366
           G  P+ +TFL  L+AC H GL+++G   F  MK+  YNL P++ HY CM+D+  +AGLLD
Sbjct: 403 GFKPNLVTFLAALNACCHTGLVDEGRRYFHKMKSREYNLFPKLEHYGCMIDLLCRAGLLD 462

Query: 367 EAEAFIKEMPVEAEGSVWGALLHACQIHGNEKKYEK-VREWLLGSKGVTVGTFALLSNTY 426
           EA   +K MPV+ +  + GA+L AC+  G   +  K + +  L  +    G + LLSN +
Sbjct: 463 EALELVKAMPVKPDVRICGAILSACKNRGTLMELPKEILDSFLDIEFEDSGVYVLLSNIF 522

Query: 427 ASCDKWNYANEVRDTMRSRGLKKMAGCSWIE 456
           A+  +W+    +R  M+ +G+ K+ G S+IE
Sbjct: 523 AANRRWDDVARIRRLMKVKGISKVPGSSYIE 552

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878297.19.4e-25392.46pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Benin... [more]
TYK18848.12.9e-24690.93pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_011660133.11.9e-24590.28pentatricopeptide repeat-containing protein At1g08070, chloroplastic isoform X1 ... [more]
XP_008450427.11.2e-24490.30PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-... [more]
KAA0059096.11.2e-24490.30pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q9LN012.1e-9838.30Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823804.2e-9438.32Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9SX451.3e-9038.16Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX... [more]
Q9SIT71.8e-8935.86Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SZK14.0e-8938.58Pentatricopeptide repeat-containing protein At4g38010 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5D3D5L61.4e-24690.93Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0LXJ19.2e-24690.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G613550 PE=4 SV=1[more]
A0A1S4DY276.0e-24590.30pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
A0A5A7UY286.0e-24590.30Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1H7Z96.4e-23986.85pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.5e-9938.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.13.0e-9538.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G50270.18.9e-9238.16Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.11.3e-9035.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G38010.12.9e-9038.58Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 172..220
e-value: 8.0E-9
score: 35.6
coord: 275..323
e-value: 4.8E-9
score: 36.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 351..374
e-value: 6.2E-4
score: 17.8
coord: 278..311
e-value: 3.8E-6
score: 24.7
coord: 175..204
e-value: 3.2E-6
score: 25.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 351..375
e-value: 5.8E-4
score: 19.9
coord: 77..102
e-value: 0.0012
score: 19.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 276..310
score: 10.248873
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..207
score: 10.994242
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 292..458
e-value: 2.0E-28
score: 101.7
coord: 141..238
e-value: 1.3E-17
score: 66.2
coord: 1..140
e-value: 7.3E-20
score: 73.6
NoneNo IPR availablePANTHERPTHR47928:SF138OS03G0216400 PROTEINcoord: 1..455
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..455

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007490.1HG10007490.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding