HG10007566 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007566
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr10: 7350194 .. 7353936 (+)
RNA-Seq ExpressionHG10007566
SyntenyHG10007566
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCGTCTTGCTCCGAATCTTTTTCGCAGAATCTCAAACAGGGCCAATTCCACAACCCATTTTCATCGCTTTTCCCTTCCCACATTCCTCAACCACGATGTGTTACAGCAACAATTTCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTGGTCAAAGAGCTCAAGAGGCTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCACCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTCGCTGTTCTCGTCGAGCTTCAGAGGCAGAATCAGATCTTTCTGTGCATGAAGGTCTGCCCTCTTTTGCCCTCTCCATTATTATATCTCTTCCCCTTTGTGACATATGTTTATTTTTCTTGGGAGATAATGCTGACGGTTAATTCTAAAGCTCTGTTAAGCGAAATGAGAATTTAATTAGTCTCTTGCTATGATATGTTGGTTAATTCGGAGTTGTTATAGTGTGGATGAATTACGTATGTGGGAATAACATAGGTATGGTTTTCGTCGATTTTCATCAACCTTTGTGTTTTAAGTCGAATTTCCTGTTCAAGTATTTGATGATTGCATTTTGAAGTGGTCGAAATTTATGGTGAACTCTGGAAGAGTAGTTGGAGAAAATAGATAGCTTACTATGGCTTGAAAACCTTCATAGCTCAATTGAAATTAGGAGCACTTGCTTGTATTAGCCTTTAGCAACCACATGGTCTTGTGAAGTGGATAGAAGTCCCATAAAGTGAGATCAGGGCTGAAAGATTTTCTTGAATAGGCTGGGTCTATTGGGGATATAATATAAAAGAACGGCTTAAAAAAAGTAGGCTGAATCGTCAAATATCAACATGTTAGTTGAGTTGAGACTGTGGGTGTGAATGGGAATTTTTGTTGGTGCTATTGTTTCATGTTATCTGTACTAATAGGGGAAGAGTAGTGATGCAAAGATCATCAAGCCTCAATGGAATGAGGTTATCTTGGGATAAGCCTTTAAGACAGCTTAGAATATGTCTCAGAGGTCGGATGAATTAGGGGCGGGCTGTCTTAAACAACCAAACACTGCCCTCTTGTTAAAATGGCGTTGGCTGTTCTCCCTTTAATAATATAGTCCGGGGAGGAGAGTTATTGTTGCTATATATGGAATATAGTCAAATGGTTGGCTTATTAAACTTGCTAAAAGGGTTGCCAATGGAAGGCCTTGGATAGACCTTGACAAAAGTAGGTAGTTGTTCCTTGATCTAGTGAATTTTAAAGTGTGCAGTGGCACAAAAGTCTGGTTTTGAGAAGACACATGGACAACATATGTCCCTCTCCACGTGAAATACCATATTTTTTATGCCTTATCTTCCAAAAAAGAGGCTTCAATCTCCCTGCTAATGCCTCTATCCTCCTCGGCATGGAAGGTAAAAATCTCCAAGAAGGTGAAGTTCTTTGTGTGGTAGGTTTTGCATGGATGAGTTAACACCTTAGCTCGGATCTAGACGTGAAAGTCCTCCTTAGTTGGGCCTCACTGCTTTATTCTTTACAAAAGAGGCTGCTGTATTTTATATGCCTTATCTTCCAAAAAAGAGGCTTCAATCTCCCTGCTAATGCCTCTATCCTCCTCGGCATGGAAGATAAAAATTTCCAAAAAGGTGACATTCTTTGTCTGGTAGGTTTTGCATGGATGAGTTAACACCTTAGCTCGGATCTCGGCCAAAGAGTCCTTGGTTGGGCCTCACTGCTGTATTCTTTACAAGCAGGCGGCTGAGGACCTTGATTATATCCTTTGGAGCCGCTCGGTCTGTTTGGTTTGGTTTGATTTATTCGAGGCTTTCAGCTGTAGCTTTGTTAGCTTCCAGGGTTATAGGGAGTCCATCGAGGAATGATTCTTCCATCAGCCTTTTTGTGAGAAAGGATTGTTTTTATGGTAGGCTGGGTGTGCGCTATTTTGTGGGGTGAGAACAATAGAATCTTTGGGGGGGGGGGGGGGGGGGGGGGGGGGGAATTCCAACAATGATATTTGGTCTTTTCTGTTTCTCTCTGGGCCTTGATGTTGAGGTCTTTTTCTAACTTCTCGTCAATACTTAATTTACTTGATTGGAGCCTGTTGTTGTTATGAAAGTCTGGTACTCCATAAAAGAATAATTACTTTAACTACATTAACCCATAACAAACGAGGTGGCGGACAAAGGCGGGCCAATGAGTAATTGGAGAGAATTGCTCTGGGAGTCATTAGAGAAAACCCAAAGAATTTGAAGCATTCCAAATTTCCAAATAACTGCTGCCAGCAAGATAAAGAGCATTGAAGGAGTCTTTCTGAAGCCCATTGACACAAAGAACAAGAAACATCTCTTTCGTTTTAAACGGATGTGGAGATTTTCAAACAAAATATCAGCATGAGAGAGGAAGATTATTGCTAGCTTATATGGTTCAGTTCAGATATAGGAGATTGTTGGACAAGCTCCTCTAAATCTAAATCTTCAAAACTCAAGATCATCAATGACCAAACTGCATAGGGAACTTCACACTTTTTCAGTTTATCAGATTGCAAATTTGCTAACTATTGGATACCATTGGGTCAGGAGTACTACTTCTGGTAGATAAAAAATTTAAAGTATGCTAGCATTTTTCCACCGTACATTAGTATAGGGCTCGTTTGTTTTCCTGGAGGCGAGGTGCATATAAAATGTGTCCATGAGTGATACATCTATCATAAACAGAAAGTATAGCATTATATTATCAATAGGGGATGTAGTCCACTATGACATAAGATGCATTGCATAACTAGGCGTGTATGTCTCCTTTAGTGACAATAAAGTAATTTAACAAATAACAATCTTCTTATGTAACTTTAAGTAAAATAAACAAAATAATCTACTTATGCTTAATCTTGTGTTTCGTAGTGGGCTATAGATTTATCTACATTACTATATTGCACTTTTATGTTCAAGAGATATTGAATGACAAGTTTGGAAATATTATATGTCGGAAACGATTTGTAGACAGTTTTGTTTGTTTGTCTGGTTAGCTCTTTGAAGGGCCTATTCTCTTTCCAATTATACTTCTTTTAAAATGATGCAAGCCTAACCTTATATTTCACTCTATACTTGTCTGGATAATTTAATCTATTGTCTTAGTCTCATGGAGAAAAATATTCTCCAATTCCTGCAGTTTTATTATTTAACAAAGCTACCATTTGTATTTATATTTTAAAAAAAATTCCTTTTAAGTTGATAGAGACCTTTGTTTTTATCCCATTGATTGCATTATTGACACCTGCCTGACCCTTTCAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTGGAAGAAACAAAACAAGTTTGGCAGGATCTGAAGAAAGAGGGAGTATTATTTGATCAGCATACTTTTGGAGACATTATTCGGGCATACTCAGATAATGCAATGCCCTCTGAGGCCATGGATATATACCGTGAAATGAGAGAATCTCCTGATAGGCCATTATCTTTGCCTTTTCGTGTAATTTTGAAGGGACTTATTCCATACCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAACTCTTCCCTGGTATGATCGTCTATGACCCACCAGAAGACTTGTTTGAAGAAGATGAAAATAGGAGGAAGAGTGAAGATGATTAA

mRNA sequence

ATGCTTCGTCTTGCTCCGAATCTTTTTCGCAGAATCTCAAACAGGGCCAATTCCACAACCCATTTTCATCGCTTTTCCCTTCCCACATTCCTCAACCACGATGTGTTACAGCAACAATTTCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTGGTCAAAGAGCTCAAGAGGCTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCACCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTCGCTGTTCTCGTCGAGCTTCAGAGGCAGAATCAGATCTTTCTGTGCATGAAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTGGAAGAAACAAAACAAGTTTGGCAGGATCTGAAGAAAGAGGGAGTATTATTTGATCAGCATACTTTTGGAGACATTATTCGGGCATACTCAGATAATGCAATGCCCTCTGAGGCCATGGATATATACCGTGAAATGAGAGAATCTCCTGATAGGCCATTATCTTTGCCTTTTCGTGTAATTTTGAAGGGACTTATTCCATACCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAACTCTTCCCTGGTATGATCGTCTATGACCCACCAGAAGACTTGTTTGAAGAAGATGAAAATAGGAGGAAGAGTGAAGATGATTAA

Coding sequence (CDS)

ATGCTTCGTCTTGCTCCGAATCTTTTTCGCAGAATCTCAAACAGGGCCAATTCCACAACCCATTTTCATCGCTTTTCCCTTCCCACATTCCTCAACCACGATGTGTTACAGCAACAATTTCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTGGTCAAAGAGCTCAAGAGGCTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCACCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTCGCTGTTCTCGTCGAGCTTCAGAGGCAGAATCAGATCTTTCTGTGCATGAAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTGGAAGAAACAAAACAAGTTTGGCAGGATCTGAAGAAAGAGGGAGTATTATTTGATCAGCATACTTTTGGAGACATTATTCGGGCATACTCAGATAATGCAATGCCCTCTGAGGCCATGGATATATACCGTGAAATGAGAGAATCTCCTGATAGGCCATTATCTTTGCCTTTTCGTGTAATTTTGAAGGGACTTATTCCATACCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAACTCTTCCCTGGTATGATCGTCTATGACCCACCAGAAGACTTGTTTGAAGAAGATGAAAATAGGAGGAAGAGTGAAGATGATTAA

Protein sequence

MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD
Homology
BLAST of HG10007566 vs. NCBI nr
Match: XP_004139064.1 (pentatricopeptide repeat-containing protein At1g62350 isoform X1 [Cucumis sativus] >KAE8653533.1 hypothetical protein Csa_007495 [Cucumis sativus])

HSP 1 Score: 458.8 bits (1179), Expect = 3.2e-125
Identity = 234/256 (91.41%), Postives = 242/256 (94.53%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPISSTPFHRFSLSTFLNLDLLQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDLVAVLVELQRQN +FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           N M SEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NTMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKSEDD 257
           EDLFEEDE+R KSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of HG10007566 vs. NCBI nr
Match: XP_008450338.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis melo])

HSP 1 Score: 458.4 bits (1178), Expect = 4.1e-125
Identity = 234/256 (91.41%), Postives = 242/256 (94.53%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFI GSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPVSSTPFHRFSLSTFLNLDLLQQQLLLRFIAGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDLVAVLVELQRQN +FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAM SEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NAMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKSEDD 257
           EDLFEEDE+R KSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of HG10007566 vs. NCBI nr
Match: XP_038878451.1 (pentatricopeptide repeat-containing protein At1g62350 [Benincasa hispida])

HSP 1 Score: 458.0 bits (1177), Expect = 5.4e-125
Identity = 231/247 (93.52%), Postives = 236/247 (95.55%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLRL  NL RRISN A STT FHRFSLPTFLNHD+ QQQ LLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRLTLNLLRRISNSATSTTPFHRFSLPTFLNHDLFQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDLVAVL ELQRQNQ+FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLFELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEED 248
           E+LFEE+
Sbjct: 241 EELFEEE 247

BLAST of HG10007566 vs. NCBI nr
Match: XP_023531705.1 (pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pepo] >XP_023531706.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 441.8 bits (1135), Expect = 4.0e-120
Identity = 222/253 (87.75%), Postives = 233/253 (92.09%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLR APNL RR SNRA ST H +RF+  TF  H   QQQ LLRFITGSASSPSLS+WRRK
Sbjct: 1   MLRFAPNLLRRFSNRATSTIHLYRFTHCTFFEHHRPQQQLLLRFITGSASSPSLSVWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKR+QSN IRLDRFIS+HVSRLLKSDLVAVLVELQRQNQ+FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRIQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAMPSEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKS 254
           EDLFEEDE+  KS
Sbjct: 241 EDLFEEDEDMCKS 253

BLAST of HG10007566 vs. NCBI nr
Match: XP_022966084.1 (pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima] >XP_022966085.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima])

HSP 1 Score: 441.0 bits (1133), Expect = 6.8e-120
Identity = 223/253 (88.14%), Postives = 233/253 (92.09%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLR APNL RRISN A ST H +RF+  TF  H   QQQ LLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRFAPNLLRRISNSATSTIHLYRFTHCTFFEHHRPQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKSDLVAVLVELQRQNQ+FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDML MLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLTMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAMPSEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQV+DDFLELFP MIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVRDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKS 254
           EDLFEEDE+R KS
Sbjct: 241 EDLFEEDEDRFKS 253

BLAST of HG10007566 vs. ExPASy Swiss-Prot
Match: Q1PFH7 (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX=3702 GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 311.2 bits (796), Expect = 1.1e-83
Identity = 147/194 (75.77%), Postives = 172/194 (88.66%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLYNV 122
           M KEGLI  KELKRLQ+  +RLDRFI +HVSRLLKSDLV+VL E QRQNQ+FLCMKLY V
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 123 VRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNA 182
           VR+E+WYRPDMFFYRDMLMMLA+NK+V+ETK+VW+DLKKE VLFDQHTFGD++R + DN 
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 183 MPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPED 242
           +P EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFPGMIVYDPPED
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 243 LFEEDENRRKSEDD 257
           + E+ +   +++ D
Sbjct: 181 ICEDSDEEARTDSD 194

BLAST of HG10007566 vs. ExPASy Swiss-Prot
Match: Q9STF9 (Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8L PE=1 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 6.3e-52
Identity = 97/205 (47.32%), Postives = 146/205 (71.22%), Query Frame = 0

Query: 40  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSD 99
           F+ RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI THV RLLK D
Sbjct: 53  FVSRFHDGRPRGP---LWRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLD 112

Query: 100 LVAVLVELQRQNQIFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDL 159
           ++AV+ EL+RQ +  L +K++ V++K+ WY+PD+F Y+D+++ LAK+KR++E   +W+ +
Sbjct: 113 MLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKM 172

Query: 160 KKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPEL 219
           KKE +  D  T+ ++IR +  +  P++AM++Y +M +SPD P  LPFRV+LKGL+P+P L
Sbjct: 173 KKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLL 232

Query: 220 REQVKDDFLELFPGMIVYDPPEDLF 245
           R +VK DF ELFP    YDPPE++F
Sbjct: 233 RNKVKKDFEELFPEKHAYDPPEEIF 254

BLAST of HG10007566 vs. ExPASy Swiss-Prot
Match: Q9FKC3 (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 48.5 bits (114), Expect = 1.3e-04
Identity = 44/230 (19.13%), Postives = 105/230 (45.65%), Query Frame = 0

Query: 18  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGL 77
           ST+  H   LPT  N    ++ F +R I+ S   P+ +I   K         +    + L
Sbjct: 5   STSTSHAPPLPT--NRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSKWL 64

Query: 78  IVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVLVELQRQNQIFL--------- 137
           I  +++    S  I+ D+         +S +L+ +    ++E ++ ++  L         
Sbjct: 65  INAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKGSKKLLPRTVLESLH 124

Query: 138 ----------CMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVL 197
                      ++++ ++R+++WY+P++  Y  +++ML K K+ E+  +++Q++  EG +
Sbjct: 125 ERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCV 184

Query: 198 FDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPD-RPLSLPFRVILKGLI 215
            +   +  ++ AYS +     A  +   M+ S + +P    + +++K  +
Sbjct: 185 VNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFL 232

BLAST of HG10007566 vs. ExPASy Swiss-Prot
Match: Q9LVW6 (Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8 PE=2 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 1.7e-04
Identity = 38/136 (27.94%), Postives = 65/136 (47.79%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLYNV 122
           +  E +  ++ LKR     + L   +   + RL+KSDL++VL EL RQ+   L + + + 
Sbjct: 48  LSTEAIQSIQSLKRAHRTGVSLSLTLRP-LRRLIKSDLISVLRELLRQDYCTLAVHVLST 107

Query: 123 VRKEVWYRP-DMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDN 182
           +R E  Y P D+  Y D++  L +NK  +E  ++  ++       D      +IRA    
Sbjct: 108 LRTE--YPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDGIDQRSDDKALAKLIRAVVGA 167

Query: 183 AMPSEAMDIYREMRES 198
                 + +Y  MRES
Sbjct: 168 ERRESVVRVYTLMRES 180

BLAST of HG10007566 vs. ExPASy Swiss-Prot
Match: Q5G1S8 (Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB1270 PE=2 SV=2)

HSP 1 Score: 45.8 bits (107), Expect = 8.4e-04
Identity = 23/84 (27.38%), Postives = 43/84 (51.19%), Query Frame = 0

Query: 131 PDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDI 190
           PD   Y  +L   A+ +  E+ K+V+Q ++K G   D+ T+  II  Y        A+ +
Sbjct: 365 PDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGKDEMTYNTIIHMYGKQGQLDLALQL 424

Query: 191 YREMRESPDR-PLSLPFRVILKGL 214
           Y++M+    R P ++ + V++  L
Sbjct: 425 YKDMKGLSGRNPDAITYTVLIDSL 448

BLAST of HG10007566 vs. ExPASy TrEMBL
Match: A0A1S3BQ11 (pentatricopeptide repeat-containing protein At1g62350 OS=Cucumis melo OX=3656 GN=LOC103491974 PE=4 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 2.0e-125
Identity = 234/256 (91.41%), Postives = 242/256 (94.53%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFI GSASSPSLSIWRRK
Sbjct: 1   MLRLAPNLLRKISSSPVSSTPFHRFSLSTFLNLDLLQQQLLLRFIAGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDLVAVLVELQRQN +FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISSHVSRLLKSDLVAVLVELQRQNHVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKKEGVLFDQHTFGDIIRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAM SEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NAMLSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKSEDD 257
           EDLFEEDE+R KSEDD
Sbjct: 241 EDLFEEDEDRNKSEDD 256

BLAST of HG10007566 vs. ExPASy TrEMBL
Match: A0A6J1HQL7 (pentatricopeptide repeat-containing protein At1g62350 OS=Cucurbita maxima OX=3661 GN=LOC111465833 PE=4 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 3.3e-120
Identity = 223/253 (88.14%), Postives = 233/253 (92.09%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLR APNL RRISN A ST H +RF+  TF  H   QQQ LLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRFAPNLLRRISNSATSTIHLYRFTHCTFFEHHRPQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKSDLVAVLVELQRQNQ+FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRLQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDML MLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLTMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAMPSEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQV+DDFLELFP MIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVRDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKS 254
           EDLFEEDE+R KS
Sbjct: 241 EDLFEEDEDRFKS 253

BLAST of HG10007566 vs. ExPASy TrEMBL
Match: A0A6J1EQI1 (pentatricopeptide repeat-containing protein At1g62350 OS=Cucurbita moschata OX=3662 GN=LOC111436516 PE=4 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 6.2e-119
Identity = 221/253 (87.35%), Postives = 230/253 (90.91%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK 60
           MLR APNL RR SN A ST H +R +  TF  H   QQQ LLRFITGSASSPSLSIWRRK
Sbjct: 1   MLRFAPNLLRRFSNSATSTIHVYRLTHCTFFEHHRPQQQLLLRFITGSASSPSLSIWRRK 60

Query: 61  KEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLY 120
           KEMGKEGLIVVKELKR+QSN IRLDRFIS+HVSRLLKSDLVAVLVELQRQNQ+FLCMKLY
Sbjct: 61  KEMGKEGLIVVKELKRIQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQNQVFLCMKLY 120

Query: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD 180
           NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY D
Sbjct: 121 NVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWEDLKGEGVLFDQHTFGDIMRAYLD 180

Query: 181 NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPP 240
           NAMPSEAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFP MIVYDPP
Sbjct: 181 NAMPSEAMDIYREMRQSPDRPLSLPFRVILKGLVPYPELREQVKDDFLELFPDMIVYDPP 240

Query: 241 EDLFEEDENRRKS 254
           EDLFEEDE   KS
Sbjct: 241 EDLFEEDEGMCKS 253

BLAST of HG10007566 vs. ExPASy TrEMBL
Match: A0A6J1DGX4 (pentatricopeptide repeat-containing protein At1g62350 OS=Momordica charantia OX=3673 GN=LOC111020768 PE=4 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 8.4e-116
Identity = 218/257 (84.82%), Postives = 230/257 (89.49%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRA--NSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWR 60
           MLRL PNL RR  NR    +T  FH  S  TF + D LQQQ L RFITGSASSPSLSIWR
Sbjct: 1   MLRLVPNLLRRAPNRVTRTATIPFHLSSPITFFDRDQLQQQSLFRFITGSASSPSLSIWR 60

Query: 61  RKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMK 120
           RKKEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKSDLVAVLVELQRQ Q+FLCMK
Sbjct: 61  RKKEMGKEGLIVVKELKRLQSNLIRLDRFISSHVSRLLKSDLVAVLVELQRQKQVFLCMK 120

Query: 121 LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAY 180
           LYNVVRKEVWYRPDMFFYRDMLMML+KNK+VEETKQVWQDLK+E VLFDQHTFGDIIRAY
Sbjct: 121 LYNVVRKEVWYRPDMFFYRDMLMMLSKNKKVEETKQVWQDLKREEVLFDQHTFGDIIRAY 180

Query: 181 SDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYD 240
            DN MPSEAMDIYREMR+SPDRPLSLPFRVILKGLIPYPELREQ+KDDFLELFP MIVYD
Sbjct: 181 LDNGMPSEAMDIYREMRQSPDRPLSLPFRVILKGLIPYPELREQIKDDFLELFPDMIVYD 240

Query: 241 PPEDLFEEDENRRKSED 256
           PPEDLFEEDE+R++  D
Sbjct: 241 PPEDLFEEDEDRKREYD 257

BLAST of HG10007566 vs. ExPASy TrEMBL
Match: A0A7N2KMM5 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 4.6e-98
Identity = 188/261 (72.03%), Postives = 223/261 (85.44%), Query Frame = 0

Query: 1   MLRLAPNLFRRISNRANSTTHFHR-----FSLPTFLNHDVLQQQFLLRFITGSASSPSLS 60
           MLR A  L R+ S+ ++S+ +F +      S  TF N   +QQQ+L R ++G ASSPSLS
Sbjct: 1   MLRHARLLLRKPSS-SSSSPNFPKQNPFLLSENTFSNSKHIQQQWLFRHVSGLASSPSLS 60

Query: 61  IWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFL 120
           IWRRKKEMGKEGLIV KELKRL+SN +RLDRFI +HVSRLLKSDL AVL E QRQ+Q+FL
Sbjct: 61  IWRRKKEMGKEGLIVAKELKRLRSNSVRLDRFIRSHVSRLLKSDLFAVLAEFQRQDQVFL 120

Query: 121 CMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDII 180
           CMKLY+VVRKE+WYRPDMFFYRDMLMMLA+NK V+E K+VW+DLK E VLFDQHTFGD+I
Sbjct: 121 CMKLYDVVRKEIWYRPDMFFYRDMLMMLARNKSVDEVKRVWEDLKGEEVLFDQHTFGDLI 180

Query: 181 RAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMI 240
           RA+SD+ +PSEAM+IY EMR+SPD P+SLPFRVILKGLIPYPELRE+VKDDFLELFPGM+
Sbjct: 181 RAFSDSGLPSEAMEIYDEMRQSPDPPISLPFRVILKGLIPYPELREKVKDDFLELFPGMV 240

Query: 241 VYDPPEDLFEEDENRRKSEDD 257
           VYDPPEDLFE+ + +R+SEDD
Sbjct: 241 VYDPPEDLFEDQDWQRESEDD 260

BLAST of HG10007566 vs. TAIR 10
Match: AT1G62350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 311.2 bits (796), Expect = 7.6e-85
Identity = 147/194 (75.77%), Postives = 172/194 (88.66%), Query Frame = 0

Query: 63  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLYNV 122
           M KEGLI  KELKRLQ+  +RLDRFI +HVSRLLKSDLV+VL E QRQNQ+FLCMKLY V
Sbjct: 1   MSKEGLIAAKELKRLQTQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYEV 60

Query: 123 VRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNA 182
           VR+E+WYRPDMFFYRDMLMMLA+NK+V+ETK+VW+DLKKE VLFDQHTFGD++R + DN 
Sbjct: 61  VRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEEVLFDQHTFGDLVRGFLDNE 120

Query: 183 MPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPED 242
           +P EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFPGMIVYDPPED
Sbjct: 121 LPLEAMRLYGEMRESPDRPLSLPFRVILKGLVPYPELREKVKDDFLELFPGMIVYDPPED 180

Query: 243 LFEEDENRRKSEDD 257
           + E+ +   +++ D
Sbjct: 181 ICEDSDEEARTDSD 194

BLAST of HG10007566 vs. TAIR 10
Match: AT3G46870.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 205.7 bits (522), Expect = 4.5e-53
Identity = 97/205 (47.32%), Postives = 146/205 (71.22%), Query Frame = 0

Query: 40  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSD 99
           F+ RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI THV RLLK D
Sbjct: 53  FVSRFHDGRPRGP---LWRGKKLIGKEALFVILGLKRLKEDDEKLDKFIKTHVFRLLKLD 112

Query: 100 LVAVLVELQRQNQIFLCMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDL 159
           ++AV+ EL+RQ +  L +K++ V++K+ WY+PD+F Y+D+++ LAK+KR++E   +W+ +
Sbjct: 113 MLAVIGELERQEETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKM 172

Query: 160 KKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPEL 219
           KKE +  D  T+ ++IR +  +  P++AM++Y +M +SPD P  LPFRV+LKGL+P+P L
Sbjct: 173 KKENLFPDSQTYTEVIRGFLRDGCPADAMNVYEDMLKSPDPPEELPFRVLLKGLLPHPLL 232

Query: 220 REQVKDDFLELFPGMIVYDPPEDLF 245
           R +VK DF ELFP    YDPPE++F
Sbjct: 233 RNKVKKDFEELFPEKHAYDPPEEIF 254

BLAST of HG10007566 vs. TAIR 10
Match: AT5G09320.1 (Vacuolar sorting protein 9 (VPS9) domain )

HSP 1 Score: 81.3 bits (199), Expect = 1.3e-15
Identity = 57/168 (33.93%), Postives = 86/168 (51.19%), Query Frame = 0

Query: 84  LDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKLYNVVRKEVWYRPDMFFYRDMLMML 143
           LDR I +   RLLK D+VAVL EL RQN+  L +K++  +RKE WY+P +  Y DM+ ++
Sbjct: 541 LDRVIISKFRRLLKFDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVM 600

Query: 144 AKNKRVEETKQVWQDLKKE-GVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPL 203
           A N  +EE   ++  +K E G++ +   F  ++    ++ +    MD Y  M+     P 
Sbjct: 601 ADNSLMEEVNYLYSAMKSEKGLMAEIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPD 660

Query: 204 SLPFRVILKGLIPYPE--LREQVKDDFLELFPGMIVYDPPEDLFEEDE 249
              FRV++ GL    E  L   V+ D  E       Y    +  EEDE
Sbjct: 661 RASFRVLVLGLESNGEMGLSAIVRQDAHE------YYGESLEFIEEDE 702

BLAST of HG10007566 vs. TAIR 10
Match: AT3G42570.1 (peroxidase family protein )

HSP 1 Score: 50.8 bits (120), Expect = 1.8e-06
Identity = 24/34 (70.59%), Postives = 27/34 (79.41%), Query Frame = 0

Query: 60 KKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVS 94
          KKE  KEGLI  KELKRLQ+N +RLDRFI +H S
Sbjct: 4  KKEKSKEGLIAAKELKRLQTNLVRLDRFIDSHPS 37

BLAST of HG10007566 vs. TAIR 10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 48.5 bits (114), Expect = 9.2e-06
Identity = 44/230 (19.13%), Postives = 105/230 (45.65%), Query Frame = 0

Query: 18  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGL 77
           ST+  H   LPT  N    ++ F +R I+ S   P+ +I   K         +    + L
Sbjct: 5   STSTSHAPPLPT--NRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSKWL 64

Query: 78  IVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVLVELQRQNQIFL--------- 137
           I  +++    S  I+ D+         +S +L+ +    ++E ++ ++  L         
Sbjct: 65  INAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKGSKKLLPRTVLESLH 124

Query: 138 ----------CMKLYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVL 197
                      ++++ ++R+++WY+P++  Y  +++ML K K+ E+  +++Q++  EG +
Sbjct: 125 ERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCV 184

Query: 198 FDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPD-RPLSLPFRVILKGLI 215
            +   +  ++ AYS +     A  +   M+ S + +P    + +++K  +
Sbjct: 185 VNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFL 232

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139064.13.2e-12591.41pentatricopeptide repeat-containing protein At1g62350 isoform X1 [Cucumis sativu... [more]
XP_008450338.14.1e-12591.41PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis melo][more]
XP_038878451.15.4e-12593.52pentatricopeptide repeat-containing protein At1g62350 [Benincasa hispida][more]
XP_023531705.14.0e-12087.75pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pep... [more]
XP_022966084.16.8e-12088.14pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima] >XP_022... [more]
Match NameE-valueIdentityDescription
Q1PFH71.1e-8375.77Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX... [more]
Q9STF96.3e-5247.32Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
Q9FKC31.3e-0419.13Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
Q9LVW61.7e-0427.94Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=T... [more]
Q5G1S88.4e-0427.38Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A1S3BQ112.0e-12591.41pentatricopeptide repeat-containing protein At1g62350 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1HQL73.3e-12088.14pentatricopeptide repeat-containing protein At1g62350 OS=Cucurbita maxima OX=366... [more]
A0A6J1EQI16.2e-11987.35pentatricopeptide repeat-containing protein At1g62350 OS=Cucurbita moschata OX=3... [more]
A0A6J1DGX48.4e-11684.82pentatricopeptide repeat-containing protein At1g62350 OS=Momordica charantia OX=... [more]
A0A7N2KMM54.6e-9872.03Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G62350.17.6e-8575.77Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46870.14.5e-5347.32Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G09320.11.3e-1533.93Vacuolar sorting protein 9 (VPS9) domain [more]
AT3G42570.11.8e-0670.59peroxidase family protein [more]
AT5G48730.19.2e-0619.13Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 136..164
e-value: 0.29
score: 11.5
coord: 170..197
e-value: 0.002
score: 18.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 170..197
e-value: 0.0011
score: 17.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 167..201
score: 10.150222
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 132..166
score: 8.6266
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 62..247
e-value: 4.3E-39
score: 135.8
IPR044795Pentatricopeptide repeat-containing protein THA8L-likePANTHERPTHR46870PROTEIN THYLAKOID ASSEMBLY 8-LIKE, CHLOROPLASTICcoord: 2..254
NoneNo IPR availablePANTHERPTHR46870:SF1BNAC09G13590D PROTEINcoord: 2..254

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007566.1HG10007566.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding