HG10007887 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007887
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr10: 16529030 .. 16530972 (-)
RNA-Seq ExpressionHG10007887
SyntenyHG10007887
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTGTATGGTGTTCGTGTCCTTTTGTACTCTAATGGCTGAGAACATCCTGGGAGTTCTACTCTTAAAGCCACAACACCATGACTTTAGAGTCCAATACTTCCAATTTGTAGGCGTTAGTTCTTACTGTTCCTTTTATGGTTTGATGCAGATCATTGATGTTCTTCACCCAGGAAGGCCAAATGTCTCTAAGGTAAGTCGATTTTACTAACAGAAGTTATTGGGCTTCTGCTTTAGCATAATTGCAGTGCAGCTTGTGGATTGCTTTCATAATGAATGAATCCAGCAATTATGTCTTACAACCAATTTATCACAGTTCATATTTTAGAATTTTTTGCTTTCTTTAATGTTTTATGCCCAGCCACAGATACATTGTTGAGTTAAATGGTTTTGAGTTTCAAGTCTGGTTGCATGTTAAATTGTCTGCTCATTTGATTTCACTTTTGTCTTTTTGATTTGAAGGCAGAGCTGAAGGAAAGGTTAGCAAGAATATATGACGTGAAGGATGCAAATGCAATGTTTGTCTTTAAGCTCCGAACTCACTTTGGAGGTGGAAAATCAACTAGGTTTGGTTTGATTTATGATTTTGTTGAAAATGCCAAAAAATATGAGCCCAAGTACATTTGTCTCATCCGTCATTTCAGTACCTCAAATTCTTCTGGGAATGGCTCTGTTGCGCCTTCCAAAGACGACTATTTCGCCGCAATCCACCATATCTCCCACATTGTCCGTCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCAAATCTCCTACCTCAATTCTGAGCTCGTTTTCAGGGTCCTTCGCGCTTGCTCCAACTCTGGTACTGAGTCCTTTCGTTTCTTCAACTGGGCTTGCACTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAAAAAGTATACGACGATGTGGAAAGTTCTTCTTCAGATGAAGACGCAGAATCTTAAAATTTCACCAGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAAGGCCTTGTTGATAATGCGGTTACCATTTTCAATCAATGTTCTAAATCTATCGACTGTCCACAAACAGTTGAGGTCTATAACGCATTACTTTTTGCGCTTTGTGAGGTTAAGATGTTTCATGGAGCTTATGCATTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCCGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAGGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTAGAAGGATTGCTGAATGCAGGATATTTAGAATCTGCAAAGGTTATGGTTAGAAAAATGATAAAAGAAGGATCTGTGCCTGATATAGGGACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTGGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGTTGTATTGAGGATGGACATATACCGTTTCCAAGTCTTTATGGACCAATTCTTAAAGGAATGTGTAAAAGGGGTCAGTTTGATGATGCATTTTGCTTTTTCAGTGATATGAAACATAAGGGGCATCCACCAAATCGGCCGGTGTACACAATGTTGATAACAATGTGTGGACGTGGTGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAGATGGCTGAATTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACCGATGGATTGAAGAACTGTGGAAAGCATGATTTAGCTAAGAAGATCGAGCAGCTTGAAGTTTCTATTCGAGGCATTTGA

mRNA sequence

ATGAACTGTATGGTGTTCGTGTCCTTTTGTACTCTAATGGCTGAGAACATCCTGGGAGTTCTACTCTTAAAGCCACAACACCATGACTTTAGAGTCCAATACTTCCAATTTGTAGGCGTTAGTTCTTACTGTTCCTTTTATGGTTTGATGCAGATCATTGATGTTCTTCACCCAGGAAGGCCAAATGTCTCTAAGGCAGAGCTGAAGGAAAGGTTAGCAAGAATATATGACGTGAAGGATGCAAATGCAATGTTTGTCTTTAAGCTCCGAACTCACTTTGGAGGTGGAAAATCAACTAGGTTTGGTTTGATTTATGATTTTGTTGAAAATGCCAAAAAATATGAGCCCAAGTACATTTGTCTCATCCGTCATTTCAGTACCTCAAATTCTTCTGGGAATGGCTCTGTTGCGCCTTCCAAAGACGACTATTTCGCCGCAATCCACCATATCTCCCACATTGTCCGTCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCAAATCTCCTACCTCAATTCTGAGCTCGTTTTCAGGGTCCTTCGCGCTTGCTCCAACTCTGGTACTGAGTCCTTTCGTTTCTTCAACTGGGCTTGCACTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAAAAAGTATACGACGATGTGGAAAGTTCTTCTTCAGATGAAGACGCAGAATCTTAAAATTTCACCAGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAAGGCCTTGTTGATAATGCGGTTACCATTTTCAATCAATGTTCTAAATCTATCGACTGTCCACAAACAGTTGAGGTCTATAACGCATTACTTTTTGCGCTTTGTGAGGTTAAGATGTTTCATGGAGCTTATGCATTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCCGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAGGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTAGAAGGATTGCTGAATGCAGGATATTTAGAATCTGCAAAGGTTATGGTTAGAAAAATGATAAAAGAAGGATCTGTGCCTGATATAGGGACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTGGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGTTGTATTGAGGATGGACATATACCGTTTCCAAGTCTTTATGGACCAATTCTTAAAGGAATGTGTAAAAGGGGTCAGTTTGATGATGCATTTTGCTTTTTCAGTGATATGAAACATAAGGGGCATCCACCAAATCGGCCGGTGTACACAATGTTGATAACAATGTGTGGACGTGGTGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAGATGGCTGAATTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACCGATGGATTGAAGAACTGTGGAAAGCATGATTTAGCTAAGAAGATCGAGCAGCTTGAAGTTTCTATTCGAGGCATTTGA

Coding sequence (CDS)

ATGAACTGTATGGTGTTCGTGTCCTTTTGTACTCTAATGGCTGAGAACATCCTGGGAGTTCTACTCTTAAAGCCACAACACCATGACTTTAGAGTCCAATACTTCCAATTTGTAGGCGTTAGTTCTTACTGTTCCTTTTATGGTTTGATGCAGATCATTGATGTTCTTCACCCAGGAAGGCCAAATGTCTCTAAGGCAGAGCTGAAGGAAAGGTTAGCAAGAATATATGACGTGAAGGATGCAAATGCAATGTTTGTCTTTAAGCTCCGAACTCACTTTGGAGGTGGAAAATCAACTAGGTTTGGTTTGATTTATGATTTTGTTGAAAATGCCAAAAAATATGAGCCCAAGTACATTTGTCTCATCCGTCATTTCAGTACCTCAAATTCTTCTGGGAATGGCTCTGTTGCGCCTTCCAAAGACGACTATTTCGCCGCAATCCACCATATCTCCCACATTGTCCGTCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCAAATCTCCTACCTCAATTCTGAGCTCGTTTTCAGGGTCCTTCGCGCTTGCTCCAACTCTGGTACTGAGTCCTTTCGTTTCTTCAACTGGGCTTGCACTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAAAAAGTATACGACGATGTGGAAAGTTCTTCTTCAGATGAAGACGCAGAATCTTAAAATTTCACCAGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAAGGCCTTGTTGATAATGCGGTTACCATTTTCAATCAATGTTCTAAATCTATCGACTGTCCACAAACAGTTGAGGTCTATAACGCATTACTTTTTGCGCTTTGTGAGGTTAAGATGTTTCATGGAGCTTATGCATTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCCGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAGGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTAGAAGGATTGCTGAATGCAGGATATTTAGAATCTGCAAAGGTTATGGTTAGAAAAATGATAAAAGAAGGATCTGTGCCTGATATAGGGACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTGGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGTTGTATTGAGGATGGACATATACCGTTTCCAAGTCTTTATGGACCAATTCTTAAAGGAATGTGTAAAAGGGGTCAGTTTGATGATGCATTTTGCTTTTTCAGTGATATGAAACATAAGGGGCATCCACCAAATCGGCCGGTGTACACAATGTTGATAACAATGTGTGGACGTGGTGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAGATGGCTGAATTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACCGATGGATTGAAGAACTGTGGAAAGCATGATTTAGCTAAGAAGATCGAGCAGCTTGAAGTTTCTATTCGAGGCATTTGA

Protein sequence

MNCMVFVSFCTLMAENILGVLLLKPQHHDFRVQYFQFVGVSSYCSFYGLMQIIDVLHPGRPNVSKAELKERLARIYDVKDANAMFVFKLRTHFGGGKSTRFGLIYDFVENAKKYEPKYICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI
Homology
BLAST of HG10007887 vs. NCBI nr
Match: XP_038879743.1 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Benincasa hispida] >XP_038879744.1 pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Benincasa hispida] >XP_038879746.1 pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Benincasa hispida] >XP_038879747.1 pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Benincasa hispida])

HSP 1 Score: 898.3 bits (2320), Expect = 3.4e-257
Identity = 444/506 (87.75%), Postives = 445/506 (87.94%), Query Frame = 0

Query: 52  IIDVLHPGRPNVSKAELKERLARIYDVKDANAMFVFKLRTHFGGGKSTRFGLIYDFVENA 111
           IIDVLHPGRPNVS                                               
Sbjct: 12  IIDVLHPGRPNVS----------------------------------------------- 71

Query: 112 KKYEPKYICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISY 171
                KYICLIRHFSTSNSS NGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKL+ISY
Sbjct: 72  -----KYICLIRHFSTSNSSWNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISY 131

Query: 172 LNSELVFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKV 231
           LNSELVFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKY TMWKV
Sbjct: 132 LNSELVFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYATMWKV 191

Query: 232 LLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALC 291
           L QMK QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALC
Sbjct: 192 LQQMKMQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALC 251

Query: 292 EVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLR 351
           EVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLR
Sbjct: 252 EVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLR 311

Query: 352 GRDLLVEGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEV 411
           GRDLLVEGLLNAGY ESAK MVRKM KEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEV
Sbjct: 312 GRDLLVEGLLNAGYFESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEV 371

Query: 412 CKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQF 471
           CKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKR QF
Sbjct: 372 CKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRSQF 431

Query: 472 DDAFCFFSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMV 531
           DDAFCFFSDMK KGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMV
Sbjct: 432 DDAFCFFSDMKRKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMV 465

Query: 532 TDGLKNCGKHDLAKKIEQLEVSIRGI 558
           TDGLKNCGKHDLAKKIEQLEVSIRGI
Sbjct: 492 TDGLKNCGKHDLAKKIEQLEVSIRGI 465

BLAST of HG10007887 vs. NCBI nr
Match: XP_004142520.1 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucumis sativus] >KGN66807.1 hypothetical protein Csa_007547 [Cucumis sativus])

HSP 1 Score: 882.9 bits (2280), Expect = 1.5e-252
Identity = 424/441 (96.15%), Postives = 430/441 (97.51%), Query Frame = 0

Query: 117 KYICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSEL 176
           +YI L RHFS SNS  NGS APSKDDYFAAIHHISHIVRRDFYMERTLNKL+IS LNSEL
Sbjct: 15  RYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSEL 74

Query: 177 VFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMK 236
           VFRVLRACSNSGTESFRFFNWAC+HNPSYQPTTLEFEELVKTLART+KYTTMWKVLLQMK
Sbjct: 75  VFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMK 134

Query: 237 TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF 296
           TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF
Sbjct: 135 TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF 194

Query: 297 HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLL 356
           HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKM+EAQEFLEEMSQKGFNPPLRGRDLL
Sbjct: 195 HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLL 254

Query: 357 VEGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL 416
           VEGLLNAGYLESAK MVRKM KEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL
Sbjct: 255 VEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL 314

Query: 417 CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFC 476
           CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGH+PFPSLYGPILKGMCKRGQFDDAFC
Sbjct: 315 CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFC 374

Query: 477 FFSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK 536
           FF DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAE GLPPISRCFDMVTDGLK
Sbjct: 375 FFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLK 434

Query: 537 NCGKHDLAKKIEQLEVSIRGI 558
           NCGKHDLAKKIEQLEVSIRGI
Sbjct: 435 NCGKHDLAKKIEQLEVSIRGI 455

BLAST of HG10007887 vs. NCBI nr
Match: XP_008462724.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucumis melo] >KAA0062598.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK29718.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 872.5 bits (2253), Expect = 2.0e-249
Identity = 418/440 (95.00%), Postives = 428/440 (97.27%), Query Frame = 0

Query: 118 YICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELV 177
           YI LIR FS SNSS NGS APSKDDYFAAIHHISHIVRRDFYMERTLNKL+ISYLNSELV
Sbjct: 16  YISLIRCFSNSNSSVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISYLNSELV 75

Query: 178 FRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKT 237
           FRVLRACSN GTESFRFFNWAC+HNPSYQPTTLE EELVKTLART+KYTTMWKVLLQMKT
Sbjct: 76  FRVLRACSNCGTESFRFFNWACSHNPSYQPTTLELEELVKTLARTRKYTTMWKVLLQMKT 135

Query: 238 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 297
           QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH
Sbjct: 136 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 195

Query: 298 GAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLV 357
           GAYALIRRMI+KGVTPDKKTYGTLVTGWCSAGKM+EAQEFLEEMSQKGFNPPLRGRDLLV
Sbjct: 196 GAYALIRRMIKKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLV 255

Query: 358 EGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 417
           EGLLNAGYLESAK MVRKM KEG VPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC
Sbjct: 256 EGLLNAGYLESAKDMVRKMTKEGCVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 315

Query: 418 PDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCF 477
           PDINTYKILIPATSKVGRIDEAFRLL+CCIEDGH+PFPSLYGPI+KGMCKRGQFDDAFCF
Sbjct: 316 PDINTYKILIPATSKVGRIDEAFRLLNCCIEDGHVPFPSLYGPIIKGMCKRGQFDDAFCF 375

Query: 478 FSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKN 537
           F DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAE GLPPISRCFDMVTDGLK+
Sbjct: 376 FGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKS 435

Query: 538 CGKHDLAKKIEQLEVSIRGI 558
           CGKHDLAKKIE+LEVSIRGI
Sbjct: 436 CGKHDLAKKIEKLEVSIRGI 455

BLAST of HG10007887 vs. NCBI nr
Match: XP_022988097.1 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucurbita maxima])

HSP 1 Score: 859.0 bits (2218), Expect = 2.3e-245
Identity = 418/473 (88.37%), Postives = 438/473 (92.60%), Query Frame = 0

Query: 84  MFVFKLRTHFGGGKSTRFGLIYDFVENAKKYEPKYICLIRHFSTSNSSGNGSVAPSKDDY 143
           MF FK+ T       T+F +I              I LIRHFSTSNSS NGS  PSKDDY
Sbjct: 1   MFHFKIFT-------TQFSIIPKLHRITDAIRFHNIFLIRHFSTSNSSSNGSGTPSKDDY 60

Query: 144 FAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWACTHNP 203
           FAAIHHIS+IVRRD YMERTLNKL+ISYLNSELVFRVLRACSNSGTESFRFFNWAC++NP
Sbjct: 61  FAAIHHISNIVRRDIYMERTLNKLRISYLNSELVFRVLRACSNSGTESFRFFNWACSNNP 120

Query: 204 SYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNA 263
           SYQPTTLEFEELVKTLARTKKYTTMWKVL QMKTQNLKISPETISF+I+EYGKQGLVD A
Sbjct: 121 SYQPTTLEFEELVKTLARTKKYTTMWKVLHQMKTQNLKISPETISFVIEEYGKQGLVDGA 180

Query: 264 VTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVT 323
           VTIFNQCSKSIDCPQTVEVYNALLFALCE+KMFHGAYALIRRMIRKGVTPDKKTYG LVT
Sbjct: 181 VTIFNQCSKSIDCPQTVEVYNALLFALCEIKMFHGAYALIRRMIRKGVTPDKKTYGILVT 240

Query: 324 GWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVP 383
           GWCS+GKM+EAQ+FLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAK MVRKM KEGSVP
Sbjct: 241 GWCSSGKMKEAQQFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKNMVRKMTKEGSVP 300

Query: 384 DIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 443
           D+GTFNSLI+VIC+SGEVDFCINI+HEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL
Sbjct: 301 DLGTFNSLINVICDSGEVDFCINIYHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 360

Query: 444 HCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG 503
           + CIEDGHIPFPSLYGPI+KGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG
Sbjct: 361 NYCIEDGHIPFPSLYGPIIKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG 420

Query: 504 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRG 557
           GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK+CGKHDLAKKIEQLEVSIRG
Sbjct: 421 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKSCGKHDLAKKIEQLEVSIRG 466

BLAST of HG10007887 vs. NCBI nr
Match: KAG6590295.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 857.8 bits (2215), Expect = 5.1e-245
Identity = 419/473 (88.58%), Postives = 438/473 (92.60%), Query Frame = 0

Query: 84  MFVFKLRTHFGGGKSTRFGLIYDFVENAKKYEPKYICLIRHFSTSNSSGNGSVAPSKDDY 143
           MF FK+ T       T+F +I      A       I LIRHFSTSNSS NGS  PSKDDY
Sbjct: 1   MFHFKIFT-------TQFSIIPKLHRIAAAKRFHNIFLIRHFSTSNSSSNGSGTPSKDDY 60

Query: 144 FAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWACTHNP 203
           FAAIHHIS+IVRRD YMERTLNKL+ISYLNSELVFRVLRACSNSGTESFRFFNWAC++NP
Sbjct: 61  FAAIHHISNIVRRDIYMERTLNKLRISYLNSELVFRVLRACSNSGTESFRFFNWACSNNP 120

Query: 204 SYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNA 263
           SYQPTTLEFEELVKTLARTKKYTTMWKVL QMK QNLKISPETISF+I+EYGKQGLVD A
Sbjct: 121 SYQPTTLEFEELVKTLARTKKYTTMWKVLHQMKNQNLKISPETISFVIEEYGKQGLVDGA 180

Query: 264 VTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVT 323
           VTIFNQCSKSIDCPQTVEVYNALLFALCE+KMFHGAYALIRRMIRKGVTPDKKTYG LVT
Sbjct: 181 VTIFNQCSKSIDCPQTVEVYNALLFALCEIKMFHGAYALIRRMIRKGVTPDKKTYGILVT 240

Query: 324 GWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVP 383
           GWCS+GKM+EAQ+FLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAK MVRKMIKEGSVP
Sbjct: 241 GWCSSGKMKEAQQFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKNMVRKMIKEGSVP 300

Query: 384 DIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 443
           D+GTFNSLI+VICNSGEVDFCINI+HEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL
Sbjct: 301 DLGTFNSLINVICNSGEVDFCINIYHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 360

Query: 444 HCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG 503
           + CIEDGHIPFPSLYGPI+KGMCKRGQFDDAFCFFSDMK KGHPPNRPVYTMLITMCGRG
Sbjct: 361 NYCIEDGHIPFPSLYGPIIKGMCKRGQFDDAFCFFSDMKLKGHPPNRPVYTMLITMCGRG 420

Query: 504 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRG 557
           GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK+CGKHDLAKKIEQLEVSIRG
Sbjct: 421 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKSCGKHDLAKKIEQLEVSIRG 466

BLAST of HG10007887 vs. ExPASy Swiss-Prot
Match: Q94JX6 (Pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g18390 PE=2 SV=2)

HSP 1 Score: 625.9 bits (1613), Expect = 4.3e-178
Identity = 300/436 (68.81%), Postives = 354/436 (81.19%), Query Frame = 0

Query: 122 IRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVL 181
           IRHF++     +    P+K DYFAAI+H+ +IVRR+ + ER+LN L++  + SE VFRVL
Sbjct: 26  IRHFNSLEPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLP-VTSEFVFRVL 85

Query: 182 RACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLK 241
           RA S S  +S RFFNWA   NPSY PT++E+EEL K+LA  KKY +MWK+L QMK  +L 
Sbjct: 86  RATSRSSNDSLRFFNWA-RSNPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLD 145

Query: 242 ISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYA 301
           IS ET+ FII++YGK G VD AV +FN   K++ C QTV+VYN+LL ALC+VKMFHGAYA
Sbjct: 146 ISGETLCFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYA 205

Query: 302 LIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLL 361
           LIRRMIRKG+ PDK+TY  LV GWCSAGKM+EAQEFL+EMS++GFNPP RGRDLL+EGLL
Sbjct: 206 LIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLL 265

Query: 362 NAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDIN 421
           NAGYLESAK MV KM K G VPDI TFN LI+ I  SGEV+FCI +++  CKLGLC DI+
Sbjct: 266 NAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDID 325

Query: 422 TYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDM 481
           TYK LIPA SK+G+IDEAFRLL+ C+EDGH PFPSLY PI+KGMC+ G FDDAF FFSDM
Sbjct: 326 TYKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDM 385

Query: 482 KHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKH 541
           K K HPPNRPVYTMLITMCGRGG+FVDAANYL+EM E GL PISRCFDMVTDGLKN GKH
Sbjct: 386 KVKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKH 445

Query: 542 DLAKKIEQLEVSIRGI 558
           DLA +IEQLEV +RG+
Sbjct: 446 DLAMRIEQLEVQLRGV 459

BLAST of HG10007887 vs. ExPASy Swiss-Prot
Match: Q9FH87 (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana OX=3702 GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 3.5e-39
Identity = 122/462 (26.41%), Postives = 214/462 (46.32%), Query Frame = 0

Query: 92  HFGGGKSTRFGLIYDFVENAKKYEPKY--------ICLIRHFSTSNSSGNGSVAPSKDDY 151
           H+   KS RF LI+     ++  E  +        +CL         S N     SK D 
Sbjct: 27  HYSQFKS-RFDLIHRSFHVSRALEDNFRRSNGIGLVCL-------EKSHNDRTKNSKYDE 86

Query: 152 FAAIHHISHIVRRDFY-----MERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWA 211
           FA+    S+ + R F+     +E  LN+  +  L   L+ RVL  C ++G   +RFF WA
Sbjct: 87  FASDVEKSYRILRKFHSRVPKLELALNESGVE-LRPGLIERVLNRCGDAGNLGYRFFVWA 146

Query: 212 CTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLK-ISPETISFIIQEYGKQ 271
               P Y  +   ++ +VK L++ +++  +W ++ +M+ +N + I PE    ++Q +   
Sbjct: 147 -AKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASA 206

Query: 272 GLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKT 331
            +V  A+ + ++  K    P    V+  LL ALC+      A  L   M R     + + 
Sbjct: 207 DMVKKAIEVLDEMPKFGFEPDEY-VFGCLLDALCKHGSVKDAAKLFEDM-RMRFPVNLRY 266

Query: 332 YGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMI 391
           + +L+ GWC  GKM EA+  L +M++ GF P +     L+ G  NAG +  A  ++R M 
Sbjct: 267 FTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMR 326

Query: 392 KEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRID 451
           + G  P+   +  LI  +C    ++  + +F E+ +     D+ TY  L+    K G+ID
Sbjct: 327 RRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKID 386

Query: 452 EAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLI 511
           + + +L   I+ G +P    Y  I+    K+  F++       M+   + P+  +Y ++I
Sbjct: 387 KCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVI 446

Query: 512 TMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCG 540
            +  + G   +A     EM E GL P    F ++ +GL + G
Sbjct: 447 RLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQG 476

BLAST of HG10007887 vs. ExPASy Swiss-Prot
Match: Q9S7R4 (Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=OTP43 PE=2 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.9e-38
Identity = 106/413 (25.67%), Postives = 183/413 (44.31%), Query Frame = 0

Query: 176 LVFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQM 235
           LV  VL+   N G ++ +FF++   H+  Y      F+  +   AR   + T+W ++ +M
Sbjct: 58  LVNSVLKRLWNHGPKALQFFHFLDNHHREYVHDASSFDLAIDIAARLHLHPTVWSLIHRM 117

Query: 236 KTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKM 295
           ++  +  SP+T + + + Y   G  D AV +F    +   C Q +  +N +L  LC+ K 
Sbjct: 118 RSLRIGPSPKTFAIVAERYASAGKPDKAVKLFLNMHEH-GCFQDLASFNTILDVLCKSKR 177

Query: 296 FHGAYALIR----------------------------------RMIRKGVTPDKKTYGTL 355
              AY L R                                   M+ +G+ P+  TY T+
Sbjct: 178 VEKAYELFRALRGRFSVDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTM 237

Query: 356 VTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGS 415
           + G+  AG++R A EF  EM ++     +     +V G   AG ++ A+ +  +MI+EG 
Sbjct: 238 LKGFFRAGQIRHAWEFFLEMKKRDCEIDVVTYTTVVHGFGVAGEIKRARNVFDEMIREGV 297

Query: 416 VPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFR 475
           +P + T+N++I V+C    V+  + +F E+ + G  P++ TY +LI      G       
Sbjct: 298 LPSVATYNAMIQVLCKKDNVENAVVMFEEMVRRGYEPNVTTYNVLIRGLFHAGEFSRGEE 357

Query: 476 LLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLIT--- 535
           L+     +G  P    Y  +++   +  + + A   F  M      PN   Y +LI+   
Sbjct: 358 LMQRMENEGCEPNFQTYNMMIRYYSECSEVEKALGLFEKMGSGDCLPNLDTYNILISGMF 417

Query: 536 MCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLE 552
           +  R    V A   L+EM E G  P    F+ V +GL   G    AK+I +L+
Sbjct: 418 VRKRSEDMVVAGKLLLEMVERGFIPRKFTFNRVLNGLLLTGNQAFAKEILRLQ 469

BLAST of HG10007887 vs. ExPASy Swiss-Prot
Match: Q9LQ14 (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 154.8 bits (390), Expect = 2.8e-36
Identity = 94/338 (27.81%), Postives = 164/338 (48.52%), Query Frame = 0

Query: 202 NPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVD 261
           N   +P  + +  L++ L    +++   ++L  M  + +  +  T S +I  + K+G + 
Sbjct: 283 NKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLV 342

Query: 262 NAVTIFNQCSK-SIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGT 321
            A  ++++  K SID    +  Y++L+   C       A  +   MI K   P+  TY T
Sbjct: 343 EAEKLYDEMIKRSID--PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNT 402

Query: 322 LVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEG 381
           L+ G+C A ++ E  E   EMSQ+G        + L++GL  AG  + A+ + +KM+ +G
Sbjct: 403 LIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDG 462

Query: 382 SVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAF 441
             PDI T++ L+D +C  G+++  + +F  + K  + PDI TY I+I    K G++++ +
Sbjct: 463 VPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGW 522

Query: 442 RLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMC 501
            L       G  P   +Y  ++ G C++G  ++A   F +MK  G  PN   Y  LI   
Sbjct: 523 DLFCSLSLKGVKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRAR 582

Query: 502 GRGGRFVDAANYLMEMAEFGL----PPISRCFDMVTDG 535
            R G    +A  + EM   G       IS   +M+ DG
Sbjct: 583 LRDGDKAASAELIKEMRSCGFVGDASTISMVINMLHDG 618

BLAST of HG10007887 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 8.1e-36
Identity = 88/318 (27.67%), Postives = 149/318 (46.86%), Query Frame = 0

Query: 206 QPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVT 265
           QP    +  L+    +  +     +VL +M++++      T + +I     +G +D A+ 
Sbjct: 155 QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALK 214

Query: 266 IFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGW 325
           + NQ   S +C  TV  Y  L+ A         A  L+  M+ +G+ PD  TY T++ G 
Sbjct: 215 VLNQL-LSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 274

Query: 326 CSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVPDI 385
           C  G +  A E +  +  KG  P +   ++L+  LLN G  E  + ++ KM  E   P++
Sbjct: 275 CKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 334

Query: 386 GTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHC 445
            T++ LI  +C  G+++  +N+   + + GL PD  +Y  LI A  + GR+D A   L  
Sbjct: 335 VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 394

Query: 446 CIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRGGR 505
            I DG +P    Y  +L  +CK G+ D A   F  +   G  PN   Y  + +     G 
Sbjct: 395 MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 454

Query: 506 FVDAANYLMEMAEFGLPP 524
            + A + ++EM   G+ P
Sbjct: 455 KIRALHMILEMMSNGIDP 471

BLAST of HG10007887 vs. ExPASy TrEMBL
Match: A0A0A0M105 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G695420 PE=4 SV=1)

HSP 1 Score: 882.9 bits (2280), Expect = 7.1e-253
Identity = 424/441 (96.15%), Postives = 430/441 (97.51%), Query Frame = 0

Query: 117 KYICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSEL 176
           +YI L RHFS SNS  NGS APSKDDYFAAIHHISHIVRRDFYMERTLNKL+IS LNSEL
Sbjct: 15  RYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSEL 74

Query: 177 VFRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMK 236
           VFRVLRACSNSGTESFRFFNWAC+HNPSYQPTTLEFEELVKTLART+KYTTMWKVLLQMK
Sbjct: 75  VFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMK 134

Query: 237 TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF 296
           TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF
Sbjct: 135 TQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMF 194

Query: 297 HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLL 356
           HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKM+EAQEFLEEMSQKGFNPPLRGRDLL
Sbjct: 195 HGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLL 254

Query: 357 VEGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL 416
           VEGLLNAGYLESAK MVRKM KEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL
Sbjct: 255 VEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGL 314

Query: 417 CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFC 476
           CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGH+PFPSLYGPILKGMCKRGQFDDAFC
Sbjct: 315 CPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFC 374

Query: 477 FFSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK 536
           FF DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAE GLPPISRCFDMVTDGLK
Sbjct: 375 FFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLK 434

Query: 537 NCGKHDLAKKIEQLEVSIRGI 558
           NCGKHDLAKKIEQLEVSIRGI
Sbjct: 435 NCGKHDLAKKIEQLEVSIRGI 455

BLAST of HG10007887 vs. ExPASy TrEMBL
Match: A0A5A7V4T7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold3607G00300 PE=4 SV=1)

HSP 1 Score: 872.5 bits (2253), Expect = 9.7e-250
Identity = 418/440 (95.00%), Postives = 428/440 (97.27%), Query Frame = 0

Query: 118 YICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELV 177
           YI LIR FS SNSS NGS APSKDDYFAAIHHISHIVRRDFYMERTLNKL+ISYLNSELV
Sbjct: 16  YISLIRCFSNSNSSVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISYLNSELV 75

Query: 178 FRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKT 237
           FRVLRACSN GTESFRFFNWAC+HNPSYQPTTLE EELVKTLART+KYTTMWKVLLQMKT
Sbjct: 76  FRVLRACSNCGTESFRFFNWACSHNPSYQPTTLELEELVKTLARTRKYTTMWKVLLQMKT 135

Query: 238 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 297
           QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH
Sbjct: 136 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 195

Query: 298 GAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLV 357
           GAYALIRRMI+KGVTPDKKTYGTLVTGWCSAGKM+EAQEFLEEMSQKGFNPPLRGRDLLV
Sbjct: 196 GAYALIRRMIKKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLV 255

Query: 358 EGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 417
           EGLLNAGYLESAK MVRKM KEG VPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC
Sbjct: 256 EGLLNAGYLESAKDMVRKMTKEGCVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 315

Query: 418 PDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCF 477
           PDINTYKILIPATSKVGRIDEAFRLL+CCIEDGH+PFPSLYGPI+KGMCKRGQFDDAFCF
Sbjct: 316 PDINTYKILIPATSKVGRIDEAFRLLNCCIEDGHVPFPSLYGPIIKGMCKRGQFDDAFCF 375

Query: 478 FSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKN 537
           F DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAE GLPPISRCFDMVTDGLK+
Sbjct: 376 FGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKS 435

Query: 538 CGKHDLAKKIEQLEVSIRGI 558
           CGKHDLAKKIE+LEVSIRGI
Sbjct: 436 CGKHDLAKKIEKLEVSIRGI 455

BLAST of HG10007887 vs. ExPASy TrEMBL
Match: A0A1S3CJ74 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103501022 PE=4 SV=1)

HSP 1 Score: 872.5 bits (2253), Expect = 9.7e-250
Identity = 418/440 (95.00%), Postives = 428/440 (97.27%), Query Frame = 0

Query: 118 YICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELV 177
           YI LIR FS SNSS NGS APSKDDYFAAIHHISHIVRRDFYMERTLNKL+ISYLNSELV
Sbjct: 16  YISLIRCFSNSNSSVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISYLNSELV 75

Query: 178 FRVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKT 237
           FRVLRACSN GTESFRFFNWAC+HNPSYQPTTLE EELVKTLART+KYTTMWKVLLQMKT
Sbjct: 76  FRVLRACSNCGTESFRFFNWACSHNPSYQPTTLELEELVKTLARTRKYTTMWKVLLQMKT 135

Query: 238 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 297
           QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH
Sbjct: 136 QNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFH 195

Query: 298 GAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLV 357
           GAYALIRRMI+KGVTPDKKTYGTLVTGWCSAGKM+EAQEFLEEMSQKGFNPPLRGRDLLV
Sbjct: 196 GAYALIRRMIKKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLV 255

Query: 358 EGLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 417
           EGLLNAGYLESAK MVRKM KEG VPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC
Sbjct: 256 EGLLNAGYLESAKDMVRKMTKEGCVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLC 315

Query: 418 PDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCF 477
           PDINTYKILIPATSKVGRIDEAFRLL+CCIEDGH+PFPSLYGPI+KGMCKRGQFDDAFCF
Sbjct: 316 PDINTYKILIPATSKVGRIDEAFRLLNCCIEDGHVPFPSLYGPIIKGMCKRGQFDDAFCF 375

Query: 478 FSDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKN 537
           F DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAE GLPPISRCFDMVTDGLK+
Sbjct: 376 FGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKS 435

Query: 538 CGKHDLAKKIEQLEVSIRGI 558
           CGKHDLAKKIE+LEVSIRGI
Sbjct: 436 CGKHDLAKKIEKLEVSIRGI 455

BLAST of HG10007887 vs. ExPASy TrEMBL
Match: A0A6J1JC50 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111485446 PE=4 SV=1)

HSP 1 Score: 859.0 bits (2218), Expect = 1.1e-245
Identity = 418/473 (88.37%), Postives = 438/473 (92.60%), Query Frame = 0

Query: 84  MFVFKLRTHFGGGKSTRFGLIYDFVENAKKYEPKYICLIRHFSTSNSSGNGSVAPSKDDY 143
           MF FK+ T       T+F +I              I LIRHFSTSNSS NGS  PSKDDY
Sbjct: 1   MFHFKIFT-------TQFSIIPKLHRITDAIRFHNIFLIRHFSTSNSSSNGSGTPSKDDY 60

Query: 144 FAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWACTHNP 203
           FAAIHHIS+IVRRD YMERTLNKL+ISYLNSELVFRVLRACSNSGTESFRFFNWAC++NP
Sbjct: 61  FAAIHHISNIVRRDIYMERTLNKLRISYLNSELVFRVLRACSNSGTESFRFFNWACSNNP 120

Query: 204 SYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNA 263
           SYQPTTLEFEELVKTLARTKKYTTMWKVL QMKTQNLKISPETISF+I+EYGKQGLVD A
Sbjct: 121 SYQPTTLEFEELVKTLARTKKYTTMWKVLHQMKTQNLKISPETISFVIEEYGKQGLVDGA 180

Query: 264 VTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVT 323
           VTIFNQCSKSIDCPQTVEVYNALLFALCE+KMFHGAYALIRRMIRKGVTPDKKTYG LVT
Sbjct: 181 VTIFNQCSKSIDCPQTVEVYNALLFALCEIKMFHGAYALIRRMIRKGVTPDKKTYGILVT 240

Query: 324 GWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVP 383
           GWCS+GKM+EAQ+FLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAK MVRKM KEGSVP
Sbjct: 241 GWCSSGKMKEAQQFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKNMVRKMTKEGSVP 300

Query: 384 DIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 443
           D+GTFNSLI+VIC+SGEVDFCINI+HEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL
Sbjct: 301 DLGTFNSLINVICDSGEVDFCINIYHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 360

Query: 444 HCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG 503
           + CIEDGHIPFPSLYGPI+KGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG
Sbjct: 361 NYCIEDGHIPFPSLYGPIIKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRG 420

Query: 504 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRG 557
           GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK+CGKHDLAKKIEQLEVSIRG
Sbjct: 421 GRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKSCGKHDLAKKIEQLEVSIRG 466

BLAST of HG10007887 vs. ExPASy TrEMBL
Match: A0A6J1H8G8 (pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111461504 PE=4 SV=1)

HSP 1 Score: 857.1 bits (2213), Expect = 4.2e-245
Identity = 411/438 (93.84%), Postives = 427/438 (97.49%), Query Frame = 0

Query: 119 ICLIRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELVF 178
           I LIRHFSTSNSS NGS  PSKDDYFAAIHHIS+IVRRD YMERTLNKL+ISYLNSELVF
Sbjct: 29  IFLIRHFSTSNSSSNGSGTPSKDDYFAAIHHISNIVRRDIYMERTLNKLRISYLNSELVF 88

Query: 179 RVLRACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQ 238
           RVLRACSNSGTESFRFFNWAC++NPSYQPTTLEFEELVKTLARTKKYTTMWKVL QMKTQ
Sbjct: 89  RVLRACSNSGTESFRFFNWACSNNPSYQPTTLEFEELVKTLARTKKYTTMWKVLHQMKTQ 148

Query: 239 NLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHG 298
           NLKISPETISF+I+EYGKQGLVD AVTIFNQCSKSIDCPQTVEVYNALLFALCE+KMFHG
Sbjct: 149 NLKISPETISFVIEEYGKQGLVDGAVTIFNQCSKSIDCPQTVEVYNALLFALCEIKMFHG 208

Query: 299 AYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVE 358
           AYALIRRMIRKGVTPDKKTYG LVTGWCS+GKM+EAQ+FLEEMSQKGFNPPLRGRDLLVE
Sbjct: 209 AYALIRRMIRKGVTPDKKTYGILVTGWCSSGKMKEAQQFLEEMSQKGFNPPLRGRDLLVE 268

Query: 359 GLLNAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCP 418
           GLLNAGYLESAK MVRKMIKEGSVPD+GTFNSLI+VICNSGEVDFCINI+HEVCKLGLCP
Sbjct: 269 GLLNAGYLESAKNMVRKMIKEGSVPDLGTFNSLINVICNSGEVDFCINIYHEVCKLGLCP 328

Query: 419 DINTYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFF 478
           DINTYKILIPATSKVGRIDEAFRLL+ CIEDGHIPFPSLYGPI+KGMCKRGQFDDAFCFF
Sbjct: 329 DINTYKILIPATSKVGRIDEAFRLLNYCIEDGHIPFPSLYGPIIKGMCKRGQFDDAFCFF 388

Query: 479 SDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNC 538
           SDMK KGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLK+C
Sbjct: 389 SDMKLKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKSC 448

Query: 539 GKHDLAKKIEQLEVSIRG 557
           GKHDLAKKIEQLEVSIRG
Sbjct: 449 GKHDLAKKIEQLEVSIRG 466

BLAST of HG10007887 vs. TAIR 10
Match: AT5G18390.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 625.9 bits (1613), Expect = 3.0e-179
Identity = 300/436 (68.81%), Postives = 354/436 (81.19%), Query Frame = 0

Query: 122 IRHFSTSNSSGNGSVAPSKDDYFAAIHHISHIVRRDFYMERTLNKLQISYLNSELVFRVL 181
           IRHF++     +    P+K DYFAAI+H+ +IVRR+ + ER+LN L++  + SE VFRVL
Sbjct: 26  IRHFNSLEPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLP-VTSEFVFRVL 85

Query: 182 RACSNSGTESFRFFNWACTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLK 241
           RA S S  +S RFFNWA   NPSY PT++E+EEL K+LA  KKY +MWK+L QMK  +L 
Sbjct: 86  RATSRSSNDSLRFFNWA-RSNPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLD 145

Query: 242 ISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYA 301
           IS ET+ FII++YGK G VD AV +FN   K++ C QTV+VYN+LL ALC+VKMFHGAYA
Sbjct: 146 ISGETLCFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYA 205

Query: 302 LIRRMIRKGVTPDKKTYGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLL 361
           LIRRMIRKG+ PDK+TY  LV GWCSAGKM+EAQEFL+EMS++GFNPP RGRDLL+EGLL
Sbjct: 206 LIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLL 265

Query: 362 NAGYLESAKVMVRKMIKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDIN 421
           NAGYLESAK MV KM K G VPDI TFN LI+ I  SGEV+FCI +++  CKLGLC DI+
Sbjct: 266 NAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDID 325

Query: 422 TYKILIPATSKVGRIDEAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDM 481
           TYK LIPA SK+G+IDEAFRLL+ C+EDGH PFPSLY PI+KGMC+ G FDDAF FFSDM
Sbjct: 326 TYKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDM 385

Query: 482 KHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKH 541
           K K HPPNRPVYTMLITMCGRGG+FVDAANYL+EM E GL PISRCFDMVTDGLKN GKH
Sbjct: 386 KVKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKH 445

Query: 542 DLAKKIEQLEVSIRGI 558
           DLA +IEQLEV +RG+
Sbjct: 446 DLAMRIEQLEVQLRGV 459

BLAST of HG10007887 vs. TAIR 10
Match: AT5G65820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 164.5 bits (415), Expect = 2.5e-40
Identity = 122/462 (26.41%), Postives = 214/462 (46.32%), Query Frame = 0

Query: 92  HFGGGKSTRFGLIYDFVENAKKYEPKY--------ICLIRHFSTSNSSGNGSVAPSKDDY 151
           H+   KS RF LI+     ++  E  +        +CL         S N     SK D 
Sbjct: 27  HYSQFKS-RFDLIHRSFHVSRALEDNFRRSNGIGLVCL-------EKSHNDRTKNSKYDE 86

Query: 152 FAAIHHISHIVRRDFY-----MERTLNKLQISYLNSELVFRVLRACSNSGTESFRFFNWA 211
           FA+    S+ + R F+     +E  LN+  +  L   L+ RVL  C ++G   +RFF WA
Sbjct: 87  FASDVEKSYRILRKFHSRVPKLELALNESGVE-LRPGLIERVLNRCGDAGNLGYRFFVWA 146

Query: 212 CTHNPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLK-ISPETISFIIQEYGKQ 271
               P Y  +   ++ +VK L++ +++  +W ++ +M+ +N + I PE    ++Q +   
Sbjct: 147 -AKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASA 206

Query: 272 GLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKT 331
            +V  A+ + ++  K    P    V+  LL ALC+      A  L   M R     + + 
Sbjct: 207 DMVKKAIEVLDEMPKFGFEPDEY-VFGCLLDALCKHGSVKDAAKLFEDM-RMRFPVNLRY 266

Query: 332 YGTLVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMI 391
           + +L+ GWC  GKM EA+  L +M++ GF P +     L+ G  NAG +  A  ++R M 
Sbjct: 267 FTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMR 326

Query: 392 KEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRID 451
           + G  P+   +  LI  +C    ++  + +F E+ +     D+ TY  L+    K G+ID
Sbjct: 327 RRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKID 386

Query: 452 EAFRLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLI 511
           + + +L   I+ G +P    Y  I+    K+  F++       M+   + P+  +Y ++I
Sbjct: 387 KCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVI 446

Query: 512 TMCGRGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCG 540
            +  + G   +A     EM E GL P    F ++ +GL + G
Sbjct: 447 RLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQG 476

BLAST of HG10007887 vs. TAIR 10
Match: AT1G62930.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 154.8 bits (390), Expect = 2.0e-37
Identity = 94/338 (27.81%), Postives = 164/338 (48.52%), Query Frame = 0

Query: 202 NPSYQPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVD 261
           N   +P  + +  L++ L    +++   ++L  M  + +  +  T S +I  + K+G + 
Sbjct: 283 NKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLV 342

Query: 262 NAVTIFNQCSK-SIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGT 321
            A  ++++  K SID    +  Y++L+   C       A  +   MI K   P+  TY T
Sbjct: 343 EAEKLYDEMIKRSID--PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNT 402

Query: 322 LVTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEG 381
           L+ G+C A ++ E  E   EMSQ+G        + L++GL  AG  + A+ + +KM+ +G
Sbjct: 403 LIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDG 462

Query: 382 SVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAF 441
             PDI T++ L+D +C  G+++  + +F  + K  + PDI TY I+I    K G++++ +
Sbjct: 463 VPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGW 522

Query: 442 RLLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMC 501
            L       G  P   +Y  ++ G C++G  ++A   F +MK  G  PN   Y  LI   
Sbjct: 523 DLFCSLSLKGVKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRAR 582

Query: 502 GRGGRFVDAANYLMEMAEFGL----PPISRCFDMVTDG 535
            R G    +A  + EM   G       IS   +M+ DG
Sbjct: 583 LRDGDKAASAELIKEMRSCGFVGDASTISMVINMLHDG 618

BLAST of HG10007887 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 153.3 bits (386), Expect = 5.8e-37
Identity = 88/318 (27.67%), Postives = 149/318 (46.86%), Query Frame = 0

Query: 206 QPTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVT 265
           QP    +  L+    +  +     +VL +M++++      T + +I     +G +D A+ 
Sbjct: 155 QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALK 214

Query: 266 IFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGW 325
           + NQ   S +C  TV  Y  L+ A         A  L+  M+ +G+ PD  TY T++ G 
Sbjct: 215 VLNQL-LSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 274

Query: 326 CSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGSVPDI 385
           C  G +  A E +  +  KG  P +   ++L+  LLN G  E  + ++ KM  E   P++
Sbjct: 275 CKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 334

Query: 386 GTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHC 445
            T++ LI  +C  G+++  +N+   + + GL PD  +Y  LI A  + GR+D A   L  
Sbjct: 335 VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 394

Query: 446 CIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCGRGGR 505
            I DG +P    Y  +L  +CK G+ D A   F  +   G  PN   Y  + +     G 
Sbjct: 395 MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 454

Query: 506 FVDAANYLMEMAEFGLPP 524
            + A + ++EM   G+ P
Sbjct: 455 KIRALHMILEMMSNGIDP 471

BLAST of HG10007887 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 151.4 bits (381), Expect = 2.2e-36
Identity = 95/346 (27.46%), Postives = 163/346 (47.11%), Query Frame = 0

Query: 207 PTTLEFEELVKTLARTKKYTTMWKVLLQMKTQNLKISPETISFIIQ-----EYGKQGLVD 266
           P  + +  L+    + +K    +K+L  M  + L+  P  IS+ +        G+   V 
Sbjct: 238 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLE--PNLISYNVVINGLCREGRMKEVS 297

Query: 267 NAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTL 326
             +T  N+   S+D       YN L+   C+   FH A  +   M+R G+TP   TY +L
Sbjct: 298 FVLTEMNRRGYSLD----EVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSL 357

Query: 327 VTGWCSAGKMREAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKVMVRKMIKEGS 386
           +   C AG M  A EFL++M  +G  P  R    LV+G    GY+  A  ++R+M   G 
Sbjct: 358 IHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGF 417

Query: 387 VPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFR 446
            P + T+N+LI+  C +G+++  I +  ++ + GL PD+ +Y  ++    +   +DEA R
Sbjct: 418 SPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALR 477

Query: 447 LLHCCIEDGHIPFPSLYGPILKGMCKRGQFDDAFCFFSDMKHKGHPPNRPVYTMLITMCG 506
           +    +E G  P    Y  +++G C++ +  +A   + +M   G PP+   YT LI    
Sbjct: 478 VKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYC 537

Query: 507 RGGRFVDAANYLMEMAEFGLPPISRCFDMVTDGLKNCGKHDLAKKI 548
             G    A     EM E G+ P    + ++ +GL    +   AK++
Sbjct: 538 MEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 577

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879743.13.4e-25787.75pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Benincasa ... [more]
XP_004142520.11.5e-25296.15pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucumis sa... [more]
XP_008462724.12.0e-24995.00PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial ... [more]
XP_022988097.12.3e-24588.37pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucurbita ... [more]
KAG6590295.15.1e-24588.58Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q94JX64.3e-17868.81Pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Arabidop... [more]
Q9FH873.5e-3926.41Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Q9S7R43.9e-3825.67Pentatricopeptide repeat-containing protein At1g74900, mitochondrial OS=Arabidop... [more]
Q9LQ142.8e-3627.81Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Q9SR008.1e-3627.67Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0M1057.1e-25396.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G695420 PE=4 SV=1[more]
A0A5A7V4T79.7e-25095.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CJ749.7e-25095.00pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucumis ... [more]
A0A6J1JC501.1e-24588.37pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucurbit... [more]
A0A6J1H8G84.2e-24593.84pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G18390.13.0e-17968.81Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65820.12.5e-4026.41Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62930.12.0e-3727.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G04760.15.8e-3727.67Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G39710.12.2e-3627.46Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 540..557
NoneNo IPR availableGENE3D3.30.70.3370coord: 50..137
e-value: 5.6E-25
score: 89.4
NoneNo IPR availablePANTHERPTHR45613:SF30PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN MITOCHONDRIALcoord: 118..556
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 118..556
IPR001976Ribosomal protein S24ePFAMPF01282Ribosomal_S24ecoord: 53..120
e-value: 9.4E-26
score: 89.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 431..552
e-value: 2.5E-19
score: 71.3
coord: 354..430
e-value: 5.6E-15
score: 57.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 177..353
e-value: 3.3E-28
score: 101.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 458..489
e-value: 3.2E-7
score: 28.1
coord: 492..523
e-value: 4.5E-4
score: 18.2
coord: 317..348
e-value: 1.6E-4
score: 19.6
coord: 282..314
e-value: 4.6E-6
score: 24.5
coord: 387..420
e-value: 9.1E-5
score: 20.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 309..342
e-value: 2.5E-10
score: 39.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 460..500
e-value: 1.5E-9
score: 37.9
coord: 383..429
e-value: 5.5E-8
score: 32.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 314..348
score: 12.693243
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 489..523
score: 10.073492
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 419..453
score: 9.54735
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 454..488
score: 10.588674
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 10.840783
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 384..418
score: 10.161182
IPR012678Ribosomal protein L23/L15e core domain superfamilySUPERFAMILY54189Ribosomal proteins S24e, L23 and L15ecoord: 52..120

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007887.1HG10007887.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006412 translation
cellular_component GO:0005840 ribosome
molecular_function GO:0005515 protein binding
molecular_function GO:0003735 structural constituent of ribosome