HG10018885 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018885
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 10410131 .. 10411966 (-)
RNA-Seq ExpressionHG10018885
SyntenyHG10018885
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCTCTTTCTCTTCCCAAGCCCTAAAAAAACCAGCTTTTTTCCACCCCAAATCCAAGCAACGCCCAGATTCAACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCACTTCTCAAGGTCGCCTCCCGGAAGCTCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTCAAAGGAGGTAAATGCGTTCATCTACATTTGAAACACACGGGGTTCAAACGCCCTACCACTATTATAGCCAACCATTTGATTGGTATGTACTTTGAATGTGGCAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTGTAAGGAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAAGTTGTTTGAGAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTGTTTTAGTGAAGCTATTGGGTTATATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTTAAAGAATTGCAGCTCGCAAAGCAGGTTCACGGGCAGGTATTGGTTGTTGGATTCTTGTCTAATGTAGTGCTTTCTAGTTCAATTGTTGATGCATACGCAAAATGTGGAGAGATGGCATGTGCAAGGAGATTGTTTGACGAAATGCTCGTGAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTTAGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCCGGCTATACAAGAAATAGTTTGGGGCATGAAGCACTTAATTACTTTACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTAAAGCATGGTAAACAAGTACATGCCTATTTGGTCAGAACCAACTTCAGATGCAACACAATAGTTGTCAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCTAGCTGTCGCGTTTTTTACCTTATGGGAAATAAGCAGGATGTTGTATTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTTATTGTGATCCTTAGTGCATGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGACCATGACCTATGATCACGGTGTTCTCCCTGATCAAGAACATTATGCATGCTTAATTGACCTCTTGGGTCGAGCTGGATGTTTTATCGAGTTGGTAAACGAGCTAGAGAAGATGTCTTGTAAACCTGATGATCAGGTATGGAATGCATTGCTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAGGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTGTATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTGAGGAAAGAACGTGCAATTAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCATTGAAGGAAGAAATATACTCGCTACTGGAGCAGTTAGCCAGCCACACAGAAGAAGATTTTTCAATCATTTGA

mRNA sequence

ATGCCCTCTTTCTCTTCCCAAGCCCTAAAAAAACCAGCTTTTTTCCACCCCAAATCCAAGCAACGCCCAGATTCAACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCACTTCTCAAGGTCGCCTCCCGGAAGCTCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTCAAAGGAGGTAAATGCGTTCATCTACATTTGAAACACACGGGGTTCAAACGCCCTACCACTATTATAGCCAACCATTTGATTGGTATGTACTTTGAATGTGGCAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTGTAAGGAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAAGTTGTTTGAGAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTGTTTTAGTGAAGCTATTGGGTTATATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTTAAAGAATTGCAGCTCGCAAAGCAGGTTCACGGGCAGGTATTGGTTGTTGGATTCTTGTCTAATGTAGTGCTTTCTAGTTCAATTGTTGATGCATACGCAAAATGTGGAGAGATGGCATGTGCAAGGAGATTGTTTGACGAAATGCTCGTGAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTTAGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCCGGCTATACAAGAAATAGTTTGGGGCATGAAGCACTTAATTACTTTACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTAAAGCATGGTAAACAAGTACATGCCTATTTGGTCAGAACCAACTTCAGATGCAACACAATAGTTGTCAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCTAGCTGTCGCGTTTTTTACCTTATGGGAAATAAGCAGGATGTTGTATTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTTATTGTGATCCTTAGTGCATGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGACCATGACCTATGATCACGGTGTTCTCCCTGATCAAGAACATTATGCATGCTTAATTGACCTCTTGGGTCGAGCTGGATGTTTTATCGAGTTGGTAAACGAGCTAGAGAAGATGTCTTGTAAACCTGATGATCAGGTATGGAATGCATTGCTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAGGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTGTATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTGAGGAAAGAACGTGCAATTAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCATTGAAGGAAGAAATATACTCGCTACTGGAGCAGTTAGCCAGCCACACAGAAGAAGATTTTTCAATCATTTGA

Coding sequence (CDS)

ATGCCCTCTTTCTCTTCCCAAGCCCTAAAAAAACCAGCTTTTTTCCACCCCAAATCCAAGCAACGCCCAGATTCAACTCGCCTCTGTATCGTTCAGTCCCTTCTCAACCTCACTTCTCAAGGTCGCCTCCCGGAAGCTCTCTCCTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTACCCACTAGCACTTTCGTCAACCTCTTGCGACTCTGTGCCAAAGCCAAGTTCTTCAAAGGAGGTAAATGCGTTCATCTACATTTGAAACACACGGGGTTCAAACGCCCTACCACTATTATAGCCAACCATTTGATTGGTATGTACTTTGAATGTGGCAATGACATAGAGGCACGTAAGGTGTTTGATAAAATGTCTGTAAGGAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTAAGTTGGGAAATGTACATCATGCTAGGAAGTTGTTTGAGAGAATGACGGAGAAGGATGTTGTTTCGTGGAATACTATGGTTCTTGCTTATGCTAAGAAGGGGTGTTTTAGTGAAGCTATTGGGTTATATAGAGACTTCAGGAGACTCGATATGGGGTTTAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTTAAAGAATTGCAGCTCGCAAAGCAGGTTCACGGGCAGGTATTGGTTGTTGGATTCTTGTCTAATGTAGTGCTTTCTAGTTCAATTGTTGATGCATACGCAAAATGTGGAGAGATGGCATGTGCAAGGAGATTGTTTGACGAAATGCTCGTGAAAGATATCCTTGCATGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGATATGAATTTAGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTCTGATATCCGGCTATACAAGAAATAGTTTGGGGCATGAAGCACTTAATTACTTTACAAAAATGATGAAGTTTCGAATTAATCCTGACCAATATACATTCAGTAGTTGTCTCTGCGCTTGTGCCAGCATTGCTGCACTAAAGCATGGTAAACAAGTACATGCCTATTTGGTCAGAACCAACTTCAGATGCAACACAATAGTTGTCAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCTAGCTGTCGCGTTTTTTACCTTATGGGAAATAAGCAGGATGTTGTATTGTGGAATACAATGATATCTGCCCTAGCTCAGCATGGTCATGGGGAAGAGGCAATGCAGATGTTCAATGACATGGTTGAATCAGGATTGAAGCCTGATAGGATCACCTTTATTGTGATCCTTAGTGCATGTAGTCATTCAGGTCTTGTGCAAGAAGGACTTCGGTTTTTCAAGACCATGACCTATGATCACGGTGTTCTCCCTGATCAAGAACATTATGCATGCTTAATTGACCTCTTGGGTCGAGCTGGATGTTTTATCGAGTTGGTAAACGAGCTAGAGAAGATGTCTTGTAAACCTGATGATCAGGTATGGAATGCATTGCTCGGAGTCTGTAGGATACATGGTAATATAGAGCTTGGAAGAAAGGTGGCGGAGCATGTAATCGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTGTCTCTTGCAAGTTTGTATGCTTTTCTTGGGAAATGGGAGTCAGTAGAAAAGGTCAGGGAACTAATGGAAGAGAGATTTGTGAGGAAAGAACGTGCAATTAGTTGGATTGACATTGGAAATAAGGTACATTCTTTCATTGCATCTGATAGATTACATCCATTGAAGGAAGAAATATACTCGCTACTGGAGCAGTTAGCCAGCCACACAGAAGAAGATTTTTCAATCATTTGA

Protein sequence

MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQLASHTEEDFSII
Homology
BLAST of HG10018885 vs. NCBI nr
Match: XP_038886822.1 (pentatricopeptide repeat-containing protein At2g21090 [Benincasa hispida])

HSP 1 Score: 1195.6 bits (3092), Expect = 0.0e+00
Identity = 581/611 (95.09%), Postives = 594/611 (97.22%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQ LKKPA F PKSKQRPDSTRLCIVQSLLNL+SQG LPEALSYLDPLAQRGIRL
Sbjct: 1   MPSFSSQVLKKPALFRPKSKQRPDSTRLCIVQSLLNLSSQGHLPEALSYLDPLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTSTFVNLLRLC KAKF KGGKCVHLHLKHTGFKRPTTI+ANHLIGMYFECGNDIEARKV
Sbjct: 61  PTSTFVNLLRLCGKAKFLKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGNDIEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLF+RM EKDVVSWNTMVLAYAKKG F+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFDRMMEKDVVSWNTMVLAYAKKGYFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRR D+GFNEFSFAGVLILCVKLKELQLAKQVHGQ+LVVGFLSNVVLSSSIVDAY
Sbjct: 181 GLYRDFRRFDIGFNEFSFAGVLILCVKLKELQLAKQVHGQILVVGFLSNVVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMGCARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHA+L+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAHLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSS+IDMYSKCGMLEASC VFYLMGNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSMIDMYSKCGMLEASCHVFYLMGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLVQEGLRFFK MTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELEKMSCKPDD+VWNALLGVCRIHGNIELGRKVAEHVI LEPQSSAAYVSLASL
Sbjct: 481 FIELVNELEKMSCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIGLEPQSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YA LGKWESVEKVRELMEERFVRKERAISWI IGNKVHSFIASDRLHPLKEEIYS+LEQL
Sbjct: 541 YALLGKWESVEKVRELMEERFVRKERAISWIGIGNKVHSFIASDRLHPLKEEIYSILEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEED SII
Sbjct: 601 ASHTEEDLSII 611

BLAST of HG10018885 vs. NCBI nr
Match: KAG7011402.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1140.6 bits (2949), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 581/611 (95.09%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PKSKQ PDSTR CIVQSLLN +SQG LPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGGK VHLHLK TGFKRPTTI+ANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNV+ ARKLF+ M EKDV+SWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRR DMGFNEFSFAG+LILCVKLKELQLAKQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMNLAS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQVHAYL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFK M+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDD++WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEEDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of HG10018885 vs. NCBI nr
Match: XP_022967585.1 (pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima])

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 580/611 (94.93%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PKSK  PDS+R CIVQSLLN +SQG LPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKHLPDSSRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGGK VHLHLK TGFKRPTTIIANHLIGMYF+CG+DIEARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIIANHLIGMYFQCGSDIEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNV+ ARK+F+ M EKDV+SWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKVFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           G YRDFRR DMGFNEFSFAGVLILCVKLKELQLAKQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GFYRDFRRQDMGFNEFSFAGVLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMNLAS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQVHAYL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+CRVFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACRVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLV EGL+FFK M+YDH VLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVHEGLQFFKAMSYDHSVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDD++WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEEDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of HG10018885 vs. NCBI nr
Match: XP_022963954.1 (pentatricopeptide repeat-containing protein At2g21090 [Cucurbita moschata])

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 550/611 (90.02%), Postives = 580/611 (94.93%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PKSKQ PDSTR CIVQSLLN +SQG LPEALS+LDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGNLPEALSFLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGGK VHLHLK TGFKRPTTI+ANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNV+ ARKLF+ M EKDV+SWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRR DMGFNEFSFAG+LILCVKLKELQLAKQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMNLAS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQVHAYL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFK M+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDD++WN LLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNTLLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEEDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of HG10018885 vs. NCBI nr
Match: XP_008455202.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo] >KAA0031479.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK06932.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 547/611 (89.53%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PK +Q PDSTR+CI QSLL+L+SQGRLPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGGKCVHLHLKHTGFKRPTTI+ANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLG VHHARKLF+RM EKDVVSWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRRLDMGFN FSF+GVLILCVKLKELQL KQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCG+M CARRLFDEMLVKDI  WTTMVSGYAKWGDMN ASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFTKMMK  INP+QYTFSSCLCACASIAALKHGKQVH YL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC VF+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFK MTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDD+VW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTEEDFSII 612
           A HTEE+   I
Sbjct: 601 ARHTEEELLTI 611

BLAST of HG10018885 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 1.1e-208
Identity = 347/590 (58.81%), Postives = 450/590 (76.27%), Query Frame = 0

Query: 23  PDSTRLCIVQSLLNL-TSQGRLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGG 82
           P    +C+ QS L+   ++  L +A+S L+ L Q+GIRLP     +LL+ C   K  K G
Sbjct: 6   PRKRPICVAQSFLSKHATKAELSQAVSRLESLTQQGIRLPFDLLASLLQQCGDTKSLKQG 65

Query: 83  KCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAK 142
           K +H HLK TGFKRP T+++NHLIGMY +CG  I+A KVFD+M +RNLYSWN+M++GY K
Sbjct: 66  KWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVK 125

Query: 143 LGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGV 202
            G +  AR +F+ M E+DVVSWNTMV+ YA+ G   EA+  Y++FRR  + FNEFSFAG+
Sbjct: 126 SGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGL 185

Query: 203 LILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDI 262
           L  CVK ++LQL +Q HGQVLV GFLSNVVLS SI+DAYAKCG+M  A+R FDEM VKDI
Sbjct: 186 LTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDI 245

Query: 263 LAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKF 322
             WTT++SGYAK GDM  A +LF +MPEKNPVSWTALI+GY R   G+ AL+ F KM+  
Sbjct: 246 HIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIAL 305

Query: 323 RINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEAS 382
            + P+Q+TFSSCLCA ASIA+L+HGK++H Y++RTN R N IV+SSLIDMYSK G LEAS
Sbjct: 306 GVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVISSLIDMYSKSGSLEAS 365

Query: 383 CRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSH 442
            RVF +  +K D V WNTMISALAQHG G +A++M +DM++  ++P+R T +VIL+ACSH
Sbjct: 366 ERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSH 425

Query: 443 SGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWN 502
           SGLV+EGLR+F++MT  HG++PDQEHYACLIDLLGRAGCF EL+ ++E+M  +PD  +WN
Sbjct: 426 SGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKHIWN 485

Query: 503 ALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERF 562
           A+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R 
Sbjct: 486 AILGVCRIHGNEELGKKAADELIKLDPESSAPYILLSSIYADHGKWELVEKLRGVMKKRR 545

Query: 563 VRKERAISWIDIGNKVHSFIASD--RLHPLKEEIYSLLEQLASHTEEDFS 610
           V KE+A+SWI+I  KV +F  SD    H  KEEIY +L  LA+  EE+ S
Sbjct: 546 VNKEKAVSWIEIEKKVEAFTVSDGSHAHARKEEIYFILHNLAAVIEEEAS 595

BLAST of HG10018885 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 2.2e-111
Identity = 232/662 (35.05%), Postives = 338/662 (51.06%), Query Frame = 0

Query: 48  SYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFK-GGKCVHLHLKHTGFKRPTTIIANHLIG 107
           S+L   A       +S F  LL  C K+K      + VH  +  +GF      I N LI 
Sbjct: 5   SFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN-EIFIQNRLID 64

Query: 108 MYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTM 167
            Y +CG+  + R+VFDKM  RN+Y+WN ++ G  KLG +  A  LF  M E+D  +WN+M
Sbjct: 65  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 124

Query: 168 VLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGF 227
           V  +A+     EA+  +    +     NE+SFA VL  C  L ++    QVH  +    F
Sbjct: 125 VSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPF 184

Query: 228 LSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAW---------------------- 287
           LS+V + S++VD Y+KCG +  A+R+FDEM  +++++W                      
Sbjct: 185 LSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQM 244

Query: 288 ------------------------------------------------------------ 347
                                                                       
Sbjct: 245 MLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCS 304

Query: 348 --------------------TTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYTR 407
                               T+M+SGYA       A  +F +M E+N VSW ALI+GYT+
Sbjct: 305 RIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQ 364

Query: 408 NSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRCNT-- 467
           N    EAL+ F  + +  + P  Y+F++ L ACA +A L  G Q H ++++  F+  +  
Sbjct: 365 NGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGE 424

Query: 468 ----IVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFN 527
                V +SLIDMY KCG +E    VF  M  ++D V WN MI   AQ+G+G EA+++F 
Sbjct: 425 EDDIFVGNSLIDMYVKCGCVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFR 484

Query: 528 DMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRA 587
           +M+ESG KPD IT I +LSAC H+G V+EG  +F +MT D GV P ++HY C++DLLGRA
Sbjct: 485 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 544

Query: 588 GCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLA 601
           G   E  + +E+M  +PD  +W +LL  C++H NI LG+ VAE ++E+EP +S  YV L+
Sbjct: 545 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 604

BLAST of HG10018885 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 8.4e-111
Identity = 213/572 (37.24%), Postives = 337/572 (58.92%), Query Frame = 0

Query: 37  LTSQGRLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRP 96
           L  Q  L EA+  L     R  + P ST+ NL+++C++ +  + GK VH H++ +GF  P
Sbjct: 64  LCGQKLLREAVQLLG----RAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFV-P 123

Query: 97  TTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMT 156
             +I N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G +  ARKLF+ MT
Sbjct: 124 GIVIWNRLLRMYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMT 183

Query: 157 EKDVVSWNTMVLAYAKKGCFSEAIGLYRDFRRL-DMGFNEFSFAGVLILCVKLKELQLAK 216
           EKD  SW  MV  Y KK    EA+ LY   +R+ +   N F+ +  +     +K ++  K
Sbjct: 184 EKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGK 243

Query: 217 QVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWG 276
           ++HG ++  G  S+ VL SS++D Y KCG +  AR +FD+++ KD+              
Sbjct: 244 EIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDV-------------- 303

Query: 277 DMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLC 336
                            VSWT++I  Y ++S   E  + F++++     P++YTF+  L 
Sbjct: 304 -----------------VSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLN 363

Query: 337 ACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVV 396
           ACA +   + GKQVH Y+ R  F   +   SSL+DMY+KCG +E++  V      K D+V
Sbjct: 364 ACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGC-PKPDLV 423

Query: 397 LWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTM 456
            W ++I   AQ+G  +EA++ F+ +++SG KPD +TF+ +LSAC+H+GLV++GL FF ++
Sbjct: 424 SWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSI 483

Query: 457 TYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIEL 516
           T  H +    +HY CL+DLL R+G F +L + + +M  KP   +W ++LG C  +GNI+L
Sbjct: 484 TEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKFLWASVLGGCSTYGNIDL 543

Query: 517 GRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGN 576
             + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +I  
Sbjct: 544 AEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKR 598

Query: 577 KVHSFIASDRLHPLKEEIYSLLEQLASHTEED 608
           K H FIA+D  HP+  +I   L +L    +E+
Sbjct: 604 KRHVFIAADTSHPMYNQIVEFLRELRKKMKEE 598

BLAST of HG10018885 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 2.8e-106
Identity = 216/613 (35.24%), Postives = 326/613 (53.18%), Query Frame = 0

Query: 73  AKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSW 132
           A + + + G+C           R +++  N +I  Y   G    ARK+FD+M  R+L SW
Sbjct: 70  AISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEMPERDLVSW 129

Query: 133 NHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAIGLY-RDFRRLDM 192
           N M+ GY +  N+  AR+LFE M E+DV SWNTM+  YA+ GC  +A  ++ R   + D+
Sbjct: 130 NVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEKNDV 189

Query: 193 GFNEFSFAGVL-----ILCVKLKELQLAKQVHGQVLVVGFLS-----------------N 252
            +N    A V        C+  K  +    V    L+ GF+                  +
Sbjct: 190 SWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRD 249

Query: 253 VVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWT------------------------ 312
           VV  ++I+  YA+ G++  AR+LFDE  V+D+  WT                        
Sbjct: 250 VVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPE 309

Query: 313 --------------------------------------TMVSGYAKWGDMNLASELFHQM 372
                                                 TM++GYA+ G ++ A  LF +M
Sbjct: 310 RNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKM 369

Query: 373 PEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGK 432
           P+++PVSW A+I+GY+++    EAL  F +M +     ++ +FSS L  CA + AL+ GK
Sbjct: 370 PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGK 429

Query: 433 QVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQH 492
           Q+H  LV+  +     V ++L+ MY KCG +E +  +F  M  K D+V WNTMI+  ++H
Sbjct: 430 QLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGK-DIVSWNTMIAGYSRH 489

Query: 493 GHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEH 552
           G GE A++ F  M   GLKPD  T + +LSACSH+GLV +G ++F TMT D+GV+P+ +H
Sbjct: 490 GFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQH 549

Query: 553 YACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELE 601
           YAC++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +E
Sbjct: 550 YACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAME 609

BLAST of HG10018885 vs. ExPASy Swiss-Prot
Match: Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 379.0 bits (972), Expect = 1.0e-103
Identity = 192/508 (37.80%), Postives = 301/508 (59.25%), Query Frame = 0

Query: 98  TIIANHLIGMYFECGND----IEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFE 157
           T ++N L+ +Y +C +       ARKVFD++  ++  SW  M+ GY K G      +L E
Sbjct: 184 TSVSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLE 243

Query: 158 RMTEK-DVVSWNTMVLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGVLILCVKLKELQ 217
            M +   +V++N M+  Y  +G + EA+ + R      +  +EF++  V+  C     LQ
Sbjct: 244 GMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQ 303

Query: 218 LAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWTTMVSGYA 277
           L KQVH  VL     S     +S+V  Y KCG+   AR +F++M  KD+++W  ++SGY 
Sbjct: 304 LGKQVHAYVLRREDFS-FHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 363

Query: 278 KWGDMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSS 337
             G +  A  +F +M EKN +SW  +ISG   N  G E L  F+ M +    P  Y FS 
Sbjct: 364 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 423

Query: 338 CLCACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQ 397
            + +CA + A  +G+Q HA L++  F  +    ++LI MY+KCG++E + +VF  M    
Sbjct: 424 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTM-PCL 483

Query: 398 DVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFF 457
           D V WN +I+AL QHGHG EA+ ++ +M++ G++PDRIT + +L+ACSH+GLV +G ++F
Sbjct: 484 DSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYF 543

Query: 458 KTMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGN 517
            +M   + + P  +HYA LIDLL R+G F +  + +E +  KP  ++W ALL  CR+HGN
Sbjct: 544 DSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGN 603

Query: 518 IELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWID 577
           +ELG   A+ +  L P+    Y+ L++++A  G+WE V +VR+LM +R V+KE A SWI+
Sbjct: 604 MELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIE 663

Query: 578 IGNKVHSFIASDRLHPLKEEIYSLLEQL 601
           +  +VH+F+  D  HP  E +Y  L+ L
Sbjct: 664 METQVHTFLVDDTSHPEAEAVYIYLQDL 689

BLAST of HG10018885 vs. ExPASy TrEMBL
Match: A0A6J1HUW8 (pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima OX=3661 GN=LOC111467037 PE=4 SV=1)

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 580/611 (94.93%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PKSK  PDS+R CIVQSLLN +SQG LPEALSYLDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKHLPDSSRPCIVQSLLNHSSQGHLPEALSYLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGGK VHLHLK TGFKRPTTIIANHLIGMYF+CG+DIEARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIIANHLIGMYFQCGSDIEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNV+ ARK+F+ M EKDV+SWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKVFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           G YRDFRR DMGFNEFSFAGVLILCVKLKELQLAKQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GFYRDFRRQDMGFNEFSFAGVLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMNLAS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQVHAYL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+CRVFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACRVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLV EGL+FFK M+YDH VLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVHEGLQFFKAMSYDHSVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDD++WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNALLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEEDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of HG10018885 vs. ExPASy TrEMBL
Match: A0A6J1HLP5 (pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3662 GN=LOC111464099 PE=4 SV=1)

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 550/611 (90.02%), Postives = 580/611 (94.93%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PKSKQ PDSTR CIVQSLLN +SQG LPEALS+LDPL QRGIRL
Sbjct: 1   MPSFSSQALKKPAVFRPKSKQLPDSTRPCIVQSLLNHSSQGNLPEALSFLDPLVQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTS FV+LLRLCAKAK  KGGK VHLHLK TGFKRPTTI+ANHLIGMYF+CG+D EARKV
Sbjct: 61  PTSVFVHLLRLCAKAKSLKGGKSVHLHLKLTGFKRPTTIVANHLIGMYFQCGSDTEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLGNV+ ARKLF+ M EKDV+SWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGNVYQARKLFDTMIEKDVISWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRR DMGFNEFSFAG+LILCVKLKELQLAKQVH QVLVVGFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRQDMGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CA+RLFDEM VKDILAWTTMVSGYAKWGDMNLAS LFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGEMECAKRLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFT+MMKFR+NPDQ+TFSSCLCACASIAALKHGKQVHAYL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTQMMKFRVNPDQFTFSSCLCACASIAALKHGKQVHAYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEA+C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEAACSVFYLLGNKQDVVLWNTMISALAQHGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGL PDRITFIVILSACSHSGLVQEGL+FFK M+YDHG+LPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGLNPDRITFIVILSACSHSGLVQEGLQFFKAMSYDHGILPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F ELVNELEKM CKPDD++WN LLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASL
Sbjct: 481 FTELVNELEKMPCKPDDRIWNTLLGVCRIHGNIELGRKVAEHVIELEPRSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVE+VRE+MEER VRKERAISWIDI NKVHSFIASDR HPLKEEIYSLLEQL
Sbjct: 541 YAFLGKWESVEQVREVMEERLVRKERAISWIDIENKVHSFIASDRFHPLKEEIYSLLEQL 600

Query: 601 ASHTEEDFSII 612
           ASHTEEDFSI+
Sbjct: 601 ASHTEEDFSIV 611

BLAST of HG10018885 vs. ExPASy TrEMBL
Match: A0A5D3C6K7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G002430 PE=4 SV=1)

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 547/611 (89.53%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PK +Q PDSTR+CI QSLL+L+SQGRLPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGGKCVHLHLKHTGFKRPTTI+ANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLG VHHARKLF+RM EKDVVSWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRRLDMGFN FSF+GVLILCVKLKELQL KQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCG+M CARRLFDEMLVKDI  WTTMVSGYAKWGDMN ASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFTKMMK  INP+QYTFSSCLCACASIAALKHGKQVH YL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC VF+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFK MTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDD+VW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTEEDFSII 612
           A HTEE+   I
Sbjct: 601 ARHTEEELLTI 611

BLAST of HG10018885 vs. ExPASy TrEMBL
Match: A0A1S3BZY1 (pentatricopeptide repeat-containing protein At2g21090 OS=Cucumis melo OX=3656 GN=LOC103495423 PE=4 SV=1)

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 547/611 (89.53%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQALKKPA F PK +Q PDSTR+CI QSLL+L+SQGRLPEALSYLD LAQRGIRL
Sbjct: 1   MPSFSSQALKKPASFRPKCEQSPDSTRICIAQSLLDLSSQGRLPEALSYLDRLAQRGIRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PTSTFV+LLRLCAKAK+FKGGKCVHLHLKHTGFKRPTTI+ANHLIGMYFECG D+EARKV
Sbjct: 61  PTSTFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLG VHHARKLF+RM EKDVVSWNTMVLAYAKKGCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGEVHHARKLFDRMMEKDVVSWNTMVLAYAKKGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRRLDMGFN FSF+GVLILCVKLKELQL KQVHGQVLV GFLSN+VLS SIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFSGVLILCVKLKELQLTKQVHGQVLVAGFLSNLVLSCSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCG+M CARRLFDEMLVKDI  WTTMVSGYAKWGDMN ASELFHQMPEKNPVSWTALIS
Sbjct: 241 AKCGKMGCARRLFDEMLVKDIHIWTTMVSGYAKWGDMNSASELFHQMPEKNPVSWTALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFTKMMK  INP+QYTFSSCLCACASIAALKHGKQVH YL+RTNFRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTNFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC VF+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN M
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCYVFHLMGNKQDVVVWNTMISGLAQNGHGEKAMQMFNHM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESG+KPDRITFIVILSACSHSGLVQEGL+FFK MTYDHGVLPDQEHYACLIDLLGRAGC
Sbjct: 421 VESGVKPDRITFIVILSACSHSGLVQEGLQFFKAMTYDHGVLPDQEHYACLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           FIELVNELE MSCKPDD+VW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA L
Sbjct: 481 FIELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAEHVIELKPQSSAAYVSLAGL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKERAISWID+GNK+HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERAISWIDVGNKIHSFIASDRLHPLKEEIYVLLEQL 600

Query: 601 ASHTEEDFSII 612
           A HTEE+   I
Sbjct: 601 ARHTEEELLTI 611

BLAST of HG10018885 vs. ExPASy TrEMBL
Match: A0A0A0K215 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G062860 PE=4 SV=1)

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 545/611 (89.20%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MPSFSSQALKKPAFFHPKSKQRPDSTRLCIVQSLLNLTSQGRLPEALSYLDPLAQRGIRL 60
           MPSFSSQA K PA F PKSKQRPDST LCI QSLL+L+SQGRLPEALSYLD LAQRG+RL
Sbjct: 1   MPSFSSQAFKTPASFGPKSKQRPDSTSLCIAQSLLDLSSQGRLPEALSYLDRLAQRGVRL 60

Query: 61  PTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKV 120
           PT  FV+LLRLCAKAK+FKGGKCVHLHLKHTGFKRPTTI+ANHLIGMYFECG D+EARKV
Sbjct: 61  PTGIFVDLLRLCAKAKYFKGGKCVHLHLKHTGFKRPTTIVANHLIGMYFECGRDVEARKV 120

Query: 121 FDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAI 180
           FDKMSVRNLYSWNHMLAGYAKLG+V++ARKLF+RM EKDVVSWNT+VLAYAK+GCF+EAI
Sbjct: 121 FDKMSVRNLYSWNHMLAGYAKLGDVNNARKLFDRMMEKDVVSWNTIVLAYAKQGCFNEAI 180

Query: 181 GLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAY 240
           GLYRDFRRLDMGFN FSFAGVLILCVKLKELQLAKQVHGQVLV GFLSN+VLSSSIVDAY
Sbjct: 181 GLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAKQVHGQVLVAGFLSNLVLSSSIVDAY 240

Query: 241 AKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALIS 300
           AKCGEM CAR LFDEMLVKDI AWTT+VSGYAKWGDMN ASELFHQMPEKNPVSW+ALIS
Sbjct: 241 AKCGEMRCARTLFDEMLVKDIHAWTTIVSGYAKWGDMNSASELFHQMPEKNPVSWSALIS 300

Query: 301 GYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRC 360
           GY RNSLGHEAL+YFTKMMKF INP+QYTFSSCLCACASIAALKHGKQVH YL+RT FRC
Sbjct: 301 GYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKHGKQVHGYLIRTYFRC 360

Query: 361 NTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDM 420
           NTIVVSSLIDMYSKCGMLEASC VF+LMGNKQDVV+WNTMISALAQ+GHGE+AMQMFNDM
Sbjct: 361 NTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDVVVWNTMISALAQNGHGEKAMQMFNDM 420

Query: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGC 480
           VESGLKPDRITFIVILSACSHSGLVQEGLRFFK MTYDHGV PDQEHY+CLIDLLGRAGC
Sbjct: 421 VESGLKPDRITFIVILSACSHSGLVQEGLRFFKAMTYDHGVFPDQEHYSCLIDLLGRAGC 480

Query: 481 FIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASL 540
           F+ELVNELE MSCKPDD+VW+ALLGVCRIH NIELGRKVAE VIEL+PQSSAAYVSLASL
Sbjct: 481 FVELVNELENMSCKPDDRVWSALLGVCRIHNNIELGRKVAERVIELKPQSSAAYVSLASL 540

Query: 541 YAFLGKWESVEKVRELMEERFVRKERAISWIDIGNKVHSFIASDRLHPLKEEIYSLLEQL 600
           YAFLGKWESVEKVRELM+E+F+RKER ISWID+GNK HSFIASDRLHPLKEEIY LLEQL
Sbjct: 541 YAFLGKWESVEKVRELMDEKFIRKERGISWIDVGNKTHSFIASDRLHPLKEEIYLLLEQL 600

Query: 601 ASHTEEDFSII 612
           A HTEEDF  I
Sbjct: 601 ARHTEEDFLTI 611

BLAST of HG10018885 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 727.6 bits (1877), Expect = 8.1e-210
Identity = 347/590 (58.81%), Postives = 450/590 (76.27%), Query Frame = 0

Query: 23  PDSTRLCIVQSLLNL-TSQGRLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGG 82
           P    +C+ QS L+   ++  L +A+S L+ L Q+GIRLP     +LL+ C   K  K G
Sbjct: 6   PRKRPICVAQSFLSKHATKAELSQAVSRLESLTQQGIRLPFDLLASLLQQCGDTKSLKQG 65

Query: 83  KCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAK 142
           K +H HLK TGFKRP T+++NHLIGMY +CG  I+A KVFD+M +RNLYSWN+M++GY K
Sbjct: 66  KWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVK 125

Query: 143 LGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGV 202
            G +  AR +F+ M E+DVVSWNTMV+ YA+ G   EA+  Y++FRR  + FNEFSFAG+
Sbjct: 126 SGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGL 185

Query: 203 LILCVKLKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDI 262
           L  CVK ++LQL +Q HGQVLV GFLSNVVLS SI+DAYAKCG+M  A+R FDEM VKDI
Sbjct: 186 LTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDI 245

Query: 263 LAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKF 322
             WTT++SGYAK GDM  A +LF +MPEKNPVSWTALI+GY R   G+ AL+ F KM+  
Sbjct: 246 HIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIAL 305

Query: 323 RINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEAS 382
            + P+Q+TFSSCLCA ASIA+L+HGK++H Y++RTN R N IV+SSLIDMYSK G LEAS
Sbjct: 306 GVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVISSLIDMYSKSGSLEAS 365

Query: 383 CRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSH 442
            RVF +  +K D V WNTMISALAQHG G +A++M +DM++  ++P+R T +VIL+ACSH
Sbjct: 366 ERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSH 425

Query: 443 SGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWN 502
           SGLV+EGLR+F++MT  HG++PDQEHYACLIDLLGRAGCF EL+ ++E+M  +PD  +WN
Sbjct: 426 SGLVEEGLRWFESMTVQHGIVPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKHIWN 485

Query: 503 ALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERF 562
           A+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R 
Sbjct: 486 AILGVCRIHGNEELGKKAADELIKLDPESSAPYILLSSIYADHGKWELVEKLRGVMKKRR 545

Query: 563 VRKERAISWIDIGNKVHSFIASD--RLHPLKEEIYSLLEQLASHTEEDFS 610
           V KE+A+SWI+I  KV +F  SD    H  KEEIY +L  LA+  EE+ S
Sbjct: 546 VNKEKAVSWIEIEKKVEAFTVSDGSHAHARKEEIYFILHNLAAVIEEEAS 595

BLAST of HG10018885 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 404.4 bits (1038), Expect = 1.6e-112
Identity = 232/662 (35.05%), Postives = 338/662 (51.06%), Query Frame = 0

Query: 48  SYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFK-GGKCVHLHLKHTGFKRPTTIIANHLIG 107
           S+L   A       +S F  LL  C K+K      + VH  +  +GF      I N LI 
Sbjct: 5   SFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN-EIFIQNRLID 64

Query: 108 MYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTM 167
            Y +CG+  + R+VFDKM  RN+Y+WN ++ G  KLG +  A  LF  M E+D  +WN+M
Sbjct: 65  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 124

Query: 168 VLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGVLILCVKLKELQLAKQVHGQVLVVGF 227
           V  +A+     EA+  +    +     NE+SFA VL  C  L ++    QVH  +    F
Sbjct: 125 VSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPF 184

Query: 228 LSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAW---------------------- 287
           LS+V + S++VD Y+KCG +  A+R+FDEM  +++++W                      
Sbjct: 185 LSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQM 244

Query: 288 ------------------------------------------------------------ 347
                                                                       
Sbjct: 245 MLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCS 304

Query: 348 --------------------TTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYTR 407
                               T+M+SGYA       A  +F +M E+N VSW ALI+GYT+
Sbjct: 305 RIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQ 364

Query: 408 NSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGKQVHAYLVRTNFRCNT-- 467
           N    EAL+ F  + +  + P  Y+F++ L ACA +A L  G Q H ++++  F+  +  
Sbjct: 365 NGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGE 424

Query: 468 ----IVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQHGHGEEAMQMFN 527
                V +SLIDMY KCG +E    VF  M  ++D V WN MI   AQ+G+G EA+++F 
Sbjct: 425 EDDIFVGNSLIDMYVKCGCVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFR 484

Query: 528 DMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEHYACLIDLLGRA 587
           +M+ESG KPD IT I +LSAC H+G V+EG  +F +MT D GV P ++HY C++DLLGRA
Sbjct: 485 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 544

Query: 588 GCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLA 601
           G   E  + +E+M  +PD  +W +LL  C++H NI LG+ VAE ++E+EP +S  YV L+
Sbjct: 545 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 604

BLAST of HG10018885 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 402.5 bits (1033), Expect = 6.0e-112
Identity = 213/572 (37.24%), Postives = 337/572 (58.92%), Query Frame = 0

Query: 37  LTSQGRLPEALSYLDPLAQRGIRLPTSTFVNLLRLCAKAKFFKGGKCVHLHLKHTGFKRP 96
           L  Q  L EA+  L     R  + P ST+ NL+++C++ +  + GK VH H++ +GF  P
Sbjct: 64  LCGQKLLREAVQLLG----RAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFV-P 123

Query: 97  TTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFERMT 156
             +I N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G +  ARKLF+ MT
Sbjct: 124 GIVIWNRLLRMYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMT 183

Query: 157 EKDVVSWNTMVLAYAKKGCFSEAIGLYRDFRRL-DMGFNEFSFAGVLILCVKLKELQLAK 216
           EKD  SW  MV  Y KK    EA+ LY   +R+ +   N F+ +  +     +K ++  K
Sbjct: 184 EKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGK 243

Query: 217 QVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWTTMVSGYAKWG 276
           ++HG ++  G  S+ VL SS++D Y KCG +  AR +FD+++ KD+              
Sbjct: 244 EIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDV-------------- 303

Query: 277 DMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLC 336
                            VSWT++I  Y ++S   E  + F++++     P++YTF+  L 
Sbjct: 304 -----------------VSWTSMIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLN 363

Query: 337 ACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVV 396
           ACA +   + GKQVH Y+ R  F   +   SSL+DMY+KCG +E++  V      K D+V
Sbjct: 364 ACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGC-PKPDLV 423

Query: 397 LWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTM 456
            W ++I   AQ+G  +EA++ F+ +++SG KPD +TF+ +LSAC+H+GLV++GL FF ++
Sbjct: 424 SWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSI 483

Query: 457 TYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIEL 516
           T  H +    +HY CL+DLL R+G F +L + + +M  KP   +W ++LG C  +GNI+L
Sbjct: 484 TEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMPMKPSKFLWASVLGGCSTYGNIDL 543

Query: 517 GRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDIGN 576
             + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +I  
Sbjct: 544 AEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKR 598

Query: 577 KVHSFIASDRLHPLKEEIYSLLEQLASHTEED 608
           K H FIA+D  HP+  +I   L +L    +E+
Sbjct: 604 KRHVFIAADTSHPMYNQIVEFLRELRKKMKEE 598

BLAST of HG10018885 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 387.5 bits (994), Expect = 2.0e-107
Identity = 216/613 (35.24%), Postives = 326/613 (53.18%), Query Frame = 0

Query: 73  AKAKFFKGGKCVHLHLKHTGFKRPTTIIANHLIGMYFECGNDIEARKVFDKMSVRNLYSW 132
           A + + + G+C           R +++  N +I  Y   G    ARK+FD+M  R+L SW
Sbjct: 70  AISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEMPERDLVSW 129

Query: 133 NHMLAGYAKLGNVHHARKLFERMTEKDVVSWNTMVLAYAKKGCFSEAIGLY-RDFRRLDM 192
           N M+ GY +  N+  AR+LFE M E+DV SWNTM+  YA+ GC  +A  ++ R   + D+
Sbjct: 130 NVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEKNDV 189

Query: 193 GFNEFSFAGVL-----ILCVKLKELQLAKQVHGQVLVVGFLS-----------------N 252
            +N    A V        C+  K  +    V    L+ GF+                  +
Sbjct: 190 SWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRD 249

Query: 253 VVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWT------------------------ 312
           VV  ++I+  YA+ G++  AR+LFDE  V+D+  WT                        
Sbjct: 250 VVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPE 309

Query: 313 --------------------------------------TMVSGYAKWGDMNLASELFHQM 372
                                                 TM++GYA+ G ++ A  LF +M
Sbjct: 310 RNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKM 369

Query: 373 PEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSSCLCACASIAALKHGK 432
           P+++PVSW A+I+GY+++    EAL  F +M +     ++ +FSS L  CA + AL+ GK
Sbjct: 370 PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGK 429

Query: 433 QVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQDVVLWNTMISALAQH 492
           Q+H  LV+  +     V ++L+ MY KCG +E +  +F  M  K D+V WNTMI+  ++H
Sbjct: 430 QLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGK-DIVSWNTMIAGYSRH 489

Query: 493 GHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFFKTMTYDHGVLPDQEH 552
           G GE A++ F  M   GLKPD  T + +LSACSH+GLV +G ++F TMT D+GV+P+ +H
Sbjct: 490 GFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQH 549

Query: 553 YACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGNIELGRKVAEHVIELE 601
           YAC++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +E
Sbjct: 550 YACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAME 609

BLAST of HG10018885 vs. TAIR 10
Match: AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 379.0 bits (972), Expect = 7.1e-105
Identity = 192/508 (37.80%), Postives = 301/508 (59.25%), Query Frame = 0

Query: 98  TIIANHLIGMYFECGND----IEARKVFDKMSVRNLYSWNHMLAGYAKLGNVHHARKLFE 157
           T ++N L+ +Y +C +       ARKVFD++  ++  SW  M+ GY K G      +L E
Sbjct: 184 TSVSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLE 243

Query: 158 RMTEK-DVVSWNTMVLAYAKKGCFSEAIGLYRDFRRLDMGFNEFSFAGVLILCVKLKELQ 217
            M +   +V++N M+  Y  +G + EA+ + R      +  +EF++  V+  C     LQ
Sbjct: 244 GMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQ 303

Query: 218 LAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMACARRLFDEMLVKDILAWTTMVSGYA 277
           L KQVH  VL     S     +S+V  Y KCG+   AR +F++M  KD+++W  ++SGY 
Sbjct: 304 LGKQVHAYVLRREDFS-FHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYV 363

Query: 278 KWGDMNLASELFHQMPEKNPVSWTALISGYTRNSLGHEALNYFTKMMKFRINPDQYTFSS 337
             G +  A  +F +M EKN +SW  +ISG   N  G E L  F+ M +    P  Y FS 
Sbjct: 364 SSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSG 423

Query: 338 CLCACASIAALKHGKQVHAYLVRTNFRCNTIVVSSLIDMYSKCGMLEASCRVFYLMGNKQ 397
            + +CA + A  +G+Q HA L++  F  +    ++LI MY+KCG++E + +VF  M    
Sbjct: 424 AIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYAKCGVVEEARQVFRTM-PCL 483

Query: 398 DVVLWNTMISALAQHGHGEEAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGLRFF 457
           D V WN +I+AL QHGHG EA+ ++ +M++ G++PDRIT + +L+ACSH+GLV +G ++F
Sbjct: 484 DSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYF 543

Query: 458 KTMTYDHGVLPDQEHYACLIDLLGRAGCFIELVNELEKMSCKPDDQVWNALLGVCRIHGN 517
            +M   + + P  +HYA LIDLL R+G F +  + +E +  KP  ++W ALL  CR+HGN
Sbjct: 544 DSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGN 603

Query: 518 IELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWID 577
           +ELG   A+ +  L P+    Y+ L++++A  G+WE V +VR+LM +R V+KE A SWI+
Sbjct: 604 MELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIE 663

Query: 578 IGNKVHSFIASDRLHPLKEEIYSLLEQL 601
           +  +VH+F+  D  HP  E +Y  L+ L
Sbjct: 664 METQVHTFLVDDTSHPEAEAVYIYLQDL 689

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886822.10.0e+0095.09pentatricopeptide repeat-containing protein At2g21090 [Benincasa hispida][more]
KAG7011402.10.0e+0090.34Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022967585.10.0e+0090.34pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima][more]
XP_022963954.10.0e+0090.02pentatricopeptide repeat-containing protein At2g21090 [Cucurbita moschata][more]
XP_008455202.10.0e+0089.53PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo] ... [more]
Match NameE-valueIdentityDescription
Q9SKQ41.1e-20858.81Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SIT72.2e-11135.05Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
O231698.4e-11137.24Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9SY022.8e-10635.24Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Q9FRI51.0e-10337.80Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1HUW80.0e+0090.34pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima O... [more]
A0A6J1HLP50.0e+0090.02pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3... [more]
A0A5D3C6K70.0e+0089.53Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BZY10.0e+0089.53pentatricopeptide repeat-containing protein At2g21090 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0K2150.0e+0089.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G062860 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G21090.18.1e-21058.81Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G13600.11.6e-11235.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37170.16.0e-11237.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G02750.12.0e-10735.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G25360.17.1e-10537.80Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 393..440
e-value: 2.8E-15
score: 56.3
coord: 290..338
e-value: 3.4E-9
score: 36.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 263..290
e-value: 4.1E-6
score: 24.6
coord: 293..326
e-value: 7.2E-6
score: 23.9
coord: 395..428
e-value: 1.6E-10
score: 38.5
coord: 161..190
e-value: 2.2E-4
score: 19.2
coord: 235..261
e-value: 0.0019
score: 16.3
coord: 130..161
e-value: 5.1E-7
score: 27.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 130..159
e-value: 4.0E-7
score: 29.8
coord: 533..560
e-value: 1.3
score: 9.4
coord: 365..389
e-value: 0.025
score: 14.8
coord: 102..128
e-value: 0.08
score: 13.2
coord: 234..260
e-value: 0.0034
score: 17.5
coord: 161..189
e-value: 3.0E-5
score: 23.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 11.432693
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..325
score: 10.468099
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 393..427
score: 13.800334
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 260..290
score: 9.799459
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 31..184
e-value: 4.1E-28
score: 100.6
coord: 301..438
e-value: 2.9E-33
score: 117.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 195..291
e-value: 9.4E-17
score: 62.9
coord: 439..570
e-value: 2.9E-15
score: 58.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 108..551
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 182..584
coord: 42..287

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018885.1HG10018885.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding