HG10018237 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018237
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 2132483 .. 2134570 (-)
RNA-Seq ExpressionHG10018237
SyntenyHG10018237
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGCAAAATCCACGCTGCGGCAAGCAATAGACTTGCTCTGCTCTCGGAGCATTGCCACCTCTGAGGCATACACCCAATTGGTCCTTGAATGTGTTCGTACAAATGAAATCAACCAAGCTAAGAGACTTCAGTCCCACATGGAGCACCATTTTTTCCGACCCACCGATCCCTTTCTCCACAATCAGCTACTTCATTTGTATGCAAAATTTGGCAAGTTTCGAGATGCCCAAAACTTGTTTGATAAAATGCTTGAAAGGGATGTTTTCTCTTGGAATGCTCTGCTCTCTGCGTATGCTAAATCAGGTTCTATCCAGAATTTGCAAGCGACATTTGATCGAATGCCTTTACGGGATTCAGTTTCATACAATACAACCATTGCAGGTTTTGCTGGAAATAGTTGTCCAAAAGAGTCACTTGAGCATTTTAAAAGAATGCAAAGGGAAGGTTTTGAGCCTACGGAGTATACAATTGTAAGCATATTGAATGCATCTGCACAATTGTTGGACTTGAGGCGTGGGAAACAGATTCATGGGAGCGTTATTGTGCATAACTTTTTAGGGAATGTGTTTATTTGTAATGCTTTAACAGATATCTATGCCAAATGTGGTGAGATTGAACAGGCAAGGTGGTTGTTTGATTGTCTTACTAACAAGAATTTGGTTTCTTGGAACTTAATGATATCTGGATATGCAAAGAATGGACAGCCTGAGAAGTGTAGTGGTTTGTTACATGAAATGCAGTTGTCCGGACATATGCCCGATCAATTTACCATGTCAACTATAATTGCAGCTTACTGTCAATCCGGACGTGTAGATGAAGCAAGAAGGGTGTTTAGTGAGTTTAAAGAGAAGGATATTGTTTGTTGGACAGCCATGTTGGTGGGTTATGCAAAAAATGGCAGAGAAGAGGATGCACTATTGTTGTTTAATGAAATGCTATTAGAACAAATTAAACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCTTGTGCCAAATTGGCATCTCTACATCTTGGTCAGGTAGTCCACGGAAGATCAATTCTTGGTGGACTTAATAATAATTTGCTCGTCTCTAGTGCACTAATTGATATGTATTCTAAATGTGGTTTCATTGATGATGCAAGGTCAGTCTTCAACCTGATGCCAACTAGGAATGTGGTTTCATGGAATGCTATGATTGTTGGTTGTGCACAAAATGGACATGATAAGGATGCCCTTGAACTCTTTGAAAATATGTTACAACAGAAATTTAAACCTGATAATGTAACTTTTATAGGCGTTTTATCTGCTTGTCTCCATTGTAATTGGATTGAGCAAGGGCAGGGGTACTTTGATTCTATAAGCAACCAACATGGAATGACACCTACTTTGGATCATTATGCATGTATGGCCAATCTCCTAGGACGTACGGGCTGCATACATCAAGTAGTTAGTCTAATAAAAAATATGGCCCATGAACCAGATTTCCTGATCTGGTCCACACTTCTATCCATTAGCTCAACAAAGGGTGATATTGTAAATGCAGAAATGGCTGCCAGGCATCTCTTTGAATTGGATCCTACGAGTGCTGTACCATATATTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCATCAGTTAGGAATCTCATGAATAGCAAGAATGTGAAGAAGTTTGCTGGGTACAGTTGGATTGAAATTGATAATGAGGTGCACAGATTCACATCTGAAGACCGGACTCATCCAGAAACAGAAAAAATATATGAGGAATTAAACACGTTGATAGGGAAACTTCAAGAAGAAGGATTTACCCCTAATACAAATCTGGTTTTGCATGATGTTGGAGAGGATGAAAAATTCAAATCCATATGTTTCCACAGCGAGAAACTTGCCCTTGCCTTTGGTTTGATTAAGAAACCTAATGGAATTAGTCCAATAAGGATCATAAAGAATATTCGAATTTGCAACGATTGCCATGAATTTATGAAGTTTGCATCTAGGATTATTAGAAGGCAAATAATATTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGCTCCTGCAAGGACAATTGGTAA

mRNA sequence

ATGAAAGCAAAATCCACGCTGCGGCAAGCAATAGACTTGCTCTGCTCTCGGAGCATTGCCACCTCTGAGGCATACACCCAATTGGTCCTTGAATGTGTTCGTACAAATGAAATCAACCAAGCTAAGAGACTTCAGTCCCACATGGAGCACCATTTTTTCCGACCCACCGATCCCTTTCTCCACAATCAGCTACTTCATTTGTATGCAAAATTTGGCAAGTTTCGAGATGCCCAAAACTTGTTTGATAAAATGCTTGAAAGGGATGTTTTCTCTTGGAATGCTCTGCTCTCTGCGTATGCTAAATCAGGTTCTATCCAGAATTTGCAAGCGACATTTGATCGAATGCCTTTACGGGATTCAGTTTCATACAATACAACCATTGCAGGTTTTGCTGGAAATAGTTGTCCAAAAGAGTCACTTGAGCATTTTAAAAGAATGCAAAGGGAAGGTTTTGAGCCTACGGAGTATACAATTGTAAGCATATTGAATGCATCTGCACAATTGTTGGACTTGAGGCGTGGGAAACAGATTCATGGGAGCGTTATTGTGCATAACTTTTTAGGGAATGTGTTTATTTGTAATGCTTTAACAGATATCTATGCCAAATGTGGTGAGATTGAACAGGCAAGGTGGTTGTTTGATTGTCTTACTAACAAGAATTTGGTTTCTTGGAACTTAATGATATCTGGATATGCAAAGAATGGACAGCCTGAGAAGTGTAGTGGTTTGTTACATGAAATGCAGTTGTCCGGACATATGCCCGATCAATTTACCATGTCAACTATAATTGCAGCTTACTGTCAATCCGGACGTGTAGATGAAGCAAGAAGGGTGTTTAGTGAGTTTAAAGAGAAGGATATTGTTTGTTGGACAGCCATGTTGGTGGGTTATGCAAAAAATGGCAGAGAAGAGGATGCACTATTGTTGTTTAATGAAATGCTATTAGAACAAATTAAACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCTTGTGCCAAATTGGCATCTCTACATCTTGGTCAGGTAGTCCACGGAAGATCAATTCTTGGTGGACTTAATAATAATTTGCTCGTCTCTAGTGCACTAATTGATATGTATTCTAAATGTGGTTTCATTGATGATGCAAGCTCAACAAAGGGTGATATTGTAAATGCAGAAATGGCTGCCAGGCATCTCTTTGAATTGGATCCTACGAGTGCTGTACCATATATTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCATCAGTTAGGAATCTCATGAATAGCAAGAATGTGAAGAAGTTTGCTGGGTACAGTTGGATTGAAATTGATAATGAGGTGCACAGATTCACATCTGAAGACCGGACTCATCCAGAAACAGAAAAAATATATGAGGAATTAAACACGTTGATAGGGAAACTTCAAGAAGAAGGATTTACCCCTAATACAAATCTGGTTTTGCATGATGTTGGAGAGGATGAAAAATTCAAATCCATATGTTTCCACAGCGAGAAACTTGCCCTTGCCTTTGGTTTGATTAAGAAACCTAATGGAATTAGTCCAATAAGGATCATAAAGAATATTCGAATTTGCAACGATTGCCATGAATTTATGAAGTTTGCATCTAGGATTATTAGAAGGCAAATAATATTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGCTCCTGCAAGGACAATTGGTAA

Coding sequence (CDS)

ATGAAAGCAAAATCCACGCTGCGGCAAGCAATAGACTTGCTCTGCTCTCGGAGCATTGCCACCTCTGAGGCATACACCCAATTGGTCCTTGAATGTGTTCGTACAAATGAAATCAACCAAGCTAAGAGACTTCAGTCCCACATGGAGCACCATTTTTTCCGACCCACCGATCCCTTTCTCCACAATCAGCTACTTCATTTGTATGCAAAATTTGGCAAGTTTCGAGATGCCCAAAACTTGTTTGATAAAATGCTTGAAAGGGATGTTTTCTCTTGGAATGCTCTGCTCTCTGCGTATGCTAAATCAGGTTCTATCCAGAATTTGCAAGCGACATTTGATCGAATGCCTTTACGGGATTCAGTTTCATACAATACAACCATTGCAGGTTTTGCTGGAAATAGTTGTCCAAAAGAGTCACTTGAGCATTTTAAAAGAATGCAAAGGGAAGGTTTTGAGCCTACGGAGTATACAATTGTAAGCATATTGAATGCATCTGCACAATTGTTGGACTTGAGGCGTGGGAAACAGATTCATGGGAGCGTTATTGTGCATAACTTTTTAGGGAATGTGTTTATTTGTAATGCTTTAACAGATATCTATGCCAAATGTGGTGAGATTGAACAGGCAAGGTGGTTGTTTGATTGTCTTACTAACAAGAATTTGGTTTCTTGGAACTTAATGATATCTGGATATGCAAAGAATGGACAGCCTGAGAAGTGTAGTGGTTTGTTACATGAAATGCAGTTGTCCGGACATATGCCCGATCAATTTACCATGTCAACTATAATTGCAGCTTACTGTCAATCCGGACGTGTAGATGAAGCAAGAAGGGTGTTTAGTGAGTTTAAAGAGAAGGATATTGTTTGTTGGACAGCCATGTTGGTGGGTTATGCAAAAAATGGCAGAGAAGAGGATGCACTATTGTTGTTTAATGAAATGCTATTAGAACAAATTAAACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCTTGTGCCAAATTGGCATCTCTACATCTTGGTCAGGTAGTCCACGGAAGATCAATTCTTGGTGGACTTAATAATAATTTGCTCGTCTCTAGTGCACTAATTGATATGTATTCTAAATGTGGTTTCATTGATGATGCAAGCTCAACAAAGGGTGATATTGTAAATGCAGAAATGGCTGCCAGGCATCTCTTTGAATTGGATCCTACGAGTGCTGTACCATATATTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCATCAGTTAGGAATCTCATGAATAGCAAGAATGTGAAGAAGTTTGCTGGGTACAGTTGGATTGAAATTGATAATGAGGTGCACAGATTCACATCTGAAGACCGGACTCATCCAGAAACAGAAAAAATATATGAGGAATTAAACACGTTGATAGGGAAACTTCAAGAAGAAGGATTTACCCCTAATACAAATCTGGTTTTGCATGATGTTGGAGAGGATGAAAAATTCAAATCCATATGTTTCCACAGCGAGAAACTTGCCCTTGCCTTTGGTTTGATTAAGAAACCTAATGGAATTAGTCCAATAAGGATCATAAAGAATATTCGAATTTGCAACGATTGCCATGAATTTATGAAGTTTGCATCTAGGATTATTAGAAGGCAAATAATATTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGCTCCTGCAAGGACAATTGGTAA

Protein sequence

MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKCSGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVSSALIDMYSKCGFIDDASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIRRQIILRDSNRFHHFSTGKCSCKDNW
Homology
BLAST of HG10018237 vs. NCBI nr
Match: XP_038895252.1 (pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida] >XP_038895253.1 pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida] >XP_038895254.1 pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida])

HSP 1 Score: 1041.2 bits (2691), Expect = 3.3e-300
Identity = 535/695 (76.98%), Postives = 550/695 (79.14%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQAIDLLCS+S ATSEAYTQLVLECVR N+INQAKRLQSHMEHH F+PTDPFL
Sbjct: 1   MKAKSKLRQAIDLLCSQSTATSEAYTQLVLECVRKNDINQAKRLQSHMEHHLFQPTDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKMLERDVFSWNA+LSA+AKSGSIQNL+ATFD+MP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNAMLSAHAKSGSIQNLRATFDQMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGFAGNSCPKESLE FKRMQREGFEPTEYTIVSILNAS QLLDLRRGKQIHGS
Sbjct: 121 VSYNTTIAGFAGNSCPKESLELFKRMQREGFEPTEYTIVSILNASTQLLDLRRGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVHNFLGNVFICNALTD+YAKCGEIEQARWLFDC TNKNLVSWNLMISGYAKNG+PEKC
Sbjct: 181 VIVHNFLGNVFICNALTDMYAKCGEIEQARWLFDCHTNKNLVSWNLMISGYAKNGKPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLHEM+LSGHMPDQ TMSTIIAAYCQ GRVD AR+VFSEFKEKDIVCWTAMLVGYAKN
Sbjct: 241 IGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDAARKVFSEFKEKDIVCWTAMLVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDAL LFNEMLLE I+PDSYTLSSVVSSCAKLA LH GQ VHG+SIL GLNNNLLVS
Sbjct: 301 GREEDALSLFNEMLLEHIEPDSYTLSSVVSSCAKLAFLHHGQAVHGKSILAGLNNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGFIDDA                                            
Sbjct: 361 SALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGVLSACLHCNWIEQGQGYFDSISNQHGLTPTLDHYACMVNLLGRTGRIGQAVS 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               SSTKGD+VNAEMAA+HLFELDPTSAVPYIMLSNMYASMGR
Sbjct: 481 LIKNMAHEPDFLIWSTLLSISSTKGDVVNAEMAAKHLFELDPTSAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVASVRNLMNSKNVKKFAGYSWIEIDNEV RFTSEDRTHPETEKIYEELN LIGKLQE
Sbjct: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVRRFTSEDRTHPETEKIYEELNMLIGKLQE 600

BLAST of HG10018237 vs. NCBI nr
Match: KAA0051836.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK21410.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1023.5 bits (2645), Expect = 7.2e-295
Identity = 527/695 (75.83%), Postives = 544/695 (78.27%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKSTLRQ++DLLCSRS ATSEAYTQLVLECVRTNEINQAKRLQSHMEHH F+PTDPFL
Sbjct: 1   MKAKSTLRQSVDLLCSRSTATSEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKML+RD FSWNALLSAYAKSGSIQNL+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLKRDTFSWNALLSAYAKSGSIQNLKATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGF+GNSCP+ESL+ FKRMQREGFEPTEYTIVSILNASAQLLDLR GKQIHGS
Sbjct: 121 VSYNTTIAGFSGNSCPQESLQLFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           +IV NFLGNVFI N LTD+YAKCGEIEQARWLFDCLT KNLVSWNLMISGYAKNGQPEKC
Sbjct: 181 IIVRNFLGNVFIWNTLTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMP+Q TMSTIIAAYCQ GRVDEARRVFSEFKEKDIVCWTAMLVGYAKN
Sbjct: 241 IGLLHQMRLSGHMPNQVTMSTIIAAYCQCGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE I+PDSYTLSSVVSSCAKLASLH GQ VHG+SIL GLNNNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGFIDDA                                            
Sbjct: 361 SALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRTGRIEQAVS 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                                STKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR
Sbjct: 481 LIKNMAHEPDFLIWSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WK VASVRNLM SKNVKKFAG+SWIEID EVHRFTSEDRTHPE+E IYEELN LIGKLQE
Sbjct: 541 WKYVASVRNLMKSKNVKKFAGFSWIEIDKEVHRFTSEDRTHPESENIYEELNILIGKLQE 600

BLAST of HG10018237 vs. NCBI nr
Match: XP_004147314.1 (putative pentatricopeptide repeat-containing protein At1g68930 [Cucumis sativus] >KGN64799.1 hypothetical protein Csa_014093 [Cucumis sativus])

HSP 1 Score: 1023.1 bits (2644), Expect = 9.4e-295
Identity = 526/695 (75.68%), Postives = 545/695 (78.42%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQ++DLLCSRS ATSEAYTQLVLECVRTNEINQAKRLQSHMEHH F+PTD FL
Sbjct: 1   MKAKSMLRQSVDLLCSRSTATSEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDSFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKML+RD+FSWNALLSAYAKSGSIQNL+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLKRDIFSWNALLSAYAKSGSIQNLKATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGF+GNSCP+ESLE FKRMQREGFEPTEYTIVSILNASAQL DLR GKQIHGS
Sbjct: 121 VSYNTTIAGFSGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLSDLRYGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           +IV NFLGNVFI NALTD+YAKCGEIEQARWLFDCLT KNLVSWNLMISGYAKNGQPEKC
Sbjct: 181 IIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMPDQ TMSTIIAAYCQ GRVDEARRVFSEFKEKDIVCWTAM+VGYAKN
Sbjct: 241 IGLLHQMRLSGHMPDQVTMSTIIAAYCQCGRVDEARRVFSEFKEKDIVCWTAMMVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE I+PDSYTLSSVVSSCAKLASLH GQ VHG+SIL GLNNNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGFIDDA                                            
Sbjct: 361 SALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGILSACLHCNWIEQGQEYFDSITNQHGMTPTLDHYACMVNLLGRTGRIEQAVA 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                                STKGDIVNAE+AARHLFELDPT AVPYIMLSNMYASMGR
Sbjct: 481 LIKNMAHDPDFLIWSTLLSICSTKGDIVNAEVAARHLFELDPTIAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVASVRNLM SKNVKKFAG+SWIEIDNEVHRFTSEDRTHPE+E IYE+LN LIGKLQE
Sbjct: 541 WKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESEDIYEKLNMLIGKLQE 600

BLAST of HG10018237 vs. NCBI nr
Match: XP_022145099.1 (pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia] >XP_022145100.1 pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia])

HSP 1 Score: 979.5 bits (2531), Expect = 1.2e-281
Identity = 500/695 (71.94%), Postives = 531/695 (76.40%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           M+AK  LRQAIDLLCSR  A+SEAYT L+LECVRTNE++QAKRLQSHMEHH F+P DPFL
Sbjct: 1   MQAKPKLRQAIDLLCSRGSASSEAYTHLILECVRTNEVDQAKRLQSHMEHHLFQPPDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
            NQLLHLYAKFGK RDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMP RDS
Sbjct: 61  QNQLLHLYAKFGKVRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGFAGN CPKESLE F+RMQ EGF PTEYT VS LNA+AQLLDLRRGK+IHGS
Sbjct: 121 VSYNTTIAGFAGNGCPKESLELFRRMQSEGFVPTEYTNVSALNAAAQLLDLRRGKEIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVH FLGN FI NALTD+YAKCGEIEQARWLFD L NKNL+SWNLMISGY KNGQPEKC
Sbjct: 181 VIVHKFLGNTFIWNALTDMYAKCGEIEQARWLFDHLANKNLISWNLMISGYVKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLHEMQ+SGHMPDQ TMSTIIAAYCQ   VDEAR+VFSEFKEKDIVCWTAMLVGYAKN
Sbjct: 241 IGLLHEMQMSGHMPDQVTMSTIIAAYCQCRCVDEARKVFSEFKEKDIVCWTAMLVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE +KPDSYTLSSVVSSCAKLASL+ GQ VHG+SIL GL+NNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHVKPDSYTLSSVVSSCAKLASLYHGQAVHGKSILAGLDNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGF+D+A                                            
Sbjct: 361 SALIDMYSKCGFVDNARSVFNMMPTRNVISWNAMIVGYAQNGHDKDALAFFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGVLSACLHSNWIEKGQGYFDSISNQHGLIPTVDHYACMVNLLGRLGRIDQAVD 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               S+ KGDI NAEMAAR+LFELDP +AVPY+MLSNMYA MGR
Sbjct: 481 LIKSMPHEPDCLIWSTLLSVSAVKGDIANAEMAARYLFELDPLNAVPYVMLSNMYACMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVASVR LM SKNVKKFAGYSWIEIDN+VH+FTSEDRTHPETEKIYEELN LI K QE
Sbjct: 541 WKDVASVRTLMKSKNVKKFAGYSWIEIDNQVHKFTSEDRTHPETEKIYEELNMLIRKFQE 600

BLAST of HG10018237 vs. NCBI nr
Match: XP_022960689.1 (pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 966.5 bits (2497), Expect = 1.0e-277
Identity = 493/695 (70.94%), Postives = 530/695 (76.26%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQA+DLLCSRS ATSEAYTQLVLECVR NEI+QAKRLQSHMEHH F+P DPFL
Sbjct: 1   MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKMLERDVFSWNALLSAYAKSGSIQ+L+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNT IAG +GNS PKESLE F+RMQREG  PTEYT VS LNASAQLLDLRRGKQIHGS
Sbjct: 121 VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVHN+LGNVFICNALTD+YAKCGEIEQARWLFD LTNKNLVSWNLMISGY KNGQPEKC
Sbjct: 181 VIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMPDQ T+ST+IAAYCQ GR DEARRVF+EFK+KDIVCWTAMLVGYAK+
Sbjct: 241 IGLLHDMRLSGHMPDQVTLSTVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE  +PDSYTLSSVVSSCAKLASL+ GQ +HG+SIL GL+NNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCG I+DA                                            
Sbjct: 361 SALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVD 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               S+TKGD+ +AEM  RHLFELDPT+AVPYIMLSNMYASMGR
Sbjct: 481 LIKSMPHEPDFLIWSTLLSVSATKGDVASAEMGGRHLFELDPTNAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVA+VR++M +KNVKKFAGYSWIEIDNEVH+FTSEDRTHPETE+IYEEL  LI KL+E
Sbjct: 541 WKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE 600

BLAST of HG10018237 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 1.7e-97
Identity = 206/632 (32.59%), Postives = 323/632 (51.11%), Query Frame = 0

Query: 5   STLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQL 64
           S L+  + ++    +  S  +  ++  C ++    + +++  H+        D ++H  L
Sbjct: 117 SALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHV-LKLGCDLDLYVHTSL 176

Query: 65  LHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYN 124
           + +Y + G+  DA  +FDK   RDV S+ AL+  YA  G I+N Q  FD +P++D VS+N
Sbjct: 177 ISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWN 236

Query: 125 TTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVH 184
             I+G+A     KE+LE FK M +    P E T+V++++A AQ   +  G+Q+H  +  H
Sbjct: 237 AMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDH 296

Query: 185 NFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKCSGLL 244
            F  N+ I NAL D+Y+KCGE+E A  LF+ L  K+++SWN +I GY      ++   L 
Sbjct: 297 GFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLF 356

Query: 245 HEMQLSGHMPDQFTMSTIIAA-------------------------------------YC 304
            EM  SG  P+  TM +I+ A                                     Y 
Sbjct: 357 QEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYA 416

Query: 305 QSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSS 364
           + G ++ A +VF+    K +  W AM+ G+A +GR + +  LF+ M    I+PD  T   
Sbjct: 417 KCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVG 476

Query: 365 VVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS------SALIDMYSKCGFIDDAS---- 424
           ++S+C+     H G +  GR I   +  +  ++        +ID+    G   +A     
Sbjct: 477 LLSACS-----HSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMIN 536

Query: 425 ------------------STKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKD 484
                                G++   E  A +L +++P +   Y++LSN+YAS GRW +
Sbjct: 537 MMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNE 596

Query: 485 VASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGF 544
           VA  R L+N K +KK  G S IEID+ VH F   D+ HP   +IY  L  +   L++ GF
Sbjct: 597 VAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGF 656

Query: 545 TPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMK 572
            P+T+ VL ++ E+ K  ++  HSEKLA+AFGLI    G + + I+KN+R+C +CHE  K
Sbjct: 657 VPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPG-TKLTIVKNLRVCRNCHEATK 716

BLAST of HG10018237 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 349.7 bits (896), Expect = 6.0e-95
Identity = 202/629 (32.11%), Postives = 328/629 (52.15%), Query Frame = 0

Query: 7   LRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQLLH 66
           LR+A+ LL       +  Y  L+  C +T  + + K++  H+    F P    + N+LL 
Sbjct: 70  LREAVQLLGRAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVP-GIVIWNRLLR 129

Query: 67  LYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYNTT 126
           +YAK G   DA+ +FD+M  RD+ SWN +++ YA+ G ++  +  FD M  +DS S+   
Sbjct: 130 MYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAM 189

Query: 127 IAGFAGNSCPKESLEHFKRMQR-EGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVHN 186
           + G+     P+E+L  +  MQR     P  +T+   + A+A +  +RRGK+IHG ++   
Sbjct: 190 VTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAG 249

Query: 187 FLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKN----------- 246
              +  + ++L D+Y KCG I++AR +FD +  K++VSW  MI  Y K+           
Sbjct: 250 LDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFS 309

Query: 247 ---------------GQPEKCSGLLHE---MQLSGHM------PDQFTMSTIIAAYCQSG 306
                          G    C+ L  E    Q+ G+M      P  F  S+++  Y + G
Sbjct: 310 ELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCG 369

Query: 307 RVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSVVS 366
            ++ A+ V     + D+V WT+++ G A+NG+ ++AL  F+ +L    KPD  T  +V+S
Sbjct: 370 NIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLS 429

Query: 367 SCAKLASLHLGQVVHGRSILGGLNNNLLVS------SALIDMYSKCGFIDD--------- 426
           +C      H G V  G      +     +S      + L+D+ ++ G  +          
Sbjct: 430 ACT-----HAGLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMP 489

Query: 427 -------------ASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVAS 486
                          ST G+I  AE AA+ LF+++P + V Y+ ++N+YA+ G+W++   
Sbjct: 490 MKPSKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGK 549

Query: 487 VRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPN 546
           +R  M    V K  G SW EI  + H F + D +HP   +I E L  L  K++EEG+ P 
Sbjct: 550 MRKRMQEIGVTKRPGSSWTEIKRKRHVFIAADTSHPMYNQIVEFLRELRKKMKEEGYVPA 609

Query: 547 TNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFAS 572
           T+LVLHDV +++K +++ +HSEKLA+AF ++    G + I++ KN+R C DCH  +KF S
Sbjct: 610 TSLVLHDVEDEQKEENLVYHSEKLAVAFAILSTEEG-TAIKVFKNLRSCVDCHGAIKFIS 669

BLAST of HG10018237 vs. ExPASy Swiss-Prot
Match: Q9CAA8 (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 5.1e-94
Identity = 193/566 (34.10%), Postives = 305/566 (53.89%), Query Frame = 0

Query: 64  LLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSY 123
           LL++YA  G   DA+ +F  + +R+   +N+L+      G I++    F  M  +DSVS+
Sbjct: 180 LLYMYANVGCISDAKKVFYGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGME-KDSVSW 239

Query: 124 NTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIV 183
              I G A N   KE++E F+ M+ +G +  +Y   S+L A   L  +  GKQIH  +I 
Sbjct: 240 AAMIKGLAQNGLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIR 299

Query: 184 HNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKCSGL 243
            NF  ++++ +AL D+Y KC  +  A+ +FD +  KN+VSW  M+ GY + G+ E+   +
Sbjct: 300 TNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKI 359

Query: 244 LHEMQLSGHMPDQFTMSTIIAA-----------------------------------YCQ 303
             +MQ SG  PD +T+   I+A                                   Y +
Sbjct: 360 FLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNSLVTLYGK 419

Query: 304 SGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSV 363
            G +D++ R+F+E   +D V WTAM+  YA+ GR  + + LF++M+   +KPD  TL+ V
Sbjct: 420 CGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLTGV 479

Query: 364 VSSCAKLASLHLGQ-VVHGRSILGGLNNNLLVSSALIDMYSKCGFIDD------------ 423
           +S+C++   +  GQ      +   G+  ++   S +ID++S+ G +++            
Sbjct: 480 ISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMPFPP 539

Query: 424 ----------ASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRN 483
                     A   KG++   + AA  L ELDP     Y +LS++YAS G+W  VA +R 
Sbjct: 540 DAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQLRR 599

Query: 484 LMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPNTNL 543
            M  KNVKK  G SWI+   ++H F+++D + P  ++IY +L  L  K+ + G+ P+T+ 
Sbjct: 600 GMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKIIDNGYKPDTSF 659

Query: 544 VLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRII 572
           V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN+R+C DCH   K  S + 
Sbjct: 660 VHHDVEEAVKVKMLNYHSERLAIAFGLIFVPSG-QPIRVGKNLRVCVDCHNATKHISSVT 719

BLAST of HG10018237 vs. ExPASy Swiss-Prot
Match: Q9M4P3 (Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DYW10 PE=2 SV=3)

HSP 1 Score: 343.2 bits (879), Expect = 5.7e-93
Identity = 207/610 (33.93%), Postives = 315/610 (51.64%), Query Frame = 0

Query: 27  QLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQLLHLYAKF-GKFRDAQNLFDKML 86
           +++  CVR+ +I+ A R+      H  R  +    N LL   +K   +  +A  LFD++ 
Sbjct: 66  KIIARCVRSGDIDGALRV-----FHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIP 125

Query: 87  ERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYNTTIAGFAGNSCPKESLEHFKR 146
           E D FS+N +LS Y ++ + +  Q+ FDRMP +D+ S+NT I G+A     +++ E F  
Sbjct: 126 EPDTFSYNIMLSCYVRNVNFEKAQSFFDRMPFKDAASWNTMITGYARRGEMEKARELFYS 185

Query: 147 MQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVHNFLGNVFICNALTDIYAKCGE 206
           M     E  E +  ++++   +  DL +         V      V    A+   Y K  +
Sbjct: 186 M----MEKNEVSWNAMISGYIECGDLEKASHFFKVAPVR----GVVAWTAMITGYMKAKK 245

Query: 207 IEQARWLF-DCLTNKNLVSWNLMISGYAKNGQPEKCSGLLHEMQLSGHMP---------- 266
           +E A  +F D   NKNLV+WN MISGY +N +PE    L   M   G  P          
Sbjct: 246 VELAEAMFKDMTVNKNLVTWNAMISGYVENSRPEDGLKLFRAMLEEGIRPNSSGLSSALL 305

Query: 267 -------------------------DQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVC 326
                                    D   ++++I+ YC+ G + +A ++F   K+KD+V 
Sbjct: 306 GCSELSALQLGRQIHQIVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVA 365

Query: 327 WTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSI 386
           W AM+ GYA++G  + AL LF EM+  +I+PD  T  +V+ +C      H G V  G + 
Sbjct: 366 WNAMISGYAQHGNADKALCLFREMIDNKIRPDWITFVAVLLACN-----HAGLVNIGMAY 425

Query: 387 LGGLNNNLLVS------SALIDMYSKCGFIDDA------------SSTKGDIVN------ 446
              +  +  V       + ++D+  + G +++A            ++  G ++       
Sbjct: 426 FESMVRDYKVEPQPDHYTCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHK 485

Query: 447 ----AEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMNSKNVKKFAGYSWI 506
               AE AA  L +L+  +A  Y+ L+N+YAS  RW+DVA VR  M   NV K  GYSWI
Sbjct: 486 NVELAEFAAEKLLQLNSQNAAGYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWI 545

Query: 507 EIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPNTNLVLHDVGEDEKFKSICF 566
           EI N+VH F S DR HPE + I+++L  L  K++  G+ P     LH+V E++K K + +
Sbjct: 546 EIRNKVHHFRSSDRIHPELDSIHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLW 605

Query: 567 HSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIRRQIILRDSNRFHHFS 572
           HSEKLA+AFG IK P G S I++ KN+RIC DCH+ +KF S I +R+II+RD+ RFHHF 
Sbjct: 606 HSEKLAVAFGCIKLPQG-SQIQVFKNLRICGDCHKAIKFISEIEKREIIVRDTTRFHHFK 656

BLAST of HG10018237 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 4.8e-92
Identity = 189/579 (32.64%), Postives = 305/579 (52.68%), Query Frame = 0

Query: 55  PTDPFLHNQLLHLYAKF-----GKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQ 114
           P D   +N LL     F     G+   A ++   +   D+   N LL+ YAK GS++  +
Sbjct: 57  PADRRFYNTLLKKCTVFKLLIQGRIVHA-HILQSIFRHDIVMGNTLLNMYAKCGSLEEAR 116

Query: 115 ATFDRMPLRDSVSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLL 174
             F++MP RD V++ T I+G++ +  P ++L  F +M R G+ P E+T+ S++ A+A   
Sbjct: 117 KVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAER 176

Query: 175 DLRRGKQIHGSVIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMIS 234
               G Q+HG  +   F  NV + +AL D+Y + G ++ A+ +FD L ++N VSWN +I+
Sbjct: 177 RGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIA 236

Query: 235 GYAKNGQPEKCSGLLHEMQLSGHMPDQFTMSTIIAA------------------------ 294
           G+A+    EK   L   M   G  P  F+ +++  A                        
Sbjct: 237 GHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKL 296

Query: 295 -----------YCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEML 354
                      Y +SG + +AR++F    ++D+V W ++L  YA++G  ++A+  F EM 
Sbjct: 297 VAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMR 356

Query: 355 LEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVSSALIDMYSKCG--- 414
              I+P+  +  SV+++C+    L  G   +      G+         ++D+  + G   
Sbjct: 357 RVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLN 416

Query: 415 ----FIDD-----ASSTKGDIVNA----------EMAARHLFELDPTSAVPYIMLSNMYA 474
               FI++      ++    ++NA            AA H+FELDP    P+++L N+YA
Sbjct: 417 RALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYA 476

Query: 475 SMGRWKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIG 534
           S GRW D A VR  M    VKK    SW+EI+N +H F + D  HP+ E+I  +   ++ 
Sbjct: 477 SGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLA 536

Query: 535 KLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICN 572
           K++E G+ P+T+ V+  V + E+  ++ +HSEK+ALAF L+  P G S I I KNIR+C 
Sbjct: 537 KIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPG-STIHIKKNIRVCG 596

BLAST of HG10018237 vs. ExPASy TrEMBL
Match: A0A5A7UC76 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold609G00590 PE=3 SV=1)

HSP 1 Score: 1023.5 bits (2645), Expect = 3.5e-295
Identity = 527/695 (75.83%), Postives = 544/695 (78.27%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKSTLRQ++DLLCSRS ATSEAYTQLVLECVRTNEINQAKRLQSHMEHH F+PTDPFL
Sbjct: 1   MKAKSTLRQSVDLLCSRSTATSEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKML+RD FSWNALLSAYAKSGSIQNL+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLKRDTFSWNALLSAYAKSGSIQNLKATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGF+GNSCP+ESL+ FKRMQREGFEPTEYTIVSILNASAQLLDLR GKQIHGS
Sbjct: 121 VSYNTTIAGFSGNSCPQESLQLFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           +IV NFLGNVFI N LTD+YAKCGEIEQARWLFDCLT KNLVSWNLMISGYAKNGQPEKC
Sbjct: 181 IIVRNFLGNVFIWNTLTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMP+Q TMSTIIAAYCQ GRVDEARRVFSEFKEKDIVCWTAMLVGYAKN
Sbjct: 241 IGLLHQMRLSGHMPNQVTMSTIIAAYCQCGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE I+PDSYTLSSVVSSCAKLASLH GQ VHG+SIL GLNNNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGFIDDA                                            
Sbjct: 361 SALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRTGRIEQAVS 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                                STKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR
Sbjct: 481 LIKNMAHEPDFLIWSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WK VASVRNLM SKNVKKFAG+SWIEID EVHRFTSEDRTHPE+E IYEELN LIGKLQE
Sbjct: 541 WKYVASVRNLMKSKNVKKFAGFSWIEIDKEVHRFTSEDRTHPESENIYEELNILIGKLQE 600

BLAST of HG10018237 vs. ExPASy TrEMBL
Match: A0A0A0LUY3 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G103260 PE=3 SV=1)

HSP 1 Score: 1023.1 bits (2644), Expect = 4.5e-295
Identity = 526/695 (75.68%), Postives = 545/695 (78.42%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQ++DLLCSRS ATSEAYTQLVLECVRTNEINQAKRLQSHMEHH F+PTD FL
Sbjct: 1   MKAKSMLRQSVDLLCSRSTATSEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDSFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKML+RD+FSWNALLSAYAKSGSIQNL+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLKRDIFSWNALLSAYAKSGSIQNLKATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGF+GNSCP+ESLE FKRMQREGFEPTEYTIVSILNASAQL DLR GKQIHGS
Sbjct: 121 VSYNTTIAGFSGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLSDLRYGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           +IV NFLGNVFI NALTD+YAKCGEIEQARWLFDCLT KNLVSWNLMISGYAKNGQPEKC
Sbjct: 181 IIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMPDQ TMSTIIAAYCQ GRVDEARRVFSEFKEKDIVCWTAM+VGYAKN
Sbjct: 241 IGLLHQMRLSGHMPDQVTMSTIIAAYCQCGRVDEARRVFSEFKEKDIVCWTAMMVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE I+PDSYTLSSVVSSCAKLASLH GQ VHG+SIL GLNNNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGFIDDA                                            
Sbjct: 361 SALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGILSACLHCNWIEQGQEYFDSITNQHGMTPTLDHYACMVNLLGRTGRIEQAVA 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                                STKGDIVNAE+AARHLFELDPT AVPYIMLSNMYASMGR
Sbjct: 481 LIKNMAHDPDFLIWSTLLSICSTKGDIVNAEVAARHLFELDPTIAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVASVRNLM SKNVKKFAG+SWIEIDNEVHRFTSEDRTHPE+E IYE+LN LIGKLQE
Sbjct: 541 WKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESEDIYEKLNMLIGKLQE 600

BLAST of HG10018237 vs. ExPASy TrEMBL
Match: A0A6J1CU81 (pentatricopeptide repeat-containing protein At4g02750-like OS=Momordica charantia OX=3673 GN=LOC111014606 PE=3 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 5.8e-282
Identity = 500/695 (71.94%), Postives = 531/695 (76.40%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           M+AK  LRQAIDLLCSR  A+SEAYT L+LECVRTNE++QAKRLQSHMEHH F+P DPFL
Sbjct: 1   MQAKPKLRQAIDLLCSRGSASSEAYTHLILECVRTNEVDQAKRLQSHMEHHLFQPPDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
            NQLLHLYAKFGK RDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMP RDS
Sbjct: 61  QNQLLHLYAKFGKVRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPFRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNTTIAGFAGN CPKESLE F+RMQ EGF PTEYT VS LNA+AQLLDLRRGK+IHGS
Sbjct: 121 VSYNTTIAGFAGNGCPKESLELFRRMQSEGFVPTEYTNVSALNAAAQLLDLRRGKEIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVH FLGN FI NALTD+YAKCGEIEQARWLFD L NKNL+SWNLMISGY KNGQPEKC
Sbjct: 181 VIVHKFLGNTFIWNALTDMYAKCGEIEQARWLFDHLANKNLISWNLMISGYVKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLHEMQ+SGHMPDQ TMSTIIAAYCQ   VDEAR+VFSEFKEKDIVCWTAMLVGYAKN
Sbjct: 241 IGLLHEMQMSGHMPDQVTMSTIIAAYCQCRCVDEARKVFSEFKEKDIVCWTAMLVGYAKN 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE +KPDSYTLSSVVSSCAKLASL+ GQ VHG+SIL GL+NNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHVKPDSYTLSSVVSSCAKLASLYHGQAVHGKSILAGLDNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCGF+D+A                                            
Sbjct: 361 SALIDMYSKCGFVDNARSVFNMMPTRNVISWNAMIVGYAQNGHDKDALAFFENMLQQKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFIGVLSACLHSNWIEKGQGYFDSISNQHGLIPTVDHYACMVNLLGRLGRIDQAVD 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               S+ KGDI NAEMAAR+LFELDP +AVPY+MLSNMYA MGR
Sbjct: 481 LIKSMPHEPDCLIWSTLLSVSAVKGDIANAEMAARYLFELDPLNAVPYVMLSNMYACMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVASVR LM SKNVKKFAGYSWIEIDN+VH+FTSEDRTHPETEKIYEELN LI K QE
Sbjct: 541 WKDVASVRTLMKSKNVKKFAGYSWIEIDNQVHKFTSEDRTHPETEKIYEELNMLIRKFQE 600

BLAST of HG10018237 vs. ExPASy TrEMBL
Match: A0A6J1HBT0 (pentatricopeptide repeat-containing protein At4g02750-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461409 PE=3 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 5.0e-278
Identity = 493/695 (70.94%), Postives = 530/695 (76.26%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQA+DLLCSRS ATSEAYTQLVLECVR NEI+QAKRLQSHMEHH F+P DPFL
Sbjct: 1   MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKMLERDVFSWNALLSAYAKSGSIQ+L+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNT IAG +GNS PKESLE F+RMQREG  PTEYT VS LNASAQLLDLRRGKQIHGS
Sbjct: 121 VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVHN+LGNVFICNALTD+YAKCGEIEQARWLFD LTNKNLVSWNLMISGY KNGQPEKC
Sbjct: 181 VIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLH+M+LSGHMPDQ T+ST+IAAYCQ GR DEARRVF+EFK+KDIVCWTAMLVGYAK+
Sbjct: 241 IGLLHDMRLSGHMPDQVTLSTVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE  +PDSYTLSSVVSSCAKLASL+ GQ +HG+SIL GL+NNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCG I+DA                                            
Sbjct: 361 SALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVD 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               S+TKGD+ +AEM  RHLFELDPT+AVPYIMLSNMYASMGR
Sbjct: 481 LIKSMPHEPDFLIWSTLLSVSATKGDVASAEMGGRHLFELDPTNAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVA+VR++M +KNVKKFAGYSWIEIDNEVH+FTSEDRTHPETE+IYEEL  LI KL+E
Sbjct: 541 WKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE 600

BLAST of HG10018237 vs. ExPASy TrEMBL
Match: A0A6J1JAW4 (pentatricopeptide repeat-containing protein At4g02750-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485131 PE=3 SV=1)

HSP 1 Score: 963.0 bits (2488), Expect = 5.6e-277
Identity = 493/695 (70.94%), Postives = 527/695 (75.83%), Query Frame = 0

Query: 1   MKAKSTLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFL 60
           MKAKS LRQA+ LLCSRS ATSEAYTQLVLECVR NEI+QAKRLQSHMEHH F+P DPFL
Sbjct: 1   MKAKSKLRQAVALLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFL 60

Query: 61  HNQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDS 120
           HNQLLHLYAKFGK RDAQNLFDKMLERDVFSWNALLSAYAKSGSIQ+L+ATFDRMP RDS
Sbjct: 61  HNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS 120

Query: 121 VSYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGS 180
           VSYNT IAG +GNS PKESLE F+RMQREG EPTEYT VS LNASAQLLDLRRGKQIHGS
Sbjct: 121 VSYNTIIAGLSGNSFPKESLELFRRMQREGLEPTEYTNVSALNASAQLLDLRRGKQIHGS 180

Query: 181 VIVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKC 240
           VIVHN+LGNVFICNALTD+YAKCGEIE ARWLFD LTNKNLVSWNLMISGY KNGQPEKC
Sbjct: 181 VIVHNYLGNVFICNALTDMYAKCGEIEHARWLFDRLTNKNLVSWNLMISGYVKNGQPEKC 240

Query: 241 SGLLHEMQLSGHMPDQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKN 300
            GLLHEM+LSGHMPDQ T+ST+IAAYCQ GR DEARRVF+EFK+KDIVCWTAMLVGYAK+
Sbjct: 241 IGLLHEMRLSGHMPDQVTLSTVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS 300

Query: 301 GREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS 360
           GREEDALLLFNEMLLE  +PDSYT SSVVSSCAKLASL+ GQ +HG+SIL GL+NNLLVS
Sbjct: 301 GREEDALLLFNEMLLEHFEPDSYTFSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS 360

Query: 361 SALIDMYSKCGFIDDA-------------------------------------------- 420
           SALIDMYSKCG IDDA                                            
Sbjct: 361 SALIDMYSKCGLIDDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDMLELFENMLQEKFK 420

Query: 421 ------------------------------------------------------------ 480
                                                                       
Sbjct: 421 PDNVTFVGVLSACLHSNLIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVN 480

Query: 481 --------------------SSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGR 540
                               S+TKGD+  AEMA RHLFELD T+AVPYIMLSNMYASMGR
Sbjct: 481 LIKSMPHEPDFLIWSTLLSVSATKGDVARAEMAGRHLFELDSTNAVPYIMLSNMYASMGR 540

Query: 541 WKDVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQE 572
           WKDVA+VR++M +KNVKKFAGYSWIEIDNEVH+FTSEDRTHPETE+IYEEL  LI KL+E
Sbjct: 541 WKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE 600

BLAST of HG10018237 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 358.2 bits (918), Expect = 1.2e-98
Identity = 206/632 (32.59%), Postives = 323/632 (51.11%), Query Frame = 0

Query: 5   STLRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQL 64
           S L+  + ++    +  S  +  ++  C ++    + +++  H+        D ++H  L
Sbjct: 117 SALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHV-LKLGCDLDLYVHTSL 176

Query: 65  LHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYN 124
           + +Y + G+  DA  +FDK   RDV S+ AL+  YA  G I+N Q  FD +P++D VS+N
Sbjct: 177 ISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWN 236

Query: 125 TTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVH 184
             I+G+A     KE+LE FK M +    P E T+V++++A AQ   +  G+Q+H  +  H
Sbjct: 237 AMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDH 296

Query: 185 NFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKCSGLL 244
            F  N+ I NAL D+Y+KCGE+E A  LF+ L  K+++SWN +I GY      ++   L 
Sbjct: 297 GFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLF 356

Query: 245 HEMQLSGHMPDQFTMSTIIAA-------------------------------------YC 304
            EM  SG  P+  TM +I+ A                                     Y 
Sbjct: 357 QEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYA 416

Query: 305 QSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSS 364
           + G ++ A +VF+    K +  W AM+ G+A +GR + +  LF+ M    I+PD  T   
Sbjct: 417 KCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVG 476

Query: 365 VVSSCAKLASLHLGQVVHGRSILGGLNNNLLVS------SALIDMYSKCGFIDDAS---- 424
           ++S+C+     H G +  GR I   +  +  ++        +ID+    G   +A     
Sbjct: 477 LLSACS-----HSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMIN 536

Query: 425 ------------------STKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKD 484
                                G++   E  A +L +++P +   Y++LSN+YAS GRW +
Sbjct: 537 MMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNE 596

Query: 485 VASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGF 544
           VA  R L+N K +KK  G S IEID+ VH F   D+ HP   +IY  L  +   L++ GF
Sbjct: 597 VAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGF 656

Query: 545 TPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMK 572
            P+T+ VL ++ E+ K  ++  HSEKLA+AFGLI    G + + I+KN+R+C +CHE  K
Sbjct: 657 VPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPG-TKLTIVKNLRVCRNCHEATK 716

BLAST of HG10018237 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 349.7 bits (896), Expect = 4.3e-96
Identity = 202/629 (32.11%), Postives = 328/629 (52.15%), Query Frame = 0

Query: 7   LRQAIDLLCSRSIATSEAYTQLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQLLH 66
           LR+A+ LL       +  Y  L+  C +T  + + K++  H+    F P    + N+LL 
Sbjct: 70  LREAVQLLGRAKKPPASTYCNLIQVCSQTRALEEGKKVHEHIRTSGFVP-GIVIWNRLLR 129

Query: 67  LYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYNTT 126
           +YAK G   DA+ +FD+M  RD+ SWN +++ YA+ G ++  +  FD M  +DS S+   
Sbjct: 130 MYAKCGSLVDARKVFDEMPNRDLCSWNVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAM 189

Query: 127 IAGFAGNSCPKESLEHFKRMQR-EGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVHN 186
           + G+     P+E+L  +  MQR     P  +T+   + A+A +  +RRGK+IHG ++   
Sbjct: 190 VTGYVKKDQPEEALVLYSLMQRVPNSRPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAG 249

Query: 187 FLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKN----------- 246
              +  + ++L D+Y KCG I++AR +FD +  K++VSW  MI  Y K+           
Sbjct: 250 LDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFS 309

Query: 247 ---------------GQPEKCSGLLHE---MQLSGHM------PDQFTMSTIIAAYCQSG 306
                          G    C+ L  E    Q+ G+M      P  F  S+++  Y + G
Sbjct: 310 ELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCG 369

Query: 307 RVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSVVS 366
            ++ A+ V     + D+V WT+++ G A+NG+ ++AL  F+ +L    KPD  T  +V+S
Sbjct: 370 NIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLS 429

Query: 367 SCAKLASLHLGQVVHGRSILGGLNNNLLVS------SALIDMYSKCGFIDD--------- 426
           +C      H G V  G      +     +S      + L+D+ ++ G  +          
Sbjct: 430 ACT-----HAGLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQLKSVISEMP 489

Query: 427 -------------ASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVAS 486
                          ST G+I  AE AA+ LF+++P + V Y+ ++N+YA+ G+W++   
Sbjct: 490 MKPSKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAAGKWEEEGK 549

Query: 487 VRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPN 546
           +R  M    V K  G SW EI  + H F + D +HP   +I E L  L  K++EEG+ P 
Sbjct: 550 MRKRMQEIGVTKRPGSSWTEIKRKRHVFIAADTSHPMYNQIVEFLRELRKKMKEEGYVPA 609

Query: 547 TNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFAS 572
           T+LVLHDV +++K +++ +HSEKLA+AF ++    G + I++ KN+R C DCH  +KF S
Sbjct: 610 TSLVLHDVEDEQKEENLVYHSEKLAVAFAILSTEEG-TAIKVFKNLRSCVDCHGAIKFIS 669

BLAST of HG10018237 vs. TAIR 10
Match: AT1G68930.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 346.7 bits (888), Expect = 3.6e-95
Identity = 193/566 (34.10%), Postives = 305/566 (53.89%), Query Frame = 0

Query: 64  LLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSY 123
           LL++YA  G   DA+ +F  + +R+   +N+L+      G I++    F  M  +DSVS+
Sbjct: 180 LLYMYANVGCISDAKKVFYGLDDRNTVMYNSLMGGLLACGMIEDALQLFRGME-KDSVSW 239

Query: 124 NTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIV 183
              I G A N   KE++E F+ M+ +G +  +Y   S+L A   L  +  GKQIH  +I 
Sbjct: 240 AAMIKGLAQNGLAKEAIECFREMKVQGLKMDQYPFGSVLPACGGLGAINEGKQIHACIIR 299

Query: 184 HNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNGQPEKCSGL 243
            NF  ++++ +AL D+Y KC  +  A+ +FD +  KN+VSW  M+ GY + G+ E+   +
Sbjct: 300 TNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMKQKNVVSWTAMVVGYGQTGRAEEAVKI 359

Query: 244 LHEMQLSGHMPDQFTMSTIIAA-----------------------------------YCQ 303
             +MQ SG  PD +T+   I+A                                   Y +
Sbjct: 360 FLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQFHGKAITSGLIHYVTVSNSLVTLYGK 419

Query: 304 SGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSV 363
            G +D++ R+F+E   +D V WTAM+  YA+ GR  + + LF++M+   +KPD  TL+ V
Sbjct: 420 CGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGRAVETIQLFDKMVQHGLKPDGVTLTGV 479

Query: 364 VSSCAKLASLHLGQ-VVHGRSILGGLNNNLLVSSALIDMYSKCGFIDD------------ 423
           +S+C++   +  GQ      +   G+  ++   S +ID++S+ G +++            
Sbjct: 480 ISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYSCMIDLFSRSGRLEEAMRFINGMPFPP 539

Query: 424 ----------ASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRN 483
                     A   KG++   + AA  L ELDP     Y +LS++YAS G+W  VA +R 
Sbjct: 540 DAIGWTTLLSACRNKGNLEIGKWAAESLIELDPHHPAGYTLLSSIYASKGKWDSVAQLRR 599

Query: 484 LMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPNTNL 543
            M  KNVKK  G SWI+   ++H F+++D + P  ++IY +L  L  K+ + G+ P+T+ 
Sbjct: 600 GMREKNVKKEPGQSWIKWKGKLHSFSADDESSPYLDQIYAKLEELNNKIIDNGYKPDTSF 659

Query: 544 VLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRII 572
           V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN+R+C DCH   K  S + 
Sbjct: 660 VHHDVEEAVKVKMLNYHSERLAIAFGLIFVPSG-QPIRVGKNLRVCVDCHNATKHISSVT 719

BLAST of HG10018237 vs. TAIR 10
Match: AT4G16835.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 343.2 bits (879), Expect = 4.0e-94
Identity = 207/610 (33.93%), Postives = 315/610 (51.64%), Query Frame = 0

Query: 27  QLVLECVRTNEINQAKRLQSHMEHHFFRPTDPFLHNQLLHLYAKF-GKFRDAQNLFDKML 86
           +++  CVR+ +I+ A R+      H  R  +    N LL   +K   +  +A  LFD++ 
Sbjct: 66  KIIARCVRSGDIDGALRV-----FHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIP 125

Query: 87  ERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSVSYNTTIAGFAGNSCPKESLEHFKR 146
           E D FS+N +LS Y ++ + +  Q+ FDRMP +D+ S+NT I G+A     +++ E F  
Sbjct: 126 EPDTFSYNIMLSCYVRNVNFEKAQSFFDRMPFKDAASWNTMITGYARRGEMEKARELFYS 185

Query: 147 MQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSVIVHNFLGNVFICNALTDIYAKCGE 206
           M     E  E +  ++++   +  DL +         V      V    A+   Y K  +
Sbjct: 186 M----MEKNEVSWNAMISGYIECGDLEKASHFFKVAPVR----GVVAWTAMITGYMKAKK 245

Query: 207 IEQARWLF-DCLTNKNLVSWNLMISGYAKNGQPEKCSGLLHEMQLSGHMP---------- 266
           +E A  +F D   NKNLV+WN MISGY +N +PE    L   M   G  P          
Sbjct: 246 VELAEAMFKDMTVNKNLVTWNAMISGYVENSRPEDGLKLFRAMLEEGIRPNSSGLSSALL 305

Query: 267 -------------------------DQFTMSTIIAAYCQSGRVDEARRVFSEFKEKDIVC 326
                                    D   ++++I+ YC+ G + +A ++F   K+KD+V 
Sbjct: 306 GCSELSALQLGRQIHQIVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVA 365

Query: 327 WTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLSSVVSSCAKLASLHLGQVVHGRSI 386
           W AM+ GYA++G  + AL LF EM+  +I+PD  T  +V+ +C      H G V  G + 
Sbjct: 366 WNAMISGYAQHGNADKALCLFREMIDNKIRPDWITFVAVLLACN-----HAGLVNIGMAY 425

Query: 387 LGGLNNNLLVS------SALIDMYSKCGFIDDA------------SSTKGDIVN------ 446
              +  +  V       + ++D+  + G +++A            ++  G ++       
Sbjct: 426 FESMVRDYKVEPQPDHYTCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHK 485

Query: 447 ----AEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMNSKNVKKFAGYSWI 506
               AE AA  L +L+  +A  Y+ L+N+YAS  RW+DVA VR  M   NV K  GYSWI
Sbjct: 486 NVELAEFAAEKLLQLNSQNAAGYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWI 545

Query: 507 EIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEGFTPNTNLVLHDVGEDEKFKSICF 566
           EI N+VH F S DR HPE + I+++L  L  K++  G+ P     LH+V E++K K + +
Sbjct: 546 EIRNKVHHFRSSDRIHPELDSIHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLW 605

Query: 567 HSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIRRQIILRDSNRFHHFS 572
           HSEKLA+AFG IK P G S I++ KN+RIC DCH+ +KF S I +R+II+RD+ RFHHF 
Sbjct: 606 HSEKLAVAFGCIKLPQG-SQIQVFKNLRICGDCHKAIKFISEIEKREIIVRDTTRFHHFK 656

BLAST of HG10018237 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 338.6 bits (867), Expect = 9.9e-93
Identity = 195/573 (34.03%), Postives = 300/573 (52.36%), Query Frame = 0

Query: 62  NQLLHLYAKFGKFRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQNLQATFDRMPLRDSV 121
           N LL  + K  K  +A+  FD M  RDV SWN +++ YA+SG I   +  FD  P++D  
Sbjct: 223 NCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQDVF 282

Query: 122 SYNTTIAGFAGNSCPKESLEHFKRMQREGFEPTEYTIVSILNASAQLLDLRRGKQIHGSV 181
           ++   ++G+  N   +E+ E F +M     E  E +  ++L    Q   +   K++   +
Sbjct: 283 TWTAMVSGYIQNRMVEEARELFDKMP----ERNEVSWNAMLAGYVQGERMEMAKELFDVM 342

Query: 182 IVHNFLGNVFICNALTDIYAKCGEIEQARWLFDCLTNKNLVSWNLMISGYAKNG------ 241
                  NV   N +   YA+CG+I +A+ LFD +  ++ VSW  MI+GY+++G      
Sbjct: 343 PCR----NVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEAL 402

Query: 242 ----QPEKCSGLLHEMQLS-------------------------GHMPDQFTMSTIIAAY 301
               Q E+  G L+    S                         G+    F  + ++  Y
Sbjct: 403 RLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMY 462

Query: 302 CQSGRVDEARRVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEQIKPDSYTLS 361
           C+ G ++EA  +F E   KDIV W  M+ GY+++G  E AL  F  M  E +KPD  T+ 
Sbjct: 463 CKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMV 522

Query: 362 SVVSSCAKLASLHLGQVVHGRSIL------GGLNNNLLVSSALIDMYSKCGFIDD----- 421
           +V+S+C+     H G V  GR          G+  N    + ++D+  + G ++D     
Sbjct: 523 AVLSACS-----HTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLM 582

Query: 422 -----------------ASSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWK 481
                            AS   G+   AE AA  +F ++P ++  Y++LSN+YAS GRW 
Sbjct: 583 KNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWG 642

Query: 482 DVASVRNLMNSKNVKKFAGYSWIEIDNEVHRFTSEDRTHPETEKIYEELNTLIGKLQEEG 541
           DV  +R  M  K VKK  GYSWIEI N+ H F+  D  HPE ++I+  L  L  ++++ G
Sbjct: 643 DVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAG 702

Query: 542 FTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFM 572
           +   T++VLHDV E+EK + + +HSE+LA+A+G+++  +G  PIR+IKN+R+C DCH  +
Sbjct: 703 YVSKTSVVLHDVEEEEKERMVRYHSERLAVAYGIMRVSSG-RPIRVIKNLRVCEDCHNAI 762

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895252.13.3e-30076.98pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida] >... [more]
KAA0051836.17.2e-29575.83pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK21410... [more]
XP_004147314.19.4e-29575.68putative pentatricopeptide repeat-containing protein At1g68930 [Cucumis sativus]... [more]
XP_022145099.11.2e-28171.94pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia]... [more]
XP_022960689.11.0e-27770.94pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q9LN011.7e-9732.59Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O231696.0e-9532.11Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9CAA85.1e-9434.10Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
Q9M4P35.7e-9333.93Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidop... [more]
Q9LIQ74.8e-9232.64Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5A7UC763.5e-29575.83Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0LUY34.5e-29575.68DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G1032... [more]
A0A6J1CU815.8e-28271.94pentatricopeptide repeat-containing protein At4g02750-like OS=Momordica charanti... [more]
A0A6J1HBT05.0e-27870.94pentatricopeptide repeat-containing protein At4g02750-like isoform X1 OS=Cucurbi... [more]
A0A6J1JAW45.6e-27770.94pentatricopeptide repeat-containing protein At4g02750-like isoform X1 OS=Cucurbi... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.2e-9832.59Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37170.14.3e-9632.11Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G68930.13.6e-9534.10pentatricopeptide (PPR) repeat-containing protein [more]
AT4G16835.14.0e-9433.93Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G02750.19.9e-9334.03Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 22..168
e-value: 8.4E-27
score: 96.3
coord: 169..285
e-value: 2.8E-27
score: 97.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 286..395
e-value: 2.8E-14
score: 55.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 27..418
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 121..155
e-value: 0.0015
score: 16.6
coord: 222..255
e-value: 2.0E-8
score: 31.9
coord: 258..289
e-value: 7.3E-7
score: 27.0
coord: 288..321
e-value: 1.4E-7
score: 29.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 90..116
e-value: 0.019
score: 15.2
coord: 62..88
e-value: 4.4E-4
score: 20.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 285..334
e-value: 4.9E-11
score: 42.7
coord: 119..164
e-value: 2.9E-7
score: 30.6
coord: 219..268
e-value: 1.6E-12
score: 47.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 57..91
score: 10.281757
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 11.564229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..285
score: 10.764054
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 220..254
score: 11.980759
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 119..153
score: 10.994242
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 437..560
e-value: 2.8E-39
score: 133.9
NoneNo IPR availablePANTHERPTHR47926:SF216SUBFAMILY NOT NAMEDcoord: 218..265
coord: 377..556
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 2..56
coord: 263..377
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 218..265
coord: 377..556
NoneNo IPR availablePANTHERPTHR47926:SF216SUBFAMILY NOT NAMEDcoord: 2..56
coord: 263..377
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 112..218
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 63..117
NoneNo IPR availablePANTHERPTHR47926:SF216SUBFAMILY NOT NAMEDcoord: 63..117
NoneNo IPR availablePANTHERPTHR47926:SF216SUBFAMILY NOT NAMEDcoord: 112..218

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018237.1HG10018237.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding