HG10019845 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019845
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 26076802 .. 26079396 (-)
RNA-Seq ExpressionHG10019845
SyntenyHG10019845
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTTTCTCCTGGTAGTGTCCTGAACATGAATCATCCAAAGGACTGTCTTTGTATGTATGTACATCTGTATACATATACTACATTGCATGCACGTGTATATTGTGGTGGGGAAGGTATAATACCTCCAACTTTTCAGACTCCTCTCTGTACAGTCTTTTTGTGTTTTATTTCAGGAACCAAGGGTATAGAGTAAGAACTTCATATGTTTTTGGCAAACTAGAGGTATCATATTCTTCCGAAGGAAATATAGCTGGTTTTGGAACCACCGCTGCTTTATCTGATAGATGCATTTCTAATGAGAGAAATAACCTTGCAACATGGCCGAGCACTGGGATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTACTGAGAACAGTGGAGTGGAAGATAACGTGGAAGATGGATTTTCTGAACTTGATGAAAAACTTCCAAGCACTAGTCCACTTGAAAATAATAAGGCAGCTGATGATAATGAAGGGGAACTAACCTCTGAATCAGAAATTGATGATGATGATGTCGATGATGGGACCCAAAATGAACTGGATTTACCTGAGGTAGATACTGAACTTGCCGAAAAGATATCAACAAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCAGGTTTATCTGTTCCTAGTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAACCGGGCTGAAATCTCTCTTGCCATGCTCAATCTTCGTAGACGTCGAATGTTTGGGAAGGCTTTACAGGTAAATTTATTTGATGTCACTTTTCAAGCAATGAAATTCATGTGTGGTTTGAGGTTTGTAATTAATCTCATTTCTCTTGGACTTTGTGGAACTCTATATAAGTCTATGTGATTAGTGGACACAACAGATTTCAGGATGGAAATGTTGATCAATTTTTCAAAAGTACAACAATCAATTGTAATATGGTCTAACAAGCGCAAAATGATCTCAAGTTGTGTAACGGCTTGTAATTAGTAAGCCTGAGTATGTCAATCAATTCTTAAGATTTACCATTATCAAATACTTGTATAAAAAATGGCTAGCTGGCTTATAGCATTTAAGAAATAGTGCTAGAGCCTTCTTCAAATGTTTAAGTTTGATGTTTGGTTGAGCAGCATCGGAACTCCTGGTAATATATTTGGATCGGCAGTTATATATTGTTGGGCTTATAAAGTTGTTACTGATATGAAAATGCTTGTGCAATTTTTGACACTGAAGAACTTTTTAGCTATTGTATACGGTTAATTTTTTCCCCTATTTGTTAGGCATTCATTCGTTAATTCTTGAACTCATTGCTGGCATTTCTGTATGGGTTTCACAGTTTTCAGAGTGGTTGGAAGCAAGCGGGCAACTCGAATTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACGGGGTCTCCATAAGGCAGAGAGTTACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGACGATATACCGAACTCTTTTGGCTAACTGTGTGGTTGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGGATTCCCAATCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGCTAGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGGAGAAAGAAAATGTCAAGCCGTCTCTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATTATAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGAATTGAACTTGATGTTACTATACTTTCCATATTAGCTAAGCACTATGCTTCGGGTGGGCTTAAAGACAAAGCCATGGCCATTTTAAAGGAGATGGAAGATGTTAACTCCAAAGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGAAGATTTGCGAGTCAAATCCTCGTATCGAAGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGCTGAAGAACGTCCCTGAGGCAGAGAAAATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCTCAAAACAATATTCTACCATGGTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGAACTAGTCAAGCAGATGGAAGACAGTGGTTGCCGCATTGATCCATTGACATGGGATGCAGTTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGTTCAAGGTTCTTCAAAAAAACCAGAAGAAGCCATTGTTTACCTCATACATGGTTATCATGGATCAGTATGCAAGGAGGGGGGATGTCCACAATACAGAGAAAATCTTTCATAAGATGAGACTATCGGGTTACGTTGCTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACCTTAATGCCAAGGCTCCGGCCTATGGTATGAAAGAGAGAATGAAGGCAGATAATGTATTTCCAAACAAAGCTTTGGCAGGAAAATTAGTCCAAGTTGATGCTTTCAGGAAGACAGCAGTGTCAGATTTGCTTGATTGA

mRNA sequence

ATGAAATTTTCTCCTGGTAGTGTCCTGAACATGAATCATCCAAAGGACTGTCTTTGTATGAACCAAGGGTATAGAGTAAGAACTTCATATGTTTTTGGCAAACTAGAGGTATCATATTCTTCCGAAGGAAATATAGCTGGTTTTGGAACCACCGCTGCTTTATCTGATAGATGCATTTCTAATGAGAGAAATAACCTTGCAACATGGCCGAGCACTGGGATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTACTGAGAACAGTGGAGTGGAAGATAACGTGGAAGATGGATTTTCTGAACTTGATGAAAAACTTCCAAGCACTAGTCCACTTGAAAATAATAAGGCAGCTGATGATAATGAAGGGGAACTAACCTCTGAATCAGAAATTGATGATGATGATGTCGATGATGGGACCCAAAATGAACTGGATTTACCTGAGGTAGATACTGAACTTGCCGAAAAGATATCAACAAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCAGGTTTATCTGTTCCTAGTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAACCGGGCTGAAATCTCTCTTGCCATGCTCAATCTTCGTAGACGTCGAATGTTTGGGAAGGCTTTACAGTTTTCAGAGTGGTTGGAAGCAAGCGGGCAACTCGAATTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACGGGGTCTCCATAAGGCAGAGAGTTACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGACGATATACCGAACTCTTTTGGCTAACTGTGTGGTTGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGGATTCCCAATCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGCTAGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGGAGAAAGAAAATGTCAAGCCGTCTCTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATTATAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGAATTGAACTTGATGTTACTATACTTTCCATATTAGCTAAGCACTATGCTTCGGGTGGGCTTAAAGACAAAGCCATGGCCATTTTAAAGGAGATGGAAGATGTTAACTCCAAAGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGAAGATTTGCGAGTCAAATCCTCGTATCGAAGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGCTGAAGAACGTCCCTGAGGCAGAGAAAATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCTCAAAACAATATTCTACCATGGTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGAACTAGTCAAGCAGATGGAAGACAGTGGTTGCCGCATTGATCCATTGACATGGGATGCAGTTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGTTCAAGGTTCTTCAAAAAAACCAGAAGAAGCCATTGTTTACCTCATACATGGTTATCATGGATCAGTATGCAAGGAGGGGGGATGTCCACAATACAGAGAAAATCTTTCATAAGATGAGACTATCGGGTTACGTTGCTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACCTTAATGCCAAGGCTCCGGCCTATGGTATGAAAGAGAGAATGAAGGCAGATAATGTATTTCCAAACAAAGCTTTGGCAGGAAAATTAGTCCAAGTTGATGCTTTCAGGAAGACAGCAGTGTCAGATTTGCTTGATTGA

Coding sequence (CDS)

ATGAAATTTTCTCCTGGTAGTGTCCTGAACATGAATCATCCAAAGGACTGTCTTTGTATGAACCAAGGGTATAGAGTAAGAACTTCATATGTTTTTGGCAAACTAGAGGTATCATATTCTTCCGAAGGAAATATAGCTGGTTTTGGAACCACCGCTGCTTTATCTGATAGATGCATTTCTAATGAGAGAAATAACCTTGCAACATGGCCGAGCACTGGGATTTATATTAGTAGTCATGGTCTATCTTCACAAGCTGGTACTGAGAACAGTGGAGTGGAAGATAACGTGGAAGATGGATTTTCTGAACTTGATGAAAAACTTCCAAGCACTAGTCCACTTGAAAATAATAAGGCAGCTGATGATAATGAAGGGGAACTAACCTCTGAATCAGAAATTGATGATGATGATGTCGATGATGGGACCCAAAATGAACTGGATTTACCTGAGGTAGATACTGAACTTGCCGAAAAGATATCAACAAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCAGGTTTATCTGTTCCTAGTGTACTTGATAAGTGGGTCAGTGAAGGAAAAGAAATAAACCGGGCTGAAATCTCTCTTGCCATGCTCAATCTTCGTAGACGTCGAATGTTTGGGAAGGCTTTACAGTTTTCAGAGTGGTTGGAAGCAAGCGGGCAACTCGAATTTATTGAGAGAGATTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACGGGGTCTCCATAAGGCAGAGAGTTACATTGCTAAAATCCCAAAGTCCTTCCAGGGGGAGACGATATACCGAACTCTTTTGGCTAACTGTGTGGTTGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGGATTCCCAATCACAACATTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGGCTAGACAAGAGGAAAATAGCCGACGTTTTGTTGTTGATGGAGAAAGAAAATGTCAAGCCGTCTCTGTTTACTTACAAAATCTTAATAGATGCTAAAGGCCTATCAAATGACATTATAGGGATGGAACAAGTTGTTGATACAATGAAGGCCGAAGGAATTGAACTTGATGTTACTATACTTTCCATATTAGCTAAGCACTATGCTTCGGGTGGGCTTAAAGACAAAGCCATGGCCATTTTAAAGGAGATGGAAGATGTTAACTCCAAAGGTTCTCAATGGCCTTGCAGAATTTTACTTCCCCTTTATGGAGAACTCCAAATGGAAGATGAAGTGAGGAGGCTCTGGAAGATTTGCGAGTCAAATCCTCGTATCGAAGAATGCATGGCTGCCATTGTTGCTTGGGGAAAGCTGAAGAACGTCCCTGAGGCAGAGAAAATTTTTGATAGAGTTGTAAAAACATGGAAGAAGCTGTCCTCAAAACAATATTCTACCATGGTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGAACTAGTCAAGCAGATGGAAGACAGTGGTTGCCGCATTGATCCATTGACATGGGATGCAGTTGTGAAGCTCTATGTGGAAGCTGGGGAGGTAGAAAAAGCAGACTCTTTCTTGTTCAAGGTTCTTCAAAAAAACCAGAAGAAGCCATTGTTTACCTCATACATGGTTATCATGGATCAGTATGCAAGGAGGGGGGATGTCCACAATACAGAGAAAATCTTTCATAAGATGAGACTATCGGGTTACGTTGCTCGATTAAGCCAATTTCAAACTCTAATACAGGCATACCTTAATGCCAAGGCTCCGGCCTATGGTATGAAAGAGAGAATGAAGGCAGATAATGTATTTCCAAACAAAGCTTTGGCAGGAAAATTAGTCCAAGTTGATGCTTTCAGGAAGACAGCAGTGTCAGATTTGCTTGATTGA

Protein sequence

MKFSPGSVLNMNHPKDCLCMNQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDAFRKTAVSDLLD
Homology
BLAST of HG10019845 vs. NCBI nr
Match: XP_022933474.1 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1043.9 bits (2698), Expect = 5.7e-301
Identity = 531/611 (86.91%), Postives = 568/611 (92.96%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS +GNI G     A+SDRCIS ERNNLATW S+G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+D    DDG
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELD----DDG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WKICE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKICEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTW+AVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYM+I+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWNAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMIILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 618

BLAST of HG10019845 vs. NCBI nr
Match: XP_022933485.1 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1043.1 bits (2696), Expect = 9.7e-301
Identity = 531/611 (86.91%), Postives = 567/611 (92.80%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS +GNI G     A+SDRCIS ERNNLATW S+G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+D    DDG
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELD----DDG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WK+CE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR S FQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 618

BLAST of HG10019845 vs. NCBI nr
Match: XP_023539395.1 (uncharacterized protein LOC111800051 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1042.3 bits (2694), Expect = 1.6e-300
Identity = 527/613 (85.97%), Postives = 567/613 (92.50%), Query Frame = 0

Query: 19   CMNQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISS 78
            C NQGYR+ TSYVF KL+  YS EGN+       A+SDRCIS ERNNLATW S+G+ +SS
Sbjct: 596  CGNQGYRITTSYVFAKLQAPYSWEGNVVASAILPAISDRCISFERNNLATWRSSGLSLSS 655

Query: 79   HGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVD 138
            HGLSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD VD
Sbjct: 656  HGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEHNKAADENEGELTSESELDDDTVD 715

Query: 139  DGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEI 198
            DGTQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA++
Sbjct: 716  DGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADV 775

Query: 199  SLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIP 258
            SLAMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIP
Sbjct: 776  SLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIP 835

Query: 259  KSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIA 318
            KSFQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIA
Sbjct: 836  KSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIA 895

Query: 319  DVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHY 378
            DVLLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHY
Sbjct: 896  DVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHY 955

Query: 379  ASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEEC 438
            ASGGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WKICE+NPRIEEC
Sbjct: 956  ASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKICEANPRIEEC 1015

Query: 439  MAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMED 498
            MAAIVAWGKLKNV EAE+IFDRV KTWK LSSKQYST++KVYADNKMLTKGK+LVKQM D
Sbjct: 1016 MAAIVAWGKLKNVQEAEEIFDRVSKTWKNLSSKQYSTLLKVYADNKMLTKGKDLVKQMAD 1075

Query: 499  SGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVH 558
            SGCRI PLTW+AVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYM+I+DQYARRGDVH
Sbjct: 1076 SGCRIGPLTWNAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMIILDQYARRGDVH 1135

Query: 559  NTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQV 618
            N EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+
Sbjct: 1136 NAEKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQI 1195

Query: 619  DAFRKTAVSDLLD 632
            DAFRKTAVSDLLD
Sbjct: 1196 DAFRKTAVSDLLD 1207

BLAST of HG10019845 vs. NCBI nr
Match: XP_023539396.1 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X2 [Cucurbita pepo subsp. pepo] >XP_023540094.1 pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1040.0 bits (2688), Expect = 8.2e-300
Identity = 526/611 (86.09%), Postives = 566/611 (92.64%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+ TSYVF KL+  YS EGN+       A+SDRCIS ERNNLATW S+G+ +SSHG
Sbjct: 13  NQGYRITTSYVFAKLQAPYSWEGNVVASAILPAISDRCISFERNNLATWRSSGLSLSSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD VDDG
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEHNKAADENEGELTSESELDDDTVDDG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA++SL
Sbjct: 133 TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADVSL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WKICE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKICEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV KTWK LSSKQYST++KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVSKTWKNLSSKQYSTLLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTW+AVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYM+I+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWNAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMIILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 622

BLAST of HG10019845 vs. NCBI nr
Match: XP_022971190.1 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 1039.6 bits (2687), Expect = 1.1e-299
Identity = 531/611 (86.91%), Postives = 566/611 (92.64%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS EGNI       A+SD CIS ERN+LATW  +G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSWEGNIVASAIIPAISDGCISFERNSLATWRPSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD VD G
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELDDDTVDAG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPE++TELAEKI  KRAPSELFKAIWSAPG SVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPELETELAEKIPAKRAPSELFKAIWSAPGSSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMMGMEQVVDTMKAEGIELDVHTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCRILLPLYGELQMEDEVRR+WKICE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRILLPLYGELQMEDEVRRVWKICEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 622

BLAST of HG10019845 vs. ExPASy Swiss-Prot
Match: Q9C977 (Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g80270 PE=2 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 6.1e-181
Identity = 322/556 (57.91%), Postives = 430/556 (77.34%), Query Frame = 0

Query: 76  ISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDD 135
           +S+  LSS AGT++   ED++EDGFSEL+     +   + + ++D++EG+L+++ E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 136 DVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINR 195
                 + ELDL  ++T+++ K + ++  SELFK I SAPGLS+ S LDKWV EG EI R
Sbjct: 117 -----EEEELDL--IETDVSRK-TVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 196 AEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIA 255
            EI+ AML LRRRRM+G+ALQ SEWLEA+ ++E  ERDYASRLDL  K+RGL K E+ + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 256 KIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKR 315
           KIPKSF+GE +YRTLLANCV A NVKK+E VFNKMKDLGFP++ F C+Q+LLL+KR+D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 316 KIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILA 375
           KIADVLLLMEKEN+KPSL TYKILID KG +NDI GMEQ+++TMK EG+ELD    ++ A
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 376 KHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRI 435
           +HY+  GLKDKA  +LKEME  + + ++   + LL +Y  L  EDEV+R+WKICES P  
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 436 EECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQ 495
           EE +AAI A+GKL  V EAE IF+++VK  ++ SS  YS +++VY D+KML+KGK+LVK+
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVYVDHKMLSKGKDLVKR 476

Query: 496 MEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRG 555
           M +SGCRI+  TWDA++KLYVEAGEVEKADS L K  +++  K +  S+M IMD+Y++RG
Sbjct: 477 MAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKRG 536

Query: 556 DVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKL 615
           DVHNTEKIF KMR +GY +RL QFQ L+QAY+NAK+PAYGM++R+KADN+FPNK++A +L
Sbjct: 537 DVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQL 596

Query: 616 VQVDAFRKTAVSDLLD 632
            Q D F+KTA+SD+LD
Sbjct: 597 AQGDPFKKTAISDILD 596

BLAST of HG10019845 vs. ExPASy Swiss-Prot
Match: Q9XI21 (Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g15480 PE=2 SV=2)

HSP 1 Score: 619.8 bits (1597), Expect = 3.5e-176
Identity = 330/601 (54.91%), Postives = 436/601 (72.55%), Query Frame = 0

Query: 31  VFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHGLSSQAGTENS 90
           V+ KL++    E NIA   + A + D+  +  R    +W S+        LSS AG + +
Sbjct: 22  VYSKLDIPL-GERNIA-IESNALIHDKHEALPRFYELSWSSS---TGRRSLSSDAGAKTT 81

Query: 91  GVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDGTQNELDLPEV 150
           G +D++E      D+ +   +P E +  ++D E     E   D+ D+ +G + EL +PE 
Sbjct: 82  GDDDDLE------DKNVDLATPDETSSDSEDGE-----EFSGDEGDI-EGAELELHVPE- 141

Query: 151 DTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISLAMLNLRRRRM 210
                      + PSE+FKAI S  GLSV S LDKWV +GK+ NR E   AML LR+RRM
Sbjct: 142 ----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAMLQLRKRRM 201

Query: 211 FGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKSFQGETIYRTL 270
           FG+ALQ +EWL+ + Q E  ERDYA RLDLI+KVRG +K E+YI  IP+SF+GE +YRTL
Sbjct: 202 FGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRGELVYRTL 261

Query: 271 LANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADVLLLMEKENVK 330
           LAN V  +NV+ AE VFNKMKDLGFP++TF CNQ+L+LYKR+DK+KIADVLLL+EKEN+K
Sbjct: 262 LANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLLLEKENLK 321

Query: 331 PSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYASGGLKDKAMAI 390
           P+L TYKILID KG SNDI GMEQ+V+TMK+EG+ELD+   +++A+HYAS GLK+KA  +
Sbjct: 322 PNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGLKEKAEKV 381

Query: 391 LKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMAAIVAWGKLKN 450
           LKEME  + + ++  C+ LL +YG LQ EDEVRR+WKICE NPR  E +AAI+A+GK+  
Sbjct: 382 LKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEVLAAILAFGKIDK 441

Query: 451 VPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSGCRIDPLTWDA 510
           V +AE +F++V+K   ++SS  YS +++VY D+KM+++GK+LVKQM DSGC I  LTWDA
Sbjct: 442 VKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNIGALTWDA 501

Query: 511 VVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNTEKIFHKMRLS 570
           V+KLYVEAGEVEKA+S L K +Q  Q KPL +S+M +M +Y RRGDVHNTEKIF +M+ +
Sbjct: 502 VIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKIFQRMKQA 561

Query: 571 GYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDAFRKTAVSDLL 630
           GY +R   +QTLIQAY+NAKAPAYGMKERMKADN+FPNK LA +L + D F+KT +SDLL
Sbjct: 562 GYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAKADPFKKTPLSDLL 594

Query: 631 D 632
           D
Sbjct: 622 D 594

BLAST of HG10019845 vs. ExPASy Swiss-Prot
Match: Q9LRP6 (Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g15590 PE=1 SV=1)

HSP 1 Score: 530.8 bits (1366), Expect = 2.1e-149
Identity = 279/559 (49.91%), Postives = 394/559 (70.48%), Query Frame = 0

Query: 75  YISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDN--EGELTSESEI 134
           +   H LSS A  ++ G E   E+  SE +E +P +  +      DD+  E EL S    
Sbjct: 65  FFGIHKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDSLFEPELGS---- 124

Query: 135 DDDDVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKE 194
           D+DD        L++ E  ++   K + KR  SEL+++I +    SV  VL+KWV EGK+
Sbjct: 125 DNDD--------LEIEEKHSKDGGKPTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKD 184

Query: 195 INRAEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAES 254
           +++AE++LA+ NLR+R+ +   LQ  EWL A+ Q EF E +YAS+LDL+AKV  L KAE 
Sbjct: 185 LSQAEVTLAIHNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEI 244

Query: 255 YIAKIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRL 314
           ++  IP+S +GE +YRTLLANCV+ ++V KAE++FNKMK+L FP + FACNQLLLLY   
Sbjct: 245 FLKDIPESSRGEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMH 304

Query: 315 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILS 374
           D++KI+DVLLLME+EN+KPS  TY  LI++KGL+ DI GME++V+T+K EGIELD  + S
Sbjct: 305 DRKKISDVLLLMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQS 364

Query: 375 ILAKHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESN 434
           ILAK+Y   GLK++A  ++KE+E    + + W CR LLPLY ++   D VRRL +  + N
Sbjct: 365 ILAKYYIRAGLKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQN 424

Query: 435 PRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKEL 494
           PR + C++AI AWGKLK V EAE +F+R+V+ +K      Y  ++++Y +NKML KG++L
Sbjct: 425 PRYDNCISAIKAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIYTENKMLAKGRDL 484

Query: 495 VKQMEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYA 554
           VK+M ++G  I P TW A+VKLY++AGEV KA+  L +  + N+ +P+FT+YM I+++YA
Sbjct: 485 VKRMGNAGIAIGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYA 544

Query: 555 RRGDVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALA 614
           +RGDVHNTEK+F KM+ + Y A+L Q++T++ AY+NAK PAYGM ERMKADNVFPNK+LA
Sbjct: 545 KRGDVHNTEKVFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLA 604

Query: 615 GKLVQVDAFRKTAVSDLLD 632
            KL QV+ F+K  VS LLD
Sbjct: 605 AKLAQVNPFKKCPVSVLLD 609

BLAST of HG10019845 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 6.8e-39
Identity = 129/479 (26.93%), Postives = 231/479 (48.23%), Query Frame = 0

Query: 157 KISTKRAPSE-LFKAIWSAPG--LSVPSVLDKWVSEGKEINRAEISLAMLNLRRRRMFGK 216
           K STK+   E L+  ++   G  + V   L++++   K + + E+   +  LR R ++  
Sbjct: 14  KRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYP 73

Query: 217 ALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKSFQGETIYRTLLAN 276
           AL+ SE +E  G  + +  D A  LDL+AK R +   E+Y   +P++ + E  Y +LL N
Sbjct: 74  ALKLSEVMEERGMNKTVS-DQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLL-N 133

Query: 277 CVVANNV-KKAEEVFNKMKDLGFPITTFACNQLLLLY-KRLDKRKIADVLLLMEKENVKP 336
           C     + +KAE + NKMK+L    ++ + N L+ LY K  +  K+  ++  ++ ENV P
Sbjct: 134 CYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMP 193

Query: 337 SLFTYKILIDAKGLSNDIIGMEQVVDTMKAEG-IELDVTILSILAKHYASGGLKDKAMAI 396
             +TY + + A   +NDI G+E+V++ M  +G +  D T  S +A  Y   GL  KA   
Sbjct: 194 DSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKA 253

Query: 397 LKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICE------SNPRIEECMAAIVA 456
           L+E+E  N++      + L+ LYG L    EV R+W+         SN      +  +V 
Sbjct: 254 LQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLV- 313

Query: 457 WGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSGCRID 516
             KL ++P AE +F            +  + ++  YA   ++ K  EL ++    G +++
Sbjct: 314 --KLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLN 373

Query: 517 PLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKK-----PLFTSYMVIMDQYARRGDVHN 576
             TW+  +  YV++G++ +A   + K +   +       P   +   +M  + ++ DV+ 
Sbjct: 374 AKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNG 433

Query: 577 TEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQV 619
            E +   ++          F+ LI+ Y  A      M+ R+K +NV  N+A    L +V
Sbjct: 434 AENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLLDEV 487

BLAST of HG10019845 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 1.2e-38
Identity = 121/408 (29.66%), Postives = 197/408 (48.28%), Query Frame = 0

Query: 175 PGLSVPSVLDKWVSEGKEINRAEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDY 234
           P  S+  VLD W+ +G  +  +E+   +  LR+   F  ALQ S+W+      E  E D 
Sbjct: 50  PSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVHEISEGDV 109

Query: 235 ASRLDLIAKVRGLHKAESYIAKIPKSFQGETIYRTLLANCVVANNV-KKAEEVFNKMKDL 294
           A RLDLIAKV GL +AE +   IP   +   +Y  LL NC  +  V  KAE+VF +MK+L
Sbjct: 110 AIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALL-NCYASKKVLHKAEQVFQEMKEL 169

Query: 295 GFPITTFACNQLLLLYKRLDKRKIADVLLL-MEKENVKPSLFTYKILIDAKGLSNDIIGM 354
           GF       N +L LY R  K  + + LL  ME E VKP +FT    + A  + +D+ GM
Sbjct: 170 GFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVVSDVEGM 229

Query: 355 EQVVDTMKA-EGIELDVTILSILAKHYASGGLKDKAMAILKEMED-VNSKGSQWPCRILL 414
           E+ +   +A +G+ LD    +  A  Y   GL +KA+ +L++ E  VN++  +    +L+
Sbjct: 230 EKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKHAYEVLM 289

Query: 415 PLYGELQMEDEVRRLWKICESNPRIEEC--MAAIVAWGKLKNVPEAEKIFDRVVKTWKKL 474
             YG    ++EV RLW + +          ++ I A  K+ ++ E EKI +         
Sbjct: 290 SFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIMEEWEAGHSLF 349

Query: 475 SSKQYSTMVKVYADNKMLTKGKELVKQMEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFL 534
             +    ++  Y    M+ K +E+V  +       D  TW+ +   Y  AG++EKA    
Sbjct: 350 DIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKMEKAVEKW 409

Query: 535 FKVLQKNQK--KPLFTSYMVIMDQYARRGDVHNTEKIFHKMRLSGYVA 575
            + ++ ++   +P     M  +D    + D+    KI   +   G+++
Sbjct: 410 KRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSERGHIS 456

BLAST of HG10019845 vs. ExPASy TrEMBL
Match: A0A6J1EZV3 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111440876 PE=3 SV=1)

HSP 1 Score: 1043.9 bits (2698), Expect = 2.7e-301
Identity = 531/611 (86.91%), Postives = 568/611 (92.96%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS +GNI G     A+SDRCIS ERNNLATW S+G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+D    DDG
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELD----DDG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WKICE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKICEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTW+AVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYM+I+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWNAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMIILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 618

BLAST of HG10019845 vs. ExPASy TrEMBL
Match: A0A6J1F506 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111440885 PE=3 SV=1)

HSP 1 Score: 1043.1 bits (2696), Expect = 4.7e-301
Identity = 531/611 (86.91%), Postives = 567/611 (92.80%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS +GNI G     A+SDRCIS ERNNLATW S+G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+D    DDG
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELD----DDG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPEV+TEL EKIS KRAPSELFKAIWSAPGLSVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCR+LLPLYGELQMEDEVRR+WK+CE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR S FQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 618

BLAST of HG10019845 vs. ExPASy TrEMBL
Match: A0A6J1I524 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111469986 PE=3 SV=1)

HSP 1 Score: 1039.6 bits (2687), Expect = 5.2e-300
Identity = 531/611 (86.91%), Postives = 566/611 (92.64%), Query Frame = 0

Query: 21  NQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHG 80
           NQGYR+RTSYVFGKLE  YS EGNI       A+SD CIS ERN+LATW  +G+ ISSHG
Sbjct: 13  NQGYRIRTSYVFGKLEAPYSWEGNIVASAIIPAISDGCISFERNSLATWRPSGLSISSHG 72

Query: 81  LSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDG 140
           LSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD VD G
Sbjct: 73  LSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELDDDTVDAG 132

Query: 141 TQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISL 200
           TQNELDLPE++TELAEKI  KRAPSELFKAIWSAPG SVPS LDKWVSEGKE++RA+ISL
Sbjct: 133 TQNELDLPELETELAEKIPAKRAPSELFKAIWSAPGSSVPSALDKWVSEGKELSRADISL 192

Query: 201 AMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKS 260
           AMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPKS
Sbjct: 193 AMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKS 252

Query: 261 FQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADV 320
           FQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIADV
Sbjct: 253 FQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADV 312

Query: 321 LLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYAS 380
           LLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYAS
Sbjct: 313 LLLMEKENVKPSLFTYKILIDAKGLSNDMMGMEQVVDTMKAEGIELDVHTLSILAKHYAS 372

Query: 381 GGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMA 440
           GGLKDKA AILKEMEDV+SK S+WPCRILLPLYGELQMEDEVRR+WKICE+NPRIEECMA
Sbjct: 373 GGLKDKAKAILKEMEDVSSKESRWPCRILLPLYGELQMEDEVRRVWKICEANPRIEECMA 432

Query: 441 AIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSG 500
           AIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DSG
Sbjct: 433 AIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSG 492

Query: 501 CRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNT 560
           CRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDVHN 
Sbjct: 493 CRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNA 552

Query: 561 EKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDA 620
           EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+DA
Sbjct: 553 EKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDA 612

Query: 621 FRKTAVSDLLD 632
           FRKTAVSDLLD
Sbjct: 613 FRKTAVSDLLD 622

BLAST of HG10019845 vs. ExPASy TrEMBL
Match: A0A6J1I7V1 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469985 PE=3 SV=1)

HSP 1 Score: 1035.0 bits (2675), Expect = 1.3e-298
Identity = 528/612 (86.27%), Postives = 565/612 (92.32%), Query Frame = 0

Query: 20  MNQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSH 79
           MNQGYR+RTSYVFG LE  YS EGNI       A+SD CIS ERN+LATW  +G+ ISSH
Sbjct: 1   MNQGYRIRTSYVFGTLEAPYSWEGNIVASAIIPAISDGCISFERNSLATWRPSGLSISSH 60

Query: 80  GLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDD 139
           GLSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD VD 
Sbjct: 61  GLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELDDDTVDG 120

Query: 140 GTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEIS 199
           GTQNELDLPE++TELAEKI  KRAPSELFKAIWSAPG SVPS LDKWVSEGKE++RA+IS
Sbjct: 121 GTQNELDLPELETELAEKIPAKRAPSELFKAIWSAPGSSVPSALDKWVSEGKELSRADIS 180

Query: 200 LAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPK 259
           LAMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKIPK
Sbjct: 181 LAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPK 240

Query: 260 SFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIAD 319
           SFQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKIAD
Sbjct: 241 SFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIAD 300

Query: 320 VLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYA 379
           VLLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKHYA
Sbjct: 301 VLLLMEKENVKPSLFTYKILIDAKGLSNDMMGMEQVVDTMKAEGIELDVNTLSILAKHYA 360

Query: 380 SGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECM 439
           SGGL DKA AILKEMEDV+SK S+WPCRILLPLYGELQMEDEVRR+WKICE+NPR++ECM
Sbjct: 361 SGGLIDKAKAILKEMEDVSSKESRWPCRILLPLYGELQMEDEVRRVWKICEANPRMDECM 420

Query: 440 AAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDS 499
           AAIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM DS
Sbjct: 421 AAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADS 480

Query: 500 GCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHN 559
           GCRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDVHN
Sbjct: 481 GCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHN 540

Query: 560 TEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVD 619
            EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q+D
Sbjct: 541 AEKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQID 600

Query: 620 AFRKTAVSDLLD 632
           AFRKTAVSDLLD
Sbjct: 601 AFRKTAVSDLLD 611

BLAST of HG10019845 vs. ExPASy TrEMBL
Match: A0A6J1I643 (pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111469985 PE=3 SV=1)

HSP 1 Score: 1033.9 bits (2672), Expect = 2.8e-298
Identity = 528/614 (85.99%), Postives = 565/614 (92.02%), Query Frame = 0

Query: 18  LCMNQGYRVRTSYVFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYIS 77
           L  NQGYR+RTSYVFG LE  YS EGNI       A+SD CIS ERN+LATW  +G+ IS
Sbjct: 58  LSRNQGYRIRTSYVFGTLEAPYSWEGNIVASAIIPAISDGCISFERNSLATWRPSGLSIS 117

Query: 78  SHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDV 137
           SHGLSSQAG ENSG ED++EDGFSEL E LPST+ LE+NKAAD+NEGELTSESE+DDD V
Sbjct: 118 SHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTSESELDDDTV 177

Query: 138 DDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAE 197
           D GTQNELDLPE++TELAEKI  KRAPSELFKAIWSAPG SVPS LDKWVSEGKE++RA+
Sbjct: 178 DGGTQNELDLPELETELAEKIPAKRAPSELFKAIWSAPGSSVPSALDKWVSEGKELSRAD 237

Query: 198 ISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKI 257
           ISLAMLNLRRRRMFGKALQFSEWLEASGQLEF++RDYASRLDLIAKV GLH+AE YIAKI
Sbjct: 238 ISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKI 297

Query: 258 PKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKI 317
           PKSFQGE IYRTLLANCVVANNVKKAEEVFNKMKDL FPIT FACNQLLLLYKRLDKRKI
Sbjct: 298 PKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKI 357

Query: 318 ADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKH 377
           ADVLLLMEKENVKPSLFTYKILIDAKGLSND++GMEQVVDTMKAEGIELDV  LSILAKH
Sbjct: 358 ADVLLLMEKENVKPSLFTYKILIDAKGLSNDMMGMEQVVDTMKAEGIELDVNTLSILAKH 417

Query: 378 YASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEE 437
           YASGGL DKA AILKEMEDV+SK S+WPCRILLPLYGELQMEDEVRR+WKICE+NPR++E
Sbjct: 418 YASGGLIDKAKAILKEMEDVSSKESRWPCRILLPLYGELQMEDEVRRVWKICEANPRMDE 477

Query: 438 CMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQME 497
           CMAAIVAWGKLKNV EAE+IFDRV+KTWKKLSSKQYSTM+KVYADNKMLTKGK+LVKQM 
Sbjct: 478 CMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMA 537

Query: 498 DSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDV 557
           DSGCRI PLTWDAVVKLYVEAGEVEKADSFL K +QKNQ KPLFTSYMVI+DQYARRGDV
Sbjct: 538 DSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDV 597

Query: 558 HNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQ 617
           HN EK+FH+MRLSGYVAR SQFQ LIQAY+NAKAPAYGMKERMKADNVFPNKALAGKL Q
Sbjct: 598 HNAEKMFHRMRLSGYVARFSQFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQ 657

Query: 618 VDAFRKTAVSDLLD 632
           +DAFRKTAVSDLLD
Sbjct: 658 IDAFRKTAVSDLLD 670

BLAST of HG10019845 vs. TAIR 10
Match: AT1G80270.1 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 635.6 bits (1638), Expect = 4.3e-182
Identity = 322/556 (57.91%), Postives = 430/556 (77.34%), Query Frame = 0

Query: 76  ISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDD 135
           +S+  LSS AGT++   ED++EDGFSEL+     +   + + ++D++EG+L+++ E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 136 DVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINR 195
                 + ELDL  ++T+++ K + ++  SELFK I SAPGLS+ S LDKWV EG EI R
Sbjct: 117 -----EEEELDL--IETDVSRK-TVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 196 AEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIA 255
            EI+ AML LRRRRM+G+ALQ SEWLEA+ ++E  ERDYASRLDL  K+RGL K E+ + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 256 KIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKR 315
           KIPKSF+GE +YRTLLANCV A NVKK+E VFNKMKDLGFP++ F C+Q+LLL+KR+D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 316 KIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILA 375
           KIADVLLLMEKEN+KPSL TYKILID KG +NDI GMEQ+++TMK EG+ELD    ++ A
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 376 KHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRI 435
           +HY+  GLKDKA  +LKEME  + + ++   + LL +Y  L  EDEV+R+WKICES P  
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 436 EECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQ 495
           EE +AAI A+GKL  V EAE IF+++VK  ++ SS  YS +++VY D+KML+KGK+LVK+
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVYVDHKMLSKGKDLVKR 476

Query: 496 MEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRG 555
           M +SGCRI+  TWDA++KLYVEAGEVEKADS L K  +++  K +  S+M IMD+Y++RG
Sbjct: 477 MAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKRG 536

Query: 556 DVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKL 615
           DVHNTEKIF KMR +GY +RL QFQ L+QAY+NAK+PAYGM++R+KADN+FPNK++A +L
Sbjct: 537 DVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQL 596

Query: 616 VQVDAFRKTAVSDLLD 632
            Q D F+KTA+SD+LD
Sbjct: 597 AQGDPFKKTAISDILD 596

BLAST of HG10019845 vs. TAIR 10
Match: AT1G80270.2 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 635.6 bits (1638), Expect = 4.3e-182
Identity = 322/556 (57.91%), Postives = 430/556 (77.34%), Query Frame = 0

Query: 76  ISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDD 135
           +S+  LSS AGT++   ED++EDGFSEL+     +   + + ++D++EG+L+++ E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 136 DVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINR 195
                 + ELDL  ++T+++ K + ++  SELFK I SAPGLS+ S LDKWV EG EI R
Sbjct: 117 -----EEEELDL--IETDVSRK-TVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 196 AEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIA 255
            EI+ AML LRRRRM+G+ALQ SEWLEA+ ++E  ERDYASRLDL  K+RGL K E+ + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 256 KIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKR 315
           KIPKSF+GE +YRTLLANCV A NVKK+E VFNKMKDLGFP++ F C+Q+LLL+KR+D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 316 KIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILA 375
           KIADVLLLMEKEN+KPSL TYKILID KG +NDI GMEQ+++TMK EG+ELD    ++ A
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 376 KHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRI 435
           +HY+  GLKDKA  +LKEME  + + ++   + LL +Y  L  EDEV+R+WKICES P  
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 436 EECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQ 495
           EE +AAI A+GKL  V EAE IF+++VK  ++ SS  YS +++VY D+KML+KGK+LVK+
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVYVDHKMLSKGKDLVKR 476

Query: 496 MEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRG 555
           M +SGCRI+  TWDA++KLYVEAGEVEKADS L K  +++  K +  S+M IMD+Y++RG
Sbjct: 477 MAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKRG 536

Query: 556 DVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKL 615
           DVHNTEKIF KMR +GY +RL QFQ L+QAY+NAK+PAYGM++R+KADN+FPNK++A +L
Sbjct: 537 DVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQL 596

Query: 616 VQVDAFRKTAVSDLLD 632
            Q D F+KTA+SD+LD
Sbjct: 597 AQGDPFKKTAISDILD 596

BLAST of HG10019845 vs. TAIR 10
Match: AT1G80270.3 (PENTATRICOPEPTIDE REPEAT 596 )

HSP 1 Score: 635.6 bits (1638), Expect = 4.3e-182
Identity = 322/556 (57.91%), Postives = 430/556 (77.34%), Query Frame = 0

Query: 76  ISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDD 135
           +S+  LSS AGT++   ED++EDGFSEL+     +   + + ++D++EG+L+++ E    
Sbjct: 57  LSNRALSSSAGTKSDQEEDDLEDGFSELE----GSKSGQGSTSSDEDEGKLSADEE---- 116

Query: 136 DVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINR 195
                 + ELDL  ++T+++ K + ++  SELFK I SAPGLS+ S LDKWV EG EI R
Sbjct: 117 -----EEEELDL--IETDVSRK-TVEKKQSELFKTIVSAPGLSIGSALDKWVEEGNEITR 176

Query: 196 AEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIA 255
            EI+ AML LRRRRM+G+ALQ SEWLEA+ ++E  ERDYASRLDL  K+RGL K E+ + 
Sbjct: 177 VEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASRLDLTVKIRGLEKGEACMQ 236

Query: 256 KIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKR 315
           KIPKSF+GE +YRTLLANCV A NVKK+E VFNKMKDLGFP++ F C+Q+LLL+KR+D++
Sbjct: 237 KIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPLSGFTCDQMLLLHKRIDRK 296

Query: 316 KIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILA 375
           KIADVLLLMEKEN+KPSL TYKILID KG +NDI GMEQ+++TMK EG+ELD    ++ A
Sbjct: 297 KIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILETMKDEGVELDFQTQALTA 356

Query: 376 KHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRI 435
           +HY+  GLKDKA  +LKEME  + + ++   + LL +Y  L  EDEV+R+WKICES P  
Sbjct: 357 RHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLGREDEVKRIWKICESKPYF 416

Query: 436 EECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQ 495
           EE +AAI A+GKL  V EAE IF+++VK  ++ SS  YS +++VY D+KML+KGK+LVK+
Sbjct: 417 EESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLLRVYVDHKMLSKGKDLVKR 476

Query: 496 MEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRG 555
           M +SGCRI+  TWDA++KLYVEAGEVEKADS L K  +++  K +  S+M IMD+Y++RG
Sbjct: 477 MAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHTKLMMNSFMYIMDEYSKRG 536

Query: 556 DVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKL 615
           DVHNTEKIF KMR +GY +RL QFQ L+QAY+NAK+PAYGM++R+KADN+FPNK++A +L
Sbjct: 537 DVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMRDRLKADNIFPNKSMAAQL 596

Query: 616 VQVDAFRKTAVSDLLD 632
            Q D F+KTA+SD+LD
Sbjct: 597 AQGDPFKKTAISDILD 596

BLAST of HG10019845 vs. TAIR 10
Match: AT1G15480.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 619.8 bits (1597), Expect = 2.5e-177
Identity = 330/601 (54.91%), Postives = 436/601 (72.55%), Query Frame = 0

Query: 31  VFGKLEVSYSSEGNIAGFGTTAALSDRCISNERNNLATWPSTGIYISSHGLSSQAGTENS 90
           V+ KL++    E NIA   + A + D+  +  R    +W S+        LSS AG + +
Sbjct: 22  VYSKLDIPL-GERNIA-IESNALIHDKHEALPRFYELSWSSS---TGRRSLSSDAGAKTT 81

Query: 91  GVEDNVEDGFSELDEKLPSTSPLENNKAADDNEGELTSESEIDDDDVDDGTQNELDLPEV 150
           G +D++E      D+ +   +P E +  ++D E     E   D+ D+ +G + EL +PE 
Sbjct: 82  GDDDDLE------DKNVDLATPDETSSDSEDGE-----EFSGDEGDI-EGAELELHVPE- 141

Query: 151 DTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKEINRAEISLAMLNLRRRRM 210
                      + PSE+FKAI S  GLSV S LDKWV +GK+ NR E   AML LR+RRM
Sbjct: 142 ----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAMLQLRKRRM 201

Query: 211 FGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAESYIAKIPKSFQGETIYRTL 270
           FG+ALQ +EWL+ + Q E  ERDYA RLDLI+KVRG +K E+YI  IP+SF+GE +YRTL
Sbjct: 202 FGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRGELVYRTL 261

Query: 271 LANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRLDKRKIADVLLLMEKENVK 330
           LAN V  +NV+ AE VFNKMKDLGFP++TF CNQ+L+LYKR+DK+KIADVLLL+EKEN+K
Sbjct: 262 LANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLLLEKENLK 321

Query: 331 PSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILSILAKHYASGGLKDKAMAI 390
           P+L TYKILID KG SNDI GMEQ+V+TMK+EG+ELD+   +++A+HYAS GLK+KA  +
Sbjct: 322 PNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGLKEKAEKV 381

Query: 391 LKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESNPRIEECMAAIVAWGKLKN 450
           LKEME  + + ++  C+ LL +YG LQ EDEVRR+WKICE NPR  E +AAI+A+GK+  
Sbjct: 382 LKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEVLAAILAFGKIDK 441

Query: 451 VPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKELVKQMEDSGCRIDPLTWDA 510
           V +AE +F++V+K   ++SS  YS +++VY D+KM+++GK+LVKQM DSGC I  LTWDA
Sbjct: 442 VKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNIGALTWDA 501

Query: 511 VVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYARRGDVHNTEKIFHKMRLS 570
           V+KLYVEAGEVEKA+S L K +Q  Q KPL +S+M +M +Y RRGDVHNTEKIF +M+ +
Sbjct: 502 VIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKIFQRMKQA 561

Query: 571 GYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALAGKLVQVDAFRKTAVSDLL 630
           GY +R   +QTLIQAY+NAKAPAYGMKERMKADN+FPNK LA +L + D F+KT +SDLL
Sbjct: 562 GYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAKADPFKKTPLSDLL 594

Query: 631 D 632
           D
Sbjct: 622 D 594

BLAST of HG10019845 vs. TAIR 10
Match: AT3G15590.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 530.8 bits (1366), Expect = 1.5e-150
Identity = 279/559 (49.91%), Postives = 394/559 (70.48%), Query Frame = 0

Query: 75  YISSHGLSSQAGTENSGVEDNVEDGFSELDEKLPSTSPLENNKAADDN--EGELTSESEI 134
           +   H LSS A  ++ G E   E+  SE +E +P +  +      DD+  E EL S    
Sbjct: 65  FFGIHKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDSLFEPELGS---- 124

Query: 135 DDDDVDDGTQNELDLPEVDTELAEKISTKRAPSELFKAIWSAPGLSVPSVLDKWVSEGKE 194
           D+DD        L++ E  ++   K + KR  SEL+++I +    SV  VL+KWV EGK+
Sbjct: 125 DNDD--------LEIEEKHSKDGGKPTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKD 184

Query: 195 INRAEISLAMLNLRRRRMFGKALQFSEWLEASGQLEFIERDYASRLDLIAKVRGLHKAES 254
           +++AE++LA+ NLR+R+ +   LQ  EWL A+ Q EF E +YAS+LDL+AKV  L KAE 
Sbjct: 185 LSQAEVTLAIHNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEI 244

Query: 255 YIAKIPKSFQGETIYRTLLANCVVANNVKKAEEVFNKMKDLGFPITTFACNQLLLLYKRL 314
           ++  IP+S +GE +YRTLLANCV+ ++V KAE++FNKMK+L FP + FACNQLLLLY   
Sbjct: 245 FLKDIPESSRGEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMH 304

Query: 315 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDIIGMEQVVDTMKAEGIELDVTILS 374
           D++KI+DVLLLME+EN+KPS  TY  LI++KGL+ DI GME++V+T+K EGIELD  + S
Sbjct: 305 DRKKISDVLLLMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQS 364

Query: 375 ILAKHYASGGLKDKAMAILKEMEDVNSKGSQWPCRILLPLYGELQMEDEVRRLWKICESN 434
           ILAK+Y   GLK++A  ++KE+E    + + W CR LLPLY ++   D VRRL +  + N
Sbjct: 365 ILAKYYIRAGLKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQN 424

Query: 435 PRIEECMAAIVAWGKLKNVPEAEKIFDRVVKTWKKLSSKQYSTMVKVYADNKMLTKGKEL 494
           PR + C++AI AWGKLK V EAE +F+R+V+ +K      Y  ++++Y +NKML KG++L
Sbjct: 425 PRYDNCISAIKAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIYTENKMLAKGRDL 484

Query: 495 VKQMEDSGCRIDPLTWDAVVKLYVEAGEVEKADSFLFKVLQKNQKKPLFTSYMVIMDQYA 554
           VK+M ++G  I P TW A+VKLY++AGEV KA+  L +  + N+ +P+FT+YM I+++YA
Sbjct: 485 VKRMGNAGIAIGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYA 544

Query: 555 RRGDVHNTEKIFHKMRLSGYVARLSQFQTLIQAYLNAKAPAYGMKERMKADNVFPNKALA 614
           +RGDVHNTEK+F KM+ + Y A+L Q++T++ AY+NAK PAYGM ERMKADNVFPNK+LA
Sbjct: 545 KRGDVHNTEKVFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLA 604

Query: 615 GKLVQVDAFRKTAVSDLLD 632
            KL QV+ F+K  VS LLD
Sbjct: 605 AKLAQVNPFKKCPVSVLLD 609

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022933474.15.7e-30186.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucur... [more]
XP_022933485.19.7e-30186.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucur... [more]
XP_023539395.11.6e-30085.97uncharacterized protein LOC111800051 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023539396.18.2e-30086.09pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isofor... [more]
XP_022971190.11.1e-29986.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9C9776.1e-18157.91Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidop... [more]
Q9XI213.5e-17654.91Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidop... [more]
Q9LRP62.1e-14949.91Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidop... [more]
O227146.8e-3926.93Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Q9SKU61.2e-3829.66Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EZV32.7e-30186.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cuc... [more]
A0A6J1F5064.7e-30186.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cuc... [more]
A0A6J1I5245.2e-30086.91pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like OS=Cuc... [more]
A0A6J1I7V11.3e-29886.27pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isofor... [more]
A0A6J1I6432.8e-29885.99pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isofor... [more]
Match NameE-valueIdentityDescription
AT1G80270.14.3e-18257.91PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G80270.24.3e-18257.91PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G80270.34.3e-18257.91PENTATRICOPEPTIDE REPEAT 596 [more]
AT1G15480.12.5e-17754.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G15590.11.5e-15049.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 266..295
e-value: 0.0015
score: 18.7
coord: 543..571
e-value: 0.039
score: 14.2
coord: 473..501
e-value: 0.0042
score: 17.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 473..504
e-value: 3.2E-5
score: 21.8
coord: 266..298
e-value: 3.1E-5
score: 21.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 300..342
e-value: 4.3E-4
score: 20.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 263..297
score: 8.900633
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 469..503
score: 9.262356
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 504..539
score: 8.977363
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 439..627
e-value: 2.4E-23
score: 85.0
coord: 263..438
e-value: 5.7E-24
score: 87.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 125..146
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 102..149
NoneNo IPR availablePANTHERPTHR45717:SF15OS01G0280400 PROTEINcoord: 30..631
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 30..631

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019845.1HG10019845.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding