HG10008034 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008034
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr10: 18888700 .. 18891063 (-)
RNA-Seq ExpressionHG10008034
SyntenyHG10008034
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTACCCGCCATCTGAATTTCAAATCCTTTCATGTCAAATCCAATCAACGATTACCAAGCTTCAATTTTTTCCGCTCTTTCCAACATGACTACAACCTGTTTGATCAAAGTCCTCCTCCAAATGCAGCCTCTACTAACCGCGTTTTGCTCAATTACTTGCACAGAGACGAAGCAATTCAATCCCTTCGCTTTTTCAAGAAGCAGATTCGGTGGGGTCTTGATGGGAATGCCGATGAGTTGACTTTGGCCCTTGCTCTGAAGGCCTGTTGTGGGCTTCCTAAACTAGGTAGACAAATTCATGGATTTGTTATTTCTTCTGGGCTTGTTTCCCATATTACAGTCTCTAACTCTTTGATGAATATGTACTGTAAATCGGGGCAGCTTGAGAGGGCTTTTAGTGTCTTCGAGAATTTACATGACCCAGATATTGTTTCGTGGAATACTATTCTTTCCGGGTTTCAGAAGAGTGAGAACGCTTTGAGTTTTGCTGTTAGGATGAATTTAAATGGGGTTAAGTTTGATCCTGTGACTTATACTACCACACTTGCCTTCTGCTTAGATGGAGAAGATTTTCTTTTTGGTTGGCAATTGCATACTCTTGCTCTGAAATGTGGATTCAAAGGTGATGTTTTTGTTGGAAATGCTCTGATTACAATGTACTCGAGATGGGAACATCTTGTGGATGCTAGACAAGTGTTCGATGAAATGTGGAGCCGGGATCGGGTGTCCTGGAGCGCGATGATTACTGGTTATGCACAAGAGGGAGGTCATGGGTTAGAAGCAATTTCAGTGTTCGTTCAAATGGTGAGAGAAGGAGTGAAGTTTGACAATGTAGCAATTACTGGAGCACTTTCTGTTTGTGGTCATGAAAGAAACCTGGAGCTTGGAAAACAGATCCATTGTTTGACCGTGAAAACAGGATATGAAACTCATACTTCTGTTGGTAATGTTCTGATCTCGATGTACTCCAAGTGTGAGATCATTGAAGATGCAAAGGCAGCCTTCAAGTTAATCAAGGACCGCAATGTGATCTCGTGGACAACTATGCTCTCATTGTATGAAGAAGATGCGGTTTCTTTGTTCAATAAGATGAGATTAGATGGAGTATATCCAAATGACGTTACGTTTATTGGATTACTTCATGCCATCACAATAAGGGGTATGGTGGAACAGGGACTAATAGTCCATGGATTATGTATCAAAGCTGACTTTGTATCAGAATTAAGTGTAGGCAATAGTCTAATAACCATGTATGCTAAATTTGAGTCAATACAAGATGCCTCAAGAGTGTTTATGGAACTTCCATATAGAGAGATAATATCATGGAATGCCTTAATTTCTGGATATGCTCAAAATGCACTATGTCAAGAAGCTTTAGAGACATTTCTTTATGCAATAATGGAATCTAAGCCAAACGAATACACCTTTGGAAGTGTTCTAAATGCAATCAGTGCTGGTGAAGGCATATCAATAAAGCATGGCCAACGATGTCATTCTCATTTGATCAAAGTTGGATTAAACTTCGACCCTATAATTTCAGGTGCCCTCCTAGACATGTATGCAAAACGTGGGAGCATTCAAGAATCCCAAAGAGTTTTCAATGAAACATCCGAAAGAAGTCAATTCGCTTGGACGGCGCTGATCTCTGCCTACGCACAACACGGAGACTACGATTTGGTGATGAAACTGTTTGAAGAGATGGAGAAGGAAAAGATAAAGCCTGATGCAGTTATCTTCCTGTCTGTCTTAACAGCATGTAGCAGGAACAGGATGGTCGACATGGGTCGTCGATTTTTCGATATGATGATCAAAGATCATATGATTGAACCAGCAGGAGAGCACTACTCTTGTATGGTTGATATGCTAGGTCGTGCAGGGCGATTGGAAGATGCGGAGGAAATGTTAGCGCGCATACCAGGAGGGCCGGGGATATCTGCATTACAGAGCTTGCTTGGAGCTTGTAGAATACATGGGAATTTGGAGATGGCAGAGAGAATGGCGAATGCTTTGATGAAGGAGGAGCCAATGGAATCAGGATCATATGTGTTGATGTCAAACCTGTATGCTCAGAAGGGAGATTGGGAAAAAGTTGCTGAAATGAGGAAGGGAATGAGAGAAAGAGGAGTGAAGAAAGAGATTGGATTCAGTTGGGTTGATGTTGGTAATTTTGGTGCTTCTAATTTATACTTACATGGCTTTTCATCAGGGGATGTATCTCATCCACAATCAGGGGAGATATGTAGAATGGCAGAATATATGGGAGCAGAAATGAAGTTTCAAAAAGACAGAGAAAGGGAGTCCCAGACCCAAGTGATTGATGAACTAACTGTAACAGATTTATTTGTACTTGATGGATGGTAA

mRNA sequence

ATGCTTACCCGCCATCTGAATTTCAAATCCTTTCATGTCAAATCCAATCAACGATTACCAAGCTTCAATTTTTTCCGCTCTTTCCAACATGACTACAACCTGTTTGATCAAAGTCCTCCTCCAAATGCAGCCTCTACTAACCGCGTTTTGCTCAATTACTTGCACAGAGACGAAGCAATTCAATCCCTTCGCTTTTTCAAGAAGCAGATTCGGTGGGGTCTTGATGGGAATGCCGATGAGTTGACTTTGGCCCTTGCTCTGAAGGCCTGTTGTGGGCTTCCTAAACTAGGTAGACAAATTCATGGATTTGTTATTTCTTCTGGGCTTGTTTCCCATATTACAGTCTCTAACTCTTTGATGAATATGTACTGTAAATCGGGGCAGCTTGAGAGGGCTTTTAGTGTCTTCGAGAATTTACATGACCCAGATATTGTTTCGTGGAATACTATTCTTTCCGGGTTTCAGAAGAGTGAGAACGCTTTGAGTTTTGCTGTTAGGATGAATTTAAATGGGGTTAAGTTTGATCCTGTGACTTATACTACCACACTTGCCTTCTGCTTAGATGGAGAAGATTTTCTTTTTGGTTGGCAATTGCATACTCTTGCTCTGAAATGTGGATTCAAAGGTGATGTTTTTGTTGGAAATGCTCTGATTACAATGTACTCGAGATGGGAACATCTTGTGGATGCTAGACAAGTGTTCGATGAAATGTGGAGCCGGGATCGGGTGTCCTGGAGCGCGATGATTACTGGTTATGCACAAGAGGGAGGTCATGGGTTAGAAGCAATTTCAGTGTTCGTTCAAATGGTGAGAGAAGGAGTGAAGTTTGACAATGTAGCAATTACTGGAGCACTTTCTGTTTGTGGTCATGAAAGAAACCTGGAGCTTGGAAAACAGATCCATTGTTTGACCGTGAAAACAGGATATGAAACTCATACTTCTGTTGGTAATGTTCTGATCTCGATGTACTCCAAGTGTGAGATCATTGAAGATGCAAAGGCAGCCTTCAAGTTAATCAAGGACCGCAATGTGATCTCGTGGACAACTATGCTCTCATTGTATGAAGAAGATGCGGTTTCTTTGTTCAATAAGATGAGATTAGATGGAGTATATCCAAATGACGTTACGTTTATTGGATTACTTCATGCCATCACAATAAGGGGTATGGTGGAACAGGGACTAATAGTCCATGGATTATGTATCAAAGCTGACTTTGTATCAGAATTAAGTGTAGGCAATAGTCTAATAACCATGTATGCTAAATTTGAGTCAATACAAGATGCCTCAAGAGTGTTTATGGAACTTCCATATAGAGAGATAATATCATGGAATGCCTTAATTTCTGGATATGCTCAAAATGCACTATGTCAAGAAGCTTTAGAGACATTTCTTTATGCAATAATGGAATCTAAGCCAAACGAATACACCTTTGGAAGTGTTCTAAATGCAATCAGTGCTGGTGAAGGCATATCAATAAAGCATGGCCAACGATGTCATTCTCATTTGATCAAAGTTGGATTAAACTTCGACCCTATAATTTCAGGTGCCCTCCTAGACATGTATGCAAAACGTGGGAGCATTCAAGAATCCCAAAGAGTTTTCAATGAAACATCCGAAAGAAGTCAATTCGCTTGGACGGCGCTGATCTCTGCCTACGCACAACACGGAGACTACGATTTGGTGATGAAACTGTTTGAAGAGATGGAGAAGGAAAAGATAAAGCCTGATGCAGTTATCTTCCTGTCTGTCTTAACAGCATGTAGCAGGAACAGGATGGTCGACATGGGTCGTCGATTTTTCGATATGATGATCAAAGATCATATGATTGAACCAGCAGGAGAGCACTACTCTTGTATGGTTGATATGCTAGGTCGTGCAGGGCGATTGGAAGATGCGGAGGAAATGTTAGCGCGCATACCAGGAGGGCCGGGGATATCTGCATTACAGAGCTTGCTTGGAGCTTGTAGAATACATGGGAATTTGGAGATGGCAGAGAGAATGGCGAATGCTTTGATGAAGGAGGAGCCAATGGAATCAGGATCATATGTGTTGATGTCAAACCTGTATGCTCAGAAGGGAGATTGGGAAAAAGTTGCTGAAATGAGGAAGGGAATGAGAGAAAGAGGAGTGAAGAAAGAGATTGGATTCAGTTGGGTTGATGTTGGTAATTTTGGTGCTTCTAATTTATACTTACATGGCTTTTCATCAGGGGATGTATCTCATCCACAATCAGGGGAGATATGTAGAATGGCAGAATATATGGGAGCAGAAATGAAGTTTCAAAAAGACAGAGAAAGGGAGTCCCAGACCCAAGTGATTGATGAACTAACTGTAACAGATTTATTTGTACTTGATGGATGGTAA

Coding sequence (CDS)

ATGCTTACCCGCCATCTGAATTTCAAATCCTTTCATGTCAAATCCAATCAACGATTACCAAGCTTCAATTTTTTCCGCTCTTTCCAACATGACTACAACCTGTTTGATCAAAGTCCTCCTCCAAATGCAGCCTCTACTAACCGCGTTTTGCTCAATTACTTGCACAGAGACGAAGCAATTCAATCCCTTCGCTTTTTCAAGAAGCAGATTCGGTGGGGTCTTGATGGGAATGCCGATGAGTTGACTTTGGCCCTTGCTCTGAAGGCCTGTTGTGGGCTTCCTAAACTAGGTAGACAAATTCATGGATTTGTTATTTCTTCTGGGCTTGTTTCCCATATTACAGTCTCTAACTCTTTGATGAATATGTACTGTAAATCGGGGCAGCTTGAGAGGGCTTTTAGTGTCTTCGAGAATTTACATGACCCAGATATTGTTTCGTGGAATACTATTCTTTCCGGGTTTCAGAAGAGTGAGAACGCTTTGAGTTTTGCTGTTAGGATGAATTTAAATGGGGTTAAGTTTGATCCTGTGACTTATACTACCACACTTGCCTTCTGCTTAGATGGAGAAGATTTTCTTTTTGGTTGGCAATTGCATACTCTTGCTCTGAAATGTGGATTCAAAGGTGATGTTTTTGTTGGAAATGCTCTGATTACAATGTACTCGAGATGGGAACATCTTGTGGATGCTAGACAAGTGTTCGATGAAATGTGGAGCCGGGATCGGGTGTCCTGGAGCGCGATGATTACTGGTTATGCACAAGAGGGAGGTCATGGGTTAGAAGCAATTTCAGTGTTCGTTCAAATGGTGAGAGAAGGAGTGAAGTTTGACAATGTAGCAATTACTGGAGCACTTTCTGTTTGTGGTCATGAAAGAAACCTGGAGCTTGGAAAACAGATCCATTGTTTGACCGTGAAAACAGGATATGAAACTCATACTTCTGTTGGTAATGTTCTGATCTCGATGTACTCCAAGTGTGAGATCATTGAAGATGCAAAGGCAGCCTTCAAGTTAATCAAGGACCGCAATGTGATCTCGTGGACAACTATGCTCTCATTGTATGAAGAAGATGCGGTTTCTTTGTTCAATAAGATGAGATTAGATGGAGTATATCCAAATGACGTTACGTTTATTGGATTACTTCATGCCATCACAATAAGGGGTATGGTGGAACAGGGACTAATAGTCCATGGATTATGTATCAAAGCTGACTTTGTATCAGAATTAAGTGTAGGCAATAGTCTAATAACCATGTATGCTAAATTTGAGTCAATACAAGATGCCTCAAGAGTGTTTATGGAACTTCCATATAGAGAGATAATATCATGGAATGCCTTAATTTCTGGATATGCTCAAAATGCACTATGTCAAGAAGCTTTAGAGACATTTCTTTATGCAATAATGGAATCTAAGCCAAACGAATACACCTTTGGAAGTGTTCTAAATGCAATCAGTGCTGGTGAAGGCATATCAATAAAGCATGGCCAACGATGTCATTCTCATTTGATCAAAGTTGGATTAAACTTCGACCCTATAATTTCAGGTGCCCTCCTAGACATGTATGCAAAACGTGGGAGCATTCAAGAATCCCAAAGAGTTTTCAATGAAACATCCGAAAGAAGTCAATTCGCTTGGACGGCGCTGATCTCTGCCTACGCACAACACGGAGACTACGATTTGGTGATGAAACTGTTTGAAGAGATGGAGAAGGAAAAGATAAAGCCTGATGCAGTTATCTTCCTGTCTGTCTTAACAGCATGTAGCAGGAACAGGATGGTCGACATGGGTCGTCGATTTTTCGATATGATGATCAAAGATCATATGATTGAACCAGCAGGAGAGCACTACTCTTGTATGGTTGATATGCTAGGTCGTGCAGGGCGATTGGAAGATGCGGAGGAAATGTTAGCGCGCATACCAGGAGGGCCGGGGATATCTGCATTACAGAGCTTGCTTGGAGCTTGTAGAATACATGGGAATTTGGAGATGGCAGAGAGAATGGCGAATGCTTTGATGAAGGAGGAGCCAATGGAATCAGGATCATATGTGTTGATGTCAAACCTGTATGCTCAGAAGGGAGATTGGGAAAAAGTTGCTGAAATGAGGAAGGGAATGAGAGAAAGAGGAGTGAAGAAAGAGATTGGATTCAGTTGGGTTGATGTTGGTAATTTTGGTGCTTCTAATTTATACTTACATGGCTTTTCATCAGGGGATGTATCTCATCCACAATCAGGGGAGATATGTAGAATGGCAGAATATATGGGAGCAGAAATGAAGTTTCAAAAAGACAGAGAAAGGGAGTCCCAGACCCAAGTGATTGATGAACTAACTGTAACAGATTTATTTGTACTTGATGGATGGTAA

Protein sequence

MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVSLFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSVLNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTDLFVLDGW
Homology
BLAST of HG10008034 vs. NCBI nr
Match: XP_038878735.1 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1494.6 bits (3868), Expect = 0.0e+00
Identity = 736/786 (93.64%), Postives = 756/786 (96.18%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKSNQR PSF FFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRD A 
Sbjct: 1   MLTRHLNFKSFHVKSNQRFPSFKFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDGAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           QSLRFFKKQIRW LDG+ DE TLALALKACCGLPKLGRQIHGFVISSG VSHITVSNSLM
Sbjct: 61  QSLRFFKKQIRWSLDGSTDEFTLALALKACCGLPKLGRQIHGFVISSGFVSHITVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLE+AFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT
Sbjct: 121 NMYCKSGQLEKAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDFLFGWQLHTLALKCG KGDVFVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFLFGWQLHTLALKCGLKGDVFVGNALITMYSRWEHLVDARQVFDEMQSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAM+TGYAQEG HGLEAI +F+QMVREGVKFDNVAITGALSVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMVTGYAQEGDHGLEAILLFIQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVKTGYETHTSVGNVLIS YSKCEIIEDAKA F+LI DRNVISWTTM+SLYEE AVS
Sbjct: 301 HCLTVKTGYETHTSVGNVLISTYSKCEIIEDAKAVFELINDRNVISWTTMISLYEEGAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFNKMRLDGVYPNDVTFIGLLHAITIR MVEQGL+VHGLCIKADFVSEL +GNSLITMYA
Sbjct: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRNMVEQGLMVHGLCIKADFVSELGIGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNTLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETS+R
Sbjct: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEMEKEKIKPDAVIFLSVL ACSRNRMVDMGRRFF
Sbjct: 541 SQFAWTALISGYAQHGDYDSVMKLFEEMEKEKIKPDAVIFLSVLVACSRNRMVDMGRRFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           DMMIKDHMIEPAGEHYSCMVD+LGRAGRLE+AEEMLARIPG PGISALQSLLGACRIHGN
Sbjct: 601 DMMIKDHMIEPAGEHYSCMVDLLGRAGRLEEAEEMLARIPGEPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           +EMAERMANALMK++P+ESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD
Sbjct: 661 VEMAERMANALMKKKPLESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGASNLYLHGFSSGDVSHPQS EICRMAEYMGAEMKFQKDRERE QT VID+L V+D
Sbjct: 721 VGNFGASNLYLHGFSSGDVSHPQSEEICRMAEYMGAEMKFQKDRERERQTHVIDQLIVSD 780

Query: 781 LFVLDG 787
           L +LDG
Sbjct: 781 LVLLDG 786

BLAST of HG10008034 vs. NCBI nr
Match: XP_022927123.1 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial [Cucurbita moschata] >KAG6587926.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1426.8 bits (3692), Expect = 0.0e+00
Identity = 699/786 (88.93%), Postives = 743/786 (94.53%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKS QRLPSFNFFRSFQHD+NLFDQSPPPNAASTNRVLLNYLHR+EA 
Sbjct: 1   MLTRHLNFKSFHVKSKQRLPSFNFFRSFQHDHNLFDQSPPPNAASTNRVLLNYLHRNEAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           Q+LR FKK IRWGLDGNAD  TLALALKACCG+PKLGRQIHGFVISSGLVS+I+VSNSLM
Sbjct: 61  QALRLFKKHIRWGLDGNADGFTLALALKACCGVPKLGRQIHGFVISSGLVSNISVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSE+ALSFA  MNLNGV+FDPVTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSESALSFAAWMNLNGVRFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDF+FGWQLHTL LKCGF+ D+FVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFIFGWQLHTLVLKCGFQCDIFVGNALITMYSRWEHLVDARQVFDEMRSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG HGLEAI VF+QMVREGVKFDNVAITGA+SVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDHGLEAILVFIQMVREGVKFDNVAITGAVSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVK G+ETHTSVGNVLIS YSKCE+I+DAK+ F++I DRNVISWTTM+SLYEEDAVS
Sbjct: 301 HCLTVKIGFETHTSVGNVLISTYSKCEVIDDAKSVFEIIDDRNVISWTTMISLYEEDAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFN+MRLDGVYPNDVTFIGLLHAIT+R MVEQGL+VHGLCIKADFV+EL VGNSLITMYA
Sbjct: 361 LFNEMRLDGVYPNDVTFIGLLHAITMRNMVEQGLMVHGLCIKADFVTELGVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNGLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAIS GE IS+KHGQRCHSHLIKVGLN  PIISGALLDMYAKRGSIQESQRVFNE S+R
Sbjct: 481 LNAISGGEDISLKHGQRCHSHLIKVGLNSGPIISGALLDMYAKRGSIQESQRVFNEASKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEM+KEKI PDAVIFLSVL ACSRNRMVDMGR+FF
Sbjct: 541 SQFAWTALISGYAQHGDYDTVMKLFEEMKKEKINPDAVIFLSVLAACSRNRMVDMGRQFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           +MMI DHMIEP  EHYSCMVDMLGRAG+LE+A+EMLARIPGGPGISALQSLLGACRIHGN
Sbjct: 601 NMMINDHMIEPEAEHYSCMVDMLGRAGQLEEAQEMLARIPGGPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           ++MAERMA+ALMK EP+ESGSYVLMSNLYAQKGDWEKVAE+RKGM+E+GVKKEIGFSWVD
Sbjct: 661 VDMAERMADALMKSEPLESGSYVLMSNLYAQKGDWEKVAEVRKGMKEKGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGAS LYLHGFSSGDVSHPQS EICRMA+YMGAEMKF KDRER+ QT +ID L +TD
Sbjct: 721 VGNFGAS-LYLHGFSSGDVSHPQSEEICRMAQYMGAEMKFLKDRERQRQTHMIDGLPLTD 780

Query: 781 LFVLDG 787
           LFV DG
Sbjct: 781 LFVFDG 785

BLAST of HG10008034 vs. NCBI nr
Match: XP_038878737.1 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial isoform X2 [Benincasa hispida])

HSP 1 Score: 1425.2 bits (3688), Expect = 0.0e+00
Identity = 710/786 (90.33%), Postives = 730/786 (92.88%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKSNQR PSF FFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRD A 
Sbjct: 1   MLTRHLNFKSFHVKSNQRFPSFKFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDGAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           QSLRFFKKQIRW LDG+ DE TLALALKACCGLPKLGRQIHGFVISSG VSHITVSNSLM
Sbjct: 61  QSLRFFKKQIRWSLDGSTDEFTLALALKACCGLPKLGRQIHGFVISSGFVSHITVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLE+AFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT
Sbjct: 121 NMYCKSGQLEKAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDFLFGWQLHTLALKCG KGDVFVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFLFGWQLHTLALKCGLKGDVFVGNALITMYSRWEHLVDARQVFDEMQSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAM+TGYAQEG HGLEAI +F+QMVREGVKFDNVAITGALSVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMVTGYAQEGDHGLEAILLFIQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVKTGYETHTSVGNVLIS YSKCEIIEDAKA F+LI DRNVISWTTM+SLYEE AVS
Sbjct: 301 HCLTVKTGYETHTSVGNVLISTYSKCEIIEDAKAVFELINDRNVISWTTMISLYEEGAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFNKMRLDGVYPNDVTFIGLLHAITIR MVEQGL+VHGLCIKADFVSEL +GNSLITMYA
Sbjct: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRNMVEQGLMVHGLCIKADFVSELGIGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNTLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAISAGE                          GALLDMYAKRGSIQESQRVFNETS+R
Sbjct: 481 LNAISAGE--------------------------GALLDMYAKRGSIQESQRVFNETSKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEMEKEKIKPDAVIFLSVL ACSRNRMVDMGRRFF
Sbjct: 541 SQFAWTALISGYAQHGDYDSVMKLFEEMEKEKIKPDAVIFLSVLVACSRNRMVDMGRRFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           DMMIKDHMIEPAGEHYSCMVD+LGRAGRLE+AEEMLARIPG PGISALQSLLGACRIHGN
Sbjct: 601 DMMIKDHMIEPAGEHYSCMVDLLGRAGRLEEAEEMLARIPGEPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           +EMAERMANALMK++P+ESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD
Sbjct: 661 VEMAERMANALMKKKPLESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGASNLYLHGFSSGDVSHPQS EICRMAEYMGAEMKFQKDRERE QT VID+L V+D
Sbjct: 721 VGNFGASNLYLHGFSSGDVSHPQSEEICRMAEYMGAEMKFQKDRERERQTHVIDQLIVSD 760

Query: 781 LFVLDG 787
           L +LDG
Sbjct: 781 LVLLDG 760

BLAST of HG10008034 vs. NCBI nr
Match: KAG7021814.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1425.2 bits (3688), Expect = 0.0e+00
Identity = 699/786 (88.93%), Postives = 742/786 (94.40%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKS QRLPSFNFFRSFQHD+NLFDQSPPPNAASTNRVLLNYLHR+EA 
Sbjct: 1   MLTRHLNFKSFHVKSKQRLPSFNFFRSFQHDHNLFDQSPPPNAASTNRVLLNYLHRNEAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           Q+LR FKK IRWGLDGN DE TLALALKACCG+PKLGRQIHGFVISSGLVS+I+VSNSLM
Sbjct: 61  QALRLFKKHIRWGLDGNVDEFTLALALKACCGVPKLGRQIHGFVISSGLVSNISVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSE+ALSFA  MNLNGV+FDPVTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSESALSFAAWMNLNGVQFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDF+FGWQLHTL LKCGF+ DVFVGNALITMYSRWEHLV ARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFIFGWQLHTLVLKCGFQFDVFVGNALITMYSRWEHLVGARQVFDEMRSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG HGLEAI VF+QMVREGVKFDNVAITGA+SVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDHGLEAILVFIQMVREGVKFDNVAITGAVSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVK G+ETHTSVGNVLIS YSKCE+I+DAK+ F++I DRNVISWTTM+SLYEEDAVS
Sbjct: 301 HCLTVKIGFETHTSVGNVLISTYSKCEVIDDAKSVFEIIDDRNVISWTTMISLYEEDAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFN+MRLDGVYPNDVTFIGLLHAIT+R MVEQGL+VHGLCIKADFV+EL VGNSLITMYA
Sbjct: 361 LFNEMRLDGVYPNDVTFIGLLHAITMRNMVEQGLMVHGLCIKADFVTELGVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNGLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAIS GE IS+KHGQRCHSHLIKVGLN  PIISGALLDMYAKRGSIQESQRVFNE S+R
Sbjct: 481 LNAISGGEDISLKHGQRCHSHLIKVGLNSGPIISGALLDMYAKRGSIQESQRVFNEASKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEM+KEKI PDAVIFLSVL ACSRNRMVDMGR+FF
Sbjct: 541 SQFAWTALISGYAQHGDYDTVMKLFEEMKKEKINPDAVIFLSVLAACSRNRMVDMGRQFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           +MMI DHMIEP  EHYSCMVDMLGRAG+LE+A+EMLARIPGGPGISALQSLLGACRIHGN
Sbjct: 601 NMMINDHMIEPEAEHYSCMVDMLGRAGQLEEAQEMLARIPGGPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           ++MAERMA+ALMK EP+ESGSYVLMSNLYAQKGDWEKVAE+RKGM+E+GVKKEIGFSWVD
Sbjct: 661 VDMAERMADALMKSEPLESGSYVLMSNLYAQKGDWEKVAEVRKGMKEKGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGAS LYLHGFSSGDVSHPQS EICRMA+YMGAEMKF KDRER+ QT +ID L +TD
Sbjct: 721 VGNFGAS-LYLHGFSSGDVSHPQSEEICRMAQYMGAEMKFLKDRERQRQTHMIDGLPLTD 780

Query: 781 LFVLDG 787
           LFV DG
Sbjct: 781 LFVFDG 785

BLAST of HG10008034 vs. NCBI nr
Match: XP_023531439.1 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1416.7 bits (3666), Expect = 0.0e+00
Identity = 695/786 (88.42%), Postives = 741/786 (94.27%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKS QRL SFNFFRSF+HD+NLFDQSPPPNAASTNRVLLNYLHR+EA 
Sbjct: 1   MLTRHLNFKSFHVKSKQRLTSFNFFRSFRHDHNLFDQSPPPNAASTNRVLLNYLHRNEAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           Q+LR FKK IRWGLDGNADE TLALALKACCG+PKLGRQIHGFVISSGLVS+++VSNSLM
Sbjct: 61  QALRLFKKHIRWGLDGNADEFTLALALKACCGVPKLGRQIHGFVISSGLVSNVSVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENL DPDIVSWNTILSGFQKSE+ALSFA  MNLNGV+FDPVTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLLDPDIVSWNTILSGFQKSESALSFAAWMNLNGVQFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T L+FCLDGEDF+FGWQLHTL LKCGF+ DVFVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALSFCLDGEDFMFGWQLHTLVLKCGFQCDVFVGNALITMYSRWEHLVDARQVFDEMRSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG HGLEAI VF+QMVREGVKFDNVAITGA+SVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDHGLEAILVFIQMVREGVKFDNVAITGAVSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVK G+ETHTSVGNVLIS YSKCE+I+DAK+ F++I DRNVISWTTM+SLYEEDAVS
Sbjct: 301 HCLTVKIGFETHTSVGNVLISTYSKCEVIDDAKSVFEIIDDRNVISWTTMISLYEEDAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFN+MRLDGVYPNDVTFIGLLHAIT+R MVEQGL+VHGLCIKADFV+EL VGNSLITMYA
Sbjct: 361 LFNEMRLDGVYPNDVTFIGLLHAITMRNMVEQGLMVHGLCIKADFVTELGVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNGLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAIS GE IS+KHGQRCHSHLIKVGLN  PIISGALLDMYAKRGSIQESQRVF E S+R
Sbjct: 481 LNAISGGEDISLKHGQRCHSHLIKVGLNSGPIISGALLDMYAKRGSIQESQRVFKEASKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEM+KEKI PDAVIFLSVL ACSRNRMVDMGR+FF
Sbjct: 541 SQFAWTALISGYAQHGDYDTVMKLFEEMKKEKINPDAVIFLSVLAACSRNRMVDMGRQFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           +MMI DHMIEP  EHYSCMVDMLGRAG+LE+A+EMLARIPGGPGISALQSLLGACRIHGN
Sbjct: 601 NMMINDHMIEPEAEHYSCMVDMLGRAGQLEEAQEMLARIPGGPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           ++MAERMA+ALMK EP+ESGSYVLMSNLYAQKGDWEKVAE+RKGM+E+GVKKEIGFSWVD
Sbjct: 661 VDMAERMADALMKSEPLESGSYVLMSNLYAQKGDWEKVAEVRKGMKEKGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGAS LYLHGFSSGDVSHPQS EICRMA+YMGAEMKF KDRER+ QT +ID L +TD
Sbjct: 721 VGNFGAS-LYLHGFSSGDVSHPQSEEICRMAQYMGAEMKFLKDRERQRQTHMIDGLPLTD 780

Query: 781 LFVLDG 787
           LFV DG
Sbjct: 781 LFVFDG 785

BLAST of HG10008034 vs. ExPASy Swiss-Prot
Match: Q84MA3 (Pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E40 PE=2 SV=1)

HSP 1 Score: 823.5 bits (2126), Expect = 2.0e-237
Identity = 399/740 (53.92%), Postives = 541/740 (73.11%), Query Frame = 0

Query: 24  FFRSFQHDYNLFDQSPPPNA-ASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDG-NADEL 83
           F+  ++  + LFD S   NA  S N  +   L R+   ++L  FK+ ++ G  G + DE+
Sbjct: 20  FYSPYRIAHKLFDGSSQRNATTSINHSISESLRRNSPARALSIFKENLQLGYFGRHMDEV 79

Query: 84  TLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHD 143
           TL LALKAC G  K G QIHGF  +SG  S + VSN++M MY K+G+ + A  +FENL D
Sbjct: 80  TLCLALKACRGDLKRGCQIHGFSTTSGFTSFVCVSNAVMGMYRKAGRFDNALCIFENLVD 139

Query: 144 PDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTL 203
           PD+VSWNTILSGF  ++ AL+F VRM   GV FD  TY+T L+FC+  E FL G QL + 
Sbjct: 140 PDVVSWNTILSGFDDNQIALNFVVRMKSAGVVFDAFTYSTALSFCVGSEGFLLGLQLQST 199

Query: 204 ALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLE 263
            +K G + D+ VGN+ ITMYSR      AR+VFDEM  +D +SW+++++G +QEG  G E
Sbjct: 200 VVKTGLESDLVVGNSFITMYSRSGSFRGARRVFDEMSFKDMISWNSLLSGLSQEGTFGFE 259

Query: 264 AISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLIS 323
           A+ +F  M+REGV+ D+V+ T  ++ C HE +L+L +QIH L +K GYE+   VGN+L+S
Sbjct: 260 AVVIFRDMMREGVELDHVSFTSVITTCCHETDLKLARQIHGLCIKRGYESLLEVGNILMS 319

Query: 324 MYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVSLFNKMRLDGVYPNDVTFIGLL 383
            YSKC ++E  K+ F  + +RNV+SWTTM+S  ++DAVS+F  MR DGVYPN+VTF+GL+
Sbjct: 320 RYSKCGVLEAVKSVFHQMSERNVVSWTTMISSNKDDAVSIFLNMRFDGVYPNEVTFVGLI 379

Query: 384 HAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYREII 443
           +A+     +++GL +HGLCIK  FVSE SVGNS IT+YAKFE+++DA + F ++ +REII
Sbjct: 380 NAVKCNEQIKEGLKIHGLCIKTGFVSEPSVGNSFITLYAKFEALEDAKKAFEDITFREII 439

Query: 444 SWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSVLNAISAGEGISIKHGQRCHSH 503
           SWNA+ISG+AQN    EAL+ FL A  E+ PNEYTFGSVLNAI+  E IS+K GQRCH+H
Sbjct: 440 SWNAMISGFAQNGFSHEALKMFLSAAAETMPNEYTFGSVLNAIAFAEDISVKQGQRCHAH 499

Query: 504 LIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQFAWTALISAYAQHGDYDLV 563
           L+K+GLN  P++S ALLDMYAKRG+I ES++VFNE S+++QF WT++ISAY+ HGD++ V
Sbjct: 500 LLKLGLNSCPVVSSALLDMYAKRGNIDESEKVFNEMSQKNQFVWTSIISAYSSHGDFETV 559

Query: 564 MKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVD 623
           M LF +M KE + PD V FLSVLTAC+R  MVD G   F+MMI+ + +EP+ EHYSCMVD
Sbjct: 560 MNLFHKMIKENVAPDLVTFLSVLTACNRKGMVDKGYEIFNMMIEVYNLEPSHEHYSCMVD 619

Query: 624 MLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGS 683
           MLGRAGRL++AEE+++ +PGGPG S LQS+LG+CR+HGN++M  ++A   M+ +P  SGS
Sbjct: 620 MLGRAGRLKEAEELMSEVPGGPGESMLQSMLGSCRLHGNVKMGAKVAELAMEMKPELSGS 679

Query: 684 YVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSH 743
           YV M N+YA+K +W+K AE+RK MR++ V KE GFSW+DVG+   S L + GFSSGD SH
Sbjct: 680 YVQMYNIYAEKEEWDKAAEIRKAMRKKNVSKEAGFSWIDVGDTEGS-LTMQGFSSGDKSH 739

Query: 744 PQSGEICRMAEYMGAEMKFQ 762
           P+S EI RM E +G EM  +
Sbjct: 740 PKSDEIYRMVEIIGLEMNLE 758

BLAST of HG10008034 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 3.0e-121
Identity = 241/738 (32.66%), Postives = 408/738 (55.28%), Query Frame = 0

Query: 32  YNLFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACC 91
           +NLFD+SP  +  S   +L  +       ++ R F    R G++ +    +  L + A  
Sbjct: 47  HNLFDKSPGRDRESYISLLFGFSRDGRTQEAKRLFLNIHRLGMEMDCSIFSSVLKVSATL 106

Query: 92  GLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTIL 151
                GRQ+H   I  G +  ++V  SL++ Y K    +    VF+ + + ++V+W T++
Sbjct: 107 CDELFGRQLHCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLI 166

Query: 152 SGFQK---SENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGFK 211
           SG+ +   ++  L+  +RM   G + +  T+   L    +      G Q+HT+ +K G  
Sbjct: 167 SGYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLD 226

Query: 212 GDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFVQ 271
             + V N+LI +Y +  ++  AR +FD+   +  V+W++MI+GYA   G  LEA+ +F  
Sbjct: 227 KTIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYA-ANGLDLEALGMFYS 286

Query: 272 MVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCEI 331
           M    V+    +    + +C + + L   +Q+HC  VK G+    ++   L+  YSKC  
Sbjct: 287 MRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTA 346

Query: 332 IEDAKAAFKLIK-DRNVISWTTMLSLY-----EEDAVSLFNKMRLDGVYPNDVTFIGLLH 391
           + DA   FK I    NV+SWT M+S +     +E+AV LF++M+  GV PN+ T+  +L 
Sbjct: 347 MLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILT 406

Query: 392 AITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYREIIS 451
           A+ +    E    VH   +K ++    +VG +L+  Y K   +++A++VF  +  ++I++
Sbjct: 407 ALPVISPSE----VHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVA 466

Query: 452 WNALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLNAISAGEGISIKHGQRCHSH 511
           W+A+++GYAQ    + A++ F        KPNE+TF S+LN + A    S+  G++ H  
Sbjct: 467 WSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILN-VCAATNASMGQGKQFHGF 526

Query: 512 LIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQFAWTALISAYAQHGDYDLV 571
            IK  L+    +S ALL MYAK+G+I+ ++ VF    E+   +W ++IS YAQHG     
Sbjct: 527 AIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKA 586

Query: 572 MKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVD 631
           + +F+EM+K K+K D V F+ V  AC+   +V+ G ++FD+M++D  I P  EH SCMVD
Sbjct: 587 LDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVD 646

Query: 632 MLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGS 691
           +  RAG+LE A +++  +P   G +  +++L ACR+H   E+    A  ++  +P +S +
Sbjct: 647 LYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAA 706

Query: 692 YVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSH 751
           YVL+SN+YA+ GDW++ A++RK M ER VKKE G+SW++V N        + F +GD SH
Sbjct: 707 YVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKN------KTYSFLAGDRSH 766

Query: 752 PQSGEICRMAEYMGAEMK 760
           P   +I    E +   +K
Sbjct: 767 PLKDQIYMKLEDLSTRLK 772

BLAST of HG10008034 vs. ExPASy Swiss-Prot
Match: Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 1.3e-119
Identity = 254/725 (35.03%), Postives = 391/725 (53.93%), Query Frame = 0

Query: 34  LFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACCGL 93
           LF +   P+  + N ++  +  R     ++ +F    +  +      L   L+       
Sbjct: 283 LFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVAN 342

Query: 94  PKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTILSG 153
             LG  +H   I  GL S+I V +SL++MY K  ++E A  VFE L + + V WN ++ G
Sbjct: 343 LDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRG 402

Query: 154 FQ---KSENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGFKGD 213
           +    +S   +   + M  +G   D  T+T+ L+ C    D   G Q H++ +K     +
Sbjct: 403 YAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKN 462

Query: 214 VFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFVQMV 273
           +FVGNAL+ MY++   L DARQ+F+ M  RD V+W+ +I  Y Q+     EA  +F +M 
Sbjct: 463 LFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENES-EAFDLFKRMN 522

Query: 274 REGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCEIIE 333
             G+  D   +   L  C H   L  GKQ+HCL+VK G +     G+ LI MYSKC II+
Sbjct: 523 LCGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIK 582

Query: 334 DAKAAFKLIKDRNVISWTTMLSLYE----EDAVSLFNKMRLDGVYPNDVTFIGLLHAITI 393
           DA+  F  + + +V+S   +++ Y     E+AV LF +M   GV P+++TF  ++ A   
Sbjct: 583 DARKVFSSLPEWSVVSMNALIAGYSQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHK 642

Query: 394 RGMVEQGLIVHGLCIKADFVSELS-VGNSLITMYAKFESIQDASRVFMELPY-REIISWN 453
              +  G   HG   K  F SE   +G SL+ MY     + +A  +F EL   + I+ W 
Sbjct: 643 PESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWT 702

Query: 454 ALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLNAISAGEGISIKHGQRCHSHLI 513
            ++SG++QN   +EAL+ +     +   P++ TF +VL   S     S++ G+  HS + 
Sbjct: 703 GMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLS--SLREGRAIHSLIF 762

Query: 514 KVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQ-FAWTALISAYAQHGDYDLVM 573
            +  + D + S  L+DMYAK G ++ S +VF+E   RS   +W +LI+ YA++G  +  +
Sbjct: 763 HLAHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDAL 822

Query: 574 KLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVDM 633
           K+F+ M +  I PD + FL VLTACS    V  GR+ F+MMI  + IE   +H +CMVD+
Sbjct: 823 KIFDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDL 882

Query: 634 LGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGSY 693
           LGR G L++A++ +      P      SLLGACRIHG+    E  A  L++ EP  S +Y
Sbjct: 883 LGRWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAY 942

Query: 694 VLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSHP 748
           VL+SN+YA +G WEK   +RK MR+RGVKK  G+SW+DV          H F++GD SH 
Sbjct: 943 VLLSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWIDV------EQRTHIFAAGDKSHS 998

BLAST of HG10008034 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 1.6e-117
Identity = 246/698 (35.24%), Postives = 395/698 (56.59%), Query Frame = 0

Query: 74  LDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAF 133
           +  N  E   AL L+ C  L +L RQI   V  +GL         L++++C+ G ++ A 
Sbjct: 31  IPANVYEHPAALLLERCSSLKEL-RQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAA 90

Query: 134 SVFENLHDPDIVSWNTILSGFQK---SENALSFAVRMNLNGVKFDPVTYTTT--LAFCLD 193
            VFE +     V ++T+L GF K    + AL F VRM  + V  +PV Y  T  L  C D
Sbjct: 91  RVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDV--EPVVYNFTYLLKVCGD 150

Query: 194 GEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAM 253
             +   G ++H L +K GF  D+F    L  MY++   + +AR+VFD M  RD VSW+ +
Sbjct: 151 EAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTI 210

Query: 254 ITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTG 313
           + GY+Q G   + A+ +   M  E +K   + I   L      R + +GK+IH   +++G
Sbjct: 211 VAGYSQNGMARM-ALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSG 270

Query: 314 YETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLY-----EEDAVSLFN 373
           +++  ++   L+ MY+KC  +E A+  F  + +RNV+SW +M+  Y      ++A+ +F 
Sbjct: 271 FDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQ 330

Query: 374 KMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFE 433
           KM  +GV P DV+ +G LHA    G +E+G  +H L ++      +SV NSLI+MY K +
Sbjct: 331 KMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCK 390

Query: 434 SIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLN 493
            +  A+ +F +L  R ++SWNA+I G+AQN    +AL  F      + KP+ +T+ SV+ 
Sbjct: 391 EVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVIT 450

Query: 494 AISAGEGISIKHGQR-CHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERS 553
           AI+    +SI H  +  H  +++  L+ +  ++ AL+DMYAK G+I  ++ +F+  SER 
Sbjct: 451 AIAE---LSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 510

Query: 554 QFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFD 613
              W A+I  Y  HG     ++LFEEM+K  IKP+ V FLSV++ACS + +V+ G + F 
Sbjct: 511 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 570

Query: 614 MMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNL 673
           MM +++ IE + +HY  MVD+LGRAGRL +A + + ++P  P ++   ++LGAC+IH N+
Sbjct: 571 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 630

Query: 674 EMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDV 733
             AE+ A  L +  P + G +VL++N+Y     WEKV ++R  M  +G++K  G S V++
Sbjct: 631 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 690

Query: 734 GNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMK 760
            N       +H F SG  +HP S +I    E +   +K
Sbjct: 691 KN------EVHSFFSGSTAHPDSKKIYAFLEKLICHIK 715

BLAST of HG10008034 vs. ExPASy Swiss-Prot
Match: Q9FWA6 (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 409.1 bits (1050), Expect = 1.2e-112
Identity = 245/761 (32.19%), Postives = 406/761 (53.35%), Query Frame = 0

Query: 33  NLFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACCG 92
           + F+  P  +  S N +L  YL   E+++S+  F    R G++   D  T A+ LK C  
Sbjct: 135 SFFNMMPVRDVVSWNSMLSGYLQNGESLKSIEVFVDMGREGIE--FDGRTFAIILKVCSF 194

Query: 93  L--PKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTI 152
           L    LG QIHG V+  G  + +  +++L++MY K  +   +  VF+ + + + VSW+ I
Sbjct: 195 LEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFVESLRVFQGIPEKNSVSWSAI 254

Query: 153 LSGFQKS---ENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGF 212
           ++G  ++     AL F   M           Y + L  C    +   G QLH  ALK  F
Sbjct: 255 IAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHALKSDF 314

Query: 213 KGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFV 272
             D  V  A + MY++ +++ DA+ +FD   + +R S++AMITGY+QE  HG +A+ +F 
Sbjct: 315 AADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQE-EHGFKALLLFH 374

Query: 273 QMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCE 332
           +++  G+ FD ++++G    C   + L  G QI+ L +K+       V N  I MY KC+
Sbjct: 375 RLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMYGKCQ 434

Query: 333 IIEDAKAAFKLIKDRNVISWTTMLSLYEE-----DAVSLFNKMRLDGVYPNDVTFIGLLH 392
            + +A   F  ++ R+ +SW  +++ +E+     + + LF  M    + P++ TF  +L 
Sbjct: 435 ALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFGSILK 494

Query: 393 AITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYRE--- 452
           A T  G +  G+ +H   +K+   S  SVG SLI MY+K   I++A ++      R    
Sbjct: 495 ACT-GGSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRANVS 554

Query: 453 -----------------IISWNALISGYAQNALCQEALETFLYAI-MESKPNEYTFGSVL 512
                             +SWN++ISGY      ++A   F   + M   P+++T+ +VL
Sbjct: 555 GTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPDKFTYATVL 614

Query: 513 NAIS--AGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSE 572
           +  +  A  G+    G++ H+ +IK  L  D  I   L+DMY+K G + +S+ +F ++  
Sbjct: 615 DTCANLASAGL----GKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLR 674

Query: 573 RSQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRF 632
           R    W A+I  YA HG  +  ++LFE M  E IKP+ V F+S+L AC+   ++D G  +
Sbjct: 675 RDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLIDKGLEY 734

Query: 633 FDMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIH- 692
           F MM +D+ ++P   HYS MVD+LG++G+++ A E++  +P        ++LLG C IH 
Sbjct: 735 FYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIHR 794

Query: 693 GNLEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSW 752
            N+E+AE    AL++ +P +S +Y L+SN+YA  G WEKV+++R+ MR   +KKE G SW
Sbjct: 795 NNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCSW 854

Query: 753 VDVGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMK 760
           V++ +       LH F  GD +HP+  EI      + +EMK
Sbjct: 855 VELKD------ELHVFLVGDKAHPRWEEIYEELGLIYSEMK 881

BLAST of HG10008034 vs. ExPASy TrEMBL
Match: A0A6J1EK48 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111434059 PE=4 SV=1)

HSP 1 Score: 1426.8 bits (3692), Expect = 0.0e+00
Identity = 699/786 (88.93%), Postives = 743/786 (94.53%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKS QRLPSFNFFRSFQHD+NLFDQSPPPNAASTNRVLLNYLHR+EA 
Sbjct: 1   MLTRHLNFKSFHVKSKQRLPSFNFFRSFQHDHNLFDQSPPPNAASTNRVLLNYLHRNEAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           Q+LR FKK IRWGLDGNAD  TLALALKACCG+PKLGRQIHGFVISSGLVS+I+VSNSLM
Sbjct: 61  QALRLFKKHIRWGLDGNADGFTLALALKACCGVPKLGRQIHGFVISSGLVSNISVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSE+ALSFA  MNLNGV+FDPVTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSESALSFAAWMNLNGVRFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDF+FGWQLHTL LKCGF+ D+FVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFIFGWQLHTLVLKCGFQCDIFVGNALITMYSRWEHLVDARQVFDEMRSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG HGLEAI VF+QMVREGVKFDNVAITGA+SVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDHGLEAILVFIQMVREGVKFDNVAITGAVSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVK G+ETHTSVGNVLIS YSKCE+I+DAK+ F++I DRNVISWTTM+SLYEEDAVS
Sbjct: 301 HCLTVKIGFETHTSVGNVLISTYSKCEVIDDAKSVFEIIDDRNVISWTTMISLYEEDAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFN+MRLDGVYPNDVTFIGLLHAIT+R MVEQGL+VHGLCIKADFV+EL VGNSLITMYA
Sbjct: 361 LFNEMRLDGVYPNDVTFIGLLHAITMRNMVEQGLMVHGLCIKADFVTELGVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNGLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAIS GE IS+KHGQRCHSHLIKVGLN  PIISGALLDMYAKRGSIQESQRVFNE S+R
Sbjct: 481 LNAISGGEDISLKHGQRCHSHLIKVGLNSGPIISGALLDMYAKRGSIQESQRVFNEASKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDYD VMKLFEEM+KEKI PDAVIFLSVL ACSRNRMVDMGR+FF
Sbjct: 541 SQFAWTALISGYAQHGDYDTVMKLFEEMKKEKINPDAVIFLSVLAACSRNRMVDMGRQFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           +MMI DHMIEP  EHYSCMVDMLGRAG+LE+A+EMLARIPGGPGISALQSLLGACRIHGN
Sbjct: 601 NMMINDHMIEPEAEHYSCMVDMLGRAGQLEEAQEMLARIPGGPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           ++MAERMA+ALMK EP+ESGSYVLMSNLYAQKGDWEKVAE+RKGM+E+GVKKEIGFSWVD
Sbjct: 661 VDMAERMADALMKSEPLESGSYVLMSNLYAQKGDWEKVAEVRKGMKEKGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGAS LYLHGFSSGDVSHPQS EICRMA+YMGAEMKF KDRER+ QT +ID L +TD
Sbjct: 721 VGNFGAS-LYLHGFSSGDVSHPQSEEICRMAQYMGAEMKFLKDRERQRQTHMIDGLPLTD 780

Query: 781 LFVLDG 787
           LFV DG
Sbjct: 781 LFVFDG 785

BLAST of HG10008034 vs. ExPASy TrEMBL
Match: A0A5D3DTC6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold876G00080 PE=4 SV=1)

HSP 1 Score: 1414.4 bits (3660), Expect = 0.0e+00
Identity = 703/786 (89.44%), Postives = 731/786 (93.00%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNF SFHVKS QR PSF  FRSF HDYNLFDQSP  NAAS NRVLLNYL RD A 
Sbjct: 1   MLTRHLNFNSFHVKSKQRFPSFKIFRSFHHDYNLFDQSPTSNAASFNRVLLNYLSRDGAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           QSLRFFKK  RWGLDGN DE TLALALKACCGLPKLGRQIHGFVISSG VSHITVSNSLM
Sbjct: 61  QSLRFFKKNFRWGLDGNTDEFTLALALKACCGLPKLGRQIHGFVISSGFVSHITVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGF+KSENALSFA+RMNLNGVKFD VTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFEKSENALSFALRMNLNGVKFDSVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T L+FCLD EDFLFGWQLHTLALKCG K DVFVGNAL+TMYSR EHLVDAR+VFDEM SR
Sbjct: 181 TALSFCLDIEDFLFGWQLHTLALKCGLKSDVFVGNALVTMYSRCEHLVDARKVFDEMPSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG +GL+AI VFVQMVREGVKFDNV ITGALSVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDNGLQAILVFVQMVREGVKFDNVPITGALSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCL VKTG+ETHTSVGNVLIS YSKCEIIEDAKA F+LI DRNVISWTTM+SLYEE AVS
Sbjct: 301 HCLAVKTGHETHTSVGNVLISTYSKCEIIEDAKAVFELINDRNVISWTTMISLYEEGAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFNKMRLDGVYPNDVTFIGLLHAITIR MVEQGL+VHGLCIKADFVSEL+VGNSLITMYA
Sbjct: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRNMVEQGLMVHGLCIKADFVSELTVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFE +QDASRVFMELPYREIISWNALISGYAQNALCQEALE F YAIME KPNEYTFGSV
Sbjct: 421 KFEFMQDASRVFMELPYREIISWNALISGYAQNALCQEALEAFFYAIMEYKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAISAGE IS+KHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETS++
Sbjct: 481 LNAISAGEDISLKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSKQ 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDY+ V+KLFEEMEKEKIKPDAVIFLSVLTACSRNRMV+MGR+ F
Sbjct: 541 SQFAWTALISGYAQHGDYESVIKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVNMGRQLF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           DMMIKDHMIEP GEHYSCMVDMLGRAGRLE+AEE+LA IPGGPGISALQSLLGACR HGN
Sbjct: 601 DMMIKDHMIEPEGEHYSCMVDMLGRAGRLEEAEEILASIPGGPGISALQSLLGACRTHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           +EMAER+AN LMK+EP+ESG YVLMSNLYAQKGDWEKVAEMRK MRERGVKKEIGFSWVD
Sbjct: 661 VEMAERIANDLMKKEPLESGPYVLMSNLYAQKGDWEKVAEMRKEMRERGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGASNLYLHGFSSGDVSHPQS EI RMA+YMGAEMKF KDR RES   VI ELT+TD
Sbjct: 721 VGNFGASNLYLHGFSSGDVSHPQSEEIFRMAKYMGAEMKFLKDRARESHISVIGELTLTD 780

Query: 781 LFVLDG 787
           LFVLDG
Sbjct: 781 LFVLDG 786

BLAST of HG10008034 vs. ExPASy TrEMBL
Match: A0A6J1KH73 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111495220 PE=4 SV=1)

HSP 1 Score: 1414.1 bits (3659), Expect = 0.0e+00
Identity = 694/786 (88.30%), Postives = 739/786 (94.02%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNFKSFHVKS QRLPSFNFFRSF+HD+NLFDQSPPPNAASTNRVLL+YLHR+EA 
Sbjct: 1   MLTRHLNFKSFHVKSKQRLPSFNFFRSFRHDHNLFDQSPPPNAASTNRVLLDYLHRNEAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           Q+LR FKK IRW LDGNADE TLALALKACCG+PKLGRQIHGF ISSGLV +I+VSNSLM
Sbjct: 61  QALRLFKKHIRWDLDGNADEFTLALALKACCGVPKLGRQIHGFAISSGLVLNISVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSE+ALSFA  MNLNGV+FDPVTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSESALSFAAWMNLNGVQFDPVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T LAFCLDGEDF+FGWQLHTL LKCGF+ DVFVGNALITMYSRWEHLVDARQVFDEM SR
Sbjct: 181 TALAFCLDGEDFIFGWQLHTLVLKCGFQCDVFVGNALITMYSRWEHLVDARQVFDEMRSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG HGLEAI VF+QMVREGVKFDNVAITGA+SVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDHGLEAILVFIQMVREGVKFDNVAITGAVSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCLTVK G+ETHTSVGNVLIS YSKCE+IEDAK+ F++I DRNVISWTTM+SLYEEDAVS
Sbjct: 301 HCLTVKIGFETHTSVGNVLISTYSKCEVIEDAKSVFEIIDDRNVISWTTMISLYEEDAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFN+MRLDGVYPNDVTFIGLLHAIT+R MVEQGL+VHGLCIKADFV+EL VGNSLITMYA
Sbjct: 361 LFNEMRLDGVYPNDVTFIGLLHAITMRNMVEQGLMVHGLCIKADFVTELCVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFES+QDASRVFMELPYREIISWNALISGYAQN LCQEALETFL AIMESKPNEYTFGSV
Sbjct: 421 KFESMQDASRVFMELPYREIISWNALISGYAQNGLCQEALETFLCAIMESKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAIS GE ISIKHGQRCHSHLIKVGLN  PIISGALLDMYAKRGSIQESQRVFNE S+R
Sbjct: 481 LNAISGGEDISIKHGQRCHSHLIKVGLNSGPIISGALLDMYAKRGSIQESQRVFNEASKR 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           S FAWTALIS YAQHGDYD VMKLFEEM+KEKI PDAVIFLS+L ACSRNRMVDMGR+FF
Sbjct: 541 SPFAWTALISGYAQHGDYDSVMKLFEEMKKEKINPDAVIFLSILAACSRNRMVDMGRQFF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           +MMI DHMIEP  EHYSCMVDMLGRAG+LE+A+EMLARIPGGPGISALQSLLGACRIHGN
Sbjct: 601 NMMINDHMIEPEAEHYSCMVDMLGRAGQLEEAQEMLARIPGGPGISALQSLLGACRIHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           ++MAERMA+ALMK EP+ESGSYVLMSNLYAQKGDWEKVAE+RKGM+E+GVKKEIGFSWVD
Sbjct: 661 VDMAERMADALMKTEPLESGSYVLMSNLYAQKGDWEKVAEVRKGMKEKGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGAS LYLHGFSSGDVSHPQS EIC+MA+YMGAEMKF KDRER+ QT +ID L +TD
Sbjct: 721 VGNFGAS-LYLHGFSSGDVSHPQSEEICKMAQYMGAEMKFLKDRERQRQTHMIDGLPLTD 780

Query: 781 LFVLDG 787
           L V DG
Sbjct: 781 LLVFDG 785

BLAST of HG10008034 vs. ExPASy TrEMBL
Match: A0A5A7TTA5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold216G00830 PE=4 SV=1)

HSP 1 Score: 1409.4 bits (3647), Expect = 0.0e+00
Identity = 700/786 (89.06%), Postives = 730/786 (92.88%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNF SFHVKS QR PSF  FRSF HDYNLFDQSP  NAAS NRVLLNYL RD A 
Sbjct: 1   MLTRHLNFNSFHVKSKQRFPSFKIFRSFHHDYNLFDQSPTSNAASFNRVLLNYLSRDGAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           QSLRFFKK  RWGLDGN DE TLALALKACCGLPKLGRQIHGFVISSG  SHITVSNSLM
Sbjct: 61  QSLRFFKKNFRWGLDGNTDEFTLALALKACCGLPKLGRQIHGFVISSGFFSHITVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGF+KSENALSFA+RMNLNGVKFD VTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFEKSENALSFALRMNLNGVKFDSVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T L+FCLD EDFLFGWQLH+LALKCG + DVFVGNAL+TMYSR EHLVDAR+VFDEM SR
Sbjct: 181 TALSFCLDIEDFLFGWQLHSLALKCGLESDVFVGNALVTMYSRCEHLVDARKVFDEMPSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG +GL+AI VFVQMVREGVKFDNV ITGALSVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDNGLQAILVFVQMVREGVKFDNVPITGALSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCL VKTG+ETHTSVGNVLIS YSKCEIIEDAKA F+LI DRNVISWTTM+SLYEE AVS
Sbjct: 301 HCLAVKTGHETHTSVGNVLISTYSKCEIIEDAKAVFELINDRNVISWTTMISLYEEGAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFNKMRLDGVYPNDVTFIGLLHAITIR MVEQGL+VHGLCIKADFVSEL+VGNSLITMYA
Sbjct: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRNMVEQGLMVHGLCIKADFVSELTVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFE +QDASRVFMELPYREIISWNALISGYAQNALCQEALE F YAIME KPNEYTFGSV
Sbjct: 421 KFEFMQDASRVFMELPYREIISWNALISGYAQNALCQEALEAFFYAIMEYKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAISAGE IS+KHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETS++
Sbjct: 481 LNAISAGEDISLKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSKQ 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDY+ V+KLFEEMEKEKIKPDAVIFLSVLTACSRNRMV+MGR+ F
Sbjct: 541 SQFAWTALISGYAQHGDYESVIKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVNMGRQLF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           DMMIKDHMIEP GEHYSCMVDMLGRAGRLE+AEE+LA IPGGPGISALQSLLGACR HGN
Sbjct: 601 DMMIKDHMIEPEGEHYSCMVDMLGRAGRLEEAEEILASIPGGPGISALQSLLGACRTHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           +EMAER+AN LMK+EP+ESG YVLMSNLYAQKGDWEKVAEMRK MRERGVKKEIGFSWVD
Sbjct: 661 VEMAERIANDLMKKEPLESGPYVLMSNLYAQKGDWEKVAEMRKEMRERGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGASNLYLHGFSSGDVSHPQS EI RMA+YMGAEMKF KDR RES   VI ELT+TD
Sbjct: 721 VGNFGASNLYLHGFSSGDVSHPQSEEIFRMAKYMGAEMKFLKDRARESHISVIGELTLTD 780

Query: 781 LFVLDG 787
           LFVLDG
Sbjct: 781 LFVLDG 786

BLAST of HG10008034 vs. ExPASy TrEMBL
Match: A0A1S3BA52 (pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103487459 PE=4 SV=1)

HSP 1 Score: 1409.4 bits (3647), Expect = 0.0e+00
Identity = 700/786 (89.06%), Postives = 730/786 (92.88%), Query Frame = 0

Query: 1   MLTRHLNFKSFHVKSNQRLPSFNFFRSFQHDYNLFDQSPPPNAASTNRVLLNYLHRDEAI 60
           MLTRHLNF SFHVKS QR PSF  FRSF HDYNLFDQSP  NAAS NRVLLNYL RD A 
Sbjct: 1   MLTRHLNFNSFHVKSKQRFPSFKIFRSFHHDYNLFDQSPTSNAASFNRVLLNYLSRDGAF 60

Query: 61  QSLRFFKKQIRWGLDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLM 120
           QSLRFFKK  RWGLDGN DE TLALALKACCGLPKLGRQIHGFVISSG  SHITVSNSLM
Sbjct: 61  QSLRFFKKNFRWGLDGNTDEFTLALALKACCGLPKLGRQIHGFVISSGFFSHITVSNSLM 120

Query: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYT 180
           NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGF+KSENALSFA+RMNLNGVKFD VTYT
Sbjct: 121 NMYCKSGQLERAFSVFENLHDPDIVSWNTILSGFEKSENALSFALRMNLNGVKFDSVTYT 180

Query: 181 TTLAFCLDGEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSR 240
           T L+FCLD EDFLFGWQLH+LALKCG + DVFVGNAL+TMYSR EHLVDAR+VFDEM SR
Sbjct: 181 TALSFCLDIEDFLFGWQLHSLALKCGLESDVFVGNALVTMYSRCEHLVDARKVFDEMPSR 240

Query: 241 DRVSWSAMITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQI 300
           DRVSWSAMITGYAQEG +GL+AI VFVQMVREGVKFDNV ITGALSVCGHERNLELGKQI
Sbjct: 241 DRVSWSAMITGYAQEGDNGLQAILVFVQMVREGVKFDNVPITGALSVCGHERNLELGKQI 300

Query: 301 HCLTVKTGYETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVS 360
           HCL VKTG+ETHTSVGNVLIS YSKCEIIEDAKA F+LI DRNVISWTTM+SLYEE AVS
Sbjct: 301 HCLAVKTGHETHTSVGNVLISTYSKCEIIEDAKAVFELINDRNVISWTTMISLYEEGAVS 360

Query: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYA 420
           LFNKMRLDGVYPNDVTFIGLLHAITIR MVEQGL+VHGLCIKADFVSEL+VGNSLITMYA
Sbjct: 361 LFNKMRLDGVYPNDVTFIGLLHAITIRNMVEQGLMVHGLCIKADFVSELTVGNSLITMYA 420

Query: 421 KFESIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSV 480
           KFE +QDASRVFMELPYREIISWNALISGYAQNALCQEALE F YAIME KPNEYTFGSV
Sbjct: 421 KFEFMQDASRVFMELPYREIISWNALISGYAQNALCQEALEAFFYAIMEYKPNEYTFGSV 480

Query: 481 LNAISAGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSER 540
           LNAISAGE IS+KHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETS++
Sbjct: 481 LNAISAGEDISLKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSKQ 540

Query: 541 SQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFF 600
           SQFAWTALIS YAQHGDY+ V+KLFEEMEKEKIKPDAVIFLSVLTACSRNRMV+MGR+ F
Sbjct: 541 SQFAWTALISGYAQHGDYESVIKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVNMGRQLF 600

Query: 601 DMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGN 660
           DMMIKDHMIEP GEHYSCMVDMLGRAGRLE+AEE+LA IPGGPGISALQSLLGACR HGN
Sbjct: 601 DMMIKDHMIEPEGEHYSCMVDMLGRAGRLEEAEEILASIPGGPGISALQSLLGACRTHGN 660

Query: 661 LEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVD 720
           +EMAER+AN LMK+EP+ESG YVLMSNLYAQKGDWEKVAEMRK MRERGVKKEIGFSWVD
Sbjct: 661 VEMAERIANDLMKKEPLESGPYVLMSNLYAQKGDWEKVAEMRKEMRERGVKKEIGFSWVD 720

Query: 721 VGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMKFQKDRERESQTQVIDELTVTD 780
           VGNFGASNLYLHGFSSGDVSHPQS EI RMA+YMGAEMKF KDR RES   VI ELT+TD
Sbjct: 721 VGNFGASNLYLHGFSSGDVSHPQSEEIFRMAKYMGAEMKFLKDRARESHISVIGELTLTD 780

Query: 781 LFVLDG 787
           LFVLDG
Sbjct: 781 LFVLDG 786

BLAST of HG10008034 vs. TAIR 10
Match: AT4G32430.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 823.5 bits (2126), Expect = 1.4e-238
Identity = 399/740 (53.92%), Postives = 541/740 (73.11%), Query Frame = 0

Query: 24  FFRSFQHDYNLFDQSPPPNA-ASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDG-NADEL 83
           F+  ++  + LFD S   NA  S N  +   L R+   ++L  FK+ ++ G  G + DE+
Sbjct: 20  FYSPYRIAHKLFDGSSQRNATTSINHSISESLRRNSPARALSIFKENLQLGYFGRHMDEV 79

Query: 84  TLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHD 143
           TL LALKAC G  K G QIHGF  +SG  S + VSN++M MY K+G+ + A  +FENL D
Sbjct: 80  TLCLALKACRGDLKRGCQIHGFSTTSGFTSFVCVSNAVMGMYRKAGRFDNALCIFENLVD 139

Query: 144 PDIVSWNTILSGFQKSENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTL 203
           PD+VSWNTILSGF  ++ AL+F VRM   GV FD  TY+T L+FC+  E FL G QL + 
Sbjct: 140 PDVVSWNTILSGFDDNQIALNFVVRMKSAGVVFDAFTYSTALSFCVGSEGFLLGLQLQST 199

Query: 204 ALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLE 263
            +K G + D+ VGN+ ITMYSR      AR+VFDEM  +D +SW+++++G +QEG  G E
Sbjct: 200 VVKTGLESDLVVGNSFITMYSRSGSFRGARRVFDEMSFKDMISWNSLLSGLSQEGTFGFE 259

Query: 264 AISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLIS 323
           A+ +F  M+REGV+ D+V+ T  ++ C HE +L+L +QIH L +K GYE+   VGN+L+S
Sbjct: 260 AVVIFRDMMREGVELDHVSFTSVITTCCHETDLKLARQIHGLCIKRGYESLLEVGNILMS 319

Query: 324 MYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLYEEDAVSLFNKMRLDGVYPNDVTFIGLL 383
            YSKC ++E  K+ F  + +RNV+SWTTM+S  ++DAVS+F  MR DGVYPN+VTF+GL+
Sbjct: 320 RYSKCGVLEAVKSVFHQMSERNVVSWTTMISSNKDDAVSIFLNMRFDGVYPNEVTFVGLI 379

Query: 384 HAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYREII 443
           +A+     +++GL +HGLCIK  FVSE SVGNS IT+YAKFE+++DA + F ++ +REII
Sbjct: 380 NAVKCNEQIKEGLKIHGLCIKTGFVSEPSVGNSFITLYAKFEALEDAKKAFEDITFREII 439

Query: 444 SWNALISGYAQNALCQEALETFLYAIMESKPNEYTFGSVLNAISAGEGISIKHGQRCHSH 503
           SWNA+ISG+AQN    EAL+ FL A  E+ PNEYTFGSVLNAI+  E IS+K GQRCH+H
Sbjct: 440 SWNAMISGFAQNGFSHEALKMFLSAAAETMPNEYTFGSVLNAIAFAEDISVKQGQRCHAH 499

Query: 504 LIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQFAWTALISAYAQHGDYDLV 563
           L+K+GLN  P++S ALLDMYAKRG+I ES++VFNE S+++QF WT++ISAY+ HGD++ V
Sbjct: 500 LLKLGLNSCPVVSSALLDMYAKRGNIDESEKVFNEMSQKNQFVWTSIISAYSSHGDFETV 559

Query: 564 MKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVD 623
           M LF +M KE + PD V FLSVLTAC+R  MVD G   F+MMI+ + +EP+ EHYSCMVD
Sbjct: 560 MNLFHKMIKENVAPDLVTFLSVLTACNRKGMVDKGYEIFNMMIEVYNLEPSHEHYSCMVD 619

Query: 624 MLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGS 683
           MLGRAGRL++AEE+++ +PGGPG S LQS+LG+CR+HGN++M  ++A   M+ +P  SGS
Sbjct: 620 MLGRAGRLKEAEELMSEVPGGPGESMLQSMLGSCRLHGNVKMGAKVAELAMEMKPELSGS 679

Query: 684 YVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSH 743
           YV M N+YA+K +W+K AE+RK MR++ V KE GFSW+DVG+   S L + GFSSGD SH
Sbjct: 680 YVQMYNIYAEKEEWDKAAEIRKAMRKKNVSKEAGFSWIDVGDTEGS-LTMQGFSSGDKSH 739

Query: 744 PQSGEICRMAEYMGAEMKFQ 762
           P+S EI RM E +G EM  +
Sbjct: 740 PKSDEIYRMVEIIGLEMNLE 758

BLAST of HG10008034 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 437.6 bits (1124), Expect = 2.2e-122
Identity = 241/738 (32.66%), Postives = 408/738 (55.28%), Query Frame = 0

Query: 32  YNLFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACC 91
           +NLFD+SP  +  S   +L  +       ++ R F    R G++ +    +  L + A  
Sbjct: 47  HNLFDKSPGRDRESYISLLFGFSRDGRTQEAKRLFLNIHRLGMEMDCSIFSSVLKVSATL 106

Query: 92  GLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTIL 151
                GRQ+H   I  G +  ++V  SL++ Y K    +    VF+ + + ++V+W T++
Sbjct: 107 CDELFGRQLHCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLI 166

Query: 152 SGFQK---SENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGFK 211
           SG+ +   ++  L+  +RM   G + +  T+   L    +      G Q+HT+ +K G  
Sbjct: 167 SGYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLD 226

Query: 212 GDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFVQ 271
             + V N+LI +Y +  ++  AR +FD+   +  V+W++MI+GYA   G  LEA+ +F  
Sbjct: 227 KTIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYA-ANGLDLEALGMFYS 286

Query: 272 MVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCEI 331
           M    V+    +    + +C + + L   +Q+HC  VK G+    ++   L+  YSKC  
Sbjct: 287 MRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTA 346

Query: 332 IEDAKAAFKLIK-DRNVISWTTMLSLY-----EEDAVSLFNKMRLDGVYPNDVTFIGLLH 391
           + DA   FK I    NV+SWT M+S +     +E+AV LF++M+  GV PN+ T+  +L 
Sbjct: 347 MLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILT 406

Query: 392 AITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYREIIS 451
           A+ +    E    VH   +K ++    +VG +L+  Y K   +++A++VF  +  ++I++
Sbjct: 407 ALPVISPSE----VHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVA 466

Query: 452 WNALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLNAISAGEGISIKHGQRCHSH 511
           W+A+++GYAQ    + A++ F        KPNE+TF S+LN + A    S+  G++ H  
Sbjct: 467 WSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILN-VCAATNASMGQGKQFHGF 526

Query: 512 LIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQFAWTALISAYAQHGDYDLV 571
            IK  L+    +S ALL MYAK+G+I+ ++ VF    E+   +W ++IS YAQHG     
Sbjct: 527 AIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKA 586

Query: 572 MKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVD 631
           + +F+EM+K K+K D V F+ V  AC+   +V+ G ++FD+M++D  I P  EH SCMVD
Sbjct: 587 LDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVD 646

Query: 632 MLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGS 691
           +  RAG+LE A +++  +P   G +  +++L ACR+H   E+    A  ++  +P +S +
Sbjct: 647 LYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAA 706

Query: 692 YVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSH 751
           YVL+SN+YA+ GDW++ A++RK M ER VKKE G+SW++V N        + F +GD SH
Sbjct: 707 YVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKN------KTYSFLAGDRSH 766

Query: 752 PQSGEICRMAEYMGAEMK 760
           P   +I    E +   +K
Sbjct: 767 PLKDQIYMKLEDLSTRLK 772

BLAST of HG10008034 vs. TAIR 10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 432.2 bits (1110), Expect = 9.1e-121
Identity = 254/725 (35.03%), Postives = 391/725 (53.93%), Query Frame = 0

Query: 34  LFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACCGL 93
           LF +   P+  + N ++  +  R     ++ +F    +  +      L   L+       
Sbjct: 283 LFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVAN 342

Query: 94  PKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTILSG 153
             LG  +H   I  GL S+I V +SL++MY K  ++E A  VFE L + + V WN ++ G
Sbjct: 343 LDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRG 402

Query: 154 FQ---KSENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGFKGD 213
           +    +S   +   + M  +G   D  T+T+ L+ C    D   G Q H++ +K     +
Sbjct: 403 YAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKN 462

Query: 214 VFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFVQMV 273
           +FVGNAL+ MY++   L DARQ+F+ M  RD V+W+ +I  Y Q+     EA  +F +M 
Sbjct: 463 LFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENES-EAFDLFKRMN 522

Query: 274 REGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCEIIE 333
             G+  D   +   L  C H   L  GKQ+HCL+VK G +     G+ LI MYSKC II+
Sbjct: 523 LCGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIK 582

Query: 334 DAKAAFKLIKDRNVISWTTMLSLYE----EDAVSLFNKMRLDGVYPNDVTFIGLLHAITI 393
           DA+  F  + + +V+S   +++ Y     E+AV LF +M   GV P+++TF  ++ A   
Sbjct: 583 DARKVFSSLPEWSVVSMNALIAGYSQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHK 642

Query: 394 RGMVEQGLIVHGLCIKADFVSELS-VGNSLITMYAKFESIQDASRVFMELPY-REIISWN 453
              +  G   HG   K  F SE   +G SL+ MY     + +A  +F EL   + I+ W 
Sbjct: 643 PESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWT 702

Query: 454 ALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLNAISAGEGISIKHGQRCHSHLI 513
            ++SG++QN   +EAL+ +     +   P++ TF +VL   S     S++ G+  HS + 
Sbjct: 703 GMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLS--SLREGRAIHSLIF 762

Query: 514 KVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERSQ-FAWTALISAYAQHGDYDLVM 573
            +  + D + S  L+DMYAK G ++ S +VF+E   RS   +W +LI+ YA++G  +  +
Sbjct: 763 HLAHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDAL 822

Query: 574 KLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFDMMIKDHMIEPAGEHYSCMVDM 633
           K+F+ M +  I PD + FL VLTACS    V  GR+ F+MMI  + IE   +H +CMVD+
Sbjct: 823 KIFDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDL 882

Query: 634 LGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNLEMAERMANALMKEEPMESGSY 693
           LGR G L++A++ +      P      SLLGACRIHG+    E  A  L++ EP  S +Y
Sbjct: 883 LGRWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAY 942

Query: 694 VLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDVGNFGASNLYLHGFSSGDVSHP 748
           VL+SN+YA +G WEK   +RK MR+RGVKK  G+SW+DV          H F++GD SH 
Sbjct: 943 VLLSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWIDV------EQRTHIFAAGDKSHS 998

BLAST of HG10008034 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 425.2 bits (1092), Expect = 1.1e-118
Identity = 246/698 (35.24%), Postives = 395/698 (56.59%), Query Frame = 0

Query: 74  LDGNADELTLALALKACCGLPKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAF 133
           +  N  E   AL L+ C  L +L RQI   V  +GL         L++++C+ G ++ A 
Sbjct: 31  IPANVYEHPAALLLERCSSLKEL-RQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAA 90

Query: 134 SVFENLHDPDIVSWNTILSGFQK---SENALSFAVRMNLNGVKFDPVTYTTT--LAFCLD 193
            VFE +     V ++T+L GF K    + AL F VRM  + V  +PV Y  T  L  C D
Sbjct: 91  RVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDV--EPVVYNFTYLLKVCGD 150

Query: 194 GEDFLFGWQLHTLALKCGFKGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAM 253
             +   G ++H L +K GF  D+F    L  MY++   + +AR+VFD M  RD VSW+ +
Sbjct: 151 EAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTI 210

Query: 254 ITGYAQEGGHGLEAISVFVQMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTG 313
           + GY+Q G   + A+ +   M  E +K   + I   L      R + +GK+IH   +++G
Sbjct: 211 VAGYSQNGMARM-ALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSG 270

Query: 314 YETHTSVGNVLISMYSKCEIIEDAKAAFKLIKDRNVISWTTMLSLY-----EEDAVSLFN 373
           +++  ++   L+ MY+KC  +E A+  F  + +RNV+SW +M+  Y      ++A+ +F 
Sbjct: 271 FDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQ 330

Query: 374 KMRLDGVYPNDVTFIGLLHAITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFE 433
           KM  +GV P DV+ +G LHA    G +E+G  +H L ++      +SV NSLI+MY K +
Sbjct: 331 KMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCK 390

Query: 434 SIQDASRVFMELPYREIISWNALISGYAQNALCQEALETFLYAIMES-KPNEYTFGSVLN 493
            +  A+ +F +L  R ++SWNA+I G+AQN    +AL  F      + KP+ +T+ SV+ 
Sbjct: 391 EVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVIT 450

Query: 494 AISAGEGISIKHGQR-CHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSERS 553
           AI+    +SI H  +  H  +++  L+ +  ++ AL+DMYAK G+I  ++ +F+  SER 
Sbjct: 451 AIAE---LSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 510

Query: 554 QFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRFFD 613
              W A+I  Y  HG     ++LFEEM+K  IKP+ V FLSV++ACS + +V+ G + F 
Sbjct: 511 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 570

Query: 614 MMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIHGNL 673
           MM +++ IE + +HY  MVD+LGRAGRL +A + + ++P  P ++   ++LGAC+IH N+
Sbjct: 571 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 630

Query: 674 EMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSWVDV 733
             AE+ A  L +  P + G +VL++N+Y     WEKV ++R  M  +G++K  G S V++
Sbjct: 631 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 690

Query: 734 GNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMK 760
            N       +H F SG  +HP S +I    E +   +K
Sbjct: 691 KN------EVHSFFSGSTAHPDSKKIYAFLEKLICHIK 715

BLAST of HG10008034 vs. TAIR 10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 409.1 bits (1050), Expect = 8.2e-114
Identity = 245/761 (32.19%), Postives = 406/761 (53.35%), Query Frame = 0

Query: 33  NLFDQSPPPNAASTNRVLLNYLHRDEAIQSLRFFKKQIRWGLDGNADELTLALALKACCG 92
           + F+  P  +  S N +L  YL   E+++S+  F    R G++   D  T A+ LK C  
Sbjct: 135 SFFNMMPVRDVVSWNSMLSGYLQNGESLKSIEVFVDMGREGIE--FDGRTFAIILKVCSF 194

Query: 93  L--PKLGRQIHGFVISSGLVSHITVSNSLMNMYCKSGQLERAFSVFENLHDPDIVSWNTI 152
           L    LG QIHG V+  G  + +  +++L++MY K  +   +  VF+ + + + VSW+ I
Sbjct: 195 LEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFVESLRVFQGIPEKNSVSWSAI 254

Query: 153 LSGFQKS---ENALSFAVRMNLNGVKFDPVTYTTTLAFCLDGEDFLFGWQLHTLALKCGF 212
           ++G  ++     AL F   M           Y + L  C    +   G QLH  ALK  F
Sbjct: 255 IAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHALKSDF 314

Query: 213 KGDVFVGNALITMYSRWEHLVDARQVFDEMWSRDRVSWSAMITGYAQEGGHGLEAISVFV 272
             D  V  A + MY++ +++ DA+ +FD   + +R S++AMITGY+QE  HG +A+ +F 
Sbjct: 315 AADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQE-EHGFKALLLFH 374

Query: 273 QMVREGVKFDNVAITGALSVCGHERNLELGKQIHCLTVKTGYETHTSVGNVLISMYSKCE 332
           +++  G+ FD ++++G    C   + L  G QI+ L +K+       V N  I MY KC+
Sbjct: 375 RLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMYGKCQ 434

Query: 333 IIEDAKAAFKLIKDRNVISWTTMLSLYEE-----DAVSLFNKMRLDGVYPNDVTFIGLLH 392
            + +A   F  ++ R+ +SW  +++ +E+     + + LF  M    + P++ TF  +L 
Sbjct: 435 ALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFGSILK 494

Query: 393 AITIRGMVEQGLIVHGLCIKADFVSELSVGNSLITMYAKFESIQDASRVFMELPYRE--- 452
           A T  G +  G+ +H   +K+   S  SVG SLI MY+K   I++A ++      R    
Sbjct: 495 ACT-GGSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRANVS 554

Query: 453 -----------------IISWNALISGYAQNALCQEALETFLYAI-MESKPNEYTFGSVL 512
                             +SWN++ISGY      ++A   F   + M   P+++T+ +VL
Sbjct: 555 GTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPDKFTYATVL 614

Query: 513 NAIS--AGEGISIKHGQRCHSHLIKVGLNFDPIISGALLDMYAKRGSIQESQRVFNETSE 572
           +  +  A  G+    G++ H+ +IK  L  D  I   L+DMY+K G + +S+ +F ++  
Sbjct: 615 DTCANLASAGL----GKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLR 674

Query: 573 RSQFAWTALISAYAQHGDYDLVMKLFEEMEKEKIKPDAVIFLSVLTACSRNRMVDMGRRF 632
           R    W A+I  YA HG  +  ++LFE M  E IKP+ V F+S+L AC+   ++D G  +
Sbjct: 675 RDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLIDKGLEY 734

Query: 633 FDMMIKDHMIEPAGEHYSCMVDMLGRAGRLEDAEEMLARIPGGPGISALQSLLGACRIH- 692
           F MM +D+ ++P   HYS MVD+LG++G+++ A E++  +P        ++LLG C IH 
Sbjct: 735 FYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIHR 794

Query: 693 GNLEMAERMANALMKEEPMESGSYVLMSNLYAQKGDWEKVAEMRKGMRERGVKKEIGFSW 752
            N+E+AE    AL++ +P +S +Y L+SN+YA  G WEKV+++R+ MR   +KKE G SW
Sbjct: 795 NNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCSW 854

Query: 753 VDVGNFGASNLYLHGFSSGDVSHPQSGEICRMAEYMGAEMK 760
           V++ +       LH F  GD +HP+  EI      + +EMK
Sbjct: 855 VELKD------ELHVFLVGDKAHPRWEEIYEELGLIYSEMK 881

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878735.10.0e+0093.64pentatricopeptide repeat-containing protein At4g32430, mitochondrial isoform X1 ... [more]
XP_022927123.10.0e+0088.93pentatricopeptide repeat-containing protein At4g32430, mitochondrial [Cucurbita ... [more]
XP_038878737.10.0e+0090.33pentatricopeptide repeat-containing protein At4g32430, mitochondrial isoform X2 ... [more]
KAG7021814.10.0e+0088.93Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023531439.10.0e+0088.42pentatricopeptide repeat-containing protein At4g32430, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q84MA32.0e-23753.92Pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Arabidop... [more]
Q9ZUW33.0e-12132.66Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q9SS831.3e-11935.03Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
Q3E6Q11.6e-11735.24Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9FWA61.2e-11232.19Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EK480.0e+0088.93pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucurbit... [more]
A0A5D3DTC60.0e+0089.44Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1KH730.0e+0088.30pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucurbit... [more]
A0A5A7TTA50.0e+0089.06Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BA520.0e+0089.06pentatricopeptide repeat-containing protein At4g32430, mitochondrial OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G32430.11.4e-23853.92Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.12.2e-12232.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G09040.19.1e-12135.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.11.1e-11835.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02330.18.2e-11432.19Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 441..463
e-value: 0.002
score: 18.2
coord: 681..710
e-value: 0.016
score: 15.4
coord: 615..638
e-value: 0.019
score: 15.2
coord: 117..138
e-value: 2.0E-4
score: 21.4
coord: 243..274
e-value: 0.0059
score: 16.8
coord: 215..240
e-value: 0.034
score: 14.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 542..588
e-value: 4.2E-9
score: 36.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 117..140
e-value: 5.8E-4
score: 17.9
coord: 215..241
e-value: 0.0026
score: 15.8
coord: 544..577
e-value: 3.2E-9
score: 34.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 678..712
score: 8.725252
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 112..146
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 541..575
score: 12.484979
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 241..276
score: 10.47906
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 294..398
e-value: 6.8E-12
score: 47.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 513..730
e-value: 1.9E-38
score: 134.6
coord: 43..187
e-value: 3.1E-17
score: 64.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 191..293
e-value: 2.2E-17
score: 65.0
coord: 399..499
e-value: 2.2E-12
score: 48.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 503..679
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 554..767
coord: 25..467
coord: 469..565
NoneNo IPR availablePANTHERPTHR24015:SF524OS07G0670000 PROTEINcoord: 469..565
NoneNo IPR availablePANTHERPTHR24015:SF524OS07G0670000 PROTEINcoord: 554..767
coord: 25..467

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10008034.1HG10008034.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding