HG10005842 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005842
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr07: 6923069 .. 6924949 (+)
RNA-Seq ExpressionHG10005842
SyntenyHG10005842
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCGTTACAAAATTCTGAAGCTTCCTTCTCTGAAGTTCAAACCAATCACTAATTCCCAATCCTTCGCACCCTTCTCTTCGATTTCTCTCCAAAAAACTCCTCCGGAATCTCCCGTTTCATCTCCAAATTCAACCATAAACGCCGCTTTGCCTCTAACCCAGAATTTTCTCGAACAATCGGCCCGATCCTCTCAGTGGCATTTCATCAAACAGGTAGAATCCACTTTGACTCCCTCGCTTATCTCTGAAACCCTTCTAAATCTTCACCAGTCTCCCCAGATTGTTCTCGAATTGTTAAACCATTTACAGCATCGATTACTTGATGCCGAAACTATTTGCCTTGCCATTGTCATTGTTGCTCGTCTTCCATCTCCCAAGCCCACTTTACAGCTTCTCAAACAGGCTGTTGGGTGTGGAACTACTAATTCAATTAGGGAGATTTTTGAATTGTTAGCTGCTTCTCGTGATAGTTTGGGTTTTAAAACTAGTATTGTTTTTGATGTCTTGATTAAGTCATGTTGTGAAATGAATAGGGTAGATGAGGCTTTTGAGTGTTTTTACATGATGAAGGAAAAGGGTATTTTACCTAAGATTGAGACCTGCAATGATTTGTTGAGTTTGTGTCTGAAGTTGAATAGGATTGAGGCAGCTTGGGTTTTGTATGCTGAGATGTTTAGATTGAGGATTAAATCTAGTGTTTATACATTTAACATTATGATTAATGTTCTATGCAAGGAGGGGAAGTTAAAGAAGGCTAAGGATTTTATTGGGCATATGGAGAGTTTTGGGGTTGAACCGAATGTCGTTACGTATAATACGATTGTTCATGGATATTGTTCGAGAGGGAGAGTTGAAGGGGCTGATGCTATTTTGAATACTATGAAAAGAAAAAAGATTCAACCTGATTCTTACACATATGGCTCTCTTATCAGTGGGATGTCCAAGCAAGGAAGACTTGAAGAAGCATCAAAGATTTTTGAAGAAATGGTACAAATTGGGTTGCTTCCTAGTGCTGTAACTTATAATACTTTGATTGATGGCTTTTGCAATAAGGGTAATTTGGATATGGCCTTTGCTTATAAAGATGAGATGTTGAAGAAGGGCATAATGCCAACTGTGTCAACTTATAACTTGTTGATTCATGCATTGTTTATGGAGCAAAGAATAGATGAAGCTGAAGGGATGATCAAAGAAATTCAGGAGAAAGGAATGTCTCCTGATGCTATTACATATAATATCTTGATAAATGGGTATTGTAGATGTGGAAATGTAAAGAAAGCATTTCGTCTACATGATGAGATGTTGACGAGCGGTATTCAGCCGACGAAAGTGACATACACATCACTTATTCATGTTTTGAGCAAAAAGAGCAGAATGAAGGAGGCAGATGATTTGTTTAAGAAGATCACAAGTAAAGGTGTGTTGCCTGATGTCATTATGTTTAATGCTTTGATTGATGGTCATTGCTCAAACGGTAATGTCGAGCATGCATTTGAGCTTCTAAAAGATATGGATAGGATGAAGGTTCCTCCTGATGAAGTGACTTTCAATACCATAATGCAAGGGCATTGCAGGGAAGGAAAAGTTGAAGAAGCTCGTGAACTTTTCGATGAGATGAAGAGAAGAGGGATTAAGCCCGACCATGTTAGTTTCAATACACTAATAAGTGGTTATAGTAGACGAGGTGACATAAAGGATGCTTTCAGAGTACGAGATGAGATGCTCGATACAGGATTCAATCCGACTCTTCTAACTTATAATGCCCTTATACAAGGGTTATGCAAAAACCAAGAAGGTGATCATGCTGAAGAGCTTCTTAAAGAAATGGTCAGTAAAGGAATTACACCTGATGATAGCACTTATTTCTCAATCAGTTGA

mRNA sequence

ATGATTCGTTACAAAATTCTGAAGCTTCCTTCTCTGAAGTTCAAACCAATCACTAATTCCCAATCCTTCGCACCCTTCTCTTCGATTTCTCTCCAAAAAACTCCTCCGGAATCTCCCGTTTCATCTCCAAATTCAACCATAAACGCCGCTTTGCCTCTAACCCAGAATTTTCTCGAACAATCGGCCCGATCCTCTCAGTGGCATTTCATCAAACAGGTAGAATCCACTTTGACTCCCTCGCTTATCTCTGAAACCCTTCTAAATCTTCACCAGTCTCCCCAGATTGTTCTCGAATTGTTAAACCATTTACAGCATCGATTACTTGATGCCGAAACTATTTGCCTTGCCATTGTCATTGTTGCTCGTCTTCCATCTCCCAAGCCCACTTTACAGCTTCTCAAACAGGCTGTTGGGTGTGGAACTACTAATTCAATTAGGGAGATTTTTGAATTGTTAGCTGCTTCTCGTGATAGTTTGGGTTTTAAAACTAGTATTGTTTTTGATGTCTTGATTAAGTCATGTTGTGAAATGAATAGGGTAGATGAGGCTTTTGAGTGTTTTTACATGATGAAGGAAAAGGGTATTTTACCTAAGATTGAGACCTGCAATGATTTGTTGAGTTTGTGTCTGAAGTTGAATAGGATTGAGGCAGCTTGGGTTTTGTATGCTGAGATGTTTAGATTGAGGATTAAATCTAGTGTTTATACATTTAACATTATGATTAATGTTCTATGCAAGGAGGGGAAGTTAAAGAAGGCTAAGGATTTTATTGGGCATATGGAGAGTTTTGGGGTTGAACCGAATGTCGTTACGTATAATACGATTGTTCATGGATATTGTTCGAGAGGGAGAGTTGAAGGGGCTGATGCTATTTTGAATACTATGAAAAGAAAAAAGATTCAACCTGATTCTTACACATATGGCTCTCTTATCAGTGGGATGTCCAAGCAAGGAAGACTTGAAGAAGCATCAAAGATTTTTGAAGAAATGGTACAAATTGGGTTGCTTCCTAGTGCTGTAACTTATAATACTTTGATTGATGGCTTTTGCAATAAGGGTAATTTGGATATGGCCTTTGCTTATAAAGATGAGATGTTGAAGAAGGGCATAATGCCAACTGTGTCAACTTATAACTTGTTGATTCATGCATTGTTTATGGAGCAAAGAATAGATGAAGCTGAAGGGATGATCAAAGAAATTCAGGAGAAAGGAATGTCTCCTGATGCTATTACATATAATATCTTGATAAATGGGTATTGTAGATGTGGAAATGTAAAGAAAGCATTTCGTCTACATGATGAGATGTTGACGAGCGGTATTCAGCCGACGAAAGTGACATACACATCACTTATTCATGTTTTGAGCAAAAAGAGCAGAATGAAGGAGGCAGATGATTTGTTTAAGAAGATCACAAGTAAAGGTGTGTTGCCTGATGTCATTATGTTTAATGCTTTGATTGATGGTCATTGCTCAAACGGTAATGTCGAGCATGCATTTGAGCTTCTAAAAGATATGGATAGGATGAAGGTTCCTCCTGATGAAGTGACTTTCAATACCATAATGCAAGGGCATTGCAGGGAAGGAAAAGTTGAAGAAGCTCGTGAACTTTTCGATGAGATGAAGAGAAGAGGGATTAAGCCCGACCATGTTAGTTTCAATACACTAATAAGTGGTTATAGTAGACGAGGTGACATAAAGGATGCTTTCAGAGTACGAGATGAGATGCTCGATACAGGATTCAATCCGACTCTTCTAACTTATAATGCCCTTATACAAGGGTTATGCAAAAACCAAGAAGGTGATCATGCTGAAGAGCTTCTTAAAGAAATGGTCAGTAAAGGAATTACACCTGATGATAGCACTTATTTCTCAATCAGTTGA

Coding sequence (CDS)

ATGATTCGTTACAAAATTCTGAAGCTTCCTTCTCTGAAGTTCAAACCAATCACTAATTCCCAATCCTTCGCACCCTTCTCTTCGATTTCTCTCCAAAAAACTCCTCCGGAATCTCCCGTTTCATCTCCAAATTCAACCATAAACGCCGCTTTGCCTCTAACCCAGAATTTTCTCGAACAATCGGCCCGATCCTCTCAGTGGCATTTCATCAAACAGGTAGAATCCACTTTGACTCCCTCGCTTATCTCTGAAACCCTTCTAAATCTTCACCAGTCTCCCCAGATTGTTCTCGAATTGTTAAACCATTTACAGCATCGATTACTTGATGCCGAAACTATTTGCCTTGCCATTGTCATTGTTGCTCGTCTTCCATCTCCCAAGCCCACTTTACAGCTTCTCAAACAGGCTGTTGGGTGTGGAACTACTAATTCAATTAGGGAGATTTTTGAATTGTTAGCTGCTTCTCGTGATAGTTTGGGTTTTAAAACTAGTATTGTTTTTGATGTCTTGATTAAGTCATGTTGTGAAATGAATAGGGTAGATGAGGCTTTTGAGTGTTTTTACATGATGAAGGAAAAGGGTATTTTACCTAAGATTGAGACCTGCAATGATTTGTTGAGTTTGTGTCTGAAGTTGAATAGGATTGAGGCAGCTTGGGTTTTGTATGCTGAGATGTTTAGATTGAGGATTAAATCTAGTGTTTATACATTTAACATTATGATTAATGTTCTATGCAAGGAGGGGAAGTTAAAGAAGGCTAAGGATTTTATTGGGCATATGGAGAGTTTTGGGGTTGAACCGAATGTCGTTACGTATAATACGATTGTTCATGGATATTGTTCGAGAGGGAGAGTTGAAGGGGCTGATGCTATTTTGAATACTATGAAAAGAAAAAAGATTCAACCTGATTCTTACACATATGGCTCTCTTATCAGTGGGATGTCCAAGCAAGGAAGACTTGAAGAAGCATCAAAGATTTTTGAAGAAATGGTACAAATTGGGTTGCTTCCTAGTGCTGTAACTTATAATACTTTGATTGATGGCTTTTGCAATAAGGGTAATTTGGATATGGCCTTTGCTTATAAAGATGAGATGTTGAAGAAGGGCATAATGCCAACTGTGTCAACTTATAACTTGTTGATTCATGCATTGTTTATGGAGCAAAGAATAGATGAAGCTGAAGGGATGATCAAAGAAATTCAGGAGAAAGGAATGTCTCCTGATGCTATTACATATAATATCTTGATAAATGGGTATTGTAGATGTGGAAATGTAAAGAAAGCATTTCGTCTACATGATGAGATGTTGACGAGCGGTATTCAGCCGACGAAAGTGACATACACATCACTTATTCATGTTTTGAGCAAAAAGAGCAGAATGAAGGAGGCAGATGATTTGTTTAAGAAGATCACAAGTAAAGGTGTGTTGCCTGATGTCATTATGTTTAATGCTTTGATTGATGGTCATTGCTCAAACGGTAATGTCGAGCATGCATTTGAGCTTCTAAAAGATATGGATAGGATGAAGGTTCCTCCTGATGAAGTGACTTTCAATACCATAATGCAAGGGCATTGCAGGGAAGGAAAAGTTGAAGAAGCTCGTGAACTTTTCGATGAGATGAAGAGAAGAGGGATTAAGCCCGACCATGTTAGTTTCAATACACTAATAAGTGGTTATAGTAGACGAGGTGACATAAAGGATGCTTTCAGAGTACGAGATGAGATGCTCGATACAGGATTCAATCCGACTCTTCTAACTTATAATGCCCTTATACAAGGGTTATGCAAAAACCAAGAAGGTGATCATGCTGAAGAGCTTCTTAAAGAAATGGTCAGTAAAGGAATTACACCTGATGATAGCACTTATTTCTCAATCAGTTGA

Protein sequence

MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTYFSIS
Homology
BLAST of HG10005842 vs. NCBI nr
Match: XP_038887485.1 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Benincasa hispida])

HSP 1 Score: 1122.1 bits (2901), Expect = 0.0e+00
Identity = 567/625 (90.72%), Postives = 589/625 (94.24%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M R+KILKL SLKFK  TNSQSFAPFSSISLQ+TP ESP  S NS+ +   P TQN LEQ
Sbjct: 1   MNRFKILKLSSLKFKLTTNSQSFAPFSSISLQETPLESPFPSTNSSTDTTSPQTQNSLEQ 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
           SARSSQWHFIKQVESTLTPSLISETL NLH+SPQIVLELLNHLQ +LLDAET+CLAIVIV
Sbjct: 61  SARSSQWHFIKQVESTLTPSLISETLQNLHESPQIVLELLNHLQPQLLDAETLCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           A LPSPKPTLQLLKQAVGCGTTNSIREIFE LAASRD LGFK+SIVFD LIKSCCEMNRV
Sbjct: 121 ACLPSPKPTLQLLKQAVGCGTTNSIREIFECLAASRDRLGFKSSIVFDYLIKSCCEMNRV 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWV YAEMFRLRIKSSV TFNIM
Sbjct: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVFYAEMFRLRIKSSVCTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFIGHMES GV+PNVVTYNTIVHGYCSRGRVE ADAIL+TMKRKKI
Sbjct: 241 INVLCKEGKLKKAKDFIGHMESLGVKPNVVTYNTIVHGYCSRGRVEEADAILDTMKRKKI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           QPDSYTYGSLI+GM K G+LEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLD+AF+
Sbjct: 301 QPDSYTYGSLINGMCKLGKLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDIAFS 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC
Sbjct: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAF L D+MLTSGI+PTKVTYTSLIHVLSKK+RMKEA+DLFKKITSKGVLPDVI
Sbjct: 421 RCGNAKKAFSLRDQMLTSGIRPTKVTYTSLIHVLSKKNRMKEANDLFKKITSKGVLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSN NVE A ELLKDMDRMKVPPDEVTFNTIMQG CREGKVEEAR+LFDEM
Sbjct: 481 MFNALIDGHCSNSNVERALELLKDMDRMKVPPDEVTFNTIMQGRCREGKVEEARKLFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSR+GDIKDAFRVRDEML+TGFNPTLLTYNALIQGLCKNQE 
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRQGDIKDAFRVRDEMLNTGFNPTLLTYNALIQGLCKNQEA 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
           DHAEELLKEMVSKGITPDD TYFS+
Sbjct: 601 DHAEELLKEMVSKGITPDDGTYFSL 625

BLAST of HG10005842 vs. NCBI nr
Match: XP_022972339.1 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1099.3 bits (2842), Expect = 0.0e+00
Identity = 557/625 (89.12%), Postives = 579/625 (92.64%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SLKF+  TNSQSFA FSSIS QKTPPES   S NST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLKFQATTNSQSFALFSSISPQKTPPESQFPSSNSTKKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLISETL NLH SPQIVLELLNHLQH LLD++T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISETLQNLHDSPQIVLELLNHLQHGLLDSQTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDQLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKEKG+ PKIETCNDLLSL LKLNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKEKGVAPKIETCNDLLSLFLKLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFIGHME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+TMKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIGHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSTMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+RMK+ADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRMKDADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDRMKV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRMKVCPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVSKGITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSKGITPDDSTYFSL 624

BLAST of HG10005842 vs. NCBI nr
Match: XP_023554102.1 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1093.2 bits (2826), Expect = 0.0e+00
Identity = 553/625 (88.48%), Postives = 576/625 (92.16%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SL F+  TNSQSFA FSSIS  KTPP+S   SPNST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLNFQATTNSQSFALFSSISPHKTPPDSQFPSPNSTTKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLIS+TL NLH SPQIVLELLNHLQH LLD+ T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISDTLQNLHDSPQIVLELLNHLQHGLLDSRTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDQLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKEKG+ PKIETCNDLLSL LKLNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKEKGVAPKIETCNDLLSLFLKLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFI HME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+TMKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIEHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSTMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+R+KEADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRIKEADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDRMKV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRMKVRPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVSKGITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSKGITPDDSTYFSL 624

BLAST of HG10005842 vs. NCBI nr
Match: XP_022952975.1 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Cucurbita moschata] >KAG7011681.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 551/625 (88.16%), Postives = 574/625 (91.84%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SL F+  TNSQSFA FSSIS  KTPP+S   SPNST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLNFQATTNSQSFALFSSISPHKTPPDSQFPSPNSTTKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLISETL NLH SPQIVLELLNHLQH LLD++T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISETLQNLHDSPQIVLELLNHLQHGLLDSQTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDRLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKE G+ PKIETCNDLLSL L+LNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKENGVAPKIETCNDLLSLFLRLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFI HME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+ MKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIEHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSIMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+RMKEADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRMKEADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDR KV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRTKVRPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVSKGITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSKGITPDDSTYFSL 624

BLAST of HG10005842 vs. NCBI nr
Match: KAG6572005.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 550/625 (88.00%), Postives = 573/625 (91.68%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SL F+  TNSQSFA FSSIS  KTPP+S   SPNST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLNFQATTNSQSFALFSSISPHKTPPDSQFPSPNSTTKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLISETL NLH SPQIVLELLNHLQH LLD++T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISETLQNLHDSPQIVLELLNHLQHGLLDSQTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDRLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKE G+ PKIETCNDLLSL L+LNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKENGVAPKIETCNDLLSLFLRLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFI HME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+ MKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIEHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSIMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+RMKEADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRMKEADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDR KV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRTKVRPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVS GITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSNGITPDDSTYFSL 624

BLAST of HG10005842 vs. ExPASy Swiss-Prot
Match: Q9ZQF1 (Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g15630 PE=3 SV=1)

HSP 1 Score: 709.1 bits (1829), Expect = 4.3e-203
Identity = 341/585 (58.29%), Postives = 450/585 (76.92%), Query Frame = 0

Query: 41  SSPNSTINAALPLTQNFLEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELL 100
           S+P S +    P+T   L +S RSSQWH ++ V   LTPSL+S TLL+L ++P +    +
Sbjct: 36  STPESVLP---PITSEILLESIRSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLAFNFV 95

Query: 101 NHLQHRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLG 160
           NH+    LD +T CLAI ++++L SPKP  QLLK+ V     NSIR +F+ L  + D L 
Sbjct: 96  NHIDLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVV-TSRKNSIRNLFDELVLAHDRLE 155

Query: 161 FKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWV 220
            K++I+FD+L++ CC++  VDEA ECFY+MKEKG  PK ETCN +L+L  +LNRIE AWV
Sbjct: 156 TKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRIENAWV 215

Query: 221 LYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYC 280
            YA+M+R+ IKS+VYTFNIMINVLCKEGKLKKAK F+G ME FG++P +VTYNT+V G+ 
Sbjct: 216 FYADMYRMEIKSNVYTFNIMINVLCKEGKLKKAKGFLGIMEVFGIKPTIVTYNTLVQGFS 275

Query: 281 SRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAV 340
            RGR+EGA  I++ MK K  QPD  TY  ++S M  +GR   AS++  EM +IGL+P +V
Sbjct: 276 LRGRIEGARLIISEMKSKGFQPDMQTYNPILSWMCNEGR---ASEVLREMKEIGLVPDSV 335

Query: 341 TYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEI 400
           +YN LI G  N G+L+MAFAY+DEM+K+G++PT  TYN LIH LFME +I+ AE +I+EI
Sbjct: 336 SYNILIRGCSNNGDLEMAFAYRDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAEILIREI 395

Query: 401 QEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRM 460
           +EKG+  D++TYNILINGYC+ G+ KKAF LHDEM+T GIQPT+ TYTSLI+VL +K++ 
Sbjct: 396 REKGIVLDSVTYNILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVLCRKNKT 455

Query: 461 KEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTI 520
           +EAD+LF+K+  KG+ PD++M N L+DGHC+ GN++ AF LLK+MD M + PD+VT+N +
Sbjct: 456 READELFEKVVGKGMKPDLVMMNTLMDGHCAIGNMDRAFSLLKEMDMMSINPDDVTYNCL 515

Query: 521 MQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGF 580
           M+G C EGK EEAREL  EMKRRGIKPDH+S+NTLISGYS++GD K AF VRDEML  GF
Sbjct: 516 MRGLCGEGKFEEARELMGEMKRRGIKPDHISYNTLISGYSKKGDTKHAFMVRDEMLSLGF 575

Query: 581 NPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTYFSI 626
           NPTLLTYNAL++GL KNQEG+ AEELL+EM S+GI P+DS++ S+
Sbjct: 576 NPTLLTYNALLKGLSKNQEGELAEELLREMKSEGIVPNDSSFCSV 613

BLAST of HG10005842 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 4.8e-93
Identity = 196/638 (30.72%), Postives = 340/638 (53.29%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESP-VSSPNSTINAALPLTQNF-- 60
           MI ++I  +PS    P+T    F P  +++   +P   P  SS +S+ +A+  ++ +F  
Sbjct: 1   MIVHRI--IPSRVKDPLTR---FKPLKNLTTSSSPVFEPSSSSSSSSSSASFSVSDSFLV 60

Query: 61  ------LEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHL-------Q 120
                 L+Q   + + H I+     L P  + E L        +    ++ L       +
Sbjct: 61  EKICFSLKQGNNNVRNHLIR-----LNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPNFK 120

Query: 121 HRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTS 180
           H  L    +   +V   RL   +  L  + +  G     S  EI   L ++  + G   S
Sbjct: 121 HTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV----SRLEIVNSLDSTFSNCGSNDS 180

Query: 181 IVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAE 240
            VFD+LI++  +  ++ EA E F +++ KG    I+ CN L+   +++  +E AW +Y E
Sbjct: 181 -VFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQE 240

Query: 241 MFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGR 300
           + R  +  +VYT NIM+N LCK+GK++K   F+  ++  GV P++VTYNT++  Y S+G 
Sbjct: 241 ISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGL 300

Query: 301 VEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNT 360
           +E A  ++N M  K   P  YTY ++I+G+ K G+ E A ++F EM++ GL P + TY +
Sbjct: 301 MEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRS 360

Query: 361 LIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKG 420
           L+   C KG++        +M  + ++P +  ++ ++        +D+A      ++E G
Sbjct: 361 LLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAG 420

Query: 421 MSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEAD 480
           + PD + Y ILI GYCR G +  A  L +EML  G     VTY +++H L K+  + EAD
Sbjct: 421 LIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEAD 480

Query: 481 DLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGH 540
            LF ++T + + PD      LIDGHC  GN+++A EL + M   ++  D VT+NT++ G 
Sbjct: 481 KLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGF 540

Query: 541 CREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTL 600
            + G ++ A+E++ +M  + I P  +S++ L++    +G + +AFRV DEM+     PT+
Sbjct: 541 GKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTV 600

Query: 601 LTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTY 623
           +  N++I+G C++      E  L++M+S+G  PD  +Y
Sbjct: 601 MICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISY 623

BLAST of HG10005842 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 318.9 bits (816), Expect = 1.3e-85
Identity = 176/589 (29.88%), Postives = 325/589 (55.18%), Query Frame = 0

Query: 41  SSPNSTINAALPLTQNFLEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELL 100
           SSP+ ++ A   LT  FL++       + +  + +  TP   S  LL       ++L+ L
Sbjct: 17  SSPSDSLLADKALT--FLKRHP-----YQLHHLSANFTPEAASNLLLKSQNDQALILKFL 76

Query: 101 NHLQ-HRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNS--IREIFELLAASRD 160
           N    H+       C+ + I+ +    K T Q+L + V   T +      +F+ L  + D
Sbjct: 77  NWANPHQFFTLRCKCITLHILTKFKLYK-TAQILAEDVAAKTLDDEYASLVFKSLQETYD 136

Query: 161 SLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNR-IE 220
            L + TS VFD+++KS   ++ +D+A    ++ +  G +P + + N +L   ++  R I 
Sbjct: 137 -LCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNIS 196

Query: 221 AAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIV 280
            A  ++ EM   ++  +V+T+NI+I   C  G +  A      ME+ G  PNVVTYNT++
Sbjct: 197 FAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLI 256

Query: 281 HGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLL 340
            GYC   +++    +L +M  K ++P+  +Y  +I+G+ ++GR++E S +  EM + G  
Sbjct: 257 DGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYS 316

Query: 341 PSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGM 400
              VTYNTLI G+C +GN   A     EML+ G+ P+V TY  LIH++     ++ A   
Sbjct: 317 LDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEF 376

Query: 401 IKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSK 460
           + +++ +G+ P+  TY  L++G+ + G + +A+R+  EM  +G  P+ VTY +LI+    
Sbjct: 377 LDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCV 436

Query: 461 KSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVT 520
             +M++A  + + +  KG+ PDV+ ++ ++ G C + +V+ A  + ++M    + PD +T
Sbjct: 437 TGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTIT 496

Query: 521 FNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEML 580
           +++++QG C + + +EA +L++EM R G+ PD  ++  LI+ Y   GD++ A ++ +EM+
Sbjct: 497 YSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMV 556

Query: 581 DTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTYFSI 626
           + G  P ++TY+ LI GL K      A+ LL ++  +   P D TY ++
Sbjct: 557 EKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL 596

BLAST of HG10005842 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 317.8 bits (813), Expect = 2.8e-85
Identity = 169/542 (31.18%), Postives = 306/542 (56.46%), Query Frame = 0

Query: 86  LLNLHQSPQIVLELLNHLQHRL-LDAETICLAIVIVARLPSPKPTLQLLK---QAVGCGT 145
           L+ +    ++VL+  +  + R   + E++C+ I +       K    L+    +      
Sbjct: 94  LMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 153

Query: 146 TNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIET 205
           T+S  + F+LL  +    G     VFDV  +   +   + EA   F  M   G++  +++
Sbjct: 154 TDSFVQFFDLLVYTYKDWG-SDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDS 213

Query: 206 CNDLLS-LCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHM 265
           CN  L+ L     +   A +++ E   + +  +V ++NI+I+ +C+ G++K+A   +  M
Sbjct: 214 CNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLM 273

Query: 266 ESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRL 325
           E  G  P+V++Y+T+V+GYC  G ++    ++  MKRK ++P+SY YGS+I  + +  +L
Sbjct: 274 ELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKL 333

Query: 326 EEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLL 385
            EA + F EM++ G+LP  V Y TLIDGFC +G++  A  +  EM  + I P V TY  +
Sbjct: 334 AEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAI 393

Query: 386 IHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGI 445
           I        + EA  +  E+  KG+ PD++T+  LINGYC+ G++K AFR+H+ M+ +G 
Sbjct: 394 ISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGC 453

Query: 446 QPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFE 505
            P  VTYT+LI  L K+  +  A++L  ++   G+ P++  +N++++G C +GN+E A +
Sbjct: 454 SPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVK 513

Query: 506 LLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYS 565
           L+ + +   +  D VT+ T+M  +C+ G++++A+E+  EM  +G++P  V+FN L++G+ 
Sbjct: 514 LVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFC 573

Query: 566 RRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDS 623
             G ++D  ++ + ML  G  P   T+N+L++  C       A  + K+M S+G+ PD  
Sbjct: 574 LHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGK 633

BLAST of HG10005842 vs. ExPASy Swiss-Prot
Match: O04504 (Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX=3702 GN=At1g09820 PE=2 SV=1)

HSP 1 Score: 312.8 bits (800), Expect = 9.0e-84
Identity = 167/492 (33.94%), Postives = 272/492 (55.28%), Query Frame = 0

Query: 140 GTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKI 199
           G+ + +  IF  ++   +      SI+ D+L+ +    +R +  FE F      G     
Sbjct: 131 GSDHQVHSIFHAISMCDNVC--VNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSA 190

Query: 200 ETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGH 259
            +C  L+   LK NR      +Y EM R +I+ +V+TFN++IN LCK GK+ KA+D +  
Sbjct: 191 LSCKPLMIALLKENRSADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMED 250

Query: 260 MESFGVEPNVVTYNTIVHGYC---SRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSK 319
           M+ +G  PNVV+YNT++ GYC     G++  ADA+L  M    + P+  T+  LI G  K
Sbjct: 251 MKVYGCSPNVVSYNTLIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWK 310

Query: 320 QGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVST 379
              L  + K+F+EM+   + P+ ++YN+LI+G CN G +  A + +D+M+  G+ P + T
Sbjct: 311 DDNLPGSMKVFKEMLDQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLIT 370

Query: 380 YNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEML 439
           YN LI+       + EA  M   ++ +G  P    YN+LI+ YC+ G +   F L +EM 
Sbjct: 371 YNALINGFCKNDMLKEALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEME 430

Query: 440 TSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVE 499
             GI P   TY  LI  L +   ++ A  LF ++TSKG LPD++ F+ L++G+C  G   
Sbjct: 431 REGIVPDVGTYNCLIAGLCRNGNIEAAKKLFDQLTSKG-LPDLVTFHILMEGYCRKGESR 490

Query: 500 HAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM-KRRGIKPDHVSFNTL 559
            A  LLK+M +M + P  +T+N +M+G+C+EG ++ A  +  +M K R ++ +  S+N L
Sbjct: 491 KAAMLLKEMSKMGLKPRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVL 550

Query: 560 ISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGI 619
           + GYS++G ++DA  + +EML+ G  P  +TY                E + +EMV +G 
Sbjct: 551 LQGYSQKGKLEDANMLLNEMLEKGLVPNRITY----------------EIVKEEMVDQGF 603

Query: 620 TPD-DSTYFSIS 627
            PD +   F++S
Sbjct: 611 VPDIEGHLFNVS 603

BLAST of HG10005842 vs. ExPASy TrEMBL
Match: A0A6J1IB79 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111470920 PE=4 SV=1)

HSP 1 Score: 1099.3 bits (2842), Expect = 0.0e+00
Identity = 557/625 (89.12%), Postives = 579/625 (92.64%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SLKF+  TNSQSFA FSSIS QKTPPES   S NST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLKFQATTNSQSFALFSSISPQKTPPESQFPSSNSTKKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLISETL NLH SPQIVLELLNHLQH LLD++T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISETLQNLHDSPQIVLELLNHLQHGLLDSQTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDQLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKEKG+ PKIETCNDLLSL LKLNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKEKGVAPKIETCNDLLSLFLKLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFIGHME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+TMKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIGHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSTMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+RMK+ADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRMKDADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDRMKV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRMKVCPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVSKGITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSKGITPDDSTYFSL 624

BLAST of HG10005842 vs. ExPASy TrEMBL
Match: A0A6J1GND9 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111455489 PE=4 SV=1)

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 551/625 (88.16%), Postives = 574/625 (91.84%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           M RYKILKL SL F+  TNSQSFA FSSIS  KTPP+S   SPNST  A   LTQN LE+
Sbjct: 1   MNRYKILKLSSLNFQATTNSQSFALFSSISPHKTPPDSQFPSPNSTTKANSTLTQNSLEK 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
            ARSSQWHFIKQVESTLTPSLISETL NLH SPQIVLELLNHLQH LLD++T CLAIVIV
Sbjct: 61  FARSSQWHFIKQVESTLTPSLISETLQNLHDSPQIVLELLNHLQHGLLDSQTHCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKPTLQLLKQAVGCG TNS++EIFELLAASRD LG K+SIVFD LIKSCCE+NR 
Sbjct: 121 ARLPSPKPTLQLLKQAVGCG-TNSVKEIFELLAASRDRLGVKSSIVFDYLIKSCCELNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEAFECFYMMKE G+ PKIETCNDLLSL L+LNR E AWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEAFECFYMMKENGVAPKIETCNDLLSLFLRLNRTETAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFI HME  GV+PNVVTYNTIVHGYCSRGRVEGADAIL+ MKRK I
Sbjct: 241 INVLCKEGKLKKAKDFIEHMECLGVKPNVVTYNTIVHGYCSRGRVEGADAILSIMKRKNI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
           +PDSYTYGSLISGM KQGRLEEASKIFEEMVQ GLLPSAVTYNTLIDGFCNKGNLDMAF 
Sbjct: 301 RPDSYTYGSLISGMCKQGRLEEASKIFEEMVQNGLLPSAVTYNTLIDGFCNKGNLDMAFG 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEM+KKGIMPTVSTYNLLIHALFMEQ+ DEAEGMIKEI EKG++PDAITYNILINGYC
Sbjct: 361 YKDEMMKKGIMPTVSTYNLLIHALFMEQKYDEAEGMIKEIHEKGIAPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RCGN KKAFRLHDEML SGI+PTKVTYTSLIHVLSKK+RMKEADDLFKKITSKG+LPDVI
Sbjct: 421 RCGNAKKAFRLHDEMLASGIRPTKVTYTSLIHVLSKKNRMKEADDLFKKITSKGMLPDVI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSNGNVE AFELLKDMDR KV PDEVTFNTIMQG CREGKVEEARELFDEM
Sbjct: 481 MFNALIDGHCSNGNVERAFELLKDMDRTKVRPDEVTFNTIMQGRCREGKVEEARELFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDHVSFNTLISGYSRRGD+KDAFRVRDEMLD GFNPTLLTYNALIQGL KNQEG
Sbjct: 541 KRRGIKPDHVSFNTLISGYSRRGDVKDAFRVRDEMLDKGFNPTLLTYNALIQGLFKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
            HAEELLKEMVSKGITPDDSTYFS+
Sbjct: 601 HHAEELLKEMVSKGITPDDSTYFSL 624

BLAST of HG10005842 vs. ExPASy TrEMBL
Match: A0A6J1C2M0 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111007887 PE=4 SV=1)

HSP 1 Score: 1060.4 bits (2741), Expect = 2.8e-306
Identity = 529/622 (85.05%), Postives = 568/622 (91.32%), Query Frame = 0

Query: 4   YKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQSAR 63
           YK L   SL FK  T+  SFAPFSSISLQKTP E+   SPN   N  LPLT NFLE+SAR
Sbjct: 4   YKFLNRSSLNFKTTTHCHSFAPFSSISLQKTPQETLSQSPNIPTNPILPLTHNFLEESAR 63

Query: 64  SSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIVARL 123
           SSQWH IKQ+   LTPSLIS+TL NLH++PQIVL+LLNHL H ++DAE+ CLAIVIVARL
Sbjct: 64  SSQWHLIKQIVPNLTPSLISQTLQNLHETPQIVLDLLNHLHHGVIDAESRCLAIVIVARL 123

Query: 124 PSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEA 183
           PSP+P+LQLLK AVG GTT S+REIFE+LA SRD LG K+SIVFD L+KSCCEMNR DE 
Sbjct: 124 PSPRPSLQLLKLAVGSGTT-SVREIFEMLAISRDRLGVKSSIVFDYLVKSCCEMNRADEG 183

Query: 184 FECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINV 243
           FECFYMMKEKG+ PKIETCN+LLSL LKLNR EAAWVLYAEMFRLRIKSSVYTFNIMINV
Sbjct: 184 FECFYMMKEKGVAPKIETCNELLSLFLKLNRTEAAWVLYAEMFRLRIKSSVYTFNIMINV 243

Query: 244 LCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKIQPD 303
           LCKEGKLKKAKDF+GHMES GV+PNVVTYNTI+HGYCSRGRVEGAD ILNTMKRKKIQPD
Sbjct: 244 LCKEGKLKKAKDFVGHMESLGVKPNVVTYNTIIHGYCSRGRVEGADGILNTMKRKKIQPD 303

Query: 304 SYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKD 363
           +YTYGSLI GM KQGRLE+ASKIFEEMVQ GLLPSAVTYNTLIDGFC+KGNL+M+FAYKD
Sbjct: 304 AYTYGSLIGGMCKQGRLEKASKIFEEMVQNGLLPSAVTYNTLIDGFCSKGNLEMSFAYKD 363

Query: 364 EMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCG 423
           EMLKKGI PTVSTYNLLIHALFMEQRIDEAEGM+KEIQE GM+PDAITYNILINGYCRCG
Sbjct: 364 EMLKKGIKPTVSTYNLLIHALFMEQRIDEAEGMMKEIQENGMAPDAITYNILINGYCRCG 423

Query: 424 NVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFN 483
           NVKKAF LHDEMLTSGI PTKVTYTSLIHVLSKK+R+KEA++LF KITSKGVLPDVIMFN
Sbjct: 424 NVKKAFSLHDEMLTSGIWPTKVTYTSLIHVLSKKNRIKEANELFNKITSKGVLPDVIMFN 483

Query: 484 ALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEMKRR 543
           ALIDG+CSNGNVE AFELLKDMDRMKV PDEVTFNTIMQG CREGKVEEARELFD+MKRR
Sbjct: 484 ALIDGYCSNGNVERAFELLKDMDRMKVLPDEVTFNTIMQGRCREGKVEEARELFDDMKRR 543

Query: 544 GIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHA 603
           GIKPDHVSFNTLISGYSRRGDIKDAFRVRDEML+TGFNPTLLTYNALIQGLCKNQ+GDHA
Sbjct: 544 GIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLNTGFNPTLLTYNALIQGLCKNQDGDHA 603

Query: 604 EELLKEMVSKGITPDDSTYFSI 626
           E+LLKEMVSKGI PDDSTY S+
Sbjct: 604 EQLLKEMVSKGIRPDDSTYLSL 624

BLAST of HG10005842 vs. ExPASy TrEMBL
Match: A0A5D3C5R0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G00160 PE=4 SV=1)

HSP 1 Score: 1048.9 bits (2711), Expect = 8.5e-303
Identity = 532/625 (85.12%), Postives = 566/625 (90.56%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           MI Y+I KL SLK           PFSSISL  TP E    SPNS+ NAA PLT +FLEQ
Sbjct: 1   MIPYRIPKLSSLKLN---------PFSSISLHNTPLE----SPNSSTNAASPLTPHFLEQ 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
           SARSSQWHFIKQVESTLTPSLIS+TLLNLHQSPQIVL+ LNHL H+L DA T+CLAIVIV
Sbjct: 61  SARSSQWHFIKQVESTLTPSLISQTLLNLHQSPQIVLDFLNHLHHKLPDAHTLCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKP L LLKQA+G GTTNSIREIFELLAASRD LGFK+SIVFD LIKSCC+MNR 
Sbjct: 121 ARLPSPKPALHLLKQALGGGTTNSIREIFELLAASRDRLGFKSSIVFDHLIKSCCDMNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEA ECFY MKEKGILPKIETCN+LLSL LKLNR EAAWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEALECFYTMKEKGILPKIETCNNLLSLFLKLNRTEAAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFIGHME+ GV+PNVVTYNTIVHGYC RGRVEGA AIL TMKR+KI
Sbjct: 241 INVLCKEGKLKKAKDFIGHMETLGVKPNVVTYNTIVHGYCLRGRVEGAAAILTTMKRQKI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
            PDS+TYGSLI GM KQGRLEEASKIFEEMVQ GL P+AV YNTLIDGFCNKGNLDMA A
Sbjct: 301 DPDSFTYGSLICGMCKQGRLEEASKIFEEMVQKGLQPNAVIYNTLIDGFCNKGNLDMASA 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEMLKKGI PTVSTYN LIHALFMEQRIDEAEGMI+EIQEKG+SPDAITYNILINGYC
Sbjct: 361 YKDEMLKKGINPTVSTYNSLIHALFMEQRIDEAEGMIEEIQEKGISPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RC N KKAFRLH+EML SGI+PTKVTYTSLIHVLSKK+RMKEADDLFKKITS+GVLPD+I
Sbjct: 421 RCANAKKAFRLHNEMLASGIKPTKVTYTSLIHVLSKKNRMKEADDLFKKITSEGVLPDLI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSN +V+ AFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEAR+LFDEM
Sbjct: 481 MFNALIDGHCSNSDVKRAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARQLFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDH+SFNTLISGYSRRGDIKDAFRV++EML+TGFNPT+LTYNALIQGLCKNQEG
Sbjct: 541 KRRGIKPDHISFNTLISGYSRRGDIKDAFRVQNEMLNTGFNPTVLTYNALIQGLCKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
           D AEELLKEMVS GI PDD+TYF++
Sbjct: 601 DRAEELLKEMVSNGIKPDDTTYFTL 612

BLAST of HG10005842 vs. ExPASy TrEMBL
Match: A0A1S3C0D3 (pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103495200 PE=4 SV=1)

HSP 1 Score: 1048.9 bits (2711), Expect = 8.5e-303
Identity = 532/625 (85.12%), Postives = 566/625 (90.56%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESPVSSPNSTINAALPLTQNFLEQ 60
           MI Y+I KL SLK           PFSSISL  TP E    SPNS+ NAA PLT +FLEQ
Sbjct: 1   MIPYRIPKLSSLKLN---------PFSSISLHNTPLE----SPNSSTNAASPLTPHFLEQ 60

Query: 61  SARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHLQHRLLDAETICLAIVIV 120
           SARSSQWHFIKQVESTLTPSLIS+TLLNLHQSPQIVL+ LNHL H+L DA T+CLAIVIV
Sbjct: 61  SARSSQWHFIKQVESTLTPSLISQTLLNLHQSPQIVLDFLNHLHHKLPDAHTLCLAIVIV 120

Query: 121 ARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRV 180
           ARLPSPKP L LLKQA+G GTTNSIREIFELLAASRD LGFK+SIVFD LIKSCC+MNR 
Sbjct: 121 ARLPSPKPALHLLKQALGGGTTNSIREIFELLAASRDRLGFKSSIVFDHLIKSCCDMNRA 180

Query: 181 DEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIM 240
           DEA ECFY MKEKGILPKIETCN+LLSL LKLNR EAAWVLYAEMFRLRIKSSVYTFNIM
Sbjct: 181 DEALECFYTMKEKGILPKIETCNNLLSLFLKLNRTEAAWVLYAEMFRLRIKSSVYTFNIM 240

Query: 241 INVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKI 300
           INVLCKEGKLKKAKDFIGHME+ GV+PNVVTYNTIVHGYC RGRVEGA AIL TMKR+KI
Sbjct: 241 INVLCKEGKLKKAKDFIGHMETLGVKPNVVTYNTIVHGYCLRGRVEGAAAILTTMKRQKI 300

Query: 301 QPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFA 360
            PDS+TYGSLI GM KQGRLEEASKIFEEMVQ GL P+AV YNTLIDGFCNKGNLDMA A
Sbjct: 301 DPDSFTYGSLICGMCKQGRLEEASKIFEEMVQKGLQPNAVIYNTLIDGFCNKGNLDMASA 360

Query: 361 YKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYC 420
           YKDEMLKKGI PTVSTYN LIHALFMEQRIDEAEGMI+EIQEKG+SPDAITYNILINGYC
Sbjct: 361 YKDEMLKKGINPTVSTYNSLIHALFMEQRIDEAEGMIEEIQEKGISPDAITYNILINGYC 420

Query: 421 RCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVI 480
           RC N KKAFRLH+EML SGI+PTKVTYTSLIHVLSKK+RMKEADDLFKKITS+GVLPD+I
Sbjct: 421 RCANAKKAFRLHNEMLASGIKPTKVTYTSLIHVLSKKNRMKEADDLFKKITSEGVLPDLI 480

Query: 481 MFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEM 540
           MFNALIDGHCSN +V+ AFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEAR+LFDEM
Sbjct: 481 MFNALIDGHCSNSDVKRAFELLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARQLFDEM 540

Query: 541 KRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEG 600
           KRRGIKPDH+SFNTLISGYSRRGDIKDAFRV++EML+TGFNPT+LTYNALIQGLCKNQEG
Sbjct: 541 KRRGIKPDHISFNTLISGYSRRGDIKDAFRVQNEMLNTGFNPTVLTYNALIQGLCKNQEG 600

Query: 601 DHAEELLKEMVSKGITPDDSTYFSI 626
           D AEELLKEMVS GI PDD+TYF++
Sbjct: 601 DRAEELLKEMVSNGIKPDDTTYFTL 612

BLAST of HG10005842 vs. TAIR 10
Match: AT2G15630.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 709.1 bits (1829), Expect = 3.1e-204
Identity = 341/585 (58.29%), Postives = 450/585 (76.92%), Query Frame = 0

Query: 41  SSPNSTINAALPLTQNFLEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELL 100
           S+P S +    P+T   L +S RSSQWH ++ V   LTPSL+S TLL+L ++P +    +
Sbjct: 36  STPESVLP---PITSEILLESIRSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLAFNFV 95

Query: 101 NHLQHRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLG 160
           NH+    LD +T CLAI ++++L SPKP  QLLK+ V     NSIR +F+ L  + D L 
Sbjct: 96  NHIDLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVV-TSRKNSIRNLFDELVLAHDRLE 155

Query: 161 FKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWV 220
            K++I+FD+L++ CC++  VDEA ECFY+MKEKG  PK ETCN +L+L  +LNRIE AWV
Sbjct: 156 TKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRIENAWV 215

Query: 221 LYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYC 280
            YA+M+R+ IKS+VYTFNIMINVLCKEGKLKKAK F+G ME FG++P +VTYNT+V G+ 
Sbjct: 216 FYADMYRMEIKSNVYTFNIMINVLCKEGKLKKAKGFLGIMEVFGIKPTIVTYNTLVQGFS 275

Query: 281 SRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAV 340
            RGR+EGA  I++ MK K  QPD  TY  ++S M  +GR   AS++  EM +IGL+P +V
Sbjct: 276 LRGRIEGARLIISEMKSKGFQPDMQTYNPILSWMCNEGR---ASEVLREMKEIGLVPDSV 335

Query: 341 TYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEI 400
           +YN LI G  N G+L+MAFAY+DEM+K+G++PT  TYN LIH LFME +I+ AE +I+EI
Sbjct: 336 SYNILIRGCSNNGDLEMAFAYRDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAEILIREI 395

Query: 401 QEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRM 460
           +EKG+  D++TYNILINGYC+ G+ KKAF LHDEM+T GIQPT+ TYTSLI+VL +K++ 
Sbjct: 396 REKGIVLDSVTYNILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVLCRKNKT 455

Query: 461 KEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTI 520
           +EAD+LF+K+  KG+ PD++M N L+DGHC+ GN++ AF LLK+MD M + PD+VT+N +
Sbjct: 456 READELFEKVVGKGMKPDLVMMNTLMDGHCAIGNMDRAFSLLKEMDMMSINPDDVTYNCL 515

Query: 521 MQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGF 580
           M+G C EGK EEAREL  EMKRRGIKPDH+S+NTLISGYS++GD K AF VRDEML  GF
Sbjct: 516 MRGLCGEGKFEEARELMGEMKRRGIKPDHISYNTLISGYSKKGDTKHAFMVRDEMLSLGF 575

Query: 581 NPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTYFSI 626
           NPTLLTYNAL++GL KNQEG+ AEELL+EM S+GI P+DS++ S+
Sbjct: 576 NPTLLTYNALLKGLSKNQEGELAEELLREMKSEGIVPNDSSFCSV 613

BLAST of HG10005842 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 343.6 bits (880), Expect = 3.4e-94
Identity = 196/638 (30.72%), Postives = 340/638 (53.29%), Query Frame = 0

Query: 1   MIRYKILKLPSLKFKPITNSQSFAPFSSISLQKTPPESP-VSSPNSTINAALPLTQNF-- 60
           MI ++I  +PS    P+T    F P  +++   +P   P  SS +S+ +A+  ++ +F  
Sbjct: 1   MIVHRI--IPSRVKDPLTR---FKPLKNLTTSSSPVFEPSSSSSSSSSSASFSVSDSFLV 60

Query: 61  ------LEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELLNHL-------Q 120
                 L+Q   + + H I+     L P  + E L        +    ++ L       +
Sbjct: 61  EKICFSLKQGNNNVRNHLIR-----LNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPNFK 120

Query: 121 HRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNSIREIFELLAASRDSLGFKTS 180
           H  L    +   +V   RL   +  L  + +  G     S  EI   L ++  + G   S
Sbjct: 121 HTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV----SRLEIVNSLDSTFSNCGSNDS 180

Query: 181 IVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNRIEAAWVLYAE 240
            VFD+LI++  +  ++ EA E F +++ KG    I+ CN L+   +++  +E AW +Y E
Sbjct: 181 -VFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQE 240

Query: 241 MFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIVHGYCSRGR 300
           + R  +  +VYT NIM+N LCK+GK++K   F+  ++  GV P++VTYNT++  Y S+G 
Sbjct: 241 ISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGL 300

Query: 301 VEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLLPSAVTYNT 360
           +E A  ++N M  K   P  YTY ++I+G+ K G+ E A ++F EM++ GL P + TY +
Sbjct: 301 MEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRS 360

Query: 361 LIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGMIKEIQEKG 420
           L+   C KG++        +M  + ++P +  ++ ++        +D+A      ++E G
Sbjct: 361 LLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAG 420

Query: 421 MSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSKKSRMKEAD 480
           + PD + Y ILI GYCR G +  A  L +EML  G     VTY +++H L K+  + EAD
Sbjct: 421 LIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEAD 480

Query: 481 DLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVTFNTIMQGH 540
            LF ++T + + PD      LIDGHC  GN+++A EL + M   ++  D VT+NT++ G 
Sbjct: 481 KLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGF 540

Query: 541 CREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEMLDTGFNPTL 600
            + G ++ A+E++ +M  + I P  +S++ L++    +G + +AFRV DEM+     PT+
Sbjct: 541 GKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTV 600

Query: 601 LTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTY 623
           +  N++I+G C++      E  L++M+S+G  PD  +Y
Sbjct: 601 MICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISY 623

BLAST of HG10005842 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 318.9 bits (816), Expect = 8.9e-87
Identity = 176/589 (29.88%), Postives = 325/589 (55.18%), Query Frame = 0

Query: 41  SSPNSTINAALPLTQNFLEQSARSSQWHFIKQVESTLTPSLISETLLNLHQSPQIVLELL 100
           SSP+ ++ A   LT  FL++       + +  + +  TP   S  LL       ++L+ L
Sbjct: 17  SSPSDSLLADKALT--FLKRHP-----YQLHHLSANFTPEAASNLLLKSQNDQALILKFL 76

Query: 101 NHLQ-HRLLDAETICLAIVIVARLPSPKPTLQLLKQAVGCGTTNS--IREIFELLAASRD 160
           N    H+       C+ + I+ +    K T Q+L + V   T +      +F+ L  + D
Sbjct: 77  NWANPHQFFTLRCKCITLHILTKFKLYK-TAQILAEDVAAKTLDDEYASLVFKSLQETYD 136

Query: 161 SLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIETCNDLLSLCLKLNR-IE 220
            L + TS VFD+++KS   ++ +D+A    ++ +  G +P + + N +L   ++  R I 
Sbjct: 137 -LCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNIS 196

Query: 221 AAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHMESFGVEPNVVTYNTIV 280
            A  ++ EM   ++  +V+T+NI+I   C  G +  A      ME+ G  PNVVTYNT++
Sbjct: 197 FAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLI 256

Query: 281 HGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRLEEASKIFEEMVQIGLL 340
            GYC   +++    +L +M  K ++P+  +Y  +I+G+ ++GR++E S +  EM + G  
Sbjct: 257 DGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYS 316

Query: 341 PSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLLIHALFMEQRIDEAEGM 400
              VTYNTLI G+C +GN   A     EML+ G+ P+V TY  LIH++     ++ A   
Sbjct: 317 LDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEF 376

Query: 401 IKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGIQPTKVTYTSLIHVLSK 460
           + +++ +G+ P+  TY  L++G+ + G + +A+R+  EM  +G  P+ VTY +LI+    
Sbjct: 377 LDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCV 436

Query: 461 KSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFELLKDMDRMKVPPDEVT 520
             +M++A  + + +  KG+ PDV+ ++ ++ G C + +V+ A  + ++M    + PD +T
Sbjct: 437 TGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTIT 496

Query: 521 FNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYSRRGDIKDAFRVRDEML 580
           +++++QG C + + +EA +L++EM R G+ PD  ++  LI+ Y   GD++ A ++ +EM+
Sbjct: 497 YSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMV 556

Query: 581 DTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDSTYFSI 626
           + G  P ++TY+ LI GL K      A+ LL ++  +   P D TY ++
Sbjct: 557 EKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL 596

BLAST of HG10005842 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 317.8 bits (813), Expect = 2.0e-86
Identity = 169/542 (31.18%), Postives = 306/542 (56.46%), Query Frame = 0

Query: 86  LLNLHQSPQIVLELLNHLQHRL-LDAETICLAIVIVARLPSPKPTLQLLK---QAVGCGT 145
           L+ +    ++VL+  +  + R   + E++C+ I +       K    L+    +      
Sbjct: 94  LMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 153

Query: 146 TNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIET 205
           T+S  + F+LL  +    G     VFDV  +   +   + EA   F  M   G++  +++
Sbjct: 154 TDSFVQFFDLLVYTYKDWG-SDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDS 213

Query: 206 CNDLLS-LCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHM 265
           CN  L+ L     +   A +++ E   + +  +V ++NI+I+ +C+ G++K+A   +  M
Sbjct: 214 CNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLM 273

Query: 266 ESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRL 325
           E  G  P+V++Y+T+V+GYC  G ++    ++  MKRK ++P+SY YGS+I  + +  +L
Sbjct: 274 ELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKL 333

Query: 326 EEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLL 385
            EA + F EM++ G+LP  V Y TLIDGFC +G++  A  +  EM  + I P V TY  +
Sbjct: 334 AEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAI 393

Query: 386 IHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGI 445
           I        + EA  +  E+  KG+ PD++T+  LINGYC+ G++K AFR+H+ M+ +G 
Sbjct: 394 ISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGC 453

Query: 446 QPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFE 505
            P  VTYT+LI  L K+  +  A++L  ++   G+ P++  +N++++G C +GN+E A +
Sbjct: 454 SPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVK 513

Query: 506 LLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYS 565
           L+ + +   +  D VT+ T+M  +C+ G++++A+E+  EM  +G++P  V+FN L++G+ 
Sbjct: 514 LVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFC 573

Query: 566 RRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDS 623
             G ++D  ++ + ML  G  P   T+N+L++  C       A  + K+M S+G+ PD  
Sbjct: 574 LHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGK 633

BLAST of HG10005842 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 317.8 bits (813), Expect = 2.0e-86
Identity = 169/542 (31.18%), Postives = 306/542 (56.46%), Query Frame = 0

Query: 86  LLNLHQSPQIVLELLNHLQHRL-LDAETICLAIVIVARLPSPKPTLQLLK---QAVGCGT 145
           L+ +    ++VL+  +  + R   + E++C+ I +       K    L+    +      
Sbjct: 94  LMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNV 153

Query: 146 TNSIREIFELLAASRDSLGFKTSIVFDVLIKSCCEMNRVDEAFECFYMMKEKGILPKIET 205
           T+S  + F+LL  +    G     VFDV  +   +   + EA   F  M   G++  +++
Sbjct: 154 TDSFVQFFDLLVYTYKDWG-SDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDS 213

Query: 206 CNDLLS-LCLKLNRIEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKAKDFIGHM 265
           CN  L+ L     +   A +++ E   + +  +V ++NI+I+ +C+ G++K+A   +  M
Sbjct: 214 CNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLM 273

Query: 266 ESFGVEPNVVTYNTIVHGYCSRGRVEGADAILNTMKRKKIQPDSYTYGSLISGMSKQGRL 325
           E  G  P+V++Y+T+V+GYC  G ++    ++  MKRK ++P+SY YGS+I  + +  +L
Sbjct: 274 ELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKL 333

Query: 326 EEASKIFEEMVQIGLLPSAVTYNTLIDGFCNKGNLDMAFAYKDEMLKKGIMPTVSTYNLL 385
            EA + F EM++ G+LP  V Y TLIDGFC +G++  A  +  EM  + I P V TY  +
Sbjct: 334 AEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAI 393

Query: 386 IHALFMEQRIDEAEGMIKEIQEKGMSPDAITYNILINGYCRCGNVKKAFRLHDEMLTSGI 445
           I        + EA  +  E+  KG+ PD++T+  LINGYC+ G++K AFR+H+ M+ +G 
Sbjct: 394 ISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGC 453

Query: 446 QPTKVTYTSLIHVLSKKSRMKEADDLFKKITSKGVLPDVIMFNALIDGHCSNGNVEHAFE 505
            P  VTYT+LI  L K+  +  A++L  ++   G+ P++  +N++++G C +GN+E A +
Sbjct: 454 SPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVK 513

Query: 506 LLKDMDRMKVPPDEVTFNTIMQGHCREGKVEEARELFDEMKRRGIKPDHVSFNTLISGYS 565
           L+ + +   +  D VT+ T+M  +C+ G++++A+E+  EM  +G++P  V+FN L++G+ 
Sbjct: 514 LVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFC 573

Query: 566 RRGDIKDAFRVRDEMLDTGFNPTLLTYNALIQGLCKNQEGDHAEELLKEMVSKGITPDDS 623
             G ++D  ++ + ML  G  P   T+N+L++  C       A  + K+M S+G+ PD  
Sbjct: 574 LHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGK 633

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887485.10.0e+0090.72pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Benincasa ... [more]
XP_022972339.10.0e+0089.12pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Cucurbita ... [more]
XP_023554102.10.0e+0088.48pentatricopeptide repeat-containing protein At2g15630, mitochondrial isoform X1 ... [more]
XP_022952975.10.0e+0088.16pentatricopeptide repeat-containing protein At2g15630, mitochondrial [Cucurbita ... [more]
KAG6572005.10.0e+0088.00Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9ZQF14.3e-20358.29Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidop... [more]
Q9LFC54.8e-9330.72Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.3e-8529.88Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q0WVK72.8e-8531.18Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
O045049.0e-8433.94Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1IB790.0e+0089.12pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucurbit... [more]
A0A6J1GND90.0e+0088.16pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucurbit... [more]
A0A6J1C2M02.8e-30685.05pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Momordic... [more]
A0A5D3C5R08.5e-30385.12Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C0D38.5e-30385.12pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT2G15630.13.1e-20458.29Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G01110.13.4e-9430.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G39710.18.9e-8729.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.12.0e-8631.18Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.22.0e-8631.18Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 383..403
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 48..625
NoneNo IPR availablePANTHERPTHR47932:SF11OS04G0477200 PROTEINcoord: 48..625
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 310..504
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 528..625
e-value: 1.2E-32
score: 115.6
coord: 318..440
e-value: 1.2E-40
score: 141.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 59..227
e-value: 1.5E-12
score: 49.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 441..527
e-value: 3.2E-28
score: 100.3
coord: 228..317
e-value: 4.7E-29
score: 103.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 410..443
e-value: 1.1E-11
score: 42.2
coord: 340..373
e-value: 2.1E-9
score: 35.0
coord: 445..479
e-value: 3.9E-7
score: 27.8
coord: 305..337
e-value: 5.3E-7
score: 27.4
coord: 270..304
e-value: 1.3E-9
score: 35.6
coord: 550..583
e-value: 1.0E-8
score: 32.8
coord: 515..548
e-value: 2.6E-13
score: 47.3
coord: 166..197
e-value: 1.8E-7
score: 28.9
coord: 376..409
e-value: 2.7E-7
score: 28.3
coord: 235..269
e-value: 2.4E-9
score: 34.8
coord: 586..618
e-value: 3.9E-10
score: 37.3
coord: 481..514
e-value: 3.5E-9
score: 34.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 166..195
e-value: 1.0E-5
score: 25.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 267..316
e-value: 3.6E-17
score: 62.3
coord: 583..622
e-value: 2.3E-11
score: 43.7
coord: 337..384
e-value: 8.7E-16
score: 57.9
coord: 197..246
e-value: 8.3E-12
score: 45.1
coord: 407..455
e-value: 4.1E-17
score: 62.1
coord: 512..555
e-value: 2.1E-17
score: 63.1
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 473..505
e-value: 3.7E-12
score: 45.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 303..337
score: 13.515341
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..267
score: 12.309597
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 548..582
score: 12.978237
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 478..512
score: 12.276713
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 408..442
score: 15.104729
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 583..617
score: 12.83574
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 11.640958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 373..407
score: 11.114816
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 443..477
score: 11.060009
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 268..302
score: 12.857662
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 513..547
score: 15.784329
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 338..372
score: 12.989198

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005842.1HG10005842.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding