HG10021209 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021209
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 6594269 .. 6596237 (-)
RNA-Seq ExpressionHG10021209
SyntenyHG10021209
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCTATAGCTAGCCGATCTTACCACACGAATCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCGGAGCTCGACGGTTGGGTTCAAGAGGGGAAGAAGGTACGAGTCGCTGAGCTTCAGAGAATCATTCACGACCTTCGCAAGCGCAAGCGTTTTACCCAAGCTCTTGAGGTTTGAATTCTTCTTTCTTGAAATGTGCTTTGTTTTGGTTTGAGTACTTAGATGTAGAGGAAGAAGTGAAATGTAAAAAAATGATGCATCGTGTGAGAACTGTGTGCCTTTTAGACGTTAAATCTGTGTGGCCTAATCGCTAACTTCAAATTATCAATAGATTTAGCTGGAGAAATGGAAGGCACGTAGGTTTTTTATGCTAAGAAGTCAGAAGCTCCTTCCTTACGTTCTTCGATTAAGTGAATTTGTTCGAGTAACTTCATTAGTTCACATAATATGGATTTTTCAGAGATGGAATTATGTCATTTCTCCCCAATATAATCTGGAACATGATGTCACTTGCAAGGAATTTAACTCTTTGTAGATAATGTCTGAATGGCTGAAACATTGTATGAGCTAGAAATATGTGGATTTGAACAGGTTTTGGTTGGAGTTTTTTTGTAATTGAAGATAGGTATAAGTGCAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATCTGATTGGCCGAGTACGTGGATATCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGGTAAGACATATGGTGCTCTCCTGAATTGCTACGTTCGGCAGCGGCAAGTGGAAAAAGCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCACAGCTCACTTACAATGACATTATGTGTTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAGAATGTTTCTCCTGACAACTTCAGCTACAGAATCTGCATAAATTCGTATGGTGCAAGACATGATCTTGAGGGGATGGAGAATGTATTGAAGGAGATGGAATCTCAACCTGATATTGTCATGGACTGGAACACTTATGCAGTAGTTGCAAACTTCTTTATAAAAGCGGGTCTTACTGATAAGGCGGTTAATGCCTTGAGAAAATCAGAAGAGAGACTGAAGAGTAAGGATAGAATTGGCCATAACCATCTGATCTCGCTTTATGCGACCTTAGGGAACAAGGAAAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTAGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGAGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTATTGAATATATTGACAAGGGAATGTGTGAGAGAGCCGAAACCCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATCTGGACCGGGGTGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAACATGGATAAAGGATGGAAGCCTAATTTTCGGGTGATCACAGGTGTATTGAATTGGCTTGGTGATAAGGGCATTATAGAAGAAGTAGAAGCTTTTGTAGGCGCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATAAAGGTTCATATAAGAGGTGGTAAAGAAGTAAATGAACTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGGAAGAAACAACTAAAGGTAAGAGCATTGACTGA

mRNA sequence

ATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCTATAGCTAGCCGATCTTACCACACGAATCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCGGAGCTCGACGGTTGGGTTCAAGAGGGGAAGAAGGTACGAGTCGCTGAGCTTCAGAGAATCATTCACGACCTTCGCAAGCGCAAGCGTTTTACCCAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATCTGATTGGCCGAGTACGTGGATATCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGGTAAGACATATGGTGCTCTCCTGAATTGCTACGTTCGGCAGCGGCAAGTGGAAAAAGCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCACAGCTCACTTACAATGACATTATGTGTTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAGAATGTTTCTCCTGACAACTTCAGCTACAGAATCTGCATAAATTCGTATGGTGCAAGACATGATCTTGAGGGGATGGAGAATGTATTGAAGGAGATGGAATCTCAACCTGATATTGTCATGGACTGGAACACTTATGCAGTAGTTGCAAACTTCTTTATAAAAGCGGGTCTTACTGATAAGGCGGTTAATGCCTTGAGAAAATCAGAAGAGAGACTGAAGAGTAAGGATAGAATTGGCCATAACCATCTGATCTCGCTTTATGCGACCTTAGGGAACAAGGAAAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTAGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGAGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTATTGAATATATTGACAAGGGAATGTGTGAGAGAGCCGAAACCCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATCTGGACCGGGGTGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAACATGGATAAAGGATGGAAGCCTAATTTTCGGGTGATCACAGGTGTATTGAATTGGCTTGGTGATAAGGGCATTATAGAAGAAGTAGAAGCTTTTGTAGGCGCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATAAAGGTTCATATAAGAGGTGGTAAAGAAGTAAATGAACTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGGAAGAAACAACTAAAGGTAAGAGCATTGACTGA

Coding sequence (CDS)

ATGGATCAGAAGCTCTTCTCCAAAGCCTTAACACGCTATGCTATAGCTAGCCGATCTTACCACACGAATCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGCATCAGCGTGGAGCCGGAGCTCGACGGTTGGGTTCAAGAGGGGAAGAAGGTACGAGTCGCTGAGCTTCAGAGAATCATTCACGACCTTCGCAAGCGCAAGCGTTTTACCCAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAAGCGGTGTCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATCTGATTGGCCGAGTACGTGGATATCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGGTAAGACATATGGTGCTCTCCTGAATTGCTACGTTCGGCAGCGGCAAGTGGAAAAAGCCCTCTCCCATTTGCAAAAAATGAAAGAGATGGGTTTTGCAACTTCACAGCTCACTTACAATGACATTATGTGTTTGTATACAAATGTTGGCCAGCATGACAAGGTCCCTGAGGTGCTAGCAGAGATGAAGGAGAAGAATGTTTCTCCTGACAACTTCAGCTACAGAATCTGCATAAATTCGTATGGTGCAAGACATGATCTTGAGGGGATGGAGAATGTATTGAAGGAGATGGAATCTCAACCTGATATTGTCATGGACTGGAACACTTATGCAGTAGTTGCAAACTTCTTTATAAAAGCGGGTCTTACTGATAAGGCGGTTAATGCCTTGAGAAAATCAGAAGAGAGACTGAAGAGTAAGGATAGAATTGGCCATAACCATCTGATCTCGCTTTATGCGACCTTAGGGAACAAGGAAAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGATTCATCAATAGGGACTACATCACAATGCTTGAATCTCTGGTGAGACTAGGTGAACTTGAAGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGAGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTATTGAATATATTGACAAGGGAATGTGTGAGAGAGCCGAAACCCTGCTTGAAGACTTGATGGAGAAAGGAAAGGCTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATCTGGACCGGGGTGAGACCGAAAAAGCTGTAGAGTGCATGAAGGCAGCCCTTTCTCTAAACATGGATAAAGGATGGAAGCCTAATTTTCGGGTGATCACAGGTGTATTGAATTGGCTTGGTGATAAGGGCATTATAGAAGAAGTAGAAGCTTTTGTAGGCGCATTGAGGTCTGTCATTCCAGTGAACAGAGAGATGTATCATGCCTTGATAAAGGTTCATATAAGAGGTGGTAAAGAAGTAAATGAACTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTTGGGAAGAAACAACTAAAGGTAAGAGCATTGACTGA

Protein sequence

MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDEDEETKKILGTWEETTKGKSID
Homology
BLAST of HG10021209 vs. NCBI nr
Match: XP_038893646.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa hispida] >XP_038893647.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa hispida])

HSP 1 Score: 946.0 bits (2444), Expect = 1.3e-271
Identity = 471/496 (94.96%), Postives = 482/496 (97.18%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQKL SK LTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELD WVQEGKKVRV
Sbjct: 1   MDQKLLSKVLTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDCWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSPTEHAVQLDLIGRVRGYLSAE+YF+
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKNGVCIFSPTEHAVQLDLIGRVRGYLSAESYFS 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QL EQDQTGKTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH
Sbjct: 121 QLNEQDQTGKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTGT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
           TLLEDLMEK KATTPNSWGAVAV+YLD+GE +KAVECMKAALSLNMDKGWKPN RVIT V
Sbjct: 361 TLLEDLMEKEKATTPNSWGAVAVKYLDQGENKKAVECMKAALSLNMDKGWKPNLRVITSV 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLGDKGIIEEVEAFV ALRSVIPVNREMYHALIKV+IR GKEVNELLNQMKSDKIDED
Sbjct: 421 LNWLGDKGIIEEVEAFVSALRSVIPVNREMYHALIKVYIRAGKEVNELLNQMKSDKIDED 480

Query: 481 EETKKILGTWEETTKG 497
           EET+KILGTWEETT+G
Sbjct: 481 EETQKILGTWEETTEG 496

BLAST of HG10021209 vs. NCBI nr
Match: XP_022994385.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita maxima])

HSP 1 Score: 915.6 bits (2365), Expect = 1.8e-262
Identity = 451/498 (90.56%), Postives = 477/498 (95.78%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDP++SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA L DKAV+AL+K+EERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPNSWGAVAVQY+DRGETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+   IEEVEAFVG+LRS IPVNREMYHAL+KVHIRGGKEV+ELLNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWEETTKGKS 499
           EETKKILGT +ETT+G+S
Sbjct: 481 EETKKILGTGQETTEGRS 497

BLAST of HG10021209 vs. NCBI nr
Match: XP_023542644.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 911.0 bits (2353), Expect = 4.5e-261
Identity = 448/495 (90.51%), Postives = 475/495 (95.96%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDPS+SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LT+ND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTFNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKN+SPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNISPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA LT+KAV+ALRK+EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKT  
Sbjct: 241 VANFFIKADLTEKAVDALRKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDA 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPNSWGAVAVQY+DRGETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+   IEEVEAFVG+LRSVIPVNREMYHAL+K HIRGGKEV+ELLNQMKSDK+DED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDED 480

Query: 481 EETKKILGTWEETTK 496
           EETKKILGT +ETT+
Sbjct: 481 EETKKILGTGQETTE 494

BLAST of HG10021209 vs. NCBI nr
Match: KAG7012430.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 908.3 bits (2346), Expect = 2.9e-260
Identity = 448/495 (90.51%), Postives = 473/495 (95.56%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA L DKAV+AL+K+EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPNSWGAVAVQY+DR ETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+ G IEEVEAFVG+LRSVIPVNREMYHAL+K HIRGGKEV+ELLNQMKSDK+DED
Sbjct: 421 LNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDED 480

Query: 481 EETKKILGTWEETTK 496
           EETKKILGT +ETT+
Sbjct: 481 EETKKILGTGQETTE 494

BLAST of HG10021209 vs. NCBI nr
Match: XP_022954890.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 907.5 bits (2344), Expect = 5.0e-260
Identity = 449/499 (89.98%), Postives = 473/499 (94.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA L DKAV+AL+K+EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPN WGAVAVQY+DR ETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+   IEEVEAFVG+LRSVIPVNREMYHAL+K HIRGGKEV+ELLNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWEETTKGKSI 500
           EETKKILGT +ETT+G  I
Sbjct: 481 EETKKILGTGQETTEGDLI 498

BLAST of HG10021209 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 611.7 bits (1576), Expect = 7.5e-174
Identity = 298/477 (62.47%), Postives = 373/477 (78.20%), Query Frame = 0

Query: 15  IASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 74
           IASR Y+TNR+KK TLY+KISPLGDP  SV PEL  WVQ GKKV VAEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 75  RFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGA 134
           RF  ALEVS+WM ++GVC+FSPTEHAV LDLIGRV G+++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 135 LLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 194
           LLNCYVRQ+ VEK+L H +KMKEMGF TS LTYN+IMCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 195 VSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKA 254
           V+PDN+SYRICIN++GA +DLE +   L++ME + DI MDWNTYAV A F+I  G  D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 255 VNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLES 314
           V  L+ SE RL+ KD  G+NHLI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 315 LVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATT 374
           LV++  L EAE+VL EW+SS NCYDFRVPNTVI  YI K M E+AE +LEDL  +GKATT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 375 PNSWGAVAVQYLDRGETEKAVECMKAALSLNM-DKGWKPNFRVITGVLNWLGDKGIIEEV 434
           P SW  VA  Y ++G  E A +CMK AL + +  + W+P   ++T VL+W+GD+G ++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 435 EAFVGALRSVIPVNREMYHALIKVHIR-GGKEVNELLNQMKSDKIDEDEETKKILGT 490
           E+FV +LR+ I VN++MYHAL+K  IR GG+ ++ LL +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of HG10021209 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 293.9 bits (751), Expect = 3.4e-78
Identity = 160/452 (35.40%), Postives = 263/452 (58.19%), Query Frame = 0

Query: 30  LYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKS 89
           +Y KIS +  P +     L+ W + G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 90  GVCI-FSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVRQRQVEKA 149
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EKA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 150 LSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 209
            + L  M++ G+A   L +N +M LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 210 YGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKSEERLKSK 269
            G+   +E ME V ++M+S   I  +W T++ +A  +IK G T+KA +ALRK E R+  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 270 DRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGELEEAEKVL 329
           +RI +++L+SLY +LGNK+++ R+W++ K+      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 330 KEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAVAVQYLDR 389
           +EW   ++ YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 390 GETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWLGDKGIIEEVEAFVGALRSVIPVNR 449
               +A+ C++ A S      W+P   +++G      ++  +   EA +  LR    +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 450 EMYHALIKVHIRGGKEVNELLNQMKSDKIDED 481
           + Y ALI V      + N  +N  + D  + D
Sbjct: 489 KSYLALIDV------DENRTVNNSEIDAHETD 514

BLAST of HG10021209 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 1.3e-77
Identity = 151/397 (38.04%), Postives = 243/397 (61.21%), Query Frame = 0

Query: 29  TLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKK 88
           TL  +++  GDPS S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 89  SGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVRQRQVEKA 148
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + KA
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 149 LSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
               Q+MKE+GF    L YN ++ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 209 YGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKSEERLKS- 268
           Y    D+EGME  L   E+   + +DW TYA  AN +IKAGLT+KA+  LRKSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 269 KDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGELEEAEKV 328
           K +  +  L+S Y   G KE+V RLW+L K     F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 329 LKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAVAVQYLD 388
           ++EWE+  + +D R+P+ +I  Y  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 389 RGETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWL 425
            G+ EKAVE  K A+ ++   GW+P+  V+   +++L
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYL 433

BLAST of HG10021209 vs. ExPASy Swiss-Prot
Match: Q93WC5 (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 273.1 bits (697), Expect = 6.3e-72
Identity = 153/473 (32.35%), Postives = 265/473 (56.03%), Query Frame = 0

Query: 16  ASRSYHTNRLKKATLYAKISPLGD-PSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 75
           A+ S  T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK +
Sbjct: 27  AAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFR 86

Query: 76  RFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGA 135
           +  +ALE+ EWM++  +  F+ ++HA++L+LI + +G  +AE YFN L +  +   TYG+
Sbjct: 87  QPQRALEIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGS 146

Query: 136 LLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 195
           LLNCY  +++  KA +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+
Sbjct: 147 LLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKS 206

Query: 196 VSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKA 255
           ++P + +Y + I S G+  DL+G+E VL EM+++ + +  WNT+A +A  +IK GL  KA
Sbjct: 207 ITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKA 266

Query: 256 VNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLES 315
             AL+  E  +    R  ++ LI+LY  + N  +V R+W+L K       N  Y+TML +
Sbjct: 267 EEALKSLENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRA 326

Query: 316 LVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATT 375
           L +L +++  +KV  EWES+   YD R+ N  I  Y+ + M E AE +    M+K K   
Sbjct: 327 LSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQF 386

Query: 376 PNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWLGDKGIIEEVE 435
             +   + +  L   + + A++  +AA+ L+ DK W  +  +I+       +   ++  E
Sbjct: 387 SKARQLLMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAE 446

Query: 436 AFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDEDEETKKIL 488
            F   L    P++ E Y  L+K ++  GK   ++  +++   I  DEE + +L
Sbjct: 447 EFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of HG10021209 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 4.1e-71
Identity = 145/469 (30.92%), Postives = 258/469 (55.01%), Query Frame = 0

Query: 22  TNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALE 81
           T +  +  LY ++   G   + V  +L+ +++  K V   E+   I  LR R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 82  VSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVR 141
           +SE M++ G+   + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 142 QRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFS 201
           +   EKA   L KMKE+    S ++YN +M LYT  G+ +KVP ++ E+K +NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 202 YRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKS 261
           Y + + +  A +D+ G+E V++EM     +  DW TY+ +A+ ++ AGL+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 262 EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGEL 321
           E +   +D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +L
Sbjct: 257 EMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDL 316

Query: 322 EEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAV 381
             AE + KEW+++ + YD R+ N +I  Y  +G+ ++A  L E    +G      +W   
Sbjct: 317 PGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIF 376

Query: 382 AVQYLDRGETEKAVECMKAALSLNMDKG--WKPNFRVITGVLNWLGDKGIIEEVEAFVGA 441
              Y+  G+  +A+ECM  A+S+    G  W P+   +  ++++   K  +   E  +  
Sbjct: 377 MDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEI 436

Query: 442 LRS-VIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDEDEETKKIL 488
           L++    +  E++  LI+ +   GK    +  ++K + ++ +E TKK+L
Sbjct: 437 LKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of HG10021209 vs. ExPASy TrEMBL
Match: A0A6J1K124 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490120 PE=4 SV=1)

HSP 1 Score: 915.6 bits (2365), Expect = 8.9e-263
Identity = 451/498 (90.56%), Postives = 477/498 (95.78%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDP++SVEPELDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA L DKAV+AL+K+EERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPNSWGAVAVQY+DRGETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+   IEEVEAFVG+LRS IPVNREMYHAL+KVHIRGGKEV+ELLNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWEETTKGKS 499
           EETKKILGT +ETT+G+S
Sbjct: 481 EETKKILGTGQETTEGRS 497

BLAST of HG10021209 vs. ExPASy TrEMBL
Match: A0A6J1GTN5 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457015 PE=4 SV=1)

HSP 1 Score: 907.5 bits (2344), Expect = 2.4e-260
Identity = 449/499 (89.98%), Postives = 473/499 (94.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ  FSKALTRYA+A R YHTNRLKKATLYAKISPLGDPS+SVEP LDGWV+EGKKVR+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMKK+GVCIFSP+EHAVQLDLIGRVRGYLSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQT KTYGALLNCYVRQRQVEK+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVPEVLAEMKEKNVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKA L DKAV+AL+K+EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKT T
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLMEKGK TTPN WGAVAVQY+DR ETEK+VECMKAAL+LNMDKGWKPN RVITG+
Sbjct: 361 ALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGI 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLG+   IEEVEAFVG+LRSVIPVNREMYHAL+K HIRGGKEV+ELLNQMKSDKIDED
Sbjct: 421 LNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDED 480

Query: 481 EETKKILGTWEETTKGKSI 500
           EETKKILGT +ETT+G  I
Sbjct: 481 EETKKILGTGQETTEGDLI 498

BLAST of HG10021209 vs. ExPASy TrEMBL
Match: A0A6J1CGU2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111010721 PE=4 SV=1)

HSP 1 Score: 903.3 bits (2333), Expect = 4.6e-259
Identity = 442/499 (88.58%), Postives = 475/499 (95.19%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQ LFSKALTRYA+A RSYHTNR+KKATLYAKISPLGDPSISV PELDGWVQEGKK+RV
Sbjct: 10  MDQNLFSKALTRYAMAGRSYHTNRMKKATLYAKISPLGDPSISVGPELDGWVQEGKKIRV 69

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMK+SGVCIFSP+EHAVQLDLIGRVRGYLSAE+YF+
Sbjct: 70  AELQRIIHDLRKRKRFTQALEVSEWMKQSGVCIFSPSEHAVQLDLIGRVRGYLSAESYFD 129

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLK+QD+TGKTYGALLNCYVRQRQV+K+LSHLQKMKEMGFATS+LTYND+MCLYTNVGQH
Sbjct: 130 QLKDQDKTGKTYGALLNCYVRQRQVDKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 189

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           DKVP+VLAEMKE  VSPDNFSYRICINSYG R DLEGME+VLKEMESQP IVMDWNTYAV
Sbjct: 190 DKVPQVLAEMKENKVSPDNFSYRICINSYGTRCDLEGMESVLKEMESQPHIVMDWNTYAV 249

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIK GLTDKAV+ALRKSEERL SKDRIGHNHLISLYATLGNKE+VLRLW LDK+ +
Sbjct: 250 VANFFIKGGLTDKAVDALRKSEERLNSKDRIGHNHLISLYATLGNKEEVLRLWKLDKSDS 309

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 310 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 369

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
            LLEDLM++GKATTPNSWGAVAVQYLDRGETEKAVECMK ALSL++DKGWKPN RVITG+
Sbjct: 370 ALLEDLMKEGKATTPNSWGAVAVQYLDRGETEKAVECMKTALSLHIDKGWKPNLRVITGI 429

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNW+GD    EEVEAFVG+LRSVIPVNREMYHAL+K HIRGGKEV+ LL+QMKSD+IDED
Sbjct: 430 LNWIGDNSSTEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHGLLSQMKSDQIDED 489

Query: 481 EETKKILGTWEETTKGKSI 500
           EETKKILGTW+E T+GKSI
Sbjct: 490 EETKKILGTWQEATEGKSI 508

BLAST of HG10021209 vs. ExPASy TrEMBL
Match: A0A5A7UM45 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00740 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.3e-258
Identity = 447/499 (89.58%), Postives = 473/499 (94.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQKLFSKALT YA+ASRSYHT RLKKATLYAKISPLGDPSISVE ELDGWVQEGKKVRV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRII D RKR RF+QAL+VSEWMKKSG CIFSPTEHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+K+LSHLQKMKE+GFATS+LTYNDIMCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKAGLTDKAV+ALRKSEE+LKSKDRIGHNHLISLYATLGNKEKVLR+WNLDKTAT
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
           TLLE+L +  KATTPNSWGAVAV+YLDRGETEKA+ECMKAALS+N DKGWKPN RVITGV
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLGDKGI+EEVEAFV ALRSVIPVNREMYHAL+KV+IR  KEVNE+LN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTWEETTKGKSI 500
           EETKKILGTWEETT+GKSI
Sbjct: 481 EETKKILGTWEETTEGKSI 499

BLAST of HG10021209 vs. ExPASy TrEMBL
Match: A0A1S3B5N2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486296 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.3e-258
Identity = 447/499 (89.58%), Postives = 473/499 (94.79%), Query Frame = 0

Query: 1   MDQKLFSKALTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRV 60
           MDQKLFSKALT YA+ASRSYHT RLKKATLYAKISPLGDPSISVE ELDGWVQEGKKVRV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFN 120
           AELQRII D RKR RF+QAL+VSEWMKKSG CIFSPTEHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTGKTYGALLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+K+LSHLQKMKE+GFATS+LTYNDIMCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYGAR DLEGMENVLKEMESQP IVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKAGLTDKAVNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTAT 300
           VANFFIKAGLTDKAV+ALRKSEE+LKSKDRIGHNHLISLYATLGNKEKVLR+WNLDKTAT
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESS NCYDFRVPNTVI+ YIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 TLLEDLMEKGKATTPNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGV 420
           TLLE+L +  KATTPNSWGAVAV+YLDRGETEKA+ECMKAALS+N DKGWKPN RVITGV
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGDKGIIEEVEAFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDED 480
           LNWLGDKGI+EEVEAFV ALRSVIPVNREMYHAL+KV+IR  KEVNE+LN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTWEETTKGKSI 500
           EETKKILGTWEETT+GKSI
Sbjct: 481 EETKKILGTWEETTEGKSI 499

BLAST of HG10021209 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 611.7 bits (1576), Expect = 5.3e-175
Identity = 298/477 (62.47%), Postives = 373/477 (78.20%), Query Frame = 0

Query: 15  IASRSYHTNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 74
           IASR Y+TNR+KK TLY+KISPLGDP  SV PEL  WVQ GKKV VAEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 75  RFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGA 134
           RF  ALEVS+WM ++GVC+FSPTEHAV LDLIGRV G+++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 135 LLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 194
           LLNCYVRQ+ VEK+L H +KMKEMGF TS LTYN+IMCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 195 VSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKA 254
           V+PDN+SYRICIN++GA +DLE +   L++ME + DI MDWNTYAV A F+I  G  D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 255 VNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLES 314
           V  L+ SE RL+ KD  G+NHLI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 315 LVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATT 374
           LV++  L EAE+VL EW+SS NCYDFRVPNTVI  YI K M E+AE +LEDL  +GKATT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 375 PNSWGAVAVQYLDRGETEKAVECMKAALSLNM-DKGWKPNFRVITGVLNWLGDKGIIEEV 434
           P SW  VA  Y ++G  E A +CMK AL + +  + W+P   ++T VL+W+GD+G ++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 435 EAFVGALRSVIPVNREMYHALIKVHIR-GGKEVNELLNQMKSDKIDEDEETKKILGT 490
           E+FV +LR+ I VN++MYHAL+K  IR GG+ ++ LL +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of HG10021209 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 293.9 bits (751), Expect = 2.5e-79
Identity = 160/452 (35.40%), Postives = 263/452 (58.19%), Query Frame = 0

Query: 30  LYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKKS 89
           +Y KIS +  P +     L+ W + G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 90  GVCI-FSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVRQRQVEKA 149
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EKA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 150 LSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 209
            + L  M++ G+A   L +N +M LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 210 YGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKSEERLKSK 269
            G+   +E ME V ++M+S   I  +W T++ +A  +IK G T+KA +ALRK E R+  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 270 DRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGELEEAEKVL 329
           +RI +++L+SLY +LGNK+++ R+W++ K+      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 330 KEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAVAVQYLDR 389
           +EW   ++ YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 390 GETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWLGDKGIIEEVEAFVGALRSVIPVNR 449
               +A+ C++ A S      W+P   +++G      ++  +   EA +  LR    +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 450 EMYHALIKVHIRGGKEVNELLNQMKSDKIDED 481
           + Y ALI V      + N  +N  + D  + D
Sbjct: 489 KSYLALIDV------DENRTVNNSEIDAHETD 514

BLAST of HG10021209 vs. TAIR 10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 292.0 bits (746), Expect = 9.3e-79
Identity = 151/397 (38.04%), Postives = 243/397 (61.21%), Query Frame = 0

Query: 29  TLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALEVSEWMKK 88
           TL  +++  GDPS S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 89  SGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVRQRQVEKA 148
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + KA
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 149 LSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
               Q+MKE+GF    L YN ++ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 209 YGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKSEERLKS- 268
           Y    D+EGME  L   E+   + +DW TYA  AN +IKAGLT+KA+  LRKSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 269 KDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGELEEAEKV 328
           K +  +  L+S Y   G KE+V RLW+L K     F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 329 LKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAVAVQYLD 388
           ++EWE+  + +D R+P+ +I  Y  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 389 RGETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWL 425
            G+ EKAVE  K A+ ++   GW+P+  V+   +++L
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYL 433

BLAST of HG10021209 vs. TAIR 10
Match: AT4G01990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 273.1 bits (697), Expect = 4.5e-73
Identity = 153/473 (32.35%), Postives = 265/473 (56.03%), Query Frame = 0

Query: 16  ASRSYHTNRLKKATLYAKISPLGD-PSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRK 75
           A+ S  T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK +
Sbjct: 27  AAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFR 86

Query: 76  RFTQALEVSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGA 135
           +  +ALE+ EWM++  +  F+ ++HA++L+LI + +G  +AE YFN L +  +   TYG+
Sbjct: 87  QPQRALEIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGS 146

Query: 136 LLNCYVRQRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKN 195
           LLNCY  +++  KA +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+
Sbjct: 147 LLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKS 206

Query: 196 VSPDNFSYRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKA 255
           ++P + +Y + I S G+  DL+G+E VL EM+++ + +  WNT+A +A  +IK GL  KA
Sbjct: 207 ITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKA 266

Query: 256 VNALRKSEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLES 315
             AL+  E  +    R  ++ LI+LY  + N  +V R+W+L K       N  Y+TML +
Sbjct: 267 EEALKSLENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRA 326

Query: 316 LVRLGELEEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATT 375
           L +L +++  +KV  EWES+   YD R+ N  I  Y+ + M E AE +    M+K K   
Sbjct: 327 LSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQF 386

Query: 376 PNSWGAVAVQYLDRGETEKAVECMKAALSLNMDKGWKPNFRVITGVLNWLGDKGIIEEVE 435
             +   + +  L   + + A++  +AA+ L+ DK W  +  +I+       +   ++  E
Sbjct: 387 SKARQLLMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAE 446

Query: 436 AFVGALRSVIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDEDEETKKIL 488
            F   L    P++ E Y  L+K ++  GK   ++  +++   I  DEE + +L
Sbjct: 447 EFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of HG10021209 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 270.4 bits (690), Expect = 2.9e-72
Identity = 145/469 (30.92%), Postives = 258/469 (55.01%), Query Frame = 0

Query: 22  TNRLKKATLYAKISPLGDPSISVEPELDGWVQEGKKVRVAELQRIIHDLRKRKRFTQALE 81
           T +  +  LY ++   G   + V  +L+ +++  K V   E+   I  LR R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 82  VSEWMKKSGVCIFSPTEHAVQLDLIGRVRGYLSAENYFNQLKEQDQTGKTYGALLNCYVR 141
           +SE M++ G+   + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 142 QRQVEKALSHLQKMKEMGFATSQLTYNDIMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFS 201
           +   EKA   L KMKE+    S ++YN +M LYT  G+ +KVP ++ E+K +NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 202 YRICINSYGARHDLEGMENVLKEMESQPDIVMDWNTYAVVANFFIKAGLTDKAVNALRKS 261
           Y + + +  A +D+ G+E V++EM     +  DW TY+ +A+ ++ AGL+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 262 EERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTATTRFINRDYITMLESLVRLGEL 321
           E +   +D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +L
Sbjct: 257 EMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDL 316

Query: 322 EEAEKVLKEWESSENCYDFRVPNTVIIEYIDKGMCERAETLLEDLMEKGKATTPNSWGAV 381
             AE + KEW+++ + YD R+ N +I  Y  +G+ ++A  L E    +G      +W   
Sbjct: 317 PGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIF 376

Query: 382 AVQYLDRGETEKAVECMKAALSLNMDKG--WKPNFRVITGVLNWLGDKGIIEEVEAFVGA 441
              Y+  G+  +A+ECM  A+S+    G  W P+   +  ++++   K  +   E  +  
Sbjct: 377 MDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEI 436

Query: 442 LRS-VIPVNREMYHALIKVHIRGGKEVNELLNQMKSDKIDEDEETKKIL 488
           L++    +  E++  LI+ +   GK    +  ++K + ++ +E TKK+L
Sbjct: 437 LKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893646.11.3e-27194.96pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa ... [more]
XP_022994385.11.8e-26290.56pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
XP_023542644.14.5e-26190.51pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG7012430.12.9e-26090.51Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022954890.15.0e-26089.98pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
Q84JR37.5e-17462.47Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Q8LPS63.4e-7835.40Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
Q9SKU61.3e-7738.04Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Q93WC56.3e-7232.35Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
O227144.1e-7130.92Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1K1248.9e-26390.56pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1GTN52.4e-26089.98pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1CGU24.6e-25988.58pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordic... [more]
A0A5A7UM451.3e-25889.58Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B5N21.3e-25889.58pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G21705.15.3e-17562.47Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.12.5e-7935.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.19.3e-7938.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G01990.14.5e-7332.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.12.9e-7230.92Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 165..209
e-value: 1.3E-8
score: 34.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 131..160
e-value: 1.0E-5
score: 25.5
coord: 308..333
e-value: 0.011
score: 15.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 308..334
e-value: 0.0016
score: 16.5
coord: 200..234
e-value: 0.0024
score: 15.9
coord: 131..160
e-value: 3.8E-6
score: 24.7
coord: 166..198
e-value: 1.8E-5
score: 22.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 9.952918
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 10.073492
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 189..301
e-value: 1.0E-14
score: 56.7
coord: 302..407
e-value: 3.7E-8
score: 35.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 51..188
e-value: 8.3E-16
score: 60.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 112..405
NoneNo IPR availablePANTHERPTHR45717:SF20OS07G0598500 PROTEINcoord: 15..489
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 15..489
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 375..408
score: 8.4079

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021209.1HG10021209.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding