HG10022849 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022849
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 28954153 .. 28961204 (+)
RNA-Seq ExpressionHG10022849
SyntenyHG10022849
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCCGCTCGTTGCGGCCTTCTCTAGCGACGGCCGCCGCCCGCAGATTTTCCGGGGAAGCCCACAAGGCGGCGGCGGAGAACACAGCACTAGAAGGCGGTGCCGACGTCGTCTCCGTCACAGGCGGTGGTCGAGACACGCTTGGACGAAGGCTCATGAGCCTCACTTTCCCCAAACGCAGCGCCGTGATTTCCATTCGAAAATGGCAAGAAGAGGGCCACACTCTTCGCAAGTACGAGCTCAATCGCATCGTTCGGGAGCTTCGCAAGCTCAAGCGCTACAAGCACGCACTTGAGGTATTCAATTCCCTTTTTCCCTTCTTATGTGAAAACTCATGTAATGTAATGTAACCAATGCGCGAGATCATGTATTATACTGATTTTTTTATCTGAAGGAGCTCATATTCTACTTGTGCTGTGAATGTGTTCTATTCACTCCATCTCCGCTTGAATGAGTTAGTATCAATTCAACTGATGTTCGTGTTGTTCATCATTTCTAGCATCTGTTTAATTTAGTCTCAGTCAAGTGGACTAGCTTTCGAAAGGGTATCCTGATTATATTGAATTGTACGATATCCAGAAGCTGATCATTTAAGGCAACTTTAATCATACGTTCTTTTAACACTTTGAGAGCTTGTATACTTGGAGTCCCTTTTTCGTAAGGGAGTATTTTTTTTTGCAGGCTTGGTTTTTTGTGTTTCCGTAGTTTTTTGTATGTCCGTGTATTCTTTCTTTTTTTATCCCTATGGGTGTTGTTCTTAAAAACAAAAAAGAAAATAAATAAAAAATAAAAACTTAACTTGGAGAGCTTGGAAATTAGGTCAAGATACTGTACATATGAAACCAAAGATATGACTGCAAAAAGGATGTTGCAGTGATTTGCTGGCCTTTCTTTAGTACTAGATTAAATGAAGACTATCAAAAAGCTTTGGCCCATTTGATAACTATTTCGCTATTGGTTTTGGGTTTTTGAAAATTAAGTCTATTTCCTCTCATTTTCTTACCATGATAAAAGAGTTGAATTCTTAGCTAAATTCCAAAAACAAAAACTACTTTTTTTAGTTTTAAAAAATTAGCTTGGTTTTTGTAAAAATTGGTAGAAAGTAGATAACAAAATAATAAATTTAGAGGTGGAAGAGGTGTTTGTAGGCTTAATTTTCAAAAACTAAAAACTAAAAAAACCAAATGGTTACCAAACGGGGCCTTAGATAGATAGGTTAGGGTGAACCCAGAGATACTGAAATATAATTTCAATGAATATTTTGCTTTATTTCCTATAAAAAGTAAATAGATGAATAAAATCCTCTGACATGACCTTGAACCTTGTAGATTGTATGATCTGAAGATGGCCTTGAGCCTTGAGTAATCTTTCTTGAGCATTTCCACCGTGCAGGTGTGTGAATGGATGACATTACAGAAAGATATGAAGTTGCTACCTGGTGACTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGAGGCCTGAATAGCGCAGAAAAGTTTTTTGAAGATCTCCCTGATAAAATGAGAGATCAATCAGCCTGCACAGCTCTTCTTCACGTGTACGTTCAAAATAATCTATCTGAAAAGGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTTCTTTCAACCACATGCTATCTCTTCACATCTCAAACAAGCAACTAGAGAAGGTTCCTGATCTGATTCAAGTATTAAAGAAGAACACCAAACCAGATGTGGTAACATATAATCTTTTGTTGAATGTTTGCACTTTGCAAAATGACGTTGAAGCTGCAGAAAGCATTTTCCTTGAGATGAAGAAGACGAAAACCGAACCAGATTGGGTATCATTTAGCACATTAGCTAACTTGTATTCCAAAAAACAACTTACCGAAAAAGCAGCGTCTACGTTGAAGGAGATGGAGAAAATGGCATCTCAAAGAAACAGAATCTCATTTTCGTCTCTTCTTAGCTTGTATACCAATTTGGGGGATAAGAATGGAGTTTACAGAATATGGAAAAAGATGAAGTCATTGTTTCGCAAGATGAGTGATAGTGAGTACACTTGCATGATATCCTCTCTTGTGAAACTTAATGAGCTTGGGGAAGCTGAGAAACTATATACCGAATGGGAGTCAGTATCCGGGACGGGTGATACTCGGGTTCCAAATATATTGCTTGCAGCGTATATCAACAAAAACCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCACTAAAAGGAATAGTTCCATCTTACACTACTTGGGAGCTCCTTACATGGGGTTATTTGAAAGAGAACCAGATGGAGAAAGTGCTGCATTTCTTCAAGAATGCAGTTGGCAGCGTGAAGAAATGGAATGCGGACGAGAGGTTGGTTAAAGGAGTCTGTAAGAAACTCGAGGAGCAGGGTAACATTGAAGGGGCAGAGCAGTTGTTGATTATTCTTAGGAATGCTGGTCATGTGGATACTGAGATATACAATTCTCTCTTGCGGACCTATGCAAAAGCTGGTAAAATGCCACTTGTTGTTGCTGAAAGAATGGAAATGGACAACGTTCAGTTGAACGACGAGACTCGAGAGTTTCTAAGGTTGACCAGCAAGATGTGTGTGAGTGAAGTTTCAAGCACTTTATTCTACAAAACTGATCAAACCAACTCAATTCAATCCGTTTGAAGTATACTCTTTTGGTTACTCAGCTTTCTGAAGTATCAAATCCGAAAGTTGCATATCAGATTGTCTTGGATAGTTTTTAGCTGGTTCATTTCGGTTTAAATGGTTGGATTTGTCTGTTAAGAGTCAGGTCATCTTTCTGATATGTTGGTTTGACCTTGGGCAAAACCTCGACCCATACCGGACTGCACACGCCCTTGCTTTTTACTAGGAACTTTCGGAATTTTCTCGGTCAGGTTGCAGAATCCTGAGATACATTTTCCCTGATGCAAGTAGCTGTTTTAGCTGTGTTTGCAAATGAAATCATTCATAATCAGCTAAATTCAGAAGCTGCCATGGCAAAAAGTTTTGATGTGCACCGGTCGGTACAACATGTTAAATCATTTCAAGTTCCCTAAATGTTATATTTAGATCTGAATTACTTGCTAGCATTGAATGTAGAATATAACATAACTTTGTTATTAATTAGCTATTTCAATGAAGTTGGCTGGAAAAGTGTGGGATTTCATGTTTTTGACAGCTTGATCATTGTTATTGGAGCACTTTTTCTGTTCTTTTATGGTAAAAAACTTTTGAGATTATAAATCAATGGATTAGATACTGAACTTATTAAAAGGTGAAGGCATTATGTGAGGAGTTTTTACTCTGATTTTTACATCCACAAGTTTGTCAGGTGGGATTGATTTTTTGGAGGCTTTAAAAGTGATTATCTGAAAATTTTAACTGTTTTTAGTTTTTGTTAAACCAATTATTAATTAATGGGAAGCAATTTACTTTTTAGATTTGGTTGGTTTTAATCTAGTTAAATAGCTATTGGGATTGATTTCTTAAAAAAATTGATATTCTAACCTACCTTTTAAATTTTCATTTTATTCTACATTTTCACTTCTAAAATTGAAATTTACCAAACTTTTTTTAATTTAATTATCAATGTATTTTGTTACAGCATATTTGACAACAATTATCCTTTACGTTACAACCTTCTCAAATTGAGTCTTAGAAGATTGGCATGCTAATTAAATAAGTGATTGAACTAGTAATGACTATGAGTCAAAGTGAGCATAATTCAACTAGTATAAAATGTGTGTAAATGATCAAAATATTTGAAATTCAAATCTCATCACTTGTTGAACTGAAAAATTTTAAAATATAGATAATAACTTGTAAGTGAATTTGAATAATTTGAATAGACTTTGGTTTCGAAGGTGTTTTTAGTCCATTCAAGTAATTTAGACCAAGTTTGTTGTGAAAAATGAGTTTTAAAAATTATGTAATTAACCAAAGTGCTTTTAGAATGAGCTTGGCATGTTTAGTGATTTTGATATTGTGAAAATCATCTTTGTCATTTTTAAATCATTTTGTGTATTTATTGATTCAAAATCTTTTTAAATAAAATTGCATGATATTGAAATTGATTTTAAAACGATAAAAGTGACACCATTTTGCGAATTCATGGATCATTTAGCTATGGCGAAATTATATAAAATACTCCTAAATTTTATATTTTGTGTAAAAAAAATAGTTATGAACTTTAACAAATTTCAATAATATTCTTGAACTTACAAAAAAAAAAAAAAAAAAAAAAAAAACTTTTGCAATTCAAATTGAAAAAACACTCTTGAACTCTCAAAATTTAAAGACTGAAGTATCCATCTTAAAATCTTTGGATCAAATATATATCAAATTCAAACTTCAAATATCAAAAGTGTATTTTACACCCTTTAACATCGACCTTCATTATTTTAAAGGATAATGAACAATATCATTATATTCAAAATTCATTGTAATTTAAAATAGATTTGTTAATTTCAATAAATATAAAGAGATTCGAACTACAGATTTTCTGGTTGTTAACACATGTGCATGTCAATTGAGTTATGTCCAATTTGACAAGAAATTACAATTTTGGAGAAACAACGGACAAACAAAAATATTTAAATACACTATATCCATATCGTATTATCTTTAAAAATGAGTTTAGTTTTACAATTTTGAAGTTAGAACACGTCTTACAATTTTGATTCTTCAATCTTCCATGATGTAATTGTTAAACTAAATTTTTGGGGTCAAACTATGAGTTTAGTTTTTCAACTATTAGGTTAACTTATTATTTTTATTCAACGTTCAAATATCAAGTTTGTTATATATATATCTTGTTGGTATTTTTTTTAGGTCTTATATTTAATATTTTTCAAAATCAATAAACATTTTTAAATATAAAATTAAATTTTGTGTTTTATATCTTATAGGACGATAAACTTTCTAATTTTCCTTCTAATAGATTTACGAATTTAAATATGCATCAATAGGTCAATATTGTATAGGTTATAAATTTGATTTCTATAATTCGGAGAAAATTAATTTTTTTTTTTGAAAAGAGAAAATTAAACTTTTATTTCAATCGTTTGATGAAATCTTCTAAATCATAAAACTAAATTCTGATTTTTAAAACCACGGTAAGTAAATTTTCAATTTTGTAATTTAACCTTTTTCTTTATGTACAAAAAAATTGATTGGCTAATAGATATATTCAGAATAAGCTAGCGATTTAATTCTTAAAAAATATGGCGTCTAGGGGCAATTACGGAAACTAAAAAAATGAAGAGGGTATAATTGAAATATCACAAATCAAACAGGTTGAAAACGAGAAATTTCCGAATATAAATATAGGGCACTTATCACTCTTCGAAAACCCTAGAGTTTGAGAAAGGCGGAAACCCTAGTTCAGCAATCGGAGCTTTTGCCGGCCTTTGCTCACTGAATTCAACAGCCATGGCCACTGCTAGGACCGTCAAGGACGTCTCTCCTCACGAGTTTGTCAAGGCCTATGCCGCTCATCTCAAGCGATCCGGAAAGGTATACATTTTCTTTTCTCAATTCGGCTTTTTCTGTTTTGATGGCTGAGAGAGCTAGATTTTCGATTCGACGGAGTATTATAGGATGTAGATTTCGAAAACAGAGTGAATGCTAAACTTGAAGTTCCAATGTTTCCGGCTCTGTCAAGGGGCGTGATCGATTCTGCATTGTTTCTTCTCTTTATTTCTTTATTTTGGCATCTATTGTAGTTTTTTTTTAATCAAACAAGGAGTTTGCAAGTCTTAGGCTGTAGACTTTTTGATTGATGTGCTACTAATTTTTGTTTTTTTTAATCTTAATTGGTTATCAAATTATCTGCTCGTGCTTTGAATATGCCTCATCTGTTGATTAAGTTTGTGGGTTGGGTACAATTTGTCAATTTGTTCTTGTTTTTCCTGCTTGTTCCGCATTATCTGAGAAATCGTGATTTTTTTTACTGGGTTTATACTTCTTATTGACTGTTTCTGGGATCTTGTACATTTTGTAGATACGTTCTCAGTTAAGGAATTCTTTAACGTTTCACAGAGCTCCTATATTACTGGGATTTTAATGGCTTTCATGGTTTTTTTGGCCATATGGAAGAATTTCATGTAATTGTCCATTGGTCGTGATTTGGCTTTTCGCCCGCTACTGTAATATTTCACTCTGTCAATGAATGTTTCCTATTAAAGGAAATGAACAACTTAGAAGGCCTTTTCGATTCTCCCTGGTTTGGATTATCTGATTTTTTAATCCCATTTTTAGTAGTTATGCTGTTCTTACATTTTGATTGGTTAGAACTTTCTACCTTATATGTTCCTTGTTCACTGATAACTGATAAGTTTAACATCTGACTATATGATTTTTACTAATGAATCTATTATGCAAATGTGATTTTAACATATTTTTTCTGTAAATCATTGCTGAGCAGGTTGAACTTCCACCATGGGCGGACATTGTGAAAACTGCAAGGTTCAAAGAGCTCGCTCCATATGATGCGGATTGGTATTACGTGAGAGCTGGTTAGTTTCCCTTTTACTTCTTGTTGACTTCCTCGTATTCCATATAAAATGAGCTTAATTTTATTTCCATTTTGTATCTTCTGTTTGTGGGATTCTTATGATCGCCATTTCAAACTTTTCAGCATCCATGGCAAGGAAGATCTACTTGAGAGGAGGTCTTGGTGTTGGGGCATTTAAGCGGATTTATGGTGGAAGCAAGAGGAATGGAAGTCGCCCTCCACACTTTTGTGAAAGCAGTGGAGCCATTGCCCGTCACATTCTACAACAGTTGCAGGAGATGAACATTGTTGATGTGGACCCAAAGGGGTGAGTACAAGCTTCTTAAGTTTAGCTTTTTCATTCAATTATGACCATTGCTATTCAATACAACCCTGTGTTTGCTTGTTTTCAATGCCGGTTTCTAATCATTTGTAACAACAATGTTATGTCGGGTTTGCAGTGGAAGGAGAATTACTTCAAGTGGTCGACGAGACCTTGATCAAGTTGCTGGCCGGATTGTTGTTGCCCCTTGA

mRNA sequence

ATGTTCCGCTCGTTGCGGCCTTCTCTAGCGACGGCCGCCGCCCGCAGATTTTCCGGGGAAGCCCACAAGGCGGCGGCGGAGAACACAGCACTAGAAGGCGGTGCCGACGTCGTCTCCGTCACAGGCGGTGGTCGAGACACGCTTGGACGAAGGCTCATGAGCCTCACTTTCCCCAAACGCAGCGCCGTGATTTCCATTCGAAAATGGCAAGAAGAGGGCCACACTCTTCGCAAGTACGAGCTCAATCGCATCGTTCGGGAGCTTCGCAAGCTCAAGCGCTACAAGCACGCACTTGAGGTGTGTGAATGGATGACATTACAGAAAGATATGAAGTTGCTACCTGGTGACTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGAGGCCTGAATAGCGCAGAAAAGTTTTTTGAAGATCTCCCTGATAAAATGAGAGATCAATCAGCCTGCACAGCTCTTCTTCACGTGTACGTTCAAAATAATCTATCTGAAAAGGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTTCTTTCAACCACATGCTATCTCTTCACATCTCAAACAAGCAACTAGAGAAGGTTCCTGATCTGATTCAAGTATTAAAGAAGAACACCAAACCAGATGTGGTAACATATAATCTTTTGTTGAATGTTTGCACTTTGCAAAATGACGTTGAAGCTGCAGAAAGCATTTTCCTTGAGATGAAGAAGACGAAAACCGAACCAGATTGGGTATCATTTAGCACATTAGCTAACTTGTATTCCAAAAAACAACTTACCGAAAAAGCAGCGTCTACGTTGAAGGAGATGGAGAAAATGGCATCTCAAAGAAACAGAATCTCATTTTCGTCTCTTCTTAGCTTGTATACCAATTTGGGGGATAAGAATGGAGTTTACAGAATATGGAAAAAGATGAAGTCATTGTTTCGCAAGATGAGTGATAGTGAGTACACTTGCATGATATCCTCTCTTGTGAAACTTAATGAGCTTGGGGAAGCTGAGAAACTATATACCGAATGGGAGTCAGTATCCGGGACGGGTGATACTCGGGTTCCAAATATATTGCTTGCAGCGTATATCAACAAAAACCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCACTAAAAGGAATAGTTCCATCTTACACTACTTGGGAGCTCCTTACATGGGGTTATTTGAAAGAGAACCAGATGGAGAAAGTGCTGCATTTCTTCAAGAATGCAGTTGGCAGCGTGAAGAAATGGAATGCGGACGAGAGGTTGGTTAAAGGAGTCTGTAAGAAACTCGAGGAGCAGGGTAACATTGAAGGGGCAGAGCAGTTGTTGATTATTCTTAGGAATGCTGGTCATGTGGATACTGAGATATACAATTCTCTCTTGCGGACCTATGCAAAAGCTGGTAAAATGCCACTTGTTGTTGCTGAAAGAATGGAAATGGACAACGTTCAGTTGAACGACGAGACTCGAGAGTTTCTAAGGTTGACCAGCAAGATGTGTGGCACTTATCACTCTTCGAAAACCCTAGAGTTTGAGAAAGGCGGAAACCCTAGTTCAGCAATCGGAGCTTTTGCCGGCCTTTGCTCACTGAATTCAACAGCCATGGCCACTGCTAGGACCGTCAAGGACGTCTCTCCTCACGAGTTTGTCAAGGCCTATGCCGCTCATCTCAAGCGATCCGGAAAGGTTGAACTTCCACCATGGGCGGACATTGTGAAAACTGCAAGGTTCAAAGAGCTCGCTCCATATGATGCGGATTGGTATTACGTGAGAGCTGCATCCATGGCAAGGAAGATCTACTTGAGAGGAGGTCTTGGTGTTGGGGCATTTAAGCGGATTTATGGTGGAAGCAAGAGGAATGGAAGTCGCCCTCCACACTTTTGTGAAAGCAGTGGAGCCATTGCCCGTCACATTCTACAACAGTTGCAGGAGATGAACATTGTTGATGTGGACCCAAAGGGTGGAAGGAGAATTACTTCAAGTGGTCGACGAGACCTTGATCAAGTTGCTGGCCGGATTGTTGTTGCCCCTTGA

Coding sequence (CDS)

ATGTTCCGCTCGTTGCGGCCTTCTCTAGCGACGGCCGCCGCCCGCAGATTTTCCGGGGAAGCCCACAAGGCGGCGGCGGAGAACACAGCACTAGAAGGCGGTGCCGACGTCGTCTCCGTCACAGGCGGTGGTCGAGACACGCTTGGACGAAGGCTCATGAGCCTCACTTTCCCCAAACGCAGCGCCGTGATTTCCATTCGAAAATGGCAAGAAGAGGGCCACACTCTTCGCAAGTACGAGCTCAATCGCATCGTTCGGGAGCTTCGCAAGCTCAAGCGCTACAAGCACGCACTTGAGGTGTGTGAATGGATGACATTACAGAAAGATATGAAGTTGCTACCTGGTGACTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGAGGCCTGAATAGCGCAGAAAAGTTTTTTGAAGATCTCCCTGATAAAATGAGAGATCAATCAGCCTGCACAGCTCTTCTTCACGTGTACGTTCAAAATAATCTATCTGAAAAGGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTTCTTTCAACCACATGCTATCTCTTCACATCTCAAACAAGCAACTAGAGAAGGTTCCTGATCTGATTCAAGTATTAAAGAAGAACACCAAACCAGATGTGGTAACATATAATCTTTTGTTGAATGTTTGCACTTTGCAAAATGACGTTGAAGCTGCAGAAAGCATTTTCCTTGAGATGAAGAAGACGAAAACCGAACCAGATTGGGTATCATTTAGCACATTAGCTAACTTGTATTCCAAAAAACAACTTACCGAAAAAGCAGCGTCTACGTTGAAGGAGATGGAGAAAATGGCATCTCAAAGAAACAGAATCTCATTTTCGTCTCTTCTTAGCTTGTATACCAATTTGGGGGATAAGAATGGAGTTTACAGAATATGGAAAAAGATGAAGTCATTGTTTCGCAAGATGAGTGATAGTGAGTACACTTGCATGATATCCTCTCTTGTGAAACTTAATGAGCTTGGGGAAGCTGAGAAACTATATACCGAATGGGAGTCAGTATCCGGGACGGGTGATACTCGGGTTCCAAATATATTGCTTGCAGCGTATATCAACAAAAACCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCACTAAAAGGAATAGTTCCATCTTACACTACTTGGGAGCTCCTTACATGGGGTTATTTGAAAGAGAACCAGATGGAGAAAGTGCTGCATTTCTTCAAGAATGCAGTTGGCAGCGTGAAGAAATGGAATGCGGACGAGAGGTTGGTTAAAGGAGTCTGTAAGAAACTCGAGGAGCAGGGTAACATTGAAGGGGCAGAGCAGTTGTTGATTATTCTTAGGAATGCTGGTCATGTGGATACTGAGATATACAATTCTCTCTTGCGGACCTATGCAAAAGCTGGTAAAATGCCACTTGTTGTTGCTGAAAGAATGGAAATGGACAACGTTCAGTTGAACGACGAGACTCGAGAGTTTCTAAGGTTGACCAGCAAGATGTGTGGCACTTATCACTCTTCGAAAACCCTAGAGTTTGAGAAAGGCGGAAACCCTAGTTCAGCAATCGGAGCTTTTGCCGGCCTTTGCTCACTGAATTCAACAGCCATGGCCACTGCTAGGACCGTCAAGGACGTCTCTCCTCACGAGTTTGTCAAGGCCTATGCCGCTCATCTCAAGCGATCCGGAAAGGTTGAACTTCCACCATGGGCGGACATTGTGAAAACTGCAAGGTTCAAAGAGCTCGCTCCATATGATGCGGATTGGTATTACGTGAGAGCTGCATCCATGGCAAGGAAGATCTACTTGAGAGGAGGTCTTGGTGTTGGGGCATTTAAGCGGATTTATGGTGGAAGCAAGAGGAATGGAAGTCGCCCTCCACACTTTTGTGAAAGCAGTGGAGCCATTGCCCGTCACATTCTACAACAGTTGCAGGAGATGAACATTGTTGATGTGGACCCAAAGGGTGGAAGGAGAATTACTTCAAGTGGTCGACGAGACCTTGATCAAGTTGCTGGCCGGATTGTTGTTGCCCCTTGA

Protein sequence

MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSSKTLEFEKGGNPSSAIGAFAGLCSLNSTAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP
Homology
BLAST of HG10022849 vs. NCBI nr
Match: KAG6593129.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 610/685 (89.05%), Postives = 640/685 (93.43%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKR 60
           M RS R  LAT AARRFSGEA   A ENT LE  +      GGGRDTLGRRLMSL FPKR
Sbjct: 1   MLRSPRLYLAT-AARRFSGEACAVAVENTTLEAASGSSGTGGGGRDTLGRRLMSLAFPKR 60

Query: 61  SAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVH 120
           SAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+CEWMTLQKDMKLLPGDYAVH
Sbjct: 61  SAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDMKLLPGDYAVH 120

Query: 121 LDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLK 180
           LDLI+KIRGL+SAEKFF DLPDKMR QSA T+LLHV+VQNNLSEKAEALM KMSE GFLK
Sbjct: 121 LDLISKIRGLSSAEKFFMDLPDKMRGQSAYTSLLHVFVQNNLSEKAEALMAKMSEFGFLK 180

Query: 181 SPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFL 240
           SPLSFNHMLSL+I+NK LEKVP L+Q LKKNTKPDV+TYNLLLNVCTLQNDVEAAE+IFL
Sbjct: 181 SPLSFNHMLSLYITNKGLEKVPALVQELKKNTKPDVLTYNLLLNVCTLQNDVEAAENIFL 240

Query: 241 EMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG 300
           EMK  K EPDWVSFSTLANLYSK+QLTEKAASTLK+MEKMAS+RNRISFSSLLSLYTN G
Sbjct: 241 EMKNAKIEPDWVSFSTLANLYSKQQLTEKAASTLKKMEKMASKRNRISFSSLLSLYTNFG 300

Query: 301 DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPN 360
           DK+GV RIWKKM S FRKM+DSEYTCMISSLVKL++L EAEKLYTEWESVSGTGDTRVPN
Sbjct: 301 DKDGVCRIWKKMNSSFRKMNDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPN 360

Query: 361 ILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS 420
           ILLAAYINKNQ +QAESFY+RM+LKGIVPSYTTWELLTWGYLKENQMEKVL FFKNAVGS
Sbjct: 361 ILLAAYINKNQTKQAESFYDRMTLKGIVPSYTTWELLTWGYLKENQMEKVLQFFKNAVGS 420

Query: 421 VKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL 480
           VKKWNADERLV+GVCK+LEEQGN EGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL
Sbjct: 421 VKKWNADERLVEGVCKRLEEQGNFEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL 480

Query: 481 VVAERMEMDNVQLNDETREFLRLTSKMCGTYHSSKTLEFEKGGNPSSAIGAFAGLCSLNS 540
           +VAERME DNVQLN+E+RE L+LTSKMC  Y SSKTLEFEKGG+PSSAIGAFAGLCSLNS
Sbjct: 481 IVAERMEQDNVQLNEESRELLKLTSKMCAAYQSSKTLEFEKGGSPSSAIGAFAGLCSLNS 540

Query: 541 TAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRA 600
           TAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPW DIVKTARFKELAPYD DWYYVRA
Sbjct: 541 TAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWTDIVKTARFKELAPYDPDWYYVRA 600

Query: 601 ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDP 660
           ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDP
Sbjct: 601 ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDP 660

Query: 661 KGGRRITSSGRRDLDQVAGRIVVAP 686
           KGGRRITSSGRRDLDQVAGRIVVAP
Sbjct: 661 KGGRRITSSGRRDLDQVAGRIVVAP 684

BLAST of HG10022849 vs. NCBI nr
Match: XP_038900168.1 (pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Benincasa hispida])

HSP 1 Score: 922.2 bits (2382), Expect = 2.7e-264
Identity = 475/518 (91.70%), Postives = 488/518 (94.21%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADV----------VSVTGGGRDTLGR 60
           MFRSLRPSLATAAARRFSGEA  AAAEN  LEGGA V          VS TGGGRDTLGR
Sbjct: 1   MFRSLRPSLATAAARRFSGEAFMAAAENKTLEGGASVGASVSAGASIVSGTGGGRDTLGR 60

Query: 61  RLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDM 120
           RLMSLTFPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALEVCEWMTLQKDM
Sbjct: 61  RLMSLTFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDM 120

Query: 121 KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALM 180
           KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS CTALLH YVQNNL EKAEALM
Sbjct: 121 KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSVCTALLHTYVQNNLCEKAEALM 180

Query: 181 EKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQN 240
           EKMSE GFLK PLSFNHMLSL+ISNKQLEKVPD+IQVLKKNTKPDVVTYNLLLNVCTLQN
Sbjct: 181 EKMSESGFLKCPLSFNHMLSLYISNKQLEKVPDVIQVLKKNTKPDVVTYNLLLNVCTLQN 240

Query: 241 DVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFS 300
           DVEAAE+IFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLK+MEKMAS+RNRISFS
Sbjct: 241 DVEAAENIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKQMEKMASKRNRISFS 300

Query: 301 SLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESV 360
           SLLSLYTNLGDKNGV+RIWKKMKS FRKMSDSEYTCMISSLVKLNEL EAEKLY EWESV
Sbjct: 301 SLLSLYTNLGDKNGVFRIWKKMKSSFRKMSDSEYTCMISSLVKLNELEEAEKLYPEWESV 360

Query: 361 SGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKV 420
           SGTGDTRV NILLAAYINKNQMEQAE+FYNRMS+KG+VPSYTTWELLTWGYLKENQMEKV
Sbjct: 361 SGTGDTRVSNILLAAYINKNQMEQAENFYNRMSVKGMVPSYTTWELLTWGYLKENQMEKV 420

Query: 421 LHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLR 480
           LHF KNAVGSVKKWN DERLVK VCKKLEEQGNIEGAEQLL+ILRN GHVDTEIYNSLLR
Sbjct: 421 LHFLKNAVGSVKKWNGDERLVKEVCKKLEEQGNIEGAEQLLVILRNVGHVDTEIYNSLLR 480

Query: 481 TYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMC 509
           TYAKAGKMPL+VAERMEMDNVQLNDETRE LRLTSKMC
Sbjct: 481 TYAKAGKMPLIVAERMEMDNVQLNDETRELLRLTSKMC 518

BLAST of HG10022849 vs. NCBI nr
Match: KAA0064089.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK18492.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 901.7 bits (2329), Expect = 3.8e-258
Identity = 463/516 (89.73%), Postives = 484/516 (93.80%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFP 60
           MFRS R SLATAAARRFSGEA  AAAENT++EGGA   VVS  GGGRDTLGRRLMSLTFP
Sbjct: 1   MFRSFRSSLATAAARRFSGEACVAAAENTSVEGGAGTGVVSRKGGGRDTLGRRLMSLTFP 60

Query: 61  KRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120
           KRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA
Sbjct: 61  KRSAVIAIRKWQEEGHTIRKYELNHIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120

Query: 121 VHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGF 180
           V LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGF
Sbjct: 121 VQLDLIAKIRGLNSAEKFFEDLPDKIREQSVCTALLHAYVQKNLSEKAEALMEKMSECGF 180

Query: 181 LKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESI 240
           LKSPLSFNHMLSLHISNKQLEKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+I
Sbjct: 181 LKSPLSFNHMLSLHISNKQLEKVPALIEVLKKNTKPDVVTYNLLLNVCTLQNDAEAAENI 240

Query: 241 FLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN 300
           FLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLYTN
Sbjct: 241 FLEMKKTKVQPDWLSFSTLANLYCKKQLTEKAAATLKEMEKMAFKRNRLSFSSLLSLYTN 300

Query: 301 LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRV 360
           LGDKN V RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+
Sbjct: 301 LGDKNEVRRIWKKLKSSFRKMSDSEYMCMVSSLVKLNELEEAEKLYTEWESVSGTRDTRI 360

Query: 361 PNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420
            N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV
Sbjct: 361 SNVMLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420

Query: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKM 480
           GSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKM
Sbjct: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGVEQLLVILRNAGHVDTEIYNSLLRTYAKAGKM 480

Query: 481 PLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           PL+VAERME DNVQLNDETRE LRLTSKMC +  SS
Sbjct: 481 PLIVAERMEKDNVQLNDETRELLRLTSKMCVSEVSS 516

BLAST of HG10022849 vs. NCBI nr
Match: XP_008451368.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Cucumis melo])

HSP 1 Score: 897.5 bits (2318), Expect = 7.1e-257
Identity = 460/516 (89.15%), Postives = 482/516 (93.41%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFP 60
           MFRS R SLATAAARRFSGEA  AAAENT++EG  G  VVS  GGGRDTLGRRLMSL FP
Sbjct: 1   MFRSFRSSLATAAARRFSGEACVAAAENTSVEGAAGTGVVSRKGGGRDTLGRRLMSLIFP 60

Query: 61  KRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120
           KRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA
Sbjct: 61  KRSAVIAIRKWQEEGHTIRKYELNHIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120

Query: 121 VHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGF 180
           V LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGF
Sbjct: 121 VQLDLIAKIRGLNSAEKFFEDLPDKIREQSVCTALLHAYVQKNLSEKAEALMEKMSECGF 180

Query: 181 LKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESI 240
           LKSPLSFNHMLSLHISNKQLEKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+I
Sbjct: 181 LKSPLSFNHMLSLHISNKQLEKVPALIEVLKKNTKPDVVTYNLLLNVCTLQNDAEAAENI 240

Query: 241 FLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN 300
           FLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLY N
Sbjct: 241 FLEMKKTKVQPDWLSFSTLANLYCKKQLTEKAAATLKEMEKMAFKRNRLSFSSLLSLYAN 300

Query: 301 LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRV 360
           LGDKN V+RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+
Sbjct: 301 LGDKNEVHRIWKKLKSSFRKMSDSEYMCMVSSLVKLNELEEAEKLYTEWESVSGTRDTRI 360

Query: 361 PNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420
            N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV
Sbjct: 361 SNVMLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420

Query: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKM 480
           GSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKM
Sbjct: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGVEQLLVILRNAGHVDTEIYNSLLRTYAKAGKM 480

Query: 481 PLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           PL+VAERME DNVQLNDETRE LRLTSKMC +  SS
Sbjct: 481 PLIVAERMEKDNVQLNDETRELLRLTSKMCVSEVSS 516

BLAST of HG10022849 vs. NCBI nr
Match: XP_022150266.1 (pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 [Momordica charantia])

HSP 1 Score: 875.9 bits (2262), Expect = 2.2e-250
Identity = 452/514 (87.94%), Postives = 481/514 (93.58%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKR 60
           M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKR
Sbjct: 1   MLRSLRTSLAT-AARRFSGEAFMAAVENTAIEGGSG-SSGGGGGRDTLGRRLMSLAFPKR 60

Query: 61  SAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVH 120
           SAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+CEW T QKDMKLLPGDYAVH
Sbjct: 61  SAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWTTSQKDMKLLPGDYAVH 120

Query: 121 LDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLK 180
           LDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQNNLS+KAEALMEKMSECGFLK
Sbjct: 121 LDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLK 180

Query: 181 SPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFL 240
           SPLSFNHMLSLHISNKQL+KVP LIQ L+KNTKPDVVTYNLLLNVCTLQNDVEAAE+I L
Sbjct: 181 SPLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILL 240

Query: 241 EMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG 300
           EMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEKMAS+RNRI+FSSLLSLYTNLG
Sbjct: 241 EMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLG 300

Query: 301 DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPN 360
           DK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL EAEKLYTEWESVSGTGDTRVPN
Sbjct: 301 DKDGAWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPN 360

Query: 361 ILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS 420
           ILLAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS
Sbjct: 361 ILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS 420

Query: 421 VKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL 480
           VKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPL
Sbjct: 421 VKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPL 480

Query: 481 VVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           +VAERME D+V+L++ETRE ++LTSKMC +  SS
Sbjct: 481 IVAERMEKDDVKLDEETRELIKLTSKMCVSEVSS 512

BLAST of HG10022849 vs. ExPASy Swiss-Prot
Match: Q9SY07 (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 3.2e-175
Identity = 318/527 (60.34%), Postives = 404/527 (76.66%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDT 60
           + RS RP+LA +  R FS  A  AA  +TA            +GG  A+      GGRDT
Sbjct: 6   LVRSARPTLA-SIHRLFSAAA--AATVDTATAPVVKPRSGGGKGGESANKKETVVGGRDT 65

Query: 61  LGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQ 120
           LG RL+SL + KRSAV++IRKW+EEGH++RKYELNRIVRELRK+KRYKHALE+CEWM +Q
Sbjct: 66  LGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQ 125

Query: 121 KDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAE 180
           +D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MR  +ACT+LLH YVQN LS+KAE
Sbjct: 126 EDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAE 185

Query: 181 ALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCT 240
           AL EKM ECGFLKS L +NHMLS++IS  Q EKVP LI+ LK  T PD+VTYNL L    
Sbjct: 186 ALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIRTSPDIVTYNLWLTAFA 245

Query: 241 LQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI 300
             NDVE AE ++L+ K+ K  PDWV++S L NLY+K    EKA   LKEMEK+ S++NR+
Sbjct: 246 SGNDVEGAEKVYLKAKEEKLNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRV 305

Query: 301 SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEW 360
           +++SL+SL+ NLGDK+GV   WKK+KS F+KM+D+EY  MIS++VKL E  +A+ LY EW
Sbjct: 306 AYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEW 365

Query: 361 ESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQM 420
           ESVSGTGD R+PN++LA Y+N++++   E FY R+  KGI PSY+TWE+LTW YLK   M
Sbjct: 366 ESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDM 425

Query: 421 EKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNS 480
           EKVL  F  A+ SVKKW  + RLVKG CK+LEEQGN++GAE+L+ +L+ AG+V+T++YNS
Sbjct: 426 EKVLDCFGKAIDSVKKWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNS 485

Query: 481 LLRTYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           LLRTYAKAG+M L+V ERM  DNV+L++ET+E +RLTS+M  T  SS
Sbjct: 486 LLRTYAKAGEMALIVEERMAKDNVELDEETKELIRLTSQMRVTEISS 529

BLAST of HG10022849 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 273.9 bits (699), Expect = 5.1e-72
Identity = 148/425 (34.82%), Postives = 253/425 (59.53%), Query Frame = 0

Query: 50  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD 109
           +++  +  P+  A   + +W++ G  L K+EL R+V+ELRK KR   ALEV +WM  + +
Sbjct: 71  KKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGE 130

Query: 110 -MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEA 169
             +L   D A+ LDLI K+RG+  AE+FF  LP+  +D+    +LL+ YV+    EKAEA
Sbjct: 131 RFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEA 190

Query: 170 LMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCT 229
           L+  M + G+   PL FN M++L+++ ++ +KV  ++ ++ +K+ + D+ +YN+ L+ C 
Sbjct: 191 LLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCG 250

Query: 230 LQNDVEAAESIFLEMKK-TKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNR 289
               VE  E ++ +MK      P+W +FST+A +Y K   TEKA   L+++E   + RNR
Sbjct: 251 SLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNR 310

Query: 290 ISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE 349
           I +  LLSLY +LG+K  +YR+W   KS+   + +  Y  ++SSLV++ ++  AEK+Y E
Sbjct: 311 IPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEE 370

Query: 350 WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQ 409
           W  V  + D R+PN+L+ AY+  +Q+E AE  ++ M   G  PS +TWE+L  G+ ++  
Sbjct: 371 WLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRC 430

Query: 410 MEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEI 469
           + + L   +NA  +     W     ++ G  K  EE+ ++   E +L +LR +G ++ + 
Sbjct: 431 ISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLEDKS 490

BLAST of HG10022849 vs. ExPASy Swiss-Prot
Match: Q3E911 (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 7.3e-71
Identity = 166/471 (35.24%), Postives = 267/471 (56.69%), Query Frame = 0

Query: 29  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVREL 88
           ++L  G+D  SV    R++L + ++    P+RS    +++  + GH +   EL  I + L
Sbjct: 24  SSLADGSDTSSV--ANRNSL-KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRL 83

Query: 89  RKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR 148
            +  RY  AL++ EWM  QKD++    D A+ LDLI K  GL   E++FE L      MR
Sbjct: 84  IRSNRYDLALQMMEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMR 143

Query: 149 -DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDL 208
             +SA   LL  YV+N + ++AEALMEK++  GFL +P  FN M+ L+ ++ Q EKV  +
Sbjct: 144 VAKSAYLPLLRAYVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMV 203

Query: 209 IQVLKKNTKP-DVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYS 268
           + ++K N  P +V++YNL +N C   + V A E+++ EM   K+ E  W S  TLAN+Y 
Sbjct: 204 VSMMKGNKIPRNVLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYI 263

Query: 269 KKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS 328
           K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K GV R+W+  KS+  ++S  
Sbjct: 264 KSGFDEKARLVLEDAEKMLNRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCV 323

Query: 329 EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRM 388
            Y C++SSLVK  +L EAE++++EWE+     D RV N+LL AY+   ++ +AES +  +
Sbjct: 324 NYICVLSSLVKTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCV 383

Query: 389 SLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK--WNADERLVKGVCKKLEE 448
             +G  P+Y TWE+L  G++K   MEK +         +++  W     +V  + +  E+
Sbjct: 384 LERGGTPNYKTWEILMEGWVKCENMEKAIDAMHQVFVLMRRCHWRPSHNIVMAIAEYFEK 443

Query: 449 QGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV 492
           +  IE A   +  L   G     +Y  LLR +  A +    + E M++D +
Sbjct: 444 EEKIEEATAYVRDLHRLGLASLPLYRLLLRMHEHAKRPAYDIYEMMKLDKL 491

BLAST of HG10022849 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 4.9e-67
Identity = 147/431 (34.11%), Postives = 240/431 (55.68%), Query Frame = 0

Query: 78  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFF 137
           K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F
Sbjct: 55  KWEVGDTIKKLRNRGLYYPALKLSEVME-ERGMNKTVSDQAIHLDLVAKAREITAGENYF 114

Query: 138 EDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQ 197
            DLP+  + +    +LL+ Y +  L+EKAE L+ KM E     S +S+N +++L+    +
Sbjct: 115 VDLPETSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGE 174

Query: 198 LEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFS 257
            EKVP +IQ LK +N  PD  TYN+ +      ND+   E +  EM +  +  PDW ++S
Sbjct: 175 TEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYS 234

Query: 258 TLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSL 317
            +A++Y    L++KA   L+E+E   +QR+  ++  L++LY  LG    VYRIW+ ++  
Sbjct: 235 NMASIYVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLA 294

Query: 318 FRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA 377
             K S+  Y  MI  LVKLN+L  AE L+ EW++   T D R+ N+L+ AY  +  +++A
Sbjct: 295 IPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKA 354

Query: 378 ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLV 437
                +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V
Sbjct: 355 NELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETV 414

Query: 438 KGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDN 497
           + +    E++ ++ GAE LL IL+N   ++  EI+  L+RTYA AGK    +  R++M+N
Sbjct: 415 RALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMEN 474

Query: 498 VQLNDETREFL 502
           V++N+ T++ L
Sbjct: 475 VEVNEATKKLL 484

BLAST of HG10022849 vs. ExPASy Swiss-Prot
Match: Q9SGA6 (40S ribosomal protein S19-1 OS=Arabidopsis thaliana OX=3702 GN=RPS19A PE=2 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 1.1e-66
Identity = 119/143 (83.22%), Postives = 132/143 (92.31%), Query Frame = 0

Query: 543 MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAAS 602
           MAT +TVKDVSPH+FVKAYA+HLKRSGK+ELP W DIVKT + KELAPYD DWYY+RAAS
Sbjct: 1   MATGKTVKDVSPHDFVKAYASHLKRSGKIELPTWTDIVKTGKLKELAPYDPDWYYIRAAS 60

Query: 603 MARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKG 662
           MARK+YLRGGLGVGAF+RIYGGSKRNGSRPPHFC+SSG IARHILQQL+ MNIV++D KG
Sbjct: 61  MARKVYLRGGLGVGAFRRIYGGSKRNGSRPPHFCKSSGGIARHILQQLETMNIVELDTKG 120

Query: 663 GRRITSSGRRDLDQVAGRIVVAP 686
           GRRITSSG+RDLDQVAGRI V P
Sbjct: 121 GRRITSSGQRDLDQVAGRIAVEP 143

BLAST of HG10022849 vs. ExPASy TrEMBL
Match: A0A5D3D4M6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2032G00090 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.8e-258
Identity = 463/516 (89.73%), Postives = 484/516 (93.80%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFP 60
           MFRS R SLATAAARRFSGEA  AAAENT++EGGA   VVS  GGGRDTLGRRLMSLTFP
Sbjct: 1   MFRSFRSSLATAAARRFSGEACVAAAENTSVEGGAGTGVVSRKGGGRDTLGRRLMSLTFP 60

Query: 61  KRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120
           KRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA
Sbjct: 61  KRSAVIAIRKWQEEGHTIRKYELNHIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120

Query: 121 VHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGF 180
           V LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGF
Sbjct: 121 VQLDLIAKIRGLNSAEKFFEDLPDKIREQSVCTALLHAYVQKNLSEKAEALMEKMSECGF 180

Query: 181 LKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESI 240
           LKSPLSFNHMLSLHISNKQLEKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+I
Sbjct: 181 LKSPLSFNHMLSLHISNKQLEKVPALIEVLKKNTKPDVVTYNLLLNVCTLQNDAEAAENI 240

Query: 241 FLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN 300
           FLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLYTN
Sbjct: 241 FLEMKKTKVQPDWLSFSTLANLYCKKQLTEKAAATLKEMEKMAFKRNRLSFSSLLSLYTN 300

Query: 301 LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRV 360
           LGDKN V RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+
Sbjct: 301 LGDKNEVRRIWKKLKSSFRKMSDSEYMCMVSSLVKLNELEEAEKLYTEWESVSGTRDTRI 360

Query: 361 PNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420
            N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV
Sbjct: 361 SNVMLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420

Query: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKM 480
           GSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKM
Sbjct: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGVEQLLVILRNAGHVDTEIYNSLLRTYAKAGKM 480

Query: 481 PLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           PL+VAERME DNVQLNDETRE LRLTSKMC +  SS
Sbjct: 481 PLIVAERMEKDNVQLNDETRELLRLTSKMCVSEVSS 516

BLAST of HG10022849 vs. ExPASy TrEMBL
Match: A0A1S3BRD9 (pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103492678 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 3.4e-257
Identity = 460/516 (89.15%), Postives = 482/516 (93.41%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFP 60
           MFRS R SLATAAARRFSGEA  AAAENT++EG  G  VVS  GGGRDTLGRRLMSL FP
Sbjct: 1   MFRSFRSSLATAAARRFSGEACVAAAENTSVEGAAGTGVVSRKGGGRDTLGRRLMSLIFP 60

Query: 61  KRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120
           KRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA
Sbjct: 61  KRSAVIAIRKWQEEGHTIRKYELNHIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120

Query: 121 VHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGF 180
           V LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGF
Sbjct: 121 VQLDLIAKIRGLNSAEKFFEDLPDKIREQSVCTALLHAYVQKNLSEKAEALMEKMSECGF 180

Query: 181 LKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESI 240
           LKSPLSFNHMLSLHISNKQLEKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+I
Sbjct: 181 LKSPLSFNHMLSLHISNKQLEKVPALIEVLKKNTKPDVVTYNLLLNVCTLQNDAEAAENI 240

Query: 241 FLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN 300
           FLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLY N
Sbjct: 241 FLEMKKTKVQPDWLSFSTLANLYCKKQLTEKAAATLKEMEKMAFKRNRLSFSSLLSLYAN 300

Query: 301 LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRV 360
           LGDKN V+RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+
Sbjct: 301 LGDKNEVHRIWKKLKSSFRKMSDSEYMCMVSSLVKLNELEEAEKLYTEWESVSGTRDTRI 360

Query: 361 PNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420
            N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV
Sbjct: 361 SNVMLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420

Query: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKM 480
           GSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKM
Sbjct: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGVEQLLVILRNAGHVDTEIYNSLLRTYAKAGKM 480

Query: 481 PLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           PL+VAERME DNVQLNDETRE LRLTSKMC +  SS
Sbjct: 481 PLIVAERMEKDNVQLNDETRELLRLTSKMCVSEVSS 516

BLAST of HG10022849 vs. ExPASy TrEMBL
Match: A0A6J1DB09 (pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018472 PE=4 SV=1)

HSP 1 Score: 875.9 bits (2262), Expect = 1.1e-250
Identity = 452/514 (87.94%), Postives = 481/514 (93.58%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKR 60
           M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKR
Sbjct: 1   MLRSLRTSLAT-AARRFSGEAFMAAVENTAIEGGSG-SSGGGGGRDTLGRRLMSLAFPKR 60

Query: 61  SAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVH 120
           SAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+CEW T QKDMKLLPGDYAVH
Sbjct: 61  SAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWTTSQKDMKLLPGDYAVH 120

Query: 121 LDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLK 180
           LDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQNNLS+KAEALMEKMSECGFLK
Sbjct: 121 LDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLK 180

Query: 181 SPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFL 240
           SPLSFNHMLSLHISNKQL+KVP LIQ L+KNTKPDVVTYNLLLNVCTLQNDVEAAE+I L
Sbjct: 181 SPLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILL 240

Query: 241 EMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG 300
           EMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEKMAS+RNRI+FSSLLSLYTNLG
Sbjct: 241 EMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLG 300

Query: 301 DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPN 360
           DK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL EAEKLYTEWESVSGTGDTRVPN
Sbjct: 301 DKDGAWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPN 360

Query: 361 ILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS 420
           ILLAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS
Sbjct: 361 ILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS 420

Query: 421 VKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL 480
           VKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPL
Sbjct: 421 VKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPL 480

Query: 481 VVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           +VAERME D+V+L++ETRE ++LTSKMC +  SS
Sbjct: 481 IVAERMEKDDVKLDEETRELIKLTSKMCVSEVSS 512

BLAST of HG10022849 vs. ExPASy TrEMBL
Match: A0A0A0K7E2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390000 PE=4 SV=1)

HSP 1 Score: 867.1 bits (2239), Expect = 5.0e-248
Identity = 446/516 (86.43%), Postives = 468/516 (90.70%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFP 60
           MFRS RPSLATAAARRFSGEA  AA+ENTALEG  G  VVS  GGGRDTLGRRLMSL FP
Sbjct: 1   MFRSFRPSLATAAARRFSGEASMAASENTALEGAAGTRVVSGKGGGRDTLGRRLMSLIFP 60

Query: 61  KRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYA 120
           KRSAV +IRKWQEEG T+RKYELNR VRELRKLKRYKHALEVCEWMTLQKDM+L+PGDYA
Sbjct: 61  KRSAVTAIRKWQEEGRTVRKYELNRNVRELRKLKRYKHALEVCEWMTLQKDMRLVPGDYA 120

Query: 121 VHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGF 180
           VHLDLI KIRGLN AEKFFEDLPDK+R+QS CT+LLH YVQNNLSEKAEALMEKMSECGF
Sbjct: 121 VHLDLICKIRGLNRAEKFFEDLPDKIREQSVCTSLLHAYVQNNLSEKAEALMEKMSECGF 180

Query: 181 LKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESI 240
           LKSPLSFNHMLSLHISNKQLEKVP LI+ LKKNTKPDVVTYNLLLNVCTLQND EAAE+I
Sbjct: 181 LKSPLSFNHMLSLHISNKQLEKVPALIEGLKKNTKPDVVTYNLLLNVCTLQNDTEAAENI 240

Query: 241 FLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN 300
           FLEMKKTK +PDWVSFSTLANLY K QLTEKAA+TLKEMEKMA + NR+S SSLLSLYTN
Sbjct: 241 FLEMKKTKIQPDWVSFSTLANLYCKNQLTEKAAATLKEMEKMAFKSNRLSLSSLLSLYTN 300

Query: 301 LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRV 360
           LGDKN VYRIWKK+KS FRKMSD EY CMISSLVKLNEL EAEKLYTEWESVSGT DTRV
Sbjct: 301 LGDKNEVYRIWKKLKSSFRKMSDREYMCMISSLVKLNELEEAEKLYTEWESVSGTRDTRV 360

Query: 361 PNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV 420
            N++L AYI KNQ+EQAESFYNRM  KG VPSYTTWELLTWGYLKENQMEKVLHFF+ AV
Sbjct: 361 SNVMLGAYIKKNQIEQAESFYNRMLQKGTVPSYTTWELLTWGYLKENQMEKVLHFFRKAV 420

Query: 421 GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKM 480
             VKKWNADERLVKGVCKKLEEQGNI G EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKM
Sbjct: 421 NRVKKWNADERLVKGVCKKLEEQGNINGVEQLLLILRNAGHVDTEIYNSLLRTYAKAGKM 480

Query: 481 PLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           PL+VAERME DNVQLNDETRE LRLTSKMC +  SS
Sbjct: 481 PLIVAERMERDNVQLNDETRELLRLTSKMCVSEVSS 516

BLAST of HG10022849 vs. ExPASy TrEMBL
Match: A0A6J1D809 (pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018472 PE=4 SV=1)

HSP 1 Score: 863.6 bits (2230), Expect = 5.5e-247
Identity = 452/535 (84.49%), Postives = 481/535 (89.91%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKR 60
           M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKR
Sbjct: 1   MLRSLRTSLAT-AARRFSGEAFMAAVENTAIEGGSG-SSGGGGGRDTLGRRLMSLAFPKR 60

Query: 61  SAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALE--------------------- 120
           SAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE                     
Sbjct: 61  SAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEDYLVARPHNDPTSRKLQRNKY 120

Query: 121 VCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQ 180
           +CEW T QKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQ
Sbjct: 121 ICEWTTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQ 180

Query: 181 NNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTY 240
           NNLS+KAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL+KVP LIQ L+KNTKPDVVTY
Sbjct: 181 NNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTY 240

Query: 241 NLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEK 300
           NLLLNVCTLQNDVEAAE+I LEMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEK
Sbjct: 241 NLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEK 300

Query: 301 MASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGE 360
           MAS+RNRI+FSSLLSLYTNLGDK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL E
Sbjct: 301 MASKRNRITFSSLLSLYTNLGDKDGAWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEE 360

Query: 361 AEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTW 420
           AEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLTW
Sbjct: 361 AEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTW 420

Query: 421 GYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGH 480
           GYLKENQMEKVLHFFKNAVGSVKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGH
Sbjct: 421 GYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGH 480

Query: 481 VDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           V+TEIYNSLLRTYAKAGKMPL+VAERME D+V+L++ETRE ++LTSKMC +  SS
Sbjct: 481 VNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSKMCVSEVSS 533

BLAST of HG10022849 vs. TAIR 10
Match: AT4G02820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 616.7 bits (1589), Expect = 2.3e-176
Identity = 318/527 (60.34%), Postives = 404/527 (76.66%), Query Frame = 0

Query: 1   MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDT 60
           + RS RP+LA +  R FS  A  AA  +TA            +GG  A+      GGRDT
Sbjct: 6   LVRSARPTLA-SIHRLFSAAA--AATVDTATAPVVKPRSGGGKGGESANKKETVVGGRDT 65

Query: 61  LGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQ 120
           LG RL+SL + KRSAV++IRKW+EEGH++RKYELNRIVRELRK+KRYKHALE+CEWM +Q
Sbjct: 66  LGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQ 125

Query: 121 KDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAE 180
           +D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MR  +ACT+LLH YVQN LS+KAE
Sbjct: 126 EDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAE 185

Query: 181 ALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCT 240
           AL EKM ECGFLKS L +NHMLS++IS  Q EKVP LI+ LK  T PD+VTYNL L    
Sbjct: 186 ALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIRTSPDIVTYNLWLTAFA 245

Query: 241 LQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI 300
             NDVE AE ++L+ K+ K  PDWV++S L NLY+K    EKA   LKEMEK+ S++NR+
Sbjct: 246 SGNDVEGAEKVYLKAKEEKLNPDWVTYSVLTNLYAKTDNVEKARLALKEMEKLVSKKNRV 305

Query: 301 SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEW 360
           +++SL+SL+ NLGDK+GV   WKK+KS F+KM+D+EY  MIS++VKL E  +A+ LY EW
Sbjct: 306 AYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEW 365

Query: 361 ESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQM 420
           ESVSGTGD R+PN++LA Y+N++++   E FY R+  KGI PSY+TWE+LTW YLK   M
Sbjct: 366 ESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDM 425

Query: 421 EKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNS 480
           EKVL  F  A+ SVKKW  + RLVKG CK+LEEQGN++GAE+L+ +L+ AG+V+T++YNS
Sbjct: 426 EKVLDCFGKAIDSVKKWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNS 485

Query: 481 LLRTYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS 515
           LLRTYAKAG+M L+V ERM  DNV+L++ET+E +RLTS+M  T  SS
Sbjct: 486 LLRTYAKAGEMALIVEERMAKDNVELDEETKELIRLTSQMRVTEISS 529

BLAST of HG10022849 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 273.9 bits (699), Expect = 3.6e-73
Identity = 148/425 (34.82%), Postives = 253/425 (59.53%), Query Frame = 0

Query: 50  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD 109
           +++  +  P+  A   + +W++ G  L K+EL R+V+ELRK KR   ALEV +WM  + +
Sbjct: 71  KKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGE 130

Query: 110 -MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEA 169
             +L   D A+ LDLI K+RG+  AE+FF  LP+  +D+    +LL+ YV+    EKAEA
Sbjct: 131 RFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEA 190

Query: 170 LMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCT 229
           L+  M + G+   PL FN M++L+++ ++ +KV  ++ ++ +K+ + D+ +YN+ L+ C 
Sbjct: 191 LLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCG 250

Query: 230 LQNDVEAAESIFLEMKK-TKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNR 289
               VE  E ++ +MK      P+W +FST+A +Y K   TEKA   L+++E   + RNR
Sbjct: 251 SLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNR 310

Query: 290 ISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE 349
           I +  LLSLY +LG+K  +YR+W   KS+   + +  Y  ++SSLV++ ++  AEK+Y E
Sbjct: 311 IPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEE 370

Query: 350 WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQ 409
           W  V  + D R+PN+L+ AY+  +Q+E AE  ++ M   G  PS +TWE+L  G+ ++  
Sbjct: 371 WLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRC 430

Query: 410 MEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEI 469
           + + L   +NA  +     W     ++ G  K  EE+ ++   E +L +LR +G ++ + 
Sbjct: 431 ISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLEDKS 490

BLAST of HG10022849 vs. TAIR 10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 270.0 bits (689), Expect = 5.2e-72
Identity = 166/471 (35.24%), Postives = 267/471 (56.69%), Query Frame = 0

Query: 29  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVREL 88
           ++L  G+D  SV    R++L + ++    P+RS    +++  + GH +   EL  I + L
Sbjct: 24  SSLADGSDTSSV--ANRNSL-KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRL 83

Query: 89  RKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR 148
            +  RY  AL++ EWM  QKD++    D A+ LDLI K  GL   E++FE L      MR
Sbjct: 84  IRSNRYDLALQMMEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMR 143

Query: 149 -DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDL 208
             +SA   LL  YV+N + ++AEALMEK++  GFL +P  FN M+ L+ ++ Q EKV  +
Sbjct: 144 VAKSAYLPLLRAYVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMV 203

Query: 209 IQVLKKNTKP-DVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYS 268
           + ++K N  P +V++YNL +N C   + V A E+++ EM   K+ E  W S  TLAN+Y 
Sbjct: 204 VSMMKGNKIPRNVLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYI 263

Query: 269 KKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS 328
           K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K GV R+W+  KS+  ++S  
Sbjct: 264 KSGFDEKARLVLEDAEKMLNRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCV 323

Query: 329 EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRM 388
            Y C++SSLVK  +L EAE++++EWE+     D RV N+LL AY+   ++ +AES +  +
Sbjct: 324 NYICVLSSLVKTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCV 383

Query: 389 SLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK--WNADERLVKGVCKKLEE 448
             +G  P+Y TWE+L  G++K   MEK +         +++  W     +V  + +  E+
Sbjct: 384 LERGGTPNYKTWEILMEGWVKCENMEKAIDAMHQVFVLMRRCHWRPSHNIVMAIAEYFEK 443

Query: 449 QGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV 492
           +  IE A   +  L   G     +Y  LLR +  A +    + E M++D +
Sbjct: 444 EEKIEEATAYVRDLHRLGLASLPLYRLLLRMHEHAKRPAYDIYEMMKLDKL 491

BLAST of HG10022849 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 257.3 bits (656), Expect = 3.5e-68
Identity = 147/431 (34.11%), Postives = 240/431 (55.68%), Query Frame = 0

Query: 78  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFF 137
           K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F
Sbjct: 55  KWEVGDTIKKLRNRGLYYPALKLSEVME-ERGMNKTVSDQAIHLDLVAKAREITAGENYF 114

Query: 138 EDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQ 197
            DLP+  + +    +LL+ Y +  L+EKAE L+ KM E     S +S+N +++L+    +
Sbjct: 115 VDLPETSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGE 174

Query: 198 LEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFS 257
            EKVP +IQ LK +N  PD  TYN+ +      ND+   E +  EM +  +  PDW ++S
Sbjct: 175 TEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYS 234

Query: 258 TLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSL 317
            +A++Y    L++KA   L+E+E   +QR+  ++  L++LY  LG    VYRIW+ ++  
Sbjct: 235 NMASIYVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLA 294

Query: 318 FRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA 377
             K S+  Y  MI  LVKLN+L  AE L+ EW++   T D R+ N+L+ AY  +  +++A
Sbjct: 295 IPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKA 354

Query: 378 ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLV 437
                +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V
Sbjct: 355 NELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETV 414

Query: 438 KGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDN 497
           + +    E++ ++ GAE LL IL+N   ++  EI+  L+RTYA AGK    +  R++M+N
Sbjct: 415 RALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMEN 474

Query: 498 VQLNDETREFL 502
           V++N+ T++ L
Sbjct: 475 VEVNEATKKLL 484

BLAST of HG10022849 vs. TAIR 10
Match: AT3G02080.1 (Ribosomal protein S19e family protein )

HSP 1 Score: 256.1 bits (653), Expect = 7.8e-68
Identity = 119/143 (83.22%), Postives = 132/143 (92.31%), Query Frame = 0

Query: 543 MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAAS 602
           MAT +TVKDVSPH+FVKAYA+HLKRSGK+ELP W DIVKT + KELAPYD DWYY+RAAS
Sbjct: 1   MATGKTVKDVSPHDFVKAYASHLKRSGKIELPTWTDIVKTGKLKELAPYDPDWYYIRAAS 60

Query: 603 MARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKG 662
           MARK+YLRGGLGVGAF+RIYGGSKRNGSRPPHFC+SSG IARHILQQL+ MNIV++D KG
Sbjct: 61  MARKVYLRGGLGVGAFRRIYGGSKRNGSRPPHFCKSSGGIARHILQQLETMNIVELDTKG 120

Query: 663 GRRITSSGRRDLDQVAGRIVVAP 686
           GRRITSSG+RDLDQVAGRI V P
Sbjct: 121 GRRITSSGQRDLDQVAGRIAVEP 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6593129.10.0e+0089.05Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_038900168.12.7e-26491.70pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Benincasa ... [more]
KAA0064089.13.8e-25889.73pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK18492... [more]
XP_008451368.17.1e-25789.15PREDICTED: pentatricopeptide repeat-containing protein At4g02820, mitochondrial ... [more]
XP_022150266.12.2e-25087.94pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 ... [more]
Match NameE-valueIdentityDescription
Q9SY073.2e-17560.34Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Q8LPS65.1e-7234.82Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
Q3E9117.3e-7135.24Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
O227144.9e-6734.11Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Q9SGA61.1e-6683.2240S ribosomal protein S19-1 OS=Arabidopsis thaliana OX=3702 GN=RPS19A PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3D4M61.8e-25889.73Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BRD93.4e-25789.15pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Cucumis ... [more]
A0A6J1DB091.1e-25087.94pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 ... [more]
A0A0A0K7E25.0e-24886.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390000 PE=4 SV=1[more]
A0A6J1D8095.5e-24784.49pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT4G02820.12.3e-17660.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G02150.13.6e-7334.82Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G27460.15.2e-7235.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.13.5e-6834.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G02080.17.8e-6883.22Ribosomal protein S19e family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 263..283
NoneNo IPR availablePANTHERPTHR45717:SF45OS12G0527900 PROTEINcoord: 45..502
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 45..502
IPR001266Ribosomal protein S19eSMARTSM01413Ribosomal_S19e_2coord: 548..684
e-value: 1.3E-84
score: 297.1
IPR001266Ribosomal protein S19ePFAMPF01090Ribosomal_S19ecoord: 548..682
e-value: 9.2E-55
score: 184.1
IPR036388Winged helix-like DNA-binding domain superfamilyGENE3D1.10.10.10coord: 526..684
e-value: 8.1E-70
score: 235.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 214..259
e-value: 2.2E-10
score: 40.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 360..387
e-value: 0.037
score: 14.3
coord: 287..314
e-value: 0.63
score: 10.4
coord: 149..178
e-value: 4.9E-4
score: 20.2
coord: 324..347
e-value: 0.031
score: 14.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 360..390
e-value: 1.6E-4
score: 19.6
coord: 217..251
e-value: 1.2E-6
score: 26.3
coord: 149..178
e-value: 4.0E-4
score: 18.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 215..249
score: 10.807899
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..180
score: 8.911594
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 355..389
score: 9.755614
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 65..205
e-value: 2.8E-15
score: 58.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 319..437
e-value: 3.4E-16
score: 61.6
coord: 206..318
e-value: 3.0E-22
score: 81.4
IPR018277Ribosomal protein S19e, conserved sitePROSITEPS00628RIBOSOMAL_S19Ecoord: 632..651
IPR036390Winged helix DNA-binding domain superfamilySUPERFAMILY46785"Winged helix" DNA-binding domaincoord: 548..683

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022849.1HG10022849.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006412 translation
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005840 ribosome
molecular_function GO:0005515 protein binding
molecular_function GO:0003735 structural constituent of ribosome