HG10012104 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012104
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLAGLIDADG_2 domain-containing protein
LocationChr01: 17734644 .. 17746033 (+)
RNA-Seq ExpressionHG10012104
SyntenyHG10012104
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAAAGCCTTTCAAAACAGTTTGCTTTATGAAAAGTTCCTTAAAGAAATGTGTGTTTTTTTGTTCTATTAAAAGCTTAATGGGTTTCTGTTGTTTTGATGAATTGTAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCACAATCCACCAAGAAGGAGACAAGTTGGGGTAAGAGAAACATGCACAGTACCTGTTTGATAAAATGTCAAAGTGGCAACGATTTGTTTTAAGTGTCATGAAACTTTCTTTTTTTTGCCAGATAAAGGAACTGCTGCTAGATCAAAGGAGATAGTGGTGTCTAGTTCATGTGGGAGTTCTCAAAGATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTTCAAAGTCTTGCTTTTATAGTACCATTAATATAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCATTATTGGATCCAAAAGATGAAAAAAGAAGATATTGCTAAGGGAATTAGAGAGATTGGTCAAAAGGCAGATTCTTCTTCTTCTTCTGTTCATGCTGGCACCAAAGTCTTGCCAATAACTGATGCCACTTTAGTAGAGCAATGCAGCAGTAAGACTGAGAAGAAAGTTTTGGCCAATGGAGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAGATGGGCTATGACTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTGATTTTCTTAACTTCTTAATAACTTTCCCCTTGTTTGGACATCTGGAAATCCTTTATTTGTACAAGTAATCAACTATGTTTCTATTGGTTTGAATCATTTACTCCTTAAACCACCACATAACTCAATAGCTTAAGCTTATGAGTTCAGGTATATATATACTCTTAATACCATCTTTAAGTATGGGCTTGAGGATTAATTTGACTTGTGGAACCTGAAACATGAATTGTATCAACATCAATCGAGTTAGGACCTCTTGTTTTGATACAATATTAAACTACTTCTAAATCCAAAAAAAAGCTCTTAGGTTGAAATAAATTTTATTTTACCAAATCCATATTGAATGGTGACTGTGATCTGTATCCATATTGAATATGAAAGAAGCCAAAGCATAGGCCCTCGGTGTCAAATCACCATTATAGCTATCAGCATCCTTATTGATGAATAGCTAGCTTATTTTAAGGGTCTATATGGAATGACTTTTTGAGTGCTTAAACTTCTAAACACTTGAAAAGTAATTCCAAACAGTCGAGATATATGTTTCAATGATGCAATAATTTTCAAACTGAATGGCTTAAAATTGTGCAACTTCTTTCAGGTTTCACGAATGAGAAACCGAGGAACTCTTAAAGCAGGTTTGGACGATGATCGAGGGAGTAACGACTCCCCCAAGATTAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCCCATTCAAAAACTGTTCCATCACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATGTAAGATTGTAATCCCCATTTTCACTAGACTGATCACTTATGAAAAATTATTATAGAACAAAAGACATGAGGAACTGACTTAACTTTCCTTCTCTGCCTTGAATGGACAGTTGTTGTGCTAGAGCTTTGAAGGAGAAGGGAAGACAGATGATCTGCAATGGCTTCATCAATGTGGTATCTGCTTGCACTATGTGAATTGTAGATTCTATGCAAAACCAGAGGATTTGTTTGTAAAAATGCTTCAAATATCTATGCAAAACATGTATCTGAAATAACATTTTGATTGTAACTCAAATAATCATCTAACAAGTCTTTGGTGTAAGAGGTTAACTGAGCTTCACATCTGCATTCCCTTTTTGCAATGCGATTCTTTTTTTTACCCGAGTTTTTGGAGGTCGAGATTGAGATAAGAAAAGGCATCTTAGGATATGAAAGAGTTAGAATTGGTTGCTAATGTTTCTGTTTCTTTTTCACTGTCTCTCGTTTCCATTTTTTCAGAACCAAATGTATTTGGTAAATGTTCTTGTCTTTTGGTTTCTCATTTTTTTTAGAAATGTTTTTTAAAAAACCAGTTTCTTGGTTCTAATCTTCCTTTCTTTATTTAGAAAGGAAAATTGTCCAGAATAACCTCTAAGTTTTCTTTTTTCAAAACTAGCACAATTTTGCAAAACTCGTTATAATAGTGTTTCTTGTGCCATCTCGACCGAGATCGCACTCCAAGATTTTAGTGGAACGTTGAGTTTCATCTTTTTGATCCATAGGAAAAGACGATGATACCCTATTACCTTCAAACCTCAAACGTTTTCTTTATCAGCAAACTTTAGCATTTTTTTCCTTACATTTTGTTCATCAATCATTTTGAATCTCTGAAACTCTCTCAGTCTACACGTTTTTTTTGTGCCATTTCAATTGATTTCAACTCGCTCAAGTAGAGGTATTTCACATTTTTCTGTCTTTTTTTTAGTTAAAGGTTGTATTTTTAAGTTTATGAGGGTGAGTTTTTGAAGTTTAGGGTTTAGAAATTTTTGAAATATGACACCATCTATTGCATTTTCTTGGGTCTTTACCATGTTTTTTGGTAAAGATATTTGAATGCGTACTGTTTTCATGGCTTTTTCTTTTTGGGGGGGGGGGGGGGGGGGGGGGGGGAATAAATGTGGATTTGCTCGGGAACAGGTTGAATCTCCCCCAAGATTCATTCAATCTCTCTCGAGATTGTTCCAAAATTTGAGCTTTAAGACATTATGTGTTTGGTCGGTGCCTTTATCTCCCAAGACCAAATGCATAATGACTTAAAGATTGGAGCTCGGGGCTTGCCTTGGTCCTGGATGCCCGAGATTCATGCTTAATTCTACCCACTGTGTTCAATCTTGGAGCTCGAGCCCAGGATTATCTCCAAGATTCAAAACGATTTCTCAAATTTAGGTCAATAATTGTTACGTCTAATCTATTGGTTAGGTATTGAGTTTAGTTTTTTTATTTCTATTTGTTTTATAGATGCCATTAGTTTCCGAAATCCATAGTTCTAATTATTTTCTGGCCAACCTAACTTGTTGTTCCCATCTAGGAAAAAAAAATTTCCAACATTAAGAATAAGTTAACACCTAGACAATTATATATGTGTAGGCAAACCTATATTTAGATATTTTTTGGATCTTAAGCTATTTTTTAATGGTCCTCTGTGCCATTATATTCTTCTAAGGGAGGTGGAGGAGACTAGAGAGGACACTATTAGCTTTAAGTTATTGGAGAGAAAGGTGTCTTTTGGTCAAGAGCAGTTCGATCTAATGATGGAGTTGGGATATAGCTCTAAGCATGTGGTTGTGGCTGATCTCGGTCATAATTCAACATGTATTAATTTTTAATCTATTTCAAACTTTTTTTATTGAGTTTTCACACCATTCCACCATTCAATTTTTCATTTCCAATAAAAACAACTTGAACAAATAACATGTATTAGTTTTTAGCCCAATTCAAACTTTTTTCATTGAGTTTTCACATCAATTCATCATTCAATTCTTCATTTCAATAAAAATAATTTGAAAATTTCAGTTCATAATCAATTTTTTGGTCTTCACTATTATCATTATTCCACTACCAATCATAGTTGGCTTCTAATTGTTGATTTATATGCTTAAATTTATGATTTATTACTTAAATAAGTTTGTCATAAAATTTCACTAAAAAATTTACAATATTTTTACAAATTTATGTGATATCCGTGATTTTTTCTGAAATATTCGTCGATATCTAAATATCCGTCGATAAATCCGTAAATCTGAGTTACCGATATTTACACCAATAATGATATTTTCGTCCTTGGTCATGGTTGAGTTCAAATCGATCAAGAACATGCTTATGACTTTTGAAGGTAAGTTACAATCTTTCTCTGTGCTGGTTCATATTTTTAGGGTTTACGTTCTAAATTTTCTGTAATTTAGGTGTTTAAGATCTATATTTTTTGTTATCTTATAGGGTAAGTAAGTTGATCCTCAGCAGTATTCTGGGGTTGGGAAGGTCTGGTATTCATACCGAATGCCTGATTCCAACGGCTTGGTAGCCGAGGATCCACCTGTTGAGGATCTACCAGTTGATCGCACATTATTCAATGGACTTAATGATCGCACATCATTTGAACTCATGTGGTATTCAGGAATAAAATCAAGGTGAGAGGGAAATGAGGGTACATTTTAGGCTATCAAGATTGAAAATGACGGTATTATACTCCTAGTGTTATGAACTACTGTTGGTAACTTAGACATCAAAAGAGCCATTTGAGACATATCGTCACTAGTTAAGAAAAAATGAAGATCTTCTTGATTACTAATCACAAAAGCAGAGACACTTGGCCTAACATTATCAAACACCTTATAACCAACTCAAACTCAACAGAATTTATCTCACTTATTATATATATTTGCTGTAAAAATTTCTCATACATTATCCCGACGTCTACAAACAAACACTTCAATTGGTCCGATCCCCTACATATTCTTTATCCTTCTCCTTCCATTCGCCCTTAAAACAAACAAATATATAAGGAATTCTGAAAAAAGACAAACAACCATACAAAAAGTTACTAAAATTCATCAAAATCTCAGGGGGAATACTGAATCTCAGTAACAAAATGTTAATATCTCATCCAAGATTTGGGTGTTGTTGAATGCATCTCGGGGCAAGTCACATTGATCTTGGGCGAGATTATTGTATAGTTGTAGTAAGAAAGCGACATCAAAGTCGATGGGAAGCAGGATACTGTGTTCGACGGCTTGACATCTTCATTACCACCCATTTTTTTCATCACCTTTACTACTTTCATTATCACCCTGAAAATCTTCTGTATTGCTACAATCTTATTTTTAATATACATATCATGAACGTTGTCATCTTCATTACTTCCACTTTCATTTTAACATCCATCATGAGTAGCTAAATCTCTCAAGGGTTGGGTTAAATAAATCATCATTCTGATGAACGTCACTTAGATTCATTTTGTTTAGTGTGTCCTGATTCTTTTTGCTTGTTTTCATCTAACTAATCGAAACAAGTAACAATAGTTAACTGCTTGAGAGAGTAAGAAATTGTGGAGCTCACTAAACAAAGTAAGGAGTATCTTTAAAGATAAAGACAATCTTACGTTCAAATAACCTTCAAGCATGCTTCGTAGAAATATGACTTATATGACTCACTCAACGAGAGGTTGAGATTCTAGTTGTGGTTAAGAACAATGAACCGAAACCTAAGTGAAGTTCAAAAGAATGATGATTGTTTTTCCCCAACCTTGTTTCTATCAATTGCATTTTTACATCGTTTCAATCTTGCTTCACTACCATGCACCTTGCAGCTCATTCACTCTCATCCTACTGTTTATACATCTTGAATATCTTGAATACATGATCGTTTAAATACTTGACATTGCTAAGAATCATCCATAAACCATTCTCTAGTTAGTTGAATCAACGAAACCCCTGTGTTCGACCTTGATCCTCACCAAGAAATCCCTTTTTGAGTTTATACTTGGGTCCATTATAGAAAACTTGTAGGAAATTATTATAAATCATCAACACCACGGAAAGACAAGATGGAGGAATAAAATTTTTTATTTTCTAACTCAATCTCAGTCTCGATCTCAGCTGGGATGCCTCGAAAAAAACTATTCTAACAAGTTTTGAAAAATCGTGTTAGTTTTGAAATATGGAAACTTGAGGGAGTTATTTTGGACAATTTTTTATTTAAAAAGCTACTATATTTAGTTTATACATTTTTTTATATAAAATTTCATAGGAAAGAAGAAAAAAAAAAGAATGATTATCGAACATGTTTTCATTTTTTTTTCTATTAAAAAAAGAAAGAAAAAGAAATAATAAACTGCTAACACATGCAATTATATTTTTCTTCCACTAAAAAGTAAAAATGGAAAATTATAAATGAGAAACAAGAAACACAAATAATTTAAATAACATGAGAAATTGGAATAAGAATTGAGAAACAATTAGATTCAGATGTGAGTATCTCACCATCTTCAAGGGAGAATGAATTCTAAAAATTAAAAAAAAAAAGCTCTTGATTCTTAATTATCATAGGTTATATATATATATATATATATATATATATATATATATATATATATATATATATATATATTTATATTATTTTAAAATTTAAGTGTATTATTTAAAATTTTGAAAGTTGAGCAGTTTTGAGGAATGTGGGAAAAAGAATAAGGGTTTATTTGACATGCAATGTGAAAATTGAAAATTGAAAACAATGGATTCAACGGAAACATTGTTGTATTTCATATTTTCATATATGTATTCGGTAGCAGGTTTATAAATTGAATATAAATTTAAACAATTATTCAAAATCTTTTAGATAATATATTTCTAAACCATAGAACTAATGGTAGTTACCCACGAACTATTAGATTGAATATAAACAATTATTAATTTAGTTATGAACTTGTTTTAGATAAAAACTTTATATAATTATTATTTTGTAATATCATATAATTTATAATACATATTTTAATTTAACAAAATAATTGAGTTTATAACATAATATACTATACTTATTAACATTTTTTAAATATAATGCTGCTTTTGAAAATTTTAAATTCAAAGGTTGAAAACAATGAAAACATTAAAAAAAAAAACTAAATTATAAAAAATGCCCCTAAATTTTGCATTTTGGGTAAAAAAAAATACTCTCGAACTTTTAAAAGTTTAAAAAGTACCTTAAATTTTTAAAACGAATTCAAAAATACCCATACAATTAGTTTTGGATGAAAACGGTTAGAGTTTTGTTTAAAAAATACCCTAACTTTCAAAAGTTTCAAAAATACCCTGAAACTTTAAAAAAATTCAAACATACTCCATCGTTAGTATATGAACAAAAACCATTAATACCTCATTCGAAAAATAACTCTAATCTTTTTATAAAAAAAATTTAAATGCTCCTATCATTAATATGGTGATCAATTTGTTTGAAGTTTTTCTTAATTCGAAAAAGCCAATTATAAAACAAATTATATAAATATATTTTTTTATATAGACACCCTCCCTTTTCTCTTATTTTTCTAATTTTTATACTAAAATTTTTAGCAACCAAAATCAAAATCAAAATTTTAAATTAAAACAAAAAATCTCACAAAATTCTAAAATTAAAATCTCCCTTTAAAACATACCAAAACAATAGTCAGATAAAAAACGTATTTGGTTATTAACGCATACTCAACTACTAAAATACAACTTTGTTAAAACATTTAAGTCCTAAAATCTCTTGGTGCTTCCATTATGTTTTTTTTTTTTTAATTATAGTTCACATACCTTAACTAAAGTATACAATTTTATTTATTTATTTATTTTAGATATACAAATGTCAAACATTTAAAAAAAAAAGAGAAGCACGAGAGTATTAAAGAAGAGAGAGAAGAAAAAAGAGAGAAAATAAAGAAGTAAAGTGGTATTAACAATTTTGTTCATATAATAGTAAGAGTATTTTTTAAATTTTTTTAGTTTAACGGTATTTTTTTAACTTTTAAAAGTTTAAGGTTATTTTTTTACATAAAATACTAATACGTTTCATCTAAAACTAACGGTAACGGTATTTTTGAAAAATTCAAAGTTATTACTGATGTCAAGTACAAAGTTTAAGAATATTTTATATAATTTAATGTTGTTTTCCTAATTTATGTTGTCTGGATTTTCAAAAACAATTTTAAAAAAACAGAGTATAAATAAGTTTAAGAAGAATTTTTAAAATTAGAAAAATAAGAAAAACTATTTACACAAAATAATAAAATTTTTAGATAGTTGTCATAGACTAAAAATTTATCGGTGATAAAAATAATAGAAGTCTATCATTAAGTCTATCAAATATTTTTTTTTTTTAATTTTCAGTAAATAATTTGACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTTCGTAAGTTTAAGGTTTTAGGCTTTATCCTCCTTTTTGCATTTTTCTGTTTTTTCTCAGCTCAAAACTACGCTAAAAAAACACTCACTGCCAAGCTTTTTTTCCTCTTGCCCGTTAGGGTTTCTGTTTTTCTTCTTCATTTTCCCTCTCTCGCTTCCCGCCTTGCCAGAGAACCGCCATGGATAGGTACCAAAAGCTCGACAAACCTAAACCTAAATCTCCTCTGAATGAAAATTAAATCCGCATTACCAGTCAAGGCGCTATTCGAAACTAAATCACTTATGCTTCCACCCTTCTCCAGGTATCATTTCACTCTTTCTTTTATCGTTTTCAACAGCTATTGTATCTCATTTCATTTTTTTTCTTTTTGGTGTATTCGTCTCATTGTTTCTGTTTATGTGTTGCGTTTGTTGATTTTTTTTTTCTTTAATGCTGAATCTGTAATAGTTGCTCGCTGGCTGATGATTTGAAGTCCTCCCAAATTCGGCTCTGTCACTCCCAACTCTGCATTTTCTATCCAACCGTAATCCACCTTCCGTTTTCTCCATGTCCATTCATACCTCTGCATTTTCAACTGTGACCCTTCTCCGTTCTCTCACTCTTCCCTTCTCTCCATACCATCACTACTTTCGTTGTCCCGATTACATAGTCCGTACTCTCTTTATCCCAGCGTATTCTGTAAAAGGACGACGACAACTTCCGCGGATTCCTGCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTAGTGTATGACCGAGATTCCCCGTTCGAGTCTGAAGAGCACTTATCATCTCCATACAGTAATGGGGCTGATGGTTTTCAATCTGAAAATGGTTTTGCGTCAGCGGATTTGAAACATTTAGGAACGCCTGCGCTAGAAGTCAAGGAGCTGGATGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAAGATGACGCAACCTATCTCACCGTGCATTGTTTGCGTATTCGCGAAAACGAGACTGCATTTAGGGTTAGTGTCTTTTCTCTAGACTGCATTTAGGGTTAGTGTCTTTTCTCTTTCTTCTTTATTCTATTATGTTCCAACACCATAGGTATCACAACTTGTAATGCAATTATTGGTTTGTTGGGTTGAATAATAGGGAATTATTTGGAAGTGTTCAATTTTGGCTATGCTGTGCTTTATATATTTGGATTATATAATACTTGTAAAAAAAGTTTGGAATGTAGTAGTAGGGCTCTGAAGATAACTGTAGCATGAACACCATTTGGAAAGAACGAAATCTCTGGAATAAAAGTAAATTTTGATAGTTCTCTTTATTTGCTATTTTTTGTAACCTATGTTTGTGATATAATTTTCTTTCTACTTTTTCATTTTTTTCATTTTGGAGATTAGGTGTACAAGTGGATGTTGCAACAACGTTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAATTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGCTGCGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGACTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATATTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTACGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATTCAAAGATGGGCAGACCAATGAAAGCTCTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTCTACAAGCGCTGCAGCATATCAGACAGTTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGAATCAATCATGTCAGGCTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAACATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCATGCAACATCATTTTGAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCATTTTGAATTCCTCAAAAACAGTAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGCGATGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTTTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATGTATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACTTGAAAGATAGTCTACATGCAGACAGTCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAA

mRNA sequence

ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCACAATCCACCAAGAAGGAGACAAGTTGGGATAAAGGAACTGCTGCTAGATCAAAGGAGATAGTGGTGTCTAGTTCATGTGGGAGTTCTCAAAGATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTTCAAAGTCTTGCTTTTATAGTACCATTAATATAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCATTATTGGATCCAAAAGATGAAAAAAGAAGATATTGCTAAGGGAATTAGAGAGATTGGTCAAAAGGCAGATTCTTCTTCTTCTTCTGTTCATGCTGGCACCAAAGTCTTGCCAATAACTGATGCCACTTTAGTAGAGCAATGCAGCAGTAAGACTGAGAAGAAAGTTTTGGCCAATGGAGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAGATGGGCTATGACTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTTTCACGAATGAGAAACCGAGGAACTCTTAAAGCAGGTTTGGACGATGATCGAGGGAGTAACGACTCCCCCAAGATTAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCCCATTCAAAAACTGTTCCATCACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATTCCTCCCAAATTCGGCTCTGTCACTCCCAACTCTGCATTTTCTATCCAACCGTAATCCACCTTCCGTTTTCTCCATGTCCATTCATACCTCTGCATTTTCAACTGTGACCCTTCTCCGTTCTCTCACTCTTCCCTTCTCTCCATACCATCACTACTTTCGTTGTCCCGATTACATAGTCCGTACTCTCTTTATCCCAGCGTATTCTGTAAAAGGACGACGACAACTTCCGCGGATTCCTGCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTAGTGTATGACCGAGATTCCCCGTTCGAGTCTGAAGAGCACTTATCATCTCCATACAGTAATGGGGCTGATGGTTTTCAATCTGAAAATGGTTTTGCGTCAGCGGATTTGAAACATTTAGGAACGCCTGCGCTAGAAGTCAAGGAGCTGGATGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAAGATGACGCAACCTATCTCACCGTGCATTGTTTGCGTATTCGCGAAAACGAGACTGCATTTAGGGTGTACAAGTGGATGTTGCAACAACGTTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAATTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGCTGCGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGACTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATATTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTACGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATTCAAAGATGGGCAGACCAATGAAAGCTCTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTCTACAAGCGCTGCAGCATATCAGACAGTTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGAATCAATCATGTCAGGCTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAACATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCATGCAACATCATTTTGAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCATTTTGAATTCCTCAAAAACAGTAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGCGATGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTTTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATGTATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACTTGAAAGATAGTCTACATGCAGACAGTCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAA

Coding sequence (CDS)

ATGCAGCAAGTTCTTCAATGGGTTTTCAAGGAAACAAATGAAAAACAAGGACAGAGGACTTCACAATCCACCAAGAAGGAGACAAGTTGGGATAAAGGAACTGCTGCTAGATCAAAGGAGATAGTGGTGTCTAGTTCATGTGGGAGTTCTCAAAGATCAAAGGGATTTGAAGGAAGGAAATTTGATCTAAAGAAGTTGAAATCATTTGCTTTGTTGTGCAGAAAGGATATTTCAAAGTCTTGCTTTTATAGTACCATTAATATAAAGAGACCAAGAACACAGGGCTTTGATAATATGCACCATTATTGGATCCAAAAGATGAAAAAAGAAGATATTGCTAAGGGAATTAGAGAGATTGGTCAAAAGGCAGATTCTTCTTCTTCTTCTGTTCATGCTGGCACCAAAGTCTTGCCAATAACTGATGCCACTTTAGTAGAGCAATGCAGCAGTAAGACTGAGAAGAAAGTTTTGGCCAATGGAGAGAGTAAAACCAAAACCAAATCAAGAATGAAGGAACTTTTGAGATGGGCTATGACTTCAAGATCAGAAAAAGGAGGGAAGTTCATCACTGGCAAGGTTTCACGAATGAGAAACCGAGGAACTCTTAAAGCAGGTTTGGACGATGATCGAGGGAGTAACGACTCCCCCAAGATTAGTTTCAGATGGGAAGCTGAAAGCTGCTCCTCCATTTCCTCAGCCTACTCGTCGGTGTCGGCGGTGTCCCCATTCAAAAACTGTTCCATCACACTGAATTCCACTGCCATTCATGAAATCAATCAATATTATCCAAGAAGAGGAAGCTGGATCACTACAGATTCTGAATTCCTCCCAAATTCGGCTCTGTCACTCCCAACTCTGCATTTTCTATCCAACCGTAATCCACCTTCCGTTTTCTCCATGTCCATTCATACCTCTGCATTTTCAACTGTGACCCTTCTCCGTTCTCTCACTCTTCCCTTCTCTCCATACCATCACTACTTTCGTTGTCCCGATTACATAGTCCGTACTCTCTTTATCCCAGCGTATTCTGTAAAAGGACGACGACAACTTCCGCGGATTCCTGCCTTTGCTTCCAGTTCTTTTGTTGAACAGCTAGTGTATGACCGAGATTCCCCGTTCGAGTCTGAAGAGCACTTATCATCTCCATACAGTAATGGGGCTGATGGTTTTCAATCTGAAAATGGTTTTGCGTCAGCGGATTTGAAACATTTAGGAACGCCTGCGCTAGAAGTCAAGGAGCTGGATGAGTTGCCAGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACATTGATACGGCTGCTTAATGCTCAGAAGAAATGGCTGGGGCAAGATGACGCAACCTATCTCACCGTGCATTGTTTGCGTATTCGCGAAAACGAGACTGCATTTAGGGTGTACAAGTGGATGTTGCAACAACGTTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGAAAATTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGCTGCGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCATACCTTAGTGCACCTGTTCAAGGATGCATAGAGGAAGCAAGTACCATTTACAATCGTATGATTCAGTTAGGAGGTTACCAACCACGTCTTAGCTTGCACAATTCTCTCTTTAGAGCTCTTATGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTTATATATCACAATTTGGTAACAAGTGGACTTGAGTTACATAAAGATATATATGGTGGTCTAATTTGGCTACATAGTTATCAGGATACTATAGACAAAGAAAGGATAGTGTTTCTAAGGAAAGAAATGCAACAAGCAGGAATCAAGGAGGAAAGAGAAGTCCTTTTGTCCATATTGAGAGCGAGCTCAAAAATGGGGGATGTAGTGGAAGCAGAAAGATCGTGGCAAAAACTTACGTATTTTGATGGCAGCATGCCATCTCAAGCTTTTGTTTACAAAATGGAAGTCTATTCAAAGATGGGCAGACCAATGAAAGCTCTGGAGATCTTTAGGGAGATGGAGCAGTTGAACTCTACAAGCGCTGCAGCATATCAGACAGTTATTGGTATTTTATGTAAATTTCAAGAGATAGAACTTGCAGAATCAATCATGTCAGGCTTCATAAAGAGTAATTTAAAACCCCTCATGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACACGATAAGTTAGAGTTAACCTTCTCTCAGTGCCTTGAGAAGTGTAAACCCAATCGTACTATCTATAGCATATATTTGGACTCTTTGGTAAAAGTTGGTAACATCTACAAGGCCGAAGAAATATTTAATCAGATGGAAACAAATGGAGAAATTGGTATAAATGCTCGTTCATGCAACATCATTTTGAGTGGGTATCTTTTATTTGGAAATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAGAAGTATGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGGTTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAGAGGGAGATTTTAATAGGGTTGTTGTTGGGTGGCCTGGAGATAGAGTCTGATGAAGAGAGGAAAAATCATAGAATCCATTTTGAATTCCTCAAAAACAGTAACTCCCACTCTCTTTTGAGGAGACACATATATGAGCAATATCATGAATGGTTACATCCTGCTTCGAAGTTGAGCGATGGTGATATAGATATACCATATAAATTCTGCACTGTTTCACATTCATATTTTGGTTTCTATGCAGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCATACTGGTATATGTATGGAGGCTGCAGGACATCATCAGGGGATATTTTACTGAAGCTAAAGGGAAGTCATGAGGGTGTTGAGAAGATTGTTAAATCTTTGAGAGAGAAGTCCATGCATTGCAAAGTGAAAAGGAAGGGCAACATGTATTGGATAGGTTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATTACTTGAAAGATAGTCTACATGCAGACAGTCTTAACTTGGAGAGGGTTTTAAATGAAACTGAAAATATCAACTTTGATAGTCAATCTGATTCCGTTGGGGAGGCTTCTAATTAA

Protein sequence

MQQVLQWVFKETNEKQGQRTSQSTKKETSWDKGTAARSKEIVVSSSCGSSQRSKGFEGRKFDLKKLKSFALLCRKDISKSCFYSTINIKRPRTQGFDNMHHYWIQKMKKEDIAKGIREIGQKADSSSSSVHAGTKVLPITDATLVEQCSSKTEKKVLANGESKTKTKSRMKELLRWAMTSRSEKGGKFITGKVSRMRNRGTLKAGLDDDRGSNDSPKISFRWEAESCSSISSAYSSVSAVSPFKNCSITLNSTAIHEINQYYPRRGSWITTDSEFLPNSALSLPTLHFLSNRNPPSVFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENINFDSQSDSVGEASN
Homology
BLAST of HG10012104 vs. NCBI nr
Match: XP_038887990.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida])

HSP 1 Score: 1465.7 bits (3793), Expect = 0.0e+00
Identity = 724/794 (91.18%), Postives = 760/794 (95.72%), Query Frame = 0

Query: 300  MSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASS 359
            MSIHTSAFS+VTLLRS +L  SPYHHYFRCP++IVRT+FIP YSVKG++QLPRIP+FASS
Sbjct: 1    MSIHTSAFSSVTLLRSPSLSLSPYHHYFRCPNHIVRTIFIPIYSVKGQQQLPRIPSFASS 60

Query: 360  SFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPE 419
            S VEQLVYDRDS FESEEHLSSPYSNGAD      GFASADLKHL  PALEVKELDELP+
Sbjct: 61   SSVEQLVYDRDSLFESEEHLSSPYSNGAD------GFASADLKHLEMPALEVKELDELPD 120

Query: 420  QWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKW 479
            QWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QDDATYLTVHCLRIRENETAFRVYKW
Sbjct: 121  QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMRQDDATYLTVHCLRIRENETAFRVYKW 180

Query: 480  MLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 539
            M+QQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP
Sbjct: 181  MMQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 240

Query: 540  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS 599
            VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRAL SKPGDLSKHHLKQAEFIYHNLVTS
Sbjct: 241  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALTSKPGDLSKHHLKQAEFIYHNLVTS 300

Query: 600  GLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVV 659
            GLE+HKDI GGLIWLHSYQDTIDKERIV LRKEMQQAGIKEEREVLLSILRASSKMG+V+
Sbjct: 301  GLEVHKDICGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEREVLLSILRASSKMGNVM 360

Query: 660  EAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIG 719
            EAERSWQKL  FDG+MPSQAFVYKMEVY+KMG+PMKALEIFREMEQLNS +AAAY+T+IG
Sbjct: 361  EAERSWQKLKDFDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSANAAAYRTIIG 420

Query: 720  ILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 779
            ILCKFQ+IELAESIM GFIKSNLKPLMPAYVDLMNMFFNLSLH+KLEL FSQCLEKCKP+
Sbjct: 421  ILCKFQDIELAESIMKGFIKSNLKPLMPAYVDLMNMFFNLSLHNKLELIFSQCLEKCKPD 480

Query: 780  RTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 839
            RTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY
Sbjct: 481  RTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 540

Query: 840  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEE 899
            DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREIL+GLLLGG+EIESDEE
Sbjct: 541  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILVGLLLGGVEIESDEE 600

Query: 900  RKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFY 959
            RKNHRI FEF +N N+HSLLRRHIYEQYHEWLH ASKL+DGDIDIPYKFCTVSHSYFGFY
Sbjct: 601  RKNHRIQFEFQQNCNTHSLLRRHIYEQYHEWLHSASKLNDGDIDIPYKFCTVSHSYFGFY 660

Query: 960  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLR 1019
            ADQFWP+GHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGS EGVEKIVKSLR
Sbjct: 661  ADQFWPQGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSREGVEKIVKSLR 720

Query: 1020 EKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENI 1079
            EKSM CKVKRKG+MYWIGLLG+NATWFWKL+EPFILDYLKDSL ADS NL RVLNETENI
Sbjct: 721  EKSMQCKVKRKGSMYWIGLLGNNATWFWKLVEPFILDYLKDSLEADSPNLGRVLNETENI 780

Query: 1080 NFDSQSDSVGEASN 1094
            NFDSQSDSV EASN
Sbjct: 781  NFDSQSDSVEEASN 788

BLAST of HG10012104 vs. NCBI nr
Match: XP_008465080.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo])

HSP 1 Score: 1459.5 bits (3777), Expect = 0.0e+00
Identity = 722/797 (90.59%), Postives = 756/797 (94.86%), Query Frame = 0

Query: 297  VFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAF 356
            VFSMSI TSAFSTVTLLRSLTL  SPYHHYF  P++I+ TLFI +YSVK  RQLPRI AF
Sbjct: 2    VFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAF 61

Query: 357  ASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDE 416
            AS SFV+QLVYDRDSP ESEEHLSSPYSNG DGF  ENGFAS DLKHLGTPALEVKELDE
Sbjct: 62   ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 121

Query: 417  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRV 476
            LPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQDDATYLTVHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181

Query: 477  YKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 536
            YKWM+QQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 537  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 596
            SAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL 301

Query: 597  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 656
            VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMG 361

Query: 657  DVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQT 716
            DVVEAER WQKL Y DG+MP QAFVYKMEVY+KMG+PMKALEIFREMEQLNST+AAAYQT
Sbjct: 362  DVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 717  VIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 776
            +IGILCKFQEIELAESIM+GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKC 481

Query: 777  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 836
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAE 541

Query: 837  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIES 896
            KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 897  DEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 956
            DEERKNHRI FEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYF
Sbjct: 602  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYF 661

Query: 957  GFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1016
            GFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1017 SLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNET 1076
            SLREKSMHCKVKRKG+MYWIGLLGSNATWFWKLIEPFILD LK+S  ADSLNL  VLNET
Sbjct: 722  SLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNET 781

Query: 1077 ENINFDSQSDSVGEASN 1094
            ENINFDSQSDSV E SN
Sbjct: 782  ENINFDSQSDSVEETSN 796

BLAST of HG10012104 vs. NCBI nr
Match: XP_004152074.2 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus] >KGN58344.1 hypothetical protein Csa_017589 [Cucumis sativus])

HSP 1 Score: 1448.3 bits (3748), Expect = 0.0e+00
Identity = 708/797 (88.83%), Postives = 754/797 (94.60%), Query Frame = 0

Query: 297  VFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAF 356
            VFSMSI TSAFSTVT LRSLTL  SPYHHYF CP++I+ TLF+PAYSVK RRQLPRI AF
Sbjct: 2    VFSMSIPTSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRAF 61

Query: 357  ASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDE 416
            AS SFV+QLVYD DSP ESEEHLSS +SNG DGF  ENGFAS DLKHLGTP LEVKELDE
Sbjct: 62   ASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELDE 121

Query: 417  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRV 476
            LPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQDDATYL VHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFRV 181

Query: 477  YKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 536
            YKWM+QQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 537  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 596
            SAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNL 301

Query: 597  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 656
            VTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDMYGGLIWLHSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMG 361

Query: 657  DVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQT 716
            DV+EAE+ WQ+L Y DG+MPSQAFVYKMEVY+KMG+PMKALEIFREMEQLNST+AAAYQT
Sbjct: 362  DVMEAEKLWQELKYLDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 717  VIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 776
            +IGILCKFQ IELAESIM+GFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQVIELAESIMAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKC 481

Query: 777  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 836
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAE 541

Query: 837  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIES 896
            KIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKRYDIDPPLMEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 897  DEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 956
            D+ERKNHRI FEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSYF
Sbjct: 602  DDERKNHRIQFEFHRNCKTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYF 661

Query: 957  GFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1016
            GFYADQFWPRG  AIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRRAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1017 SLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNET 1076
            SLREKS+HCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLK+S  ADSLNL  VLN +
Sbjct: 722  SLREKSIHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGS 781

Query: 1077 ENINFDSQSDSVGEASN 1094
            ENINFDS+SDSV E SN
Sbjct: 782  ENINFDSESDSVEETSN 798

BLAST of HG10012104 vs. NCBI nr
Match: XP_022998786.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 689/794 (86.78%), Postives = 739/794 (93.07%), Query Frame = 0

Query: 300  MSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASS 359
            MSI TSAF+TVTLLRSLTLPFS  H++FRC +Y++R+L IP YS KGRRQLPRIPAFASS
Sbjct: 1    MSIRTSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 360  SFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPE 419
            S VE LVYDRDSP ESEE L SPYSNGA+       FASADLKHLG PALEVKELDELPE
Sbjct: 61   SSVEALVYDRDSPAESEEPLCSPYSNGAE------EFASADLKHLGAPALEVKELDELPE 120

Query: 420  QWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKW 479
            QWRRSKLAWLCKELPA KPGTLIRLLNAQ+KW+ QDDA YL VHCLRIRENETAFRVYKW
Sbjct: 121  QWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKW 180

Query: 480  MLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 539
            M+QQ WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP
Sbjct: 181  MMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 240

Query: 540  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS 599
            VQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+
Sbjct: 241  VQGCIEEASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTT 300

Query: 600  GLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVV 659
            GLELHKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+
Sbjct: 301  GLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVM 360

Query: 660  EAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIG 719
            EAERSW K+  FDGSMPSQAFVYKMEVY+K+G PMKALEIFREMEQLNS S+AAYQT+IG
Sbjct: 361  EAERSWLKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIG 420

Query: 720  ILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 779
            ILCKF+E+ LAES+M+GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN
Sbjct: 421  ILCKFEEVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 480

Query: 780  RTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 839
            RTIYSIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIY
Sbjct: 481  RTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIY 540

Query: 840  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEE 899
            DLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKEQREIL+GLLLGGLEIESDE 
Sbjct: 541  DLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEG 600

Query: 900  RKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFY 959
            RKNHRI FEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFY
Sbjct: 601  RKNHRIQFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFY 660

Query: 960  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLR 1019
            ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLR
Sbjct: 661  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLR 720

Query: 1020 EKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENI 1079
            EKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LKDSL AD+LNLE+ +NET NI
Sbjct: 721  EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNI 780

Query: 1080 NFDSQSDSVGEASN 1094
            NFDSQSDS  EAS+
Sbjct: 781  NFDSQSDSDEEASS 788

BLAST of HG10012104 vs. NCBI nr
Match: XP_022949171.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 688/794 (86.65%), Postives = 735/794 (92.57%), Query Frame = 0

Query: 300  MSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASS 359
            MSI TSAF+TVTLLRSLTLPFS  HH+FRC +Y++R+L IP YS KGRRQLPRIPAFASS
Sbjct: 1    MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 360  SFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPE 419
            S VE LVYDRDSP ESEE L SPYS GA+      GFASADLKHLG PALEVKELDELPE
Sbjct: 61   SSVEALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPE 120

Query: 420  QWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKW 479
            QWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QDDA YL VHCLRIRENETAFRVYKW
Sbjct: 121  QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKW 180

Query: 480  MLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 539
            M+QQ WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP
Sbjct: 181  MMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 240

Query: 540  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS 599
            +QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+
Sbjct: 241  IQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATT 300

Query: 600  GLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVV 659
            GLELHKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+
Sbjct: 301  GLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVM 360

Query: 660  EAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIG 719
            EAERSW KL  FDGSMPSQAFVYKMEVY+K+G PMKA EIFREMEQLNS SAAAYQT+IG
Sbjct: 361  EAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIG 420

Query: 720  ILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 779
            ILCKF+E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN
Sbjct: 421  ILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 480

Query: 780  RTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 839
            RTIYSIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIY
Sbjct: 481  RTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIY 540

Query: 840  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEE 899
            DLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKEQREIL+GLLLGGLEIESDE 
Sbjct: 541  DLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEG 600

Query: 900  RKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFY 959
            RKNHRI FEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFY
Sbjct: 601  RKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFY 660

Query: 960  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLR 1019
            ADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLR
Sbjct: 661  ADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLR 720

Query: 1020 EKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENI 1079
            EKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LKDSL ADSLN+E+  NET NI
Sbjct: 721  EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNI 780

Query: 1080 NFDSQSDSVGEASN 1094
            NFDSQSDS  EAS+
Sbjct: 781  NFDSQSDSDEEASS 788

BLAST of HG10012104 vs. ExPASy Swiss-Prot
Match: Q9XIL5 (Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OTP51 PE=2 SV=3)

HSP 1 Score: 895.2 bits (2312), Expect = 7.4e-259
Identity = 458/833 (54.98%), Postives = 616/833 (73.95%), Query Frame = 0

Query: 278  NSALSLPTLHFLSNRNPPSVFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTL 337
            +S +S+ T +  S  + P++ +        S+ TL RSL+  FS   H        +R L
Sbjct: 30   SSTVSVTTFNISSLSSNPNIIN--------SSSTLFRSLS--FSLIRHRSSYSRRSLRRL 89

Query: 338  FIPAYSVKGRRQLPRIPAFASSSFVEQLVYDRDSPFESE----EHLSSPYSNGADGFQSE 397
             I  ++V G     +   F+ SS     ++  +S  +      EHL+   +   +G    
Sbjct: 90   SI--HTVHGN----KTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTG-ITESEEGISEA 149

Query: 398  NGF-----ASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRL 457
            NGF     A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RL
Sbjct: 150  NGFGDVESARNDIRNVATRRIETEFEVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRL 209

Query: 458  LNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQRWYRFDYALATKLADYMGKER 517
            LNAQKKW+ Q+DATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L TKLA+Y+GKER
Sbjct: 210  LNAQKKWVRQEDATYISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLTTKLAEYLGKER 269

Query: 518  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR 577
            KF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PR
Sbjct: 270  KFTKCREVFDDVLNQGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYNRMIQLGGYKPR 329

Query: 578  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDK 637
            LSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D 
Sbjct: 330  LSLHNSLFRALVSKQGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLIWLHSCQDEVDI 389

Query: 638  ERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYK 697
             RI  LR+EM++AG +E +EV++S+LRA +K G V E ER+W +L   D  +PSQAFVYK
Sbjct: 390  GRINSLREEMKKAGFQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLDCGIPSQAFVYK 449

Query: 698  MEVYSKMGRPMKALEIFREMEQ-LNSTSAAAYQTVIGILCKFQEIELAESIMSGFIKSNL 757
            +E YSK+G   KA+EIFREME+ +   + + Y  +I +LCK Q++EL E++M  F +S  
Sbjct: 450  IEAYSKVGDFAKAMEIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVETLMKEFEESGK 509

Query: 758  KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEI 817
            KPL+P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GN+ KA ++
Sbjct: 510  KPLLPSFIEIAKMYFDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLTKIGNLEKAGDV 569

Query: 818  FNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLS 877
            FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LS
Sbjct: 570  FNEMKNNGTINVSARSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEPPLMEKLDYILS 629

Query: 878  LSRKEVKK-PVSLKLSKEQREILIGLLLGGLEIESDEERKNHRIHFEFLKNSNSHSLLRR 937
            L +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I FEF +NS +H +L++
Sbjct: 630  LKKKEVKKRPFSMKLSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFRENSQAHLVLKQ 689

Query: 938  HIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSP 997
            +I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P IP LIHRWLSP
Sbjct: 690  NIHDQFREWLHPLSNFQE-DI-IPFEFYSVPHSYFGFYAEHYWPKGQPEIPKLIHRWLSP 749

Query: 998  RVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNMYWIGLLGS 1057
              LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+
Sbjct: 750  HSLAYWYMYSGVKTSSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKKGKVFWIGLQGT 809

Query: 1058 NATWFWKLIEPFILDYLKDSLHADSLNLERVLN-ETENINFDSQSDSVGEASN 1094
            N+  FWKLIEP +L+ LK+ L   S +L+ V   E ++INF S SD   +  N
Sbjct: 810  NSALFWKLIEPHVLENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSDDCVN 843

BLAST of HG10012104 vs. ExPASy Swiss-Prot
Match: Q6ZHJ5 (Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=OTP51 PE=3 SV=1)

HSP 1 Score: 808.1 bits (2086), Expect = 1.2e-232
Identity = 397/739 (53.72%), Postives = 543/739 (73.48%), Query Frame = 0

Query: 351  PRIPAFASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALE 410
            P IPA AS+  +E L+ D D   E E+            FQ E   A+ + + + +P L 
Sbjct: 51   PGIPAVASA--LESLILDLDDDEEDEDE-----ETEFGLFQGEAWAAADEREAVRSPELV 110

Query: 411  VKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIREN 470
            V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQ+KW+ QDDATY+ VHCLRIR N
Sbjct: 111  VPELEELPEQWRRSRIAWLCKELPAYKHSTFTRILNAQRKWITQDDATYVAVHCLRIRNN 170

Query: 471  ETAFRVYKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHI 530
            + AFRVY WM++Q W+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHI
Sbjct: 171  DAAFRVYSWMVRQHWFRFNFALATRVADCLGRDGKVEKCREVFEAMVKQGRVPAESTFHI 230

Query: 531  LIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAE 590
            LIVAYLS P   C+EEA TIYN+MIQ+GGY+PRLSLHNSLFRAL+SK G  +K++LKQAE
Sbjct: 231  LIVAYLSVPKGRCLEEACTIYNQMIQMGGYKPRLSLHNSLFRALVSKTGGTAKYNLKQAE 290

Query: 591  FIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILR 650
            F+YHN+VT+ L++HKD+Y GLIWLHSYQD ID+ERI+ LRKEM+QAG  E  +VL+S++R
Sbjct: 291  FVYHNVVTTNLDVHKDVYAGLIWLHSYQDVIDRERIIALRKEMKQAGFDEGIDVLVSVMR 350

Query: 651  ASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLN-ST 710
            A SK G+V E E +W  +      +P QA+V +ME Y++ G PMK+L++F+EM+  N   
Sbjct: 351  AFSKEGNVAETEATWHNILQSGSDLPVQAYVCRMEAYARTGEPMKSLDMFKEMKDKNIPP 410

Query: 711  SAAAYQTVIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTF 770
            + A+Y  +I I+ K  E+++ E +M+ FI+S++K LMPA++DLM M+ +L +H+KLELTF
Sbjct: 411  NVASYHKIIEIMTKALEVDIVEQLMNEFIESDMKHLMPAFLDLMYMYMDLDMHEKLELTF 470

Query: 771  SQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLF 830
             +C+ +C+PNR +Y+IYL+SLVKVGNI KAEE+F +M  NG IG N +SCNI+L GYL  
Sbjct: 471  LKCIARCRPNRILYTIYLESLVKVGNIEKAEEVFGEMHNNGMIGTNTKSCNIMLRGYLSA 530

Query: 831  GNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVK-KPVSLKLSKEQREILIGLL 890
             +Y KAEK+YD+M +KKYD+    +EKL   L L++K +K K VS+KL +EQREILIGLL
Sbjct: 531  EDYQKAEKVYDMMSKKKYDVQADSLEKLQSGLLLNKKVIKPKTVSMKLDQEQREILIGLL 590

Query: 891  LGGLEIESDEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKF 950
            LGG  +ES  +R  H +HF+F ++SN+HS+LR HI+E++ EWL  AS+  D    IPY+F
Sbjct: 591  LGGTRMESYAQRGVHIVHFQFQEDSNAHSVLRVHIHERFFEWLSSASRSFDDGSKIPYQF 650

Query: 951  CTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSH 1010
             T+ H +F F+ DQF+ +G P +P LIHRWL+PRVLAYW+M+GG +  SGDI+LKL G +
Sbjct: 651  STIPHQHFSFFVDQFFLKGQPVLPKLIHRWLTPRVLAYWFMFGGSKLPSGDIVLKLSGGN 710

Query: 1011 -EGVEKIVKSLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSL 1070
             EGVE+IV SL  +S+  KVKRKG  +WIG  GSNA  FW++IEP +L+     +  +  
Sbjct: 711  SEGVERIVNSLHTQSLTSKVKRKGRFFWIGFQGSNAESFWRIIEPHVLNNFASLVTQEGS 770

Query: 1071 NLERVLNETENINFDSQSD 1087
            ++    + T++ + DS  D
Sbjct: 771  SIGS--DGTQDTDTDSDDD 780

BLAST of HG10012104 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 3.6e-11
Identity = 81/341 (23.75%), Postives = 147/341 (43.11%), Query Frame = 0

Query: 502 KERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQ 561
           KE    K    + +++++G +P   T++ +I A   A     +++A  + N M++  G  
Sbjct: 208 KEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQ---AMDKAMEVLNTMVK-NGVM 267

Query: 562 PRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTI 621
           P    +NS+        G  S    K+A      + + G+E     Y  L+         
Sbjct: 268 PDCMTYNSILH------GYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRC 327

Query: 622 DKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQ-AF 681
            + R +F    M + G+K E     ++L+  +  G +VE       L   +G  P    F
Sbjct: 328 MEARKIF--DSMTKRGLKPEITTYGTLLQGYATKGALVEM-HGLLDLMVRNGIHPDHYVF 387

Query: 682 VYKMEVYSKMGRPMKALEIFREMEQLN-STSAAAYQTVIGILCKFQEIELAESIMSGFIK 741
              +  Y+K G+  +A+ +F +M Q   + +A  Y  VIGILCK   +E A       I 
Sbjct: 388 SILICAYAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMID 447

Query: 742 SNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK--CKPNRTIYSIYLDSLVKVGNIY 801
             L P    Y  L++     +  ++ E    + L++  C  N   ++  +DS  K G + 
Sbjct: 448 EGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICL-NTIFFNSIIDSHCKEGRVI 507

Query: 802 KAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKI 839
           ++E++F  M   G +  N  + N +++GY L G   +A K+
Sbjct: 508 ESEKLFELMVRIG-VKPNVITYNTLINGYCLAGKMDEAMKL 533

BLAST of HG10012104 vs. ExPASy Swiss-Prot
Match: Q5G1S8 (Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB1270 PE=2 SV=2)

HSP 1 Score: 70.1 bits (170), Expect = 1.8e-10
Identity = 68/309 (22.01%), Postives = 135/309 (43.69%), Query Frame = 0

Query: 505 KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRL 564
           KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G +P  
Sbjct: 240 KFSKAQELVDAMRQRGCVPDLISFNTLINARLKS--GGLTPNLAVELLDMVRNSGLRPDA 299

Query: 565 SLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKE 624
             +N+L  A           +L  A  ++ ++     +     Y  +I ++       + 
Sbjct: 300 ITYNTLLSACS------RDSNLDGAVKVFEDMEAHRCQPDLWTYNAMISVYGRCGLAAEA 359

Query: 625 RIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKM 684
             +F+  E++  G   +     S+L A ++  +  + +  +Q++           +   +
Sbjct: 360 ERLFM--ELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGKDEMTYNTII 419

Query: 685 EVYSKMGRPMKALEIFREMEQLN--STSAAAYQTVIGILCKFQEIELAESIMSGFIKSNL 744
            +Y K G+   AL+++++M+ L+  +  A  Y  +I  L K      A ++MS  +   +
Sbjct: 420 HMYGKQGQLDLALQLYKDMKGLSGRNPDAITYTVLIDSLGKANRTVEAAALMSEMLDVGI 479

Query: 745 KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE 804
           KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Sbjct: 480 KPTLQTYSALICGYAKAGKREEAEDTFSCMLRSGTKPDNLAYSVMLDVLLRGNETRKAWG 538

Query: 805 IFNQMETNG 811
           ++  M ++G
Sbjct: 540 LYRDMISDG 538

BLAST of HG10012104 vs. ExPASy Swiss-Prot
Match: P0C8Q6 (Putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g08310 PE=3 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 5.2e-10
Identity = 63/274 (22.99%), Postives = 120/274 (43.80%), Query Frame = 0

Query: 473 AFRVYKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILI 532
           A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I
Sbjct: 89  AYLFFNWASKQEGYRNDMYAYNAMASILSRARQNASLKALVVDVLNSRCFMSPGAFGFFI 148

Query: 533 VAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFI 592
               +A   G ++EAS++++R+ ++G   P    +N L  A       +SK +    E +
Sbjct: 149 RCLGNA---GLVDEASSVFDRVREMGLCVPNAYTYNCLLEA-------ISKSNSSSVELV 208

Query: 593 YHNL-VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRA 652
              L        H D +     L  Y +T   ER + +  E+   G  +E  +   ++ +
Sbjct: 209 EARLKEMRDCGFHFDKFTLTPVLQVYCNTGKSERALSVFNEILSRGWLDE-HISTILVVS 268

Query: 653 SSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLN-STS 712
             K G V +A    + L   D  +  + +   +  + K  R  KA ++F +M ++  +  
Sbjct: 269 FCKWGQVDKAFELIEMLEERDIRLNYKTYCVLIHGFVKESRIDKAFQLFEKMRRMGMNAD 328

Query: 713 AAAYQTVIGILCKFQEIELAESIMSGFIKSNLKP 745
            A Y  +IG LCK +++E+A S+     +S + P
Sbjct: 329 IALYDVLIGGLCKHKDLEMALSLYLEIKRSGIPP 351

BLAST of HG10012104 vs. ExPASy TrEMBL
Match: A0A1S3CPK0 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502781 PE=4 SV=1)

HSP 1 Score: 1459.5 bits (3777), Expect = 0.0e+00
Identity = 722/797 (90.59%), Postives = 756/797 (94.86%), Query Frame = 0

Query: 297  VFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAF 356
            VFSMSI TSAFSTVTLLRSLTL  SPYHHYF  P++I+ TLFI +YSVK  RQLPRI AF
Sbjct: 2    VFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAF 61

Query: 357  ASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDE 416
            AS SFV+QLVYDRDSP ESEEHLSSPYSNG DGF  ENGFAS DLKHLGTPALEVKELDE
Sbjct: 62   ASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDE 121

Query: 417  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRV 476
            LPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQ+KW+GQDDATYLTVHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV 181

Query: 477  YKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 536
            YKWM+QQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 537  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 596
            SAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRALMSKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNL 301

Query: 597  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 656
            VTSGLELHKDIYGGLIWLHSYQDTIDKERIV LRKEMQQAGIKEE+EVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMG 361

Query: 657  DVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQT 716
            DVVEAER WQKL Y DG+MP QAFVYKMEVY+KMG+PMKALEIFREMEQLNST+AAAYQT
Sbjct: 362  DVVEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 717  VIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 776
            +IGILCKFQEIELAESIM+GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKC 481

Query: 777  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 836
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIG+NARSCN+IL GYLLFGNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAE 541

Query: 837  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIES 896
            KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 897  DEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 956
            DEERKNHRI FEF KN  +HS+LRRHIYEQYH+WLH ASKL+DGDIDIPYKFCTVSHSYF
Sbjct: 602  DEERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYF 661

Query: 957  GFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1016
            GFYADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1017 SLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNET 1076
            SLREKSMHCKVKRKG+MYWIGLLGSNATWFWKLIEPFILD LK+S  ADSLNL  VLNET
Sbjct: 722  SLREKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNET 781

Query: 1077 ENINFDSQSDSVGEASN 1094
            ENINFDSQSDSV E SN
Sbjct: 782  ENINFDSQSDSVEETSN 796

BLAST of HG10012104 vs. ExPASy TrEMBL
Match: A0A0A0LBL0 (LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100 PE=4 SV=1)

HSP 1 Score: 1448.3 bits (3748), Expect = 0.0e+00
Identity = 708/797 (88.83%), Postives = 754/797 (94.60%), Query Frame = 0

Query: 297  VFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAF 356
            VFSMSI TSAFSTVT LRSLTL  SPYHHYF CP++I+ TLF+PAYSVK RRQLPRI AF
Sbjct: 2    VFSMSIPTSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRAF 61

Query: 357  ASSSFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDE 416
            AS SFV+QLVYD DSP ESEEHLSS +SNG DGF  ENGFAS DLKHLGTP LEVKELDE
Sbjct: 62   ASGSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELDE 121

Query: 417  LPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRV 476
            LPEQWRRSK+AWLCKELPAQKPGT+IRLLNAQKKW+GQDDATYL VHCLRIRENETAFRV
Sbjct: 122  LPEQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFRV 181

Query: 477  YKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 536
            YKWM+QQ WYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL
Sbjct: 182  YKWMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYL 241

Query: 537  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNL 596
            SAPVQGCIEEASTIYNRMIQLGGYQPRLSLH+SLFRAL+SKPGDLSKHHLKQAEFIYHNL
Sbjct: 242  SAPVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNL 301

Query: 597  VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMG 656
            VTSGLELHKD+YGGLIWLHSYQDTID+ERIV LRKEMQQAGIKEEREVLLSILRASSKMG
Sbjct: 302  VTSGLELHKDMYGGLIWLHSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMG 361

Query: 657  DVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQT 716
            DV+EAE+ WQ+L Y DG+MPSQAFVYKMEVY+KMG+PMKALEIFREMEQLNST+AAAYQT
Sbjct: 362  DVMEAEKLWQELKYLDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQT 421

Query: 717  VIGILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 776
            +IGILCKFQ IELAESIM+GFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKC
Sbjct: 422  IIGILCKFQVIELAESIMAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKC 481

Query: 777  KPNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAE 836
            KPNRTIYSIYLDSLVKVGN+ +AEEIF+QMETNGEIGINARSCNIIL GYLL GNY+KAE
Sbjct: 482  KPNRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAE 541

Query: 837  KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIES 896
            KIYDLMCQK+YDIDPPLMEKL+Y+LSLSRKEVKKP+SLKLSKEQREIL+GLLLGGLEIES
Sbjct: 542  KIYDLMCQKRYDIDPPLMEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIES 601

Query: 897  DEERKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYF 956
            D+ERKNHRI FEF +N  +HS+LRRHIYEQYH+WLH ASKL+DGD+DIPYKFCTVSHSYF
Sbjct: 602  DDERKNHRIQFEFHRNCKTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYF 661

Query: 957  GFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 1016
            GFYADQFWPRG  AIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK
Sbjct: 662  GFYADQFWPRGRRAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVK 721

Query: 1017 SLREKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNET 1076
            SLREKS+HCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLK+S  ADSLNL  VLN +
Sbjct: 722  SLREKSIHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGS 781

Query: 1077 ENINFDSQSDSVGEASN 1094
            ENINFDS+SDSV E SN
Sbjct: 782  ENINFDSESDSVEETSN 798

BLAST of HG10012104 vs. ExPASy TrEMBL
Match: A0A6J1KB64 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493350 PE=4 SV=1)

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 689/794 (86.78%), Postives = 739/794 (93.07%), Query Frame = 0

Query: 300  MSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASS 359
            MSI TSAF+TVTLLRSLTLPFS  H++FRC +Y++R+L IP YS KGRRQLPRIPAFASS
Sbjct: 1    MSIRTSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 360  SFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPE 419
            S VE LVYDRDSP ESEE L SPYSNGA+       FASADLKHLG PALEVKELDELPE
Sbjct: 61   SSVEALVYDRDSPAESEEPLCSPYSNGAE------EFASADLKHLGAPALEVKELDELPE 120

Query: 420  QWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKW 479
            QWRRSKLAWLCKELPA KPGTLIRLLNAQ+KW+ QDDA YL VHCLRIRENETAFRVYKW
Sbjct: 121  QWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKW 180

Query: 480  MLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 539
            M+QQ WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP
Sbjct: 181  MMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 240

Query: 540  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS 599
            VQGCIEEASTIYNRMIQLGGY PRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNLVT+
Sbjct: 241  VQGCIEEASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTT 300

Query: 600  GLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVV 659
            GLELHKDIYGGLIWLHSYQDT+DKERI+ LRKEMQQAGI+EEREVL+SILRASSK+GDV+
Sbjct: 301  GLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVM 360

Query: 660  EAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIG 719
            EAERSW K+  FDGSMPSQAFVYKMEVY+K+G PMKALEIFREMEQLNS S+AAYQT+IG
Sbjct: 361  EAERSWLKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIG 420

Query: 720  ILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 779
            ILCKF+E+ LAES+M+GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN
Sbjct: 421  ILCKFEEVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 480

Query: 780  RTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 839
            RTIYSIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIY
Sbjct: 481  RTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIY 540

Query: 840  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEE 899
            DLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKEQREIL+GLLLGGLEIESDE 
Sbjct: 541  DLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEG 600

Query: 900  RKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFY 959
            RKNHRI FEF ++ ++HS LRRH+YEQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFY
Sbjct: 601  RKNHRIQFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFY 660

Query: 960  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLR 1019
            ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLR
Sbjct: 661  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLR 720

Query: 1020 EKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENI 1079
            EKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LKDSL AD+LNLE+ +NET NI
Sbjct: 721  EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNI 780

Query: 1080 NFDSQSDSVGEASN 1094
            NFDSQSDS  EAS+
Sbjct: 781  NFDSQSDSDEEASS 788

BLAST of HG10012104 vs. ExPASy TrEMBL
Match: A0A6J1GB98 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452602 PE=4 SV=1)

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 688/794 (86.65%), Postives = 735/794 (92.57%), Query Frame = 0

Query: 300  MSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTLFIPAYSVKGRRQLPRIPAFASS 359
            MSI TSAF+TVTLLRSLTLPFS  HH+FRC +Y++R+L IP YS KGRRQLPRIPAFASS
Sbjct: 1    MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 360  SFVEQLVYDRDSPFESEEHLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPE 419
            S VE LVYDRDSP ESEE L SPYS GA+      GFASADLKHLG PALEVKELDELPE
Sbjct: 61   SSVEALVYDRDSPAESEEPLCSPYSTGAE------GFASADLKHLGAPALEVKELDELPE 120

Query: 420  QWRRSKLAWLCKELPAQKPGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKW 479
            QWRRSKLAWLCKELPAQKPGTLIRLLNAQ+KW+ QDDA YL VHCLRIRENETAFRVYKW
Sbjct: 121  QWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKW 180

Query: 480  MLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 539
            M+QQ WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP
Sbjct: 181  MMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP 240

Query: 540  VQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTS 599
            +QGCIEE+STIYNRMIQLGGYQPRLSLHNSLF+AL+SKPGDLSKHHLKQAEFIYHNL T+
Sbjct: 241  IQGCIEESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATT 300

Query: 600  GLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVV 659
            GLELHKDIYGGLIWLHSYQDT+DKERI+ LRKEM QAGI+EEREVL+SILRASSK+GDV+
Sbjct: 301  GLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVM 360

Query: 660  EAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLNSTSAAAYQTVIG 719
            EAERSW KL  FDGSMPSQAFVYKMEVY+K+G PMKA EIFREMEQLNS SAAAYQT+IG
Sbjct: 361  EAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIG 420

Query: 720  ILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 779
            ILCKF+E+ LAES+M GFIKSNLKPL PAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN
Sbjct: 421  ILCKFEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPN 480

Query: 780  RTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIY 839
            RTIYSIYL+SLVKVGN+ +AEEIF+QM+TNGEIG++ARSCNIILSGYLL G+YLKAEKIY
Sbjct: 481  RTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIY 540

Query: 840  DLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEE 899
            DLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKPVSLKLSKEQREIL+GLLLGGLEIESDE 
Sbjct: 541  DLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEG 600

Query: 900  RKNHRIHFEFLKNSNSHSLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFY 959
            RKNHRI FEF ++ ++HS LRRHI+EQYHEWLHPASKLSD D DIPYKFCTVSHSYFGFY
Sbjct: 601  RKNHRIQFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFY 660

Query: 960  ADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLR 1019
            ADQFWPRGHP IPNLIHRWLSPRVLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSLR
Sbjct: 661  ADQFWPRGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLR 720

Query: 1020 EKSMHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENI 1079
            EKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LKDSL ADSLN+E+  NET NI
Sbjct: 721  EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNI 780

Query: 1080 NFDSQSDSVGEASN 1094
            NFDSQSDS  EAS+
Sbjct: 781  NFDSQSDSDEEASS 788

BLAST of HG10012104 vs. ExPASy TrEMBL
Match: A0A5N6QQ61 (LAGLIDADG_2 domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_005424 PE=4 SV=1)

HSP 1 Score: 1089.3 bits (2816), Expect = 0.0e+00
Identity = 549/831 (66.06%), Postives = 660/831 (79.42%), Query Frame = 0

Query: 261  YYPRRGSWITTDSEFLPNSALSLPTLHFLSNRNPPSVFSMSIHTSAFSTVTLLRSLTLPF 320
            Y P     +T+ S FL  SALS       SN  P + F   +     S+++ LRSL+L  
Sbjct: 5    YPPSATLALTSSSVFL--SALS------SSNAYPTNPF---LSLPMRSSLSSLRSLSLSL 64

Query: 321  SPYH--HYFRCPDYIVRTLFIPAY-SVKGRRQLPRIPAFASSSFVEQLVYDRDSPFESEE 380
            S +H  HYF       R++F PA+ S    R+   + A + S+ VE L  +   P   E 
Sbjct: 65   SHHHRSHYF-------RSIFAPAFCSFPKPRKFLSLRALSRSTSVEHLACEVSRPETEEL 124

Query: 381  HLSSPYSNGADGFQSENGFASADLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQK 440
               S  S+    F  +    S DLK L  PAL+VKEL +LPEQWRRSKLAWLCKELPA K
Sbjct: 125  WNFSNNSDSEAAFDFDKNVGSLDLKRLEVPALDVKELGDLPEQWRRSKLAWLCKELPAHK 184

Query: 441  PGTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQRWYRFDYALATKLA 500
             GTLIR+LNAQ+KW+ Q DATY+ VHC+RIRENET F+VYKWM+QQ WY+FD+ALATKLA
Sbjct: 185  GGTLIRVLNAQRKWVRQQDATYVAVHCMRIRENETGFKVYKWMMQQHWYQFDFALATKLA 244

Query: 501  DYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQL 560
            DYMGKERKFSKCRE+FDDIINQG VP ESTFHILI+AYLSAP+Q C+EEA +IYNRMIQL
Sbjct: 245  DYMGKERKFSKCREIFDDIINQGRVPCESTFHILIIAYLSAPIQVCLEEACSIYNRMIQL 304

Query: 561  GGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSY 620
            GGYQP+LSLHN LFRAL+SKPG  SK +LKQAEFI+HNL+TSGLE+HKDIYGGLIWLHSY
Sbjct: 305  GGYQPQLSLHNCLFRALVSKPGASSKQYLKQAEFIFHNLLTSGLEIHKDIYGGLIWLHSY 364

Query: 621  QDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPS 680
            QDTID+ERI  L+KEM+ AGI+E REVLLSILRA SK  +V EAER+W KL   DG +P 
Sbjct: 365  QDTIDRERIASLKKEMEDAGIEEGREVLLSILRACSKESNVEEAERTWLKLLQLDGGIPH 424

Query: 681  QAFVYKMEVYSKMGRPMKALEIFREM-EQLNSTSAAAYQTVIGILCKFQEIELAESIMSG 740
             AFVYKMEVY+K+G PMK+LEIFREM E+L+STS AAY  +I +LCK QE+ELAES+M  
Sbjct: 425  LAFVYKMEVYAKVGEPMKSLEIFREMQEKLSSTSIAAYHEIIQVLCKAQEVELAESLMIE 484

Query: 741  FIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNI 800
            FIKSNLKPL P+Y+D+MN++FNL+LHDKLEL FSQ L+KC+PN T+YSIYLDSLV +GN+
Sbjct: 485  FIKSNLKPLTPSYIDMMNLYFNLNLHDKLELAFSQSLDKCRPNCTMYSIYLDSLVTIGNL 544

Query: 801  YKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEK 860
             KAEEIFNQM +NG IG+N+RSCN IL GYL  G+Y+KAEKIYDLMCQKKY ID PLMEK
Sbjct: 545  DKAEEIFNQMRSNGAIGVNSRSCNTILRGYLSSGDYVKAEKIYDLMCQKKYQIDSPLMEK 604

Query: 861  LDYVLSLSRKEVKKPVSLKLSKEQREILIGLLLGGLEIESDEERKNHRIHFEFLKNSNSH 920
            +DYVLSLSR+ VKKPVS+KLSKEQREIL+GLLLGGL+IESDEERK+H + FEF +NS++H
Sbjct: 605  IDYVLSLSRQHVKKPVSMKLSKEQREILVGLLLGGLQIESDEERKSHMLRFEFRENSSTH 664

Query: 921  SLLRRHIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIH 980
             +L+RHI++QYHEWLHP+ K  +G  DIP +FCT+SHSYFGFYADQFWP+G P IP LIH
Sbjct: 665  YVLKRHIHDQYHEWLHPSCKPGEGADDIPCRFCTISHSYFGFYADQFWPKGRPVIPKLIH 724

Query: 981  RWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNMYWI 1040
            RWLSPRVLAYWYMYGG RTSSGDILL+LKG+HEGVEK+  +L EKS+ C++KRKG+++WI
Sbjct: 725  RWLSPRVLAYWYMYGGYRTSSGDILLRLKGNHEGVEKVANALMEKSLDCRMKRKGSVFWI 784

Query: 1041 GLLGSNATWFWKLIEPFILDYLKDSLHADSLNLERVLNETENINFDSQSDS 1088
            G LGSN+ WFWKLIEP++LD +KD L A    LE    ET++I++DS S+S
Sbjct: 785  GFLGSNSLWFWKLIEPYVLDDMKDFLKAGVATLENSSVETQDIDYDSGSES 817

BLAST of HG10012104 vs. TAIR 10
Match: AT2G15820.1 (endonucleases )

HSP 1 Score: 895.2 bits (2312), Expect = 5.3e-260
Identity = 458/833 (54.98%), Postives = 616/833 (73.95%), Query Frame = 0

Query: 278  NSALSLPTLHFLSNRNPPSVFSMSIHTSAFSTVTLLRSLTLPFSPYHHYFRCPDYIVRTL 337
            +S +S+ T +  S  + P++ +        S+ TL RSL+  FS   H        +R L
Sbjct: 30   SSTVSVTTFNISSLSSNPNIIN--------SSSTLFRSLS--FSLIRHRSSYSRRSLRRL 89

Query: 338  FIPAYSVKGRRQLPRIPAFASSSFVEQLVYDRDSPFESE----EHLSSPYSNGADGFQSE 397
             I  ++V G     +   F+ SS     ++  +S  +      EHL+   +   +G    
Sbjct: 90   SI--HTVHGN----KTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTG-ITESEEGISEA 149

Query: 398  NGF-----ASADLKHLGTPAL----EVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRL 457
            NGF     A  D++++ T  +    EV+EL+ELPE+WRRSKLAWLCKE+P  K  TL+RL
Sbjct: 150  NGFGDVESARNDIRNVATRRIETEFEVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRL 209

Query: 458  LNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQRWYRFDYALATKLADYMGKER 517
            LNAQKKW+ Q+DATY++VHC+RIRENET FRVY+WM QQ WYRFD+ L TKLA+Y+GKER
Sbjct: 210  LNAQKKWVRQEDATYISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLTTKLAEYLGKER 269

Query: 518  KFSKCREVFDDIINQGCVPSESTFHILIVAYLSA-PVQGCIEEASTIYNRMIQLGGYQPR 577
            KF+KCREVFDD++NQG VPSESTFHIL+VAYLS+  V+GC+EEA ++YNRMIQLGGY+PR
Sbjct: 270  KFTKCREVFDDVLNQGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYNRMIQLGGYKPR 329

Query: 578  LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDK 637
            LSLHNSLFRAL+SK G +    LKQAEFI+HN+VT+GLE+ KDIY GLIWLHS QD +D 
Sbjct: 330  LSLHNSLFRALVSKQGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLIWLHSCQDEVDI 389

Query: 638  ERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYK 697
             RI  LR+EM++AG +E +EV++S+LRA +K G V E ER+W +L   D  +PSQAFVYK
Sbjct: 390  GRINSLREEMKKAGFQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLDCGIPSQAFVYK 449

Query: 698  MEVYSKMGRPMKALEIFREMEQ-LNSTSAAAYQTVIGILCKFQEIELAESIMSGFIKSNL 757
            +E YSK+G   KA+EIFREME+ +   + + Y  +I +LCK Q++EL E++M  F +S  
Sbjct: 450  IEAYSKVGDFAKAMEIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVETLMKEFEESGK 509

Query: 758  KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLDSLVKVGNIYKAEEI 817
            KPL+P+++++  M+F+L LH+KLE+ F QCLEKC+P++ IY+IYLDSL K+GN+ KA ++
Sbjct: 510  KPLLPSFIEIAKMYFDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLTKIGNLEKAGDV 569

Query: 818  FNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLS 877
            FN+M+ NG I ++ARSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LS
Sbjct: 570  FNEMKNNGTINVSARSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEPPLMEKLDYILS 629

Query: 878  LSRKEVKK-PVSLKLSKEQREILIGLLLGGLEIESDEERKNHRIHFEFLKNSNSHSLLRR 937
            L +KEVKK P S+KLSK+QRE+L+GLLLGGL+IESD+E+K+H I FEF +NS +H +L++
Sbjct: 630  LKKKEVKKRPFSMKLSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFRENSQAHLVLKQ 689

Query: 938  HIYEQYHEWLHPASKLSDGDIDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSP 997
            +I++Q+ EWLHP S   + DI IP++F +V HSYFGFYA+ +WP+G P IP LIHRWLSP
Sbjct: 690  NIHDQFREWLHPLSNFQE-DI-IPFEFYSVPHSYFGFYAEHYWPKGQPEIPKLIHRWLSP 749

Query: 998  RVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSLREKSMHCKVKRKGNMYWIGLLGS 1057
              LAYWYMY G +TSSGDI+L+LKGS EGVEK+VK+L+ KSM C+VK+KG ++WIGL G+
Sbjct: 750  HSLAYWYMYSGVKTSSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKKGKVFWIGLQGT 809

Query: 1058 NATWFWKLIEPFILDYLKDSLHADSLNLERVLN-ETENINFDSQSDSVGEASN 1094
            N+  FWKLIEP +L+ LK+ L   S +L+ V   E ++INF S SD   +  N
Sbjct: 810  NSALFWKLIEPHVLENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSDDCVN 843

BLAST of HG10012104 vs. TAIR 10
Match: AT2G35130.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 73.2 bits (178), Expect = 1.5e-12
Identity = 91/433 (21.02%), Postives = 190/433 (43.88%), Query Frame = 0

Query: 426 LAWLCKELPAQKPGTLIRLL-NAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQR 485
           L+++ KE    K   ++  L +    W   DD   ++V     ++ ++   V +W+L++ 
Sbjct: 115 LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 174

Query: 486 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCI 545
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 175 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 234

Query: 546 EEASTIYNRMIQLGGYQPR---LSLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGL 605
           E A  +   M Q     P+   ++++N+    LM + G     + ++A  ++  +     
Sbjct: 235 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 294

Query: 606 ELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEA 665
           +   + Y   + ++ Y           L  EM+    K       +++ A ++ G   +A
Sbjct: 295 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 354

Query: 666 ERSWQKLTYFDGSMPSQAFVYK--MEVYSKMGRPMKALEIFREMEQLN-STSAAAYQTVI 725
           E  +++L   DG  P   +VY   ME YS+ G P  A EIF  M+ +      A+Y  ++
Sbjct: 355 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 414

Query: 726 GILCKFQEIELAESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 785
               +      AE++     +  + P M +++ L++ +       K E    +  E   +
Sbjct: 415 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 474

Query: 786 PNRTIYSIYLDSLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEK 845
           P+  + +  L+   ++G   K E+I  +ME NG    +  + NI+++ Y   G   + E+
Sbjct: 475 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 530

Query: 846 IYDLMCQKKYDID 851
           ++  + +K +  D
Sbjct: 535 LFVELKEKNFRPD 530

BLAST of HG10012104 vs. TAIR 10
Match: AT3G18110.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 70.1 bits (170), Expect = 1.3e-11
Identity = 68/309 (22.01%), Postives = 135/309 (43.69%), Query Frame = 0

Query: 505 KFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRMIQLGGYQPRL 564
           KFSK +E+ D +  +GCVP   +F+ LI A L +   G     +     M++  G +P  
Sbjct: 240 KFSKAQELVDAMRQRGCVPDLISFNTLINARLKS--GGLTPNLAVELLDMVRNSGLRPDA 299

Query: 565 SLHNSLFRALMSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKE 624
             +N+L  A           +L  A  ++ ++     +     Y  +I ++       + 
Sbjct: 300 ITYNTLLSACS------RDSNLDGAVKVFEDMEAHRCQPDLWTYNAMISVYGRCGLAAEA 359

Query: 625 RIVFLRKEMQQAGIKEEREVLLSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKM 684
             +F+  E++  G   +     S+L A ++  +  + +  +Q++           +   +
Sbjct: 360 ERLFM--ELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGKDEMTYNTII 419

Query: 685 EVYSKMGRPMKALEIFREMEQLN--STSAAAYQTVIGILCKFQEIELAESIMSGFIKSNL 744
            +Y K G+   AL+++++M+ L+  +  A  Y  +I  L K      A ++MS  +   +
Sbjct: 420 HMYGKQGQLDLALQLYKDMKGLSGRNPDAITYTVLIDSLGKANRTVEAAALMSEMLDVGI 479

Query: 745 KPLMPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CKPNRTIYSIYLDSLVKVGNIYKAEE 804
           KP +  Y  L+  +      ++ E TFS  L    KP+   YS+ LD L++     KA  
Sbjct: 480 KPTLQTYSALICGYAKAGKREEAEDTFSCMLRSGTKPDNLAYSVMLDVLLRGNETRKAWG 538

Query: 805 IFNQMETNG 811
           ++  M ++G
Sbjct: 540 LYRDMISDG 538

BLAST of HG10012104 vs. TAIR 10
Match: AT5G08310.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 68.6 bits (166), Expect = 3.7e-11
Identity = 63/274 (22.99%), Postives = 120/274 (43.80%), Query Frame = 0

Query: 473 AFRVYKWMLQQRWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILI 532
           A+  + W  +Q  YR D      +A  + + R+ +  + +  D++N  C  S   F   I
Sbjct: 89  AYLFFNWASKQEGYRNDMYAYNAMASILSRARQNASLKALVVDVLNSRCFMSPGAFGFFI 148

Query: 533 VAYLSAPVQGCIEEASTIYNRMIQLGGYQPRLSLHNSLFRALMSKPGDLSKHHLKQAEFI 592
               +A   G ++EAS++++R+ ++G   P    +N L  A       +SK +    E +
Sbjct: 149 RCLGNA---GLVDEASSVFDRVREMGLCVPNAYTYNCLLEA-------ISKSNSSSVELV 208

Query: 593 YHNL-VTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVLLSILRA 652
              L        H D +     L  Y +T   ER + +  E+   G  +E  +   ++ +
Sbjct: 209 EARLKEMRDCGFHFDKFTLTPVLQVYCNTGKSERALSVFNEILSRGWLDE-HISTILVVS 268

Query: 653 SSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQLN-STS 712
             K G V +A    + L   D  +  + +   +  + K  R  KA ++F +M ++  +  
Sbjct: 269 FCKWGQVDKAFELIEMLEERDIRLNYKTYCVLIHGFVKESRIDKAFQLFEKMRRMGMNAD 328

Query: 713 AAAYQTVIGILCKFQEIELAESIMSGFIKSNLKP 745
            A Y  +IG LCK +++E+A S+     +S + P
Sbjct: 329 IALYDVLIGGLCKHKDLEMALSLYLEIKRSGIPP 351

BLAST of HG10012104 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 68.2 bits (165), Expect = 4.8e-11
Identity = 102/497 (20.52%), Postives = 195/497 (39.24%), Query Frame = 0

Query: 439 GTLIRLLNAQKKWLGQDDATYLTVHCLRIRENETAFRVYKWMLQQRWYRFDYALATKLAD 498
           G++ R L+  K  L  +D   +        + + + R++K+M +Q W + +  + T +  
Sbjct: 90  GSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLRLFKYMQRQIWCKPNEHIYTIMIS 149

Query: 499 YMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAY------------LSAPVQGCIEE 558
            +G+E    KC EVFD++ +QG   S  ++  LI AY            L       I  
Sbjct: 150 LLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELLDRMKNEKISP 209

Query: 559 ASTIYNRMIQ------------LG--------GYQPRLSLHNSLFRALMSKP-GDLSKHH 618
           +   YN +I             LG        G QP +  +N+L  A   +  GD     
Sbjct: 210 SILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDIVTYNTLLSACAIRGLGD----- 269

Query: 619 LKQAEFIYHNLVTSGLELHKDIYGGLIWLHSYQDTIDKERIVFLRKEMQQAGIKEEREVL 678
             +AE ++  +   G+      Y  L+   ++      E++  L  EM   G   +    
Sbjct: 270 --EAEMVFRTMNDGGIVPDLTTYSHLV--ETFGKLRRLEKVCDLLGEMASGGSLPDITSY 329

Query: 679 LSILRASSKMGDVVEAERSWQKLTYFDGSMPSQAFVYKMEVYSKMGRPMKALEIFREMEQ 738
             +L A +K G + EA   + ++     +  +  +   + ++ + GR     ++F EM+ 
Sbjct: 330 NVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKS 389

Query: 739 LNS-TSAAAYQTVIGILCK---FQEI--------------------------------EL 798
            N+   AA Y  +I +  +   F+E+                                E 
Sbjct: 390 SNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYEGIIFACGKGGLHED 449

Query: 799 AESIMSGFIKSNLKPLMPAYVDLMNMFFNLSLHDKLELTFSQCLE-KCKPNRTIYSIYLD 858
           A  I+     +++ P   AY  ++  F   +L+++  + F+   E    P+   +   L 
Sbjct: 450 ARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNPSIETFHSLLY 509

Query: 859 SLVKVGNIYKAEEIFNQMETNGEIGINARSCNIILSGYLLFGNYLKAEKIYDLMCQKKYD 866
           S  + G + ++E I +++  +G I  N  + N  +  Y   G + +A K Y  M + + D
Sbjct: 510 SFARGGLVKESEAILSRLVDSG-IPRNRDTFNAQIEAYKQGGKFEEAVKTYVDMEKSRCD 569

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887990.10.0e+0091.18pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa ... [more]
XP_008465080.10.0e+0090.59PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic ... [more]
XP_004152074.20.0e+0088.83pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sa... [more]
XP_022998786.10.0e+0086.78pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
XP_022949171.10.0e+0086.65pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9XIL57.4e-25954.98Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidop... [more]
Q6ZHJ51.2e-23253.72Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa... [more]
Q76C993.6e-1123.75Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q5G1S81.8e-1022.01Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidop... [more]
P0C8Q65.2e-1022.99Putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
A0A1S3CPK00.0e+0090.59pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis ... [more]
A0A0A0LBL00.0e+0088.83LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100... [more]
A0A6J1KB640.0e+0086.78pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A6J1GB980.0e+0086.65pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A5N6QQ610.0e+0066.06LAGLIDADG_2 domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_00... [more]
Match NameE-valueIdentityDescription
AT2G15820.15.3e-26054.98endonucleases [more]
AT2G35130.21.5e-1221.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G18110.11.3e-1122.01Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G08310.13.7e-1122.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74850.14.8e-1120.52plastid transcriptionally active 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 430..583
e-value: 2.0E-16
score: 61.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 772..857
e-value: 3.4E-12
score: 48.4
coord: 624..771
e-value: 1.6E-9
score: 39.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 782..810
e-value: 0.006
score: 16.7
coord: 818..845
e-value: 0.032
score: 14.5
coord: 685..706
e-value: 0.0066
score: 16.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 782..810
e-value: 5.1E-4
score: 18.0
coord: 818..849
e-value: 4.9E-4
score: 18.1
coord: 497..525
e-value: 6.6E-4
score: 17.7
IPR004860Homing endonuclease, LAGLIDADGPFAMPF03161LAGLIDADG_2coord: 881..1046
e-value: 9.7E-45
score: 152.5
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 970..1066
e-value: 3.3E-11
score: 45.5
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 858..968
e-value: 1.4E-17
score: 65.7
IPR027434Homing endonucleaseSUPERFAMILY55608Homing endonucleasescoord: 872..1058
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..39
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..29
NoneNo IPR availablePANTHERPTHR47539PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTICcoord: 310..1088
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 654..848

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012104.1HG10012104.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010239 chloroplast mRNA processing
biological_process GO:0000373 Group II intron splicing
biological_process GO:0045292 mRNA cis splicing, via spliceosome
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0048564 photosystem I assembly
biological_process GO:0006388 tRNA splicing, via endonucleolytic cleavage and ligation
cellular_component GO:0009507 chloroplast
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0005515 protein binding