HG10001152 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001152
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionouter envelope protein 80, chloroplastic
LocationChr09: 14455725 .. 14465257 (-)
RNA-Seq ExpressionHG10001152
SyntenyHG10001152
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCACCGAACGATGACATTGTTTTTACTTCGCGTGCTACGCTTAGAATTCCACACTTTCCTCCGTCTAGTTCGCATTCGAGTTTCAGATTTGGCTTCCGAAATTTAGCTTCACATTTCGATCAATCCTGTAACTCAATATTCCAGTTCATTGATTCGGTTAAGAGAGGTTCGAAATTGACTCATTTCAACCACAGTATTTCCCACTTCTGGCCACCAGCACTCCCCCTTTGTGCTTCAAAGAAGGTAACTCAGCAAGAGAATTCGGTCAATCGACGAGCAAGTTGGACTTGGGGGGCAATGTTTGTTGATAAATCTCCTCTAATTTGTTCGGCGTCAATGTCGCTAATTCAATCAGATGTGTCGAATAAGTCTGATTCTGGTACTTGTACGGAGCATCAAGGAAAGCGACGGGGGATGGAGGACAAAACTACGGGGTTGGTTGGAAAATCAACTCTTCTTTGTTCAGCTTCATTGGCTTTGACGCGTCCTGACGAGTTGAGTCAGTCTGTTGGTTCGGAGAGTAAGGAATTGCCTCAAAAGGGGTATTCAGCTGGTCGGCTGGATGAGGAGAGGGTTCTAATCAGTGAGGTTCTCGTGAGAAATAAGGACGGTGAGGAGCTAGAGAGGAAGGACCTCGAGCTGGAGGCCTTTATCGCCTTGAAGGCGAGCCGACCCAACTCTGCGCTCACTGTACGCGAGGTTCAAGAAGATGTGCACAGGATCATCAACAGTGGATATTTCTACTCGTGCATACCCGTTGCAGTTGATACCAGAGATGGAATAAGATTAATATTTCAGGTAAAATTGTTGAATCAGTTGTATAATATATGTGTGTGTATATATATTATAAATATATTTTATTTTTTTTTCCCATTGGGAAGGAAGGATTTTACGTCTGGAATGGTTTATGTTAAGTGAAAGGGGAGCTCGGCTGAAATTTCAAACAACTTATAAGTTTTAATGCAAGTTGCGATACCAAATTATGCACACTTGAGCTTGAAAAATAAAAAATCTCTTGAGAACCCTCTTTTGAAGCTACGGAGCGAGGGAATTTAAGCCCCATTTAGTAACAATTTTGTTTTTCGTTTTCTACTTTTGAAATTTATGTTTGTTTTCTCATCAATTCTATACTATGATTCTCACATTTGTTGAAGAAATACTTGAATTCTTAGTCAAATTCTAAAAACAAAAACAAGTTTTTAAAAACTACTTTATTTTCAAAAACTAAATGGTTAACAAACGGGACCTAAGTGATTAGTGCACTTTCCATGAGGAAAAATCATAATTATGAACAAGTTCCCTTTTACATTTTAGAAGAATATTTATTAATATTATTTTCTAATGGCACAAGGTAGAACCAAATCAAGAGTTTCAGGGGTTGGTCTGTGAAGGTGCTAATGTCCTTCCAGCAAAGTTTCTGGAGGAAGCTTTTCGTCATGGCTATGGTAGGATTTCCTACATAATTGTAGATTATCGCAATTCTATGAACCAATATAAAATATGCTCACAGACGACCATTATTTTGGATTCTTTGGTGTCAAGCTGTGCAGTAAAAGTTGTCAATCTCAGGCATTTAGATGAGGTAATATCATCCATTAATAGCTGGTATGGTGAACGTGGCCTTTTTGGCGGGGTTAGTTCTTTCTCTATATTCTTTTACAATTTCGTTAGTTAGTTTAAGTTGAAATTTAGATTGTTTTTGCCTCGTAGGGATTAACACATTGCAGACTTTAGGTGTGCCTGTTTTGTGATCACTATTGTTCTCATATATTGATACTTTGACTATATAACACAGGTATCAGCTGTAGATATTCTATCCGGGGGCATTCTTAGTTTACAAGTTTCTGAAGCTGAAGTAAATAACACTTCCATCCGATTTCTTGACAAAAAGACGTAAGAATATGTCTTTTGCATGATCTAAATTTGCACGCTAACAGCCATGATTCACTAATGCATACTGCAGCGGAACAACCTTACACGTATATGCCAATGTGGCGTGGACTTTATATCTTTTGAAAGTTTTGATTTCATAGTTTTTTGACTGGTGGTTATATTCAGTGGCGAGCCAATTTCAGGAAACACAAGACCTGAGACAATACTTCGGCAACTTACTACCAAGAAAGGACAGGTATTTGGTGCATGCTTGACGTGTCTAACTTTTATTTATAAAATATGATCATCGATCAATTTCTTTGTTGAGTTATGAGTTTTTCTTTATCACATTTAGTTTCTAAATTTAATGTTTGAAGTTCTTTATTTTATTGTTAGATTAAAATTTTAGTTTTTAGTTTATGTTTTAATGGTGTTTTTAATTTTGTAGGTTAGATTTATTAATTTAGATTCACCTTTGTACCTATATAAGGGTGATCTCCCCTGACATTTTTGAGAATCAAAATTTTAGTATCACATTTTTTATACAACATTTTGTTTATCACCATTTTTTAGTCTCATAATTTTAGTACAACATTTTTTCATCACATTTTGTGCATCACCAACATTTTTTACTTCAGTAAGTAGTTTGAATTTCACTCTCTCATTTTCTCTTTTTATTTAACAAACATTGAGATATTGATTTTCTTATGCAAATATTTATTTTTTCCATTGATTTCATTTTTTCAACAGTGGTATCAAAGCTACTTTATTTTTTCAACAATGGTATCAAAGCTGCTTTGATTTCTCAACAATTCTCCTTTTAGATTGGTGAAAGCCACTTCATGACGGTTGTTGTTAGAAAATAGTCGGGTTTTTGTGGGGGAAAATTGAATATTCTCTTCGTTTAGCATTTTGGTTAGCTTGGTGGGTCTGTTATCTGCCCCCTTCGTTCTTCTTTAGTCTTACCACCACCACCCAACCCCCCAAAAAAATAATGGTGTTACATGTCTTTATATTAGCATAATGACAATGGATGATTCATTTAACTGATCTCCAAGCTTGAATGTGCTGGTTTCTCAATCATATGTAGTCTTCAGTTTAAATATATTTAATATGGATACACTCTGCTTCAAGCTTGGAGTTATTTATTGCAGGTCTACAGTATGCTCCAAGGAAAAAGAGATGCAGAGACTGTCTTAACCATGGGGATCATGGAAGACGTCAGCATTATTCCCCAACCTGCAGCTGGTGACTAAAAATATTTTTCACTTTTACATGAACGATTAATATGTATTGTTTCTTTATTTCATAACTTAGGCCTCAGAGTTCCTGGCTGATTGCTAACTTGTTCTTGGTTTTGGATTATACTGGCCCAACTTAGACTTCTGATTTCATGGGAAGTCTTTTATGTATAAGATCCAGTGCATAAAATCAACTTAAACTTTTATGGGAAGACTCTTTTGTATTCTAGCTATATGCTCGATCTTATTTCTGTTGAGTGGAGCCTCTTTTTGCTTTCTATGTATTCTTTTTTGTACACCCTTATTTTTTTTCAATGAAAGCCGGATTCTTTTACCAAAAAAGGATGAAATTGAAGTTGACAATTGCTGGTTCATGTGTAATTTTCATTTTTATTGGACATGAATATTTAGTTGTGGTTAGGGGGTGTGAGGTGGCATCAGGTTTGAAGTTGATTATGGAGAAAATATATACTTGGGAAATTAATATCAACATAAAGGGAGACAGAGTAGGTGCCCTGGTTACTACTTGGGATTGTTCCTCAAGTGGTTTAACCAGTATGTATTTGGGCATGCTGTTGGAGGCAATCTGATTTGAGTGGCATTTTGGATAAAGTTGACCAAAAATTATTGAGGAAATTGGAGAAATGAAAGAACTAAACTCTTTCAAAACTCAAAGATCGGATGCGTGGCACTGGCACGTCGGCAAGAGGGAGAGAGAGAGAGAGAGAGAGAATCTAAGGATGTGGAGTCCAATAGTTATCTTCATAACTCAGAGAGCATAGTAATGTCAGATTCAAAGTTGTCTTTATGATCTATTTTTATTGTGCGGACTTATACATATTTTCAAAACATATATCCATTTCTCTTAATTCTTCTTTTATTTGGAATTGTCTAGATGCTGGAAAGGTTGACATACTGATGAACGTAGTAGAGCGTCCAAGTGGAGGTTTCTCTGCTGGTGGTGGTCTATCATGTGGGTAAGAATCAGATGAACAATTTTTCATCTTTGCTGATGCTTCCTCACAGTTGACTATATTTAGTATTTTCTGTTATCTTTATTTAATGTTGACTTTTGTCATGTTTTAAAATACAGGACAACCGGTGGTGGTGGCCCTTTAGCTGCCCTTATTGGAAGGTAAGTTTAAAGCTAAATATCCTGTTTTGGAACCCCCAACATCAATCCATGCATATAAACTTTCTTCTGGAGAGACATATTATCTACATGTTTCTTTTTTCTTCATGTACCACTGTTTTATCTCTCTATCATAGATTAATTTGCAACTACTTAGTTTTCTCGTGTCACACTTGCAGTAAGTTCCGCTGCCACCAAGTGCTGAGCATTTTAGCTTAGTAACAATATTATTCTTGTTGCAGCTTTGCATATTCCCATAGAAATCTTTTTGGGAGGAATCAGAAGCTCCATGTCTCCTTGGAGAGGGGACAAGTTGATTTCACTTTTCGCATAAATTACACAGACCCATGGATTGAAGGAGATGATAAGAGGACATCACGAACAATAATGGTTCAGGTAATCTTTGATTTGGTGTGTATTTCTATTTACGATATTTTTCAGTGAGAACCTGATCAAGATCCTTGGGTTCTTTTCTTCAGAACTCAAGGACTCCAGGTACACTTGTCCATGGTGGTAGCAGCATCTTGACTATTGGTCGGGTTACAGCTGGTTTGGAATTCAACCGCCCTATCAGGCCTTCGTGGAGTGGAACGGGAGGACTTTATTTCCAGGTACTTTAGTAGAACTGAATACATGCTTCTGGATATCTTTGCATATGTACTGCAGGAAAGTGATTCATCCCCCCCGCCCCCCCCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAATAATAATATAAATAAAGCACTCTCTCTCTCTCTACCGTCTTTCCTTTAGTGTTTTCTTTGCATAACTAACTTCAACTCTCTATGCTATTCGGTTGCTAACTTTGTAATAGACCCAAATATATTGAGCTATAATCGGCGACCAAACTATAAGATAAACCAAGTAATTCGAGGTACCTTGACCATCCCAATCTTAAAATCTCTCTCAAGCTCTAGTGCACTCGCCACAATATTGCCTATGTTCTTCATTCTCCTCCCCCTCTATTTATAACCAACAAGTATAACAAACTCCCTAACTAACTACTAATATACCCTTAATACTCTAATGACTCCCTAATGCCATTCCTATCACTTTCTAGCTAAGAAGTACTAGACTTCTTATGCTCAATGTTTTAAGGCCCTCCCAGGTGCTAGGCACAGGTCTTATGTGAAGTCTTAAGGCCTAACCTAAAGCTTTGAGTCTTAAGGCTTTTTCATTATTTAAAAAATATTAATAATTAGGTTAAAAATCACTTTTAGTCCCTAAACTTTTATAAAAGTAACAATTTAGTCCCTGACATTTGGTTTGTAACGATTTAGTCCCTAGACTTTCAATTTTGTAACAATTTAGTCCCTAAACTTTACTATGTAACAATTTAGTCCCTATCGTAAAAATTAGTGTTAAGATTTAATGAGATTTTTTGCATAAATAAACCGATAAACAAATTAAAAACTCAATATTTTTATATAATACAAAGTCTACATCATAAATTAATAAAAATTGGCATCTAATTTTGATGAATTTTTTTACGTTAGGAACTACATTGTTACAAATTTGAAAGTTCAAGGACCAAATTGTTACATACAAAATTGAAAGTATAGGACTAAATTGTTACAAACCAAAGTTCAGGGACTAAATTGTTACTTTCATGAGAGTTTAGGGACCAAAAGTATTTTTACTCGTTTATTTTTACTGATCCATAGTACTATGGTTTTTATGATTATTACTCGGTATTATTGTATTTTGAAGCTTTGTATTTGGAGCATTAGTCTCTTTTCATTAAATCAATGAAAAGTTCGTTTCTGTTTCAAAAAAATAATCTAATAATTATCAAGGGTTTTCCTTATATATTATAGAAATAAAATTTACTAAGCTTATATGCAAAATGCAAAATTCCTTGTGTTTAAGTTTTTTTTTTTTTTTTTCCTTCCATATTCCCACTTCAACCATATTTTCTCTTTTTTATGTAGTATTTTTATATATAGTGCGGCTCACAAAAAGATGCCCGCGTTTCTTTTGCATGTTGCGCTTAAGCCTCATAAGACTATTGTGTTTTTTTGTGTCCTTGAGCTTTAAAAACACTACTTATGGCTGAAGCATGTGATTTGAGGTGAAAAACATTTCGGCAAATTCTAGGTTGTGCATGATATGGATGGAATTATTGAACCTCCTGAAAGTTATTTATAAAATTTTGTCAACAGTCGGATCAAAAGAATTATTTGGCTTACTATGCAAATTAGCCAAGATATCCGTAGTTTCTGCTTTAATGTGCCATTGTTCAGTTCTTTTCCTCTATTGAAGTATTTTAGGAAGATTTCATAATATTTTTGCCTAATCTTTGTTTACAGCGTGCTGGTGCTCGAGATGAACAAGGAAATCCTATCATAATGGATACCATCAAATGTCCTCTCACTGCAAGGTCTGTTCCTTTGCAATTTTCATTTATGCATCACTTTTAAGGAATTTGATGCTAGGTGCAAGTTAATTGTGTTAAACATTGTCTCTTATTGATGTTAGTGGCAATGCTGACGACAATATGTTACTGGCTAAACTTGAGGGTGTCTACACTGGTTCTGGGGACCATGGATCTTCCATGGTTAGTTACTTCCATCATCTCAACTTATGCTCCAACTATGATAATCCCTCATAACTTTTTTCCGTTCAGTTTGTTCTTAGCATGGAACAGGGGCTTCCCTTTTTGCCTGAATGGTTATGTTTTAATAGAGTGAATGCTCGTGCCAGAGCTGGCATGAAAGTTGGTCCCTCTCAGCTTCTTTTGAGGTATGGTGCCTAGTTATGAAGTGCAGGGATTGTTGTAAGTTGAAAAATCATACTGGGAGGATTACTTTGAATGCTGCCCTTAATTCCTTTCTTCTTTTTGGTTTGAAATGAATCATGGGATATGATCGTGCCTGTAAATATTTTGAATTTGCTCTCTTTGATTTTTGTGGTACATCCCTTCAAAGGTGACAGGAAAACCCTATGGTGTGCCTTTAATCGATCCTTCTTTTGGTCACTTTGGTGTGAAAGGAATGGATGAATCTTCACCATAGGCACTTAATATTTGATGACTTTAGGGATTAGTTTATCTTTAGTCGCTCCTTTTGCTCACTTTGGTGTGAAAGGCATGGATGAATCTTGACGATACGCACTTAATATTTCATGACTTTAGGGATTAGGTTATTTTTAATGCTTTGTCTTGTATAAATGTTCACACCTCTTTGAGGATTATAGTCTTCTTTAATTTATAATTGAAATTGACAATCCATTCTTTGCAATCCATCATTTAGTGGTATGTGGTTATTTCATTTATCAATGAAATTCTCAATTTCTTAAGAAAAAGAAAGAAAGAATAATCATCAAATTGATTTTACTTGGCATGGCTGATATTTTTTTGGTCCTCGTTTCTAGTTTGTCGGAAGGTCATGTAGTTGGCAAATTCTGCCCACATGAAGCATTTGCCATTGGGGGAACTAACAGTGTAAGAGGATACGAGGAAGGTGCTGTAGGCTCTGGTCGATCTTATACTGTTGGATGTGGAGAGATCTCTTTTCCCTTGGTATGCAGGGAAATCGTACTTGATTGCATTATGTTATATGCTTCATATTTATTTTAGTCTTTCCATGAGTTCATTATAAAGGTTTGTTTGATGCTATAAATGGTCTTACATGACAGCTGCATTGCTTTGTGAAGGATTCTGGCATGTGTTATGGAGCTAGTTGAGCTATTTAAATTAGATAATGTCGAAGCTGAATGGTGACTACAAATCATAGAAACTTTTAATCTATCTCGGCTTCATGCTCTCTTTCAGAGGAGCGAAATTGGATGGATTTTGAATTAGCAATTTACATGAACAAATTAGTATTTGAGGGGGGGATTTGAAGTGGACGATTGATTTGAATAGTCCAAGTTGGGGTTAATGGTAATTAATTACCAATTCCATTTTTCAACTTTACCTTAAGATCAAAGTTATGCATTTAGTTGTCTAGAAAACCATGCTGCTTGTAAGATAGTCGAGCTATGGAGAGCCATGGTTTAAAAGAAATGTATCTGATTGGGCAGAAAGTGAAATTACACCTTGCTATAATTAACCCTCGTTGATTGGTTCAGCCTTTAGGCTTTAAGTTCATGGGTTTAATATTGAGTCTTTCAATCCTCATTGAATCAAGGAACGTTGAATTGAAGAGGCATAAAACGATTGACTTGAGATTTGAGAATGGTTGAAAGGGTAAAAAAATTAATGAAATAAGGTTCTTTTTTCTTTTCTTTTTTTTTTTTCAGTAAGAAATTAATGAAAGAAGGCTTGAAAAGTTAACTCAGTTTCTTGGTATTCTTGCTGATCAAATACCTAGAATTTTCTTGCTAATCTCTGCTTCTGAATTATGATTATGAATTTTAGAATTTATGTCTTGAGCATACTTGTCGTGGCCGGTGGGCATGGATAATTTTTTTTAATCCTTTTATTTATCCTCGTTTTCTTTTCCTCATTTGTAGTTTGCCCCTGTAGAAGGAGTCATTTTTGCTGACTATGGAACTGATCTTGGATCAGGCTCATCTGTTCCAGGTTGGTCTGCTACACTTCTTCCGTTGTGTTGGACCTTTTGTATGTTTTATTCATTTTCTTGCTTTCTTTATTTAACTGGTATGTGGTAACATGATAGGCCCCAAGTCACCATATTTTTATACATGTAGGAAAGAAATGAACGGGGAGGTGGGTGATGTTTTACATGAGGAGTGAGCATGTAATTGCTATAATATTTTGGGCGTATGCTGCTTTGATTTTGTGGGCTCGATTACTATCCTATTATAGTTTTTCTCGAATCTCCCTTGGCTGTTGCCATCATTCATAAATAAATCGAGATTTATGATACTGGAATTTTTTGCCTCGAACTAAAATTCGTGCACAGGCACAACAGCAAACATGCATATGGGGTATTCATGTTGTCCTAGAAAAGATTACCTTGAATCGAATATTTAGCATTCTAGACTTGATATTCTTGATGTGATGGCTTATTAGTTAGACTGGTATTGATGAAGCAATGTATGTGGTTTTTTATGCTTTGGTGCTGTCAACAGCCTATTATCTTGACGGCAGTAGAATGTCTCTGCAGTAGTTTGAATGAAAAGCTTGTGTTTTCCTTGATCAGGTGATCCTGCGGGGGCTAGGATGAAACCTGGAAGTGGATATGGGTATGGGTTTGGCATCCGTCTGGACTCCCCTTTGGGGCCTCTTCGTCTAGAATATGCATTCAACGACAAAGGTGCAAAAAGGTTTCACTTTGGCGTTGGCCACCGGAATTAA

mRNA sequence

ATGCCACCGAACGATGACATTGTTTTTACTTCGCGTGCTACGCTTAGAATTCCACACTTTCCTCCGTCTAGTTCGCATTCGAGTTTCAGATTTGGCTTCCGAAATTTAGCTTCACATTTCGATCAATCCTGTAACTCAATATTCCAGTTCATTGATTCGGTTAAGAGAGGTTCGAAATTGACTCATTTCAACCACAGTATTTCCCACTTCTGGCCACCAGCACTCCCCCTTTGTGCTTCAAAGAAGGTAACTCAGCAAGAGAATTCGGTCAATCGACGAGCAAGTTGGACTTGGGGGGCAATGTTTGTTGATAAATCTCCTCTAATTTGTTCGGCGTCAATGTCGCTAATTCAATCAGATGTGTCGAATAAGTCTGATTCTGGTACTTGTACGGAGCATCAAGGAAAGCGACGGGGGATGGAGGACAAAACTACGGGGTTGGTTGGAAAATCAACTCTTCTTTGTTCAGCTTCATTGGCTTTGACGCGTCCTGACGAGTTGAGTCAGTCTGTTGGTTCGGAGAGTAAGGAATTGCCTCAAAAGGGGTATTCAGCTGGTCGGCTGGATGAGGAGAGGGTTCTAATCAGTGAGGTTCTCGTGAGAAATAAGGACGGTGAGGAGCTAGAGAGGAAGGACCTCGAGCTGGAGGCCTTTATCGCCTTGAAGGCGAGCCGACCCAACTCTGCGCTCACTGTACGCGAGGTTCAAGAAGATGTGCACAGGATCATCAACAGTGGATATTTCTACTCGTGCATACCCGTTGCAGTTGATACCAGAGATGGAATAAGATTAATATTTCAGGTAGAACCAAATCAAGAGTTTCAGGGGTTGGTCTGTGAAGGTGCTAATGTCCTTCCAGCAAAGTTTCTGGAGGAAGCTTTTCGTCATGGCTATGTAAAAGTTGTCAATCTCAGGCATTTAGATGAGGTAATATCATCCATTAATAGCTGGTATGGTGAACGTGGCCTTTTTGGCGGGGTATCAGCTGTAGATATTCTATCCGGGGGCATTCTTAGTTTACAAGTTTCTGAAGCTGAAGTAAATAACACTTCCATCCGATTTCTTGACAAAAAGACTGGCGAGCCAATTTCAGGAAACACAAGACCTGAGACAATACTTCGGCAACTTACTACCAAGAAAGGACAGGTCTACAGTATGCTCCAAGGAAAAAGAGATGCAGAGACTGTCTTAACCATGGGGATCATGGAAGACGTCAGCATTATTCCCCAACCTGCAGCTGATGCTGGAAAGGTTGACATACTGATGAACGTAGTAGAGCGTCCAAGTGGAGGTTTCTCTGCTGGTGGTGGTCTATCATGTGGGACAACCGGTGGTGGTGGCCCTTTAGCTGCCCTTATTGGAAGCTTTGCATATTCCCATAGAAATCTTTTTGGGAGGAATCAGAAGCTCCATGTCTCCTTGGAGAGGGGACAAGTTGATTTCACTTTTCGCATAAATTACACAGACCCATGGATTGAAGGAGATGATAAGAGGACATCACGAACAATAATGGTTCAGAACTCAAGGACTCCAGGTACACTTGTCCATGGTGGTAGCAGCATCTTGACTATTGGTCGGGTTACAGCTGGTTTGGAATTCAACCGCCCTATCAGGCCTTCGTGGAGTGGAACGGGAGGACTTTATTTCCAGCGTGCTGGTGCTCGAGATGAACAAGGAAATCCTATCATAATGGATACCATCAAATGTCCTCTCACTGCAAGTGGCAATGCTGACGACAATATGTTACTGGCTAAACTTGAGGGTGTCTACACTGGTTCTGGGGACCATGGATCTTCCATGTTTGTTCTTAGCATGGAACAGGGGCTTCCCTTTTTGCCTGAATGGTTATGTTTTAATAGAGTGAATGCTCGTGCCAGAGCTGGCATGAAAGTTGGTCCCTCTCAGCTTCTTTTGAGTTTGTCGGAAGGTCATGTAGTTGGCAAATTCTGCCCACATGAAGCATTTGCCATTGGGGGAACTAACAGTGTAAGAGGATACGAGGAAGGTGCTGTAGGCTCTGGTCGATCTTATACTGTTGGATGTGGAGAGATCTCTTTTCCCTTGTTTGCCCCTGTAGAAGGAGTCATTTTTGCTGACTATGGAACTGATCTTGGATCAGGCTCATCTGTTCCAGGTGATCCTGCGGGGGCTAGGATGAAACCTGGAAGTGGATATGGGTATGGGTTTGGCATCCGTCTGGACTCCCCTTTGGGGCCTCTTCGTCTAGAATATGCATTCAACGACAAAGGTGCAAAAAGGTTTCACTTTGGCGTTGGCCACCGGAATTAA

Coding sequence (CDS)

ATGCCACCGAACGATGACATTGTTTTTACTTCGCGTGCTACGCTTAGAATTCCACACTTTCCTCCGTCTAGTTCGCATTCGAGTTTCAGATTTGGCTTCCGAAATTTAGCTTCACATTTCGATCAATCCTGTAACTCAATATTCCAGTTCATTGATTCGGTTAAGAGAGGTTCGAAATTGACTCATTTCAACCACAGTATTTCCCACTTCTGGCCACCAGCACTCCCCCTTTGTGCTTCAAAGAAGGTAACTCAGCAAGAGAATTCGGTCAATCGACGAGCAAGTTGGACTTGGGGGGCAATGTTTGTTGATAAATCTCCTCTAATTTGTTCGGCGTCAATGTCGCTAATTCAATCAGATGTGTCGAATAAGTCTGATTCTGGTACTTGTACGGAGCATCAAGGAAAGCGACGGGGGATGGAGGACAAAACTACGGGGTTGGTTGGAAAATCAACTCTTCTTTGTTCAGCTTCATTGGCTTTGACGCGTCCTGACGAGTTGAGTCAGTCTGTTGGTTCGGAGAGTAAGGAATTGCCTCAAAAGGGGTATTCAGCTGGTCGGCTGGATGAGGAGAGGGTTCTAATCAGTGAGGTTCTCGTGAGAAATAAGGACGGTGAGGAGCTAGAGAGGAAGGACCTCGAGCTGGAGGCCTTTATCGCCTTGAAGGCGAGCCGACCCAACTCTGCGCTCACTGTACGCGAGGTTCAAGAAGATGTGCACAGGATCATCAACAGTGGATATTTCTACTCGTGCATACCCGTTGCAGTTGATACCAGAGATGGAATAAGATTAATATTTCAGGTAGAACCAAATCAAGAGTTTCAGGGGTTGGTCTGTGAAGGTGCTAATGTCCTTCCAGCAAAGTTTCTGGAGGAAGCTTTTCGTCATGGCTATGTAAAAGTTGTCAATCTCAGGCATTTAGATGAGGTAATATCATCCATTAATAGCTGGTATGGTGAACGTGGCCTTTTTGGCGGGGTATCAGCTGTAGATATTCTATCCGGGGGCATTCTTAGTTTACAAGTTTCTGAAGCTGAAGTAAATAACACTTCCATCCGATTTCTTGACAAAAAGACTGGCGAGCCAATTTCAGGAAACACAAGACCTGAGACAATACTTCGGCAACTTACTACCAAGAAAGGACAGGTCTACAGTATGCTCCAAGGAAAAAGAGATGCAGAGACTGTCTTAACCATGGGGATCATGGAAGACGTCAGCATTATTCCCCAACCTGCAGCTGATGCTGGAAAGGTTGACATACTGATGAACGTAGTAGAGCGTCCAAGTGGAGGTTTCTCTGCTGGTGGTGGTCTATCATGTGGGACAACCGGTGGTGGTGGCCCTTTAGCTGCCCTTATTGGAAGCTTTGCATATTCCCATAGAAATCTTTTTGGGAGGAATCAGAAGCTCCATGTCTCCTTGGAGAGGGGACAAGTTGATTTCACTTTTCGCATAAATTACACAGACCCATGGATTGAAGGAGATGATAAGAGGACATCACGAACAATAATGGTTCAGAACTCAAGGACTCCAGGTACACTTGTCCATGGTGGTAGCAGCATCTTGACTATTGGTCGGGTTACAGCTGGTTTGGAATTCAACCGCCCTATCAGGCCTTCGTGGAGTGGAACGGGAGGACTTTATTTCCAGCGTGCTGGTGCTCGAGATGAACAAGGAAATCCTATCATAATGGATACCATCAAATGTCCTCTCACTGCAAGTGGCAATGCTGACGACAATATGTTACTGGCTAAACTTGAGGGTGTCTACACTGGTTCTGGGGACCATGGATCTTCCATGTTTGTTCTTAGCATGGAACAGGGGCTTCCCTTTTTGCCTGAATGGTTATGTTTTAATAGAGTGAATGCTCGTGCCAGAGCTGGCATGAAAGTTGGTCCCTCTCAGCTTCTTTTGAGTTTGTCGGAAGGTCATGTAGTTGGCAAATTCTGCCCACATGAAGCATTTGCCATTGGGGGAACTAACAGTGTAAGAGGATACGAGGAAGGTGCTGTAGGCTCTGGTCGATCTTATACTGTTGGATGTGGAGAGATCTCTTTTCCCTTGTTTGCCCCTGTAGAAGGAGTCATTTTTGCTGACTATGGAACTGATCTTGGATCAGGCTCATCTGTTCCAGGTGATCCTGCGGGGGCTAGGATGAAACCTGGAAGTGGATATGGGTATGGGTTTGGCATCCGTCTGGACTCCCCTTTGGGGCCTCTTCGTCTAGAATATGCATTCAACGACAAAGGTGCAAAAAGGTTTCACTTTGGCGTTGGCCACCGGAATTAA

Protein sequence

MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKLTHFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSDVSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQKGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVHRIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVKVVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTGEPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDILMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVDFTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPSWSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Homology
BLAST of HG10001152 vs. NCBI nr
Match: XP_038901490.1 (outer envelope protein 80, chloroplastic [Benincasa hispida])

HSP 1 Score: 1448.7 bits (3749), Expect = 0.0e+00
Identity = 723/762 (94.88%), Postives = 736/762 (96.59%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLRIPHFPPSSSHSSFRF FRNLAS FDQSCNSIFQFIDSVK+GSKL
Sbjct: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFCFRNLASQFDQSCNSIFQFIDSVKKGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           THFNHSI H WPPALP  ASKK+TQQENSVNRRASWTWGAMFV KSPLICSASMSLIQS 
Sbjct: 61  THFNHSIPHLWPPALPFGASKKLTQQENSVNRRASWTWGAMFVSKSPLICSASMSLIQSG 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
           VSNKSDSG CTEH G+RRGMEDK+TG VGKS+LLCSASLALTR DE SQS GSESKELPQ
Sbjct: 121 VSNKSDSGHCTEHSGRRRGMEDKSTGFVGKSSLLCSASLALTRSDESSQSGGSESKELPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGRLDEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEF+GLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFRGLVCEGANVLPAKFLEEAFREGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLD+VISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDDVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI
Sbjct: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP+
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPT 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMD IKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDNIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR  M+VGPSQLLLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTSMEVGPSQLLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. NCBI nr
Match: XP_023532612.1 (outer envelope protein 80, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1397.5 bits (3616), Expect = 0.0e+00
Identity = 698/762 (91.60%), Postives = 724/762 (95.01%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLR+PH PPS+SHSSFRF FRN+AS FDQSC+SIFQFIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRATLRVPHLPPSNSHSSFRFCFRNMASQFDQSCHSIFQFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           T+FNHSI HFWPP+LP   SKK+TQ+ NSV+R AS + GAMFVDKS LICSAS+SLIQSD
Sbjct: 61  TNFNHSIPHFWPPSLPFWGSKKITQRGNSVSRWASGSCGAMFVDKSSLICSASLSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
            SNK++SGT  EH G++RGM+DK+TGLVGKS+LLCSASLAL R DE SQ  GSESKE PQ
Sbjct: 121 GSNKAESGTRAEHSGRQRGMDDKSTGLVGKSSLLCSASLALARSDEASQPSGSESKESPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGR DEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRPDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPISGNTRP+TILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVD+
Sbjct: 361 EPISGNTRPDTILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDL 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGG GPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGAGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP 
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPK 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLE VYTGSGDHGSSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLESVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR G++VGPSQ LLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGVEVGPSQFLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. NCBI nr
Match: XP_022957959.1 (outer envelope protein 80, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1390.2 bits (3597), Expect = 0.0e+00
Identity = 694/762 (91.08%), Postives = 720/762 (94.49%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLR+PH PP +SHSSFRF FRN+AS FDQSC+SIFQFIDSVKRGSK 
Sbjct: 1   MPPNDDIVFTSRATLRVPHLPPPNSHSSFRFCFRNMASQFDQSCHSIFQFIDSVKRGSKF 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           T+FNHSI HFWPP+LP   SKK+TQ+ NSV+R AS + GAMFVDKS LICSAS+SLIQSD
Sbjct: 61  TNFNHSIPHFWPPSLPFWGSKKITQRGNSVSRWASGSCGAMFVDKSSLICSASLSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
            SNK++SGT  EH G++RGM+DK+TGLVGKS+LLCSASLAL R DE SQ  GSESKE PQ
Sbjct: 121 GSNKAESGTRAEHSGRQRGMDDKSTGLVGKSSLLCSASLALARSDEASQPSGSESKESPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGR DEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRPDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPISGNTRP+TILRQLTTKKGQVYSM QGKRDAETVLTMGIMEDVSIIPQPAADAGKVD+
Sbjct: 361 EPISGNTRPDTILRQLTTKKGQVYSMFQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDL 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGG GPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGAGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP 
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPK 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLE VYTGSGDH SSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLESVYTGSGDHASSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR G++VGPSQ LLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGVEVGPSQFLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. NCBI nr
Match: XP_022995029.1 (outer envelope protein 80, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1389.8 bits (3596), Expect = 0.0e+00
Identity = 695/762 (91.21%), Postives = 721/762 (94.62%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLR+PH PP +SHSSF F FRN+AS FDQSC+SIFQFIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRATLRVPHLPPPNSHSSFSFCFRNMASQFDQSCHSIFQFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           T+FNHSI HFWPP+LP   SKK+TQ+ NSV+R AS + GAMFVDKS LICSAS+SLIQSD
Sbjct: 61  TNFNHSIPHFWPPSLPFWGSKKITQRGNSVSRWASGSCGAMFVDKSSLICSASLSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
            SNK++SGT  EH G++RGM+DK+TGLVGKS+LLCSASLAL R DE SQ  GSESKE PQ
Sbjct: 121 GSNKAESGTRAEHSGRQRGMDDKSTGLVGKSSLLCSASLALARSDEGSQPSGSESKESPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGR DEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRPDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRP+TILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVD+
Sbjct: 361 EPILGNTRPDTILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDL 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGG GPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGAGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP 
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPK 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLE VYTGSGDHGSSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLESVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR G++VGPSQ LLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGVEVGPSQFLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. NCBI nr
Match: TYJ96147.1 (outer envelope protein 80 [Cucumis melo var. makuwa])

HSP 1 Score: 1349.3 bits (3491), Expect = 0.0e+00
Identity = 678/762 (88.98%), Postives = 708/762 (92.91%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSR+TLRIPHFPPS+SHSSFRF FRNLAS FDQSCNSI QFIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRSTLRIPHFPPSTSHSSFRFCFRNLASQFDQSCNSISQFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           +HFNHS    WPP LP C+SKKVTQ+ENS +RRASW WG++FV+KSPLIC ASMSLIQSD
Sbjct: 61  SHFNHSFPQLWPPTLPFCSSKKVTQRENSSSRRASWNWGSVFVEKSPLICLASMSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
           +S+KS+S    E  G+RRGMEDK+TGLVGKS+LLCSASLAL R DE SQS GSE+KELPQ
Sbjct: 121 MSSKSES----EDSGRRRGMEDKSTGLVGKSSLLCSASLALARSDESSQSGGSENKELPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSA R+DEERVLISEVLVRNKDGEELERKDLELE F ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI
Sbjct: 361 EPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERP GGFSAGGGLSCG+TGG G L+ LIGS AYSHRNLFGRNQKLHVSLE+GQVD
Sbjct: 421 LMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
            TFRINYTDPWIEGDDKRTS TIMVQNSRTPGTLVHGGS+ LTI RVTAGLEFNRPIRP+
Sbjct: 481 STFRINYTDPWIEGDDKRTSGTIMVQNSRTPGTLVHGGSN-LTIVRVTAGLEFNRPIRPT 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGT GLYFQRAGA++E+G PI+ D IKCPLTASGNA DNMLLAKLEGVYTGSGDHGSSM
Sbjct: 541 WSGTAGLYFQRAGAQNEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR GM+VG +QLLLSLS GHVVG FCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGMEVGFAQLLLSLSGGHVVGNFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLF PVEGV FADYGTDLGSG+SV GDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFGPVEGVFFADYGTDLGSGASVLGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
            GSGYGYGFGIRL+SPLGPLRLEYAFNDK AKRFHFGVGHRN
Sbjct: 721 TGSGYGYGFGIRLESPLGPLRLEYAFNDKSAKRFHFGVGHRN 757

BLAST of HG10001152 vs. ExPASy Swiss-Prot
Match: Q9C5J8 (Outer envelope protein 80, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OEP80 PE=2 SV=1)

HSP 1 Score: 960.7 bits (2482), Expect = 1.0e-278
Identity = 508/765 (66.41%), Postives = 580/765 (75.82%), Query Frame = 0

Query: 4   NDDIVFTSRATLRIPHFPPSSSHS---SFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 63
           NDD+ F+S +++RI    P   HS   + +   +   SH   + NS+ Q + S+K     
Sbjct: 5   NDDVRFSS-SSIRIHSPSPKEQHSLLTNLQSCSKTFVSHLSNTRNSLNQMLQSLK----- 64

Query: 64  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 123
                  +   PP   +      TQ  NSV +        + + KS  I   S+SLIQS 
Sbjct: 65  -------NRHTPPPRSVRRPNLPTQMLNSVTQ--------LMIGKSSPI---SLSLIQST 124

Query: 124 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 183
             N S+S    E+    RG+          S LLC ASL+LTRP+E +QSV  +     Q
Sbjct: 125 QFNWSESR--DENVETIRGL---------SSPLLCCASLSLTRPNESTQSVEGKDTVQQQ 184

Query: 184 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 243
           KG+S  R  EERVLISEVLVR KDGEELERKDLE+EA  ALKA R NSALT+REVQEDVH
Sbjct: 185 KGHSVSRNAEERVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVH 244

Query: 244 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 303
           RII SGYF SC PVAVDTRDGIRL+FQVEPNQEF+GLVCE ANVLP+KF+ EAFR G+ K
Sbjct: 245 RIIESGYFCSCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIHEAFRDGFGK 304

Query: 304 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 363
           V+N++ L+E I+SIN WY ERGLFG VS +D LSGGI+ LQV+EAEVNN SIRFLD+KTG
Sbjct: 305 VINIKRLEEAITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTG 364

Query: 364 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 423
           EP  G T PETILRQLTTKKGQVYSMLQGKRD +TVL MGIMEDVSIIPQPA D+GKVD+
Sbjct: 365 EPTKGKTSPETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDL 424

Query: 424 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 483
           +MN VERPSGGFSAGGG+S G T   GPL+ LIGSFAYSHRNLFGRNQKL+VSLERGQ+D
Sbjct: 425 IMNCVERPSGGFSAGGGISSGIT--SGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQID 484

Query: 484 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGG---SSILTIGRVTAGLEFNRPI 543
             FRINYTDPWIEGDDKRTSR+IMVQNSRTPG LVHG    +S LTIGRVTAG+E++RP 
Sbjct: 485 SIFRINYTDPWIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPF 544

Query: 544 RPSWSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHG 603
           RP W+GT GL FQ AGARDEQGNPII D    PLTASG   D  +LAKLE +YTGSGD G
Sbjct: 545 RPKWNGTAGLIFQHAGARDEQGNPIIKDFYSSPLTASGKPHDETMLAKLESIYTGSGDQG 604

Query: 604 SSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAI 663
           S+MF  +MEQGLP LPEWLCFNRV  RAR G+ +GP++ L SLS GHVVGKF PHEAF I
Sbjct: 605 STMFAFNMEQGLPVLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGKFSPHEAFVI 664

Query: 664 GGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGA 723
           GGTNSVRGYEEGAVGSGRSY VG GE+SFP+  PVEGVIF DYGTD+GSGS+VPGDPAGA
Sbjct: 665 GGTNSVRGYEEGAVGSGRSYVVGSGELSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGA 724

Query: 724 RMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           R+KPGSGYGYG G+R+DSPLGPLRLEYAFND+ A RFHFGVG RN
Sbjct: 725 RLKPGSGYGYGLGVRVDSPLGPLRLEYAFNDQHAGRFHFGVGLRN 732

BLAST of HG10001152 vs. ExPASy Swiss-Prot
Match: A2X208 (Outer envelope protein 80, chloroplastic OS=Oryza sativa subsp. indica OX=39946 GN=OEP80 PE=3 SV=1)

HSP 1 Score: 858.6 bits (2217), Expect = 5.3e-248
Identity = 423/587 (72.06%), Postives = 486/587 (82.79%), Query Frame = 0

Query: 180 QKGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDV 239
           QK    G   EERVLISEV VR KDGE LER +LE  A  AL+A RPN+ALTVREVQEDV
Sbjct: 81  QKDGGGGGGGEERVLISEVAVRGKDGEPLERPELEAAAAAALRACRPNAALTVREVQEDV 140

Query: 240 HRIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYV 299
           HR++ SG F SC+PVAVDTRDGIRL+F+VEPNQ+F GLVCEGAN+LP+KFLE+AF   + 
Sbjct: 141 HRVVESGLFRSCMPVAVDTRDGIRLVFEVEPNQDFHGLVCEGANMLPSKFLEDAFHDRHG 200

Query: 300 KVVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKT 359
           K++N+RHLD+VI S+N WY ERGL G VS  +ILSGGIL LQVSEAEVNN +IRFLD++T
Sbjct: 201 KIINIRHLDQVIKSVNGWYQERGLTGLVSYAEILSGGILRLQVSEAEVNNINIRFLDRRT 260

Query: 360 GEPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVD 419
           GEP  G T+PETILR LTTKKGQ Y+  Q KRD ET+LTMGIMEDV+IIPQP  D+ KVD
Sbjct: 261 GEPTVGKTQPETILRHLTTKKGQAYNRAQVKRDVETILTMGIMEDVTIIPQPVGDSNKVD 320

Query: 420 ILMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQV 479
           ++MN+VERPSGGFSAGGG+S G T   GPL+ LIGSFAYSHRN+FGRN+KL++SLERGQ+
Sbjct: 321 LVMNLVERPSGGFSAGGGISSGIT--NGPLSGLIGSFAYSHRNVFGRNKKLNLSLERGQI 380

Query: 480 DFTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS----SILTIGRVTAGLEFNR 539
           D  FR+NYTDPWI+GD+KRTSRTIMVQNSRTPGTL+HGG       +TIGRVTAG+E++R
Sbjct: 381 DSIFRLNYTDPWIDGDNKRTSRTIMVQNSRTPGTLIHGGDHPDHGPITIGRVTAGIEYSR 440

Query: 540 PIRPSWSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGD 599
           P RP WSGT GL FQ AGARD++GNPII D     LTASGNA D+ LLAKLE VYT SGD
Sbjct: 441 PFRPKWSGTLGLIFQHAGARDDKGNPIIRDFYNSQLTASGNAYDDTLLAKLESVYTDSGD 500

Query: 600 HGSSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAF 659
             S+MFV ++EQGLP LPEWL FNRV AR R G ++GP++LLLS S GHV G F PHEAF
Sbjct: 501 RSSTMFVFNIEQGLPILPEWLSFNRVTARLRQGYEIGPARLLLSASGGHVEGNFSPHEAF 560

Query: 660 AIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPA 719
           AIGGTNSVRGYEEGAVGSGRSY VG GE+S  +F P+EGV+F DYG+DL SG  VPGDPA
Sbjct: 561 AIGGTNSVRGYEEGAVGSGRSYAVGSGEVSCRMFGPLEGVVFGDYGSDLSSGPKVPGDPA 620

Query: 720 GARMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           GAR KPGSGYGYG GIR+DSPLGPLRLEYAFNDK A+RFHFGVG+RN
Sbjct: 621 GARGKPGSGYGYGVGIRVDSPLGPLRLEYAFNDKQARRFHFGVGYRN 665

BLAST of HG10001152 vs. ExPASy Swiss-Prot
Match: Q6H7M7 (Outer envelope protein 80, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=OEP80 PE=3 SV=2)

HSP 1 Score: 858.2 bits (2216), Expect = 7.0e-248
Identity = 422/587 (71.89%), Postives = 486/587 (82.79%), Query Frame = 0

Query: 180 QKGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDV 239
           QK    G   EERVLISEV VR KDGE LER +LE  A  AL+A RPN+ALTVREVQEDV
Sbjct: 81  QKDGGGGGGGEERVLISEVAVRGKDGEPLERPELEAAAAAALRACRPNAALTVREVQEDV 140

Query: 240 HRIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYV 299
           HR++ SG F SC+PVAVDTRDGIRL+F+VEPNQ+F GLVCEGAN+LP+KFLE+AF   + 
Sbjct: 141 HRVVESGLFRSCMPVAVDTRDGIRLVFEVEPNQDFHGLVCEGANMLPSKFLEDAFHDRHG 200

Query: 300 KVVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKT 359
           K++N+RHLD+VI S+N WY ERGL G VS  +ILSGGIL LQVSEAEVNN +IRFLD++T
Sbjct: 201 KIINIRHLDQVIKSVNGWYQERGLTGLVSYAEILSGGILRLQVSEAEVNNINIRFLDRRT 260

Query: 360 GEPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVD 419
           GEP  G T+PETILR LTTKKGQ Y+  Q KRD ET+LTMGIMEDV+IIPQP  D+ KVD
Sbjct: 261 GEPTVGKTQPETILRHLTTKKGQAYNRAQVKRDVETILTMGIMEDVTIIPQPVGDSNKVD 320

Query: 420 ILMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQV 479
           ++MN+VERPSGGFSAGGG+S G T   GPL+ LIGSFAYSHRN+FGRN+KL++SLERGQ+
Sbjct: 321 LVMNLVERPSGGFSAGGGISSGIT--NGPLSGLIGSFAYSHRNVFGRNKKLNLSLERGQI 380

Query: 480 DFTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS----SILTIGRVTAGLEFNR 539
           D  FR+NYTDPWI+GD+KRTSRTIMVQNSRTPGTL+HGG       +TIGRVTAG+E++R
Sbjct: 381 DSIFRLNYTDPWIDGDNKRTSRTIMVQNSRTPGTLIHGGDHPDHGPITIGRVTAGIEYSR 440

Query: 540 PIRPSWSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGD 599
           P RP WSGT GL FQ AGARD++GNPII D     LTASGNA D+ LLAKLE VYT SGD
Sbjct: 441 PFRPKWSGTLGLIFQHAGARDDKGNPIIRDFYNSQLTASGNAYDDTLLAKLESVYTDSGD 500

Query: 600 HGSSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAF 659
             S+MFV ++EQGLP LPEWL FNRV AR R G ++GP++LLLS S GHV G F PHEAF
Sbjct: 501 RSSTMFVFNIEQGLPILPEWLSFNRVTARLRQGYEIGPARLLLSASGGHVEGNFSPHEAF 560

Query: 660 AIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPA 719
           AIGGTNSVRGYEEGAVGSGRSY VG GE+S  +F P+EGV+F DYG+DL SG  VPGDPA
Sbjct: 561 AIGGTNSVRGYEEGAVGSGRSYAVGSGEVSCRMFGPLEGVVFGDYGSDLSSGPKVPGDPA 620

Query: 720 GARMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           GAR KPGSGYGYG G+R+DSPLGPLRLEYAFNDK A+RFHFGVG+RN
Sbjct: 621 GARGKPGSGYGYGVGVRVDSPLGPLRLEYAFNDKQARRFHFGVGYRN 665

BLAST of HG10001152 vs. ExPASy Swiss-Prot
Match: Q5PP51 (Outer envelope protein 39, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=P39 PE=2 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 1.6e-63
Identity = 131/320 (40.94%), Postives = 183/320 (57.19%), Query Frame = 0

Query: 448 PLAALIGSFAYSHRNLFGRNQKLHVSLERGQVDFTFRINYTDPWIEGDDKRTSRTIMVQN 507
           PL+ +IGS    H NLFG ++KL VS ++G  D    + +  P  E    R  +   +Q+
Sbjct: 39  PLSLVIGSLCIKHPNLFGGSEKLDVSWDKGLYDSNVLVAFRRPRPEW---RPQQCFFIQH 98

Query: 508 SRTPGTLVHG---------GSSILTIGRVTAGLEFNRPIRPSWSGTGGLYFQRAGARDEQ 567
           S +P   VHG         GS  + + ++  GL+ + P    WS T  + F+     ++ 
Sbjct: 99  SLSPEIGVHGTPVDNFSRSGSGGVNLSKLALGLDLSEPASSKWSSTTSIKFEHVRPINDD 158

Query: 568 GNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSMFVLSMEQGLPFLPEWLCF 627
           G  I  D    P+T SGN  D+M++ K E  +  + D G S F + +EQG+P + +WL F
Sbjct: 159 GRAITRDLDGFPITCSGNTHDSMVVLKQESRFAKATDQGLSHFSMQIEQGIPVVSKWLIF 218

Query: 628 NRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGTNSVRGYEEGAVGSGRSYT 687
           NR    A  G++ GP+ LL SL+ G +VG   P++AFAIGG  SVRGY EGAVGSGRS  
Sbjct: 219 NRFKFVASKGVRFGPAFLLASLTGGSIVGDMAPYQAFAIGGLGSVRGYGEGAVGSGRSCL 278

Query: 688 VGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMKPGSGYGYGFGIRLDSPLG 747
           V   E++ PL    EG IF D GTDLGS   VPG+P+  + KPG GYG+G+G+R  SPLG
Sbjct: 279 VANTELALPLNKMTEGTIFLDCGTDLGSSRLVPGNPSMRQGKPGFGYGFGYGLRFKSPLG 338

Query: 748 PLRLEYAFNDKGAKRFHFGV 759
            L+++YA N    K  +FGV
Sbjct: 339 HLQVDYAINAFNQKTLYFGV 355

BLAST of HG10001152 vs. ExPASy Swiss-Prot
Match: F4JF35 (Outer envelope protein 36, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=P36 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 3.1e-38
Identity = 104/314 (33.12%), Postives = 151/314 (48.09%), Query Frame = 0

Query: 491 WIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPSWSGTGGLYFQ 550
           W+ GD       + +   R   +   GG   + + ++  GL+ + P    WS T  + F+
Sbjct: 15  WLLGD-------LDLNGVRNSASSYSGG---VNLSKLAVGLDLSEPASSKWSSTTSVKFE 74

Query: 551 RAG-----------------ARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGS 610
             G                  R ++   I  + ++  L  SGN  D+M++ K E  +  +
Sbjct: 75  VPGHYLLQTLYMCVRLTMTDGRYQRSGRISYN-MQNSLLCSGNTHDSMVVLKQESRFAKA 134

Query: 611 GDHGSSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHE 670
            D G S F + +EQG+P +  WL FNR    A  G++ GP+  L SL+ G +VG   P++
Sbjct: 135 TDQGLSHFSMQIEQGIPVVSNWLIFNRFKFVASKGVRFGPAFPLASLTGGSIVGDMTPYQ 194

Query: 671 AFAIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVP-- 730
           AFAIGG  SVRGY E AVGSGRS  V   E++  +    EG IF D GTDLGS   VP  
Sbjct: 195 AFAIGGLGSVRGYGEVAVGSGRSCLVANTELANKM---TEGTIFLDCGTDLGSSRLVPVS 254

Query: 731 ---------------------------GDPAGARMKPGSGYGYGFGIRLDSPLGPLRLEY 759
                                      G+P+  + KPG GYG+G+G+R  SPLG L+++Y
Sbjct: 255 SLYLLRTRTIKKLLRHENDKAVAELVSGNPSLRQGKPGFGYGFGYGLRFKSPLGHLQVDY 314

BLAST of HG10001152 vs. ExPASy TrEMBL
Match: A0A6J1H3L6 (outer envelope protein 80, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111459337 PE=4 SV=1)

HSP 1 Score: 1390.2 bits (3597), Expect = 0.0e+00
Identity = 694/762 (91.08%), Postives = 720/762 (94.49%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLR+PH PP +SHSSFRF FRN+AS FDQSC+SIFQFIDSVKRGSK 
Sbjct: 1   MPPNDDIVFTSRATLRVPHLPPPNSHSSFRFCFRNMASQFDQSCHSIFQFIDSVKRGSKF 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           T+FNHSI HFWPP+LP   SKK+TQ+ NSV+R AS + GAMFVDKS LICSAS+SLIQSD
Sbjct: 61  TNFNHSIPHFWPPSLPFWGSKKITQRGNSVSRWASGSCGAMFVDKSSLICSASLSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
            SNK++SGT  EH G++RGM+DK+TGLVGKS+LLCSASLAL R DE SQ  GSESKE PQ
Sbjct: 121 GSNKAESGTRAEHSGRQRGMDDKSTGLVGKSSLLCSASLALARSDEASQPSGSESKESPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGR DEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRPDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPISGNTRP+TILRQLTTKKGQVYSM QGKRDAETVLTMGIMEDVSIIPQPAADAGKVD+
Sbjct: 361 EPISGNTRPDTILRQLTTKKGQVYSMFQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDL 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGG GPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGAGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP 
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPK 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLE VYTGSGDH SSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLESVYTGSGDHASSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR G++VGPSQ LLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGVEVGPSQFLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. ExPASy TrEMBL
Match: A0A6J1K4I6 (outer envelope protein 80, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111490707 PE=4 SV=1)

HSP 1 Score: 1389.8 bits (3596), Expect = 0.0e+00
Identity = 695/762 (91.21%), Postives = 721/762 (94.62%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSRATLR+PH PP +SHSSF F FRN+AS FDQSC+SIFQFIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRATLRVPHLPPPNSHSSFSFCFRNMASQFDQSCHSIFQFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           T+FNHSI HFWPP+LP   SKK+TQ+ NSV+R AS + GAMFVDKS LICSAS+SLIQSD
Sbjct: 61  TNFNHSIPHFWPPSLPFWGSKKITQRGNSVSRWASGSCGAMFVDKSSLICSASLSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
            SNK++SGT  EH G++RGM+DK+TGLVGKS+LLCSASLAL R DE SQ  GSESKE PQ
Sbjct: 121 GSNKAESGTRAEHSGRQRGMDDKSTGLVGKSSLLCSASLALARSDEGSQPSGSESKESPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSAGR DEERVLISEVLVRNKDGEELERKDLE+E   ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAGRPDEERVLISEVLVRNKDGEELERKDLEMEVLTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRP+TILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVD+
Sbjct: 361 EPILGNTRPDTILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDL 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERPSGGFSAGGGLSCGTTGG GPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD
Sbjct: 421 LMNVVERPSGGFSAGGGLSCGTTGGAGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
           FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTIGRVTAGLEFNRPIRP 
Sbjct: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIGRVTAGLEFNRPIRPK 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLE VYTGSGDHGSSM
Sbjct: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLESVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR G++VGPSQ LLSLS GHVVGKFCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGVEVGPSQFLLSLSGGHVVGKFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLFAPVEGV+FADYGTDLGSGS+VPGDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFAPVEGVLFADYGTDLGSGSTVPGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN
Sbjct: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 761

BLAST of HG10001152 vs. ExPASy TrEMBL
Match: A0A5D3BAJ7 (Outer envelope protein 80 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G001160 PE=4 SV=1)

HSP 1 Score: 1349.3 bits (3491), Expect = 0.0e+00
Identity = 678/762 (88.98%), Postives = 708/762 (92.91%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSR+TLRIPHFPPS+SHSSFRF FRNLAS FDQSCNSI QFIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRSTLRIPHFPPSTSHSSFRFCFRNLASQFDQSCNSISQFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           +HFNHS    WPP LP C+SKKVTQ+ENS +RRASW WG++FV+KSPLIC ASMSLIQSD
Sbjct: 61  SHFNHSFPQLWPPTLPFCSSKKVTQRENSSSRRASWNWGSVFVEKSPLICLASMSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
           +S+KS+S    E  G+RRGMEDK+TGLVGKS+LLCSASLAL R DE SQS GSE+KELPQ
Sbjct: 121 MSSKSES----EDSGRRRGMEDKSTGLVGKSSLLCSASLALARSDESSQSGGSENKELPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSA R+DEERVLISEVLVRNKDGEELERKDLELE F ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI
Sbjct: 361 EPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERP GGFSAGGGLSCG+TGG G L+ LIGS AYSHRNLFGRNQKLHVSLE+GQVD
Sbjct: 421 LMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
            TFRINYTDPWIEGDDKRTS TIMVQNSRTPGTLVHGGS+ LTI RVTAGLEFNRPIRP+
Sbjct: 481 STFRINYTDPWIEGDDKRTSGTIMVQNSRTPGTLVHGGSN-LTIVRVTAGLEFNRPIRPT 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGT GLYFQRAGA++E+G PI+ D IKCPLTASGNA DNMLLAKLEGVYTGSGDHGSSM
Sbjct: 541 WSGTAGLYFQRAGAQNEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR GM+VG +QLLLSLS GHVVG FCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGMEVGFAQLLLSLSGGHVVGNFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLF PVEGV FADYGTDLGSG+SV GDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFGPVEGVFFADYGTDLGSGASVLGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
            GSGYGYGFGIRL+SPLGPLRLEYAFNDK AKRFHFGVGHRN
Sbjct: 721 TGSGYGYGFGIRLESPLGPLRLEYAFNDKSAKRFHFGVGHRN 757

BLAST of HG10001152 vs. ExPASy TrEMBL
Match: A0A0A0KZC1 (Omp85 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G290840 PE=4 SV=1)

HSP 1 Score: 1348.6 bits (3489), Expect = 0.0e+00
Identity = 675/762 (88.58%), Postives = 707/762 (92.78%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSR+TLRIPHFPPSSSHSSFRF FRNLAS FDQSC SI  FIDSVKRGSKL
Sbjct: 1   MPPNDDIVFTSRSTLRIPHFPPSSSHSSFRFCFRNLASQFDQSCKSISHFIDSVKRGSKL 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           +HFNHS  H WPP LP C+SKKVTQQE+S++RRASW WG++FV+K PLICSASMSLIQSD
Sbjct: 61  SHFNHSFPHLWPPTLPFCSSKKVTQQESSISRRASWNWGSVFVEKYPLICSASMSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
           +S+KS+S    E  GKR+GMED +TGLVGKS+LLCSASLALTR DE +QS GSESKELPQ
Sbjct: 121 MSSKSES----EDSGKRQGMEDMSTGLVGKSSLLCSASLALTRSDESNQSGGSESKELPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSA R+DEERVLISEVLVRNKDGEELERKDLELE F ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSIN WYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINGWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI
Sbjct: 361 EPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERP GGFSAGGGLSCG+TGG G L+ LIGS AYSHRNLFGRNQKLHVSLE+GQVD
Sbjct: 421 LMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
            TFRINYTDPWIEGDDKRTSRT+MVQNSRTPGTLVHGGS+ LTI RVTAGLEFNRPIRP+
Sbjct: 481 STFRINYTDPWIEGDDKRTSRTMMVQNSRTPGTLVHGGSN-LTIVRVTAGLEFNRPIRPT 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGT GLYFQRAGA+DE+G PI+ D IKCPLTASGNA DNMLLAKLEGVYTGSGDHGSSM
Sbjct: 541 WSGTAGLYFQRAGAQDEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR GM++G SQLLLSLS GHVVG FCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTGMEIGFSQLLLSLSGGHVVGNFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGE+SFPLF PVEGV FADYGTDLGSG+SV GDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGELSFPLFGPVEGVFFADYGTDLGSGASVLGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
            GSG+GYGFGIRL+SPLGPLRLEYAFNDK  KRFHFGVGHRN
Sbjct: 721 TGSGFGYGFGIRLESPLGPLRLEYAFNDKSEKRFHFGVGHRN 757

BLAST of HG10001152 vs. ExPASy TrEMBL
Match: A0A5A7TD95 (Outer envelope protein 80 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold206G00490 PE=4 SV=1)

HSP 1 Score: 1347.4 bits (3486), Expect = 0.0e+00
Identity = 677/762 (88.85%), Postives = 707/762 (92.78%), Query Frame = 0

Query: 1   MPPNDDIVFTSRATLRIPHFPPSSSHSSFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 60
           MPPNDDIVFTSR+TLRIPHFPPS+SHSSFRF FRNLAS FDQSCNSI QFIDSVKRGSK 
Sbjct: 1   MPPNDDIVFTSRSTLRIPHFPPSTSHSSFRFCFRNLASQFDQSCNSISQFIDSVKRGSKS 60

Query: 61  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 120
           +HFNHS    WPP LP C+SKKVTQ+ENS +RRASW WG++FV+KSPLIC ASMSLIQSD
Sbjct: 61  SHFNHSFPQLWPPTLPFCSSKKVTQRENSSSRRASWNWGSVFVEKSPLICLASMSLIQSD 120

Query: 121 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 180
           +S+KS+S    E  G+RRGMEDK+TGLVGKS+LLCSASLAL R DE SQS GSE+KELPQ
Sbjct: 121 MSSKSES----EDSGRRRGMEDKSTGLVGKSSLLCSASLALARSDESSQSGGSENKELPQ 180

Query: 181 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 240
           KGYSA R+DEERVLISEVLVRNKDGEELERKDLELE F ALKASRPNSALTVREVQEDVH
Sbjct: 181 KGYSAARVDEERVLISEVLVRNKDGEELERKDLELEVFTALKASRPNSALTVREVQEDVH 240

Query: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 300
           RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFR GY K
Sbjct: 241 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRDGYGK 300

Query: 301 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 360
           VVNLRHLDEVISSINSWYGERGLFG VSAVDILSGGILSLQVSEAEVNN SIRFLDKKTG
Sbjct: 301 VVNLRHLDEVISSINSWYGERGLFGRVSAVDILSGGILSLQVSEAEVNNISIRFLDKKTG 360

Query: 361 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420
           EPI GNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI
Sbjct: 361 EPIPGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 420

Query: 421 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 480
           LMNVVERP GGFSAGGGLSCG+TGG G L+ LIGS AYSHRNLFGRNQKLHVSLE+GQVD
Sbjct: 421 LMNVVERPGGGFSAGGGLSCGSTGGAGLLSTLIGSLAYSHRNLFGRNQKLHVSLEKGQVD 480

Query: 481 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPS 540
            TFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGS+ LTI RVTAGLEFNRPIRP+
Sbjct: 481 STFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGGSN-LTIVRVTAGLEFNRPIRPT 540

Query: 541 WSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSM 600
           WSGT GLYFQRAGA++E+G PI+ D IKCPLTASGNA DNMLLAKLEGVYTGSGDHGSSM
Sbjct: 541 WSGTAGLYFQRAGAQNEKGEPILKDNIKCPLTASGNAVDNMLLAKLEGVYTGSGDHGSSM 600

Query: 601 FVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGT 660
           FVLSMEQGLPFLPEWLCFNRVNARAR  M+VG +QLLLSLS GHVVG FCPHEAFAIGGT
Sbjct: 601 FVLSMEQGLPFLPEWLCFNRVNARARTSMEVGFAQLLLSLSGGHVVGNFCPHEAFAIGGT 660

Query: 661 NSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMK 720
           NSVRGYEEGAVGSGRSY VGCGEISFPLF PVEGV FADYGTDLGSG+SV GDPAGARMK
Sbjct: 661 NSVRGYEEGAVGSGRSYAVGCGEISFPLFGPVEGVFFADYGTDLGSGASVLGDPAGARMK 720

Query: 721 PGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
            GSGYGYGFGIRL+SPLGPLRLEYAFNDK AKRFHFGVGHRN
Sbjct: 721 TGSGYGYGFGIRLESPLGPLRLEYAFNDKSAKRFHFGVGHRN 757

BLAST of HG10001152 vs. TAIR 10
Match: AT5G19620.1 (outer envelope protein of 80 kDa )

HSP 1 Score: 960.7 bits (2482), Expect = 7.1e-280
Identity = 508/765 (66.41%), Postives = 580/765 (75.82%), Query Frame = 0

Query: 4   NDDIVFTSRATLRIPHFPPSSSHS---SFRFGFRNLASHFDQSCNSIFQFIDSVKRGSKL 63
           NDD+ F+S +++RI    P   HS   + +   +   SH   + NS+ Q + S+K     
Sbjct: 5   NDDVRFSS-SSIRIHSPSPKEQHSLLTNLQSCSKTFVSHLSNTRNSLNQMLQSLK----- 64

Query: 64  THFNHSISHFWPPALPLCASKKVTQQENSVNRRASWTWGAMFVDKSPLICSASMSLIQSD 123
                  +   PP   +      TQ  NSV +        + + KS  I   S+SLIQS 
Sbjct: 65  -------NRHTPPPRSVRRPNLPTQMLNSVTQ--------LMIGKSSPI---SLSLIQST 124

Query: 124 VSNKSDSGTCTEHQGKRRGMEDKTTGLVGKSTLLCSASLALTRPDELSQSVGSESKELPQ 183
             N S+S    E+    RG+          S LLC ASL+LTRP+E +QSV  +     Q
Sbjct: 125 QFNWSESR--DENVETIRGL---------SSPLLCCASLSLTRPNESTQSVEGKDTVQQQ 184

Query: 184 KGYSAGRLDEERVLISEVLVRNKDGEELERKDLELEAFIALKASRPNSALTVREVQEDVH 243
           KG+S  R  EERVLISEVLVR KDGEELERKDLE+EA  ALKA R NSALT+REVQEDVH
Sbjct: 185 KGHSVSRNAEERVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVH 244

Query: 244 RIINSGYFYSCIPVAVDTRDGIRLIFQVEPNQEFQGLVCEGANVLPAKFLEEAFRHGYVK 303
           RII SGYF SC PVAVDTRDGIRL+FQVEPNQEF+GLVCE ANVLP+KF+ EAFR G+ K
Sbjct: 245 RIIESGYFCSCTPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIHEAFRDGFGK 304

Query: 304 VVNLRHLDEVISSINSWYGERGLFGGVSAVDILSGGILSLQVSEAEVNNTSIRFLDKKTG 363
           V+N++ L+E I+SIN WY ERGLFG VS +D LSGGI+ LQV+EAEVNN SIRFLD+KTG
Sbjct: 305 VINIKRLEEAITSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTG 364

Query: 364 EPISGNTRPETILRQLTTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQPAADAGKVDI 423
           EP  G T PETILRQLTTKKGQVYSMLQGKRD +TVL MGIMEDVSIIPQPA D+GKVD+
Sbjct: 365 EPTKGKTSPETILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDL 424

Query: 424 LMNVVERPSGGFSAGGGLSCGTTGGGGPLAALIGSFAYSHRNLFGRNQKLHVSLERGQVD 483
           +MN VERPSGGFSAGGG+S G T   GPL+ LIGSFAYSHRNLFGRNQKL+VSLERGQ+D
Sbjct: 425 IMNCVERPSGGFSAGGGISSGIT--SGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQID 484

Query: 484 FTFRINYTDPWIEGDDKRTSRTIMVQNSRTPGTLVHGG---SSILTIGRVTAGLEFNRPI 543
             FRINYTDPWIEGDDKRTSR+IMVQNSRTPG LVHG    +S LTIGRVTAG+E++RP 
Sbjct: 485 SIFRINYTDPWIEGDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPF 544

Query: 544 RPSWSGTGGLYFQRAGARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHG 603
           RP W+GT GL FQ AGARDEQGNPII D    PLTASG   D  +LAKLE +YTGSGD G
Sbjct: 545 RPKWNGTAGLIFQHAGARDEQGNPIIKDFYSSPLTASGKPHDETMLAKLESIYTGSGDQG 604

Query: 604 SSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAI 663
           S+MF  +MEQGLP LPEWLCFNRV  RAR G+ +GP++ L SLS GHVVGKF PHEAF I
Sbjct: 605 STMFAFNMEQGLPVLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGKFSPHEAFVI 664

Query: 664 GGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGA 723
           GGTNSVRGYEEGAVGSGRSY VG GE+SFP+  PVEGVIF DYGTD+GSGS+VPGDPAGA
Sbjct: 665 GGTNSVRGYEEGAVGSGRSYVVGSGELSFPVRGPVEGVIFTDYGTDMGSGSTVPGDPAGA 724

Query: 724 RMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDKGAKRFHFGVGHRN 763
           R+KPGSGYGYG G+R+DSPLGPLRLEYAFND+ A RFHFGVG RN
Sbjct: 725 RLKPGSGYGYGLGVRVDSPLGPLRLEYAFNDQHAGRFHFGVGLRN 732

BLAST of HG10001152 vs. TAIR 10
Match: AT3G44160.1 (Outer membrane OMP85 family protein )

HSP 1 Score: 245.7 bits (626), Expect = 1.2e-64
Identity = 131/320 (40.94%), Postives = 183/320 (57.19%), Query Frame = 0

Query: 448 PLAALIGSFAYSHRNLFGRNQKLHVSLERGQVDFTFRINYTDPWIEGDDKRTSRTIMVQN 507
           PL+ +IGS    H NLFG ++KL VS ++G  D    + +  P  E    R  +   +Q+
Sbjct: 39  PLSLVIGSLCIKHPNLFGGSEKLDVSWDKGLYDSNVLVAFRRPRPEW---RPQQCFFIQH 98

Query: 508 SRTPGTLVHG---------GSSILTIGRVTAGLEFNRPIRPSWSGTGGLYFQRAGARDEQ 567
           S +P   VHG         GS  + + ++  GL+ + P    WS T  + F+     ++ 
Sbjct: 99  SLSPEIGVHGTPVDNFSRSGSGGVNLSKLALGLDLSEPASSKWSSTTSIKFEHVRPINDD 158

Query: 568 GNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGSGDHGSSMFVLSMEQGLPFLPEWLCF 627
           G  I  D    P+T SGN  D+M++ K E  +  + D G S F + +EQG+P + +WL F
Sbjct: 159 GRAITRDLDGFPITCSGNTHDSMVVLKQESRFAKATDQGLSHFSMQIEQGIPVVSKWLIF 218

Query: 628 NRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGTNSVRGYEEGAVGSGRSYT 687
           NR    A  G++ GP+ LL SL+ G +VG   P++AFAIGG  SVRGY EGAVGSGRS  
Sbjct: 219 NRFKFVASKGVRFGPAFLLASLTGGSIVGDMAPYQAFAIGGLGSVRGYGEGAVGSGRSCL 278

Query: 688 VGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMKPGSGYGYGFGIRLDSPLG 747
           V   E++ PL    EG IF D GTDLGS   VPG+P+  + KPG GYG+G+G+R  SPLG
Sbjct: 279 VANTELALPLNKMTEGTIFLDCGTDLGSSRLVPGNPSMRQGKPGFGYGFGYGLRFKSPLG 338

Query: 748 PLRLEYAFNDKGAKRFHFGV 759
            L+++YA N    K  +FGV
Sbjct: 339 HLQVDYAINAFNQKTLYFGV 355

BLAST of HG10001152 vs. TAIR 10
Match: AT3G48620.1 (Outer membrane OMP85 family protein )

HSP 1 Score: 161.8 bits (408), Expect = 2.2e-39
Identity = 104/314 (33.12%), Postives = 151/314 (48.09%), Query Frame = 0

Query: 491 WIEGDDKRTSRTIMVQNSRTPGTLVHGGSSILTIGRVTAGLEFNRPIRPSWSGTGGLYFQ 550
           W+ GD       + +   R   +   GG   + + ++  GL+ + P    WS T  + F+
Sbjct: 15  WLLGD-------LDLNGVRNSASSYSGG---VNLSKLAVGLDLSEPASSKWSSTTSVKFE 74

Query: 551 RAG-----------------ARDEQGNPIIMDTIKCPLTASGNADDNMLLAKLEGVYTGS 610
             G                  R ++   I  + ++  L  SGN  D+M++ K E  +  +
Sbjct: 75  VPGHYLLQTLYMCVRLTMTDGRYQRSGRISYN-MQNSLLCSGNTHDSMVVLKQESRFAKA 134

Query: 611 GDHGSSMFVLSMEQGLPFLPEWLCFNRVNARARAGMKVGPSQLLLSLSEGHVVGKFCPHE 670
            D G S F + +EQG+P +  WL FNR    A  G++ GP+  L SL+ G +VG   P++
Sbjct: 135 TDQGLSHFSMQIEQGIPVVSNWLIFNRFKFVASKGVRFGPAFPLASLTGGSIVGDMTPYQ 194

Query: 671 AFAIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVP-- 730
           AFAIGG  SVRGY E AVGSGRS  V   E++  +    EG IF D GTDLGS   VP  
Sbjct: 195 AFAIGGLGSVRGYGEVAVGSGRSCLVANTELANKM---TEGTIFLDCGTDLGSSRLVPVS 254

Query: 731 ---------------------------GDPAGARMKPGSGYGYGFGIRLDSPLGPLRLEY 759
                                      G+P+  + KPG GYG+G+G+R  SPLG L+++Y
Sbjct: 255 SLYLLRTRTIKKLLRHENDKAVAELVSGNPSLRQGKPGFGYGFGYGLRFKSPLGHLQVDY 314

BLAST of HG10001152 vs. TAIR 10
Match: AT3G46740.1 (translocon at the outer envelope membrane of chloroplasts 75-III )

HSP 1 Score: 129.8 bits (325), Expect = 9.3e-30
Identity = 131/510 (25.69%), Postives = 210/510 (41.18%), Query Frame = 0

Query: 302 VNLRHLDEVISSINSWYGERGL-------FGGVSAVDILSGGILSLQVSEAEVNNTSIRF 361
           V+ R L  +   +  WY + G        FG ++  +++       +V E ++    I+F
Sbjct: 320 VSARLLQRIRDRVQKWYHDEGYACAQVVNFGNLNTKEVV------CEVVEGDITQLVIQF 379

Query: 362 LDKKTGEPISGNTRPETILRQL--TTKKGQVYSMLQGKRDAETVLTMGIMEDVSIIPQP- 421
            D K G  + GNT+   + R+L    ++G V+++  GK+    + ++G+  ++ + P+P 
Sbjct: 380 QD-KLGNVVEGNTQVPVVRRELPKQLRQGYVFNIEAGKKALSNINSLGLFSNIEVNPRPD 439

Query: 422 -AADAG-KVDILMNVVERPSGGFSAGGGLSCGTTGGGGPLAALI---GSFAYSHRNLFGR 481
              + G  V+I +  +E+ S   S    +  G   GG P  A     GS  + HRNL G 
Sbjct: 440 EKNEGGIIVEIKLKELEQKSAEVSTEWSIVPGR--GGAPTLASFQPGGSVTFEHRNLQGL 499

Query: 482 NQKLHVSLERG-----QVDFTFRINYTDPWIEGDDKRTSRTIMVQ--NSRTPGTLVHGGS 541
           N+ L  S+        Q D +F++ Y  P+++G     +RT      NSR    +  GG 
Sbjct: 500 NRSLMGSVTTSNFLNPQDDLSFKLEYVHPYLDGVYNPRNRTFKTSCFNSRKLSPVFTGGP 559

Query: 542 SI-----LTIGRVTAGLEFNRPIRPSWSGTGGLYFQRAGARDE------QGNPII----M 601
            +     + + R                 T GL  +    RDE       G  ++    +
Sbjct: 560 GVEEVPPIWVDRAGVKANITENFTRQSKFTYGLVMEEITTRDESSHIAANGQRLLPSGGI 619

Query: 602 DTIKCPLTASGNADDNMLLAKL----EGVYTGSGDHGSSMFVLSMEQGLPFLPEWLCFNR 661
                P T SG   D M   +     +     +G       V  ++QGL    ++  FNR
Sbjct: 620 SADGPPTTLSGTGVDRMAFLQANITRDNTKFVNGAVVGQRTVFQVDQGLGIGSKFPFFNR 679

Query: 662 ----------VNARARAGMKVGPSQLLLSLSEGHVVGKFCPHEAFAIGGTNSVRGYEEGA 721
                     +    +   K  P  L+L    G  VG    ++AF +GG  SVRGY  G 
Sbjct: 680 HQLTMTKFIQLREVEQGAGKSPPPVLVLHGHYGGCVGDLPSYDAFVLGGPYSVRGYNMGE 739

Query: 722 VGSGRSYTVGCGEISFPLFAPVEGVIFADYGTDLGSGSSVPGDPAGARMKPGSGYGYGFG 758
           +G+ R+      EI  P+        F ++G DLGS   V G+P     + G G  YG G
Sbjct: 740 LGAARNIAEVGAEIRIPV-KNTHVYAFVEHGNDLGSSKDVKGNPTAVYRRTGQGSSYGAG 799

BLAST of HG10001152 vs. TAIR 10
Match: AT4G09080.1 (Outer membrane OMP85 family protein )

HSP 1 Score: 72.4 bits (176), Expect = 1.8e-12
Identity = 47/131 (35.88%), Postives = 63/131 (48.09%), Query Frame = 0

Query: 630 KVGPSQLLLSLSEGHVVGKFCPHEAFAIGGTNSVRGYEEGAVGSGRSYTVGCGEISFPLF 689
           K  P  L+L    G  +G    ++ FA+GG NSVRGY  G +G+ ++      EI  P+ 
Sbjct: 268 KPQPPVLVLHGRYGGCIGDLPSYDVFALGGPNSVRGYSMGELGAAKNILELGAEIRIPV- 327

Query: 690 APVEGVIFADYGTDLGSGSSVPGDPAGARMKPGSGYGYGFGIRLDSPLGPLRLEYAFNDK 749
                  FA++G DLGS   V G+P G   K G G  YG G++    LG +R EY     
Sbjct: 328 KNTHVYAFAEHGNDLGSSKDVKGNPTGLYRKMGHGSSYGLGVK----LGMVRAEYTVRHN 387

Query: 750 ---GAKRFHFG 758
              GA    FG
Sbjct: 388 RGTGALFLRFG 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901490.10.0e+0094.88outer envelope protein 80, chloroplastic [Benincasa hispida][more]
XP_023532612.10.0e+0091.60outer envelope protein 80, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022957959.10.0e+0091.08outer envelope protein 80, chloroplastic [Cucurbita moschata][more]
XP_022995029.10.0e+0091.21outer envelope protein 80, chloroplastic [Cucurbita maxima][more]
TYJ96147.10.0e+0088.98outer envelope protein 80 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q9C5J81.0e-27866.41Outer envelope protein 80, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OEP8... [more]
A2X2085.3e-24872.06Outer envelope protein 80, chloroplastic OS=Oryza sativa subsp. indica OX=39946 ... [more]
Q6H7M77.0e-24871.89Outer envelope protein 80, chloroplastic OS=Oryza sativa subsp. japonica OX=3994... [more]
Q5PP511.6e-6340.94Outer envelope protein 39, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=P39 ... [more]
F4JF353.1e-3833.12Outer envelope protein 36, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=P36 ... [more]
Match NameE-valueIdentityDescription
A0A6J1H3L60.0e+0091.08outer envelope protein 80, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1K4I60.0e+0091.21outer envelope protein 80, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC11149... [more]
A0A5D3BAJ70.0e+0088.98Outer envelope protein 80 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A0A0KZC10.0e+0088.58Omp85 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G290840 PE=4 ... [more]
A0A5A7TD950.0e+0088.85Outer envelope protein 80 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffo... [more]
Match NameE-valueIdentityDescription
AT5G19620.17.1e-28066.41outer envelope protein of 80 kDa [more]
AT3G44160.11.2e-6440.94Outer membrane OMP85 family protein [more]
AT3G48620.12.2e-3933.12Outer membrane OMP85 family protein [more]
AT3G46740.19.3e-3025.69translocon at the outer envelope membrane of chloroplasts 75-III [more]
AT4G09080.11.8e-1235.88Outer membrane OMP85 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.40.160.50membrane protein fhac: a member of the omp85/tpsb transporter familycoord: 430..761
e-value: 3.1E-45
score: 157.1
NoneNo IPR availableGENE3D3.10.20.310membrane protein fhaccoord: 269..347
e-value: 2.2E-5
score: 26.2
NoneNo IPR availableGENE3D3.10.20.310membrane protein fhaccoord: 189..268
e-value: 1.5E-13
score: 52.6
NoneNo IPR availableGENE3D3.10.20.310membrane protein fhaccoord: 348..428
e-value: 1.7E-14
score: 55.6
NoneNo IPR availablePANTHERPTHR12815:SF32OUTER ENVELOPE PROTEIN 80, CHLOROPLASTICcoord: 118..760
IPR000184Bacterial surface antigen (D15)PFAMPF01103Omp85coord: 462..761
e-value: 5.4E-43
score: 147.7
IPR039910Surface antigen D15-likePANTHERPTHR12815SORTING AND ASSEMBLY MACHINERY SAMM50 PROTEIN FAMILY MEMBERcoord: 118..760

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001152.1HG10001152.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0045040 protein insertion into mitochondrial outer membrane
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009527 plastid outer membrane
cellular_component GO:0001401 SAM complex
cellular_component GO:0019867 outer membrane