HG10017408 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017408
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA-directed RNA polymerase subunit
LocationChr03: 14026452 .. 14034077 (-)
RNA-Seq ExpressionHG10017408
SyntenyHG10017408
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGATTTGATCAAAATTTTGATTTTACAGATTCAGAATAAGAAACTGTCATCCCATTCAATCGAATTGGGATGCCCCGGATCTGACATGTGGCTTGCTTGGTAGGAGTAACATGAAGCTCAGAATTTTGGGTGTAGTCAATACTCCCATCCCAAATAAAAGGGGAATTGGTCTATGGTCGATTCAGTAATAAATAGGAATTTTTAGTTCTACCTCGTGAAAAAAAAACTTATTTCTTTTGTGTAATTAACCATTCCTTTTTCTTTCAGAATGAAATTATGTTCAAATAAGCAAATATGTCATGGTTAAAGGAGTCTATCCATCGCATATAGGCTTTAAGGACATCGTGGCATAACCGTCGAGGTGAAGTTGGGACCTAAAAGATCGAATGGAACGATACAAAGACAAGTAAATCCCTTATGAATTCCAAGGTACTCCTTATTAATTTTAATTAAGGGGATTCGTCATTCCAAGGGAAATAGACTACTCAAAAATTTCACATTTATTTATTTTTTATGTTGTAATTGATATATTGAATAAAGAACTCAGAAAATTGAAATAAAAAAGCAAAAGAATCAATGAAATGTTGTTTGGTCTTAATTGAAACTTGAGTAAAGAGTAGTTGGGGGGTTTTATAGAATTTGAAAGTGGGAACTCCTTTATTTGATGTAACTACTTGAGCCGGATGAAAAGAAACTTTCACGTCCGATTTTGAAAGGGGGAGATCCTATAGTATCCTATCCAAATTTTTCTTTTGCTAGGCCCATAGCTAAAAAACCTACTTTCTTACGATTACGGGGTTCATTCGAATATGAAATCCAATCCTGGAAATACAGCATCCCACTTTTTTTTACTACCCAAGGCTTCGATACATTTCGAAATCGAGAAATTTCTACCGGAGCAGGTGCTATACGAGAACAATTAGCCGATCTGGATTTGCGACTTATTATAGATTATTCGTTGGTAGAATGGAAAGAATTAGGCGAAGAAGGACCCGCGGGTAATGAATGGGAAGATCGGAAGGTTGGAAGAAGAAAGGATTTTTTGGTTAGACGTATGGAATTAGCTAAGCATTTTATTCGAACAAATATAGAACCCGAATGGATGGTTTTATGTCTATTACCTGTTCTGCCTCCCGAGCTGAGACCGATCATTCAGATAGATGGGGGTAAACTAATGAGCTCGGATATTAATGAACTCTATAGAAGAGTTATCTATCGGAACAATACTCTTATTGATCTATTAACAACAAGTAGATCTACGCCAGGAGAATTAGTAATGTGTCAGGAGAAATTGGTACAAGAAGCCGTGGATACACTTCTTGATAATGGAATCCGCGGACAACCAATGAGGGATGGCCATAATAAGGTTTACAAGTCGTTTTCCGATGTAATTGAAGGCAAAGAGGGAAGATTTCGGGAGACTCTGCTTGGCAAACGGGTCGATTATTCAGGACGTTCTGTGATTGTTGTAGGTCCCTCACTTTCATTACATCGATGTGGATTGCCTCGCGAAATCGCAATAGAGCTTTTCCAGACATTTTTAATTCGTGGGCTAATTCGACAACATTTTGCTTCGAACATAGGAGTTGCTAAGAGTAAAATTCGGGAAAAAGAACCCATTGTATGGGAAATACTTCAGGAAGTTATGCAGGGGCATCCCGTATTGCTAAATAGAGCGCCTACTCTGCATAGATTAGGCATCCAGGCCTTCCAACCTATTTTAGTCGAAGGACGTGCTATTTGTTTACATCCATTAGTTTGTAAGGGATTCAATGCAGACTTTGATGGGGATCAAATGGCTGTTCATGTACCTTTATCTTTGGAGGCTCAAGCAGAGGCGCGTTTACTTATGTTTTCTCATATGAATCTCTTGTCTTCAGCTATTGGGGATCCCATTTCTGTACCAACTCAAGATATGCTTATTGGACTCTATGTATTAACGAGCGGGAATCGACGAGGTATTTGTGCAAATCGGTATAATCCATCTAATCGCAAAAATCATAAAAATGAAAAAATTTACAATAATAACTATAAATATACGAAAGAACCCTTTTTTTGTAATTCCTATGATGCAATTGGAGCTTATCGCCAGAAAAGGATCAATTTAGATAGTCCTTTGTGGCTCCGATGGCGACTAGATCAACGCGTTATTGCTTCAAGAGAAGCCCCTATCGAAGTTCACTATGAATCTTTGGGTACCCATCATGAGATTTATGGGTCCTATCTAATAGTAAAAAGTATAAAAAAAGAAATTCTTTGTATATACATTCGAACCACTGTTGGTCATATTTCTCTTTATCGAGAAATCGAAGAGGCTATACAAGGATTTTGCCGGGCCTGCTCATATGGTACCTAATCTTCTGGTATCTCACTCAGCTAAGTAATTATATGACACCGGTATGAATTTTGTTCTCTCCGGTTCAACTAGGACCATAGGTCCTGAATTTCTACGCGAATCAAGATCGAGAAAAGGAAGTTTTCTAATCATTGACTCAAACCTATTGTCGAACCCCACTCAGCGTAATATGGAGGTACTTATGGCGGAACGGGCCGATCTGGTCTTTCACAATAAAGTGATAGATGGAACTGCCATTAAACGACTTATTAGCAGATTAATAGATCACTTCGGAATGGCATATACATCACACATCCTGGATCAACTAAAGACTCTGGGGTTCCAGCAGGCCACTGCTACATCCATTTCATTAGGAATTGATGATCTTTTAACAATCCCTTCTAAGGGATGGCTAGTCCAAGATGCTGAACAACAAAGTTTTATTTTGGAAAAACACCATCATTATGGAAACGTACACGCAGTAGAAAAATTACGCCAGTCCATTGAGGTATGGTATGCTACAAGTGAATATTTGCGACAAGAAATGAATCCTAATTTTAGGATGACTGACCCTTTTAATCCAGTCCATATAATGTCTTTTTCGGGAGCTCGAGGAAATGCATCTCAAGTACACCAATTAGTAGGTATGAGAGGATTAATGTCAGATCCACAAGGACAAATGATTGATTTACCCATTCAAAGCAATTTACGTGAGGGGCTGTCCTTAACGGAATATATAATTTCTTGCTACGGAGCGCGGAAAGGAGTTGTGGATACTGCTGTACGAACATCGGATGCTGGATATCTTACACGCAGACTTGTTGAAGTAGTTCAACACATTGTTGTACGTAGAACCGATTGTGGAACCATCCGAGGGATTTCAGTGAGTCCCGGAAATAGGATGATTCCGGAAAGAATTTTTATCCAAACATTAATTGGTCGTGTATTAGCGGACGATATATATATGGGCCCGCGATGCATTGGCATTCGAAATCAAGATATTGGTATTGGACTTATCAATCGATTCATAACCTTTCAAACACAACCAATATCTATTCGGACTCCCTTTACTTGTAGGAGTACATCTTGGATCTGCCGATTATGTTATGGACGAAGTCCTACTCACGGGGACCTGGTCGAATTGGGAGAAGCCGTAGGTATTATTGCGGGTCAATCTATTGGAGAACCGGGTACTCAACTAACATTAAGAACTTTTCATACCGGCGGAGTATTCACAGGGGGGACTGCAGAACATATACGAGCTCCTTCTAATGGCAAAATAAAATTCAATGAGGATTTGGTTCATCCCACACGTACACGCCATGGGCATCCTGCTTTTTTATGTTATATAGACTTGTATGTAACTATTGAGAGTGAGGATATTATACATAACGTCACTATTCCACCAAAAAGTTTACTTTTAGTTCAAAACGATCAATATGTAGAATCAGAACAAGTGATTGCTGAGATTCGCGCGGGAACATACACTCTTAATTTAAAAGAGAGGGTTCGAAAACATATTTATTCTGACTCAGAGGGAGAAATGCATTGGAGTACCGACGTGTATCATGCACCCGAATTTACATATAGTAATGTCCATCTCTTACCAAAAACAAGTCATTTATGGATATTATCAGGAGGTTCGTGCGGATGCAGTGTAGTCCCTTTTTCACTCTACAAGGATCAAGATCAAATTAACGTTCATTCTCTTTCTGTCGAAAGAAGATATATTTCTAGCCTTTCAGTAAATAATGATAAAGTGAGACAAAAATTTTATGGTCTGGATCTCTCTGGTAAAAACGAAAGCGGAATTCCTTATTATTCAGAACTTAATCCAATCCTATGTACGGGTCAATCTAATCTCACATATCCTGCTATTTTCCACGGTAATTCTGATTTATTGGCAAAGAGGCGAAGAAATGGATTCATCATTCAATTTGAATCACTTCAAGAACGAGAAAAAGAACTACCACCCCCTTCCGGTATTTCGATTGAAATACCCATAAATGGTATTTTTCGTAGAAATAGTATTCTTGCTTATTTCGATGATCCTCAATACAGAAGAAATAGTTCAGGAATTACTAAATATGGGACTATAGGGGTGCATTCAATCCTCAAAAAAGAGGATTTGATTGAGTATCGAGGAGTCAAAGACTTTAAGCCAAAATACCAAATGCAAATGAAAGTGGATCGATTTTTTTTCATTCCCGAGGAAGTGCATATTTTACCTGAATCTTCTTCCATAATGGTACGGAACAATAGTATCATTGGAGTAGCTACACGACTAACTTTAAGTATAAGAAGCCGAGTAGGCGGATTGGTCCGAGTGGAGAAAAAAAAAAAAAGGATTGAACTCAAAATATTTTCTGGAGATATACATTTTCCTGGAGAGATGGATAAGATATCCCGACACAATGGCATCTTGATACCACCTGAAAGAGTAAAAAAAAATTCTAAGAAATCAAAAAAATTGAAAAATTGGATTTATGTCCAGTGGATCACACCTACCAAGAAAAAGTATTTTGTTTTTGTTCGACCTGTAATCATATATGAACTAGCCGACGGTATAAATTTAGTAACACTTTTCCCACAGGATCTATTGCAAGAAAGGGATAATCTGGAGCTTAGAGTTGTCAATTATATCCTTTATGGAAATGGCAAACCAATTCGGGGAATTTCTGGCACAAGCATTCAATTAGTTCGGACTTGTTTATTGTTGAATTGGGACCGAGACAAAAAAAACTCTTCTATCGAAGAAGCGCGCGCTTCCTTTGTTGAAGTAACTACAAATGGTCTGGTTCGAAATTTCCTACGAATAGACTTAGTGAAATCCGATACTTCGTATATCAGAAAAAGGAAAGATCCGTCAGGTTCAGGATTGATCTTTAATAATGAGTCAGATCGCACCAATATCAATCCATTTTTTTCTATTTATTCCAAGACAAGGGTTCCACAATCTCCTAGTCAAAATCAAGGAACTATTCGTACGTTGTTGAATAGAAATAAAGAACGCCAATCTTTGATAATTTTGTCAGCATCGAATTGTTTGCAACTGGATCTATTCAACGATGTAAAAGATTATAATGTCATAAAAGAATCAAGTAAAAAGGATCCTCTAATTGAAATTAGGAATTCGTTAGGACCTTTAGGGGCAGCCCCTCAAATTGTGAATTTTTATTCATTTTATGACTTAATAACTCATAATCCGATCTCCCTAACTAAATATTTGCAACTTGACAATTTAAAACAGACTTTTCAAGTACTTAAATATTATTTAATGGATGAAAACGGGGGGATTTTTAATTCCGATCCATGCAGTAACATCGTTTTCAATACATTTAATTTGAATTGGCATTTTCTGCATCATAATTATCACCATAATTATTGTGAAGAAACACCTAAAAGAATTAGCCTTGGACAGTTTTTTTTTGAAAATGTATGTATAGCCAAAAATAGACCACACCTAAAATCGGGTCAAATTATAATTGTTCAAGTTGATTCTGTAGTAATAAGATCAGCTAAGCCTTATTTGGCCACTTCAGGAGCAACTGTTCATCGCCATTATGGCGAAATCCTTTACGAAGGAGATACCTTAGTTACATTTATATATGAAAAATCGCGATCTGGTGATATAACGCAGGGTCTTCCAAAAGTAGAACAAGTATTAGAAGTGCGTTCGATTGATTCAATATCGATGAGCCTAGAAAAGAGAATTGAGGGTTGGAACGAGCGTATAACAAGAATTCTTGGAATTCCTTGGGGATTTTTGATTGGTGCTGAACTAACGATAGTGCAAAGTCGTATCTCTTTGGTTAATAAGATCCAAAAGGTTTATCGATCCCAGGGGGTGGAGATCCATAATAGACATATCGAAATTATTGTACGTCAAATAACATCAAAAGTGTTGGTTTCAGAAGATGGAATGTCTAATGTTTTTTCACCTGGAGAACTCATTGGATTGTTGCGAGCGGAACGAACAGGGCGTGCTTTGGAAGAAGCGATCTGTTACCGAGCCGTATTATTGGGAATAACGAAAGCATCTCTAAATACTCAAAGTTTCATATCGGAAGCAAGTTTTCAAGAAACTGCTCGAGTTTTAGCAAAAGCCGCCCTCCGAGGTCGTATCGATTGGTTGAGAGGTTTGAAAGAGAACGTTGTTCTAGGAGGAATGATACCAGTTGGTACCGGATTCAGAGAATTAGCGCACCGTTCGAGGCAACATAACAATATTCCTTTAGAAACCCCCCAAAAAAATTTTTTCGAGGGGGAAATGAGAGATATTTTGTTCCACCACAAAGAATTATTTGATTTTTTCATTTCAACGAATTTACATGATACATCAGAACAAGCATTTCTAGGATTTAATGATTCATAAACGTGGAGTCATCCTGTTAACTAGCCGCGTTGATTTGGTAATAAGATAATAACGAATAGAAAAAAATTAATGGCTTGGATCGTGTATCAACAGTCAATCCCCGGTCCAGGTAGAGGATAAGGTTCCATGGGAACAATTATTTATTTCTATTTCAGCTCGTCTCTTTTTTTTAAGAAAAAAGAGAGGAAGTGGGTGGAGAAATGACAAGAAGATATTGGAACATCCATTTGGAAGAGATGATGGAAGCAGGAGTTCATTTTGGTCATGGTACTAGGAAATGGAATCCTAGAATGGCACCTTATATCTCTGCAAAACGTAAAGGTATTCATATTATAAATCTTACTAGAACTGCTCGTTTTTTATCAGAAGCTTGTGATTTAGTTTTTGATGCAGCAAGTAGGGGCAAACAATTCTTAATTGTTGGTACCAAAAATAAAGCAGCGGATTCAGTAGCCCGGGCTGCAACAAGGGCTCGGTGTCATTATGTTAATAAAAAATGGCTCGGGGGGATGTTAACAAATTGGTCTACTACAGAAACGAGACTTCATAAGTTCAGGGACTTGAGAACGGAACAAAAGACGGGGGGACTCAACCGTCTTCCGAAAAGGGATGCCGCTATGTTGAAGAGACAATTATCTCACTTGCAAACATATCTGGGCGGGATTAAATATATGACGGGGTTACCCGATATTGTAATAATCGTTGATCAGCAAGAAGAATACCGGGCTCTTCAAGAATGTATCACGTTGGGAATTCCAACTATTTGTTTAATTGATACAAATTGTGACCCCGATCTCGCAGATATTTCGATTCCAGCGAATGATGATGCTATAGCTTCAATCCGATTAATTCTTAACAAATTAGTATTTGCAATTTCTGAGGGTCGTTCTAGCTCTATACGAAATTCTTGA

mRNA sequence

ATGTGTGATTTGATCAAAATTTTGATTTTACAGATTCAGAATAAGAAACTGTCATCCCATTCAATCGAATTGGGATGCCCCGGATCTGACATGTGGCTTGCTTGGCCCATAGCTAAAAAACCTACTTTCTTACGATTACGGGGTTCATTCGAATATGAAATCCAATCCTGGAAATACAGCATCCCACTTTTTTTTACTACCCAAGGCTTCGATACATTTCGAAATCGAGAAATTTCTACCGGAGCAGGTGCTATACGAGAACAATTAGCCGATCTGGATTTGCGACTTATTATAGATTATTCGTTGGTAGAATGGAAAGAATTAGGCGAAGAAGGACCCGCGGGTAATGAATGGGAAGATCGGAAGGTTGGAAGAAGAAAGGATTTTTTGGTTAGACGTATGGAATTAGCTAAGCATTTTATTCGAACAAATATAGAACCCGAATGGATGGTTTTATGTCTATTACCTGTTCTGCCTCCCGAGCTGAGACCGATCATTCAGATAGATGGGGGTAAACTAATGAGCTCGGATATTAATGAACTCTATAGAAGAGTTATCTATCGGAACAATACTCTTATTGATCTATTAACAACAAGTAGATCTACGCCAGGAGAATTAGTAATGTGTCAGGAGAAATTGGTACAAGAAGCCGTGGATACACTTCTTGATAATGGAATCCGCGGACAACCAATGAGGGATGGCCATAATAAGGTTTACAAGTCGTTTTCCGATGTAATTGAAGGCAAAGAGGGAAGATTTCGGGAGACTCTGCTTGGCAAACGGGTCGATTATTCAGGACGTTCTGTGATTGTTGTAGGTCCCTCACTTTCATTACATCGATGTGGATTGCCTCGCGAAATCGCAATAGAGCTTTTCCAGACATTTTTAATTCGTGGGCTAATTCGACAACATTTTGCTTCGAACATAGGAGTTGCTAAGAGTAAAATTCGGGAAAAAGAACCCATTGTATGGGAAATACTTCAGGAAGTTATGCAGGGGCATCCCGTATTGCTAAATAGAGCGCCTACTCTGCATAGATTAGGCATCCAGGCCTTCCAACCTATTTTAGTCGAAGGACGTGCTATTTGTTTACATCCATTAGTTTGTAAGGGATTCAATGCAGACTTTGATGGGGATCAAATGGCTGTTCATGTACCTTTATCTTTGGAGGCTCAAGCAGAGGCGCGTTTACTTATGTTTTCTCATATGAATCTCTTGTCTTCAGCTATTGGGGATCCCATTTCTGTACCAACTCAAGATATGCTTATTGGACTCTATGTATTAACGAGCGGGAATCGACGAGGTATTTGTGCAAATCGGACCATAGGTCCTGAATTTCTACGCGAATCAAGATCGAGAAAAGGAAGTTTTCTAATCATTGACTCAAACCTATTGTCGAACCCCACTCAGCGTAATATGGAGGTACTTATGGCGGAACGGGCCGATCTGGTCTTTCACAATAAAGTGATAGATGGAACTGCCATTAAACGACTTATTAGCAGATTAATAGATCACTTCGGAATGGCATATACATCACACATCCTGGATCAACTAAAGACTCTGGGGTTCCAGCAGGCCACTGCTACATCCATTTCATTAGGAATTGATGATCTTTTAACAATCCCTTCTAAGGGATGGCTAGTCCAAGATGCTGAACAACAAAGTTTTATTTTGGAAAAACACCATCATTATGGAAACGTACACGCAGTAGAAAAATTACGCCAGTCCATTGAGGTATGGTATGCTACAAGTGAATATTTGCGACAAGAAATGAATCCTAATTTTAGGATGACTGACCCTTTTAATCCAGTCCATATAATGTCTTTTTCGGGAGCTCGAGGAAATGCATCTCAAGTACACCAATTAGTAGGTATGAGAGGATTAATGTCAGATCCACAAGGACAAATGATTGATTTACCCATTCAAAGCAATTTACGTGAGGGGCTGTCCTTAACGGAATATATAATTTCTTGCTACGGAGCGCGGAAAGGAGTTGTGGATACTGCTGTACGAACATCGGATGCTGGATATCTTACACGCAGACTTGTTGAAGTAGTTCAACACATTGTTGTACGTAGAACCGATTGTGGAACCATCCGAGGGATTTCAGTGAGTCCCGGAAATAGGATGATTCCGGAAAGAATTTTTATCCAAACATTAATTGGTCGTGTATTAGCGGACGATATATATATGGGCCCGCGATGCATTGGCATTCGAAATCAAGATATTGGTATTGGACTTATCAATCGATTCATAACCTTTCAAACACAACCAATATCTATTCGGACTCCCTTTACTTGTAGGAGTACATCTTGGATCTGCCGATTATGTTATGGACGAAGTCCTACTCACGGGGACCTGGTCGAATTGGGAGAAGCCGTAGGTATTATTGCGGGTCAATCTATTGGAGAACCGGAAAAAAGAGAGGAAGTGGGTGGAGAAATGACAAGAAGATATTGGAACATCCATTTGGAAGAGATGATGGAAGCAGGAGTTCATTTTGGTCATGGTACTAGGAAATGGAATCCTAGAATGGCACCTTATATCTCTGCAAAACGTAAAGGTATTCATATTATAAATCTTACTAGAACTGCTCGTTTTTTATCAGAAGCTTGTGATTTAGTTTTTGATGCAGCAAGTAGGGGCAAACAATTCTTAATTGTTGGTACCAAAAATAAAGCAGCGGATTCAGTAGCCCGGGCTGCAACAAGGGCTCGGTGTCATTATGTTAATAAAAAATGGCTCGGGGGGATGTTAACAAATTGGTCTACTACAGAAACGAGACTTCATAAGTTCAGGGACTTGAGAACGGAACAAAAGACGGGGGGACTCAACCGTCTTCCGAAAAGGGATGCCGCTATGTTGAAGAGACAATTATCTCACTTGCAAACATATCTGGGCGGGATTAAATATATGACGGGGTTACCCGATATTGTAATAATCGTTGATCAGCAAGAAGAATACCGGGCTCTTCAAGAATGTATCACGTTGGGAATTCCAACTATTTGTTTAATTGATACAAATTGTGACCCCGATCTCGCAGATATTTCGATTCCAGCGAATGATGATGCTATAGCTTCAATCCGATTAATTCTTAACAAATTAGTATTTGCAATTTCTGAGGGTCGTTCTAGCTCTATACGAAATTCTTGA

Coding sequence (CDS)

ATGTGTGATTTGATCAAAATTTTGATTTTACAGATTCAGAATAAGAAACTGTCATCCCATTCAATCGAATTGGGATGCCCCGGATCTGACATGTGGCTTGCTTGGCCCATAGCTAAAAAACCTACTTTCTTACGATTACGGGGTTCATTCGAATATGAAATCCAATCCTGGAAATACAGCATCCCACTTTTTTTTACTACCCAAGGCTTCGATACATTTCGAAATCGAGAAATTTCTACCGGAGCAGGTGCTATACGAGAACAATTAGCCGATCTGGATTTGCGACTTATTATAGATTATTCGTTGGTAGAATGGAAAGAATTAGGCGAAGAAGGACCCGCGGGTAATGAATGGGAAGATCGGAAGGTTGGAAGAAGAAAGGATTTTTTGGTTAGACGTATGGAATTAGCTAAGCATTTTATTCGAACAAATATAGAACCCGAATGGATGGTTTTATGTCTATTACCTGTTCTGCCTCCCGAGCTGAGACCGATCATTCAGATAGATGGGGGTAAACTAATGAGCTCGGATATTAATGAACTCTATAGAAGAGTTATCTATCGGAACAATACTCTTATTGATCTATTAACAACAAGTAGATCTACGCCAGGAGAATTAGTAATGTGTCAGGAGAAATTGGTACAAGAAGCCGTGGATACACTTCTTGATAATGGAATCCGCGGACAACCAATGAGGGATGGCCATAATAAGGTTTACAAGTCGTTTTCCGATGTAATTGAAGGCAAAGAGGGAAGATTTCGGGAGACTCTGCTTGGCAAACGGGTCGATTATTCAGGACGTTCTGTGATTGTTGTAGGTCCCTCACTTTCATTACATCGATGTGGATTGCCTCGCGAAATCGCAATAGAGCTTTTCCAGACATTTTTAATTCGTGGGCTAATTCGACAACATTTTGCTTCGAACATAGGAGTTGCTAAGAGTAAAATTCGGGAAAAAGAACCCATTGTATGGGAAATACTTCAGGAAGTTATGCAGGGGCATCCCGTATTGCTAAATAGAGCGCCTACTCTGCATAGATTAGGCATCCAGGCCTTCCAACCTATTTTAGTCGAAGGACGTGCTATTTGTTTACATCCATTAGTTTGTAAGGGATTCAATGCAGACTTTGATGGGGATCAAATGGCTGTTCATGTACCTTTATCTTTGGAGGCTCAAGCAGAGGCGCGTTTACTTATGTTTTCTCATATGAATCTCTTGTCTTCAGCTATTGGGGATCCCATTTCTGTACCAACTCAAGATATGCTTATTGGACTCTATGTATTAACGAGCGGGAATCGACGAGGTATTTGTGCAAATCGGACCATAGGTCCTGAATTTCTACGCGAATCAAGATCGAGAAAAGGAAGTTTTCTAATCATTGACTCAAACCTATTGTCGAACCCCACTCAGCGTAATATGGAGGTACTTATGGCGGAACGGGCCGATCTGGTCTTTCACAATAAAGTGATAGATGGAACTGCCATTAAACGACTTATTAGCAGATTAATAGATCACTTCGGAATGGCATATACATCACACATCCTGGATCAACTAAAGACTCTGGGGTTCCAGCAGGCCACTGCTACATCCATTTCATTAGGAATTGATGATCTTTTAACAATCCCTTCTAAGGGATGGCTAGTCCAAGATGCTGAACAACAAAGTTTTATTTTGGAAAAACACCATCATTATGGAAACGTACACGCAGTAGAAAAATTACGCCAGTCCATTGAGGTATGGTATGCTACAAGTGAATATTTGCGACAAGAAATGAATCCTAATTTTAGGATGACTGACCCTTTTAATCCAGTCCATATAATGTCTTTTTCGGGAGCTCGAGGAAATGCATCTCAAGTACACCAATTAGTAGGTATGAGAGGATTAATGTCAGATCCACAAGGACAAATGATTGATTTACCCATTCAAAGCAATTTACGTGAGGGGCTGTCCTTAACGGAATATATAATTTCTTGCTACGGAGCGCGGAAAGGAGTTGTGGATACTGCTGTACGAACATCGGATGCTGGATATCTTACACGCAGACTTGTTGAAGTAGTTCAACACATTGTTGTACGTAGAACCGATTGTGGAACCATCCGAGGGATTTCAGTGAGTCCCGGAAATAGGATGATTCCGGAAAGAATTTTTATCCAAACATTAATTGGTCGTGTATTAGCGGACGATATATATATGGGCCCGCGATGCATTGGCATTCGAAATCAAGATATTGGTATTGGACTTATCAATCGATTCATAACCTTTCAAACACAACCAATATCTATTCGGACTCCCTTTACTTGTAGGAGTACATCTTGGATCTGCCGATTATGTTATGGACGAAGTCCTACTCACGGGGACCTGGTCGAATTGGGAGAAGCCGTAGGTATTATTGCGGGTCAATCTATTGGAGAACCGGAAAAAAGAGAGGAAGTGGGTGGAGAAATGACAAGAAGATATTGGAACATCCATTTGGAAGAGATGATGGAAGCAGGAGTTCATTTTGGTCATGGTACTAGGAAATGGAATCCTAGAATGGCACCTTATATCTCTGCAAAACGTAAAGGTATTCATATTATAAATCTTACTAGAACTGCTCGTTTTTTATCAGAAGCTTGTGATTTAGTTTTTGATGCAGCAAGTAGGGGCAAACAATTCTTAATTGTTGGTACCAAAAATAAAGCAGCGGATTCAGTAGCCCGGGCTGCAACAAGGGCTCGGTGTCATTATGTTAATAAAAAATGGCTCGGGGGGATGTTAACAAATTGGTCTACTACAGAAACGAGACTTCATAAGTTCAGGGACTTGAGAACGGAACAAAAGACGGGGGGACTCAACCGTCTTCCGAAAAGGGATGCCGCTATGTTGAAGAGACAATTATCTCACTTGCAAACATATCTGGGCGGGATTAAATATATGACGGGGTTACCCGATATTGTAATAATCGTTGATCAGCAAGAAGAATACCGGGCTCTTCAAGAATGTATCACGTTGGGAATTCCAACTATTTGTTTAATTGATACAAATTGTGACCCCGATCTCGCAGATATTTCGATTCCAGCGAATGATGATGCTATAGCTTCAATCCGATTAATTCTTAACAAATTAGTATTTGCAATTTCTGAGGGTCGTTCTAGCTCTATACGAAATTCTTGA

Protein sequence

MCDLIKILILQIQNKKLSSHSIELGCPGSDMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANRTIGPEFLRESRSRKGSFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYMGPRCIGIRNQDIGIGLINRFITFQTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEPEKREEVGGEMTRRYWNIHLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAISEGRSSSIRNS
Homology
BLAST of HG10017408 vs. NCBI nr
Match: KAB5511205.1 (hypothetical protein DKX38_030130 [Salix brachista])

HSP 1 Score: 1650.2 bits (4272), Expect = 0.0e+00
Identity = 942/1625 (57.97%), Postives = 970/1625 (59.69%), Query Frame = 0

Query: 34   AWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLD 93
            A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIRE LADLD
Sbjct: 20   ARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIRELLADLD 79

Query: 94   LRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLC 153
            LR+I+DYS +EWKELGEEGP GNEWEDRKVGRRKDFLVRR+ELAKHFIRTNIEPEWMVLC
Sbjct: 80   LRIILDYSSLEWKELGEEGPTGNEWEDRKVGRRKDFLVRRVELAKHFIRTNIEPEWMVLC 139

Query: 154  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKL 213
            LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMCQEKL
Sbjct: 140  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMCQEKL 199

Query: 214  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 273
            VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRET+LGKRVDYSGRSVIVVG
Sbjct: 200  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETMLGKRVDYSGRSVIVVG 259

Query: 274  PSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQG 333
            PSLSLHRCGLPREIAIELFQTF+IRGLIRQH ASNIGVAKSKIREKEPIVW ILQEVM+G
Sbjct: 260  PSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSKIREKEPIVWGILQEVMRG 319

Query: 334  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQA 393
            HP+LLNRAPTLHRLGIQAFQPILVEGRAICLHPLV KGFNADFDGDQMAVHVPLSLEAQA
Sbjct: 320  HPILLNRAPTLHRLGIQAFQPILVEGRAICLHPLVRKGFNADFDGDQMAVHVPLSLEAQA 379

Query: 394  EARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICAN-------------- 453
            EARLLMFSHMNLLS AIGDPISVPTQDML+GLYVLTSGNRRGI AN              
Sbjct: 380  EARLLMFSHMNLLSPAIGDPISVPTQDMLMGLYVLTSGNRRGISANRYNPFNCRNFQNEK 439

Query: 454  ------------------------------------------------------------ 513
                                                                        
Sbjct: 440  IDANANKDKYIKEPFFCNSYDAIGAYRQKRINLASPLWLRWQLDQGLIASREAPIEGHFG 499

Query: 514  --------RTIGPEFLRESRSRKGSFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVID 573
                    RT+ PEFLRESR RKGSF  I+SNLL +P  RNMEV MAERA+L FHNKVID
Sbjct: 500  SLDLFGATRTLVPEFLRESRLRKGSFPSINSNLLPSPIHRNMEVFMAERANLFFHNKVID 559

Query: 574  GTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDA 633
            GTAIKR+ISR IDHFGMAYTSHILDQ+KTLGFQQATATSISLGIDDLLTIPSKGWLVQDA
Sbjct: 560  GTAIKRIISRFIDHFGMAYTSHILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDA 619

Query: 634  EQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSG 693
            EQQSFILEKHHHYGNVHA+EKLRQSIE+WYATSEYLRQEMNPNFRMT+PFNPVHIMSFSG
Sbjct: 620  EQQSFILEKHHHYGNVHAIEKLRQSIEIWYATSEYLRQEMNPNFRMTEPFNPVHIMSFSG 679

Query: 694  ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR 753
            ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR
Sbjct: 680  ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR 739

Query: 754  TSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYM 813
            TSDAGYLTRRLVEVVQHIVVRRTDCGT RGISVS  N MIPERIFIQTLIGRVLAD+IYM
Sbjct: 740  TSDAGYLTRRLVEVVQHIVVRRTDCGTTRGISVSSRNGMIPERIFIQTLIGRVLADNIYM 799

Query: 814  GPRCIGIRNQDIGIGLINRFITFQTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG 873
            G RCI  RNQDIGIGL+NRFITF+TQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG
Sbjct: 800  GLRCIATRNQDIGIGLVNRFITFRTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG 859

Query: 874  EAVGIIAGQSIGEP---------------------------------------------- 933
            EAVGIIAGQSIGEP                                              
Sbjct: 860  EAVGIIAGQSIGEPGTQLTLRTFHTGGVFTGGTAEHVRAPSNGKIKFNKGLVHPTRTRHG 919

Query: 934  ------------------------------------------------------------ 993
                                                                        
Sbjct: 920  HPAFLCSMDLYVTIESQDIIHNVTIPPKSFLLVQNDQYVESEQVIAEIRSGTYTLNFTER 979

Query: 994  ------------------------------------------------------------ 1050
                                                                        
Sbjct: 980  VRKHIYSDSEGEMHWSTDVYHASEFTYSNVHLLPKTSHLWILSGGSCRSSIVPFSLHKDQ 1039

BLAST of HG10017408 vs. NCBI nr
Match: KAG8363203.1 (hypothetical protein BUALT_BualtPtG0001100 [Buddleja alternifolia])

HSP 1 Score: 1572.0 bits (4069), Expect = 0.0e+00
Identity = 935/1766 (52.94%), Postives = 972/1766 (55.04%), Query Frame = 0

Query: 8    LILQIQNKKLSSHSIELGCPGSDMWLAWPIAKKPTFLRLRG------------------- 67
            ++ +I+NKKLSS  I+LGCPGSDM     + K    LR+RG                   
Sbjct: 33   IVGEIRNKKLSSQPIQLGCPGSDM----SLGKSNMKLRIRGVVNTPQSKGELIYGVYPSR 92

Query: 68   -------SFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRLIIDYS 127
                       E++S           QGFDTFRNREISTGAGAIREQLADLDLR+I+D S
Sbjct: 93   IGFQGHRGITVEVKSGPNRSNGTIHRQGFDTFRNREISTGAGAIREQLADLDLRIILDNS 152

Query: 128  LVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPE 187
            LVEWKELGEEGP GNEWEDRKVGRRKDFLVRRMELAKHF+RTNIEPEWMVLCLLPVLPPE
Sbjct: 153  LVEWKELGEEGPTGNEWEDRKVGRRKDFLVRRMELAKHFLRTNIEPEWMVLCLLPVLPPE 212

Query: 188  LRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTL 247
            LRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMCQEKLVQEAVDTL
Sbjct: 213  LRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMCQEKLVQEAVDTL 272

Query: 248  LDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRC 307
            LDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRC
Sbjct: 273  LDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRC 332

Query: 308  GLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRA 367
            GLPREIAIELFQTF+IRGLIRQH ASNIG+AKSKIREKEPIVWEILQEVMQGHPVLLNRA
Sbjct: 333  GLPREIAIELFQTFVIRGLIRQHLASNIGIAKSKIREKEPIVWEILQEVMQGHPVLLNRA 392

Query: 368  PTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFS 427
            PTLH+LGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFS
Sbjct: 393  PTLHKLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFS 452

Query: 428  HMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRR------GICANRTIGPEFLRESRSRKG 487
            HMNLLS AIGDPISVPTQDMLIGLYVLTSGNRR       +     I PEFLRESRSRKG
Sbjct: 453  HMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRVGIKISSLQLGPYIVPEFLRESRSRKG 512

Query: 488  SFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHIL 547
            SF I DSN LSN TQ+NMEVLMAERA+LVFHNKVIDGTA+KRLISRLIDHFGMAYTSHIL
Sbjct: 513  SFPITDSNPLSNRTQQNMEVLMAERANLVFHNKVIDGTAMKRLISRLIDHFGMAYTSHIL 572

Query: 548  DQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQ 607
            DQ+KTLGFQQATATSISLGIDDLLTIPSK WLVQDAEQQS ILEKHHHYGNVHAVEKLRQ
Sbjct: 573  DQVKTLGFQQATATSISLGIDDLLTIPSKRWLVQDAEQQSLILEKHHHYGNVHAVEKLRQ 632

Query: 608  SIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQ 667
            SIE+WYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQ
Sbjct: 633  SIEIWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQ 692

Query: 668  MIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTD 727
            MIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTD
Sbjct: 693  MIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTD 752

Query: 728  CGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYMGPRCIGIRNQDIGIGLINRFITFQ 787
            CGT+RGISVSP N M+PERIFIQTLIGRVLADDIYMG RCI  RNQDIGIGL+NRFITF+
Sbjct: 753  CGTVRGISVSPRNGMMPERIFIQTLIGRVLADDIYMGTRCIATRNQDIGIGLVNRFITFR 812

Query: 788  TQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEP---------- 847
             QPI+IRTPFTCRS SWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEP          
Sbjct: 813  AQPIAIRTPFTCRSASWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEPGTQLTLRTFH 872

Query: 848  ------------------------------------------------------------ 907
                                                                        
Sbjct: 873  TGGVFTGGTAEHVRAPSNGKIKFNEDLVHPTRTRHGHPAFLCSIDLYVTIESEDIQHNVN 932

Query: 908  ------------------------------------------------------------ 967
                                                                        
Sbjct: 933  IPPQSFLLVQNDQYVESEQVIAEIRAGTSTLNFKEKVRKHIYSDSDGEMHWSTDVYHAPE 992

Query: 968  ------------------------------------------------------------ 1027
                                                                        
Sbjct: 993  FTYGNVHLLPKTSHLWILLGGPCRSSLVSLSLHKDQDQMSAHSRSVKRRSLSNLSGTTDQ 1052

Query: 1028 ------------------------------------------------------------ 1050
                                                                        
Sbjct: 1053 SRQKFFTSDFSGKKEDRIPDYSDLSRIICTGRCNLIDPTILYQNSDLFSKRRRNRFIIPL 1112

BLAST of HG10017408 vs. NCBI nr
Match: THG00578.1 (hypothetical protein TEA_001457 [Camellia sinensis var. sinensis])

HSP 1 Score: 1566.6 bits (4055), Expect = 0.0e+00
Identity = 886/1470 (60.27%), Postives = 916/1470 (62.31%), Query Frame = 0

Query: 34   AWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLD 93
            A PIAKKPTFLRLRG FEYEIQSWKYSIPLFFT QGFDTFRNREISTGAGAIREQLADLD
Sbjct: 26   ARPIAKKPTFLRLRGLFEYEIQSWKYSIPLFFTNQGFDTFRNREISTGAGAIREQLADLD 85

Query: 94   LRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLC 153
            LR+IID SLVEWKELG+E  AGNEWEDRK+ RRKDFLVRRMELAKHF+RTN+EPEWMVLC
Sbjct: 86   LRIIIDNSLVEWKELGDEESAGNEWEDRKIRRRKDFLVRRMELAKHFLRTNVEPEWMVLC 145

Query: 154  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKL 213
            LLPVLPPELRPIIQIDGGKLMSSDINELYRRV+YRNNTL DLL TSRSTPGELVMCQEKL
Sbjct: 146  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVLYRNNTLTDLLATSRSTPGELVMCQEKL 205

Query: 214  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 273
            VQEAVDTLLDNGIRGQP +DGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG
Sbjct: 206  VQEAVDTLLDNGIRGQPTKDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 265

Query: 274  PSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQG 333
            PSLSLH+CGLPREIAIELFQTF+IRGLIRQH ASNIG+AK KIREKEPIVWEILQ+VMQG
Sbjct: 266  PSLSLHQCGLPREIAIELFQTFVIRGLIRQHIASNIGIAKRKIREKEPIVWEILQKVMQG 325

Query: 334  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQA 393
            HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLS EAQA
Sbjct: 326  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSFEAQA 385

Query: 394  EARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANRTIGPEFLRESRSR 453
            EARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLT GNRRGIC NR            R
Sbjct: 386  EARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTMGNRRGICKNRL-----------R 445

Query: 454  KGSFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSH 513
            KGSF I DS  LSNPT RN  VLMAERADLVFHNKVIDGTA+KRLISRLIDHFGMAYTSH
Sbjct: 446  KGSFRITDSGPLSNPTHRNTGVLMAERADLVFHNKVIDGTAMKRLISRLIDHFGMAYTSH 505

Query: 514  ILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKL 573
            ILDQ+KTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQS ILEKHHHYGNVHAVEKL
Sbjct: 506  ILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSCILEKHHHYGNVHAVEKL 565

Query: 574  RQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQ 633
            RQSIE+WYATSEYLRQEMNPNFRMTDP NPVH+MSFSGARGNASQVHQLVGMRGLMSDPQ
Sbjct: 566  RQSIEIWYATSEYLRQEMNPNFRMTDPSNPVHLMSFSGARGNASQVHQLVGMRGLMSDPQ 625

Query: 634  GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRR 693
            GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHI+VRR
Sbjct: 626  GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIIVRR 685

Query: 694  TDCGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYMGPRCIGIRNQDIGIGLINRFIT 753
            TDCGTIRGISVSP N M  E+I +QTL+GRVLADDIYMG RCI  RNQDIGIGL NRFIT
Sbjct: 686  TDCGTIRGISVSPRNGM-TEKILVQTLMGRVLADDIYMGIRCIASRNQDIGIGLANRFIT 745

Query: 754  FQTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEP-------- 813
            F+ QPI IRTPFTCR+TSWIC+LCYGRSPTHGDLVELGEAVGIIAGQSIGEP        
Sbjct: 746  FRAQPIYIRTPFTCRNTSWICQLCYGRSPTHGDLVELGEAVGIIAGQSIGEPGTQLTLRT 805

Query: 814  ----------------------------------------------------EKRE---- 873
                                                                E R+    
Sbjct: 806  FHTGGVFTGGTAEHVRAPSNGKIKFNEYLVHPTRTRHGHPAFLCSIDLYVTIESRDILHS 865

Query: 874  ---------------------------------------------EVGGEM--------- 933
                                                         E  GEM         
Sbjct: 866  VNIPPKSLILVQNDQYVESEQVIAEIRAGTSTFHLKERVRKHIYSESEGEMHWSTDVYHA 925

Query: 934  ------------------------------------------------------------ 993
                                                                        
Sbjct: 926  PEYTYSNVHLLPKTSHLWILAGGPCRSSIVSFSLHKDQDQMNAHSFSVDERYISDRLITN 985

Query: 994  ----------------------------TRRYWNI------------------------- 1040
                                        +  YWN                          
Sbjct: 986  DRVRHKLLDPYGKKDKEILDYSRLDRIISNGYWNFIYPSIPQENSDFLAKRRRNRFLIPL 1045

BLAST of HG10017408 vs. NCBI nr
Match: QCE13735.1 (DNA-directed RNA polymerase subunit beta' [Vigna unguiculata])

HSP 1 Score: 1481.8 bits (3835), Expect = 0.0e+00
Identity = 762/932 (81.76%), Postives = 778/932 (83.48%), Query Frame = 0

Query: 9    ILQIQNKKLSSHSIELGCPGSDMWLA---------------WPIAKKPTFLRLRGSFEYE 68
            +  I N+KLSSHSIELGCPGS M L                 PIAKKPTFLRLRGSFEYE
Sbjct: 1146 VTHIPNEKLSSHSIELGCPGSGMSLGRSNMKLRIMGVFSTPKPIAKKPTFLRLRGSFEYE 1205

Query: 69   IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRLIIDYSLVEWKELGEEGP 128
            IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLR+IIDYSLVEWKELGEEGP
Sbjct: 1206 IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRIIIDYSLVEWKELGEEGP 1265

Query: 129  AGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL 188
             GNEWEDRKVGRR+DFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL
Sbjct: 1266 TGNEWEDRKVGRRRDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL 1325

Query: 189  MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD 248
            MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD
Sbjct: 1326 MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD 1385

Query: 249  GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ 308
            GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ
Sbjct: 1386 GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ 1445

Query: 309  TFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ 368
            TF+IRGLIRQHFASNIGVAKSKIREKEP+VWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ
Sbjct: 1446 TFVIRGLIRQHFASNIGVAKSKIREKEPVVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ 1505

Query: 369  PILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDP 428
            PILVEG AICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLS AIGDP
Sbjct: 1506 PILVEGHAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSPAIGDP 1565

Query: 429  ISVPTQDMLIGLYVLTSGNRRGICAN---------------------------------- 488
            ISVPTQDMLIGLYVLTSGNRRGICAN                                  
Sbjct: 1566 ISVPTQDMLIGLYVLTSGNRRGICANRYNPCNRRNYQNKRIDDNNYKYTKEPFFCNSYDA 1625

Query: 489  ------------------------------------------------------------ 548
                                                                        
Sbjct: 1626 IGAYRQKRINLDSPLWLRWRLDQRVITSRETPIEVHYESLGTYHEIYGHYLIVRSIKKEI 1685

Query: 549  --------------------------RTIGPEFLRESRSRKGSFLIIDSNLLSNPTQRNM 608
                                      RTI PEFL ESRSRKGSF IIDSN LSNPTQRN 
Sbjct: 1686 ICIYVRTTVGHISLYREIEEAIQGSPRTIVPEFLCESRSRKGSFPIIDSNPLSNPTQRNR 1745

Query: 609  EVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISL 668
            EVLMAERA+LVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQ+KTLGF+QATATSISL
Sbjct: 1746 EVLMAERANLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQVKTLGFRQATATSISL 1805

Query: 669  GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNP 728
            GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIE+WYATSEYLRQEMNP
Sbjct: 1806 GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEIWYATSEYLRQEMNP 1865

Query: 729  NFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE 788
            NFRMTDPFNPVH+MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE
Sbjct: 1866 NFRMTDPFNPVHMMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE 1925

Query: 789  YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPE 806
            YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSP N M+PE
Sbjct: 1926 YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPQNGMMPE 1985

BLAST of HG10017408 vs. NCBI nr
Match: QCE13735.1 (DNA-directed RNA polymerase subunit beta' [Vigna unguiculata])

HSP 1 Score: 449.5 bits (1155), Expect = 7.9e-122
Identity = 221/234 (94.44%), Postives = 228/234 (97.44%), Query Frame = 0

Query: 806  EKREEVGGEMTRRYWNIHLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA 865
            ++R+EV GEMTRRYWNI+LEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA
Sbjct: 3080 KRRKEVWGEMTRRYWNINLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA 3139

Query: 866  RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTT 925
            RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAA +ARCHYVNKKWLGGMLTNW TT
Sbjct: 3140 RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAAIKARCHYVNKKWLGGMLTNWYTT 3199

Query: 926  ETRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQ 985
            ETRLHKFRDLRTEQKTG LNRLPKRDAA+LKRQL HLQTYLGGIKYMTGLPDIVIIVDQQ
Sbjct: 3200 ETRLHKFRDLRTEQKTGRLNRLPKRDAAVLKRQLFHLQTYLGGIKYMTGLPDIVIIVDQQ 3259

Query: 986  EEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI 1040
            EEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI
Sbjct: 3260 EEYTALRECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI 3313


HSP 2 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 747/920 (81.20%), Postives = 766/920 (83.26%), Query Frame = 0

Query: 15   KKLSSHSIELGCP-GS--DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFD 74
            +KLS   ++ G P GS  +   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTT GFD
Sbjct: 155  RKLSRPILKGGDPIGSYPNFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTPGFD 214

Query: 75   TFRNREISTGAGAIREQLADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLV 134
            TFRNREISTGAGAIREQLADLDLR+IIDYSL+EWKELGEEG  GNEWEDRKVGRRKDFLV
Sbjct: 215  TFRNREISTGAGAIREQLADLDLRIIIDYSLLEWKELGEEGSTGNEWEDRKVGRRKDFLV 274

Query: 135  RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT 194
            RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT
Sbjct: 275  RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT 334

Query: 195  LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG 254
            LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG
Sbjct: 335  LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG 394

Query: 255  RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGV 314
            RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTF+IRGLIRQHFASNIGV
Sbjct: 395  RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHFASNIGV 454

Query: 315  AKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKG 374
            AKSKIREKEP+VWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEG AICLHPLVCKG
Sbjct: 455  AKSKIREKEPVVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGHAICLHPLVCKG 514

Query: 375  FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSG 434
            FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSG
Sbjct: 515  FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSG 574

Query: 435  NRRGICANR--------------------------------------------------- 494
            NRRGICANR                                                   
Sbjct: 575  NRRGICANRYNPYNRTNSKNERIADNNYKYTKEPFFCNSYDAIGAYRQKRVNLDSPLWLR 634

Query: 495  -------------------------------------------------TIG-------- 554
                                                             T+G        
Sbjct: 635  WRLDQRVITSRETPIEVHYESLGTSHEIYGHYVIVRSIKKEVLCIYVRTTVGHISLYREI 694

Query: 555  ------------------PEFLRESRSRKGSFLIIDSNLLSNPTQRNMEVLMAERADLVF 614
                              PEFL ESRSRKGSF I DSN LSNPTQRN EVLMAERA LVF
Sbjct: 695  EEAIQGFCRAYSYARTTVPEFLCESRSRKGSFPITDSNPLSNPTQRNREVLMAERASLVF 754

Query: 615  HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKG 674
            HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQ+KTLGF+QATATSISLGIDDLLTIPSKG
Sbjct: 755  HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQVKTLGFRQATATSISLGIDDLLTIPSKG 814

Query: 675  WLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVH 734
            WLVQDAEQQS ILEKHHHYGNVHAVEKLRQSIE+WYATSEYLRQEMNPNFRMTDPFNPVH
Sbjct: 815  WLVQDAEQQSLILEKHHHYGNVHAVEKLRQSIEIWYATSEYLRQEMNPNFRMTDPFNPVH 874

Query: 735  IMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV 794
            +MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV
Sbjct: 875  MMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV 934

Query: 795  VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPERIFIQTLIGRVL 806
            VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGT+RGISVSP N M+PERIFIQTLIGRVL
Sbjct: 935  VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTVRGISVSPRNGMMPERIFIQTLIGRVL 994

BLAST of HG10017408 vs. ExPASy Swiss-Prot
Match: Q4VZP2 (DNA-directed RNA polymerase subunit beta' OS=Cucumis sativus OX=3659 GN=rpoC1 PE=3 SV=3)

HSP 1 Score: 821.6 bits (2121), Expect = 1.0e-236
Identity = 407/411 (99.03%), Postives = 407/411 (99.03%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL
Sbjct: 144 DFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
           ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW
Sbjct: 204 ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE
Sbjct: 384 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR
Sbjct: 504 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 554

BLAST of HG10017408 vs. ExPASy Swiss-Prot
Match: B1NWE1 (DNA-directed RNA polymerase subunit beta' OS=Manihot esculenta OX=3983 GN=rpoC1 PE=3 SV=1)

HSP 1 Score: 801.6 bits (2069), Expect = 1.1e-230
Identity = 396/411 (96.35%), Postives = 400/411 (97.32%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQ FDTFRNREISTGAGAIREQL
Sbjct: 144 DFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQCFDTFRNREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
           ADLDLR+IIDYS VEWKELGEEGP GNEWEDRKVGRRKDFLVRR+ELAKHFIRTNIEPEW
Sbjct: 204 ADLDLRIIIDYSSVEWKELGEEGPTGNEWEDRKVGRRKDFLVRRVELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRET+LGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETMLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLHRCGLPREIAIELFQ F+IRGLIRQH ASNIGVAKSKIREKEPIVWEIL E
Sbjct: 384 IVVGPSLSLHRCGLPREIAIELFQIFVIRGLIRQHLASNIGVAKSKIREKEPIVWEILHE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSGNRRGICANR
Sbjct: 504 EAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 554

BLAST of HG10017408 vs. ExPASy Swiss-Prot
Match: Q68S15 (DNA-directed RNA polymerase subunit beta' OS=Panax ginseng OX=4054 GN=rpoC1 PE=3 SV=2)

HSP 1 Score: 800.8 bits (2067), Expect = 1.8e-230
Identity = 396/411 (96.35%), Postives = 400/411 (97.32%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL
Sbjct: 144 DFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
           ADLDLR+IID SLVEWKELGE+GP GNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW
Sbjct: 204 ADLDLRIIIDSSLVEWKELGEDGPTGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLHRCGLPREIAIELFQTF+IRGLIRQH ASNIGVAKSKIREKEPIVWEILQE
Sbjct: 384 IVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSKIREKEPIVWEILQE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VMQGHPVLLNRAPTLHRLGIQAFQP+LVEGRAICLHPLV KGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMQGHPVLLNRAPTLHRLGIQAFQPVLVEGRAICLHPLVRKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSGNRRGIC NR
Sbjct: 504 EAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRGICVNR 554

BLAST of HG10017408 vs. ExPASy Swiss-Prot
Match: Q0ZJ29 (DNA-directed RNA polymerase subunit beta' OS=Vitis vinifera OX=29760 GN=rpoC1 PE=3 SV=1)

HSP 1 Score: 798.1 bits (2060), Expect = 1.2e-229
Identity = 394/411 (95.86%), Postives = 400/411 (97.32%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PI KKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL
Sbjct: 144 DFSFARPIEKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
            DLDLR+IIDYSLVEWKELGEEGP GNEWEDRK+GRRKDFLVRRMELAKHFIRTNIEPEW
Sbjct: 204 DDLDLRIIIDYSLVEWKELGEEGPTGNEWEDRKIGRRKDFLVRRMELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLH+CGLPREIAIELFQTFLIRGLIRQH ASNIGVAKS+IREKEPIVWEILQE
Sbjct: 384 IVVGPSLSLHQCGLPREIAIELFQTFLIRGLIRQHLASNIGVAKSQIREKEPIVWEILQE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VM+GHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLV KGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMRGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVRKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQ+EARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSGNRRGICANR
Sbjct: 504 EAQSEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 554

BLAST of HG10017408 vs. ExPASy Swiss-Prot
Match: Q2L8Z4 (DNA-directed RNA polymerase subunit beta' OS=Gossypium hirsutum OX=3635 GN=rpoC1 PE=3 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 1.5e-229
Identity = 393/411 (95.62%), Postives = 400/411 (97.32%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFR+REISTGAGAIREQL
Sbjct: 144 DFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRSREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
           ADLDLR++IDYS+VEWKELGEEG  GNEWEDRK+GRRKDFLVRRMELAKHFIRTNIEPEW
Sbjct: 204 ADLDLRILIDYSVVEWKELGEEGLTGNEWEDRKIGRRKDFLVRRMELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLHRCGLPREIAIELFQTF+IRGLIRQH A NIGVAKSKIREK PIVWEILQE
Sbjct: 384 IVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHLAPNIGVAKSKIREKGPIVWEILQE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VM+GHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMRGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSGNRRGICANR
Sbjct: 504 EAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 554

BLAST of HG10017408 vs. ExPASy TrEMBL
Match: A0A5N5J004 (Multifunctional fusion protein OS=Salix brachista OX=2182728 GN=rps2 PE=3 SV=1)

HSP 1 Score: 1650.2 bits (4272), Expect = 0.0e+00
Identity = 942/1625 (57.97%), Postives = 970/1625 (59.69%), Query Frame = 0

Query: 34   AWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLD 93
            A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIRE LADLD
Sbjct: 20   ARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIRELLADLD 79

Query: 94   LRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLC 153
            LR+I+DYS +EWKELGEEGP GNEWEDRKVGRRKDFLVRR+ELAKHFIRTNIEPEWMVLC
Sbjct: 80   LRIILDYSSLEWKELGEEGPTGNEWEDRKVGRRKDFLVRRVELAKHFIRTNIEPEWMVLC 139

Query: 154  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKL 213
            LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMCQEKL
Sbjct: 140  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMCQEKL 199

Query: 214  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 273
            VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRET+LGKRVDYSGRSVIVVG
Sbjct: 200  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETMLGKRVDYSGRSVIVVG 259

Query: 274  PSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQG 333
            PSLSLHRCGLPREIAIELFQTF+IRGLIRQH ASNIGVAKSKIREKEPIVW ILQEVM+G
Sbjct: 260  PSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSKIREKEPIVWGILQEVMRG 319

Query: 334  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQA 393
            HP+LLNRAPTLHRLGIQAFQPILVEGRAICLHPLV KGFNADFDGDQMAVHVPLSLEAQA
Sbjct: 320  HPILLNRAPTLHRLGIQAFQPILVEGRAICLHPLVRKGFNADFDGDQMAVHVPLSLEAQA 379

Query: 394  EARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICAN-------------- 453
            EARLLMFSHMNLLS AIGDPISVPTQDML+GLYVLTSGNRRGI AN              
Sbjct: 380  EARLLMFSHMNLLSPAIGDPISVPTQDMLMGLYVLTSGNRRGISANRYNPFNCRNFQNEK 439

Query: 454  ------------------------------------------------------------ 513
                                                                        
Sbjct: 440  IDANANKDKYIKEPFFCNSYDAIGAYRQKRINLASPLWLRWQLDQGLIASREAPIEGHFG 499

Query: 514  --------RTIGPEFLRESRSRKGSFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVID 573
                    RT+ PEFLRESR RKGSF  I+SNLL +P  RNMEV MAERA+L FHNKVID
Sbjct: 500  SLDLFGATRTLVPEFLRESRLRKGSFPSINSNLLPSPIHRNMEVFMAERANLFFHNKVID 559

Query: 574  GTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDA 633
            GTAIKR+ISR IDHFGMAYTSHILDQ+KTLGFQQATATSISLGIDDLLTIPSKGWLVQDA
Sbjct: 560  GTAIKRIISRFIDHFGMAYTSHILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDA 619

Query: 634  EQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSG 693
            EQQSFILEKHHHYGNVHA+EKLRQSIE+WYATSEYLRQEMNPNFRMT+PFNPVHIMSFSG
Sbjct: 620  EQQSFILEKHHHYGNVHAIEKLRQSIEIWYATSEYLRQEMNPNFRMTEPFNPVHIMSFSG 679

Query: 694  ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR 753
            ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR
Sbjct: 680  ARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVR 739

Query: 754  TSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYM 813
            TSDAGYLTRRLVEVVQHIVVRRTDCGT RGISVS  N MIPERIFIQTLIGRVLAD+IYM
Sbjct: 740  TSDAGYLTRRLVEVVQHIVVRRTDCGTTRGISVSSRNGMIPERIFIQTLIGRVLADNIYM 799

Query: 814  GPRCIGIRNQDIGIGLINRFITFQTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG 873
            G RCI  RNQDIGIGL+NRFITF+TQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG
Sbjct: 800  GLRCIATRNQDIGIGLVNRFITFRTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELG 859

Query: 874  EAVGIIAGQSIGEP---------------------------------------------- 933
            EAVGIIAGQSIGEP                                              
Sbjct: 860  EAVGIIAGQSIGEPGTQLTLRTFHTGGVFTGGTAEHVRAPSNGKIKFNKGLVHPTRTRHG 919

Query: 934  ------------------------------------------------------------ 993
                                                                        
Sbjct: 920  HPAFLCSMDLYVTIESQDIIHNVTIPPKSFLLVQNDQYVESEQVIAEIRSGTYTLNFTER 979

Query: 994  ------------------------------------------------------------ 1050
                                                                        
Sbjct: 980  VRKHIYSDSEGEMHWSTDVYHASEFTYSNVHLLPKTSHLWILSGGSCRSSIVPFSLHKDQ 1039

BLAST of HG10017408 vs. ExPASy TrEMBL
Match: A0A4S4DD75 (DNA-directed RNA polymerase subunit OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_001457 PE=3 SV=1)

HSP 1 Score: 1566.6 bits (4055), Expect = 0.0e+00
Identity = 886/1470 (60.27%), Postives = 916/1470 (62.31%), Query Frame = 0

Query: 34   AWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLD 93
            A PIAKKPTFLRLRG FEYEIQSWKYSIPLFFT QGFDTFRNREISTGAGAIREQLADLD
Sbjct: 26   ARPIAKKPTFLRLRGLFEYEIQSWKYSIPLFFTNQGFDTFRNREISTGAGAIREQLADLD 85

Query: 94   LRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLC 153
            LR+IID SLVEWKELG+E  AGNEWEDRK+ RRKDFLVRRMELAKHF+RTN+EPEWMVLC
Sbjct: 86   LRIIIDNSLVEWKELGDEESAGNEWEDRKIRRRKDFLVRRMELAKHFLRTNVEPEWMVLC 145

Query: 154  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKL 213
            LLPVLPPELRPIIQIDGGKLMSSDINELYRRV+YRNNTL DLL TSRSTPGELVMCQEKL
Sbjct: 146  LLPVLPPELRPIIQIDGGKLMSSDINELYRRVLYRNNTLTDLLATSRSTPGELVMCQEKL 205

Query: 214  VQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 273
            VQEAVDTLLDNGIRGQP +DGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG
Sbjct: 206  VQEAVDTLLDNGIRGQPTKDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVG 265

Query: 274  PSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQG 333
            PSLSLH+CGLPREIAIELFQTF+IRGLIRQH ASNIG+AK KIREKEPIVWEILQ+VMQG
Sbjct: 266  PSLSLHQCGLPREIAIELFQTFVIRGLIRQHIASNIGIAKRKIREKEPIVWEILQKVMQG 325

Query: 334  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQA 393
            HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLS EAQA
Sbjct: 326  HPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSFEAQA 385

Query: 394  EARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANRTIGPEFLRESRSR 453
            EARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLT GNRRGIC NR            R
Sbjct: 386  EARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTMGNRRGICKNRL-----------R 445

Query: 454  KGSFLIIDSNLLSNPTQRNMEVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSH 513
            KGSF I DS  LSNPT RN  VLMAERADLVFHNKVIDGTA+KRLISRLIDHFGMAYTSH
Sbjct: 446  KGSFRITDSGPLSNPTHRNTGVLMAERADLVFHNKVIDGTAMKRLISRLIDHFGMAYTSH 505

Query: 514  ILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKL 573
            ILDQ+KTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQS ILEKHHHYGNVHAVEKL
Sbjct: 506  ILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSCILEKHHHYGNVHAVEKL 565

Query: 574  RQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQ 633
            RQSIE+WYATSEYLRQEMNPNFRMTDP NPVH+MSFSGARGNASQVHQLVGMRGLMSDPQ
Sbjct: 566  RQSIEIWYATSEYLRQEMNPNFRMTDPSNPVHLMSFSGARGNASQVHQLVGMRGLMSDPQ 625

Query: 634  GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRR 693
            GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHI+VRR
Sbjct: 626  GQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIIVRR 685

Query: 694  TDCGTIRGISVSPGNRMIPERIFIQTLIGRVLADDIYMGPRCIGIRNQDIGIGLINRFIT 753
            TDCGTIRGISVSP N M  E+I +QTL+GRVLADDIYMG RCI  RNQDIGIGL NRFIT
Sbjct: 686  TDCGTIRGISVSPRNGM-TEKILVQTLMGRVLADDIYMGIRCIASRNQDIGIGLANRFIT 745

Query: 754  FQTQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEP-------- 813
            F+ QPI IRTPFTCR+TSWIC+LCYGRSPTHGDLVELGEAVGIIAGQSIGEP        
Sbjct: 746  FRAQPIYIRTPFTCRNTSWICQLCYGRSPTHGDLVELGEAVGIIAGQSIGEPGTQLTLRT 805

Query: 814  ----------------------------------------------------EKRE---- 873
                                                                E R+    
Sbjct: 806  FHTGGVFTGGTAEHVRAPSNGKIKFNEYLVHPTRTRHGHPAFLCSIDLYVTIESRDILHS 865

Query: 874  ---------------------------------------------EVGGEM--------- 933
                                                         E  GEM         
Sbjct: 866  VNIPPKSLILVQNDQYVESEQVIAEIRAGTSTFHLKERVRKHIYSESEGEMHWSTDVYHA 925

Query: 934  ------------------------------------------------------------ 993
                                                                        
Sbjct: 926  PEYTYSNVHLLPKTSHLWILAGGPCRSSIVSFSLHKDQDQMNAHSFSVDERYISDRLITN 985

Query: 994  ----------------------------TRRYWNI------------------------- 1040
                                        +  YWN                          
Sbjct: 986  DRVRHKLLDPYGKKDKEILDYSRLDRIISNGYWNFIYPSIPQENSDFLAKRRRNRFLIPL 1045

BLAST of HG10017408 vs. ExPASy TrEMBL
Match: A0A4D6NNC8 (Multifunctional fusion protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG11g731 PE=3 SV=1)

HSP 1 Score: 1481.8 bits (3835), Expect = 0.0e+00
Identity = 762/932 (81.76%), Postives = 778/932 (83.48%), Query Frame = 0

Query: 9    ILQIQNKKLSSHSIELGCPGSDMWLA---------------WPIAKKPTFLRLRGSFEYE 68
            +  I N+KLSSHSIELGCPGS M L                 PIAKKPTFLRLRGSFEYE
Sbjct: 1146 VTHIPNEKLSSHSIELGCPGSGMSLGRSNMKLRIMGVFSTPKPIAKKPTFLRLRGSFEYE 1205

Query: 69   IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRLIIDYSLVEWKELGEEGP 128
            IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLR+IIDYSLVEWKELGEEGP
Sbjct: 1206 IQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRIIIDYSLVEWKELGEEGP 1265

Query: 129  AGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL 188
             GNEWEDRKVGRR+DFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL
Sbjct: 1266 TGNEWEDRKVGRRRDFLVRRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKL 1325

Query: 189  MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD 248
            MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD
Sbjct: 1326 MSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRD 1385

Query: 249  GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ 308
            GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ
Sbjct: 1386 GHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQ 1445

Query: 309  TFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ 368
            TF+IRGLIRQHFASNIGVAKSKIREKEP+VWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ
Sbjct: 1446 TFVIRGLIRQHFASNIGVAKSKIREKEPVVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQ 1505

Query: 369  PILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDP 428
            PILVEG AICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLS AIGDP
Sbjct: 1506 PILVEGHAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSPAIGDP 1565

Query: 429  ISVPTQDMLIGLYVLTSGNRRGICAN---------------------------------- 488
            ISVPTQDMLIGLYVLTSGNRRGICAN                                  
Sbjct: 1566 ISVPTQDMLIGLYVLTSGNRRGICANRYNPCNRRNYQNKRIDDNNYKYTKEPFFCNSYDA 1625

Query: 489  ------------------------------------------------------------ 548
                                                                        
Sbjct: 1626 IGAYRQKRINLDSPLWLRWRLDQRVITSRETPIEVHYESLGTYHEIYGHYLIVRSIKKEI 1685

Query: 549  --------------------------RTIGPEFLRESRSRKGSFLIIDSNLLSNPTQRNM 608
                                      RTI PEFL ESRSRKGSF IIDSN LSNPTQRN 
Sbjct: 1686 ICIYVRTTVGHISLYREIEEAIQGSPRTIVPEFLCESRSRKGSFPIIDSNPLSNPTQRNR 1745

Query: 609  EVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISL 668
            EVLMAERA+LVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQ+KTLGF+QATATSISL
Sbjct: 1746 EVLMAERANLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQVKTLGFRQATATSISL 1805

Query: 669  GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNP 728
            GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIE+WYATSEYLRQEMNP
Sbjct: 1806 GIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEIWYATSEYLRQEMNP 1865

Query: 729  NFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE 788
            NFRMTDPFNPVH+MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE
Sbjct: 1866 NFRMTDPFNPVHMMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTE 1925

Query: 789  YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPE 806
            YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSP N M+PE
Sbjct: 1926 YIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPQNGMMPE 1985

BLAST of HG10017408 vs. ExPASy TrEMBL
Match: A0A4D6NNC8 (Multifunctional fusion protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG11g731 PE=3 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 3.8e-122
Identity = 221/234 (94.44%), Postives = 228/234 (97.44%), Query Frame = 0

Query: 806  EKREEVGGEMTRRYWNIHLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA 865
            ++R+EV GEMTRRYWNI+LEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA
Sbjct: 3080 KRRKEVWGEMTRRYWNINLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTA 3139

Query: 866  RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTT 925
            RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAA +ARCHYVNKKWLGGMLTNW TT
Sbjct: 3140 RFLSEACDLVFDAASRGKQFLIVGTKNKAADSVARAAIKARCHYVNKKWLGGMLTNWYTT 3199

Query: 926  ETRLHKFRDLRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQ 985
            ETRLHKFRDLRTEQKTG LNRLPKRDAA+LKRQL HLQTYLGGIKYMTGLPDIVIIVDQQ
Sbjct: 3200 ETRLHKFRDLRTEQKTGRLNRLPKRDAAVLKRQLFHLQTYLGGIKYMTGLPDIVIIVDQQ 3259

Query: 986  EEYRALQECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI 1040
            EEY AL+ECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI
Sbjct: 3260 EEYTALRECITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAI 3313


HSP 2 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 747/920 (81.20%), Postives = 766/920 (83.26%), Query Frame = 0

Query: 15   KKLSSHSIELGCP-GS--DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFD 74
            +KLS   ++ G P GS  +   A PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTT GFD
Sbjct: 155  RKLSRPILKGGDPIGSYPNFSFARPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTPGFD 214

Query: 75   TFRNREISTGAGAIREQLADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLV 134
            TFRNREISTGAGAIREQLADLDLR+IIDYSL+EWKELGEEG  GNEWEDRKVGRRKDFLV
Sbjct: 215  TFRNREISTGAGAIREQLADLDLRIIIDYSLLEWKELGEEGSTGNEWEDRKVGRRKDFLV 274

Query: 135  RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT 194
            RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT
Sbjct: 275  RRMELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNT 334

Query: 195  LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG 254
            LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG
Sbjct: 335  LIDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEG 394

Query: 255  RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGV 314
            RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTF+IRGLIRQHFASNIGV
Sbjct: 395  RFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHFASNIGV 454

Query: 315  AKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKG 374
            AKSKIREKEP+VWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEG AICLHPLVCKG
Sbjct: 455  AKSKIREKEPVVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGHAICLHPLVCKG 514

Query: 375  FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSG 434
            FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSG
Sbjct: 515  FNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSG 574

Query: 435  NRRGICANR--------------------------------------------------- 494
            NRRGICANR                                                   
Sbjct: 575  NRRGICANRYNPYNRTNSKNERIADNNYKYTKEPFFCNSYDAIGAYRQKRVNLDSPLWLR 634

Query: 495  -------------------------------------------------TIG-------- 554
                                                             T+G        
Sbjct: 635  WRLDQRVITSRETPIEVHYESLGTSHEIYGHYVIVRSIKKEVLCIYVRTTVGHISLYREI 694

Query: 555  ------------------PEFLRESRSRKGSFLIIDSNLLSNPTQRNMEVLMAERADLVF 614
                              PEFL ESRSRKGSF I DSN LSNPTQRN EVLMAERA LVF
Sbjct: 695  EEAIQGFCRAYSYARTTVPEFLCESRSRKGSFPITDSNPLSNPTQRNREVLMAERASLVF 754

Query: 615  HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKG 674
            HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQ+KTLGF+QATATSISLGIDDLLTIPSKG
Sbjct: 755  HNKVIDGTAIKRLISRLIDHFGMAYTSHILDQVKTLGFRQATATSISLGIDDLLTIPSKG 814

Query: 675  WLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVH 734
            WLVQDAEQQS ILEKHHHYGNVHAVEKLRQSIE+WYATSEYLRQEMNPNFRMTDPFNPVH
Sbjct: 815  WLVQDAEQQSLILEKHHHYGNVHAVEKLRQSIEIWYATSEYLRQEMNPNFRMTDPFNPVH 874

Query: 735  IMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV 794
            +MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV
Sbjct: 875  MMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGV 934

Query: 795  VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPGNRMIPERIFIQTLIGRVL 806
            VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGT+RGISVSP N M+PERIFIQTLIGRVL
Sbjct: 935  VDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTVRGISVSPRNGMMPERIFIQTLIGRVL 994

BLAST of HG10017408 vs. ExPASy TrEMBL
Match: A0A5H2Y107 (Multifunctional fusion protein (Fragment) OS=Prunus dulcis OX=3755 GN=Prudu_904S000200 PE=3 SV=1)

HSP 1 Score: 1440.6 bits (3728), Expect = 0.0e+00
Identity = 738/885 (83.39%), Postives = 754/885 (85.20%), Query Frame = 0

Query: 36   PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLR 95
            PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLR
Sbjct: 1228 PIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLR 1287

Query: 96   LIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLL 155
            +IIDYSLVEWKELGEEGP GNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLL
Sbjct: 1288 IIIDYSLVEWKELGEEGPTGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEWMVLCLL 1347

Query: 156  PVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQ 215
            PVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQ
Sbjct: 1348 PVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMCQEKLVQ 1407

Query: 216  EAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPS 275
            EAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPS
Sbjct: 1408 EAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPS 1467

Query: 276  LSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQEVMQGHP 335
            LSLHRCGLPREIAIELFQTF+IRGLIRQHFASNIGVAKSKIREKEP+VWEIL EVMQGHP
Sbjct: 1468 LSLHRCGLPREIAIELFQTFVIRGLIRQHFASNIGVAKSKIREKEPVVWEILHEVMQGHP 1527

Query: 336  VLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEA 395
            VLLNRAPTLHRLGIQAFQPILVEG AICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEA
Sbjct: 1528 VLLNRAPTLHRLGIQAFQPILVEGHAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEA 1587

Query: 396  RLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR--------------- 455
            RLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSGNRRGICANR               
Sbjct: 1588 RLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNRRGICANRYNPCNRRNYQNKRID 1647

Query: 456  -------------------TIGPE------------------------------------ 515
                                IG                                      
Sbjct: 1648 DNSYKYTKEKEPFFCNSYDAIGAYRQKRINLDSPLWLRWRLDQRVITSRETPIEVHYESL 1707

Query: 516  -----------------FLRESR----------------------------SRKGSFLII 575
                             F+ +S+                            SRKGSF II
Sbjct: 1708 GTYHEIYGHYLILVIFLFIEKSKRPSKGFAEPTHMVPNHMVSKRSNSMTPVSRKGSFPII 1767

Query: 576  DSNLLSNPTQRNMEVLMAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKT 635
            DSN LSNPTQRN EVLMAERA+LVFHNKVIDGTAIKRLI RLIDHFGMAYTSHILDQ+KT
Sbjct: 1768 DSNPLSNPTQRNREVLMAERANLVFHNKVIDGTAIKRLIRRLIDHFGMAYTSHILDQVKT 1827

Query: 636  LGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVW 695
            LGF+QATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIE+W
Sbjct: 1828 LGFRQATATSISLGIDDLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEIW 1887

Query: 696  YATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLP 755
            YATSEYLRQEMNPNFRMTDPFNPVH+MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLP
Sbjct: 1888 YATSEYLRQEMNPNFRMTDPFNPVHMMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLP 1947

Query: 756  IQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIR 806
            IQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIR
Sbjct: 1948 IQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIR 2007

BLAST of HG10017408 vs. TAIR 10
Match: ATCG00180.1 (DNA-directed RNA polymerase family protein )

HSP 1 Score: 788.5 bits (2035), Expect = 6.7e-228
Identity = 388/411 (94.40%), Postives = 397/411 (96.59%), Query Frame = 0

Query: 30  DMWLAWPIAKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQL 89
           D   A PI KKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFD FRNREISTGAGAIREQL
Sbjct: 144 DFSFARPITKKPTFLRLRGSFEYEIQSWKYSIPLFFTTQGFDIFRNREISTGAGAIREQL 203

Query: 90  ADLDLRLIIDYSLVEWKELGEEGPAGNEWEDRKVGRRKDFLVRRMELAKHFIRTNIEPEW 149
           ADLDLR+II+ SLVEWK+LGEEGP GNEWEDRK+ RRKDFLVRRMELAKHFIRTNIEPEW
Sbjct: 204 ADLDLRIIIENSLVEWKQLGEEGPTGNEWEDRKIVRRKDFLVRRMELAKHFIRTNIEPEW 263

Query: 150 MVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGELVMC 209
           MVLCLLPVLPPELRPIIQI+GGKLMSSDINELYRRVIYRNNTL DLLTTSRSTPGELVMC
Sbjct: 264 MVLCLLPVLPPELRPIIQIEGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMC 323

Query: 210 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 269
           QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV
Sbjct: 324 QEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSV 383

Query: 270 IVVGPSLSLHRCGLPREIAIELFQTFLIRGLIRQHFASNIGVAKSKIREKEPIVWEILQE 329
           IVVGPSLSLHRCGLPREIAIELFQTF+IRGLIRQH ASNIGVAKS+IREK+PIVWEILQE
Sbjct: 384 IVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSQIREKKPIVWEILQE 443

Query: 330 VMQGHPVLLNRAPTLHRLGIQAFQPILVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSL 389
           VMQGHPVLLNRAPTLHRLGIQ+FQPILVEGR ICLHPLVCKGFNADFDGDQMAVHVPLSL
Sbjct: 444 VMQGHPVLLNRAPTLHRLGIQSFQPILVEGRTICLHPLVCKGFNADFDGDQMAVHVPLSL 503

Query: 390 EAQAEARLLMFSHMNLLSSAIGDPISVPTQDMLIGLYVLTSGNRRGICANR 441
           EAQAEARLLMFSHMNLLS AIGDPISVPTQDMLIGLYVLTSG RRGICANR
Sbjct: 504 EAQAEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGTRRGICANR 554

BLAST of HG10017408 vs. TAIR 10
Match: ATCG00170.1 (DNA-directed RNA polymerase family protein )

HSP 1 Score: 630.6 bits (1625), Expect = 2.3e-180
Identity = 311/331 (93.96%), Postives = 321/331 (96.98%), Query Frame = 0

Query: 477 MAERADLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGID 536
           MAERA+LVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQ+KTLGFQQATATSISLGID
Sbjct: 1   MAERANLVFHNKVIDGTAIKRLISRLIDHFGMAYTSHILDQVKTLGFQQATATSISLGID 60

Query: 537 DLLTIPSKGWLVQDAEQQSFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFR 596
           DLLTIPSKGWLVQDAEQQS+ILEKHHHYGNVHAVEKLRQSIE+WYATSEYLRQEMNPNFR
Sbjct: 61  DLLTIPSKGWLVQDAEQQSWILEKHHHYGNVHAVEKLRQSIEIWYATSEYLRQEMNPNFR 120

Query: 597 MTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYII 656
           MTDPFNPVH+MSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYII
Sbjct: 121 MTDPFNPVHMMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYII 180

Query: 657 SCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSP--GNRMIPER 716
           SCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSP   NRM+ ER
Sbjct: 181 SCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTIRGISVSPRNKNRMMSER 240

Query: 717 IFIQTLIGRVLADDIYMGPRCIGIRNQDIGIGLINRFITFQTQPISIRTPFTCRSTSWIC 776
           IFIQTLIGRVLADDIY+G RC+  RNQD+GIGL+NR ITF TQ ISIRTPFTCRSTSWIC
Sbjct: 241 IFIQTLIGRVLADDIYIGSRCVAFRNQDLGIGLVNRLITFGTQSISIRTPFTCRSTSWIC 300

Query: 777 RLCYGRSPTHGDLVELGEAVGIIAGQSIGEP 806
           RLCYGRSPTHGDLVELGEAVGIIAGQSIGEP
Sbjct: 301 RLCYGRSPTHGDLVELGEAVGIIAGQSIGEP 331

BLAST of HG10017408 vs. TAIR 10
Match: ATCG00160.1 (ribosomal protein S2 )

HSP 1 Score: 435.3 bits (1118), Expect = 1.4e-121
Identity = 215/236 (91.10%), Postives = 223/236 (94.49%), Query Frame = 0

Query: 815  MTRRYWNIHLEEMMEAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDL 874
            MT+RYWNI LEEMM AGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDL
Sbjct: 1    MTKRYWNIDLEEMMRAGVHFGHGTRKWNPRMAPYISAKRKGIHIINLTRTARFLSEACDL 60

Query: 875  VFDAASRGKQFLIVGTKNKAADSVARAATRARCHYVNKKWLGGMLTNWSTTETRLHKFRD 934
            VFDAASRGKQFLIVGTKNKAAD V+RAA RARCHYVNKKWLGGMLTNWSTTE RLHKFRD
Sbjct: 61   VFDAASRGKQFLIVGTKNKAADLVSRAAIRARCHYVNKKWLGGMLTNWSTTEKRLHKFRD 120

Query: 935  LRTEQKTGGLNRLPKRDAAMLKRQLSHLQTYLGGIKYMTGLPDIVIIVDQQEEYRALQEC 994
            LRTEQKT G NRLPKRDAA+LKRQLS L+TYLGGIKYMTGLPDIVII+DQQEEY AL+EC
Sbjct: 121  LRTEQKTEGFNRLPKRDAAVLKRQLSRLETYLGGIKYMTGLPDIVIILDQQEEYTALREC 180

Query: 995  ITLGIPTICLIDTNCDPDLADISIPANDDAIASIRLILNKLVFAISEGRSSSIRNS 1051
            ITLGIPTI LIDTNC+PDLADISIPANDDAIASIR ILNKLVFAI EGRSS I+NS
Sbjct: 181  ITLGIPTISLIDTNCNPDLADISIPANDDAIASIRFILNKLVFAICEGRSSYIQNS 236

BLAST of HG10017408 vs. TAIR 10
Match: AT4G35800.1 (RNA polymerase II large subunit )

HSP 1 Score: 142.1 bits (357), Expect = 2.5e-33
Identity = 150/643 (23.33%), Postives = 255/643 (39.66%), Query Frame = 0

Query: 147 PEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGEL 206
           P+WM+L +LP+ PP +RP + +D       D+      +I  N  L          P  +
Sbjct: 242 PDWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQLAMIIRHNENL--KRQEKNGAPAHI 301

Query: 207 VMCQEKLVQEAVDTLLDNGIRGQP-MRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYS 266
           +    +L+Q  + T  DN + GQP       +  KS    ++ KEGR R  L+GKRVD+S
Sbjct: 302 ISEFTQLLQFHIATYFDNELPGQPRATQKSGRPIKSICSRLKAKEGRIRGNLMGKRVDFS 361

Query: 267 GRSVIVVGPSLSLHRCGLPREIAIEL--------FQTFLIRGLI--RQHFASNIGVAKSK 326
            R+VI   P++++   G+P  IA+ L        +    ++ L+    H       AK  
Sbjct: 362 ARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLKELVDYGPHPPPGKTGAKYI 421

Query: 327 IREKE----------------PIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEG 386
           IR+                   + +++ + +  G  VL NR P+LH++ I   +  ++  
Sbjct: 422 IRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQPSLHKMSIMGHRIRIMPY 481

Query: 387 RAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDPISVPTQ 446
               L+  V   +NADFDGD+M +HVP S E +AE   LM     ++S     P+    Q
Sbjct: 482 STFRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQANRPVMGIVQ 541

Query: 447 DMLIGLYVLTSGN---RRGICAN----------RTIGPEFLRESRSRKGSFLIIDSNLLS 506
           D L+G   +T  +    + +  N          +   P  L+      G  +    NL+ 
Sbjct: 542 DTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGKVPAPAILKPRPLWTGKQVF---NLII 601

Query: 507 NPTQRNMEVLMAERADL-------------VFHNKVIDGTAIKR--------LISRLIDH 566
            P Q N+    A  AD              +   +++ GT  K+        L+  + + 
Sbjct: 602 -PKQINLLRYSAWHADTETGFITPGDTQVRIERGELLAGTLCKKTLGTSNGSLVHVIWEE 661

Query: 567 FGMAYTSHILDQLKTLGFQQATATSISLGIDDLL-----------TIPSKGWLVQDAEQQ 626
            G       L   + L          ++GI D +           TI +    V+D  +Q
Sbjct: 662 VGPDAARKFLGHTQWLVNYWLLQNGFTIGIGDTIADSSTMEKINETISNAKTAVKDLIRQ 721

Query: 627 SFILEKHHHYGNVHAVEKLRQSIEVWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARG 686
               E     G         +  +V     +        +   T+    +      G+  
Sbjct: 722 FQGKELDPEPGRTMRDTFENRVNQVLNKARDDAGSSAQKSLAETNNLKAMVTAGSKGSFI 781

Query: 687 NASQVHQLVGMRGL-----------------MSDPQGQMIDLPIQSNLREGLSLTEYIIS 701
           N SQ+   VG + +                   D  G      ++++   GL+  E+   
Sbjct: 782 NISQMTACVGQQNVEGKRIPFGFDGRTLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFH 841

BLAST of HG10017408 vs. TAIR 10
Match: AT5G60040.1 (nuclear RNA polymerase C1 )

HSP 1 Score: 140.2 bits (352), Expect = 9.5e-33
Identity = 155/659 (23.52%), Postives = 278/659 (42.19%), Query Frame = 0

Query: 147 PEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLIDLLTTSRSTPGEL 206
           PE +++  + V P  +RP + I G +   +D+    +++I  N +L  +L+   S+P  +
Sbjct: 250 PENLIITCMLVPPLSIRPSVMIGGIQSNENDLTARLKQIILGNASLHKILSQPTSSPKNM 309

Query: 207 VMCQEKLVQEAVDTLLDNGIRG-QPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYS 266
            +     VQ  V   +++ +RG Q   + H          ++GK GRFR  L GKRV+++
Sbjct: 310 QVWD--TVQIEVARYINSEVRGCQNQPEEH--PLSGILQRLKGKGGRFRANLSGKRVEFT 369

Query: 267 GRSVIVVGPSLSLHRCGLPREIA-------------IELFQTFLIRGLIRQHFASN---- 326
           GR+VI   P+L +   G+P  +A             IE  +  +  G  +   A N    
Sbjct: 370 GRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKLRQCVRNGPNKYPGARNVRYP 429

Query: 327 -------IGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPILVEGRA 386
                  +G  + +I ++  I   + + + +G  VL NR P+LHR+ I   +  ++  R 
Sbjct: 430 DGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSIMCHRARIMPWRT 489

Query: 387 ICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSSAIGDPISVPTQDM 446
           +  +  VC  +NADFDGD+M +HVP + EA+ EA  LM    NL +   G+ +   TQD 
Sbjct: 490 LRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEILVASTQDF 549

Query: 447 LIGLYVLTSGNR-------RGICANRTIGPEFL--------------------------- 506
           L   +++T  +          IC+    G + +                           
Sbjct: 550 LTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIELWTGKQIFSVLLRPN 609

Query: 507 -----------RESRSRKGSFLIIDSNLLSNPTQ--RNMEVLMAERADLVFHNKVIDGTA 566
                      +E   +KG     ++  +++     RN E++  +       N   DG  
Sbjct: 610 ASIRVYVTLNVKEKNFKKGEHGFDETMCINDGWVYFRNSELISGQLGKATLGNGNKDG-- 669

Query: 567 IKRLISRLIDHFGMAYTSHILDQLKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQ 626
              L S L+  +     +  +++L  L  +       S+GIDD+   P +    +  +  
Sbjct: 670 ---LYSILLRDYNSHAAAVCMNRLAKLSARWIGIHGFSIGIDDVQ--PGEELSKERKDSI 729

Query: 627 SFILEKHH------HYGNVHAVEKL--RQSIEVWYATSEYLRQEMNPNFRMTDPF--NPV 686
            F  ++ H      + GN+     L   +S+E          +E      M+     N  
Sbjct: 730 QFGYDQCHRKIEEFNRGNLQLKAGLDGAKSLEAEITGILNTIREATGKACMSGLHWRNSP 789

Query: 687 HIMSFSGARGNASQVHQLVGMRGLMS-----DPQG----------QMIDLP-----IQSN 704
            IMS  G++G+   + Q+V   G  +      P G          +M   P     + ++
Sbjct: 790 LIMSQCGSKGSPINISQMVACVGQQTVNGHRAPDGFIDRSLPHFPRMSKSPAAKGFVANS 849

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAB5511205.10.0e+0057.97hypothetical protein DKX38_030130 [Salix brachista][more]
KAG8363203.10.0e+0052.94hypothetical protein BUALT_BualtPtG0001100 [Buddleja alternifolia][more]
THG00578.10.0e+0060.27hypothetical protein TEA_001457 [Camellia sinensis var. sinensis][more]
QCE13735.10.0e+0081.76DNA-directed RNA polymerase subunit beta' [Vigna unguiculata][more]
QCE13735.17.9e-12294.44DNA-directed RNA polymerase subunit beta' [Vigna unguiculata][more]
Match NameE-valueIdentityDescription
Q4VZP21.0e-23699.03DNA-directed RNA polymerase subunit beta' OS=Cucumis sativus OX=3659 GN=rpoC1 PE... [more]
B1NWE11.1e-23096.35DNA-directed RNA polymerase subunit beta' OS=Manihot esculenta OX=3983 GN=rpoC1 ... [more]
Q68S151.8e-23096.35DNA-directed RNA polymerase subunit beta' OS=Panax ginseng OX=4054 GN=rpoC1 PE=3... [more]
Q0ZJ291.2e-22995.86DNA-directed RNA polymerase subunit beta' OS=Vitis vinifera OX=29760 GN=rpoC1 PE... [more]
Q2L8Z41.5e-22995.62DNA-directed RNA polymerase subunit beta' OS=Gossypium hirsutum OX=3635 GN=rpoC1... [more]
Match NameE-valueIdentityDescription
A0A5N5J0040.0e+0057.97Multifunctional fusion protein OS=Salix brachista OX=2182728 GN=rps2 PE=3 SV=1[more]
A0A4S4DD750.0e+0060.27DNA-directed RNA polymerase subunit OS=Camellia sinensis var. sinensis OX=542762... [more]
A0A4D6NNC80.0e+0081.76Multifunctional fusion protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG11g731 PE... [more]
A0A4D6NNC83.8e-12294.44Multifunctional fusion protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG11g731 PE... [more]
A0A5H2Y1070.0e+0083.39Multifunctional fusion protein (Fragment) OS=Prunus dulcis OX=3755 GN=Prudu_904S... [more]
Match NameE-valueIdentityDescription
ATCG00180.16.7e-22894.40DNA-directed RNA polymerase family protein [more]
ATCG00170.12.3e-18093.96DNA-directed RNA polymerase family protein [more]
ATCG00160.11.4e-12191.10ribosomal protein S2 [more]
AT4G35800.12.5e-3323.33RNA polymerase II large subunit [more]
AT5G60040.19.5e-3323.52nuclear RNA polymerase C1 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001865Ribosomal protein S2PRINTSPR00395RIBOSOMALS2coord: 824..842
score: 46.72
coord: 905..922
score: 46.92
coord: 855..864
score: 61.69
coord: 993..1004
score: 50.51
coord: 1014..1028
score: 55.28
coord: 976..993
score: 39.83
IPR001865Ribosomal protein S2PFAMPF00318Ribosomal_S2coord: 827..1042
e-value: 2.0E-78
score: 262.5
IPR001865Ribosomal protein S2CDDcd01425RPS2coord: 827..1041
e-value: 4.72298E-85
score: 270.992
IPR006592RNA polymerase, N-terminalSMARTSM00663rpolaneu7coord: 148..429
e-value: 2.9E-156
score: 535.1
IPR007083RNA polymerase Rpb1, domain 4PFAMPF05000RNA_pol_Rpb1_4coord: 575..633
e-value: 7.1E-10
score: 38.8
NoneNo IPR availableGENE3D2.40.40.20coord: 247..400
e-value: 5.6E-46
score: 157.7
NoneNo IPR availableGENE3D1.10.40.90coord: 288..332
e-value: 6.8E-16
score: 60.1
NoneNo IPR availableGENE3D1.10.287.610Helix hairpin bincoord: 920..970
e-value: 1.3E-24
score: 88.2
NoneNo IPR availableGENE3D3.40.50.10490coord: 823..1050
e-value: 2.0E-76
score: 257.6
NoneNo IPR availablePANTHERPTHR19376:SF47DNA-DIRECTED RNA POLYMERASE SUBUNIT BETA''coord: 477..949
NoneNo IPR availablePANTHERPTHR19376:SF47DNA-DIRECTED RNA POLYMERASE SUBUNIT BETA''coord: 174..440
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 477..949
coord: 174..440
NoneNo IPR availableSUPERFAMILY64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 57..805
IPR042102RNA polymerase Rpb1, domain 3 superfamilyGENE3D1.10.274.100RNA polymerase Rpb1, domain 3coord: 458..534
e-value: 3.8E-11
score: 45.3
coord: 404..457
e-value: 2.4E-8
score: 36.2
IPR007080RNA polymerase Rpb1, domain 1PFAMPF04997RNA_pol_Rpb1_1coord: 138..257
e-value: 1.7E-31
score: 109.7
IPR038120RNA polymerase Rpb1, funnel domain superfamilyGENE3D1.10.132.30coord: 535..672
e-value: 1.4E-46
score: 160.0
IPR007081RNA polymerase Rpb1, domain 5PFAMPF04998RNA_pol_Rpb1_5coord: 648..805
e-value: 7.6E-28
score: 97.7
IPR005706Ribosomal protein S2, bacteria/mitochondria/plastidTIGRFAMTIGR01011TIGR01011coord: 822..1044
e-value: 4.8E-79
score: 262.7
IPR005706Ribosomal protein S2, bacteria/mitochondria/plastidHAMAPMF_00291_BRibosomal_S2_Bcoord: 820..1043
score: 37.793209
IPR000722RNA polymerase, alpha subunitPFAMPF00623RNA_pol_Rpb1_2coord: 259..306
e-value: 7.4E-7
score: 29.5
coord: 314..399
e-value: 2.8E-26
score: 92.7
IPR018130Ribosomal protein S2, conserved sitePROSITEPS00962RIBOSOMAL_S2_1coord: 824..835
IPR018130Ribosomal protein S2, conserved sitePROSITEPS00963RIBOSOMAL_S2_2coord: 976..1000
IPR023591Ribosomal protein S2, flavodoxin-like domain superfamilySUPERFAMILY52313Ribosomal protein S2coord: 824..1045

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017408.1HG10017408.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015986 ATP synthesis coupled proton transport
biological_process GO:0015031 protein transport
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0006412 translation
biological_process GO:0048193 Golgi vesicle transport
cellular_component GO:0045261 proton-transporting ATP synthase complex, catalytic core F(1)
cellular_component GO:0015935 small ribosomal subunit
cellular_component GO:0045263 proton-transporting ATP synthase complex, coupling factor F(o)
cellular_component GO:0005743 mitochondrial inner membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009507 chloroplast
cellular_component GO:0005840 ribosome
molecular_function GO:0005524 ATP binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0046933 proton-transporting ATP synthase activity, rotational mechanism
molecular_function GO:0032549 ribonucleoside binding
molecular_function GO:0003735 structural constituent of ribosome