Cp4.1LG04g07860 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g07860
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA-directed RNA polymerase II subunit, putative
LocationCp4.1LG04 : 3771645 .. 3780826 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATTTTTGGGTCGAGTCTATTCACTGGCAGACCGGACCGGGTCGGTTTTTCACCAACAGCCTTTTAGAAGGTCTCCACATCTCAGAAACCCTCGGAAGTCGGAAGAGAGAGTAGGGAGCCATGGAAGGGGCATCGTACCAGCGCTTTCCCAAGGTGAAGATCCGAGAGCTCAGAGATGACTATGCCAAATTCGAGCTCCATGACACTGATGTTAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCTGAGGTCCCCACCATCGCCATCGACCTGGTAGAAATTGAAGTGAACTCCTCAGTTCTTAACGATGAGTTCATCGCCCATCGCCTTGGTCTCATTCCCCTCACTAGCGAGCGTGCCATGTCTATGCGCTTCTCTCGCGACTGTGATGCTTGTGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTCGAGTTTCACCTTCGTGCCAAGTGCCACTCTGACCAAACCCTAGACGTCTCCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCTGTTGATTTCTCTGATTCTGCAGCAGCTGCTGGTGAAGCGCTTGATACCAAGTATTTCCTCTTACTCTCATGTGTCTCGACTCGTAGACATGTTTGTTGATTATATTTTCCCAGCGAGATTCATTTGAGCTTTTAGCTGATTATTGCATGTAGTTGGGTAGGCGTATTCGTGGTCTGTACTGTTAAGTATGCAACGCCATCTGTTATTCGGTTTACAAATGGAGGGTAAATTTTTGGGTGAAAATACATCTTAGAGATAGTCTATCACTTCTACTTTGGAGCTTGACCGCCAAAGTTATAATGCGTTCTTTTCTTGCTATGCAACATTCCTGCATTATTGTAAGATAACATTCTTTTCATTTTCTATTGCGGTCATTTTCCTCTGCTTGTTGGTTCACTCACATGGCAGCAATTTCATGTTAGGTGGACAGTTGTGATAAAATGCTTGTGTTAGAACAGGTTTATGAGTTCATCTTCATGGTTCTGGATTTCCCTTGTATTTTTTTGTTTTTTGGTTTTTATGCTTCATGTTCTATGGTGAGCAAACATAAATTTTGGGACACCAATAATCTGTTTGACTGAATGCCAACGCACCTCACTACCTTAATGATGAATTTTCTTGCATTCTACTAAATTGCCTGTCTGAAATTTACATAAAAAGAAAAAAAAAAAATTGTAATGCGGAAAATATCATTTTAATCTAGTATCATTGTGGTTAGTGAAATAATTGCTCCTCAATTTGAGCGGGTTTTGTGATAATTGTAGAGGAATAATCATTGTGAAGCTACGTCGGGGTCAAGAATTGAGGTTGAGAGCCATAGCCAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACTGTGACATTCATGTATGAACCAAGCATTCACATCAATGAAGACCTAATGGAAACATTAACACTCGAGGAAAAGACTTCCTGGGTTGAAAGTAGCCCAACCAAAGTCTTTGAATTGGATCCTGTTACCCACCAGGTTCTTCTCGCATAACCTCCACTTCTCTTTCTTTTTATTTGTTGCAATGGCTTTCTGTAATTTTCAAGCTGCTCTCTATAATATATGCAATTTCCCAGAGGGTTGTGTAGAAGAGCTGATCTGAACTGTATATTTCCTCTAATATCACTTTCTGTTGTTTATGGTTTGTCATGCGGGGTTGGCATATTTGAAACTTTTTAGAATAAACTATAGTACTGTGCTGTTTCTATCGCTTCTTGTAAGTGGGAGAGCTAATATAAGAAGCAATAAAGGTATTAGCTTTGTGGTGGATTTGTGTAACGACCTACTTTTCGAGGGACTCGAGGCATGAATTGCCACTAACTGGCGCAGAAATTGAAAACAACATAATAATTTGATTCAATTTGAAAACATGCATGCATTAACTTTAAATCAAGATAAAACAGTACAAGTCCATAAAAAAGACATCAGGGACAGTTTACAAAATACAACATCAAAACATGTACATGAAAATCTTCAAGAGTTTGAAATCAAGACTCTATTCAACTGGACACGCCCTCACCTCGATCCCACAACTTAGTTTCCATTTGACAAAAAAAAGTAGAATGCGTGAGCATAAAAATACTCGGGAAGTAGCTTGCTTGTAGGCTCTTAGCTTATTCTAAGTCTTATGTCATACACTTTCTTTAAGTTCATGTGTCTTATTACATACCTAGACGTCACATTTCTAAGTCATAGGCTCATCTAGGTTATGCTAGCTCACCCAAGCTCAACTTAGAGTTCTTGTTTGGTAATTTCGTGGCTTCTCATCATTCAGTACTATTAGAGCTCTTATTGATTCATTGGTTCTTGTAGTTTGGTTCCTCGAAGCTCAATCAATTCATCGCATCATAAGGTTCTAGGATAACCCAAATCTTTACTAGTTCCTTCATTCATAATCTACTTGCATATCATGCACACTTGAGGCATGACCTCATTATGGTTCTAACTCATAATCTCATCATTCCGTTACAGACACTATCAACCAAGACAGATTCTCTATTAGGGGTTAGTTGGCGTCTCTATAGACATACACTTAACCCCTTGGCCCGGTGGGAAATCCTAGTTTGAAAGGAGAACGAGGGAGGGAGTAGGGAGAGATTTTTTTCCTATTAGATAGTTTACATTCTGTAATTCTGTGACTTCAAAAAGAAATGCCTAGATCTTCTACATGTTCGAAAGCTTTTTGTTTGTGTTCTATGCAAGTCTTGAGATGGCAAGTCTGGTCCTTTATGATGTATACCTTCTTTTCATAACACAATTCAAGTGTTTTTCCCCTTCTGTTTCTTTTAATAGGAAACGACATTTCAATGCGATGAAATATACAATAAGTAGCAGTGCTGAAAAACTCATTGATTAAGAAAAACCTTCAGGTTAGCCAATAGATCCTCTAAACTGTAGTTAGAAAAAAGGTGCATTTACACCAAGTAAAGCAAGAACAAGAATTAAATTATAGAAGTATTTAAAAGAGACTTTTTTTTTTTGTTAGAAAGGCTGCTTATTCCTTTCTAACCAATAAATAAAATAAATAAAATAAAAGAACCCAGAAAAACAACCCATGTAGATGAAGAAAAACTTGTCTAAATGAAGTGTGGAAGGAGACAGATCCTTGATTTGTGCCTTATTGGAATAGATTGAAGTGGAAAGGGGCTATGATTAAGCTGGACATCAAATAAGCGTTTGATAAAGTGGATTGGGACATTCTTGATCTCCTCCTACAAGTTGAAGGATTTATCTCCCAACGGAGAGCATAGATCAGTGCCTGCATTTTTCACCAAATTACTCTATCATCTTTAAAGGGTGCCCGTGTGCTAAGATTTTGGTATCGAGGGGTCTTAGGCTTGCCTTATGAACCATGCTGCAGAAAATGGGTCCTTAAAGGGTCTCTTAGTGGTTTGAGAAGAGCGTAGTATCCATCGTTTACAAATTTTGGATGATAGTGTTCTCTTCTCCATCCTTGATGACAGTCCTATCCACAACACTTTCTCCATTAGTGAAATCCTTTGAGATGGCTTCAAGCCTAAACATTAATGCCGAAAAGTTCAAGATTTTGGGCATCAACTTAGAAGAGGAGGAGCTGGAAAATTTGGCCTCTTTGTTCGGTCGGAAAAGGGAAGCTGGCCGAATGTCTACGCCACAAAAATAGAGAGGAGGTTGCTAGCATGGGAGTCTGTCCATATCTCAAAGTGAGGTTGTCTGACATTATTAGAAGTGTCCTTTCAAATCTTCCCACTTACTACTATCACTGTTCTCATTATCAAAAAAGGTTGCTGCCATAATAGAAAGCGCATGAAGCTTACATTAAAAATAAAGGTCCGATGGGAGAAAATTATTCTTCCCACAGTCAGTTGTGGTCTTGGCATCCAAATGTCTACCAGAAGAACCATTCTCTCCTTGCCAAATGGGCTTGGCGGTTTTGGTTTGAAAAATCAGCCCGTTGGAGACATGAATATTGCTAAATATGGGTCAACCCATTTCAATCTTAAACTGAGATCCTTAATGGTCCGCAAATAAAATGGTTCCTGGAAGGTCATCCTCTGTATGCAAGACCTCATCTTCGACCGATCTTTGACCCATCTTCGTCCATATTCATCATAGAGTTGGGAATGGAGAAACTACTTTTTGGCATGATGACATGTTGCCCTGTGGTCCACTTGCCCACTGGTTCCCTCTATTATTCTCCATCTCAAACATTAAGGCAGCTTCCATTCATGATATCTGGAGTGTTGAAAGGAGATTTGGGGATTTATCTTTAATAAGGAATCCCAAGGAGGAGGAAATTTCAGAATGGGCTGATTTATCCACAAATCTCCTTCCTATCCAACTCTCTCCTACTTGCGTTTCATGAGTTTGGAAATTGGAAAATGATGGCATTTTTTCTACAAAGTCTCTCCTAATGGACTTTGTCACCAACAGGAAAATCTTGCCTGATGGCTTTACAAATATATTTGGGAAGCCACCTGCCTGGAAAAAATGCCATCAACATGCATGAGAAACTCCAGCAATGCATATGGGAAAATCTTGCCTGATGGCTTTACAAATAAATTCTTCATATGGGAAGTTGCTCGCCTTGCCATCAACATGCATGAGAAACTCCAGCAATGCATTCCCCACATGATTCTCTCGCCCCAATGGTGCTCTATGTGTAAAAGGGATGCTGGATCTCAAGACCACCTTCTCATTAATTTTTTTTCTTGCCTCCCTCTGGACATACCTTATTTCTGCTTTTAATTGGTTGATTCCATACCCTGGCAATCGCATCTTCTTGCATCCATGTTATCTGGGTATCCACACAAAAAAGGGAAGTACCTTTGGGAAAATCTCATCAAAGCTTTCTTTTGGGTTACTTGGACTGAGCGCAACGAATGAATCTTTCAAAACAAGGAGAGGCCTGTTGAACACCTTTTGTATAGTATCATTTATCTTACTCTATCTTGGTGTAAAGTGATACCTCTAATTTAATCCTTACTCTTTTGTCTCTCTTGTGACCAATTGGAAAAGTCTTTTGTAACTCCATTGCTTGAAATTTCTTTTCCCTCGTTTGCTTATCCAAACCAAAATAATAATAAATAAATAAAAATAAAACTTGTATATGTTGGAGCATGAGTAGAAAACATGGTTTTGAGTTGAGCTTTTCATGCACATAATACACCACTCATTGATAAACCCATAGAGGGTTGATATGAATTCCCTTGGGACCAAGTAACTCGATACCAAAAGAGTTCAAATCTCTGAGGCAAATAGAGCTCAGATACTAAATAGTGTGAACTCCCTTGGAAATTGAAGTAAGAACCATTTTTTTGGTATCAGAGCACTTTGGTATCATTTCAAGGGCACTGCTTTTGTAAACTGTTGTGTGAGAGTTAATATTCCAATGACTAAGCAAATTGGAATTGCATTATACTATGTGGGAGGACGCTTTTCAATCTTTCAGGGAGCACTCTAGCTATGATCTTATAGATGCATGGGACAAGGCTGATAGGGCAGTAATTGATTGCTGTATTTGCTTCAGCTATTTTGGAATTGGACCTATATAAATTTCATTAAGGCTAACATTTATAATGCCATTCTTTAAAGTTGGGGAATTCTGTCGCAAGGTCTTCTTTAACGATGTCATGATTATTTGGATTTGTCACTTTTTTATAAGCACTGGAAGCCATTGTATCTTGCGGTATCTTCACTTCGAGAAAAGAATCAGCTTAAAGGAGGTACTGATTCAGGTGTAATTGGTGTATTAGGAGAAAATTTTCATTCTCGCTGCCGCTCATGTATGACGTAAAATTTAAGATGGGGATTCATCTTTTTTATTTAATTGTAGTAATACCCAAATTCTGAGATACTTCGCTTACTTATGGCTAAAGAAAGTATGAAAAATGAGTGATCCAAACTTAGCCAATAAATTTTTTATTCTTTATTTTTCATTTGACTTGTTTAACTTTGCTGCTGCTTATTCTGTTTGTAAAGGTTCTGGTGGTTGATCCAGAGGCATACACTTATGATGATGAAGTGATTAAGAAAGCAGAAGCAATGGGTAAAGCTGGACTAGTGGATATCACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATAAAAGCTTCCCAGTTGATACTCAATGCCATAGATATTCTGAAACAGAAACTGGATGCAGTCCGCCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGTGGATGATCCAGATACCAGGCCTTCCCATATGTTTCTAGGAGGTTAGGTCAACTGTAATTTTGTTTTCCTGAACAATCTAGTTTTTCTGTACTGATGAGCCTGGTTTAGTGCATTCTTTTTCACATATTGGATGCACATGTAGCTTCCGGAGATAATTATAATGTAGTATCTTTGGCATTGGTTGGCTCATCCTCATGTTTGACCTGGTTCAATGTATTATGGATGATCGTTAATTCAGCTTCTTCCTTGGGGGTTTTTTTCCCAGGCAATGCCTTGATTATTAATTTTCCAGAAGCTGCTTCCTTAGGGTGGTGACTTGCCCCAAGGGGGTGTAATAAAAACAGCCCCTCCTTAGCCAAATGACATGGTGACCTGGTTCTATGTGTAATATAATGAAATTTTGAATTGCTTCTAGGTTCAGTGATCTACTTAACGGGGCGGTCATTTGCATGAACAACCCAATTTGGATTGGCCAAGGCTAATAAGAACTTGACCAACCTTGAAGATTTAGAATTAAAGTTAGTGATCCTAGATGTCTAGAGACTTGGGAAGTAATAGAGAGAGAGAGAGACCCTTTCCCTTTTCCCTTTTTCCTTTTTCCATCATATGTTGGCATGTCCTCCAAGGTCCTCAAAGAACACTGGTCAGTAAAGCCTTGGTTCAAATGACCCCAGATGCTTTTCACCAAAAGTATCGGGATCTTGTTGGCCAGGGTTTGTCCTCTCTGTGGCTCCCGATCTTAGATTATAATGGGCATCCTATTTCCTTAAATGACATGAACATTCTTCTTTCCTAGAAATATAGGATCAAATTTTTGTGATGGACTTTTATAGGCCTATTAGATGTTTCCAGACTTGGGCCAATTTTGAGACCCAAGCTGCCTTTGTTTCTAACTCATCAGTGATATATTACTTTCTTATTTTGGTTGACTGCCCAGAAGTTTGAGTTTCTGTTTTCAGGTTTTAAGAATAATTGTTTAGGAGCTTTTGCGATTCGAAGTGCTTTTTTTAGATACTCTTTCTAAATTACAATGTCCCTTAGATTTTGTTTATTTTTACTTCTTGGACTATTTAGCCTTCGAACATCTGCTCTTTCATTGTTCATTCGCTTCGTATGAAATCAGACTGGTTTATCTATATGTTGATTAGATGTGAGGATTATTTGTTGGTTGGTGGAAGCTCTTACAGGTTGGAGCTTTAGAAGAAAGGCTAATAATCACAGAATTCTTTTGTAATAAGTCTTCTTTATGTTCTTTTCCTATATATCTTTCTTTATTTCTCATAAAAACAACAAAAGAAATACAAGAGATAGTAGGTTGACACTTCTTGGTTTCACTCTTGAGTGTTGTAGGGTTGGGAATATTCGACTCAGGCATTGCATGTGAGATTCCACATCCTGTGAGATTCCACATCGGTTGGAGAGGGGAACGAAACCATTCTTTATAAGGGTGTGGAAACATCTGGCAAATACATTTTAAAACTTTGAAGGGAAGCCCGAAAGGGAAAGTCCAAAGAAGACAATATCAGTTAGTGGTGGGCTTGGACCCTTACAAATGGTTTCAGAGCTAAACGTCGGGTGATGTGCCAGTGAAAAGGCTGAGCCTCGAAGGGGGTGGACACAAGGCAGTGTGCCAGCAAGGACGTTGGGCCCCAAAGGGGGTGGGGGTGGGGGTGGGGGTGGATTGGGGGGTCCGTGTGAAAACTTCTTTCTAGCAGACACATTTAAAAAACCTTGATGGGAAGCCCGAAAGGGAAAGGCGAAAGAGGACAATATTTGTTAGCGGTGGGCTTGGGCCATTACAAATGGTATAAGAGTCAAACATCAGACGATGTGCCAGCGAAAGGGCTGAGTCCCAAAGGGGGTGGACCCGAGGCGGTGTGCCAGCAAACACGCTGGGCCCAAAGGAAGGGGGTGGATCGGTTGGAGAAAGGAACGAAACATTCTTTACAAGGGTGTGAAAATCTCTCCTTAGTAAATGCATTTTAGATCTACTAGAAGTGGGCTTGAGGTGTATATAATAAACCTCTAACATTTTGGATCAACCTGACAAATGACAACAACCCTGGATAACAATATCAAAGTTGAAAGCAGAAAGTTACTTAAATGAGCAATGAGTTGCCCTCATGATGTTGTAATCATCCAGGCCCCGAAATAAATGAATGTTGGCATCTTAGTTGCATCCACAGTCTTTTAATTTATGAGCAGTAAGTTTGTACCTTCCCTTATGTTTAACCATTGAAGCTCTTCTTTATTATGCCATTGCAGGTCCTGATTCCGACTTTGTACTTTTACTGTCCAAGCAGTTCCCATTATTTCTCTGCACAAAATCTTTATTACCTCTTGTACTCGTGCAATTGCCTTTTACTTCTCTTTCTTTACCATCAGTTTCTTAGGCAATCAATGGATTTTTAATCTTTCTTAGCACAAAAATGGCCTATGCTTTGTCTGGGAAATTAATTGTTTTGGGTCTCATTATTGTAGGCTGTCAAAAAGATGCTCAAAATTTTGTCTGAGAAAGTCAGACATGGCTGCCTGCACACACCTTTGCTGCTTCCTCTGAGAACAGTTGACTCTTCAAAGGCTGGATTCTTTCGGGGCCCCTAAGGAAGAAAACAACAACCAGAACCTTCTCTGAGTTATTTTGGAATTCAACTACTCTTATTATCTCAACCACTTTAAAATGGACTTTACTCATGTTTCTCTTCATTTCTTCTTATTAACCTGTTTGGGAATGGTGTTACAAGCCAAATTTTATTAACCTGTTTTTGTTTTGAGTTGTTCTTAAAAGAGTATTTGTTCTTTGGACTTGGAGGACAAATCTTTTCAAGGTTTCTGGCTTTTGGTAACAATTTGTATGCTCTAGAGTTGGTGGAATCACATTTAGATTTGGTAAAATGAATGTTTATTT

mRNA sequence

ATATTTTTGGGTCGAGTCTATTCACTGGCAGACCGGACCGGGTCGGGAGCCATGGAAGGGGCATCGTACCAGCGCTTTCCCAAGGTGAAGATCCGAGAGCTCAGAGATGACTATGCCAAATTCGAGCTCCATGACACTGATGTTAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCTGAGGTCCCCACCATCGCCATCGACCTGGTAGAAATTGAAGTGAACTCCTCAGTTCTTAACGATGAGTTCATCGCCCATCGCCTTGGTCTCATTCCCCTCACTAGCGAGCGTGCCATGTCTATGCGCTTCTCTCGCGACTGTGATGCTTGTGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTCGAGTTTCACCTTCGTGCCAAGTGCCACTCTGACCAAACCCTAGACGTCTCCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCTGTTGATTTCTCTGATTCTGCAGCAGCTGCTGGTGAAGCGCTTGATACCAAAGGAATAATCATTGTGAAGCTACGTCGGGGTCAAGAATTGAGGTTGAGAGCCATAGCCAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACTGTGACATTCATGTATGAACCAAGCATTCACATCAATGAAGACCTAATGGAAACATTAACACTCGAGGAAAAGACTTCCTGGGTTGAAAGTAGCCCAACCAAAGTCTTTGAATTGGATCCTGTTACCCACCAGGTTCTGGTGGTTGATCCAGAGGCATACACTTATGATGATGAAGTGATTAAGAAAGCAGAAGCAATGGGTAAAGCTGGACTAGTGGATATCACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATAAAAGCTTCCCAGTTGATACTCAATGCCATAGATATTCTGAAACAGAAACTGGATGCAGTCCGCCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGCTGTCAAAAAGATGCTCAAAATTTTGTCTGAGAAAGTCAGACATGGCTGCCTGCACACACCTTTGCTGCTTCCTCTGAGAACAGTTGACTCTTCAAAGGCTGGATTCTTTCGGGGCCCCTAAGGAAGAAAACAACAACCAGAACCTTCTCTGAGTTATTTTGGAATTCAACTACTCTTATTATCTCAACCACTTTAAAATGGACTTTACTCATGTTTCTCTTCATTTCTTCTTATTAACCTGTTTGGGAATGGTGTTACAAGCCAAATTTTATTAACCTGTTTTTGTTTTGAGTTGTTCTTAAAAGAGTATTTGTTCTTTGGACTTGGAGGACAAATCTTTTCAAGGTTTCTGGCTTTTGGTAACAATTTGTATGCTCTAGAGTTGGTGGAATCACATTTAGATTTGGTAAAATGAATGTTTATTT

Coding sequence (CDS)

ATATTTTTGGGTCGAGTCTATTCACTGGCAGACCGGACCGGGTCGGGAGCCATGGAAGGGGCATCGTACCAGCGCTTTCCCAAGGTGAAGATCCGAGAGCTCAGAGATGACTATGCCAAATTCGAGCTCCATGACACTGATGTTAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCTGAGGTCCCCACCATCGCCATCGACCTGGTAGAAATTGAAGTGAACTCCTCAGTTCTTAACGATGAGTTCATCGCCCATCGCCTTGGTCTCATTCCCCTCACTAGCGAGCGTGCCATGTCTATGCGCTTCTCTCGCGACTGTGATGCTTGTGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTCGAGTTTCACCTTCGTGCCAAGTGCCACTCTGACCAAACCCTAGACGTCTCCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCTGTTGATTTCTCTGATTCTGCAGCAGCTGCTGGTGAAGCGCTTGATACCAAAGGAATAATCATTGTGAAGCTACGTCGGGGTCAAGAATTGAGGTTGAGAGCCATAGCCAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACTGTGACATTCATGTATGAACCAAGCATTCACATCAATGAAGACCTAATGGAAACATTAACACTCGAGGAAAAGACTTCCTGGGTTGAAAGTAGCCCAACCAAAGTCTTTGAATTGGATCCTGTTACCCACCAGGTTCTGGTGGTTGATCCAGAGGCATACACTTATGATGATGAAGTGATTAAGAAAGCAGAAGCAATGGGTAAAGCTGGACTAGTGGATATCACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATAAAAGCTTCCCAGTTGATACTCAATGCCATAGATATTCTGAAACAGAAACTGGATGCAGTCCGCCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGCTGTCAAAAAGATGCTCAAAATTTTGTCTGA

Protein sequence

IFLGRVYSLADRTGSGAMEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVSSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYTYDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLSDDTVEADDQFGELGAHMRGCQKDAQNFV
BLAST of Cp4.1LG04g07860 vs. Swiss-Prot
Match: NRPB3_ARATH (DNA-directed RNA polymerases II, IV and V subunit 3 OS=Arabidopsis thaliana GN=NRPB3 PE=1 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 2.3e-147
Identity = 263/319 (82.45%), Postives = 297/319 (93.10%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           M+GA+YQRFPK+KIREL+DDYAKFEL +TDVSMANALRRVMI+EVPT+AIDLVEIEVNSS
Sbjct: 1   MDGATYQRFPKIKIRELKDDYAKFELRETDVSMANALRRVMISEVPTVAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEF L +KC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFRLSSKCVTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +S+DLYS+D TV PVDF+  ++ + ++ + KGIIIVKLRRGQEL+LRAIARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTIDSSVS-DSSEHKGIIIVKLRRGQELKLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP I INED+M+TL+ EEK   +ESSPTKVF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPDIIINEDMMDTLSDEEKIDLIESSPTKVFGMDPVTRQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YD+EVIKKAEAMGK GL++I+ K+DSFIFTVESTGA+KASQL+LNAID+LKQKLDAVRLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEISPKDDSFIFTVESTGAVKASQLVLNAIDLLKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 318

BLAST of Cp4.1LG04g07860 vs. Swiss-Prot
Match: RPD3B_ARATH (DNA-directed RNA polymerases IV and V subunit 3B OS=Arabidopsis thaliana GN=NRPD3B PE=1 SV=2)

HSP 1 Score: 484.2 bits (1245), Expect = 1.2e-135
Identity = 245/318 (77.04%), Postives = 284/318 (89.31%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           M+G +YQRFP VKIREL+DDYAKFEL +TDVSMANALRRVMI+EVPT+AI LV+IEVNSS
Sbjct: 1   MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIA RL LIPLTSERAMSMRF +DC+ C+GD  CEFCSVEF L AKC +DQTLDV
Sbjct: 61  VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +S+DLYS+D TV PVDF+ +++ + ++ + KGIII KLRRGQEL+L+A+ARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTSNSSTS-DSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVT+MYEP I INE++M TLT EEK   +ESSPTKVF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YD+EVIKKAEAMGK GL++I  K DSF+FTVESTGA+KASQL+LNAIDILKQKLDA+RLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLNAIDILKQKLDAIRLS 300

Query: 318 DDTVEADDQFGELGAHMR 336
           D+TVEADDQFGELGAHMR
Sbjct: 301 DNTVEADDQFGELGAHMR 317

BLAST of Cp4.1LG04g07860 vs. Swiss-Prot
Match: RPB3_DICDI (DNA-directed RNA polymerase II subunit rpb3 OS=Dictyostelium discoideum GN=polr2c PE=3 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 7.4e-69
Identity = 143/293 (48.81%), Postives = 198/293 (67.58%), Query Frame = 1

Query: 25  RFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDEFI 84
           R P+++I E+++D   F L +TD+S+ANALRRVMIAEVPT+ IDLVE E N+SVL DEFI
Sbjct: 10  RQPELEILEIKNDSIIFILSNTDISVANALRRVMIAEVPTMCIDLVEFESNNSVLCDEFI 69

Query: 85  AHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVSSKDLYS 144
           AHRLGLIPL S+      ++RDC   D   +C+ CSVE  L  KC  ++  DV+S DL S
Sbjct: 70  AHRLGLIPLVSDNIDKFCYTRDCSCSD---RCDQCSVELRLNVKCTENRPRDVTSSDLLS 129

Query: 145 SDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPAATV 204
            +  V+PV    +++ + +      I IVKLRRGQE++LRAIA+KG+GK+HAKWSP+   
Sbjct: 130 QNSAVIPVSSQVTSSNSEQE-----IPIVKLRRGQEIKLRAIAKKGVGKEHAKWSPSCVA 189

Query: 205 TFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDP--VTHQVLVVDPEAYTYDDEV 264
           T+ ++P I +N++ ++ LT ++K  WV S PTKV+   P   T QV + DP    Y  E 
Sbjct: 190 TYQFQPIIVLNQNRIDELTDQQKEEWVGSCPTKVYSYSPHQSTQQVTIEDPLRCVYCLEC 249

Query: 265 IKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVR 316
            KKAE+ GK  LV +  K+D FIFTVES+GA+K   ++L AI I+K+KL  ++
Sbjct: 250 KKKAESFGKPDLVHLEQKQDKFIFTVESSGALKPEDIVLYAIQIIKRKLTDIQ 294

BLAST of Cp4.1LG04g07860 vs. Swiss-Prot
Match: RPB3_BOVIN (DNA-directed RNA polymerase II subunit RPB3 OS=Bos taurus GN=POLR2C PE=1 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 1.5e-48
Identity = 117/290 (40.34%), Postives = 169/290 (58.28%), Query Frame = 1

Query: 23  YQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDE 82
           Y   P V+I EL D+  KF + +TD+++AN++RRV IAEVP IAID V+I+ NSSVL+DE
Sbjct: 3   YANQPTVRITELTDENVKFIIENTDLAVANSIRRVFIAEVPIIAIDWVQIDANSSVLHDE 62

Query: 83  FIAHRLGLIPLTSERAMS-MRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVSSKD 142
           FIAHRLGLIPLTS+  +  +++SRDC     +  C  CSVEF L  +C+ DQT  V+S+D
Sbjct: 63  FIAHRLGLIPLTSDDIVDKLQYSRDCTC---EEFCPECSVEFTLDVRCNEDQTRHVTSRD 122

Query: 143 LYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPA 202
           L S+   V+PV   +      + ++   I+IVKLR+GQELRLRA A+KG GK+HAKW+P 
Sbjct: 123 LISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQELRLRAYAKKGFGKEHAKWNPT 182

Query: 203 ATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYTYDDE 262
           A V F Y+P     ++ +      +   W +S  +++ E           D     YD  
Sbjct: 183 AGVAFEYDP-----DNALRHTVYPKPEEWPKSEYSELDE-----------DESQAPYDP- 242

Query: 263 VIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKL 312
                             K + F + VES G+++   ++L+A+  LK+KL
Sbjct: 243 ----------------NGKPERFYYNVESCGSLRPETIVLSALSGLKKKL 256

BLAST of Cp4.1LG04g07860 vs. Swiss-Prot
Match: RPB3_HUMAN (DNA-directed RNA polymerase II subunit RPB3 OS=Homo sapiens GN=POLR2C PE=1 SV=2)

HSP 1 Score: 192.6 bits (488), Expect = 7.2e-48
Identity = 116/290 (40.00%), Postives = 168/290 (57.93%), Query Frame = 1

Query: 23  YQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDE 82
           Y   P V+I EL D+  KF + +TD+++AN++RRV IAEVP IAID V+I+ NSSVL+DE
Sbjct: 3   YANQPTVRITELTDENVKFIIENTDLAVANSIRRVFIAEVPIIAIDWVQIDANSSVLHDE 62

Query: 83  FIAHRLGLIPLTSERAMS-MRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVSSKD 142
           FIAHRLGLIPL S+  +  +++SRDC     +  C  CSVEF L  +C+ DQT  V+S+D
Sbjct: 63  FIAHRLGLIPLISDDIVDKLQYSRDCTC---EEFCPECSVEFTLDVRCNEDQTRHVTSRD 122

Query: 143 LYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPA 202
           L S+   V+PV   +      + ++   I+IVKLR+GQELRLRA A+KG GK+HAKW+P 
Sbjct: 123 LISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQELRLRAYAKKGFGKEHAKWNPT 182

Query: 203 ATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYTYDDE 262
           A V F Y+P     ++ +      +   W +S  +++ E           D     YD  
Sbjct: 183 AGVAFEYDP-----DNALRHTVYPKPEEWPKSEYSELDE-----------DESQAPYDP- 242

Query: 263 VIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKL 312
                             K + F + VES G+++   ++L+A+  LK+KL
Sbjct: 243 ----------------NGKPERFYYNVESCGSLRPETIVLSALSGLKKKL 256

BLAST of Cp4.1LG04g07860 vs. TrEMBL
Match: A0A0A0LHS1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G093850 PE=4 SV=1)

HSP 1 Score: 602.8 bits (1553), Expect = 2.6e-169
Identity = 307/319 (96.24%), Postives = 313/319 (98.12%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG+SYQRFPKVKIRELRDDYAKFEL DTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDFSDSAAA GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEPSIHINEDLMETLTLEEK +WVES PT+VFELD VTHQV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKRTWVESCPTRVFELDTVTHQVMVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 319

BLAST of Cp4.1LG04g07860 vs. TrEMBL
Match: A0A061E7U0_THECC (DNA-directed RNA polymerase family protein OS=Theobroma cacao GN=TCM_007121 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 2.1e-155
Identity = 283/319 (88.71%), Postives = 303/319 (94.98%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG SYQRFPKVKIREL+DDYAKFEL DTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGVSYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCMTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+DSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDSAGY--DSSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINED+METLTLEEK S+VESSPT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMETLTLEEKQSFVESSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEVLKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. TrEMBL
Match: A0A0B0MXX2_GOSAR (DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum GN=F383_29872 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 3.6e-155
Identity = 284/319 (89.03%), Postives = 303/319 (94.98%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG SYQRFPKVKIREL+DDYAKFEL DTD SMANALRRVMI+EVPTIAIDLVEIEVNSS
Sbjct: 1   MEGISYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMISEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+D+A    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDNAGY--DSTEPRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINEDLME+LTLEEK S+VESSPTKVF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLTLEEKLSFVESSPTKVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. TrEMBL
Match: A0A0B0P1L0_GOSAR (DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum GN=F383_05805 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 3.6e-155
Identity = 281/319 (88.09%), Postives = 304/319 (95.30%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEGASYQRFP+VKIREL+DDYAKFELHDTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGASYQRFPRVKIRELKDDYAKFELHDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTS+RAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSQRAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+D A    E+ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDYAGY--ESSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINED+M+TLTLEEK S+V+SSPT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMDTLTLEEKQSFVDSSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFT+ESTGA+KASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTIESTGAVKASQLVLNAIEILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. TrEMBL
Match: A0A0D2SZ13_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G023500 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 3.6e-155
Identity = 284/319 (89.03%), Postives = 303/319 (94.98%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG SYQRFPKVKIREL+DDYAKFEL DTD SMANALRRVMI+EVPTIAIDLVEIEVNSS
Sbjct: 1   MEGISYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMISEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+D+A    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDNAGY--DSTEPRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINEDLME+LTLEEK S+VESSPTKVF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLTLEEKLSFVESSPTKVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. TAIR10
Match: AT2G15430.1 (AT2G15430.1 DNA-directed RNA polymerase family protein)

HSP 1 Score: 523.1 bits (1346), Expect = 1.3e-148
Identity = 263/319 (82.45%), Postives = 297/319 (93.10%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           M+GA+YQRFPK+KIREL+DDYAKFEL +TDVSMANALRRVMI+EVPT+AIDLVEIEVNSS
Sbjct: 1   MDGATYQRFPKIKIRELKDDYAKFELRETDVSMANALRRVMISEVPTVAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEF L +KC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFRLSSKCVTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +S+DLYS+D TV PVDF+  ++ + ++ + KGIIIVKLRRGQEL+LRAIARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTIDSSVS-DSSEHKGIIIVKLRRGQELKLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP I INED+M+TL+ EEK   +ESSPTKVF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPDIIINEDMMDTLSDEEKIDLIESSPTKVFGMDPVTRQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YD+EVIKKAEAMGK GL++I+ K+DSFIFTVESTGA+KASQL+LNAID+LKQKLDAVRLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEISPKDDSFIFTVESTGAVKASQLVLNAIDLLKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 318

BLAST of Cp4.1LG04g07860 vs. TAIR10
Match: AT2G15400.1 (AT2G15400.1 DNA-directed RNA polymerase family protein)

HSP 1 Score: 484.2 bits (1245), Expect = 6.8e-137
Identity = 245/318 (77.04%), Postives = 284/318 (89.31%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           M+G +YQRFP VKIREL+DDYAKFEL +TDVSMANALRRVMI+EVPT+AI LV+IEVNSS
Sbjct: 1   MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIA RL LIPLTSERAMSMRF +DC+ C+GD  CEFCSVEF L AKC +DQTLDV
Sbjct: 61  VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +S+DLYS+D TV PVDF+ +++ + ++ + KGIII KLRRGQEL+L+A+ARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTSNSSTS-DSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVT+MYEP I INE++M TLT EEK   +ESSPTKVF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YD+EVIKKAEAMGK GL++I  K DSF+FTVESTGA+KASQL+LNAIDILKQKLDA+RLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLNAIDILKQKLDAIRLS 300

Query: 318 DDTVEADDQFGELGAHMR 336
           D+TVEADDQFGELGAHMR
Sbjct: 301 DNTVEADDQFGELGAHMR 317

BLAST of Cp4.1LG04g07860 vs. TAIR10
Match: AT1G60850.2 (AT1G60850.2 DNA-directed RNA polymerase family protein)

HSP 1 Score: 121.3 bits (303), Expect = 1.2e-27
Identity = 99/319 (31.03%), Postives = 154/319 (48.28%), Query Frame = 1

Query: 20  GASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVL 79
           G  Y  F KV +  L     +F++   D + ANA RR++IAEVP++AI+ V I  N+SV+
Sbjct: 68  GNFYDNF-KVDVVSLTKTDMEFDMIGIDAAFANAFRRILIAEVPSMAIEKVLIAYNTSVI 127

Query: 80  NDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQ-TLDVS 139
            DE +AHR+GLIP+ ++  +    S      + D   E  ++ F L  KC  ++  L V 
Sbjct: 128 IDEVLAHRMGLIPIAADPRLFEYLS------EHDQANEKNTIVFKLHVKCPKNRPRLKVL 187

Query: 140 SKDL-----------YSSDHTVVP---VDFSDSAAAAGEALDTK------GIIIVKLRRG 199
           + DL            S + T  P     FS S  +  E  +         I+I KL  G
Sbjct: 188 TSDLKWLPNGSELLRESENKTSKPKTYTSFSCSQDSLPEFANNPITPCDLDILIAKLAPG 247

Query: 200 QELRLRAIARKGIGKDHAKWSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKV 259
           QE+ L A A KGIGK HAKWSP  T  +   P + +  ++ + L        V   P  V
Sbjct: 248 QEIELEAHAVKGIGKTHAKWSPVGTAWYRMHPEVVLRGEVEDELA----ERLVNVCPQNV 307

Query: 260 FELDPV---THQVLVVDPEAYTYDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIK 315
           F+++ +     +  V  P   T   E ++  + +     VD+ + ++ FIF +ESTG++ 
Sbjct: 308 FDIEDMGKGKKRATVAQPRKCTLCKECVRDDDLVDH---VDLGSVKNHFIFNIESTGSLP 367

BLAST of Cp4.1LG04g07860 vs. TAIR10
Match: AT1G60620.1 (AT1G60620.1 RNA polymerase I subunit 43)

HSP 1 Score: 113.2 bits (282), Expect = 3.1e-25
Identity = 95/311 (30.55%), Postives = 148/311 (47.59%), Query Frame = 1

Query: 28  KVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDEFIAHR 87
           KV +  L +    F++      +ANA RR+++AE+P++AI+ V +  N+SV+ DE +AHR
Sbjct: 82  KVDVISLTETDMVFDMIGVHAGIANAFRRILLAELPSMAIEKVYVANNTSVIQDEVLAHR 141

Query: 88  LGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC-HSDQTLDVSSKDL---- 147
           LGLIP+ ++  +    S      + D   E  ++ F L  KC   D    V + +L    
Sbjct: 142 LGLIPIAADPRLFEYLS------ENDQPNEKNTIVFKLHVKCLKGDPRRKVLTSELKWLP 201

Query: 148 -------YSSDHTVVP---VDFSDSAAAAGEALDT------KGIIIVKLRRGQELRLRAI 207
                   S   T  P     F+ S  +  E  +       K I+I KL  GQE+ L A 
Sbjct: 202 NGSELIKESGGSTTTPKTYTSFNHSQDSFPEFAENPIRPTLKDILIAKLGPGQEIELEAH 261

Query: 208 ARKGIGKDHAKWSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPV-- 267
           A KGIGK HAKWSP AT  +   P +     L++    +     V+  P KVF+++ +  
Sbjct: 262 AVKGIGKTHAKWSPVATAWYRMLPEV----VLLKEFEGKHAEELVKVCPKKVFDIEDMGQ 321

Query: 268 -THQVLVVDPEAYTYDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNA 315
              +  V  P   +   E I+  + +     VD+   ++ FIFT+ESTG+     L   A
Sbjct: 322 GRKRATVARPRDCSLCRECIR--DGVEWEDQVDLRRVKNHFIFTIESTGSQPPEVLFNEA 380

BLAST of Cp4.1LG04g07860 vs. NCBI nr
Match: gi|659082285|ref|XP_008441761.1| (PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerases II, IV and V subunit 3-like [Cucumis melo])

HSP 1 Score: 604.4 bits (1557), Expect = 1.3e-169
Identity = 307/319 (96.24%), Postives = 314/319 (98.43%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG+SYQRFPKVKIRELRDDYAKFEL DTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDFSDSAAA GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEPSIHINEDLMETLTLEEK +WVES PT+VFELDPVTHQV+VV+PEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKXTWVESCPTRVFELDPVTHQVMVVEPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 319

BLAST of Cp4.1LG04g07860 vs. NCBI nr
Match: gi|449442251|ref|XP_004138895.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Cucumis sativus])

HSP 1 Score: 602.8 bits (1553), Expect = 3.7e-169
Identity = 307/319 (96.24%), Postives = 313/319 (98.12%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG+SYQRFPKVKIRELRDDYAKFEL DTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDFSDSAAA GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEPSIHINEDLMETLTLEEK +WVES PT+VFELD VTHQV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKRTWVESCPTRVFELDTVTHQVMVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 319

BLAST of Cp4.1LG04g07860 vs. NCBI nr
Match: gi|1009149980|ref|XP_015892771.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Ziziphus jujuba])

HSP 1 Score: 560.5 bits (1443), Expect = 2.1e-156
Identity = 282/319 (88.40%), Postives = 303/319 (94.98%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG SYQRFPKVKIRE++DDY KFEL +TD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGKSYQRFPKVKIREMKDDYLKFELRETDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGL+PLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLVPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDFSDSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAGY--DSSENRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP I INE+LM+TLT EEK +WV+SSPTKVF++DP THQV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPDIRINEELMDTLTFEEKRNWVDSSPTKVFDIDPNTHQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YD+EVIKKA+AMGK GLVDITAKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDEEVIKKADAMGKHGLVDITAKEDSFIFTVESTGAIKASQLLLNAIEVLKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           DDTVEADDQFGELGAHMRG
Sbjct: 301 DDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. NCBI nr
Match: gi|590686859|ref|XP_007042502.1| (DNA-directed RNA polymerase family protein [Theobroma cacao])

HSP 1 Score: 556.6 bits (1433), Expect = 3.0e-155
Identity = 283/319 (88.71%), Postives = 303/319 (94.98%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEG SYQRFPKVKIREL+DDYAKFEL DTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGVSYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCMTDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+DSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDSAGY--DSSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINED+METLTLEEK S+VESSPT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMETLTLEEKQSFVESSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEVLKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

BLAST of Cp4.1LG04g07860 vs. NCBI nr
Match: gi|823253251|ref|XP_012459245.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3-like [Gossypium raimondii])

HSP 1 Score: 555.8 bits (1431), Expect = 5.2e-155
Identity = 281/319 (88.09%), Postives = 304/319 (95.30%), Query Frame = 1

Query: 18  MEGASYQRFPKVKIRELRDDYAKFELHDTDVSMANALRRVMIAEVPTIAIDLVEIEVNSS 77
           MEGASYQRFP+VKIREL+DDYAKFELHDTD SMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGASYQRFPRVKIRELKDDYAKFELHDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 78  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 137
           VLNDEFIAHRLGLIPLTS+RAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSQRAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 138 SSKDLYSSDHTVVPVDFSDSAAAAGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 197
           +SKDLYSSDHTVVPVDF+DSA    E+ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDSAGY--ESSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 198 WSPAATVTFMYEPSIHINEDLMETLTLEEKTSWVESSPTKVFELDPVTHQVLVVDPEAYT 257
           WSPAATVTFMYEP IHINED+M+TLTLEEK S+V+SSPT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMDTLTLEEKQSFVDSSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 258 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 317
           YDDEV+KKAEAMGK GLV+I AKEDSFIFT+ESTGA+KA QL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTIESTGAVKAFQLVLNAIEILKQKLDAVRLS 300

Query: 318 DDTVEADDQFGELGAHMRG 337
           +DTVEADDQFGELGAHMRG
Sbjct: 301 EDTVEADDQFGELGAHMRG 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NRPB3_ARATH2.3e-14782.45DNA-directed RNA polymerases II, IV and V subunit 3 OS=Arabidopsis thaliana GN=N... [more]
RPD3B_ARATH1.2e-13577.04DNA-directed RNA polymerases IV and V subunit 3B OS=Arabidopsis thaliana GN=NRPD... [more]
RPB3_DICDI7.4e-6948.81DNA-directed RNA polymerase II subunit rpb3 OS=Dictyostelium discoideum GN=polr2... [more]
RPB3_BOVIN1.5e-4840.34DNA-directed RNA polymerase II subunit RPB3 OS=Bos taurus GN=POLR2C PE=1 SV=1[more]
RPB3_HUMAN7.2e-4840.00DNA-directed RNA polymerase II subunit RPB3 OS=Homo sapiens GN=POLR2C PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LHS1_CUCSA2.6e-16996.24Uncharacterized protein OS=Cucumis sativus GN=Csa_2G093850 PE=4 SV=1[more]
A0A061E7U0_THECC2.1e-15588.71DNA-directed RNA polymerase family protein OS=Theobroma cacao GN=TCM_007121 PE=4... [more]
A0A0B0MXX2_GOSAR3.6e-15589.03DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum... [more]
A0A0B0P1L0_GOSAR3.6e-15588.09DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum... [more]
A0A0D2SZ13_GOSRA3.6e-15589.03Uncharacterized protein OS=Gossypium raimondii GN=B456_008G023500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G15430.11.3e-14882.45 DNA-directed RNA polymerase family protein[more]
AT2G15400.16.8e-13777.04 DNA-directed RNA polymerase family protein[more]
AT1G60850.21.2e-2731.03 DNA-directed RNA polymerase family protein[more]
AT1G60620.13.1e-2530.55 RNA polymerase I subunit 43[more]
Match NameE-valueIdentityDescription
gi|659082285|ref|XP_008441761.1|1.3e-16996.24PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerases II, IV and V subuni... [more]
gi|449442251|ref|XP_004138895.1|3.7e-16996.24PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Cucumis sativus][more]
gi|1009149980|ref|XP_015892771.1|2.1e-15688.40PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Ziziphus jujuba][more]
gi|590686859|ref|XP_007042502.1|3.0e-15588.71DNA-directed RNA polymerase family protein [Theobroma cacao][more]
gi|823253251|ref|XP_012459245.1|5.2e-15588.09PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3-like [Gossypium r... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046983protein dimerization activity
GO:0003899DNA-directed RNA polymerase activity
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR011263DNA-dir_RNA_pol_RpoA/D/Rpb3
IPR011262DNA-dir_RNA_pol_insert
IPR009025RBP11-like_dimer
IPR001514DNA-dir_RNA_pol_30-40kDasu_CS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0046487 glyoxylate metabolic process
biological_process GO:0019441 tryptophan catabolic process to kynurenine
cellular_component GO:0005730 nucleolus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0004061 arylformamidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07860.1Cp4.1LG04g07860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001514DNA-directed RNA polymerase, 30-40kDa subunit, conserved sitePROSITEPS00446RNA_POL_D_30KDcoord: 52..92
scor
IPR009025DNA-directed RNA polymerase, RBP11-like dimerisation domainPFAMPF01193RNA_pol_Lcoord: 40..311
score: 1.0
IPR009025DNA-directed RNA polymerase, RBP11-like dimerisation domainunknownSSF55257RBP11-like subunits of RNA polymerasecoord: 231..319
score: 3.73E-34coord: 27..68
score: 3.73
IPR011262DNA-directed RNA polymerase, insert domainGENE3DG3DSA:2.170.120.12coord: 64..196
score: 1.7
IPR011262DNA-directed RNA polymerase, insert domainPFAMPF01000RNA_pol_A_baccoord: 70..199
score: 2.2
IPR011262DNA-directed RNA polymerase, insert domainunknownSSF56553Insert subdomain of RNA polymerase alpha subunitcoord: 63..200
score: 5.06
IPR011263DNA-directed RNA polymerase, RpoA/D/Rpb3-typeSMARTSM00662rpoldneu2coord: 38..317
score: 1.3
NoneNo IPR availableGENE3DG3DSA:3.30.1360.10coord: 23..63
score: 5.2E-43coord: 197..315
score: 5.2
NoneNo IPR availablePANTHERPTHR11800DNA-DIRECTED RNA POLYMERASEcoord: 16..335
score: 2.2E
NoneNo IPR availablePANTHERPTHR11800:SF2DNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB3coord: 16..335
score: 2.2E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG04g07860CmaCh11G017270Cucurbita maxima (Rimu)cmacpeB154
Cp4.1LG04g07860CmoCh11G013870Cucurbita moschata (Rifu)cmocpeB126
Cp4.1LG04g07860Carg26852Silver-seed gourdcarcpeB0017
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB133
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB269
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB400
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB417
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB497
Cp4.1LG04g07860Cucurbita pepo (Zucchini)cpecpeB500
Cp4.1LG04g07860Cucumber (Gy14) v1cgycpeB0602
Cp4.1LG04g07860Cucurbita maxima (Rimu)cmacpeB539
Cp4.1LG04g07860Cucurbita maxima (Rimu)cmacpeB749
Cp4.1LG04g07860Cucurbita moschata (Rifu)cmocpeB321
Cp4.1LG04g07860Cucurbita moschata (Rifu)cmocpeB494
Cp4.1LG04g07860Cucurbita moschata (Rifu)cmocpeB703
Cp4.1LG04g07860Wild cucumber (PI 183967)cpecpiB680
Cp4.1LG04g07860Wild cucumber (PI 183967)cpecpiB697
Cp4.1LG04g07860Cucumber (Chinese Long) v2cpecuB677
Cp4.1LG04g07860Cucumber (Chinese Long) v2cpecuB695
Cp4.1LG04g07860Bottle gourd (USVL1VR-Ls)cpelsiB543
Cp4.1LG04g07860Bottle gourd (USVL1VR-Ls)cpelsiB552
Cp4.1LG04g07860Bottle gourd (USVL1VR-Ls)cpelsiB554
Cp4.1LG04g07860Watermelon (Charleston Gray)cpewcgB590
Cp4.1LG04g07860Watermelon (Charleston Gray)cpewcgB611
Cp4.1LG04g07860Watermelon (Charleston Gray)cpewcgB624
Cp4.1LG04g07860Watermelon (97103) v1cpewmB665
Cp4.1LG04g07860Melon (DHL92) v3.5.1cpemeB604
Cp4.1LG04g07860Cucumber (Gy14) v2cgybcpeB237
Cp4.1LG04g07860Cucumber (Gy14) v2cgybcpeB423
Cp4.1LG04g07860Cucumber (Gy14) v2cgybcpeB969
Cp4.1LG04g07860Melon (DHL92) v3.6.1cpemedB715
Cp4.1LG04g07860Melon (DHL92) v3.6.1cpemedB746
Cp4.1LG04g07860Melon (DHL92) v3.6.1cpemedB764
Cp4.1LG04g07860Melon (DHL92) v3.6.1cpemedB771
Cp4.1LG04g07860Melon (DHL92) v3.6.1cpemedB772
Cp4.1LG04g07860Silver-seed gourdcarcpeB0266
Cp4.1LG04g07860Silver-seed gourdcarcpeB0409
Cp4.1LG04g07860Silver-seed gourdcarcpeB0529
Cp4.1LG04g07860Silver-seed gourdcarcpeB0639
Cp4.1LG04g07860Cucumber (Chinese Long) v3cpecucB0834
Cp4.1LG04g07860Cucumber (Chinese Long) v3cpecucB0846
Cp4.1LG04g07860Cucumber (Chinese Long) v3cpecucB0847
Cp4.1LG04g07860Cucumber (Chinese Long) v3cpecucB0868
Cp4.1LG04g07860Wax gourdcpewgoB0841
Cp4.1LG04g07860Wax gourdcpewgoB0845
Cp4.1LG04g07860Wax gourdcpewgoB0860
Cp4.1LG04g07860Wax gourdcpewgoB0879