Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTAGAGACGTTTGTTTGTTTGTTTGTTTGATTTTCAGAGTGTCAAAGAGAAACCAAAGACGAAACTTATTTTTATTTTTATTTTTATTTTTCACATTCGTTCGCCGTCTCCCCTTCAAGACTTCTTCTTCTATACGCTCGCTTCATCTTCTCTCTTTCTCTCTATCTCTCTCTCAGCCGTTCCGTTTCTCTCTGCCGTTTTACGTTCTCCCCGCCACCGTCACGGCGGCTGAAGCTGACCGAACCCAACCCAACCCAACAAACAGACAAAAACCCTATTCGCTATTCCGAACAACCCAAGCCTCATTTTCTCTCATTCCCAATCCCTTTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCGCCGCCGCTTTCTTTCTTCACACTCTGAAACTCGCTCTGTCTATCTATTTTGTTGTTGCCCTCTTCAGACGCATTTCTTTTTCTTTTGGGGATATACAGCAACCACTGGGAGGCAGTTATTGAAACAGTTGCGCCTTCTGGGGTTCGTCAATCAGTCTTGAGGGTAGGGTTAGTTCTTCGTCTTCATCGGATATCTCACAGGTTATATTATCTATTTCTTGTGTATTTTTCTTGCGGGTTTGGACCTCTGCGGAAGTTTTTTTTTTCGGAGGGTGCATATTTGGTTTAGGGTTTCTGATATTTTTCTTTTGGGCACATTCAATCGGAGTTCGGATTCTGGTTATTCCAATTTCTTATTTCGTTAGGGTTTCTTTTTGTTCTCTGATCGAACTGTTTTGACGGGGACAATCTTCGGTTTTGGAATTGCTTCATCTCTTTATTCTTGAGGTATACCTTCTTTATATTTCTGCATCTCAACCGATTCCCTCAAAGATGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACACGGAGGCCGGGAGACGATAGTGAGGGCCTTTCCTCTTTTACCATTCTTTTCAATTGAACCGCTTCATCTTATCCCTGCTATTTCCTCTGTTTTCGTCATCTGATGTTCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGATAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGCTAGCTACTTCCTTGAATTCCGGCGGTCGTGAAGTAGGATAGTTGTAGAAGATAGACTTCTTTAGTACCATTCAGGACATAAAGTTTGAGAACCATCCTCTGGGTTCACTACCGTGCGCGAGTTTGCAACTCTGAAGCAATTCACTGCGCATGTTTATCTTTGCTGGACCCCTGGATATGCCATAAAGTTGATGACACATATGGTTTGTTGCCGTGACTTGCATAGTCAGGGATGCTCGTGAAAGTTTATCCAGCTTAAAATAGTGGTTAGGCATTTGGAATAATAAAGTACAGGCTAGGGTTTTATAAATTCCTCTGCACAAGGAGTACAGTTTTTGGAAATTTGAGGTGGTAAAGAAGGCAAGAAGTGCTTTTTCTTCTGATATTTTGATCATGTTCAATATTCCTTATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGGTACTGCTTTATTTGTTTCTTTCTTTTCTTTTTTTATTGTTGCATGCTTCCAATAAAGATGCGGTCACTTACTACTGTCTTGCCTCTGTTTTGATATTGTGAACGCTGTTAATGTGTATCCTGTGCTGTTGTGTAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAATTTCCAAGACCACTGGCGATTTTGACACTTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGCAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTCGGGGATAATTGGGACCGTGACTCTCATGACCCTCTGGGGAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGCGAGTTGTTTCATAGAAGAGGTGCAACAGAATTAAAAAGTCACAACAATAGCAATGGTATTCTTTCTGGAACTAGTGTCGGCAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCGCTGGGATCTGAAGAAAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTCGGTGGAGAGGGATGGACCTCTGCTCTTGCTGAGGTGCCCAGTATGATTGGAAACACCACAGGGTCATCGTCATTTCAACAAACTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGCATGACAGCTGGACTTAATATGGCTGAAGCGTTGGTGCAGGCTCCATCTCGAGCTCGTGCTGCTCCCCAGGTATCTGAGGTACCATACTAACCTTCTTGTGCTGTATATTATGTTCTCCTTTTGATCTAACAGTCTCATCCTTTACCCCCACTTTGGTAGTTATCTGTCAAGACCCAGAGGCTTGAGGAGTTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTTGAATCTTTGTTGTTAGATCTTCTGCTACTTCCGTATATTTAAATTATTTGGTGTTTGTTGATTTGAAAATTTAAAGGTCCGTAATTAAAATATATAAGAGAACTGAGAACCATCTGCACGGTACCTAGACAACTTGTGCCAATCTCTGATTTCAGCTGGGACTAATACTCTCTTCGAAGCTACTTTTGTGTTGTAGTATTAGGCACTTAAGATGTATAGATAAAAAAAAGTCATTTTTGGGTAAATTATACTGAAAACTATGTGACAAAGATACTATATGTTTTGACATTTTTCTGCATGTGCTGTTAGTGGTTGGTGACTGGTAATATAAATTTTTGGATTTAGGTTGTTAAATAAACTCGGCTACTACATTTTAGGAACTGTATATTTCTTGTGGACGTTTGTTATTCAAGTGTTCTATCTCTTCAACTTACTTGTTGACTAAAGAACAGTGTTTATGTGTTAGATAGCGGAAGTCAAAACATTTGTTAAAAGTTCCAAAGTCATTTGCAATGTGGCTAGATAACTAGACCACTTTCCCCCCTTCTATTTGATGAGAAACTCACTTTAAGGACGATTGGAAGAACAAAATACAAGAGCCCCAGAACAAGAGAGGGAGGGTTAGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGTGGTGGCGTGCAGAGGGCTTCTAAACCATCTAAATCTAATGCGTTCTTTGATTTAGGAATTGGTTATTTTAAATTCTTCAAAGGGTTTTGTTTTTGTTTTCTTGCTTTGACCCTTTTGGAAATTTGTAAGAACTAGATGATAGGAAGGAGATAACCAGATTTGATTTTGGATCCCATTTAATAGTGTGCATATATGTGCTTTGCTACTTTTGACACTTTGCAGACTGAATTATCTATGGTTAACCATTTTTTATTCTCTCTCAAGGATGGAAAACGCCAGTGTTAGATTAAATATGCTTTAAAGGAAGTGTGAAGTTTCTATTAGTAACGTTCTTCATTTGAGATTGTTGTGCTGTAGGCAATATGTCACTGATGGCTAGTATTCTTTGTTTGTTAACTTGTTATATTTGGTAGGAAAGTTTCTGTTAACTGATTGCCAACCCCTCTTCAGTTTTAATAGTTGTAACTTGACTGAGCATTCCTCTTCTGAAAGGTTCTCATTTGTGTTCTGTTTTGTTTTTTGAAGCTGATGTGTAAATTGTTGTTGAGTTTCAGTATGGTGATGTACTAAATTTGCTTAGCAGTATAGTTCTTAAAATAGTCTTAGAACAAAAAAAACGTATATTATTCTGTTGTGTTTTATTTAATGATTTTAAAAATAATTTATTCGCATCTGATCAATAATCTTTTAAAAGTTTAAATAAATGGATTTAAATATGTTATTCTTATTTACATTCTAAAGAAATAAATGAAAATTAAAAAGCGGACCATATCTTTGCTGATAATCTTTAGTAGACTTGTCTGTAAAAACAATTTTGATAAAATCCATCCAGGAAGTTCTCCACCCTTGTAATATGAATATAGTTAGTTTTTACATTTTGAAAAATTCCCTCTATACCATGGTAAAATTCTAATACGGAATATTGATTTTGTCTAGGAATACACATCTCTCATTATTATATCTAACTGCAAATCTTCTGTTTTCCCCCTTAGATAATCATTAGCGTTTTTGTGTTTTTTGTTTGACAGGTGCTTAATTCTTCGGATAAATCAAAGCCCAAACTAGCATCAAGAACTGGAGAACTTAATGTAGCCATCAAGGGTGGACAGCCACCGCCCTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAGATTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTACGAGAAAATGGTCTCTCCCTTGCAGCAAAGGATGTTTCAAGTCCGACTAGTAATGCAAACAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCTGTGGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTGGAAAAAAGACCGTCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCACTGAGTTCTTCTGCTGTTCTCTCGGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAGTGAACTAACAAGGGAAGAAATCGACATGCCTGCAAGTCCTCGTGTTATTGAAAATGGTACTGTGGAGAATAGAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTACTGCAGAATCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGCCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGGTAAACAGATACTTCTATTGCTTTCATTAGTTCCCCACTACAATGCTTTTCAAATACTGTTCTTTTCTCTGTTAGCTGATGAAGTTGTAAATATAATTCTTAATTGCAGTACATGAACTTGAAGCCATCTCTAAAAATGGGCCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCATGAGGACAGCAAGGATGGAGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGA
mRNA sequence
TGTAGAGACGTTTGTTTGTTTGTTTGTTTGATTTTCAGAGTGTCAAAGAGAAACCAAAGACGAAACTTATTTTTATTTTTATTTTTATTTTTCACATTCGTTCGCCGTCTCCCCTTCAAGACTTCTTCTTCTATACGCTCGCTTCATCTTCTCTCTTTCTCTCTATCTCTCTCTCAGCCGTTCCGTTTCTCTCTGCCGTTTTACGTTCTCCCCGCCACCGTCACGGCGGCTGAAGCTGACCGAACCCAACCCAACCCAACAAACAGACAAAAACCCTATTCGCTATTCCGAACAACCCAAGCCTCATTTTCTCTCATTCCCAATCCCTTTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCGCCGCCGCTTTCTTTCTTCACACTCTGAAACTCGCTCTGTCTATCTATTTTGTTGTTGCCCTCTTCAGACGCATTTCTTTTTCTTTTGGGGATATACAGCAACCACTGGGAGGCAGTTATTGAAACAGTTGCGCCTTCTGGGGTTCGTCAATCAGTCTTGAGGGTAGGGTTAGTTCTTCGTCTTCATCGGATATCTCACAGGTTATATTATCTATTTCTTGTGTATTTTTCTTGCGGGTTTGGACCTCTGCGGAAGTTTTTTTTTTCGGAGGGTGCATATTTGGTTTAGGGTTTCTGATATTTTTCTTTTGGGCACATTCAATCGGAGTTCGGATTCTGGTTATTCCAATTTCTTATTTCGTTAGGGTTTCTTTTTGTTCTCTGATCGAACTGTTTTGACGGGGACAATCTTCGGTTTTGGAATTGCTTCATCTCTTTATTCTTGAGGTATACCTTCTTTATATTTCTGCATCTCAACCGATTCCCTCAAAGATGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACACGGAGGCCGGGAGACGATAGTGAGGGCCTTTCCTCTTTTACCATTCTTTTCAATTGAACCGCTTCATCTTATCCCTGCTATTTCCTCTGTTTTCGTCATCTGATGTTCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGATAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGCTAGCTACTTCCTTGAATTCCGGCGGTCGTGAAGTAGGATAGTTGTAGAAGATAGACTTCTTTAGTACCATTCAGGACATAAAGTTTGAGAACCATCCTCTGGGTTCACTACCGTGCGCGAGTTTGCAACTCTGAAGCAATTCACTGCGCATGTTTATCTTTGCTGGACCCCTGGATATGCCATAAAGTTGATGACACATATGGTTTGTTGCCGTGACTTGCATAGTCAGGGATGCTCGTGAAAGTTTATCCAGCTTAAAATAGTGGTTAGGCATTTGGAATAATAAAGTACAGGCTAGGGTTTTATAAATTCCTCTGCACAAGGAGTACAGTTTTTGGAAATTTGAGGTGGTAAAGAAGGCAAGAAGTGCTTTTTCTTCTGATATTTTGATCATGTTCAATATTCCTTATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAATTTCCAAGACCACTGGCGATTTTGACACTTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGCAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTCGGGGATAATTGGGACCGTGACTCTCATGACCCTCTGGGGAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGCGAGTTGTTTCATAGAAGAGGTGCAACAGAATTAAAAAGTCACAACAATAGCAATGGTATTCTTTCTGGAACTAGTGTCGGCAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCGCTGGGATCTGAAGAAAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTCGGTGGAGAGGGATGGACCTCTGCTCTTGCTGAGGTGCCCAGTATGATTGGAAACACCACAGGGTCATCGTCATTTCAACAAACTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGCATGACAGCTGGACTTAATATGGCTGAAGCGTTGGTGCAGGCTCCATCTCGAGCTCGTGCTGCTCCCCAGGTATCTGAGTTATCTGTCAAGACCCAGAGGCTTGAGGAGTTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTTGAATCTTTGTTGTTAGATCTTCTGCTACTTCCGTTGTTAAATAAACTCGGCTACTACATTTTAGGAACTGTATATTTCTTGTGGACGTTTGTTATTCAAGTGCTTAATTCTTCGGATAAATCAAAGCCCAAACTAGCATCAAGAACTGGAGAACTTAATGTAGCCATCAAGGGTGGACAGCCACCGCCCTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAGATTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTACGAGAAAATGGTCTCTCCCTTGCAGCAAAGGATGTTTCAAGTCCGACTAGTAATGCAAACAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCTGTGGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTGGAAAAAAGACCGTCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCACTGAGTTCTTCTGCTGTTCTCTCGGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAGTGAACTAACAAGGGAAGAAATCGACATGCCTGCAAGTCCTCGTGTTATTGAAAATGGTACTGTGGAGAATAGAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTACTGCAGAATCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGCCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGTACATGAACTTGAAGCCATCTCTAAAAATGGGCCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCATGAGGACAGCAAGGATGGAGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGA
Coding sequence (CDS)
ATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCAGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGATGTGCCCTCTCTATCTCAATCGAGAAATAGAATTTCCAAGACCACTGGCGATTTTGACACTTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGCAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTCGGGGATAATTGGGACCGTGACTCTCATGACCCTCTGGGGAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGCGAGTTGTTTCATAGAAGAGGTGCAACAGAATTAAAAAGTCACAACAATAGCAATGGTATTCTTTCTGGAACTAGTGTCGGCAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCGCTGGGATCTGAAGAAAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTCGGTGGAGAGGGATGGACCTCTGCTCTTGCTGAGGTGCCCAGTATGATTGGAAACACCACAGGGTCATCGTCATTTCAACAAACTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGCATGACAGCTGGACTTAATATGGCTGAAGCGTTGGTGCAGGCTCCATCTCGAGCTCGTGCTGCTCCCCAGGTATCTGAGTTATCTGTCAAGACCCAGAGGCTTGAGGAGTTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTTGAATCTTTGTTGTTAGATCTTCTGCTACTTCCGTTGTTAAATAAACTCGGCTACTACATTTTAGGAACTGTATATTTCTTGTGGACGTTTGTTATTCAAGTGCTTAATTCTTCGGATAAATCAAAGCCCAAACTAGCATCAAGAACTGGAGAACTTAATGTAGCCATCAAGGGTGGACAGCCACCGCCCTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAGATTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTACGAGAAAATGGTCTCTCCCTTGCAGCAAAGGATGTTTCAAGTCCGACTAGTAATGCAAACAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATGCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCTGTGGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTGGAAAAAAGACCGTCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCACTGAGTTCTTCTGCTGTTCTCTCGGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAGTGAACTAACAAGGGAAGAAATCGACATGCCTGCAAGTCCTCGTGTTATTGAAAATGGTACTGTGGAGAATAGAAATGGAGATAGTTCTGAAGAGGTTCGAGCATCTTGTGACAGTGGTGAAAAAACTGAGAGCCACGTTACTGCAGAATCTCTAGATGAAGAGGAGGCTGCTTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGCCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGTACATGAACTTGAAGCCATCTCTAAAAATGGGCCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCATGAGGACAGCAAGGATGGAGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGA
Protein sequence
MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRSAFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNTTGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLNSSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRENGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMPASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA
Homology
BLAST of Lsi09G009760 vs. ExPASy TrEMBL
Match:
A0A5D3DT29 (Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G001070 PE=4 SV=1)
HSP 1 Score: 1045.8 bits (2703), Expect = 7.5e-302
Identity = 578/659 (87.71%), Postives = 595/659 (90.29%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
GSSSFQQTVPATSGAGPLS+TAGLNMAEALVQ+PSR R APQVSELSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP P SVHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELT EE+ +P
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTSEEMGIP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPSES EDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPSESREDSKDDGAGSELSSSDSEA 616
BLAST of Lsi09G009760 vs. ExPASy TrEMBL
Match:
A0A1S3CDT9 (mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499274 PE=4 SV=1)
HSP 1 Score: 1045.8 bits (2703), Expect = 7.5e-302
Identity = 578/659 (87.71%), Postives = 595/659 (90.29%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
GSSSFQQTVPATSGAGPLS+TAGLNMAEALVQ+PSR R APQVSELSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP P SVHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELT EE+ +P
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTSEEMGIP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPSES EDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPSESREDSKDDGAGSELSSSDSEA 616
BLAST of Lsi09G009760 vs. ExPASy TrEMBL
Match:
A0A1S3CC42 (mediator of RNA polymerase II transcription subunit 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499274 PE=4 SV=1)
HSP 1 Score: 1035.8 bits (2677), Expect = 7.8e-299
Identity = 575/659 (87.25%), Postives = 592/659 (89.83%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
GSSSFQQTVPATSGAGPLS+TAGLNMAEALVQ+PSR R APQ LSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQ---LSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP P SVHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELT EE+ +P
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTSEEMGIP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPSES EDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPSESREDSKDDGAGSELSSSDSEA 613
BLAST of Lsi09G009760 vs. ExPASy TrEMBL
Match:
A0A0A0KN63 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G387430 PE=4 SV=1)
HSP 1 Score: 1028.1 bits (2657), Expect = 1.6e-296
Identity = 572/659 (86.80%), Postives = 591/659 (89.68%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSV GG N NHHFPSSSSHSDVPSLSQSRNRISKTTGDFD+SRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVAGGGNPNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDSSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
TGSSSFQQTVPATSGAGPLS+TAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP PL VHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPLLVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNN NVSS+ERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNINVSSMERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFF LIKKKTS++SSAVLSDSCSSVKSPSIGQS+ELT EE+
Sbjct: 481 TTLEKRPSLSQVQSRNDFFKLIKKKTSMNSSAVLSDSCSSVKSPSIGQSNELTSEEMG-T 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQVSRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREY+NLKPSLK+GRCIQPKIFVPSES DSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYVNLKPSLKIGRCIQPKIFVPSESRVDSKDDGAGSELSSSDSEA 615
BLAST of Lsi09G009760 vs. ExPASy TrEMBL
Match:
A0A6J1FM76 (uncharacterized protein LOC111445235 OS=Cucurbita moschata OX=3662 GN=LOC111445235 PE=4 SV=1)
HSP 1 Score: 990.7 bits (2560), Expect = 2.9e-285
Identity = 554/660 (83.94%), Postives = 576/660 (87.27%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLR+TGSVTGG NSNHHF S S HSDVPS SQ RNR SKTTGDFDTSR
Sbjct: 1 MERSEPTLVPEWLRNTGSVTGGGNSNHHFQSPSPHSDVPSQSQPRNRTSKTTGDFDTSRP 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRT+SSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDR NFGDNWDRDSHDPLGK+
Sbjct: 61 AFLDRTASSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRFNFGDNWDRDSHDPLGKL 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSH-NNSNGILSGTSVGSSIQKAVF 180
L NR+DKDALRRSHSMVSRKQ ELFHRR AT+LK+ N+SNG+ G SVGSSIQKAVF
Sbjct: 121 LPNRVDKDALRRSHSMVSRKQD--ELFHRRVATDLKAGVNSSNGMPPGISVGSSIQKAVF 180
Query: 181 EKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGN 240
EKDFPSLGSEEKQGASEIGRVSSPGLSS VQSLPIGNSALIVGGEGWTSALAEVPSMIG+
Sbjct: 181 EKDFPSLGSEEKQGASEIGRVSSPGLSSSVQSLPIGNSALIVGGEGWTSALAEVPSMIGS 240
Query: 241 TTGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAI 300
TTGSSSFQQTVPATSGAGPLS+TAGLNMAEALVQAPSRARAAPQ SELSVKTQRLEELAI
Sbjct: 241 TTGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQASELSVKTQRLEELAI 300
Query: 301 KQSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVL 360
KQSRQLIPVTPSMPKA L
Sbjct: 301 KQSRQLIPVTPSMPKA-----------------------------------------SAL 360
Query: 361 NSSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVR 420
NSSDKSKPKLASRTGELNV +KGGQP P SVHANQSRGGHVKSDAQK SHGKFLVLKP R
Sbjct: 361 NSSDKSKPKLASRTGELNVTVKGGQPLPSSVHANQSRGGHVKSDAQKSSHGKFLVLKPAR 420
Query: 421 ENGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKS 480
ENG+S AKDVSSPTSN ANSQFALAPSVPHAPLRSPNN+NV+SVERK+ASLDLKS
Sbjct: 421 ENGVSPTAKDVSSPTSN-----ANSQFALAPSVPHAPLRSPNNSNVASVERKMASLDLKS 480
Query: 481 GTTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDM 540
GTTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELTREEID
Sbjct: 481 GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTREEIDT 540
Query: 541 PASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCG 600
PASP V+ENG VEN NGDSSEEVR+SCDSGEKTE+HV AESLDEEEAAFLRSLGWDENCG
Sbjct: 541 PASPHVLENGVVENTNGDSSEEVRSSCDSGEKTETHVAAESLDEEEAAFLRSLGWDENCG 600
Query: 601 EDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
EDEGLTEEEINSFYREYM+LKPSLKMGR IQPKI VPSESHEDSKD GAGSELSSSDSEA
Sbjct: 601 EDEGLTEEEINSFYREYMSLKPSLKMGRSIQPKISVPSESHEDSKD-GAGSELSSSDSEA 611
BLAST of Lsi09G009760 vs. NCBI nr
Match:
XP_038907227.1 (mediator of RNA polymerase II transcription subunit 1 isoform X1 [Benincasa hispida])
HSP 1 Score: 1049.3 bits (2712), Expect = 1.4e-302
Identity = 583/659 (88.47%), Postives = 595/659 (90.29%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDSWDRDSPDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSN+IDKDALRRSHSMVSRK GERELFHRR ATELK+HNNSNGILSGTSV SSIQKAVFE
Sbjct: 121 LSNKIDKDALRRSHSMVSRKLGERELFHRRAATELKNHNNSNGILSGTSVSSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
TGSSSFQQTVPATSGAGPLS+TAGLNMAEALVQAPSRARA PQVSELSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSM KAM VLN
Sbjct: 301 QSRQLIPVTPSMTKAM-----------------------------------------VLN 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SDKSKPKLASRTGELNV IKGGQ PLSVHANQSRGG VKSDAQK +HGKFLVLKPVRE
Sbjct: 361 PSDKSKPKLASRTGELNVTIKGGQQQPLSVHANQSRGGLVKSDAQKSAHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMA NSQFALA +V HAPLRSPNNTNVSSVERKIASLDLKSG
Sbjct: 421 NGISLAAKDVSSPTSNANSMAVNSQFALASAVAHAPLRSPNNTNVSSVERKIASLDLKSG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+ SSAVLSDSCSSVKSPSI S+ELTREE DMP
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSM-SSAVLSDSCSSVKSPSISHSNELTREETDMP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNGD SEEVRASCD+GEKTESHV AESLDEEEAAFLRSLGWDENCGE
Sbjct: 541 ASPRVIENGAVENRNGDGSEEVRASCDTGEKTESHVAAESLDEEEAAFLRSLGWDENCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLKMGRCI+PKIFVPSESHEDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIKPKIFVPSESHEDSKD-GAGSELSSSDSEA 616
BLAST of Lsi09G009760 vs. NCBI nr
Match:
XP_008460469.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo] >KAA0067384.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa] >TYK26525.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 1045.8 bits (2703), Expect = 1.6e-301
Identity = 578/659 (87.71%), Postives = 595/659 (90.29%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
GSSSFQQTVPATSGAGPLS+TAGLNMAEALVQ+PSR R APQVSELSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP P SVHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELT EE+ +P
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTSEEMGIP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPSES EDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPSESREDSKDDGAGSELSSSDSEA 616
BLAST of Lsi09G009760 vs. NCBI nr
Match:
XP_038907228.1 (mediator of RNA polymerase II transcription subunit 1 isoform X2 [Benincasa hispida])
HSP 1 Score: 1038.9 bits (2685), Expect = 1.9e-299
Identity = 580/659 (88.01%), Postives = 592/659 (89.83%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDSWDRDSPDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSN+IDKDALRRSHSMVSRK GERELFHRR ATELK+HNNSNGILSGTSV SSIQKAVFE
Sbjct: 121 LSNKIDKDALRRSHSMVSRKLGERELFHRRAATELKNHNNSNGILSGTSVSSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
TGSSSFQQTVPATSGAGPLS+TAGLNMAEALVQAPSRARA PQ LSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARATPQ---LSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSM KAM VLN
Sbjct: 301 QSRQLIPVTPSMTKAM-----------------------------------------VLN 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SDKSKPKLASRTGELNV IKGGQ PLSVHANQSRGG VKSDAQK +HGKFLVLKPVRE
Sbjct: 361 PSDKSKPKLASRTGELNVTIKGGQQQPLSVHANQSRGGLVKSDAQKSAHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMA NSQFALA +V HAPLRSPNNTNVSSVERKIASLDLKSG
Sbjct: 421 NGISLAAKDVSSPTSNANSMAVNSQFALASAVAHAPLRSPNNTNVSSVERKIASLDLKSG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+ SSAVLSDSCSSVKSPSI S+ELTREE DMP
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSM-SSAVLSDSCSSVKSPSISHSNELTREETDMP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNGD SEEVRASCD+GEKTESHV AESLDEEEAAFLRSLGWDENCGE
Sbjct: 541 ASPRVIENGAVENRNGDGSEEVRASCDTGEKTESHVAAESLDEEEAAFLRSLGWDENCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLKMGRCI+PKIFVPSESHEDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIKPKIFVPSESHEDSKD-GAGSELSSSDSEA 613
BLAST of Lsi09G009760 vs. NCBI nr
Match:
XP_008460470.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cucumis melo])
HSP 1 Score: 1035.8 bits (2677), Expect = 1.6e-298
Identity = 575/659 (87.25%), Postives = 592/659 (89.83%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
GSSSFQQTVPATSGAGPLS+TAGLNMAEALVQ+PSR R APQ LSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQ---LSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP P SVHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFFNLIKKKTS+SSSAVLSDSCSSVKSPSIGQS+ELT EE+ +P
Sbjct: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTSEEMGIP 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPSES EDSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPSESREDSKDDGAGSELSSSDSEA 613
BLAST of Lsi09G009760 vs. NCBI nr
Match:
XP_004140377.1 (mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis sativus])
HSP 1 Score: 1028.1 bits (2657), Expect = 3.4e-296
Identity = 572/659 (86.80%), Postives = 591/659 (89.68%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
MERSEPTLVPEWLRSTGSV GG N NHHFPSSSSHSDVPSLSQSRNRISKTTGDFD+SRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVAGGGNPNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDSSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRGATELKSHNNSNGILSGTSVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRR TELKSHN+SNGILSGTSVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGNT 240
KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIG+T
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
TGSSSFQQTVPATSGAGPLS+TAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTFVIQVLN 360
QSRQLIPVTPSMPKAM VL+
Sbjct: 301 QSRQLIPVTPSMPKAM-----------------------------------------VLS 360
Query: 361 SSDKSKPKLASRTGELNVAIKGGQPPPLSVHANQSRGGHVKSDAQKISHGKFLVLKPVRE 420
SSDKSKPKLASRTGELN IKGGQP PL VHANQSR GHVK DAQK SHGKFLVLKPVRE
Sbjct: 361 SSDKSKPKLASRTGELNATIKGGQPQPLLVHANQSRVGHVKPDAQKSSHGKFLVLKPVRE 420
Query: 421 NGLSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNTNVSSVERKIASLDLKSG 480
NG+SLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNN NVSS+ERKIASLDLK+G
Sbjct: 421 NGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPLRSPNNINVSSMERKIASLDLKTG 480
Query: 481 TTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQSSELTREEIDMP 540
TTLEKRPSLSQVQSRNDFF LIKKKTS++SSAVLSDSCSSVKSPSIGQS+ELT EE+
Sbjct: 481 TTLEKRPSLSQVQSRNDFFKLIKKKTSMNSSAVLSDSCSSVKSPSIGQSNELTSEEMG-T 540
Query: 541 ASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFLRSLGWDENCGE 600
ASPRVIENG VENRNG+SSEEV+ S DSGEKTESHV AESLDEEEAAFLRSLGWDE+CGE
Sbjct: 541 ASPRVIENGAVENRNGNSSEEVQVSRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGE 600
Query: 601 DEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGGAGSELSSSDSEA 660
DEGLTEEEINSFYREY+NLKPSLK+GRCIQPKIFVPSES DSKD GAGSELSSSDSEA
Sbjct: 601 DEGLTEEEINSFYREYVNLKPSLKIGRCIQPKIFVPSESRVDSKDDGAGSELSSSDSEA 615
BLAST of Lsi09G009760 vs. TAIR 10
Match:
AT1G36990.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08510.1); Has 5029 Blast hits to 1779 proteins in 339 species: Archae - 2; Bacteria - 1372; Metazoa - 990; Fungi - 933; Plants - 111; Viruses - 28; Other Eukaryotes - 1593 (source: NCBI BLink). )
HSP 1 Score: 392.1 bits (1006), Expect = 8.7e-109
Identity = 296/632 (46.84%), Postives = 370/632 (58.54%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLS-QSRNRISKTTGDFDTSR 60
M++ E +L PEWLRS+G +GG +SNH SSSSHSD SL SRNR S++ D D+
Sbjct: 1 MDKGEHSLAPEWLRSSGHASGGGSSNHLLVSSSSHSDSASLQYNSRNRNSRSKSDVDSIH 60
Query: 61 SAFLDRTSSSNSRRSSSNGSSKHAYSS--FNRGHRDKDREKEKDRLNFGDNWDRDSHDPL 120
S FLDR+SS+NSRR SSNGS+KHAYSS FNR RDKDR ++KDR+++ D WD D+ PL
Sbjct: 61 SPFLDRSSSTNSRRGSSNGSAKHAYSSFNFNRSQRDKDRSRDKDRVSYVDPWDLDTSIPL 120
Query: 121 GKILSNRIDKDALRRSHSMVSRKQGE---RELFHRRGATELKSHNNSNGILSGTSVGSSI 180
IL+ R D D LRRSHSMV+RKQGE R L + N NG+LSG S+G+S
Sbjct: 121 RTILTGR-DPDPLRRSHSMVTRKQGEHLSRGLTVGLNNGGSSNSYNGNGLLSGPSIGNSF 180
Query: 181 QKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVP 240
Q+ F+KDFPSLG+EEKQ ++ RVSSPG+SS VQ+LP+GNSALI GGEGWTSALAEVP
Sbjct: 181 QRTGFDKDFPSLGAEEKQNGQDVVRVSSPGISSVVQNLPVGNSALI-GGEGWTSALAEVP 240
Query: 241 SMIGNTTGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQRL 300
++I S A S AG L+ +GLNMAEALVQAP+R PQ SVKTQRL
Sbjct: 241 NVIEKACTGSLTSPKANAVS-AGTLTGPSGLNMAEALVQAPARTHTPPQG---SVKTQRL 300
Query: 301 EELAIKQSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWTF 360
E+LAIKQSRQLIPV PS PK +
Sbjct: 301 EDLAIKQSRQLIPVVPSAPKGL-------------------------------------- 360
Query: 361 VIQVLNSSDKSKPKLASRTGELNVAIKGG---QPPPLSVHANQSRGGHVKSDAQKISHGK 420
LNSSDKSK K RTGE +A QP L + G +K + K
Sbjct: 361 ---SLNSSDKSKTKQVVRTGETCLAPSRNALQQPAVLLGSFQSNPSGQIKPEK------K 420
Query: 421 FLVLKPVRENGLSLAAKDVSSPTSNANSMAANSQ-FALAPSVPHAPLRSPNNTNVSSVER 480
LVLKP RENG+S A K+ SP++N N+ AA+SQ + S AP+RS N S E
Sbjct: 421 LLVLKPARENGVS-AVKESGSPSANTNTRAASSQLMSNTQSTQSAPVRSTN----SPKEL 480
Query: 481 KIAS-LDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQS 540
K AS + SG T+EK+PS +Q QSR+ F++ +K+K + S+S + +D SS S S
Sbjct: 481 KGASAFSMISGQTIEKKPSAAQAQSRSAFYSALKQKQTASTS-ITTDPVSSSTSASSSVE 540
Query: 541 SELTREEIDMPASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFL 600
+L + + + P + S EV S T ++ DEEEA FL
Sbjct: 541 VKLNSSKDLIASDP--------SSSQATSGVEVTDSVQVASHTSGFEATDTPDEEEAQFL 564
Query: 601 RSLGWDENCGEDEGLTEEEINSFYREYMNLKP 622
RSLGW EN GE E LTEEEI+SF +Y L+P
Sbjct: 601 RSLGWVENNGE-EYLTEEEIDSFLEQYKELRP 564
BLAST of Lsi09G009760 vs. TAIR 10
Match:
AT4G08510.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36990.1); Has 888 Blast hits to 321 proteins in 121 species: Archae - 0; Bacteria - 120; Metazoa - 86; Fungi - 24; Plants - 79; Viruses - 0; Other Eukaryotes - 579 (source: NCBI BLink). )
HSP 1 Score: 289.7 bits (740), Expect = 6.1e-78
Identity = 248/634 (39.12%), Postives = 336/634 (53.00%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
ME+ EP+LVPEWLRS+G +G +SN S SD SL S+NR +++ D D+ S
Sbjct: 1 MEKREPSLVPEWLRSSGHGSGVGSSN-------SLSD--SLRNSKNRNARSRSDADSVGS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSS--FNRGHRDKDREKEKDRLNFGDNWDRDSHDPLG 120
FLDR+SS+N+RR SSNGS+KHAYSS FNR +RDKDR +EKDR+++ D WD DS P G
Sbjct: 61 PFLDRSSSTNTRRGSSNGSTKHAYSSFNFNRSNRDKDRSREKDRMSYMDPWDNDSSMPFG 120
Query: 121 KILSNRIDKDALRRSHSMVSRKQGER-----ELFHRRGATELKSHNNSNGILSGTSVGSS 180
L R ++ LRRSHSM +RKQG + ++ G + N +GIL GTS S
Sbjct: 121 TFLIGR-GEEPLRRSHSMTTRKQGNHLAQGFTVGYKNGGN--INTFNGHGILPGTSPVKS 180
Query: 181 IQKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEV 240
++ F KDFP L EE+ G ++ R+SSPG S QSL + N ALI+ GEGWTSALAEV
Sbjct: 181 SKRMGFNKDFPLLRGEERNGGPDVVRISSPGRSPTAQSLSVANPALII-GEGWTSALAEV 240
Query: 241 PSMIGNTTGSSSFQQTVPATSGAGPLSMTAGLNMAEALVQAPSRARAAPQVSELSVKTQR 300
P++I + G+ S + + +GP A NMAEALVQAP R PQ Q
Sbjct: 241 PNVIEKSGGAESHANVGNSATLSGP----ACRNMAEALVQAPGRTGTPPQ-------AQT 300
Query: 301 LEELAIKQSRQLIPVTPSMPKAMVIGANAIVESLLLDLLLLPLLNKLGYYILGTVYFLWT 360
LE+ AI+QSRQLIPV PS PK G+V+
Sbjct: 301 LEDRAIRQSRQLIPVVPSAPK-------------------------------GSVH---- 360
Query: 361 FVIQVLNSSDKSKPKLASRTGELNVAIKGGQPPPLSV---HANQSRGGHVKSDAQKISHG 420
NSSDKSK K R+GE +A SV + + G +K D K
Sbjct: 361 ------NSSDKSKTKPMFRSGETGLASSRNTQQQSSVMLGNMQSNPGSQIKPDTTK---- 420
Query: 421 KFLVLKPVRENGLSLAAKDVSSPTSNANSMAANSQFALAPSVPH-APLRSPNNTNVSSVE 480
K ++LKP RENG V + S NS A SQ APS A +RS N +
Sbjct: 421 KLVILKPARENG-------VVAGGSPPNSRVAASQPTTAPSTQFTASVRSTNGPR----D 480
Query: 481 RKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSLSSSAVLSDSCSSVKSPSIGQS 540
+ AS+++ +G EK+ SL+Q QSR+ F++ +K+KT + S S + S + S Q+
Sbjct: 481 LRGASVNMLAGKAAEKKLSLAQTQSRHAFYSALKQKTCTNISTDPSKTSSCILSSVEEQA 540
Query: 541 SELTREEIDMPASPRVIENGTVENRNGDSSEEVRASCDSGEKTESHVTAESLDEEEAAFL 600
+ P+SP+ E + E V + E+ +A D +EAAFL
Sbjct: 541 NSSKELVASDPSSPQAAERDEI-------MESVEKVSNVAERISRFESAVRPDPKEAAFL 544
Query: 601 RSLGWDENCGEDEGLTEEEINSFYREYMNLKPSL 624
+SLGWDEN ++ T EE+ + +++ KPSL
Sbjct: 601 KSLGWDENDSDEYTHTMEEMREWCKKF---KPSL 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3DT29 | 7.5e-302 | 87.71 | Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A1S3CDT9 | 7.5e-302 | 87.71 | mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A1S3CC42 | 7.8e-299 | 87.25 | mediator of RNA polymerase II transcription subunit 1 isoform X2 OS=Cucumis melo... | [more] |
A0A0A0KN63 | 1.6e-296 | 86.80 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G387430 PE=4 SV=1 | [more] |
A0A6J1FM76 | 2.9e-285 | 83.94 | uncharacterized protein LOC111445235 OS=Cucurbita moschata OX=3662 GN=LOC1114452... | [more] |
Match Name | E-value | Identity | Description | |
XP_038907227.1 | 1.4e-302 | 88.47 | mediator of RNA polymerase II transcription subunit 1 isoform X1 [Benincasa hisp... | [more] |
XP_008460469.1 | 1.6e-301 | 87.71 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cuc... | [more] |
XP_038907228.1 | 1.9e-299 | 88.01 | mediator of RNA polymerase II transcription subunit 1 isoform X2 [Benincasa hisp... | [more] |
XP_008460470.1 | 1.6e-298 | 87.25 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cuc... | [more] |
XP_004140377.1 | 3.4e-296 | 86.80 | mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis sativu... | [more] |
Match Name | E-value | Identity | Description | |
AT1G36990.1 | 8.7e-109 | 46.84 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... | [more] |
AT4G08510.1 | 6.1e-78 | 39.12 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |