Cp4.1LG17g02870 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG17g02870
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionRNA polymerase sigma factor
LocationCp4.1LG17: 1896860 .. 1901113 (-)
RNA-Seq ExpressionCp4.1LG17g02870
SyntenyCp4.1LG17g02870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAACAAATAGTGAGAAAATTCCATTTAATTCTTTTTCTTTTCCAATGATTATGTAAAATTCCTTCTTTTTTTTTTTTAATTTTAATTTATTCTCTCTCTCTCTCTCTCTCTCTCTTTCTCTCTCATATATGAGTTGGGGGAGGCAAAATGGGAACCTCTTATCCATTGTTTTATCAACAAAGCCTCCAAAAGCCAATTTACCCACACAAGTTTTCCTCTCTCAGCCTTCTTCTTCTTCTTCTTCTTCTTCACCAATCACCTCTTTCATTCCCTTCATCAATCAATGAAACCCTTTTCATATGCTTCACATTTCCCCTTCTGGGTCGCCGCCCCAGTTCAATTTTGAATGATTCCTTCATTTATGTCCGCCATAGTTGCAGATTAGCGGAACCCAGTTGGAGAACGAAGAACAAGAGCAAGAACAACAAGGCGATGTCGTGTTTACTTCCCCAATTCAAGTGCCATCCCGAAACATTCTCAATCCAATTTAAGACCCCCGCCATCACAATTACGCCTACGCCTACGCCTACTTCTTCTCACCATCATACCTTTCTTCCCACTGCTAATTCTACTTACTGTGAGTGTTTTACTCTCTAGTTCTCTCTTTTCCCATTTTAAATGGTTTTTGATGTTTTGGGTTGCTTATTTGCCTAGCTGGGCCAAAATGGCTTGTGTTTAGTTGGATTGTTTGAAATGTTGTCCTGAGATTCTTTGTTTTAGGTGGCCATTATGCGTGCTAGGATTTACAAATGGCGCTGGTTTTGATGTTGTTATTGTATCTGATATAGAATGAGACTTGAATTTCCCTTTTCAATTACTGTTGTGCTTGTTTAAGAGGTAGAATGTTCCTCGTTCTGTTGAAATACTGCGTTGTGTTATCTGCTATCGGCCTGTCGTTTGGTAATCTCTGCCTAGCCATATGGTTCTGCTATCGATGTGTCGTGATAACTTCCATAAGCTCTCATGACTTTGTTTTTGGTTTCCTCAAAAGTCCTCGTACCAATGGAGATAGCATTCCTTACTTATAAACCCATGATCAACGCCTTAGTTAGCCAACGTGGGACTCCCTCCTAATAATCCTCAACATGATTGGCAATTATGTCTAATCGGTTTCAGAGAAAAATTCTAATAGAAATAGCGAAAGAGCCTCTAAGAAAAATAAGGATGGAAGTAGTTTTTTAAACCATTCATTCAACTAGTCCTTACTTTGACCAAATGATCTTTTGAATGTAACCATAGTTGGATAGAATGTTAGCTGGGTGTAACGCTGTCGGTTAGACGTCTAATGTAGTTGCACACAACGTATCAACAATGAGAAAAATGGGGATTCATACCGACTTTGGAACTCCAGAAGGCCGATCAAAGGCCTCCGTTCCCATCTTTAGGCTCCTTTGATGACCATTGATTGTTTTTGTTTTGAAAATTAAGTGTATAAACACTCTTTTCACTTGGGCAAAAAAGAAAGATGAAAACCGTTCTAAAGAAATTGTGAGAAAGCACACACAGAGCGGAAAATCAAATCGTTATCAAGCAGGACCTTAGTTACTTTTTCCACGTTAGTTTTGAGTTGGCATTTCTTATCTACAGGTGTAAATGATTTCCCATTGAGTAATGGTGAAGAAAAGTCCCAAGCATATGAAATTTGGTGCTCCTTTTGTAGCAGATAAGGCTATAACTGAGTAGATCATTTGGTTGAATGTTAGTGCGAATAGTGTCCATAGATTTTATTGAAAACTTTTCTTATAAAATTTTGAATCATGCAGTTAAAGTCAGAGATCCTCATGGTTTACGGACACAATGTATCTTGTCTGCGGCGTCACCACCGACATCGACTGGAACAGCTACAACGTTCAATGTGGATCGACTTAAATTGCCTCCTTTTGATACGAACACCGACTCAGTCTCTGCAGAAAGACTAAGATCATATTTAGGTGCAGTCGAATCGAGCCTAGCCTCAACGCTTCTAACGAGTGAAGAGGCTACAATTGCAGCAGCTGCAGCTGAAGCAGTTGCTCTTGCTAAAGCAGCCGTCAAGGCTGCAAATGATGCAGCTCTTCTGGTTAATAATAGCAACTCTGCCAGATCAGTAACAAAATCTCAATCGTCTTCGAGATCCAATGCTCTACATTTCAAATGGGATCAGTTTATGGAATCCGAAAGGGCTGATATAATCGGAGAGCCGGTAGGAGTTAATAATCGTCCTGTGGAAGGTGATGCTTTGGAACCCAGCACAGCAGAATCTGATGATATGGAGCCAACATCTGAAGAACTTGAACTCTTAGAGGATGAATTCTCTAAGAGTATAGCCGTAAGATCGATTCGTCAAACAGAACGAAAAGCTAGAAGAACTAGAGCCTCAGAGAAGGCTGCTACTAGTGTAGTGTCACTGAAGTCGGGTTCTAGCAGCCGGAAGAAGCGTAATTCTGGACAAGAAGTGGACTACAATGACCCATTGCGTTATTTGAGAGCAACTACAAGCGCTTCTAAACTTCTTACTGCAAGTGATGAGCTTGAACTATCAGAAGGAATTCAGGTTTGGATCGAAAACCACATCGCACTTGAAAACGTTAACATCGATCTGCAAGTAAAAGTAAAAATAAATTCAGCTTTTCAGCACTCTACTTTACAGTCACTACTAAATCTGGATTAGAAACATTATGTCAAATAACTTTTGATCTGATCCTGCTTTTAAACAGAATGTAGCTGTTGTTTCTTTCCTCAGATTTGATTCATTTATCAACTGCAATTGCTCTTTAGATAGTATTTTGGATTAATATCTGGATTTGTTGTTATTTTCCTTGTGTTGTGTTCGATTACCGACACTTGTTCTGACTATTTCTAGGACTTGCTGAAACTGGAAAGACTCAAGGAGGAGCTTGCAGGACGTTACGGGAACGAACCGACCTTTGCACAATGGGCTGCTGCTGCTGGAGTAAACCAGAGGACGCTGAGAAAGCGTCTAAACTACGGTACTTTTTGTAAAGATAAAATGATAAAAAGCAACATTCGTCTTGTCATATCGATAGCGAAAAATTATCAGGGAGGTGGAATGAATCTCCAAGATCTAGTTCAGGTACTTTTTCATCGATGCACACGAACCAAAATGCTTATAAATGGAAACTGAAAATTCGAATGCCTTGTTTATTGCTCGTGTTGTGATGTAGGAAGGATGTCGAGGCCTTGTACGAGGTGCGGAGAAGTTCGATGCTTCAAAAGGCTTCAAGTTTTCAACGTATGCTCATTGGTGGATTAAGCAGGCAGTCCGAAAGTCTCTTTCGGATCAGTCGAGGACTATTCGCTTGCCGGTATGTTTCTTTCCACCTGAATTATGACAGGAGAGAAAAAAAATTCCCAATGAACTAAAGATTTCATTGATGAATGGACAGTTTCACATGGTGGAAGCAACTTATAAGGTAAAGGAAGCTAGAAAGCAATTATATACTGCAAATGGAAGACTTCCTAACGATGAAGAAATCGCCGTAGCAGCCGGTCTGTCGATGAAGAGGCTTGCTACAGTACTGATGACTCCGAAAGCACCTCGATCCCTGGAACAGAAGTTCGGGATCAACCAAAATCTCAAACCTTCGGTCTGTCGTCTTGCTACTACTTGGATCTTCCTTCCTCTCCCATTGCAGTCTCAGCATGTTTTAACTAGTTGAATCTCATGCTCTTTTCCAGGAAGTCATTTCCGACCCGGAAGCGGTAACCGCAGAAGATCTGTTGATAAAACAGTTCATGAAGCAGGAGCTCGAAAAGGTACTTGACTCGCTTAATCCGAGAGAGAGACAAATAATTAGATGGAGGTATGGCATGGAAGATGGAAGGATGAAAACATTGCAAGAAATAGGAGAAATGATGGGTGTAAGTAGGGAAAGAATTAGACAAATTGAGCTATGTGCGTTCAGAAAACTCAAAAACAAGAAGAGAACTAAACATTTGCAGCAGTATCTGATGCCATGATGATGAGTTCTTCATCAGATTCCTTCAAAGAGTATCTATTATGATCTGGTATGAGAGGCTTAAACAAGGCTTGGTCTTTCTTTTTTCTGTAACATTTTTATTGTTCTTTCCCACTTTGAGGATAGATTAGTATCATGTTGTTTACAGACTGATAGCTACTTTCTGTTCATACATATTTAATTTAAATCAAGAGATTTGTCTTGATCCATGATTGTGGCCTCTGGATTAATGTCTTTACGATATAGTGTAACGAAAAGATGAACTTTTTTCGGGTTCTTGACGACAGTTGTGCT

mRNA sequence

GAAACAAATAGTGAGAAAATTCCATTTAATTCTTTTTCTTTTCCAATGATTATGTAAAATTCCTTCTTTTTTTTTTTTAATTTTAATTTATTCTCTCTCTCTCTCTCTCTCTCTCTTTCTCTCTCATATATGAGTTGGGGGAGGCAAAATGGGAACCTCTTATCCATTGTTTTATCAACAAAGCCTCCAAAAGCCAATTTACCCACACAAGTTTTCCTCTCTCAGCCTTCTTCTTCTTCTTCTTCTTCTTCACCAATCACCTCTTTCATTCCCTTCATCAATCAATGAAACCCTTTTCATATGCTTCACATTTCCCCTTCTGGGTCGCCGCCCCAGTTCAATTTTGAATGATTCCTTCATTTATGTCCGCCATAGTTGCAGATTAGCGGAACCCAGTTGGAGAACGAAGAACAAGAGCAAGAACAACAAGGCGATGTCGTGTTTACTTCCCCAATTCAAGTGCCATCCCGAAACATTCTCAATCCAATTTAAGACCCCCGCCATCACAATTACGCCTACGCCTACGCCTACTTCTTCTCACCATCATACCTTTCTTCCCACTGCTAATTCTACTTACTTTAAAGTCAGAGATCCTCATGGTTTACGGACACAATGTATCTTGTCTGCGGCGTCACCACCGACATCGACTGGAACAGCTACAACGTTCAATGTGGATCGACTTAAATTGCCTCCTTTTGATACGAACACCGACTCAGTCTCTGCAGAAAGACTAAGATCATATTTAGGTGCAGTCGAATCGAGCCTAGCCTCAACGCTTCTAACGAGTGAAGAGGCTACAATTGCAGCAGCTGCAGCTGAAGCAGTTGCTCTTGCTAAAGCAGCCGTCAAGGCTGCAAATGATGCAGCTCTTCTGGTTAATAATAGCAACTCTGCCAGATCAGTAACAAAATCTCAATCGTCTTCGAGATCCAATGCTCTACATTTCAAATGGGATCAGTTTATGGAATCCGAAAGGGCTGATATAATCGGAGAGCCGGTAGGAGTTAATAATCGTCCTGTGGAAGGTGATGCTTTGGAACCCAGCACAGCAGAATCTGATGATATGGAGCCAACATCTGAAGAACTTGAACTCTTAGAGGATGAATTCTCTAAGAGTATAGCCGTAAGATCGATTCGTCAAACAGAACGAAAAGCTAGAAGAACTAGAGCCTCAGAGAAGGCTGCTACTAGTGTAGTGTCACTGAAGTCGGGTTCTAGCAGCCGGAAGAAGCGTAATTCTGGACAAGAAGTGGACTACAATGACCCATTGCGTTATTTGAGAGCAACTACAAGCGCTTCTAAACTTCTTACTGCAAGTGATGAGCTTGAACTATCAGAAGGAATTCAGGACTTGCTGAAACTGGAAAGACTCAAGGAGGAGCTTGCAGGACGTTACGGGAACGAACCGACCTTTGCACAATGGGCTGCTGCTGCTGGAGTAAACCAGAGGACGCTGAGAAAGCGTCTAAACTACGGTACTTTTTGTAAAGATAAAATGATAAAAAGCAACATTCGTCTTGTCATATCGATAGCGAAAAATTATCAGGGAGGTGGAATGAATCTCCAAGATCTAGTTCAGGAAGGATGTCGAGGCCTTGTACGAGGTGCGGAGAAGTTCGATGCTTCAAAAGGCTTCAAGTTTTCAACGTATGCTCATTGGTGGATTAAGCAGGCAGTCCGAAAGTCTCTTTCGGATCAGTCGAGGACTATTCGCTTGCCGTTTCACATGGTGGAAGCAACTTATAAGGTAAAGGAAGCTAGAAAGCAATTATATACTGCAAATGGAAGACTTCCTAACGATGAAGAAATCGCCGTAGCAGCCGGTCTGTCGATGAAGAGGCTTGCTACAGTACTGATGACTCCGAAAGCACCTCGATCCCTGGAACAGAAGTTCGGGATCAACCAAAATCTCAAACCTTCGGAAGTCATTTCCGACCCGGAAGCGGTAACCGCAGAAGATCTGTTGATAAAACAGTTCATGAAGCAGGAGCTCGAAAAGGTACTTGACTCGCTTAATCCGAGAGAGAGACAAATAATTAGATGGAGGTATGGCATGGAAGATGGAAGGATGAAAACATTGCAAGAAATAGGAGAAATGATGGGTGTAAGTAGGGAAAGAATTAGACAAATTGAGCTATGTGCGTTCAGAAAACTCAAAAACAAGAAGAGAACTAAACATTTGCAGCAGTATCTGATGCCATGATGATGAGTTCTTCATCAGATTCCTTCAAAGAGTATCTATTATGATCTGGTATGAGAGGCTTAAACAAGGCTTGGTCTTTCTTTTTTCTGTAACATTTTTATTGTTCTTTCCCACTTTGAGGATAGATTAGTATCATGTTGTTTACAGACTGATAGCTACTTTCTGTTCATACATATTTAATTTAAATCAAGAGATTTGTCTTGATCCATGATTGTGGCCTCTGGATTAATGTCTTTACGATATAGTGTAACGAAAAGATGAACTTTTTTCGGGTTCTTGACGACAGTTGTGCT

Coding sequence (CDS)

ATGGGAACCTCTTATCCATTGTTTTATCAACAAAGCCTCCAAAAGCCAATTTACCCACACAAGTTTTCCTCTCTCAGCCTTCTTCTTCTTCTTCTTCTTCTTCACCAATCACCTCTTTCATTCCCTTCATCAATCAATGAAACCCTTTTCATATGCTTCACATTTCCCCTTCTGGGTCGCCGCCCCAGTTCAATTTTGAATGATTCCTTCATTTATGTCCGCCATAGTTGCAGATTAGCGGAACCCAGTTGGAGAACGAAGAACAAGAGCAAGAACAACAAGGCGATGTCGTGTTTACTTCCCCAATTCAAGTGCCATCCCGAAACATTCTCAATCCAATTTAAGACCCCCGCCATCACAATTACGCCTACGCCTACGCCTACTTCTTCTCACCATCATACCTTTCTTCCCACTGCTAATTCTACTTACTTTAAAGTCAGAGATCCTCATGGTTTACGGACACAATGTATCTTGTCTGCGGCGTCACCACCGACATCGACTGGAACAGCTACAACGTTCAATGTGGATCGACTTAAATTGCCTCCTTTTGATACGAACACCGACTCAGTCTCTGCAGAAAGACTAAGATCATATTTAGGTGCAGTCGAATCGAGCCTAGCCTCAACGCTTCTAACGAGTGAAGAGGCTACAATTGCAGCAGCTGCAGCTGAAGCAGTTGCTCTTGCTAAAGCAGCCGTCAAGGCTGCAAATGATGCAGCTCTTCTGGTTAATAATAGCAACTCTGCCAGATCAGTAACAAAATCTCAATCGTCTTCGAGATCCAATGCTCTACATTTCAAATGGGATCAGTTTATGGAATCCGAAAGGGCTGATATAATCGGAGAGCCGGTAGGAGTTAATAATCGTCCTGTGGAAGGTGATGCTTTGGAACCCAGCACAGCAGAATCTGATGATATGGAGCCAACATCTGAAGAACTTGAACTCTTAGAGGATGAATTCTCTAAGAGTATAGCCGTAAGATCGATTCGTCAAACAGAACGAAAAGCTAGAAGAACTAGAGCCTCAGAGAAGGCTGCTACTAGTGTAGTGTCACTGAAGTCGGGTTCTAGCAGCCGGAAGAAGCGTAATTCTGGACAAGAAGTGGACTACAATGACCCATTGCGTTATTTGAGAGCAACTACAAGCGCTTCTAAACTTCTTACTGCAAGTGATGAGCTTGAACTATCAGAAGGAATTCAGGACTTGCTGAAACTGGAAAGACTCAAGGAGGAGCTTGCAGGACGTTACGGGAACGAACCGACCTTTGCACAATGGGCTGCTGCTGCTGGAGTAAACCAGAGGACGCTGAGAAAGCGTCTAAACTACGGTACTTTTTGTAAAGATAAAATGATAAAAAGCAACATTCGTCTTGTCATATCGATAGCGAAAAATTATCAGGGAGGTGGAATGAATCTCCAAGATCTAGTTCAGGAAGGATGTCGAGGCCTTGTACGAGGTGCGGAGAAGTTCGATGCTTCAAAAGGCTTCAAGTTTTCAACGTATGCTCATTGGTGGATTAAGCAGGCAGTCCGAAAGTCTCTTTCGGATCAGTCGAGGACTATTCGCTTGCCGTTTCACATGGTGGAAGCAACTTATAAGGTAAAGGAAGCTAGAAAGCAATTATATACTGCAAATGGAAGACTTCCTAACGATGAAGAAATCGCCGTAGCAGCCGGTCTGTCGATGAAGAGGCTTGCTACAGTACTGATGACTCCGAAAGCACCTCGATCCCTGGAACAGAAGTTCGGGATCAACCAAAATCTCAAACCTTCGGAAGTCATTTCCGACCCGGAAGCGGTAACCGCAGAAGATCTGTTGATAAAACAGTTCATGAAGCAGGAGCTCGAAAAGGTACTTGACTCGCTTAATCCGAGAGAGAGACAAATAATTAGATGGAGGTATGGCATGGAAGATGGAAGGATGAAAACATTGCAAGAAATAGGAGAAATGATGGGTGTAAGTAGGGAAAGAATTAGACAAATTGAGCTATGTGCGTTCAGAAAACTCAAAAACAAGAAGAGAACTAAACATTTGCAGCAGTATCTGATGCCATGA

Protein sequence

MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLHQSPLSFPSSINETLFICFTFPLLGRRPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP
Homology
BLAST of Cp4.1LG17g02870 vs. ExPASy Swiss-Prot
Match: O22056 (RNA polymerase sigma factor sigB OS=Arabidopsis thaliana OX=3702 GN=SIGB PE=2 SV=2)

HSP 1 Score: 641.3 bits (1653), Expect = 1.2e-182
Identity = 371/600 (61.83%), Postives = 452/600 (75.33%), Query Frame = 0

Query: 97  SCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQC 156
           SCLLPQFKC P++FSI F+T         +  +  H+       S +F        + QC
Sbjct: 3   SCLLPQFKCPPDSFSIHFRT---------SFCAPKHN-----KGSVFF--------QPQC 62

Query: 157 ILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGA---------VESSLA 216
            +S  SP   T   +  +V +L+LP FDT++DS+ ++R  +Y            +E+  +
Sbjct: 63  AVS-TSPALLT---SMLDVAKLRLPSFDTDSDSLISDRQWTYTRPDGPSTEAKYLEALAS 122

Query: 217 STLLTSEEATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFK 276
            TLLTS+EA + AAAAEAVALA+AAVK A DA L   NSN+   +T S +  RS     K
Sbjct: 123 ETLLTSDEAVVVAAAAEAVALARAAVKVAKDATLF-KNSNNTNLLTSSTADKRS-----K 182

Query: 277 WDQFMESERADIIGEPVGVNNRPVEGDALEPSTAESD---DME-PTSEELELLEDEFSKS 336
           WDQF E ERA I+G  + V++  +  D +  S +  +   D+E    EE+ELLE++ S S
Sbjct: 183 WDQFTEKERAGILGH-LAVSDNGIVSDKITASASNKESIGDLESEKQEEVELLEEQPSVS 242

Query: 337 IAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSA 396
           +AVRS RQTERKARR +  EK A+ + S+K+GSS +KKR   QEVD+NDPLRYLR TTS+
Sbjct: 243 LAVRSTRQTERKARRAKGLEKTASGIPSVKTGSSPKKKRLVAQEVDHNDPLRYLRMTTSS 302

Query: 397 SKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYG 456
           SKLLT  +E ELS GIQDLLKLERL+ EL  R G +PTFAQWA+AAGV+Q++LR+R+++G
Sbjct: 303 SKLLTVREEHELSAGIQDLLKLERLQTELTERSGRQPTFAQWASAAGVDQKSLRQRIHHG 362

Query: 457 TFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAH 516
           T CKDKMIKSNIRLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDA+KGFKFSTYAH
Sbjct: 363 TLCKDKMIKSNIRLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDATKGFKFSTYAH 422

Query: 517 WWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMK 576
           WWIKQAVRKSLSDQSR IRLPFHMVEATY+VKEARKQLY+  G+ P +EEIA A GLSMK
Sbjct: 423 WWIKQAVRKSLSDQSRMIRLPFHMVEATYRVKEARKQLYSETGKHPKNEEIAEATGLSMK 482

Query: 577 RLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLN 636
           RL  VL++PK PRSL+QK G+NQNLKPSEVI+DPEAVT+ED+LIK+FM+Q+L+KVLDSL 
Sbjct: 483 RLMAVLLSPKPPRSLDQKIGMNQNLKPSEVIADPEAVTSEDILIKEFMRQDLDKVLDSLG 542

Query: 637 PRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 684
            RE+Q+IRWR+GMEDGRMKTLQEIGEMMGVSRER+RQIE  AFRKLKNKKR  HLQQYL+
Sbjct: 543 TREKQVIRWRFGMEDGRMKTLQEIGEMMGVSRERVRQIESSAFRKLKNKKRNNHLQQYLV 569

BLAST of Cp4.1LG17g02870 vs. ExPASy Swiss-Prot
Match: P26683 (RNA polymerase sigma factor SigA OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) OX=103690 GN=sigA PE=3 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 4.6e-73
Identity = 145/332 (43.67%), Postives = 220/332 (66.27%), Query Frame = 0

Query: 353 KSGSSSRKKRNSGQEVDY--NDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKE 412
           KSG +++ +R +  +  +   D +R         +LL A +E+EL+  I DLL+LER++E
Sbjct: 59  KSGKAAKSRRRTQSKKKHYTEDSIRLYLQEIGRIRLLRADEEIELARKIADLLELERVRE 118

Query: 413 ELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGM 472
            L+ +   +P  ++WA A  +     R RL+ G   KDKM++SN+RLV+SIAK Y   G+
Sbjct: 119 RLSEKLERDPRDSEWAEAVQLPLPAFRYRLHIGRRAKDKMVQSNLRLVVSIAKKYMNRGL 178

Query: 473 NLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEA 532
           + QDL+QEG  GL+R AEKFD  KG+KFSTYA WWI+QA+ ++++DQSRTIRLP H+ E 
Sbjct: 179 SFQDLIQEGSLGLIRAAEKFDHEKGYKFSTYATWWIRQAITRAIADQSRTIRLPVHLYET 238

Query: 533 TYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKP 592
             ++K+  K L    GR P +EEIA    +++++L  +  + + P SLE   G  ++ + 
Sbjct: 239 ISRIKKTTKLLSQEMGRKPTEEEIATRMEMTIEKLRFIAKSAQLPISLETPIGKEEDSRL 298

Query: 593 SEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEM 652
            + I + +  T ED + K  ++++LEKVLDSL+PRER ++R RYG++DGRMKTL+EIG++
Sbjct: 299 GDFI-ESDGETPEDQVSKNLLREDLEKVLDSLSPRERDVLRLRYGLDDGRMKTLEEIGQI 358

Query: 653 MGVSRERIRQIELCAFRKLKNKKRTKHLQQYL 683
             V+RERIRQIE  A RKL++  R   L++Y+
Sbjct: 359 FNVTRERIRQIEAKALRKLRHPNRNSVLKEYI 389

BLAST of Cp4.1LG17g02870 vs. ExPASy Swiss-Prot
Match: P38023 (RNA polymerase sigma factor SigA1 OS=Synechococcus elongatus (strain PCC 7942 / FACHB-805) OX=1140 GN=sigA1 PE=3 SV=2)

HSP 1 Score: 262.3 bits (669), Expect = 1.5e-68
Identity = 154/386 (39.90%), Postives = 234/386 (60.62%), Query Frame = 0

Query: 310 SEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVD 369
           ++  ELL D   K    ++ R + +KA  T A  + AT++       +   + + G++ D
Sbjct: 17  TQATELL-DPALKPAETKAKRSSRKKA--TTAVVEPATTIAPTADVDAIDDEDSVGEDED 76

Query: 370 -------------YNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRY 429
                          D +R         +LL A +E+EL+  I DLL LER+++EL  + 
Sbjct: 77  AAAKAKAKVRKTYTEDSIRLYLQEIGRIRLLRADEEIELARQIADLLALERIRDELLEQL 136

Query: 430 GNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLV 489
              P+ A+WAAA        R+RL  G   KDKM++SN+RLV+SIAK Y   G++ QDL+
Sbjct: 137 DRLPSDAEWAAAVDSPLDEFRRRLFRGRRAKDKMVQSNLRLVVSIAKKYMNRGLSFQDLI 196

Query: 490 QEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKE 549
           QEG  GL+R AEKFD  KG+KFSTYA WWI+QA+ ++++DQSRTIRLP H+ E   ++K+
Sbjct: 197 QEGSLGLIRAAEKFDHEKGYKFSTYATWWIRQAITRAIADQSRTIRLPVHLYETISRIKK 256

Query: 550 ARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISD 609
             K L    GR P +EEIA    +++++L  +  + + P SLE   G  ++ +  + I +
Sbjct: 257 TTKLLSQEMGRKPTEEEIATRMEMTIEKLRFIAKSAQLPISLETPIGKEEDSRLGDFI-E 316

Query: 610 PEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRE 669
            +  T ED + K  ++++LE VL +L+PRER ++R RYG++DGRMKTL+EIG++  V+RE
Sbjct: 317 ADGETPEDEVAKNLLREDLEGVLSTLSPRERDVLRLRYGLDDGRMKTLEEIGQLFNVTRE 376

Query: 670 RIRQIELCAFRKLKNKKRTKHLQQYL 683
           RIRQIE  A RKL++  R   L++Y+
Sbjct: 377 RIRQIEAKALRKLRHPNRNSILKEYI 398

BLAST of Cp4.1LG17g02870 vs. ExPASy Swiss-Prot
Match: P52322 (RNA polymerase sigma factor SigA OS=Microcystis aeruginosa OX=1126 GN=sigA PE=3 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 1.7e-64
Identity = 139/366 (37.98%), Postives = 216/366 (59.02%), Query Frame = 0

Query: 357 SSRKKRNSGQEVDY-NDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELA-- 416
           +S ++R++ ++  Y  D +R         +LL A +E+EL+  I DLLKLER++E+    
Sbjct: 51  ASARRRDAARKKPYTEDSIRIYLQEIGRIRLLRAEEEIELARKIADLLKLERIREDFCLY 110

Query: 417 --GRYGNE-----------------------------------PTFAQWAAAAGVNQRTL 476
               +G +                                   P   +W A +       
Sbjct: 111 SDAEWGKQVFLFERIEKIIVEKSEKEPKLSDIKAYLGKTELTAPLLEEWLAKSKEYLSAF 170

Query: 477 RKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGF 536
           ++RL +G   K+KM++SN+RLV+SIAK Y   G++ QDL+QEG  GL+R AEKFD  KG+
Sbjct: 171 KRRLYHGRRAKEKMVQSNLRLVVSIAKKYMNRGLSFQDLIQEGSLGLIRAAEKFDHEKGY 230

Query: 537 KFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAV 596
           KFSTYA WWI+QA+ ++++DQSRTIRLP H+ E   ++K+  K L     R P +EEIA 
Sbjct: 231 KFSTYATWWIRQAITRAIADQSRTIRLPVHLYETISRIKKTTKILSQEMRRKPTEEEIAT 290

Query: 597 AAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELE 656
              +++++L  +  + + P SLE   G  ++ +  + I + +  T ED + K  ++++LE
Sbjct: 291 KMEMTIEKLRFIAKSAQLPISLETPIGKEEDSRLGDFI-EADGETPEDEVSKNLLREDLE 350

Query: 657 KVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTK 683
            VLD+L+PRER ++R RYG++DGRMKTL+EIG++  V+RERIRQIE  A RKL++  R  
Sbjct: 351 NVLDTLSPRERDVLRLRYGLDDGRMKTLEEIGQIFNVTRERIRQIEAKALRKLRHPNRNS 410

BLAST of Cp4.1LG17g02870 vs. ExPASy Swiss-Prot
Match: Q9LD95 (RNA polymerase sigma factor sigF, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=SIGF PE=1 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 1.2e-60
Identity = 146/391 (37.34%), Postives = 221/391 (56.52%), Query Frame = 0

Query: 291 VEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVV 350
           V+     PS    D +  TS  + L E    K   VRS RQ ER+A+  RA +       
Sbjct: 158 VDDTEANPSDNIKDSLS-TSSSMSLPE----KGNIVRSKRQLERRAKNRRAPKSNDVDDE 217

Query: 351 SLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKE 410
                 +S KK+   Q  D +D L+         +LLTA +E EL   IQ LLKLE++K 
Sbjct: 218 GYVPQKTSAKKKYK-QGADNDDALQLFLWGPETKQLLTAKEEAELISHIQHLLKLEKVKT 277

Query: 411 ELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGM 470
           +L  + G EPT  +WA A G++   L+  ++ G   ++K+I +N+RLV+ IAK YQ  G+
Sbjct: 278 KLESQNGCEPTIGEWAEAMGISSPVLKSDIHRGRSSREKLITANLRLVVHIAKQYQNRGL 337

Query: 471 NLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEA 530
           N QDL+QEG  GL++  EKF    G +F+TYA+WWI+Q++RKS+   SRTIRLP ++   
Sbjct: 338 NFQDLLQEGSMGLMKSVEKFKPQSGCRFATYAYWWIRQSIRKSIFQNSRTIRLPENVYML 397

Query: 531 TYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKP 590
             KV EARK         P+ EE+A   G+S ++L  +L   + P S++Q    +Q+   
Sbjct: 398 LGKVSEARKTCVQEGNYRPSKEELAGHVGVSTEKLDKLLYNTRTPLSMQQPIWSDQDTTF 457

Query: 591 SEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEM 650
            E+  D    T    + KQ M+  +  +L+ L+P+ER+II+ R+G++ G+ ++L EIGE+
Sbjct: 458 QEITPDSGIETPTMSVGKQLMRNHVRNLLNVLSPKERRIIKLRFGIDGGKQRSLSEIGEI 517

Query: 651 MGVSRERIRQIELCAFRKLKNKKRTKHLQQY 682
            G+S+ER+RQ+E  A  +LK    +  L  Y
Sbjct: 518 YGLSKERVRQLESRALYRLKQNMNSHGLHAY 542

BLAST of Cp4.1LG17g02870 vs. NCBI nr
Match: XP_023515004.1 (RNA polymerase sigma factor sigB [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1293 bits (3345), Expect = 0.0
Identity = 684/684 (100.00%), Postives = 684/684 (100.00%), Query Frame = 0

Query: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLHQSPLSFPSSINETLFICFTFPLLGR 60
           MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLHQSPLSFPSSINETLFICFTFPLLGR
Sbjct: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLHQSPLSFPSSINETLFICFTFPLLGR 60

Query: 61  RPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPAIT 120
           RPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPAIT
Sbjct: 61  RPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPAIT 120

Query: 121 ITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRLKL 180
           ITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRLKL
Sbjct: 121 ITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRLKL 180

Query: 181 PPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDAA 240
           PPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDAA
Sbjct: 181 PPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDAA 240

Query: 241 LLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPST 300
           LLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPST
Sbjct: 241 LLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPST 300

Query: 301 AESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRK 360
           AESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRK
Sbjct: 301 AESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRK 360

Query: 361 KRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEP 420
           KRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEP
Sbjct: 361 KRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEP 420

Query: 421 TFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGC 480
           TFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGC
Sbjct: 421 TFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGC 480

Query: 481 RGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQ 540
           RGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQ
Sbjct: 481 RGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQ 540

Query: 541 LYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAV 600
           LYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAV
Sbjct: 541 LYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAV 600

Query: 601 TAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQ 660
           TAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQ
Sbjct: 601 TAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQ 660

Query: 661 IELCAFRKLKNKKRTKHLQQYLMP 684
           IELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 661 IELCAFRKLKNKKRTKHLQQYLMP 684

BLAST of Cp4.1LG17g02870 vs. NCBI nr
Match: KAG6593746.1 (RNA polymerase sigma factor sigB, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1281 bits (3316), Expect = 0.0
Identity = 681/685 (99.42%), Postives = 682/685 (99.56%), Query Frame = 0

Query: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL-HQSPLSFPSSINETLFICFTFPLLG 60
           MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL HQSPLSFPSSINETLFICFTFPLLG
Sbjct: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLLHQSPLSFPSSINETLFICFTFPLLG 60

Query: 61  RRPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPAI 120
           RRPSSILNDSFIYVRHSCRLAEPSWRTKNK+KNNKAMSCLLPQFKCHPETFSIQFKTPAI
Sbjct: 61  RRPSSILNDSFIYVRHSCRLAEPSWRTKNKNKNNKAMSCLLPQFKCHPETFSIQFKTPAI 120

Query: 121 TITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRLK 180
           TITPTPTPTSSHHHTFLPTANSTYFKVRDPH LRTQCILSAASPPTSTGTATTFNVDRLK
Sbjct: 121 TITPTPTPTSSHHHTFLPTANSTYFKVRDPHSLRTQCILSAASPPTSTGTATTFNVDRLK 180

Query: 181 LPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDA 240
           LPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDA
Sbjct: 181 LPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAANDA 240

Query: 241 ALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPS 300
           ALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPS
Sbjct: 241 ALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEPS 300

Query: 301 TAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSR 360
           TAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSR
Sbjct: 301 TAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSR 360

Query: 361 KKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNE 420
           KKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNE
Sbjct: 361 KKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNE 420

Query: 421 PTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEG 480
           PTFAQWAAAAGVNQRTLRKRLNYGT CKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEG
Sbjct: 421 PTFAQWAAAAGVNQRTLRKRLNYGTCCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEG 480

Query: 481 CRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARK 540
           CRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARK
Sbjct: 481 CRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARK 540

Query: 541 QLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEA 600
           QLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEA
Sbjct: 541 QLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEA 600

Query: 601 VTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIR 660
           VTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIR
Sbjct: 601 VTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIR 660

Query: 661 QIELCAFRKLKNKKRTKHLQQYLMP 684
           QIELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 661 QIELCAFRKLKNKKRTKHLQQYLMP 685

BLAST of Cp4.1LG17g02870 vs. NCBI nr
Match: XP_022964550.1 (RNA polymerase sigma factor sigB [Cucurbita moschata])

HSP 1 Score: 1263 bits (3269), Expect = 0.0
Identity = 674/686 (98.25%), Postives = 676/686 (98.54%), Query Frame = 0

Query: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL--HQSPLSFPSSINETLFICFTFPLL 60
           MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL  HQSPL FPSSINETLFICFTFPLL
Sbjct: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLLLHQSPLPFPSSINETLFICFTFPLL 60

Query: 61  GRRPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPA 120
           GRRPSSILND FIYVRHSCRLAEPSWRTKNK+KNNKAMSCLLPQFKCHPETFSIQFKTPA
Sbjct: 61  GRRPSSILNDYFIYVRHSCRLAEPSWRTKNKNKNNKAMSCLLPQFKCHPETFSIQFKTPA 120

Query: 121 ITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRL 180
            TITPTPT  SSHHHTFLPTANSTYFK RDPH LRTQCILSAASPPT TGTATTFNVDRL
Sbjct: 121 STITPTPT--SSHHHTFLPTANSTYFKFRDPHSLRTQCILSAASPPTLTGTATTFNVDRL 180

Query: 181 KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND 240
           KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND
Sbjct: 181 KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND 240

Query: 241 AALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP 300
           AALLVNNSNS+RSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP
Sbjct: 241 AALLVNNSNSSRSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP 300

Query: 301 STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS 360
           STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS
Sbjct: 301 STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS 360

Query: 361 RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN 420
           RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN
Sbjct: 361 RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN 420

Query: 421 EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE 480
           EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE
Sbjct: 421 EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE 480

Query: 481 GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR 540
           GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR
Sbjct: 481 GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR 540

Query: 541 KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE 600
           KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE
Sbjct: 541 KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE 600

Query: 601 AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI 660
           AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI
Sbjct: 601 AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI 660

Query: 661 RQIELCAFRKLKNKKRTKHLQQYLMP 684
           RQIELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 661 RQIELCAFRKLKNKKRTKHLQQYLMP 684

BLAST of Cp4.1LG17g02870 vs. NCBI nr
Match: XP_023000238.1 (RNA polymerase sigma factor sigB [Cucurbita maxima])

HSP 1 Score: 1075 bits (2780), Expect = 0.0
Identity = 579/589 (98.30%), Postives = 581/589 (98.64%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKTPAITITPTPT  SSHHHTFLPTANSTYFKVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTPAITITPTPT--SSHHHTFLPTANSTYFKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAA   TSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE
Sbjct: 61  CILSAA---TSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSS+SNALHFKWDQFMESE
Sbjct: 121 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSKSNALHFKWDQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELL+DEFSKSIAVRS RQTERK
Sbjct: 181 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLQDEFSKSIAVRSNRQTERK 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL
Sbjct: 241 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS
Sbjct: 361 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLP DEEIAVAAGLSMKRLATVLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPADEEIAVAAGLSMKRLATVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG
Sbjct: 481 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP 684
           MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 541 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP 584

BLAST of Cp4.1LG17g02870 vs. NCBI nr
Match: XP_038875580.1 (RNA polymerase sigma factor sigB isoform X1 [Benincasa hispida])

HSP 1 Score: 966 bits (2498), Expect = 0.0
Identity = 519/588 (88.27%), Postives = 545/588 (92.69%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKT A          + HHHTFLPTANS+Y KVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTAANY--------APHHHTFLPTANSSYTKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAASPPTSTGTATT NVDRL LPPFDTNTDSVS ERLRSYLGAVESSLASTLLTSEE
Sbjct: 61  CILSAASPPTSTGTATTLNVDRLMLPPFDTNTDSVSVERLRSYLGAVESSLASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           ATIAAAAAEAVALAKAAVK A DAALL NNSNS+R+  KS+SS R +ALHFKW QFMESE
Sbjct: 121 ATIAAAAAEAVALAKAAVKVARDAALLANNSNSSRAGEKSRSSPRPDALHFKWAQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIGEPVGVNNRP+E DAL+PST +SDDMEPTSEELELL+DE S+SI VRS RQTERK
Sbjct: 181 RADIIGEPVGVNNRPMETDALQPSTTKSDDMEPTSEELELLQDELSESITVRSKRQTERK 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           ARRTRA+EK ATSVVSLKSGSSSRKKRNS QEVDY+DPLRYLRATTS S+LLTA++ELEL
Sbjct: 241 ARRTRAAEKTATSVVSLKSGSSSRKKRNSLQEVDYSDPLRYLRATTSTSRLLTAAEELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERL+EELA RYGNEPTFAQWAAAAGVNQRTLRKRLNYGT CKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLQEELADRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTLCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS
Sbjct: 361 RLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATY+VKEARKQLY+ NGR P+DEEIA AAGLSMKRLA VLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYRVKEARKQLYSENGRHPDDEEIAEAAGLSMKRLAAVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQK GINQNLKPSEVISDPEA TAED+LIKQFMKQ+LEKVLDSLNPRERQ+IRWR+G
Sbjct: 481 RSLEQKIGINQNLKPSEVISDPEAETAEDMLIKQFMKQDLEKVLDSLNPRERQVIRWRFG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 683
           MEDGRMKTLQEIGE+MGVSRERIRQIE CAFRKLKNKKRTKHLQQYL+
Sbjct: 541 MEDGRMKTLQEIGEIMGVSRERIRQIESCAFRKLKNKKRTKHLQQYLI 580

BLAST of Cp4.1LG17g02870 vs. ExPASy TrEMBL
Match: A0A6J1HL45 (RNA polymerase sigma factor sigB OS=Cucurbita moschata OX=3662 GN=LOC111464542 PE=3 SV=1)

HSP 1 Score: 1263 bits (3269), Expect = 0.0
Identity = 674/686 (98.25%), Postives = 676/686 (98.54%), Query Frame = 0

Query: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL--HQSPLSFPSSINETLFICFTFPLL 60
           MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLL  HQSPL FPSSINETLFICFTFPLL
Sbjct: 1   MGTSYPLFYQQSLQKPIYPHKFSSLSLLLLLLLLLLHQSPLPFPSSINETLFICFTFPLL 60

Query: 61  GRRPSSILNDSFIYVRHSCRLAEPSWRTKNKSKNNKAMSCLLPQFKCHPETFSIQFKTPA 120
           GRRPSSILND FIYVRHSCRLAEPSWRTKNK+KNNKAMSCLLPQFKCHPETFSIQFKTPA
Sbjct: 61  GRRPSSILNDYFIYVRHSCRLAEPSWRTKNKNKNNKAMSCLLPQFKCHPETFSIQFKTPA 120

Query: 121 ITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQCILSAASPPTSTGTATTFNVDRL 180
            TITPTPT  SSHHHTFLPTANSTYFK RDPH LRTQCILSAASPPT TGTATTFNVDRL
Sbjct: 121 STITPTPT--SSHHHTFLPTANSTYFKFRDPHSLRTQCILSAASPPTLTGTATTFNVDRL 180

Query: 181 KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND 240
           KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND
Sbjct: 181 KLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEEATIAAAAAEAVALAKAAVKAAND 240

Query: 241 AALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP 300
           AALLVNNSNS+RSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP
Sbjct: 241 AALLVNNSNSSRSVTKSQSSSRSNALHFKWDQFMESERADIIGEPVGVNNRPVEGDALEP 300

Query: 301 STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS 360
           STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS
Sbjct: 301 STAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSS 360

Query: 361 RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN 420
           RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN
Sbjct: 361 RKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGN 420

Query: 421 EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE 480
           EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE
Sbjct: 421 EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQE 480

Query: 481 GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR 540
           GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR
Sbjct: 481 GCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEAR 540

Query: 541 KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE 600
           KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE
Sbjct: 541 KQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPE 600

Query: 601 AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI 660
           AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI
Sbjct: 601 AVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERI 660

Query: 661 RQIELCAFRKLKNKKRTKHLQQYLMP 684
           RQIELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 661 RQIELCAFRKLKNKKRTKHLQQYLMP 684

BLAST of Cp4.1LG17g02870 vs. ExPASy TrEMBL
Match: A0A6J1KM29 (RNA polymerase sigma factor OS=Cucurbita maxima OX=3661 GN=LOC111494519 PE=3 SV=1)

HSP 1 Score: 1075 bits (2780), Expect = 0.0
Identity = 579/589 (98.30%), Postives = 581/589 (98.64%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKTPAITITPTPT  SSHHHTFLPTANSTYFKVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTPAITITPTPT--SSHHHTFLPTANSTYFKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAA   TSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE
Sbjct: 61  CILSAA---TSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSS+SNALHFKWDQFMESE
Sbjct: 121 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSKSNALHFKWDQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELL+DEFSKSIAVRS RQTERK
Sbjct: 181 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLQDEFSKSIAVRSNRQTERK 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL
Sbjct: 241 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS
Sbjct: 361 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLP DEEIAVAAGLSMKRLATVLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPADEEIAVAAGLSMKRLATVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG
Sbjct: 481 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP 684
           MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP
Sbjct: 541 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLMP 584

BLAST of Cp4.1LG17g02870 vs. ExPASy TrEMBL
Match: A0A6J1DSR7 (RNA polymerase sigma factor OS=Momordica charantia OX=3673 GN=LOC111022820 PE=3 SV=1)

HSP 1 Score: 951 bits (2459), Expect = 0.0
Identity = 513/588 (87.24%), Postives = 544/588 (92.52%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKT AIT+T T     SHHH FLPTANS+  KVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTTAITVTTTAN--YSHHHNFLPTANSSCTKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAASP TSTGTATTFNV+RLKLPP D NTDS+SAERLRSYL AVESS ASTLLTSEE
Sbjct: 61  CILSAASPSTSTGTATTFNVERLKLPPLDANTDSISAERLRSYLSAVESSFASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           ATIAAAAAEAV LAKAAVK A DAALLVNNSNS+++  KSQSSS+S+ALHFKWDQFMESE
Sbjct: 121 ATIAAAAAEAVTLAKAAVKVAKDAALLVNNSNSSKTGKKSQSSSKSDALHFKWDQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIG PVGVNNRP EGDAL+  TAESD  +PT+EELELL+DE S +IAVRS RQTER+
Sbjct: 181 RADIIGVPVGVNNRPAEGDALQHDTAESDYTDPTTEELELLQDELS-NIAVRSRRQTERR 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           A+RTRA+EK ATSVVS+KSGSSSRKKRNS QEVDY+DPLRYLRATTS S+LLTA++ELEL
Sbjct: 241 AKRTRAAEKIATSVVSVKSGSSSRKKRNSVQEVDYSDPLRYLRATTSNSRLLTATEELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERL+EEL  RYG EPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLQEELRERYGKEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS
Sbjct: 361 RLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATY+VKEARKQLY+ NGR P+DEEIA AAGLSMKRLA VLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYRVKEARKQLYSENGRHPDDEEIAEAAGLSMKRLAAVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQK GINQNLKPSEVISDPEA T+EDLLIKQFMKQ+LEKVLDSLNPRERQ+IRWR+G
Sbjct: 481 RSLEQKIGINQNLKPSEVISDPEAETSEDLLIKQFMKQDLEKVLDSLNPRERQVIRWRFG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 683
           MEDGRMKTLQEIGE+MGVSRERIRQIE CAFRKLKNKKRTKHLQQYLM
Sbjct: 541 MEDGRMKTLQEIGEIMGVSRERIRQIESCAFRKLKNKKRTKHLQQYLM 585

BLAST of Cp4.1LG17g02870 vs. ExPASy TrEMBL
Match: A0A1S3CD00 (RNA polymerase sigma factor OS=Cucumis melo OX=3656 GN=LOC103499006 PE=3 SV=1)

HSP 1 Score: 950 bits (2455), Expect = 0.0
Identity = 508/588 (86.39%), Postives = 539/588 (91.67%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKT A          +SHHH+F+PTA S+Y KVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTAA---------NNSHHHSFIPTAYSSYTKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAASPPTSTGTATT NVDRLKLPPFDTNTDSVS ERLRSYLGAVESSLASTLLTSEE
Sbjct: 61  CILSAASPPTSTGTATTLNVDRLKLPPFDTNTDSVSVERLRSYLGAVESSLASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           ATIAAAAAEAV LAKAAVK A DAALL NN NS+R+ TKSQ S + +ALHFKW QFMESE
Sbjct: 121 ATIAAAAAEAVTLAKAAVKVARDAALLANNINSSRAGTKSQPSPKPDALHFKWAQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIGEPVGVN RP+EGDALEPST ESDD+EPTS+ELELL+DE S+SI V+S RQTERK
Sbjct: 181 RADIIGEPVGVNKRPIEGDALEPSTTESDDVEPTSKELELLQDELSESITVKSKRQTERK 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           ARRTRA+EK  TSVV  KSGSSSRKKR+S QEVDY+DPLRYLRATTS S+LLTA++ELEL
Sbjct: 241 ARRTRAAEKTVTSVVPFKSGSSSRKKRSSLQEVDYSDPLRYLRATTSTSRLLTATEELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERL+EELA RYGNEPTFAQWAAAAGVNQRTLRKRLNYGT CKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLQEELAERYGNEPTFAQWAAAAGVNQRTLRKRLNYGTLCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRK LS
Sbjct: 361 RLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKFLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATY+VKEARKQL + NGR P+D+EIA AAGLSMKRLA VLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYRVKEARKQLLSENGRHPDDKEIAEAAGLSMKRLAAVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQK GINQNLKPSEVISDPEA TAED+LIKQFMKQ+LEKVLDSLNPRE+Q+IRWR+G
Sbjct: 481 RSLEQKIGINQNLKPSEVISDPEAETAEDMLIKQFMKQDLEKVLDSLNPREKQVIRWRFG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 683
           MEDGRMKTLQEIGE+MGVSRERIRQIE CAFRKLKNKKRTKHLQQYLM
Sbjct: 541 MEDGRMKTLQEIGEIMGVSRERIRQIESCAFRKLKNKKRTKHLQQYLM 579

BLAST of Cp4.1LG17g02870 vs. ExPASy TrEMBL
Match: A0A0A0K917 (RNA polymerase sigma factor OS=Cucumis sativus OX=3659 GN=Csa_6G078550 PE=3 SV=1)

HSP 1 Score: 947 bits (2447), Expect = 0.0
Identity = 507/588 (86.22%), Postives = 537/588 (91.33%), Query Frame = 0

Query: 96  MSCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQ 155
           MSCLLPQFKCHPETFSIQFKT A           SHHH+FLPTA S+Y KVRDPH LRTQ
Sbjct: 1   MSCLLPQFKCHPETFSIQFKTAA---------NYSHHHSFLPTAYSSYTKVRDPHSLRTQ 60

Query: 156 CILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGAVESSLASTLLTSEE 215
           CILSAASPPTSTGTATT +VDRLKLPPFDTNTDSVS ERLRSYLGAVESSLASTLLTSEE
Sbjct: 61  CILSAASPPTSTGTATTLDVDRLKLPPFDTNTDSVSVERLRSYLGAVESSLASTLLTSEE 120

Query: 216 ATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFKWDQFMESE 275
           A+IAAAAAEAV LAKAAVK A DAALL NN NS+R+ TKSQSS + +ALHFKW QFMESE
Sbjct: 121 ASIAAAAAEAVTLAKAAVKVARDAALLANNINSSRAGTKSQSSPKPDALHFKWAQFMESE 180

Query: 276 RADIIGEPVGVNNRPVEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERK 335
           RADIIGEPVGVN RP+EGDALEPST ESDDMEPTSEELELL+DE S+SI V+S RQTERK
Sbjct: 181 RADIIGEPVGVNKRPMEGDALEPSTTESDDMEPTSEELELLQDELSESITVKSKRQTERK 240

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           ARRTRA+EK  TSV+S KSGSSSRKKRNS QEVDY+DPLRYLRATT+ S+LLTA++ELEL
Sbjct: 241 ARRTRAAEKTVTSVLSFKSGSSSRKKRNSVQEVDYSDPLRYLRATTNTSRLLTATEELEL 300

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           SEGIQDLLKLERL+EEL  RYGNEPTFAQWAAAAGVNQRTLRKRLNYGT CKDKMIKSNI
Sbjct: 301 SEGIQDLLKLERLQEELGERYGNEPTFAQWAAAAGVNQRTLRKRLNYGTLCKDKMIKSNI 360

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRK LS
Sbjct: 361 RLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKFLS 420

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           DQSRTIRLPFHMVEATY+VKEARKQL   NGR P+D+EIA AAGLSMKRLA VLMTPKAP
Sbjct: 421 DQSRTIRLPFHMVEATYRVKEARKQLLHENGRHPDDKEIAEAAGLSMKRLAAVLMTPKAP 480

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYG 635
           RSLEQK GINQNLKPSEVISDPEA T ED+LIKQFMKQ+LEKVLDSLNPRE+Q+IRWR+G
Sbjct: 481 RSLEQKIGINQNLKPSEVISDPEAETCEDMLIKQFMKQDLEKVLDSLNPREKQVIRWRFG 540

Query: 636 MEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 683
           MEDGRMKTLQEIGE+MGVSRERIRQIE CAFRKLKNKKRTKHLQQY+M
Sbjct: 541 MEDGRMKTLQEIGEIMGVSRERIRQIESCAFRKLKNKKRTKHLQQYVM 579

BLAST of Cp4.1LG17g02870 vs. TAIR 10
Match: AT1G08540.1 (RNApolymerase sigma subunit 2 )

HSP 1 Score: 641.3 bits (1653), Expect = 8.6e-184
Identity = 371/600 (61.83%), Postives = 452/600 (75.33%), Query Frame = 0

Query: 97  SCLLPQFKCHPETFSIQFKTPAITITPTPTPTSSHHHTFLPTANSTYFKVRDPHGLRTQC 156
           SCLLPQFKC P++FSI F+T         +  +  H+       S +F        + QC
Sbjct: 3   SCLLPQFKCPPDSFSIHFRT---------SFCAPKHN-----KGSVFF--------QPQC 62

Query: 157 ILSAASPPTSTGTATTFNVDRLKLPPFDTNTDSVSAERLRSYLGA---------VESSLA 216
            +S  SP   T   +  +V +L+LP FDT++DS+ ++R  +Y            +E+  +
Sbjct: 63  AVS-TSPALLT---SMLDVAKLRLPSFDTDSDSLISDRQWTYTRPDGPSTEAKYLEALAS 122

Query: 217 STLLTSEEATIAAAAAEAVALAKAAVKAANDAALLVNNSNSARSVTKSQSSSRSNALHFK 276
            TLLTS+EA + AAAAEAVALA+AAVK A DA L   NSN+   +T S +  RS     K
Sbjct: 123 ETLLTSDEAVVVAAAAEAVALARAAVKVAKDATLF-KNSNNTNLLTSSTADKRS-----K 182

Query: 277 WDQFMESERADIIGEPVGVNNRPVEGDALEPSTAESD---DME-PTSEELELLEDEFSKS 336
           WDQF E ERA I+G  + V++  +  D +  S +  +   D+E    EE+ELLE++ S S
Sbjct: 183 WDQFTEKERAGILGH-LAVSDNGIVSDKITASASNKESIGDLESEKQEEVELLEEQPSVS 242

Query: 337 IAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSA 396
           +AVRS RQTERKARR +  EK A+ + S+K+GSS +KKR   QEVD+NDPLRYLR TTS+
Sbjct: 243 LAVRSTRQTERKARRAKGLEKTASGIPSVKTGSSPKKKRLVAQEVDHNDPLRYLRMTTSS 302

Query: 397 SKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYG 456
           SKLLT  +E ELS GIQDLLKLERL+ EL  R G +PTFAQWA+AAGV+Q++LR+R+++G
Sbjct: 303 SKLLTVREEHELSAGIQDLLKLERLQTELTERSGRQPTFAQWASAAGVDQKSLRQRIHHG 362

Query: 457 TFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAH 516
           T CKDKMIKSNIRLVISIAKNYQG GMNLQDLVQEGCRGLVRGAEKFDA+KGFKFSTYAH
Sbjct: 363 TLCKDKMIKSNIRLVISIAKNYQGAGMNLQDLVQEGCRGLVRGAEKFDATKGFKFSTYAH 422

Query: 517 WWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMK 576
           WWIKQAVRKSLSDQSR IRLPFHMVEATY+VKEARKQLY+  G+ P +EEIA A GLSMK
Sbjct: 423 WWIKQAVRKSLSDQSRMIRLPFHMVEATYRVKEARKQLYSETGKHPKNEEIAEATGLSMK 482

Query: 577 RLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDSLN 636
           RL  VL++PK PRSL+QK G+NQNLKPSEVI+DPEAVT+ED+LIK+FM+Q+L+KVLDSL 
Sbjct: 483 RLMAVLLSPKPPRSLDQKIGMNQNLKPSEVIADPEAVTSEDILIKEFMRQDLDKVLDSLG 542

Query: 637 PRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 684
            RE+Q+IRWR+GMEDGRMKTLQEIGEMMGVSRER+RQIE  AFRKLKNKKR  HLQQYL+
Sbjct: 543 TREKQVIRWRFGMEDGRMKTLQEIGEMMGVSRERVRQIESSAFRKLKNKKRNNHLQQYLV 569

BLAST of Cp4.1LG17g02870 vs. TAIR 10
Match: AT2G36990.1 (RNApolymerase sigma-subunit F )

HSP 1 Score: 236.1 bits (601), Expect = 8.3e-62
Identity = 146/391 (37.34%), Postives = 221/391 (56.52%), Query Frame = 0

Query: 291 VEGDALEPSTAESDDMEPTSEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVV 350
           V+     PS    D +  TS  + L E    K   VRS RQ ER+A+  RA +       
Sbjct: 158 VDDTEANPSDNIKDSLS-TSSSMSLPE----KGNIVRSKRQLERRAKNRRAPKSNDVDDE 217

Query: 351 SLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKE 410
                 +S KK+   Q  D +D L+         +LLTA +E EL   IQ LLKLE++K 
Sbjct: 218 GYVPQKTSAKKKYK-QGADNDDALQLFLWGPETKQLLTAKEEAELISHIQHLLKLEKVKT 277

Query: 411 ELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGM 470
           +L  + G EPT  +WA A G++   L+  ++ G   ++K+I +N+RLV+ IAK YQ  G+
Sbjct: 278 KLESQNGCEPTIGEWAEAMGISSPVLKSDIHRGRSSREKLITANLRLVVHIAKQYQNRGL 337

Query: 471 NLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEA 530
           N QDL+QEG  GL++  EKF    G +F+TYA+WWI+Q++RKS+   SRTIRLP ++   
Sbjct: 338 NFQDLLQEGSMGLMKSVEKFKPQSGCRFATYAYWWIRQSIRKSIFQNSRTIRLPENVYML 397

Query: 531 TYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKP 590
             KV EARK         P+ EE+A   G+S ++L  +L   + P S++Q    +Q+   
Sbjct: 398 LGKVSEARKTCVQEGNYRPSKEELAGHVGVSTEKLDKLLYNTRTPLSMQQPIWSDQDTTF 457

Query: 591 SEVISDPEAVTAEDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEM 650
            E+  D    T    + KQ M+  +  +L+ L+P+ER+II+ R+G++ G+ ++L EIGE+
Sbjct: 458 QEITPDSGIETPTMSVGKQLMRNHVRNLLNVLSPKERRIIKLRFGIDGGKQRSLSEIGEI 517

Query: 651 MGVSRERIRQIELCAFRKLKNKKRTKHLQQY 682
            G+S+ER+RQ+E  A  +LK    +  L  Y
Sbjct: 518 YGLSKERVRQLESRALYRLKQNMNSHGLHAY 542

BLAST of Cp4.1LG17g02870 vs. TAIR 10
Match: AT3G53920.1 (RNApolymerase sigma-subunit C )

HSP 1 Score: 201.4 bits (511), Expect = 2.3e-51
Identity = 122/380 (32.11%), Postives = 225/380 (59.21%), Query Frame = 0

Query: 310 SEELELLEDEFSKSIAVRSIRQTERKARRTRASEKAATSVVSLKSGSSSRKK---RNSGQ 369
           S+  EL++   ++ + V S R+ ++KARR         S V+ ++G  S      R +  
Sbjct: 204 SDTTELVDTTPNQQVFVSSRRKVKKKARR---------SSVTAENGDQSSLPIGLRTTWN 263

Query: 370 EVD---YNDPLRYLRATTSASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEPTFA 429
            +D      P +Y +     S+     +E E+S G++ +  +ER++ +L    G   + +
Sbjct: 264 NIDVPRVRRPPKYRKKRERISR-----NETEMSTGVKIVADMERIRTQLEEESGKVASLS 323

Query: 430 QWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGL 489
            WAAAAG+N++ L + L+YG +C+D+++KS   LV+ +A+NY+G G+  +DL+Q G  G+
Sbjct: 324 CWAAAAGMNEKLLMRNLHYGWYCRDELVKSTRSLVLFLARNYRGLGIAHEDLIQAGYVGV 383

Query: 490 VRGAEKFDASKGFKFSTYAHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYT 549
           ++GAE+FD ++G+KFSTY  +WI++++   +S  +R + +P  ++     +++ARK L T
Sbjct: 384 LQGAERFDHTRGYKFSTYVQYWIRKSMSTMVSRHARGVHIPSSIIRTINHIQKARKTLKT 443

Query: 550 ANG-RLPNDEEIAVAAGLSMKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTA 609
           ++G +   DEEIA   G S+K++       K   S+++K G     K  E   D    + 
Sbjct: 444 SHGIKYAADEEIAKLTGHSVKKIRAANQCLKVVGSIDKKVGDCFTTKFLEFTPDTTMESP 503

Query: 610 EDLLIKQFMKQELEKVLDSLNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIE 669
           E+ +++Q  ++++  +L+ L PRE+Q++  RYG++D R K+L+EIG+++ VS+E IR+IE
Sbjct: 504 EEAVMRQSARRDIHDLLEGLEPREKQVMVLRYGLQDYRPKSLEEIGKLLKVSKEWIRKIE 563

Query: 670 LCAFRKLKNKKRTKHLQQYL 683
             A  KL+++   + L+ YL
Sbjct: 564 RRAMAKLRDQPNAEDLRYYL 569

BLAST of Cp4.1LG17g02870 vs. TAIR 10
Match: AT5G13730.1 (sigma factor 4 )

HSP 1 Score: 177.9 bits (450), Expect = 2.7e-44
Identity = 105/300 (35.00%), Postives = 172/300 (57.33%), Query Frame = 0

Query: 382 SASKLLTASDELELSEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLN 441
           S S  L+  +E++L   +++  KLE L   +     NE      A+  G  +R+  + L 
Sbjct: 118 SRSGFLSRLEEVQLCLYLKEGAKLENLGTSVE---ENEMVSVLLASGRGKKKRSANEILC 177

Query: 442 YGTFCKDKMIKSNIRLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTY 501
                ++K+ +   RLV+SIA  YQG G+NLQDL+QEG  GL+RGAE+FD  +G+K STY
Sbjct: 178 RRKEAREKITRCYRRLVVSIATGYQGKGLNLQDLIQEGSIGLLRGAERFDPDRGYKLSTY 237

Query: 502 AHWWIKQAVRKSLSDQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLS 561
            +WWIKQA+ ++++ +SR ++LP  M E T KV EA   L     R P+ EEIA    L+
Sbjct: 238 VYWWIKQAILRAIAHKSRLVKLPGSMWELTAKVAEASNVLTRKLRRQPSCEEIAEHLNLN 297

Query: 562 MKRLATVLMTPKAPRSLEQKFGINQNLKPSEVISDPEAVTAEDLLIKQFMKQELEKVLDS 621
           +  +   +   ++P SL++    N  +   E++  P+    E+++ ++ MK E+E++L S
Sbjct: 298 VSAVRLAVERSRSPVSLDRVASQNGRMTLQEIVRGPDETRPEEMVKREHMKHEIEQLLGS 357

Query: 622 LNPRERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQY 681
           L  RE +++   +G+      + +EIG+ + +SRER+RQI   A +KL+N      L+ Y
Sbjct: 358 LTARESRVLGLYFGLNGETPMSFEEIGKSLKLSRERVRQINGIALKKLRNVHNVNDLKIY 414

BLAST of Cp4.1LG17g02870 vs. TAIR 10
Match: AT1G64860.1 (sigma factor A )

HSP 1 Score: 158.3 bits (399), Expect = 2.2e-38
Identity = 118/358 (32.96%), Postives = 194/358 (54.19%), Query Frame = 0

Query: 336 ARRTRASEKAATSVVSLKSGSSSRKKRNSGQEVDYNDPLRYLRATTSASKLLTASDELEL 395
           AR+ R   K  T++  +K+ S      +SG++V       Y++   S   +L+  + + L
Sbjct: 159 ARQRRIGAKKKTNMTHVKAVSDV----SSGKQV-----RGYVKGVIS-EDVLSHVEVVRL 218

Query: 396 SEGIQDLLKLERLKEELAGRYGNEPTFAQWAAAAGVNQRTLRKRLNYGTFCKDKMIKSNI 455
           S+ I+  L+L+  K  L  R G EP+  Q A +  +++  L+  L      ++K+  SN+
Sbjct: 219 SKKIKSGLRLDDHKSRLKDRLGCEPSDEQLAVSLKISRAELQAWLMECHLAREKLAMSNV 278

Query: 456 RLVISIAKNYQGGGMNLQDLVQEGCRGLVRGAEKFDASKGFKFSTYAHWWIKQAVRKSLS 515
           RLV+SIA+ Y   G  + DLVQ G  GL+RG EKFD+SKGF+ STY +WWI+Q V ++L 
Sbjct: 279 RLVMSIAQRYDNLGAEMSDLVQGGLIGLLRGIEKFDSSKGFRISTYVYWWIRQGVSRALV 338

Query: 516 DQSRTIRLPFHMVEATYKVKEARKQLYTANGRLPNDEEIAVAAGLSMKRLATVLMTPKAP 575
           D SRT+RLP H+ E    ++ A+ +L    G  P+ + IA +  +S K++          
Sbjct: 339 DNSRTLRLPTHLHERLGLIRNAKLRL-QEKGITPSIDRIAESLNMSQKKVRNATEAVSKV 398

Query: 576 RSLEQKFGINQNLKPSEVISDPEAVTA---------EDLLIKQFMKQELEKVLD-SLNPR 635
            SL++    + N  P E      A T          +DL     +K+E+ K++  +L  R
Sbjct: 399 FSLDRDAFPSLNGLPGETHHSYIADTRLENNPWHGYDDLA----LKEEVSKLISATLGER 458

Query: 636 ERQIIRWRYGMEDGRMKTLQEIGEMMGVSRERIRQIELCAFRKLKNKKRTKHLQQYLM 684
           E++IIR  YG+ D    T ++I + +G+SRER+RQ+ L A  KLK+  R + ++  ++
Sbjct: 459 EKEIIRLYYGL-DKECLTWEDISKRIGLSRERVRQVGLVALEKLKHAARKRKMEAMIL 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O220561.2e-18261.83RNA polymerase sigma factor sigB OS=Arabidopsis thaliana OX=3702 GN=SIGB PE=2 SV... [more]
P266834.6e-7343.67RNA polymerase sigma factor SigA OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UT... [more]
P380231.5e-6839.90RNA polymerase sigma factor SigA1 OS=Synechococcus elongatus (strain PCC 7942 / ... [more]
P523221.7e-6437.98RNA polymerase sigma factor SigA OS=Microcystis aeruginosa OX=1126 GN=sigA PE=3 ... [more]
Q9LD951.2e-6037.34RNA polymerase sigma factor sigF, chloroplastic OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
XP_023515004.10.0100.00RNA polymerase sigma factor sigB [Cucurbita pepo subsp. pepo][more]
KAG6593746.10.099.42RNA polymerase sigma factor sigB, partial [Cucurbita argyrosperma subsp. sororia... [more]
XP_022964550.10.098.25RNA polymerase sigma factor sigB [Cucurbita moschata][more]
XP_023000238.10.098.30RNA polymerase sigma factor sigB [Cucurbita maxima][more]
XP_038875580.10.088.27RNA polymerase sigma factor sigB isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1HL450.098.25RNA polymerase sigma factor sigB OS=Cucurbita moschata OX=3662 GN=LOC111464542 P... [more]
A0A6J1KM290.098.30RNA polymerase sigma factor OS=Cucurbita maxima OX=3661 GN=LOC111494519 PE=3 SV=... [more]
A0A6J1DSR70.087.24RNA polymerase sigma factor OS=Momordica charantia OX=3673 GN=LOC111022820 PE=3 ... [more]
A0A1S3CD000.086.39RNA polymerase sigma factor OS=Cucumis melo OX=3656 GN=LOC103499006 PE=3 SV=1[more]
A0A0A0K9170.086.22RNA polymerase sigma factor OS=Cucumis sativus OX=3659 GN=Csa_6G078550 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G08540.18.6e-18461.83RNApolymerase sigma subunit 2 [more]
AT2G36990.18.3e-6237.34RNApolymerase sigma-subunit F [more]
AT3G53920.12.3e-5132.11RNApolymerase sigma-subunit C [more]
AT5G13730.12.7e-4435.00sigma factor 4 [more]
AT1G64860.12.2e-3832.96sigma factor A [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000943RNA polymerase sigma-70PRINTSPR00046SIGMA70FCTcoord: 474..487
score: 52.4
coord: 622..634
score: 42.68
coord: 498..506
score: 82.44
coord: 643..658
score: 54.84
coord: 658..669
score: 58.33
IPR000943RNA polymerase sigma-70PROSITEPS00716SIGMA70_2coord: 643..669
IPR007624RNA polymerase sigma-70 region 3PFAMPF04539Sigma70_r3coord: 529..604
e-value: 2.2E-14
score: 53.3
NoneNo IPR availableGENE3D1.10.601.10RNA Polymerase Primary Sigma Factorcoord: 205..522
e-value: 5.0E-38
score: 133.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..359
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 281..311
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..369
NoneNo IPR availablePANTHERPTHR30603RNA POLYMERASE SIGMA FACTOR RPOcoord: 143..683
NoneNo IPR availablePANTHERPTHR30603:SF57RNA POLYMERASE SIGMA FACTOR SIGBcoord: 143..683
NoneNo IPR availableCDDcd06171Sigma70_r4coord: 612..670
e-value: 3.94185E-14
score: 65.2034
IPR007630RNA polymerase sigma-70 region 4PFAMPF04545Sigma70_r4coord: 618..670
e-value: 1.1E-19
score: 69.7
IPR014284RNA polymerase sigma-70 like domainTIGRFAMTIGR02937TIGR02937coord: 448..672
e-value: 9.4E-33
score: 111.1
IPR036388Winged helix-like DNA-binding domain superfamilyGENE3D1.10.10.10coord: 606..684
e-value: 1.0E-27
score: 97.6
IPR036388Winged helix-like DNA-binding domain superfamilyGENE3D1.10.10.10coord: 523..599
e-value: 7.2E-12
score: 47.5
IPR007627RNA polymerase sigma-70 region 2PFAMPF04542Sigma70_r2coord: 450..519
e-value: 3.5E-18
score: 65.1
IPR013324RNA polymerase sigma factor, region 3/4-likeSUPERFAMILY88659Sigma3 and sigma4 domains of RNA polymerase sigma factorscoord: 518..569
IPR013324RNA polymerase sigma factor, region 3/4-likeSUPERFAMILY88659Sigma3 and sigma4 domains of RNA polymerase sigma factorscoord: 582..681
IPR013325RNA polymerase sigma factor, region 2SUPERFAMILY88946Sigma2 domain of RNA polymerase sigma factorscoord: 371..520

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g02870.1Cp4.1LG17g02870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071482 cellular response to light stimulus
biological_process GO:2000142 regulation of DNA-templated transcription, initiation
biological_process GO:0006352 DNA-templated transcription, initiation
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0009507 chloroplast
molecular_function GO:0003677 DNA binding
molecular_function GO:0016987 sigma factor activity
molecular_function GO:0003700 DNA-binding transcription factor activity