ClCG04G007350 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G007350
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiontranscription termination/antitermination protein NusG
LocationCG_Chr04: 22336010 .. 22339005 (-)
RNA-Seq ExpressionClCG04G007350
SyntenyClCG04G007350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCATCAACCAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGGTAAAATTTCTTCATTCCACTCTTTGATTTCTAGGTTTGTTGCACAATTTGATTTCATGTGTTGTATTGATTTGGCATCAATTTTCTTAAAGCAAATCGAATGGATTCTGTTGTCTTGGTTTTGAAATTCATTAACATAACTCTAAATATGAAAACCTACTGAGTTCCATTGTTTGGGAGCGCAAGGGAATGATTAAAGGTTTCTGTTACCATATATATTTCATTAGCTTGATTACCTTACAGTGTAGTGCTTAGATTGCAGGAGAAAGGATTTAAAACAACAGTAATAGATAAAAAGAATGAGGTTTTCTTTTCAATTCTATATTAGATATCAAAGAAACAAAGGCTATCTAAAAAAGAGTGAAGAAATGGGATTCGTGTTGTTAGTTGCCACAAATTGTTAGAGACGATAATGTGTATCAACATCTCAAGTTCTTCAATTGGATTGCAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGTGCGAAGGTTGGAAACACGTGAGTCGTTATTTCGTTCCTTCTTTGTAAAAAAAAGTTAAGCAATTGTTTGTTTTGTTTGAAACTCTGGACAGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTGTATGTCTTGTAGTCAATAGATTTTGGAGTCTAGGAGGTTGTGCCATAGAACTGATTCTAAACTAAATCTGAATTTTAAAGCTTTGGATTCAAAATTCAGATTTTCAAAGTACTATAGACGATGGCTTTCATCCAAAAAGTTAGATGTAAATAGCTTTATATTTCTTTACATCAATTCTGTGTTCTCAGTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTGTGCTGTTTTCTGTTAGATGAAAAGTTGGGTACAAAGGAGAAAAATTAGAAGAAAGTTGTGTGCTAGAGACCTGACGGATTGTTAGAAAGCATTTGAATTTTATTTTATCTACTAACCATAATTGATTTCCTTCATGCAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGGTAAGTTTTTTAAATCACGAGATACCATTTATTTGAGGTTTCTTTCGTCTACTTTAATGTAAAATTTAGTGCCTTGGTATATTCAGTCAAAACCCACATAAATTCATAATCATCATAAGAACTTGCAAGGATCTTGTTTGGTCCTTCAGATCATTATTCAACTCACTGTTAACAATCTTCAATAATAAGCCTTCATTTGTAATATTGAAAATTAAGATAGCCTACAATGGCTAATGAAGCTCTCCAAATCACAATAGATTCAGATTTAGTAATCATGTCTGCAAGATTTTGGGGTTGTTTTAAGATAGAAGCGAAAGACGCCATGGCTTTGTTACTTCATCACTCTTCCGTTTGAAGCCAAATGTGGTGGGAAATGTTTAAGGACTGCCGCTATTATATTAGAGCGTTTCGAAATAGTTGTTTCTGTTTCTTTAGTTAACACCAATCGGAGCATCTTTGATAGCTGAACATTTTTCACTCACTCTTCTCTGTGATTGTTTTGCAGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCCAGCTCTCAGCTCTCAACTCATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTGCTTTTTTGAAATTACTCTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCTGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAAATAGGGAATATCATGATGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTTAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAGCTATGTTCTAAAGTATTTCCAAAATTGTGTTACCATTAAAATCATAGTAATTTTTGAGCTCCTATTCATTGTGGAC

mRNA sequence

CTTCATCAACCAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCCAGCTCTCAGCTCTCAACTCATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTGCTTTTTTGAAATTACTCTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCTGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAAATAGGGAATATCATGATGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTTAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAGCTATGTTCTAAAGTATTTCCAAAATTGTGTTACCATTAAAATCATAGTAATTTTTGAGCTCCTATTCATTGTGGAC

Coding sequence (CDS)

ATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGA

Protein sequence

MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Homology
BLAST of ClCG04G007350 vs. NCBI nr
Match: XP_038880828.1 (transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida])

HSP 1 Score: 569.7 bits (1467), Expect = 1.7e-158
Identity = 300/344 (87.21%), Postives = 309/344 (89.83%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLL WSPISLRS SF ALSF LSS KRTQLSISA VE PSAAD++QQLSARERRKLR
Sbjct: 1   MACGLLIWSPISLRSISFPALSFSLSSPKRTQLSISATVETPSAADDIQQLSARERRKLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL+KLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLSKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKA+FPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKALFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQAFLEKEQ++A NS ALETDLDTNGTTA K KGRPKKAVNTLSPGSTVRVASG
Sbjct: 241 EEQERHDQAFLEKEQDRAPNSSALETDLDTNGTTATKQKGRPKKAVNTLSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 325

BLAST of ClCG04G007350 vs. NCBI nr
Match: XP_008448915.1 (PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo] >XP_008448924.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo])

HSP 1 Score: 555.8 bits (1431), Expect = 2.6e-154
Identity = 296/345 (85.80%), Postives = 309/345 (89.57%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKL
Sbjct: 1   MAYGLLIWSPISLCSTSLPALSFSLSSSRRTQLSISASVETPAAAADDVQQLSARDRRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDYLAKLGPQWWV+RVARVRGQE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGPQWWVMRVARVRGQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVKPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS 300
           KEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Sbjct: 241 KEEQERHDQAFLEKEQEEAPNSSALKTDLDTNGTTATKHKGRGKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of ClCG04G007350 vs. NCBI nr
Match: XP_004147896.1 (uncharacterized protein LOC101211195 [Cucumis sativus] >KGN54344.1 hypothetical protein Csa_018004 [Cucumis sativus])

HSP 1 Score: 549.7 bits (1415), Expect = 1.8e-152
Identity = 289/345 (83.77%), Postives = 305/345 (88.41%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKL
Sbjct: 1   MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDYLAKLGPQWWV+RVARVR QE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAKLGPQWWVMRVARVRSQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARCLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS 300
           K+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Sbjct: 241 KDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of ClCG04G007350 vs. NCBI nr
Match: XP_022151589.1 (uncharacterized protein LOC111019492 [Momordica charantia])

HSP 1 Score: 532.3 bits (1370), Expect = 3.0e-147
Identity = 287/343 (83.67%), Postives = 301/343 (87.76%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LR
Sbjct: 1   MAFGLLPWSSISLRSSSFPALSFSLSSSKCTQLSISAALE-TAAADDVQQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLAKLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFASWTEKLNLDYLAKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYMVKPRAVFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Sbjct: 241 EEQERHDQDFLEKEQEKAPNSTIHKTDLDTNGTTATKPKGRLKKAVNALSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET 344
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVET 323

BLAST of ClCG04G007350 vs. NCBI nr
Match: XP_022931956.1 (uncharacterized protein LOC111438223 [Cucurbita moschata])

HSP 1 Score: 528.1 bits (1359), Expect = 5.7e-146
Identity = 282/344 (81.98%), Postives = 300/344 (87.21%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACGLLTWTTLPLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAKLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLAKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Sbjct: 241 EEQERHDQAFLEKEKEQAPNPSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVSSG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSKKVTVGFTLFGKETLVELDIGDIIVETK 322

BLAST of ClCG04G007350 vs. ExPASy Swiss-Prot
Match: Q06795 (Transcription termination/antitermination protein NusG OS=Bacillus subtilis (strain 168) OX=224308 GN=nusG PE=1 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.7e-07
Identity = 51/206 (24.76%), Postives = 78/206 (37.86%), Query Frame = 0

Query: 134 DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGN 193
           D  F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+
Sbjct: 33  DKIFRVVVPE-EEETDIKNGKKKVVKKKVFPGYVLVEIVMTDDSWYVVRNTPGVTGFVGS 92

Query: 194 ERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEK 253
             +                         +KP P+   + E I K    ++ + D  F  K
Sbjct: 93  AGSG------------------------SKPTPLLPGEAETILKRMGMDERKTDIDFELK 152

Query: 254 EQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLN 313
           E                                       TV+V  G FA F GS+++++
Sbjct: 153 E---------------------------------------TVKVIDGPFANFTGSIEEID 174

Query: 314 RKSGKVTVGFTLFGKETLVDLDIGDI 340
               KV V   +FG+ET V+L+   I
Sbjct: 213 YDKSKVKVFVNMFGRETPVELEFTQI 174

BLAST of ClCG04G007350 vs. ExPASy Swiss-Prot
Match: P29397 (Transcription termination/antitermination protein NusG OS=Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) OX=243274 GN=nusG PE=1 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 8.3e-07
Identity = 43/185 (23.24%), Postives = 71/185 (38.38%), Query Frame = 0

Query: 155 YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWA 214
           Y  K + +FPG VF+  IMN E ++F+R    V GFV +                     
Sbjct: 224 YKTKRRKLFPGYVFVEMIMNDEAYNFVRSVPYVMGFVSSG-------------------- 283

Query: 215 IYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTT 274
                   +P PV + +M  I + A                                G  
Sbjct: 284 -------GQPVPVKDREMRPILRLA--------------------------------GLE 343

Query: 275 AIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDL 334
             + K +P K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L
Sbjct: 344 EYEEKKKPVKVELGFKVGDMVKIISGPFEDFAGVIKEIDPERQELKVNVTIFGRETPVVL 349

Query: 335 DIGDI 340
            + ++
Sbjct: 404 HVSEV 349

BLAST of ClCG04G007350 vs. ExPASy Swiss-Prot
Match: Q9HWC4 (Transcription termination/antitermination protein NusG OS=Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) OX=208964 GN=nusG PE=3 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 1.4e-06
Identity = 45/237 (18.99%), Postives = 91/237 (38.40%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V+       + ++  L   +     + +F       +E  +++NG      +  
Sbjct: 1   MAKRWYVVHAYSGYEKHVMRSLIERVKLAGMEEEFGEILVPTEEVVEMRNGQKRKSERKF 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           FPG V ++  MN+     +++   V GF+G                             +
Sbjct: 61  FPGYVLVQMEMNEGTWHLVKDTPRVMGFIGG--------------------------TAD 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           KP P+++ + +AI +                   + ++SG                K +P
Sbjct: 121 KPAPITDREADAILR-------------------RVADSG---------------DKPKP 174

Query: 283 KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K       PG TVRV  G FA+F G ++++N +  ++ V   +FG+ T V+L+   +
Sbjct: 181 K---TLFEPGETVRVIDGPFADFNGVVEEVNYEKSRIQVAVLIFGRSTPVELEFSQV 174

BLAST of ClCG04G007350 vs. ExPASy Swiss-Prot
Match: P65591 (Transcription termination/antitermination protein NusG OS=Neisseria meningitidis serogroup A / serotype 4A (strain DSM 15465 / Z2491) OX=122587 GN=nusG PE=3 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.2e-06
Identity = 48/237 (20.25%), Postives = 88/237 (37.13%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  
Sbjct: 1   MSKKWYVVQAYSGFEKNVQRILEERIAREEMGDYFGQILVPVEKVVDIRNGRKTISERKS 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           +PG V +   M  +    ++    V GF+G                           + N
Sbjct: 61  YPGYVLVEMEMTDDSWHLVKSTPRVSGFIGG--------------------------RAN 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           +P P+S+ + E I ++         Q  +EK                            P
Sbjct: 121 RPTPISQREAEIILQQV--------QTGIEK----------------------------P 174

Query: 283 KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K  V     G  VRV  G FA+F G ++++N +  K+ V   +FG+ET V+L+   +
Sbjct: 181 KPKVE-FEVGQQVRVNEGPFADFNGVVEEVNYERNKLRVSVQIFGRETPVELEFSQV 174

BLAST of ClCG04G007350 vs. ExPASy Swiss-Prot
Match: P65592 (Transcription termination/antitermination protein NusG OS=Neisseria meningitidis serogroup B (strain MC58) OX=122586 GN=nusG PE=3 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.2e-06
Identity = 48/237 (20.25%), Postives = 88/237 (37.13%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  
Sbjct: 1   MSKKWYVVQAYSGFEKNVQRILEERIAREEMGDYFGQILVPVEKVVDIRNGRKTISERKS 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           +PG V +   M  +    ++    V GF+G                           + N
Sbjct: 61  YPGYVLVEMEMTDDSWHLVKSTPRVSGFIGG--------------------------RAN 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           +P P+S+ + E I ++         Q  +EK                            P
Sbjct: 121 RPTPISQREAEIILQQV--------QTGIEK----------------------------P 174

Query: 283 KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K  V     G  VRV  G FA+F G ++++N +  K+ V   +FG+ET V+L+   +
Sbjct: 181 KPKVE-FEVGQQVRVNEGPFADFNGVVEEVNYERNKLRVSVQIFGRETPVELEFSQV 174

BLAST of ClCG04G007350 vs. ExPASy TrEMBL
Match: A0A1S3BKU2 (transcription termination/antitermination protein NusG OS=Cucumis melo OX=3656 GN=LOC103490936 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 1.2e-154
Identity = 296/345 (85.80%), Postives = 309/345 (89.57%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKL
Sbjct: 1   MAYGLLIWSPISLCSTSLPALSFSLSSSRRTQLSISASVETPAAAADDVQQLSARDRRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDYLAKLGPQWWV+RVARVRGQE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGPQWWVMRVARVRGQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVKPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS 300
           KEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Sbjct: 241 KEEQERHDQAFLEKEQEEAPNSSALKTDLDTNGTTATKHKGRGKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of ClCG04G007350 vs. ExPASy TrEMBL
Match: A0A0A0KZV1 (NGN domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G307370 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 8.9e-153
Identity = 289/345 (83.77%), Postives = 305/345 (88.41%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKL
Sbjct: 1   MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDYLAKLGPQWWV+RVARVR QE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAKLGPQWWVMRVARVRSQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARCLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS 300
           K+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Sbjct: 241 KDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of ClCG04G007350 vs. ExPASy TrEMBL
Match: A0A6J1DDH4 (uncharacterized protein LOC111019492 OS=Momordica charantia OX=3673 GN=LOC111019492 PE=4 SV=1)

HSP 1 Score: 532.3 bits (1370), Expect = 1.5e-147
Identity = 287/343 (83.67%), Postives = 301/343 (87.76%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LR
Sbjct: 1   MAFGLLPWSSISLRSSSFPALSFSLSSSKCTQLSISAALE-TAAADDVQQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLAKLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFASWTEKLNLDYLAKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYMVKPRAVFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Sbjct: 241 EEQERHDQDFLEKEQEKAPNSTIHKTDLDTNGTTATKPKGRLKKAVNALSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET 344
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVET 323

BLAST of ClCG04G007350 vs. ExPASy TrEMBL
Match: A0A6J1EV13 (uncharacterized protein LOC111438223 OS=Cucurbita moschata OX=3662 GN=LOC111438223 PE=4 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 2.8e-146
Identity = 282/344 (81.98%), Postives = 300/344 (87.21%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACGLLTWTTLPLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAKLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLAKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Sbjct: 241 EEQERHDQAFLEKEKEQAPNPSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVSSG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSKKVTVGFTLFGKETLVELDIGDIIVETK 322

BLAST of ClCG04G007350 vs. ExPASy TrEMBL
Match: A0A6J1HQX6 (uncharacterized protein LOC111465889 OS=Cucurbita maxima OX=3661 GN=LOC111465889 PE=4 SV=1)

HSP 1 Score: 524.2 bits (1349), Expect = 4.0e-145
Identity = 281/344 (81.69%), Postives = 299/344 (86.92%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MAC LLTW+ +SLRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACRLLTWTTLSLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL+KLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLSKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG 300
           EEQ RHDQAFLEK++EQA N   LET LDTNGTTA KHKGRPKKAVNTLSPGSTVRVASG
Sbjct: 241 EEQERHDQAFLEKQKEQAPNPSVLETGLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKSGKVTVGFTLFGKETLV LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSGKVTVGFTLFGKETLVVLDIGDIIVETK 322

BLAST of ClCG04G007350 vs. TAIR 10
Match: AT3G09210.1 (plastid transcriptionally active 13 )

HSP 1 Score: 314.7 bits (805), Expect = 9.2e-86
Identity = 184/359 (51.25%), Postives = 241/359 (67.13%), Query Frame = 0

Query: 4   GLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNER 63
           GLL WS    RS+   ++  P++   +TQ SI+A V      +   QL+A+ERR+LRNER
Sbjct: 7   GLLQWS----RSSLIPSIYTPIN---KTQFSIAACV-----IERNHQLTAKERRQLRNER 66

Query: 64  REIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEIVER 123
           RE K   +WREEVEE+L KKPKK +ATWTE+LNLD LA+ GPQWW +RV+R+RG E  + 
Sbjct: 67  RESKLGYSWREEVEEKLIKKPKKRYATWTEELNLDTLAESGPQWWAVRVSRLRGHETAQI 126

Query: 124 LARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRE 183
           LAR+LAR +P+++F +Y PSVQ KRKLKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+
Sbjct: 127 LARALARQFPEMEFTVYAPSVQVKRKLKNGSISVKPKPVFPGCIFIRCILNKEIHDSIRD 186

Query: 184 CDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQ 243
            DGVGGF+G++                      +KRQINKP+PV ++D+EAIFK+AKE Q
Sbjct: 187 VDGVGGFIGSK-------------------VGNTKRQINKPRPVDDSDLEAIFKQAKEAQ 246

Query: 244 VRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK 303
            + D  F E  + +E+A           SNS  +ET  ++         T A + K + K
Sbjct: 247 EKADSEFEEADRAEEEASILASQELLALSNSDVIETVAESKPKRAPRKATLATETKAKKK 306

Query: 304 KAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE 343
           K    L+ GSTVRV SGTFAEF G+LKKLNRK+ K TVGFTLFGKETLV++DI +++ E
Sbjct: 307 K----LAAGSTVRVLSGTFAEFVGNLKKLNRKTAKATVGFTLFGKETLVEIDINELVPE 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880828.11.7e-15887.21transcription termination/antitermination protein NusG isoform X1 [Benincasa his... [more]
XP_008448915.12.6e-15485.80PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]... [more]
XP_004147896.11.8e-15283.77uncharacterized protein LOC101211195 [Cucumis sativus] >KGN54344.1 hypothetical ... [more]
XP_022151589.13.0e-14783.67uncharacterized protein LOC111019492 [Momordica charantia][more]
XP_022931956.15.7e-14681.98uncharacterized protein LOC111438223 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q067951.7e-0724.76Transcription termination/antitermination protein NusG OS=Bacillus subtilis (str... [more]
P293978.3e-0723.24Transcription termination/antitermination protein NusG OS=Thermotoga maritima (s... [more]
Q9HWC41.4e-0618.99Transcription termination/antitermination protein NusG OS=Pseudomonas aeruginosa... [more]
P655913.2e-0620.25Transcription termination/antitermination protein NusG OS=Neisseria meningitidis... [more]
P655923.2e-0620.25Transcription termination/antitermination protein NusG OS=Neisseria meningitidis... [more]
Match NameE-valueIdentityDescription
A0A1S3BKU21.2e-15485.80transcription termination/antitermination protein NusG OS=Cucumis melo OX=3656 G... [more]
A0A0A0KZV18.9e-15383.77NGN domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G307370 PE=4 SV... [more]
A0A6J1DDH41.5e-14783.67uncharacterized protein LOC111019492 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A6J1EV132.8e-14681.98uncharacterized protein LOC111438223 OS=Cucurbita moschata OX=3662 GN=LOC1114382... [more]
A0A6J1HQX64.0e-14581.69uncharacterized protein LOC111465889 OS=Cucurbita maxima OX=3661 GN=LOC111465889... [more]
Match NameE-valueIdentityDescription
AT3G09210.19.2e-8651.25plastid transcriptionally active 13 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 45..65
NoneNo IPR availablePANTHERPTHR30265:SF4TRANSCRIPTION ANTITERMINATION PROTEIN RFAHcoord: 34..341
NoneNo IPR availableCDDcd09890NGN_plantcoord: 106..236
e-value: 1.20667E-43
score: 144.418
NoneNo IPR availableCDDcd06091KOW_NusGcoord: 289..339
e-value: 1.01084E-12
score: 60.1636
IPR006645NusG, N-terminalSMARTSM00738nusgn_4coord: 104..239
e-value: 3.0E-17
score: 73.3
IPR006645NusG, N-terminalPFAMPF02357NusGcoord: 106..197
e-value: 4.5E-9
score: 36.7
IPR036735NusG, N-terminal domain superfamilyGENE3D3.30.70.940coord: 105..240
e-value: 2.4E-16
score: 61.7
IPR036735NusG, N-terminal domain superfamilySUPERFAMILY82679N-utilization substance G protein NusG, N-terminal domaincoord: 105..206
IPR014722Ribosomal protein L2, domain 2GENE3D2.30.30.30coord: 287..340
e-value: 3.5E-12
score: 48.1
IPR043425NusG-likePANTHERPTHR30265RHO-INTERACTING TRANSCRIPTION TERMINATION FACTOR NUSGcoord: 34..341
IPR008991Translation protein SH3-like domain superfamilySUPERFAMILY50104Translation proteins SH3-like domaincoord: 289..339

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G007350.2ClCG04G007350.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated