CmUC04G074640 (gene) Watermelon (USVL531) v1

Overview
NameCmUC04G074640
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptiontranscription termination/antitermination protein NusG
LocationCmU531Chr04: 21199925 .. 21202887 (-)
RNA-Seq ExpressionCmUC04G074640
SyntenyCmUC04G074640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCATCAACCAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGGTAAAATTTCTTCATTCCACTCTTTGATTTCTAGGTTTGTTGCACAATTTGATTTCATGTGTTGTATTGATTTGGCATCAATTTTCTTAAAGCAAATCGAATGGATTCTGTTGTCTTGGTTTTGAAATTCATTAACATAACTCTAAATATGAAAACCTACTGAGTTCCATTGTTTGGGAGCGCAAGGGAATGATTAAAGGTTTCTGTTACCATATATATTTCATTAGCTTGATTACCTTACAGTGTAGTGCTTAGATTGCAGGAGAAAGGATTTAAAACAACAGTAATAGATAAAAAGAATGAGGTTTTCTTTTCAATTCTATATTAGATATCAAAGAAACAAAGGCTATCTAAAAAAGAGTGAAGAAATGGGATTCGTGTTGTTAGTTGCCACAAATTGTTAGAGACGATAATGTGTATCAACATCTCAAGTTCTTCAATTGGATTGCAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGTGCGAAGGTTGGAAACACGTGAGTCGTTATTTCGTTCCTTCTTTGTAAAAAAAAGTTAAGCAATTGTTTGTTTTGTTTGAAACTCTGGACAGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTGTATGTCTTGTAGTCAATAGATTTTGGAGTCTAGGAGGTTGTGCCATAGAACTGATTCTAAACTAAATCTGAATTTTAAAACTTTGGATTCAAAATTCAGATTTTCAAAGTACTATAGACGATGGCTTTCATCCAAAAAGTTAGATGTAAATAGCTTTATATTTCTTTACATCAATTCTGTGTTCTCAGTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTCGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTGTGCTGTTTTCTGTTAGATGAAAAGTTGGGTACAAAGGAGAAAAATTAGAAGAAAGTTGTGTGCTAGAGACCTGACGGATTGTTAGAAAGCATTTGAATTTTATTTTATCTACTAACCATAATTGATTTCCTTCATGCAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGGTAAGTTTTTTAAATCACGAGATACCATTTATTTGAGGTTTCTTTCGTCTACTTTAATGTAAAATTTAGTGCCTTGGTATATTCAGTCAAAACCCACATAAATTCATAATCATCATAAGAACTTGCAAGGATCTTGTTTGGTCCTTCAGATCATTATTCAACTCACTGTTAACAATCTTCAATAATAAGCCTTCATTTGTAATATTGAAAATTAAGATAGCCTACAATGGCTAATGAAGCTCTCCAAATCACAATAGATTCAGATTTAGTAATCATGTCTGCAAGATTTTGGGGTTGTTTTAAGATAGAAGCGAAAGACGCCATGGCTTTGTTACTTCATCACTCTTCCGTTTGTTACTATTATATTAGAGCGTTTCGAAATAGTTGTTTCTGTTTCTTTAGTTAACACCAATCGGAGCATCTTTGATAGCTGAACATTCTTCACTCACTCTTCTCTGTGATTGTTTTGCAGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCCAGCTCTCAGCTCTCAACTCATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTGCTTTTTTGAAATTACTCTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCAGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAAATAGGGAATATCATGATGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTGAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAGCTATGTTCTAAAGTATTTCCAAAATTGTGTTACCATTAAAATCATAGTAATTTTTGAGCTCCTATTCATTGTGGAC

mRNA sequence

CTTCATCAACCAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTCGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCCAGCTCTCAGCTCTCAACTCATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTGCTTTTTTGAAATTACTCTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCAGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAAATAGGGAATATCATGATGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTGAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAGCTATGTTCTAAAGTATTTCCAAAATTGTGTTACCATTAAAATCATAGTAATTTTTGAGCTCCTATTCATTGTGGAC

Coding sequence (CDS)

ATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTCGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGA

Protein sequence

MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Homology
BLAST of CmUC04G074640 vs. NCBI nr
Match: XP_038880828.1 (transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida])

HSP 1 Score: 566.6 bits (1459), Expect = 1.4e-157
Identity = 299/344 (86.92%), Postives = 308/344 (89.53%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLL WSPISLRS SF ALSF LSS KRTQLSISA VE PSAAD++QQLSARERRKLR
Sbjct: 1   MACGLLIWSPISLRSISFPALSFSLSSPKRTQLSISATVETPSAADDIQQLSARERRKLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL+KLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLSKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKA+FPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKALFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQAFLEKEQ++A NS ALETDLDTNGTTA K KGRPKKAVNT SPGSTVRVASG
Sbjct: 241 EEQERHDQAFLEKEQDRAPNSSALETDLDTNGTTATKQKGRPKKAVNTLSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 325

BLAST of CmUC04G074640 vs. NCBI nr
Match: XP_008448915.1 (PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo] >XP_008448924.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo])

HSP 1 Score: 552.7 bits (1423), Expect = 2.2e-153
Identity = 295/345 (85.51%), Postives = 308/345 (89.28%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKL
Sbjct: 1   MAYGLLIWSPISLCSTSLPALSFSLSSSRRTQLSISASVETPAAAADDVQQLSARDRRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDYLAKLGPQWWV+RVARVRGQE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGPQWWVMRVARVRGQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVKPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVAS 300
           KEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNT SPGSTVRVAS
Sbjct: 241 KEEQERHDQAFLEKEQEEAPNSSALKTDLDTNGTTATKHKGRGKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of CmUC04G074640 vs. NCBI nr
Match: XP_004147896.1 (uncharacterized protein LOC101211195 [Cucumis sativus] >KGN54344.1 hypothetical protein Csa_018004 [Cucumis sativus])

HSP 1 Score: 546.6 bits (1407), Expect = 1.5e-151
Identity = 288/345 (83.48%), Postives = 304/345 (88.12%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKL
Sbjct: 1   MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDYLAKLGPQWWV+RVARVR QE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAKLGPQWWVMRVARVRSQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARCLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVAS 300
           K+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNT SPGSTVRVAS
Sbjct: 241 KDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of CmUC04G074640 vs. NCBI nr
Match: XP_022151589.1 (uncharacterized protein LOC111019492 [Momordica charantia])

HSP 1 Score: 528.9 bits (1361), Expect = 3.3e-146
Identity = 286/343 (83.38%), Postives = 300/343 (87.46%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LR
Sbjct: 1   MAFGLLPWSSISLRSSSFPALSFSLSSSKCTQLSISAALE-TAAADDVQQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLAKLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFASWTEKLNLDYLAKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYMVKPRAVFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN  SPGSTVRVASG
Sbjct: 241 EEQERHDQDFLEKEQEKAPNSTIHKTDLDTNGTTATKPKGRLKKAVNALSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET 344
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVET 323

BLAST of CmUC04G074640 vs. NCBI nr
Match: XP_022931956.1 (uncharacterized protein LOC111438223 [Cucurbita moschata])

HSP 1 Score: 524.6 bits (1350), Expect = 6.3e-145
Identity = 281/344 (81.69%), Postives = 299/344 (86.92%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACGLLTWTTLPLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAKLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLAKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNT SPGSTVRV+SG
Sbjct: 241 EEQERHDQAFLEKEKEQAPNPSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVSSG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSKKVTVGFTLFGKETLVELDIGDIIVETK 322

BLAST of CmUC04G074640 vs. ExPASy Swiss-Prot
Match: Q06795 (Transcription termination/antitermination protein NusG OS=Bacillus subtilis (strain 168) OX=224308 GN=nusG PE=1 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.7e-07
Identity = 51/206 (24.76%), Postives = 78/206 (37.86%), Query Frame = 0

Query: 134 DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGN 193
           D  F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+
Sbjct: 33  DKIFRVVVPE-EEETDIKNGKKKVVKKKVFPGYVLVEIVMTDDSWYVVRNTPGVTGFVGS 92

Query: 194 ERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEK 253
             +                         +KP P+   + E I K    ++ + D  F  K
Sbjct: 93  AGSG------------------------SKPTPLLPGEAETILKRMGMDERKTDIDFELK 152

Query: 254 EQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASGTFAEFEGSLKKLN 313
           E                                       TV+V  G FA F GS+++++
Sbjct: 153 E---------------------------------------TVKVIDGPFANFTGSIEEID 174

Query: 314 RKSGKVTVGFTLFGKETLVDLDIGDI 340
               KV V   +FG+ET V+L+   I
Sbjct: 213 YDKSKVKVFVNMFGRETPVELEFTQI 174

BLAST of CmUC04G074640 vs. ExPASy Swiss-Prot
Match: P29397 (Transcription termination/antitermination protein NusG OS=Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) OX=243274 GN=nusG PE=1 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 2.4e-06
Identity = 43/185 (23.24%), Postives = 71/185 (38.38%), Query Frame = 0

Query: 155 YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWA 214
           Y  K + +FPG VF+  IMN E ++F+R    V GFV +                     
Sbjct: 224 YKTKRRKLFPGYVFVEMIMNDEAYNFVRSVPYVMGFVSSG-------------------- 283

Query: 215 IYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTT 274
                   +P PV + +M  I + A                                G  
Sbjct: 284 -------GQPVPVKDREMRPILRLA--------------------------------GLE 343

Query: 275 AIKHKGRPKKAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDL 334
             + K +P K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L
Sbjct: 344 EYEEKKKPVKVELGFKVGDMVKIISGPFEDFAGVIKEIDPERQELKVNVTIFGRETPVVL 349

Query: 335 DIGDI 340
            + ++
Sbjct: 404 HVSEV 349

BLAST of CmUC04G074640 vs. ExPASy Swiss-Prot
Match: Q9HWC4 (Transcription termination/antitermination protein NusG OS=Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) OX=208964 GN=nusG PE=3 SV=1)

HSP 1 Score: 53.5 bits (127), Expect = 5.4e-06
Identity = 45/237 (18.99%), Postives = 91/237 (38.40%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V+       + ++  L   +     + +F       +E  +++NG      +  
Sbjct: 1   MAKRWYVVHAYSGYEKHVMRSLIERVKLAGMEEEFGEILVPTEEVVEMRNGQKRKSERKF 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           FPG V ++  MN+     +++   V GF+G                             +
Sbjct: 61  FPGYVLVQMEMNEGTWHLVKDTPRVMGFIGG--------------------------TAD 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           KP P+++ + +AI +                   + ++SG                K +P
Sbjct: 121 KPAPITDREADAILR-------------------RVADSG---------------DKPKP 174

Query: 283 KKAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K       PG TVRV  G FA+F G ++++N +  ++ V   +FG+ T V+L+   +
Sbjct: 181 K---TLFEPGETVRVIDGPFADFNGVVEEVNYEKSRIQVAVLIFGRSTPVELEFSQV 174

BLAST of CmUC04G074640 vs. ExPASy Swiss-Prot
Match: P65591 (Transcription termination/antitermination protein NusG OS=Neisseria meningitidis serogroup A / serotype 4A (strain DSM 15465 / Z2491) OX=122587 GN=nusG PE=3 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 1.2e-05
Identity = 48/237 (20.25%), Postives = 88/237 (37.13%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  
Sbjct: 1   MSKKWYVVQAYSGFEKNVQRILEERIAREEMGDYFGQILVPVEKVVDIRNGRKTISERKS 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           +PG V +   M  +    ++    V GF+G                           + N
Sbjct: 61  YPGYVLVEMEMTDDSWHLVKSTPRVSGFIGG--------------------------RAN 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           +P P+S+ + E I ++         Q  +EK                            P
Sbjct: 121 RPTPISQREAEIILQQV--------QTGIEK----------------------------P 174

Query: 283 KKAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K  V     G  VRV  G FA+F G ++++N +  K+ V   +FG+ET V+L+   +
Sbjct: 181 KPKVE-FEVGQQVRVNEGPFADFNGVVEEVNYERNKLRVSVQIFGRETPVELEFSQV 174

BLAST of CmUC04G074640 vs. ExPASy Swiss-Prot
Match: P65592 (Transcription termination/antitermination protein NusG OS=Neisseria meningitidis serogroup B (strain MC58) OX=122586 GN=nusG PE=3 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 1.2e-05
Identity = 48/237 (20.25%), Postives = 88/237 (37.13%), Query Frame = 0

Query: 103 LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAV 162
           +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  
Sbjct: 1   MSKKWYVVQAYSGFEKNVQRILEERIAREEMGDYFGQILVPVEKVVDIRNGRKTISERKS 60

Query: 163 FPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQIN 222
           +PG V +   M  +    ++    V GF+G                           + N
Sbjct: 61  YPGYVLVEMEMTDDSWHLVKSTPRVSGFIGG--------------------------RAN 120

Query: 223 KPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRP 282
           +P P+S+ + E I ++         Q  +EK                            P
Sbjct: 121 RPTPISQREAEIILQQV--------QTGIEK----------------------------P 174

Query: 283 KKAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI 340
           K  V     G  VRV  G FA+F G ++++N +  K+ V   +FG+ET V+L+   +
Sbjct: 181 KPKVE-FEVGQQVRVNEGPFADFNGVVEEVNYERNKLRVSVQIFGRETPVELEFSQV 174

BLAST of CmUC04G074640 vs. ExPASy TrEMBL
Match: A0A1S3BKU2 (transcription termination/antitermination protein NusG OS=Cucumis melo OX=3656 GN=LOC103490936 PE=4 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 1.0e-153
Identity = 295/345 (85.51%), Postives = 308/345 (89.28%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKL
Sbjct: 1   MAYGLLIWSPISLCSTSLPALSFSLSSSRRTQLSISASVETPAAAADDVQQLSARDRRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDYLAKLGPQWWV+RVARVRGQE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGPQWWVMRVARVRGQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVKPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVAS 300
           KEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNT SPGSTVRVAS
Sbjct: 241 KEEQERHDQAFLEKEQEEAPNSSALKTDLDTNGTTATKHKGRGKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of CmUC04G074640 vs. ExPASy TrEMBL
Match: A0A0A0KZV1 (NGN domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G307370 PE=4 SV=1)

HSP 1 Score: 546.6 bits (1407), Expect = 7.5e-152
Identity = 288/345 (83.48%), Postives = 304/345 (88.12%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKL 60
           MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKL
Sbjct: 1   MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKL 60

Query: 61  RNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQE 120
           RNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDYLAKLGPQWWV+RVARVR QE
Sbjct: 61  RNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAKLGPQWWVMRVARVRSQE 120

Query: 121 IVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHD 180
           IVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHD
Sbjct: 121 IVERLARCLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHD 180

Query: 181 FIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEA 240
           FIRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEA
Sbjct: 181 FIRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEA 240

Query: 241 KEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVAS 300
           K+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNT SPGSTVRVAS
Sbjct: 241 KDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVAS 300

Query: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Sbjct: 301 GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 326

BLAST of CmUC04G074640 vs. ExPASy TrEMBL
Match: A0A6J1DDH4 (uncharacterized protein LOC111019492 OS=Momordica charantia OX=3673 GN=LOC111019492 PE=4 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 1.6e-146
Identity = 286/343 (83.38%), Postives = 300/343 (87.46%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LR
Sbjct: 1   MAFGLLPWSSISLRSSSFPALSFSLSSSKCTQLSISAALE-TAAADDVQQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLAKLGPQWWV+RVARVRGQEI
Sbjct: 61  NERREIKTTTNWREEVEERLCKKPKKEFASWTEKLNLDYLAKLGPQWWVMRVARVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYMVKPRAVFPGSVFIRCIMNKEIHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVSEADMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSEADMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN  SPGSTVRVASG
Sbjct: 241 EEQERHDQDFLEKEQEKAPNSTIHKTDLDTNGTTATKPKGRLKKAVNALSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET 344
           TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Sbjct: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVET 323

BLAST of CmUC04G074640 vs. ExPASy TrEMBL
Match: A0A6J1EV13 (uncharacterized protein LOC111438223 OS=Cucurbita moschata OX=3662 GN=LOC111438223 PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 3.1e-145
Identity = 281/344 (81.69%), Postives = 299/344 (86.92%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACGLLTWTTLPLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAKLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLAKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNT SPGSTVRV+SG
Sbjct: 241 EEQERHDQAFLEKEKEQAPNPSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVSSG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSKKVTVGFTLFGKETLVELDIGDIIVETK 322

BLAST of CmUC04G074640 vs. ExPASy TrEMBL
Match: A0A6J1HQX6 (uncharacterized protein LOC111465889 OS=Cucurbita maxima OX=3661 GN=LOC111465889 PE=4 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 4.4e-144
Identity = 280/344 (81.40%), Postives = 298/344 (86.63%), Query Frame = 0

Query: 1   MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLR 60
           MAC LLTW+ +SLRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LR
Sbjct: 1   MACRLLTWTTLSLRSPSFPSLSFSLSSSLRTQLSISAALE--TAADDVLQLSARERRRLR 60

Query: 61  NERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEI 120
           NERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL+KLGPQWWV+RV+RVRGQEI
Sbjct: 61  NERRETK-TTNWREEVEERLCKKPKKEFANWTEKLNLDYLSKLGPQWWVMRVSRVRGQEI 120

Query: 121 VERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDF 180
           VERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDF
Sbjct: 121 VERLARSLARNYPDLDFKIYYPSVHEKRKLKNGSYTVKPKAVFPGSVFIRCIMNKEMHDF 180

Query: 181 IRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAK 240
           IRECDGVGGFVG +                      +KRQINKPKPVS+ DMEAIFKEAK
Sbjct: 181 IRECDGVGGFVGAK-------------------VGNTKRQINKPKPVSQDDMEAIFKEAK 240

Query: 241 EEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTSSPGSTVRVASG 300
           EEQ RHDQAFLEK++EQA N   LET LDTNGTTA KHKGRPKKAVNT SPGSTVRVASG
Sbjct: 241 EEQERHDQAFLEKQKEQAPNPSVLETGLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASG 300

Query: 301 TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK 345
           TFAEFEGSLKK+NRKSGKVTVGFTLFGKETLV LDIGDIIVETK
Sbjct: 301 TFAEFEGSLKKVNRKSGKVTVGFTLFGKETLVVLDIGDIIVETK 322

BLAST of CmUC04G074640 vs. TAIR 10
Match: AT3G09210.1 (plastid transcriptionally active 13 )

HSP 1 Score: 312.0 bits (798), Expect = 6.0e-85
Identity = 183/359 (50.97%), Postives = 240/359 (66.85%), Query Frame = 0

Query: 4   GLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNER 63
           GLL WS    RS+   ++  P++   +TQ SI+A V      +   QL+A+ERR+LRNER
Sbjct: 7   GLLQWS----RSSLIPSIYTPIN---KTQFSIAACV-----IERNHQLTAKERRQLRNER 66

Query: 64  REIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVLRVARVRGQEIVER 123
           RE K   +WREEVEE+L KKPKK +ATWTE+LNLD LA+ GPQWW +RV+R+RG E  + 
Sbjct: 67  RESKLGYSWREEVEEKLIKKPKKRYATWTEELNLDTLAESGPQWWAVRVSRLRGHETAQI 126

Query: 124 LARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRE 183
           LAR+LAR +P+++F +Y PSVQ KRKLKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+
Sbjct: 127 LARALARQFPEMEFTVYAPSVQVKRKLKNGSISVKPKPVFPGCIFIRCILNKEIHDSIRD 186

Query: 184 CDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQ 243
            DGVGGF+G++                      +KRQINKP+PV ++D+EAIFK+AKE Q
Sbjct: 187 VDGVGGFIGSK-------------------VGNTKRQINKPRPVDDSDLEAIFKQAKEAQ 246

Query: 244 VRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK 303
            + D  F E  + +E+A           SNS  +ET  ++         T A + K + K
Sbjct: 247 EKADSEFEEADRAEEEASILASQELLALSNSDVIETVAESKPKRAPRKATLATETKAKKK 306

Query: 304 KAVNTSSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE 343
           K     + GSTVRV SGTFAEF G+LKKLNRK+ K TVGFTLFGKETLV++DI +++ E
Sbjct: 307 KL----AAGSTVRVLSGTFAEFVGNLKKLNRKTAKATVGFTLFGKETLVEIDINELVPE 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880828.11.4e-15786.92transcription termination/antitermination protein NusG isoform X1 [Benincasa his... [more]
XP_008448915.12.2e-15385.51PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]... [more]
XP_004147896.11.5e-15183.48uncharacterized protein LOC101211195 [Cucumis sativus] >KGN54344.1 hypothetical ... [more]
XP_022151589.13.3e-14683.38uncharacterized protein LOC111019492 [Momordica charantia][more]
XP_022931956.16.3e-14581.69uncharacterized protein LOC111438223 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q067951.7e-0724.76Transcription termination/antitermination protein NusG OS=Bacillus subtilis (str... [more]
P293972.4e-0623.24Transcription termination/antitermination protein NusG OS=Thermotoga maritima (s... [more]
Q9HWC45.4e-0618.99Transcription termination/antitermination protein NusG OS=Pseudomonas aeruginosa... [more]
P655911.2e-0520.25Transcription termination/antitermination protein NusG OS=Neisseria meningitidis... [more]
P655921.2e-0520.25Transcription termination/antitermination protein NusG OS=Neisseria meningitidis... [more]
Match NameE-valueIdentityDescription
A0A1S3BKU21.0e-15385.51transcription termination/antitermination protein NusG OS=Cucumis melo OX=3656 G... [more]
A0A0A0KZV17.5e-15283.48NGN domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G307370 PE=4 SV... [more]
A0A6J1DDH41.6e-14683.38uncharacterized protein LOC111019492 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A6J1EV133.1e-14581.69uncharacterized protein LOC111438223 OS=Cucurbita moschata OX=3662 GN=LOC1114382... [more]
A0A6J1HQX64.4e-14481.40uncharacterized protein LOC111465889 OS=Cucurbita maxima OX=3661 GN=LOC111465889... [more]
Match NameE-valueIdentityDescription
AT3G09210.16.0e-8550.97plastid transcriptionally active 13 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 45..65
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 261..292
NoneNo IPR availablePANTHERPTHR30265:SF4TRANSCRIPTION ANTITERMINATION PROTEIN RFAHcoord: 34..341
NoneNo IPR availableCDDcd06091KOW_NusGcoord: 291..339
e-value: 1.62574E-12
score: 59.7784
NoneNo IPR availableCDDcd09890NGN_plantcoord: 106..236
e-value: 1.2595E-43
score: 144.418
IPR006645NusG, N-terminalSMARTSM00738nusgn_4coord: 104..239
e-value: 3.0E-17
score: 73.3
IPR006645NusG, N-terminalPFAMPF02357NusGcoord: 106..197
e-value: 4.5E-9
score: 36.7
IPR036735NusG, N-terminal domain superfamilyGENE3D3.30.70.940coord: 105..240
e-value: 2.4E-16
score: 61.7
IPR036735NusG, N-terminal domain superfamilySUPERFAMILY82679N-utilization substance G protein NusG, N-terminal domaincoord: 105..206
IPR014722Ribosomal protein L2, domain 2GENE3D2.30.30.30coord: 288..340
e-value: 5.9E-12
score: 47.4
IPR043425NusG-likePANTHERPTHR30265RHO-INTERACTING TRANSCRIPTION TERMINATION FACTOR NUSGcoord: 34..341
IPR008991Translation protein SH3-like domain superfamilySUPERFAMILY50104Translation proteins SH3-like domaincoord: 291..339

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC04G074640.1CmUC04G074640.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated