CcUC02G042230 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC02G042230
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionNusB domain-containing protein
LocationCicolChr02: 37204639 .. 37210704 (+)
RNA-Seq ExpressionCcUC02G042230
SyntenyCcUC02G042230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTTCTTCCACTACTAGAAATGCCTTTGCCTCCAAATCCCGAGTGAGTAGCGTGGGCTGATTTTGGGGCATTTCACAATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTTCTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCCACACTCTCGCTTTCATCTTAGTTTCTCCAACTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATCGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGAATCTTCACCCATCGGGATAAAGTTATTACTATCAGGTTTCTCTTTCATTTGCCATTTCTATGTCCTTCCCCTTCTTCTGCCCTCTATCCTTTACCTACTCCTCTGTTTCCTTTTCAATTCAAACTCTGCTTCCTTGCTCACTATGATCACCATGTAAGCAGAATTAAGGAATTAATGGTGAATCTTGGCTAAATTCATGTCAATCGTTTTTAGTTTCTTATTTTGAATTTTAATTCCTCTTTAGTTTCCTTTTAGGATTTCATTCTCGCCCTCCCTCATGCATTCTCTACGACTTTGATTCATAATAATAAGTTTTTTTTTTTTTGATAAGAAATTCATAATAATTAGTTTTGAAATTTGATTCTTGGAGAATTCTTTCCTTTGGGTCCTTTATGCTACATCAGTTCACCAACAATTGACCTCTCTGTTTATTTCGCACGCTTAGAAATTTGAATTGTGAAATGGAAACTATAGGATGTGGCATGTCTCTTCACTTCATACTAGTTACTCCCTCTGTATACTTGCATTATCTTTACGACTTTCCAGATGGTGGATGCTGGGTCAATGCGTAACATTTTCTTTAAAGGTTGTTTTTTGGGTCTTTCCTTAGTAGTCTGAAGTTAGGCTATACCAGATGGCACCTGGATGAGACATCGAGATAACACGACCGCCATTTTATACTGGTCAACAGTAACGAACTTGGTTTTATGTAATGCAATCTATGGTTACTACGTTGAAAAGAAACTAAAAGCTTCTTTTTCTAAATACTCTCTCACTTGTGAGCTTGGAATAAGCTTAAGGACTTAGGGTGTGATTGTGGAGAGGAAATGGGAGAGGTAGAGTTGTTTGGCCGAAGGGGTTATTGAAAGCATTGTAAATTCCATGACTCATTAGCATCGATTCCATATTTATTAACTCCTTAAATTGTGGTTCGATGACTAAACAACTTTTACTCCTCACTCTCTTATAAGTTATGGTAAATTTAATGAATAATTTATATTCTTGACAGCACTAGACTATTGGATATTAATTGACACAGCTGGTAGAGGGAGGGTGAACTTTGATGCCAGAAAATTGTCTTATTGTTATTTTACCTAGTTTTTCTTAAAGAAAGTAAAAGGGAAAAAAGAAAGAGAAAGATATAGTCCATGATTTCTAGATATCTCTTGAAAACATGACTTAACTTAGCATGCCTAATTTTCATTCTTGCTCTTCGTCTCTGTGCTTCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTAAGATCCATCATTTCCTTTTTTTTTTTTTCCTAACATATGAATATCTCCTCTCTACCTTTTTAACTATTCAAAAAGTTGGCCTACTCACCAAGAATATACTCTAAGAATTTGTTTAATTTCTAGACTTGTAATCAATTCTAATTCGGATTATTCGATCACAGGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGGTAATCTATGGTTGAGACATGATACATGGTTTCTATTGCCCCATATACTATTTGGTGTCTGTGTGTTTTTTAGATATTTATATAGTTTCAAAAAATATAAAAAAAGATTGTGTTGGGGAAATAGATGTCACATATTTAACTCAAATATAAAACCAACTTCTTGCTTTATTTGTTTGTACAATTCAGAATCAGGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGGTAAACAATTATGTGACCTTCGTATATGTATTATTTCTTCTTCTTAATTCACTCATTACTTTATATTAAAAACAGAATTGGATCCTAAAAGAGACAAAACTTTTTAGAGGAAAAGAGAGACTACTGGTGAAAGGATACAAATTCCTAAATTCCTAAAGGAGTAAAAATAAAAAAGAAAAGCATAGAAATATAAAGATTCCACAAACCATAACAAATGATATTCTTAAAAACTTAACAAAGAATTAATACATTAAGGTCAAGCATGGGTATGAACTTCAAGTTGCCCATAGAAGTTGGAGTAGATGTCCAAAAGCTCCAACAAGTATTGGAAAATTGCAATCTTCTGTTTAGGTAACTTTACAAAGAAGCTTTTCCTTCTAAATTTTAACCTCTTCTTCATCAAGAAGTGCTATTAACGCTGATTAAATAACAGGACTGTATTAGGTTCCATGAGAAATGTTGAAAATAAAACTCAAAGAGGCCCTTGAAGATAAAATTCTGACATCGGTAAAAGGAATATATAGAGAGGGGAGAAATGCCAAAAAACTCCTGAACTTCCAGAAGAAAAGTTTCAACTTTTTTTATTTTTTATTTTTATATAATTAATGCAGTATCTAATGAAAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCACTATGGATCAGCAAAAGGGTCAGAAGATTTGCAGGGCACACAGGGATAAATATTTTTGGTATGTTTGGAGAGTTGACAAATTTTGCACTGAAAATGACTTGATTTCTTATTAATGAATAACTGAGGTAACAATTTTTCAAATATGAAAAATATTTGAAAAATTTGGATGACCGAGACAAAAGTTCTATAACATGACTTTGCCTTAACAAGACAAGGATTTAAAATTGGAAGTAGATTCGGGAATAAGGGTATGTTTGGAATACATTTTCAAGTGTTTAATTTAAAAAATAAGTCATTTTGAAAAAATCGGAGTGTTTGGCAACCACTCAAAATAGATTTTGAAGTGTATTTTGAACGGTTTTTATAAAAAGAGTTTAAATAAAAATGAGTTTTTTGAAAAATGCTTTTTCCTCAAGTCAATCCAAATGGGCCCTAAGACTACAAAATTGAGAAAATACTCACAACTAAGCCTGGAAGATGATTTTGAAGGATTAACCACAAATTTGATGGAGGGATCAAGGATTCTACCAAAAGCAAAGCTCAAAGGACAAAATGTCACTTTTGAAAATTTAGAAACCCCAATGCAACCAAACCCAAAACTTAGGAACTAAAGTTTATTTTTCCCATGTGTTTTCTATAGTTGTAAATGCAGTATTGTCTAGCCTCAAAGTCCGATTTTATTGCGGTTAGTATTAAAGTTCTGTTTTCTGCATGACCGTATCTCAGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTAAACGTGGTTTATATTCCCCTGCCTTTGACAGAGGAAGATAAGATGACAATTTAGTATGAATTTCATGTATTTTATCAATGTTATTTACTTCAAGGATGAAATAAGGTCAGCCATTGTCTTCATGGTTTTCAGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACGAATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTTCAACTTGGAAGGTCGTGTAAAAAAAGCTTGTTTCTTTTTGCTATTTTATTTTCTGATTTTCTCTTCTCTTCTAGTTGGGAATATACGTTGAATCATTTATCATGCGATTATAATACTCCTCCTGAACTGATCTTTGCCTCATAATCATAGCATATGTGTGCTTTTTACAACACTTACAGAAATTTGACAGAAATTTGAAAAGCTTTGTATGATGGTGTCAAGGATGGAACCACAAACACATGTGCAATATTTCATCCATGGATATTGTTGAATTTCTCGTGTGTTAGAAACTCTGAGCTTTAATTATCGAAGGTTTGAGCTTAGTGATAAACATCCTTGGACTCAAACAGTTAATGGTTTCAAAATTTGTTGGATCTCCATTCCAGAATTTTATATCTTACAGGTTCTTGACTACATAATTATATAGTCGTCCAAGTTTGAACCCAGAAAATATTAAGCTATCAGTTTTTATGGGGTTAAAATAAAACCTTTAGCAACCATCATAGGATTGCCTAGTAGTAAATAAGGGAACATTCCCTTAATGCAGGCTAAGAGGTCATGGGTTCAATCTATGGTGGCCAAGGTACCTAGGATTTAATTTATTGTGAGTTTCATTGATACCCAAATCCATGATAGTCACCTATTTAGGATTTAATATCTCACGAGTTTTCTTGACACCAAAATGTTGTAGGGTTAAGCGAATTGTCCCATGAGATTAATCAAGGTGCACCGCTGACCTGGACACTCACAGATATGCAAAAAAATAAAACAGAACCTTTAGCCTTGAGAACATGAATTTTGTCAATAGTCATCTAGATATGCAACTAGGTTAAGCGCTATACAAAAGTATTGTCATTCTATCAAATTTTTTCTTTTCTAAAAAAGTGTACTTTTCAATAACCAAAGTTGTTGAACTTCCAGTTTCTCTGTGTTTGTTTATTTCATATGCTCTTAATAGTCGTGAACCTAAATTTGAATAATGTTATATTTTGTTAATTATTTTTTGTTTGAATTCAGTAAAGAGATCAACCCATCTAAACCTTTGAAATGGTAGTTGATGACCTATTGAGCTATGTTTGTAATGACATGTTTTGTTAATCTATTAGATAAATCATTCCAGCCTTTATGTTATTCGTTATTCAGACATTGCAGTTGCGTGAATCCAAACCAAAGATCACGCGCTTGGACTTGAAGTTCTTGAATTAAGTTGCAGATTTCTATGAATGTTTAAGATTTGTAATTTGTAAAGGTTCGGGAGTTTAAATATCACCTGCATTCGTATATATTTCAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGTAAATGCGTCTTGAGCCATCTTACTTGAAGGTCTGTTGATATATATCTAACATCAATTCTCTTTTTGTTTAAAAAAATAATAATAATTGTTCTTGGTTATGGATGAAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAACGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATAAGCCTGAACTTTTGTGGTGCCTCAGGACCCTTTGTAAGGATATCGAAGAATTCGACTGAACTCATGCTAGCTCGAGTAAGTGATTCTGGAGTCTCAACTTTTGGGGGTACGTCTATTAAACCTGTTGCTGTCAAAAGCAGCCAAAGTGCAGTGATTTGAAGAATCTCATACTTGCCGCAAACCACAGCAAGAATTTCTTTCCCTTTCCCTTGTAGGCGTGGATTTCTGTTATATCCAAATTATCCATAGAGTTGAGTAATGAGGTGCATGATGTGTGAGAGAATGAGTTGCCAAATTAATCTAGTACTTGGAGAGATGTTATCATGCACATAATGTTGTCAATGTATTGAAGGGTTTACTTTTTTCTTTTTCTTTTTTTTAATTTTTAAGTATGTG

mRNA sequence

CCTTCTTCCACTACTAGAAATGCCTTTGCCTCCAAATCCCGAGTGAGTAGCGTGGGCTGATTTTGGGGCATTTCACAATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTTCTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCCACACTCTCGCTTTCATCTTAGTTTCTCCAACTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATCGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGAATCTTCACCCATCGGGATAAAGTTATTACTATCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGGTAAACAATTATGTGACCTTCTATCTAATGAAAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCACTATGGATCAGCAAAAGGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACGAATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTTCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAACGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATAAGCCTGAACTTTTGTGGTGCCTCAGGACCCTTTGTAAGGATATCGAAGAATTCGACTGAACTCATGCTAGCTCGAGTAAGTGATTCTGGAGTCTCAACTTTTGGGGGTACGTCTATTAAACCTGTTGCTGTCAAAAGCAGCCAAAGTGCAGTGATTTGAAGAATCTCATACTTGCCGCAAACCACAGCAAGAATTTCTTTCCCTTTCCCTTGTAGGCGTGGATTTCTGTTATATCCAAATTATCCATAGAGTTGAGTAATGAGGTGCATGATGTGTGAGAGAATGAGTTGCCAAATTAATCTAGTACTTGGAGAGATGTTATCATGCACATAATGTTGTCAATGTATTGAAGGGTTTACTTTTTTCTTTTTCTTTTTTTTAATTTTTAAGTATGTG

Coding sequence (CDS)

ATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTTCTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCCACACTCTCGCTTTCATCTTAGTTTCTCCAACTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATCGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGAATCTTCACCCATCGGGATAAAGTTATTACTATCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGGTAAACAATTATGTGACCTTCTATCTAATGAAAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCACTATGGATCAGCAAAAGGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACGAATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTTCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAACGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATAA

Protein sequence

MSLAPPTSPYPFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQPGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
Homology
BLAST of CcUC02G042230 vs. NCBI nr
Match: XP_038901769.1 (uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida])

HSP 1 Score: 583.9 bits (1504), Expect = 1.0e-162
Identity = 322/433 (74.36%), Postives = 334/433 (77.14%), Query Frame = 0

Query: 1   MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQ 60
           MSLAPP SPY       P PKSCLP+HLSS +Q   SH KLFP HS  H SFS S STL 
Sbjct: 1   MSLAPPISPYLYSSHSHPLPKSCLPSHLSSLTQ--CSHSKLFPSHSLSHRSFSTSSSTLH 60

Query: 61  SLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVI 120
           SLK        + +GL D   ADQ                  PG SAD TG+FT  DKVI
Sbjct: 61  SLKSK------THVGLGDLGDADQPCEEYEVELEQLSGLDFAPGASADNTGLFTVGDKVI 120

Query: 121 TIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSI 180
           T RPNFHFS HISGICRFPFRASSIVPH KD M HLCPQASLRASTSF ENCVAE+R+SI
Sbjct: 121 TTRPNFHFSLHISGICRFPFRASSIVPHVKDSMTHLCPQASLRASTSFPENCVAEDRSSI 180

Query: 181 AVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFD 240
           +VSS+ETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN RRE GYEFD
Sbjct: 181 SVSSVETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRRELGYEFD 240

Query: 241 KTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP 300
           KTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDS I                      
Sbjct: 241 KTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSII---------------------- 300

Query: 301 NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPA 360
                   EAEILAAPPKMVYSKLILRFTRKLLVAVVD WDSRVLKIEKVIP TWK+KPA
Sbjct: 301 --------EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKDKPA 360

Query: 361 GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEI 409
            RILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEI
Sbjct: 361 RRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEI 395

BLAST of CcUC02G042230 vs. NCBI nr
Match: XP_008449897.1 (PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo])

HSP 1 Score: 559.7 bits (1441), Expect = 2.1e-155
Identity = 316/433 (72.98%), Postives = 328/433 (75.75%), Query Frame = 0

Query: 1   MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQ 60
           MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL 
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKS----HLSSFTQ--RSHPNLFPVRFLFHLSFSTSFSTLH 60

Query: 61  SLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVI 120
           S K S F     DIGL DS  A Q                  PG SA KT +F   DKVI
Sbjct: 61  SFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVI 120

Query: 121 TIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSI 180
           T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI
Sbjct: 121 TTRPNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSI 180

Query: 181 AVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFD 240
           +VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFD
Sbjct: 181 SVSSIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFD 240

Query: 241 KTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP 300
           KTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                      
Sbjct: 241 KTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTI---------------------- 300

Query: 301 NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPA 360
                   EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPA
Sbjct: 301 --------EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDNRALKIEKVIPPTWKNKPA 360

Query: 361 GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEI 409
           GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE 
Sbjct: 361 GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKET 394

BLAST of CcUC02G042230 vs. NCBI nr
Match: XP_004149639.3 (uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus] >XP_031739815.1 uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus] >KGN54018.1 hypothetical protein Csa_021656 [Cucumis sativus])

HSP 1 Score: 556.6 bits (1433), Expect = 1.8e-154
Identity = 310/413 (75.06%), Postives = 323/413 (78.21%), Query Frame = 0

Query: 1   MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKP 60
           MSLAPPTS YP+     P   +HLSS +Q   SHP  F     FHLSFS SFSTL SLK 
Sbjct: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQ--RSHPNFFSLCFPFHLSFSTSFSTLHSLKY 60

Query: 61  SPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGI 120
           S F    +D GL DS  ADQ       PG SA KT +FT  DKVIT RPNFHFSYHISGI
Sbjct: 61  SAF---KTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGI 120

Query: 121 CRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG 180
           C+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSG
Sbjct: 121 CQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSG 180

Query: 181 KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP 240
           KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP
Sbjct: 181 KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP 240

Query: 241 PVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA 300
           PVTVET+EEADELLRKDE+DSTI                              EAEILAA
Sbjct: 241 PVTVETIEEADELLRKDERDSTI------------------------------EAEILAA 300

Query: 301 PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSE 360
           PPK+VYSKLILRFTRKLLVAV D WDSR LKIEKVIP TWKNKPAGRILELCILHLAMSE
Sbjct: 301 PPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSE 360

Query: 361 ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK 404
           ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDS  A +K
Sbjct: 361 ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK 378

BLAST of CcUC02G042230 vs. NCBI nr
Match: TYK21741.1 (NusB/RsmB/TIM44 [Cucumis melo var. makuwa])

HSP 1 Score: 530.4 bits (1365), Expect = 1.4e-146
Identity = 289/382 (75.65%), Postives = 300/382 (78.53%), Query Frame = 0

Query: 45  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTG 104
           FS SFSTL S K S F     DIGL DS  A Q                  PG SA KT 
Sbjct: 9   FSTSFSTLHSFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTR 68

Query: 105 IFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSEN 164
           +FT  DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN
Sbjct: 69  LFTVGDKVITTRPNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPEN 128

Query: 165 CVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNA 224
            VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+
Sbjct: 129 RVAEERSSISVSSIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNS 188

Query: 225 RRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI 284
           RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI             
Sbjct: 189 RRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTI------------- 248

Query: 285 KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI 344
                            EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVI
Sbjct: 249 -----------------EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDNRALKIEKVI 308

Query: 345 PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 404
           P TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
Sbjct: 309 PPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 357

Query: 405 TFVKDIKEIDSTHAGDKQEVRA 409
           TFVKDIKE DST A +KQEVRA
Sbjct: 369 TFVKDIKETDSTPAREKQEVRA 357

BLAST of CcUC02G042230 vs. NCBI nr
Match: KAA0040119.1 (NusB/RsmB/TIM44 [Cucumis melo var. makuwa])

HSP 1 Score: 513.5 bits (1321), Expect = 1.7e-141
Identity = 316/542 (58.30%), Postives = 328/542 (60.52%), Query Frame = 0

Query: 1   MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQ 60
           MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL 
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKS----HLSSFTQ--RSHPNLFPVRFLFHLSFSTSFSTLH 60

Query: 61  SLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVI 120
           S K S F     DIGL DS  A Q                  PG SA KT +F   DKVI
Sbjct: 61  SFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVI 120

Query: 121 TIR--------------------------------------------------------- 180
           T R                                                         
Sbjct: 121 TTRFFFLCPSTYSVLILYLLPLSFFNSNSASLILLTMQGMVDAGSVVTFALKISFGSFLS 180

Query: 181 ----------------------------------------------------PNFHFSYH 240
                                                               PNFHFS H
Sbjct: 181 SLKFGYTRWHLDEHRDNTTTILYWSTLGVAPELNRLKSLRADWGEELGGVELPNFHFSNH 240

Query: 241 ISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV 300
           ISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Sbjct: 241 ISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVSSIETIPKI 300

Query: 301 DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS 360
           DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMS
Sbjct: 301 DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTSLMEYNHMS 360

Query: 361 FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAE 409
           FGGPPVTVET+EEADELLRKDE+DSTI                              EAE
Sbjct: 361 FGGPPVTVETIEEADELLRKDERDSTI------------------------------EAE 420

BLAST of CcUC02G042230 vs. ExPASy Swiss-Prot
Match: Q18B61 (Transcription antitermination protein NusB OS=Clostridioides difficile (strain 630) OX=272563 GN=nusB PE=3 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 2.9e-06
Identity = 29/74 (39.19%), Postives = 48/74 (64.86%), Query Frame = 0

Query: 321 KIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAP 380
           KI+++I    KN    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P
Sbjct: 94  KIDELINKHAKNWTVDRMPKVDVSILRLSVCEILYLDTPNKVSINEAVELAKIYCDDKSP 153

Query: 381 RIINGCLRTFVKDI 393
           + ING L + V +I
Sbjct: 154 KFINGILGSVVDEI 167

BLAST of CcUC02G042230 vs. ExPASy Swiss-Prot
Match: B1WXY6 (Transcription antitermination protein NusB OS=Crocosphaera subtropica (strain ATCC 51142 / BH68) OX=43989 GN=nusB PE=3 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 9.2e-05
Identity = 30/91 (32.97%), Postives = 51/91 (56.04%), Query Frame = 0

Query: 305 RKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVI 364
           R+  + ++   + R  +I++ + +  K+    R+  ++  IL LA++EI  +    ++ I
Sbjct: 117 REYAIELIGTINRRRKEIDEQLEAVLKDWQLKRLAKIDQDILRLAVAEILFLDVPEKVSI 176

Query: 365 NEAVDLAKRFCDGAAPRIINGCLRTFVKDIK 394
           NEAV+LAKR+ D    R ING LR F   IK
Sbjct: 177 NEAVELAKRYSDDDGYRFINGVLRRFTDHIK 207

BLAST of CcUC02G042230 vs. ExPASy Swiss-Prot
Match: Q8GIR7 (Transcription antitermination protein NusB OS=Synechococcus elongatus (strain PCC 7942 / FACHB-805) OX=1140 GN=nusB PE=3 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 2.1e-04
Identity = 25/48 (52.08%), Postives = 31/48 (64.58%), Query Frame = 0

Query: 339 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 387
           L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Sbjct: 149 LDQDILRLAAAEILFLGTPEQVAINEAVELANRYSDEEGRRFINGVLR 196

BLAST of CcUC02G042230 vs. ExPASy Swiss-Prot
Match: Q5N1J7 (Transcription antitermination protein NusB OS=Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1) OX=269084 GN=nusB PE=3 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 2.1e-04
Identity = 25/48 (52.08%), Postives = 31/48 (64.58%), Query Frame = 0

Query: 339 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 387
           L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Sbjct: 149 LDQDILRLAAAEILFLGTPEQVAINEAVELANRYSDEEGRRFINGVLR 196

BLAST of CcUC02G042230 vs. ExPASy Swiss-Prot
Match: A7GWZ7 (Transcription antitermination protein NusB OS=Campylobacter curvus (strain 525.92) OX=360105 GN=nusB PE=3 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 2.7e-04
Identity = 28/82 (34.15%), Postives = 47/82 (57.32%), Query Frame = 0

Query: 309 VAVVDEWDSRVLK---IEKVIPSTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVIN 368
           +  ++ +D+  LK   +++++    K K   R  I+EL IL L + E+   GT   ++IN
Sbjct: 43  IQALETFDAICLKKGELDEILKPYLKEKDIERIGIVELAILRLGVYEMKFTGTDKAVIIN 102

Query: 369 EAVDLAKRFCDGAAPRIINGCL 386
           EA++LAK     +AP+ ING L
Sbjct: 103 EAIELAKELGGDSAPKFINGVL 124

BLAST of CcUC02G042230 vs. ExPASy TrEMBL
Match: A0A1S3BNR2 (uncharacterized protein LOC103491638 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491638 PE=3 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 1.0e-155
Identity = 316/433 (72.98%), Postives = 328/433 (75.75%), Query Frame = 0

Query: 1   MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQ 60
           MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL 
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKS----HLSSFTQ--RSHPNLFPVRFLFHLSFSTSFSTLH 60

Query: 61  SLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVI 120
           S K S F     DIGL DS  A Q                  PG SA KT +F   DKVI
Sbjct: 61  SFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVI 120

Query: 121 TIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSI 180
           T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI
Sbjct: 121 TTRPNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSI 180

Query: 181 AVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFD 240
           +VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFD
Sbjct: 181 SVSSIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFD 240

Query: 241 KTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP 300
           KTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                      
Sbjct: 241 KTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTI---------------------- 300

Query: 301 NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPA 360
                   EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPA
Sbjct: 301 --------EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDNRALKIEKVIPPTWKNKPA 360

Query: 361 GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEI 409
           GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE 
Sbjct: 361 GRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKET 394

BLAST of CcUC02G042230 vs. ExPASy TrEMBL
Match: A0A0A0KWZ5 (NusB domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G268020 PE=3 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 8.6e-155
Identity = 310/413 (75.06%), Postives = 323/413 (78.21%), Query Frame = 0

Query: 1   MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKP 60
           MSLAPPTS YP+     P   +HLSS +Q   SHP  F     FHLSFS SFSTL SLK 
Sbjct: 1   MSLAPPTSLYPYSSHSHPLPKSHLSSFTQ--RSHPNFFSLCFPFHLSFSTSFSTLHSLKY 60

Query: 61  SPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGI 120
           S F    +D GL DS  ADQ       PG SA KT +FT  DKVIT RPNFHFSYHISGI
Sbjct: 61  SAF---KTDTGLGDSHDADQPHSLKFAPGASAHKTRLFTVGDKVITTRPNFHFSYHISGI 120

Query: 121 CRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG 180
           C+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSG
Sbjct: 121 CQFPFHASSIVPHIKDSMPRFCCQASLRASTSFSENRVAEERSSISISSIEMIPKVDKSG 180

Query: 181 KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP 240
           KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP
Sbjct: 181 KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGP 240

Query: 241 PVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA 300
           PVTVET+EEADELLRKDE+DSTI                              EAEILAA
Sbjct: 241 PVTVETIEEADELLRKDERDSTI------------------------------EAEILAA 300

Query: 301 PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSE 360
           PPK+VYSKLILRFTRKLLVAV D WDSR LKIEKVIP TWKNKPAGRILELCILHLAMSE
Sbjct: 301 PPKIVYSKLILRFTRKLLVAVGDGWDSRALKIEKVIPPTWKNKPAGRILELCILHLAMSE 360

Query: 361 ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK 404
           ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDS  A +K
Sbjct: 361 ITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSMPAREK 378

BLAST of CcUC02G042230 vs. ExPASy TrEMBL
Match: A0A5D3DDJ4 (NusB/RsmB/TIM44 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold859G001130 PE=3 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 6.6e-147
Identity = 289/382 (75.65%), Postives = 300/382 (78.53%), Query Frame = 0

Query: 45  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTG 104
           FS SFSTL S K S F     DIGL DS  A Q                  PG SA KT 
Sbjct: 9   FSTSFSTLHSFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTR 68

Query: 105 IFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSEN 164
           +FT  DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN
Sbjct: 69  LFTVGDKVITTRPNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPEN 128

Query: 165 CVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNA 224
            VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+
Sbjct: 129 RVAEERSSISVSSIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNS 188

Query: 225 RRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI 284
           RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI             
Sbjct: 189 RRESGYEFDKTSLMEYNHMSFGGPPVTVETIEEADELLRKDERDSTI------------- 248

Query: 285 KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI 344
                            EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVI
Sbjct: 249 -----------------EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDNRALKIEKVI 308

Query: 345 PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 404
           P TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
Sbjct: 309 PPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR 357

Query: 405 TFVKDIKEIDSTHAGDKQEVRA 409
           TFVKDIKE DST A +KQEVRA
Sbjct: 369 TFVKDIKETDSTPAREKQEVRA 357

BLAST of CcUC02G042230 vs. ExPASy TrEMBL
Match: A0A5A7TFT7 (NusB/RsmB/TIM44 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold366G00940 PE=3 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 8.3e-142
Identity = 316/542 (58.30%), Postives = 328/542 (60.52%), Query Frame = 0

Query: 1   MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQ 60
           MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL 
Sbjct: 1   MSLAPPTSLYLYSSHSHPLPKS----HLSSFTQ--RSHPNLFPVRFLFHLSFSTSFSTLH 60

Query: 61  SLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVI 120
           S K S F     DIGL DS  A Q                  PG SA KT +F   DKVI
Sbjct: 61  SFKSSAF---KIDIGLGDSHDAGQPREQYQVEQEQPHGLKFAPGASAHKTRLFNVGDKVI 120

Query: 121 TIR--------------------------------------------------------- 180
           T R                                                         
Sbjct: 121 TTRFFFLCPSTYSVLILYLLPLSFFNSNSASLILLTMQGMVDAGSVVTFALKISFGSFLS 180

Query: 181 ----------------------------------------------------PNFHFSYH 240
                                                               PNFHFS H
Sbjct: 181 SLKFGYTRWHLDEHRDNTTTILYWSTLGVAPELNRLKSLRADWGEELGGVELPNFHFSNH 240

Query: 241 ISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV 300
           ISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Sbjct: 241 ISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVSSIETIPKI 300

Query: 301 DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS 360
           DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMS
Sbjct: 301 DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTSLMEYNHMS 360

Query: 361 FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAE 409
           FGGPPVTVET+EEADELLRKDE+DSTI                              EAE
Sbjct: 361 FGGPPVTVETIEEADELLRKDERDSTI------------------------------EAE 420

BLAST of CcUC02G042230 vs. ExPASy TrEMBL
Match: A0A1S4DXQ5 (uncharacterized protein LOC103491638 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491638 PE=3 SV=1)

HSP 1 Score: 492.7 bits (1267), Expect = 1.5e-135
Identity = 256/310 (82.58%), Postives = 266/310 (85.81%), Query Frame = 0

Query: 99  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVS 158
           PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VS
Sbjct: 15  PNFHFSNHISGICQFPFHASSIVPHVKNSMPRLCCQASLRASTSFPENRVAEERSSISVS 74

Query: 159 SIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTS 218
           SIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTS
Sbjct: 75  SIETIPKIDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNSRRESGYEFDKTS 134

Query: 219 LMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVT 278
           LMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                         
Sbjct: 135 LMEYNHMSFGGPPVTVETIEEADELLRKDERDSTI------------------------- 194

Query: 279 MDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRI 338
                EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRI
Sbjct: 195 -----EAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDNRALKIEKVIPPTWKNKPAGRI 254

Query: 339 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDST 398
           LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST
Sbjct: 255 LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKETDST 294

Query: 399 HAGDKQEVRA 409
            A +KQEVRA
Sbjct: 315 PAREKQEVRA 294

BLAST of CcUC02G042230 vs. TAIR 10
Match: AT4G26370.1 (antitermination NusB domain-containing protein )

HSP 1 Score: 306.2 bits (783), Expect = 3.9e-83
Identity = 159/245 (64.90%), Postives = 183/245 (74.69%), Query Frame = 0

Query: 163 IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEY 222
           +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EY
Sbjct: 85  MPKIDKSGRLSSPRAARELALVILYAACLEGSDPIRLFEKRINARREPGYEFDKSSLLEY 144

Query: 223 NHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQ 282
           NHMSFGGPPV  ET EE DEL+R DEK+S I                             
Sbjct: 145 NHMSFGGPPVKTETKEEEDELVRHDEKESKI----------------------------- 204

Query: 283 KEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELC 342
            EAE+L+APPK+VYSKL+LRF +KLL AVVD+WDS V+ IEK+ P  WK+ PAGRILE  
Sbjct: 205 -EAEVLSAPPKLVYSKLVLRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFS 264

Query: 343 ILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGD 402
           ILHLAMSE+ V+ TRH IVINEAVDLAKRFCDG+APRIINGCLRTFVKD     +  A +
Sbjct: 265 ILHLAMSEVAVLETRHPIVINEAVDLAKRFCDGSAPRIINGCLRTFVKDRATTSTPQALE 299

Query: 403 -KQEV 407
            KQEV
Sbjct: 325 LKQEV 299

BLAST of CcUC02G042230 vs. TAIR 10
Match: AT4G26370.2 (antitermination NusB domain-containing protein )

HSP 1 Score: 155.6 bits (392), Expect = 8.5e-38
Identity = 73/95 (76.84%), Postives = 84/95 (88.42%), Query Frame = 0

Query: 163 IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEY 222
           +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EY
Sbjct: 85  MPKIDKSGRLSSPRAARELALVILYAACLEGSDPIRLFEKRINARREPGYEFDKSSLLEY 144

Query: 223 NHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQL 258
           NHMSFGGPPV  ET EE DEL+R DEK+S IG+ L
Sbjct: 145 NHMSFGGPPVKTETKEEEDELVRHDEKESKIGRSL 179

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901769.11.0e-16274.36uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida][more]
XP_008449897.12.1e-15572.98PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo][more]
XP_004149639.31.8e-15475.06uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus] >XP_031739815.... [more]
TYK21741.11.4e-14675.65NusB/RsmB/TIM44 [Cucumis melo var. makuwa][more]
KAA0040119.11.7e-14158.30NusB/RsmB/TIM44 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q18B612.9e-0639.19Transcription antitermination protein NusB OS=Clostridioides difficile (strain 6... [more]
B1WXY69.2e-0532.97Transcription antitermination protein NusB OS=Crocosphaera subtropica (strain AT... [more]
Q8GIR72.1e-0452.08Transcription antitermination protein NusB OS=Synechococcus elongatus (strain PC... [more]
Q5N1J72.1e-0452.08Transcription antitermination protein NusB OS=Synechococcus sp. (strain ATCC 271... [more]
A7GWZ72.7e-0434.15Transcription antitermination protein NusB OS=Campylobacter curvus (strain 525.9... [more]
Match NameE-valueIdentityDescription
A0A1S3BNR21.0e-15572.98uncharacterized protein LOC103491638 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KWZ58.6e-15575.06NusB domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G268020 PE=3 S... [more]
A0A5D3DDJ46.6e-14775.65NusB/RsmB/TIM44 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold859G0011... [more]
A0A5A7TFT78.3e-14258.30NusB/RsmB/TIM44 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold366G0094... [more]
A0A1S4DXQ51.5e-13582.58uncharacterized protein LOC103491638 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT4G26370.13.9e-8364.90antitermination NusB domain-containing protein [more]
AT4G26370.28.5e-3876.84antitermination NusB domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR035926NusB-like superfamilyGENE3D1.10.940.10coord: 175..403
e-value: 7.5E-19
score: 70.0
IPR035926NusB-like superfamilySUPERFAMILY48013NusB-likecoord: 301..392
IPR006027NusB/RsmB/TIM44PFAMPF01029NusBcoord: 305..390
e-value: 5.8E-11
score: 42.8
IPR011605NusB antitermination factorPANTHERPTHR11078N UTILIZATION SUBSTANCE PROTEIN B-RELATEDcoord: 278..400
coord: 32..254

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC02G042230.1CcUC02G042230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0031564 transcription antitermination
biological_process GO:0006353 DNA-templated transcription, termination
molecular_function GO:0003723 RNA binding