Cla97C01G020560 (gene) Watermelon (97103) v2

NameCla97C01G020560
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptiontRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG
LocationCla97Chr01 : 33160477 .. 33161025 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCGGCTCTCAACCCTTACAGTTAATAACGACTAATTATCTGTTCTTTTGTACTTTTAATCTTTTGCATAGCATCAGCAGTGAGAAGCACAACCGCACAGCCATGTCTCGGATTGACGATTTACCATATTCTAGGCCACCAAAACAGAAAGCTTTGCACCAAAAAGCTTCCACAGCATCTCATGTACAAGTGAAAGATCCCAACGATCGAATTAAGGGGTCTTTTCCAGCACAAACAGCACAGAAGGCCTCCAAAAGAAGTTTGAAGAATGAACCATTTATAGCATTTCAGCACCCCGAAAGGTCGAATTCGGATTCTCTGCCTGATTCATCTGCATCTGGGAATGAGTACCGTGCCTTAAGAAGAAAATATCTACTCCTGGAGGAAGAGAGCTTTACTCTGGGAGCAGAACTGAAAGAAGTTGAAGATGAAGTGAAGACCCTAGAAGAAGAAAAGCTTGGTCTCTTGGATGAACTTCTTGTTTTAGAAGGACTAATCAATCGTTCAGAATTGCAGCTTGCCCAGTCAAATTTACCACAACTTTAG

mRNA sequence

ATGTGCGGCTCTCAACCCTTACAGTTAATAACGACTAATTATCTGTTCTTTTGTACTTTTAATCTTTTGCATAGCATCAGCAGTGAGAAGCACAACCGCACAGCCATGTCTCGGATTGACGATTTACCATATTCTAGGCCACCAAAACAGAAAGCTTTGCACCAAAAAGCTTCCACAGCATCTCATGTACAAGTGAAAGATCCCAACGATCGAATTAAGGGGTCTTTTCCAGCACAAACAGCACAGAAGGCCTCCAAAAGAAGTTTGAAGAATGAACCATTTATAGCATTTCAGCACCCCGAAAGGTCGAATTCGGATTCTCTGCCTGATTCATCTGCATCTGGGAATGAGTACCGTGCCTTAAGAAGAAAATATCTACTCCTGGAGGAAGAGAGCTTTACTCTGGGAGCAGAACTGAAAGAAGTTGAAGATGAAGTGAAGACCCTAGAAGAAGAAAAGCTTGGTCTCTTGGATGAACTTCTTGTTTTAGAAGGACTAATCAATCGTTCAGAATTGCAGCTTGCCCAGTCAAATTTACCACAACTTTAG

Coding sequence (CDS)

ATGTGCGGCTCTCAACCCTTACAGTTAATAACGACTAATTATCTGTTCTTTTGTACTTTTAATCTTTTGCATAGCATCAGCAGTGAGAAGCACAACCGCACAGCCATGTCTCGGATTGACGATTTACCATATTCTAGGCCACCAAAACAGAAAGCTTTGCACCAAAAAGCTTCCACAGCATCTCATGTACAAGTGAAAGATCCCAACGATCGAATTAAGGGGTCTTTTCCAGCACAAACAGCACAGAAGGCCTCCAAAAGAAGTTTGAAGAATGAACCATTTATAGCATTTCAGCACCCCGAAAGGTCGAATTCGGATTCTCTGCCTGATTCATCTGCATCTGGGAATGAGTACCGTGCCTTAAGAAGAAAATATCTACTCCTGGAGGAAGAGAGCTTTACTCTGGGAGCAGAACTGAAAGAAGTTGAAGATGAAGTGAAGACCCTAGAAGAAGAAAAGCTTGGTCTCTTGGATGAACTTCTTGTTTTAGAAGGACTAATCAATCGTTCAGAATTGCAGCTTGCCCAGTCAAATTTACCACAACTTTAG

Protein sequence

MCGSQPLQLITTNYLFFCTFNLLHSISSEKHNRTAMSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFIAFQHPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFTLGAELKEVEDEVKTLEEEKLGLLDELLVLEGLINRSELQLAQSNLPQL
BLAST of Cla97C01G020560 vs. NCBI nr
Match: XP_016899190.1 (PREDICTED: uncharacterized protein LOC103485204 [Cucumis melo])

HSP 1 Score: 226.9 bits (577), Expect = 5.7e-56
Identity = 134/148 (90.54%), Postives = 139/148 (93.92%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKALHQKASTASH+Q+KDP++RIKGSFPAQTAQKASKRSLKNEP I
Sbjct: 1   MSRIDDLPYSRPPKQKALHQKASTASHLQLKDPSNRIKGSFPAQTAQKASKRSLKNEPSI 60

Query: 96  AFQHPERSNSDSLPD-SSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXX 155
            FQ PERSNSDSLPD SSASGNEYRALRRKYLLLEEESFTLGAELKEV  XXXXXXXXXX
Sbjct: 61  VFQQPERSNSDSLPDSSSASGNEYRALRRKYLLLEEESFTLGAELKEVXXXXXXXXXXXX 120

Query: 156 GLLDELLVLEGLINRSELQLAQSNLPQL 183
            LLDELLVLEGLI+RSELQLA SNLP L
Sbjct: 121 XLLDELLVLEGLIDRSELQLAHSNLPHL 148

BLAST of Cla97C01G020560 vs. NCBI nr
Match: XP_011658091.1 (PREDICTED: uncharacterized protein LOC101212199 [Cucumis sativus] >KGN49006.1 hypothetical protein Csa_6G510240 [Cucumis sativus])

HSP 1 Score: 222.6 bits (566), Expect = 1.1e-54
Identity = 130/146 (89.04%), Postives = 134/146 (91.78%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKALHQKASTASHVQVKDP+DRIKGSFPAQTAQKASKRSLKNEP +
Sbjct: 1   MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPSDRIKGSFPAQTAQKASKRSLKNEPSV 60

Query: 96  AFQHPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXXG 155
            FQ PERSNSDSLPDSSASGNEYRALRRKYLLLEEESF+LGAELK V  XXXXXXXXXX 
Sbjct: 61  VFQQPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFSLGAELKGVXXXXXXXXXXXXX 120

Query: 156 LLDELLVLEGLINRSELQLAQSNLPQ 182
               LLVLEGLI+RSELQLA SNLPQ
Sbjct: 121 XXXXLLVLEGLIDRSELQLAHSNLPQ 146

BLAST of Cla97C01G020560 vs. NCBI nr
Match: XP_022950616.1 (uncharacterized protein LOC111453661 [Cucurbita moschata])

HSP 1 Score: 176.4 bits (446), Expect = 8.8e-41
Identity = 110/146 (75.34%), Postives = 115/146 (78.77%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLP+SRPPKQKA HQKASTASH+Q KDPNDRIKGSFPAQTAQK+SKRSLKNEP +
Sbjct: 1   MSRIDDLPFSRPPKQKASHQKASTASHLQGKDPNDRIKGSFPAQTAQKSSKRSLKNEPSV 60

Query: 96  AFQHPERSNSDSLPD-SSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXX 155
           A Q PERSNSDSLPD SSASGNEYR LRRKYLLL                XXXXXXXXXX
Sbjct: 61  ALQQPERSNSDSLPDSSSASGNEYRVLRRKYLLLXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 156 GLLDELLVLEGLINRSELQLAQSNLP 181
                  VLEGLI+RSELQLA SNLP
Sbjct: 121 XXXXXXXVLEGLIDRSELQLAHSNLP 146

BLAST of Cla97C01G020560 vs. NCBI nr
Match: XP_023545100.1 (uncharacterized protein LOC111804503 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 176.0 bits (445), Expect = 1.1e-40
Identity = 110/146 (75.34%), Postives = 114/146 (78.08%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKA HQKASTASH+Q KDPNDRIKGSFPAQT QK+SKRSLKNEP +
Sbjct: 1   MSRIDDLPYSRPPKQKASHQKASTASHLQGKDPNDRIKGSFPAQTPQKSSKRSLKNEPSV 60

Query: 96  AFQHPERSNSDSLPD-SSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXX 155
           A Q PERSNSDSLPD SSASGNEYR LRRKYLLL                XXXXXXXXXX
Sbjct: 61  ALQQPERSNSDSLPDSSSASGNEYRVLRRKYLLLXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 156 GLLDELLVLEGLINRSELQLAQSNLP 181
                  VLEGLI+RSELQLA SNLP
Sbjct: 121 XXXXXXXVLEGLIDRSELQLAHSNLP 146

BLAST of Cla97C01G020560 vs. NCBI nr
Match: XP_022978374.1 (uncharacterized protein LOC111478385 [Cucurbita maxima])

HSP 1 Score: 174.1 bits (440), Expect = 4.4e-40
Identity = 109/146 (74.66%), Postives = 113/146 (77.40%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKA H KASTASH+Q KDPNDRIKGSFPAQT QK+SKRSLKNEP +
Sbjct: 1   MSRIDDLPYSRPPKQKASHPKASTASHLQGKDPNDRIKGSFPAQTTQKSSKRSLKNEPSV 60

Query: 96  AFQHPERSNSDSLPD-SSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXX 155
           A Q PERSNSDSLPD SSASGNEYR LRRKYLLL                XXXXXXXXXX
Sbjct: 61  ALQQPERSNSDSLPDSSSASGNEYRVLRRKYLLLXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 156 GLLDELLVLEGLINRSELQLAQSNLP 181
                  VLEGLI+RSELQLA SNLP
Sbjct: 121 XXXXXXXVLEGLIDRSELQLAHSNLP 146

BLAST of Cla97C01G020560 vs. TrEMBL
Match: tr|A0A1S4DU11|A0A1S4DU11_CUCME (uncharacterized protein LOC103485204 OS=Cucumis melo OX=3656 GN=LOC103485204 PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 3.7e-56
Identity = 134/148 (90.54%), Postives = 139/148 (93.92%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKALHQKASTASH+Q+KDP++RIKGSFPAQTAQKASKRSLKNEP I
Sbjct: 1   MSRIDDLPYSRPPKQKALHQKASTASHLQLKDPSNRIKGSFPAQTAQKASKRSLKNEPSI 60

Query: 96  AFQHPERSNSDSLPD-SSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXX 155
            FQ PERSNSDSLPD SSASGNEYRALRRKYLLLEEESFTLGAELKEV  XXXXXXXXXX
Sbjct: 61  VFQQPERSNSDSLPDSSSASGNEYRALRRKYLLLEEESFTLGAELKEVXXXXXXXXXXXX 120

Query: 156 GLLDELLVLEGLINRSELQLAQSNLPQL 183
            LLDELLVLEGLI+RSELQLA SNLP L
Sbjct: 121 XLLDELLVLEGLIDRSELQLAHSNLPHL 148

BLAST of Cla97C01G020560 vs. TrEMBL
Match: tr|A0A0A0KMU4|A0A0A0KMU4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G510240 PE=4 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 7.1e-55
Identity = 130/146 (89.04%), Postives = 134/146 (91.78%), Query Frame = 0

Query: 36  MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFI 95
           MSRIDDLPYSRPPKQKALHQKASTASHVQVKDP+DRIKGSFPAQTAQKASKRSLKNEP +
Sbjct: 1   MSRIDDLPYSRPPKQKALHQKASTASHVQVKDPSDRIKGSFPAQTAQKASKRSLKNEPSV 60

Query: 96  AFQHPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXXG 155
            FQ PERSNSDSLPDSSASGNEYRALRRKYLLLEEESF+LGAELK V  XXXXXXXXXX 
Sbjct: 61  VFQQPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFSLGAELKGVXXXXXXXXXXXXX 120

Query: 156 LLDELLVLEGLINRSELQLAQSNLPQ 182
               LLVLEGLI+RSELQLA SNLPQ
Sbjct: 121 XXXXLLVLEGLIDRSELQLAHSNLPQ 146

BLAST of Cla97C01G020560 vs. TrEMBL
Match: tr|W9REA9|W9REA9_9ROSA (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_002283 PE=4 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 3.4e-25
Identity = 76/142 (53.52%), Postives = 91/142 (64.08%), Query Frame = 0

Query: 40  DDLPYSRPPKQKALHQKAS--TASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFIAF 99
           D++P  +P K+K   QKA         VK   D ++GS P +  QK SKR+L+NE     
Sbjct: 10  DEVPNHKPQKKKTSTQKAPMFQMRRHDVKAVKD-VQGSVPLKAGQKVSKRALRNEVSPMV 69

Query: 100 QHPERSNSDSLPDSSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXXGLL 159
             PERSNSDSLP+SS SGNEYR+LRRKYLLLEEESFTLG +L+EVED           LL
Sbjct: 70  HQPERSNSDSLPNSSTSGNEYRSLRRKYLLLEEESFTLGRDLREVEDEVKSLEDEKHALL 129

Query: 160 DELLVLEGLINRSELQLAQSNL 180
           D L+VLEGLI+ SEL   Q  L
Sbjct: 130 DRLVVLEGLIDPSELLSQQGKL 150

BLAST of Cla97C01G020560 vs. TrEMBL
Match: tr|A0A2R6R5Y8|A0A2R6R5Y8_ACTCH (Vacuolar protein sorting/targeting protein like OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc10470 PE=4 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 7.6e-25
Identity = 75/137 (54.74%), Postives = 86/137 (62.77%), Query Frame = 0

Query: 46  RPPKQKALHQKASTASHVQVKDPNDRIKGSFPAQTAQKASKRSLKNEPFIAFQHPERSNS 105
           +P K+K   QK ST            +    P Q+ +KASKR LKNE    FQ PE S S
Sbjct: 16  KPRKKKPSTQKGSTFQQGSNVKGFQEVLPPLPIQSVKKASKRVLKNEVNPLFQQPENSPS 75

Query: 106 DSLPDSSASGNEYRALRRKYLLLEEESFTLGAELKEVEDXXXXXXXXXXGLLDELLVLEG 165
           DSLPDSS SGNEYRALRRKYLLLEEESF LG+EL+EVED           LLD+L+VLEG
Sbjct: 76  DSLPDSSTSGNEYRALRRKYLLLEEESFGLGSELREVEDVVKTLEEEKLALLDDLVVLEG 135

Query: 166 LINRSELQLAQSNLPQL 183
           L++ SELQ     LP L
Sbjct: 136 LMDPSELQSQGKRLPLL 152

BLAST of Cla97C01G020560 vs. TrEMBL
Match: tr|A0A061ENV2|A0A061ENV2_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_019278 PE=4 SV=1)

HSP 1 Score: 121.7 bits (304), Expect = 1.7e-24
Identity = 69/103 (66.99%), Postives = 80/103 (77.67%), Query Frame = 0

Query: 71  RIKGSFPAQTAQKASKRSLKNEPFIAFQHPERSNSDSLPDSSASGNEYRALRRKYLLLEE 130
           ++ GS P +T QK SKR+LK E    FQ PERSNSDS+PDSS SGNEYRALRRKYLLLEE
Sbjct: 42  QVLGSLPLRTGQKTSKRNLKKEISPIFQQPERSNSDSIPDSSTSGNEYRALRRKYLLLEE 101

Query: 131 ESFTLGAELKEVEDXXXXXXXXXXGLLDELLVLEGLINRSELQ 174
           ESF LG ELK+V D     XXXX  LLD+L+VLEGL++ SE+Q
Sbjct: 102 ESFALGKELKDVVDEVKVLXXXXFALLDQLVVLEGLVDPSEMQ 144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_016899190.15.7e-5690.54PREDICTED: uncharacterized protein LOC103485204 [Cucumis melo][more]
XP_011658091.11.1e-5489.04PREDICTED: uncharacterized protein LOC101212199 [Cucumis sativus] >KGN49006.1 hy... [more]
XP_022950616.18.8e-4175.34uncharacterized protein LOC111453661 [Cucurbita moschata][more]
XP_023545100.11.1e-4075.34uncharacterized protein LOC111804503 [Cucurbita pepo subsp. pepo][more]
XP_022978374.14.4e-4074.66uncharacterized protein LOC111478385 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S4DU11|A0A1S4DU11_CUCME3.7e-5690.54uncharacterized protein LOC103485204 OS=Cucumis melo OX=3656 GN=LOC103485204 PE=... [more]
tr|A0A0A0KMU4|A0A0A0KMU4_CUCSA7.1e-5589.04Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G510240 PE=4 SV=1[more]
tr|W9REA9|W9REA9_9ROSA3.4e-2553.52Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_002283 PE=4 SV=1[more]
tr|A0A2R6R5Y8|A0A2R6R5Y8_ACTCH7.6e-2554.74Vacuolar protein sorting/targeting protein like OS=Actinidia chinensis var. chin... [more]
tr|A0A061ENV2|A0A061ENV2_THECC1.7e-2466.99Uncharacterized protein isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_019278 PE=4 ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G020560.1Cla97C01G020560.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 132..159
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 55..90
NoneNo IPR availablePANTHERPTHR37740FAMILY NOT NAMEDcoord: 37..173

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G020560Silver-seed gourdcarwmbB0669
Cla97C01G020560Cucumber (Gy14) v2cgybwmbB394
Cla97C01G020560Cucurbita maxima (Rimu)cmawmbB875
Cla97C01G020560Cucurbita moschata (Rifu)cmowmbB849
Cla97C01G020560Bottle gourd (USVL1VR-Ls)lsiwmbB115
Cla97C01G020560Melon (DHL92) v3.6.1medwmbB411
Cla97C01G020560Watermelon (Charleston Gray)wcgwmbB089