CmUC06G118420 (gene) Watermelon (USVL531) v1

Overview
NameCmUC06G118420
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmU531Chr06: 20583158 .. 20583540 (-)
RNA-Seq ExpressionCmUC06G118420
SyntenyCmUC06G118420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGCGCAACACAACCGAGGAAGAGGACATGGTTGTGGTCATGGAAATAACAATGAAGAGAAAGAACAAACGAGCCACAAAATTGGCGTGGACGAGGTTGAGGTCGTGGAAGAGGTGATCGCTCAAATCGTCTAAATATCAAGTGTTACAATTGTGGAAAATATGGACACTACACAAAATATTGCTATGCCGAGAAGAAGGTGGAAGAAAATGCGAACTTAGTTGCGGAAGATGAGACCAAAGATGAGGGTGTTCTTGTGATGGCCCGTGAAGGTATCACTCCGGAGAGCAACATGCTGTTGTATCTTGACACCGGTGCAAGCAACCATATGTGTAGACACAAACATCTTTTTGTTGATATGCAAGAGATAGAAGATTGA

mRNA sequence

ATGTATGCGCAACACAACCGAGGAAGAGGACATGGTTGTGGTCATGGAAATAACAATGAAGAGAAAGAACAAACGAGCCACAAAATTGGCGTGGACGAGGTTGAGGTCGTGGAAGAGGTGGAAGAAAATGCGAACTTAGTTGCGGAAGATGAGACCAAAGATGAGGGTGTTCTTGTGATGGCCCGTGAAGGTATCACTCCGGAGAGCAACATGCTGTTGTATCTTGACACCGGTGCAAGCAACCATATGTGTAGACACAAACATCTTTTTGTTGATATGCAAGAGATAGAAGATTGA

Coding sequence (CDS)

ATGTATGCGCAACACAACCGAGGAAGAGGACATGGTTGTGGTCATGGAAATAACAATGAAGAGAAAGAACAAACGAGCCACAAAATTGGCGTGGACGAGGTTGAGGTCGTGGAAGAGGTGGAAGAAAATGCGAACTTAGTTGCGGAAGATGAGACCAAAGATGAGGGTGTTCTTGTGATGGCCCGTGAAGGTATCACTCCGGAGAGCAACATGCTGTTGTATCTTGACACCGGTGCAAGCAACCATATGTGTAGACACAAACATCTTTTTGTTGATATGCAAGAGATAGAAGATTGA

Protein sequence

MYAQHNRGRGHGCGHGNNNEEKEQTSHKIGVDEVEVVEEVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIED
Homology
BLAST of CmUC06G118420 vs. NCBI nr
Match: KYP41491.1 (hypothetical protein KK1_037138 [Cajanus cajan])

HSP 1 Score: 105.9 bits (263), Expect = 2.0e-19
Identity = 61/110 (55.45%), Postives = 74/110 (67.27%), Query Frame = 0

Query: 1   MYAQHNRGRGHGCGHGNNNEEKEQTSHKIGVD------------EVEVVEEVEENANLVA 60
           MYAQHNRGRG G G G     +   S++  V+                 ++VEEN NLV 
Sbjct: 1   MYAQHNRGRGRGRG-GRRGRGRGGRSNRTNVECYNCSKYGHYAKNCYAKKKVEENVNLVK 60

Query: 61  EDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIED 99
           EDETK+EG+L+MA EGIT +S+M+ YLDTGASNHMC HKHLFVD+QEIED
Sbjct: 61  EDETKNEGILMMANEGITLDSDMVWYLDTGASNHMCGHKHLFVDIQEIED 109

BLAST of CmUC06G118420 vs. NCBI nr
Match: KAE8706666.1 (hypothetical protein F3Y22_tig00110388pilonHSYRG00007 [Hibiscus syriacus])

HSP 1 Score: 90.1 bits (222), Expect = 1.1e-14
Identity = 42/61 (68.85%), Postives = 53/61 (86.89%), Query Frame = 0

Query: 38  EEVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIE 97
           + VEEN NLVAE+ET+++GVL+MA + I P+S+ + YLDTGASNHMC HKHLFVDMQEIE
Sbjct: 285 KRVEENVNLVAEEETREDGVLMMAYKNIIPDSDTVWYLDTGASNHMCGHKHLFVDMQEIE 344

Query: 98  D 99
           +
Sbjct: 345 E 345

BLAST of CmUC06G118420 vs. NCBI nr
Match: KAA0045462.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK00248.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 85.1 bits (209), Expect = 3.6e-13
Identity = 46/100 (46.00%), Postives = 67/100 (67.00%), Query Frame = 0

Query: 4   QHNRGRGHGCGHGNNNEEKEQTSHKIG-----VDEVEVVEEVEENANLVAEDETKDEGVL 63
           Q+ RGRG G G G+ +       +  G      ++    ++VEENANL+A++ETK++ +L
Sbjct: 108 QNWRGRGRGRGRGSRSNRPNVKCYNCGKYGHYANDWYAEKKVEENANLIAKEETKNDDIL 167

Query: 64  VMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIED 99
           +M  EG  P+S+M+ YL+TG SN+MC H+H FVDMQEIED
Sbjct: 168 MMTHEGTIPDSDMVCYLNTGGSNNMCGHEHFFVDMQEIED 207

BLAST of CmUC06G118420 vs. NCBI nr
Match: KAE8732520.1 (Detected protein of unknown function [Hibiscus syriacus])

HSP 1 Score: 84.3 bits (207), Expect = 6.2e-13
Identity = 57/130 (43.85%), Postives = 75/130 (57.69%), Query Frame = 0

Query: 1   MYAQHNRGRGHGCGHGNNN------------EEKEQTSHKIG-VDEVEVVE--------- 60
           +Y Q+ RGRG G    NN             EEK   ++KIG V++V V E         
Sbjct: 175 LYTQNFRGRGRGRRGRNNGRGGGGRCRSGYYEEKGSQANKIGEVEDVVVGEATDQIIQML 234

Query: 61  ----------EVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKH 99
                     +VEENANLVAE+ET+++GVL+M  +   P+ + + YLDT ASNHMC HKH
Sbjct: 235 SATTECYSEKKVEENANLVAEEETREDGVLMMTYKSTVPDRDTVWYLDTRASNHMCGHKH 294

BLAST of CmUC06G118420 vs. NCBI nr
Match: KAB5529592.1 (hypothetical protein DKX38_019673 [Salix brachista])

HSP 1 Score: 81.6 bits (200), Expect = 4.0e-12
Identity = 36/61 (59.02%), Postives = 51/61 (83.61%), Query Frame = 0

Query: 38  EEVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIE 97
           ++VEEN NLV E+ET+++GVL+MA +    ++N++ YLDTGASNHMC HKHLF +M+E+E
Sbjct: 148 KKVEENVNLVTEEETREDGVLMMAYKNTVSDNNIVWYLDTGASNHMCGHKHLFKEMREVE 207

Query: 98  D 99
           D
Sbjct: 208 D 208

BLAST of CmUC06G118420 vs. ExPASy TrEMBL
Match: A0A151RFW6 (CCHC-type domain-containing protein OS=Cajanus cajan OX=3821 GN=KK1_037138 PE=4 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 9.6e-20
Identity = 61/110 (55.45%), Postives = 74/110 (67.27%), Query Frame = 0

Query: 1   MYAQHNRGRGHGCGHGNNNEEKEQTSHKIGVD------------EVEVVEEVEENANLVA 60
           MYAQHNRGRG G G G     +   S++  V+                 ++VEEN NLV 
Sbjct: 1   MYAQHNRGRGRGRG-GRRGRGRGGRSNRTNVECYNCSKYGHYAKNCYAKKKVEENVNLVK 60

Query: 61  EDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIED 99
           EDETK+EG+L+MA EGIT +S+M+ YLDTGASNHMC HKHLFVD+QEIED
Sbjct: 61  EDETKNEGILMMANEGITLDSDMVWYLDTGASNHMCGHKHLFVDIQEIED 109

BLAST of CmUC06G118420 vs. ExPASy TrEMBL
Match: A0A6A3AQY9 (CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00110388pilonHSYRG00007 PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 5.5e-15
Identity = 42/61 (68.85%), Postives = 53/61 (86.89%), Query Frame = 0

Query: 38  EEVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIE 97
           + VEEN NLVAE+ET+++GVL+MA + I P+S+ + YLDTGASNHMC HKHLFVDMQEIE
Sbjct: 285 KRVEENVNLVAEEETREDGVLMMAYKNIIPDSDTVWYLDTGASNHMCGHKHLFVDMQEIE 344

Query: 98  D 99
           +
Sbjct: 345 E 345

BLAST of CmUC06G118420 vs. ExPASy TrEMBL
Match: A0A5A7TQ20 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2460G00080 PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 1.8e-13
Identity = 46/100 (46.00%), Postives = 67/100 (67.00%), Query Frame = 0

Query: 4   QHNRGRGHGCGHGNNNEEKEQTSHKIG-----VDEVEVVEEVEENANLVAEDETKDEGVL 63
           Q+ RGRG G G G+ +       +  G      ++    ++VEENANL+A++ETK++ +L
Sbjct: 108 QNWRGRGRGRGRGSRSNRPNVKCYNCGKYGHYANDWYAEKKVEENANLIAKEETKNDDIL 167

Query: 64  VMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIED 99
           +M  EG  P+S+M+ YL+TG SN+MC H+H FVDMQEIED
Sbjct: 168 MMTHEGTIPDSDMVCYLNTGGSNNMCGHEHFFVDMQEIED 207

BLAST of CmUC06G118420 vs. ExPASy TrEMBL
Match: A0A6A3CTC0 (Integrase catalytic domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00001860pilonHSYRG00008 PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 3.0e-13
Identity = 57/130 (43.85%), Postives = 75/130 (57.69%), Query Frame = 0

Query: 1   MYAQHNRGRGHGCGHGNNN------------EEKEQTSHKIG-VDEVEVVE--------- 60
           +Y Q+ RGRG G    NN             EEK   ++KIG V++V V E         
Sbjct: 175 LYTQNFRGRGRGRRGRNNGRGGGGRCRSGYYEEKGSQANKIGEVEDVVVGEATDQIIQML 234

Query: 61  ----------EVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKH 99
                     +VEENANLVAE+ET+++GVL+M  +   P+ + + YLDT ASNHMC HKH
Sbjct: 235 SATTECYSEKKVEENANLVAEEETREDGVLMMTYKSTVPDRDTVWYLDTRASNHMCGHKH 294

BLAST of CmUC06G118420 vs. ExPASy TrEMBL
Match: A0A5N5KGV2 (DUF4219 domain-containing protein OS=Salix brachista OX=2182728 GN=DKX38_019673 PE=4 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.9e-12
Identity = 36/61 (59.02%), Postives = 51/61 (83.61%), Query Frame = 0

Query: 38  EEVEENANLVAEDETKDEGVLVMAREGITPESNMLLYLDTGASNHMCRHKHLFVDMQEIE 97
           ++VEEN NLV E+ET+++GVL+MA +    ++N++ YLDTGASNHMC HKHLF +M+E+E
Sbjct: 148 KKVEENVNLVTEEETREDGVLMMAYKNTVSDNNIVWYLDTGASNHMCGHKHLFKEMREVE 207

Query: 98  D 99
           D
Sbjct: 208 D 208

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KYP41491.12.0e-1955.45hypothetical protein KK1_037138 [Cajanus cajan][more]
KAE8706666.11.1e-1468.85hypothetical protein F3Y22_tig00110388pilonHSYRG00007 [Hibiscus syriacus][more]
KAA0045462.13.6e-1346.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAE8732520.16.2e-1343.85Detected protein of unknown function [Hibiscus syriacus][more]
KAB5529592.14.0e-1259.02hypothetical protein DKX38_019673 [Salix brachista][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A151RFW69.6e-2055.45CCHC-type domain-containing protein OS=Cajanus cajan OX=3821 GN=KK1_037138 PE=4 ... [more]
A0A6A3AQY95.5e-1568.85CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig0... [more]
A0A5A7TQ201.8e-1346.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A6A3CTC03.0e-1343.85Integrase catalytic domain-containing protein OS=Hibiscus syriacus OX=106335 GN=... [more]
A0A5N5KGV21.9e-1259.02DUF4219 domain-containing protein OS=Salix brachista OX=2182728 GN=DKX38_019673 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..32
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 14..32

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC06G118420.1CmUC06G118420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding