CmUC02G034300 (gene) Watermelon (USVL531) v1

Overview
NameCmUC02G034300
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationCmU531Chr02: 10298169 .. 10298552 (+)
RNA-Seq ExpressionCmUC02G034300
SyntenyCmUC02G034300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTATACTTTATGCTTCCCTTATAGAGGAAAAGATGGGAGAAATAGTATCCTTCACAACTACCTATGATATATGGCATTCACTTCACCGATCATATGAGTCATCATCATACACGCGAGTTCTCAGTCTTAAAGCCCAAATACAAAAAATTCAAAAGGACGGTCTTACTGTCACACAATATTTGGCCAAATTTAAAGATATTTCTGATAAGTTGTCTGCGATCAGTGAACCCATATCTCATAAAGACCACATCTCCTATATTTTAGAGGGTCTTGGAGTTGAGTACAATGCTTTTGTCACCTCCATCCAGAACAAGGGGGATATTCCAATGCTTGAGGATGTTATCACACTTCTTCTCAGTTACGATTATCGTCTTGAATGA

mRNA sequence

ATGACTATACTTTATGCTTCCCTTATAGAGGAAAAGATGGGAGAAATAGTATCCTTCACAACTACCTATGATATATGGCATTCACTTCACCGATCATATGAGTCATCATCATACACGCGAGTTCTCAGTCTTAAAGCCCAAATACAAAAAATTCAAAAGGACGGTCTTACTGTCACACAATATTTGGCCAAATTTAAAGATATTTCTGATAAGTTGTCTGCGATCAGTGAACCCATATCTCATAAAGACCACATCTCCTATATTTTAGAGGGTCTTGGAGTTGAGTACAATGCTTTTGTCACCTCCATCCAGAACAAGGGGGATATTCCAATGCTTGAGGATGTTATCACACTTCTTCTCAGTTACGATTATCGTCTTGAATGA

Coding sequence (CDS)

ATGACTATACTTTATGCTTCCCTTATAGAGGAAAAGATGGGAGAAATAGTATCCTTCACAACTACCTATGATATATGGCATTCACTTCACCGATCATATGAGTCATCATCATACACGCGAGTTCTCAGTCTTAAAGCCCAAATACAAAAAATTCAAAAGGACGGTCTTACTGTCACACAATATTTGGCCAAATTTAAAGATATTTCTGATAAGTTGTCTGCGATCAGTGAACCCATATCTCATAAAGACCACATCTCCTATATTTTAGAGGGTCTTGGAGTTGAGTACAATGCTTTTGTCACCTCCATCCAGAACAAGGGGGATATTCCAATGCTTGAGGATGTTATCACACTTCTTCTCAGTTACGATTATCGTCTTGAATGA

Protein sequence

MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE
Homology
BLAST of CmUC02G034300 vs. NCBI nr
Match: XP_022155181.1 (uncharacterized protein LOC111022315 [Momordica charantia])

HSP 1 Score: 154.5 bits (389), Expect = 6.3e-34
Identity = 71/127 (55.91%), Postives = 102/127 (80.31%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M  +Y+SL EEKMGE+VS  TT+DIW SL R Y+S +  R++ LK ++Q ++KDG +V+Q
Sbjct: 92  MCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQ 151

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           YLAK K+I+DK +A+ EP+S++DH++++L+GLG EYNAFVTSI N+ D P LEDV +LLL
Sbjct: 152 YLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLL 211

Query: 121 SYDYRLE 128
           +Y+ RL+
Sbjct: 212 AYEARLD 218

BLAST of CmUC02G034300 vs. NCBI nr
Match: XP_038887133.1 (uncharacterized protein LOC120077323 [Benincasa hispida])

HSP 1 Score: 135.6 bits (340), Expect = 3.0e-28
Identity = 64/115 (55.65%), Postives = 87/115 (75.65%), Query Frame = 0

Query: 13  MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKL 72
           MGEIV + + +DIW +L   YESSS   ++   +Q+QKI+KDGLTV+QYLA+ KD+ D  
Sbjct: 1   MGEIVGYESAFDIWEALRTVYESSSIAPIMGFCSQLQKIKKDGLTVSQYLAQIKDVLDNF 60

Query: 73  SAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE 128
           +AI EP+S++DH+SYILEGLG EYN FV+SI N+ + P + DV  LL++YD RLE
Sbjct: 61  AAIGEPLSYRDHLSYILEGLGSEYNPFVSSIHNRTNRPSIADVRNLLITYDSRLE 115

BLAST of CmUC02G034300 vs. NCBI nr
Match: PON47862.1 (hypothetical protein TorRG33x02_321990 [Trema orientale])

HSP 1 Score: 125.6 bits (314), Expect = 3.1e-25
Identity = 59/127 (46.46%), Postives = 92/127 (72.44%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M+ +YASL +  MG+IV + + ++IW +L++ Y SSS  ++  L+A++Q ++KDGLT  +
Sbjct: 114 MSWIYASLTQGVMGQIVGYASAFEIWEALNQIYTSSSLAKITELRAKLQNLRKDGLTAIE 173

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           Y+ K K+I + L+A+ EP+S KDH+ Y+  GL  EYNAFVTSI  + D   LE++ +LLL
Sbjct: 174 YIQKHKNICNTLAAVGEPVSCKDHLLYLFGGLDREYNAFVTSITKRPDNLPLEEIYSLLL 233

Query: 121 SYDYRLE 128
           SY++RLE
Sbjct: 234 SYEFRLE 240

BLAST of CmUC02G034300 vs. NCBI nr
Match: XP_022148871.1 (uncharacterized protein LOC111017438 [Momordica charantia])

HSP 1 Score: 123.6 bits (309), Expect = 1.2e-24
Identity = 59/81 (72.84%), Postives = 73/81 (90.12%), Query Frame = 0

Query: 47  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNK 106
           +IQ+++KDGL+V+QYLAK K+I+ KLS+I EPIS KDHISYI+EGLG+EYNAFVTSIQN+
Sbjct: 7   EIQQVKKDGLSVSQYLAKIKEITGKLSSIGEPISLKDHISYIIEGLGIEYNAFVTSIQNR 66

Query: 107 GDIPMLEDVITLLLSYDYRLE 128
            D+  LEDV TLLL+YDYRLE
Sbjct: 67  SDMXTLEDVRTLLLAYDYRLE 87

BLAST of CmUC02G034300 vs. NCBI nr
Match: RVW33435.1 (hypothetical protein CK203_098877 [Vitis vinifera])

HSP 1 Score: 122.1 bits (305), Expect = 3.5e-24
Identity = 57/127 (44.88%), Postives = 90/127 (70.87%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M+ LYASL E+ M +IV ++T  +IW++L++ Y +SS  R   L+ ++Q ++KDGL+  +
Sbjct: 1   MSWLYASLSEDIMAQIVGYSTAMEIWNALNQIYFASSMARFTELRTKLQTLKKDGLSAGE 60

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           Y+ + K I + ++AI EP+S K H+ Y+  GL  EYN+FVTSIQ + D P ++ + +LLL
Sbjct: 61  YIQRLKSICNSIAAIGEPVSEKGHLIYLFNGLDCEYNSFVTSIQIRSDQPTIDKIHSLLL 120

Query: 121 SYDYRLE 128
           SYD+RLE
Sbjct: 121 SYDFRLE 127

BLAST of CmUC02G034300 vs. ExPASy TrEMBL
Match: A0A6J1DQX7 (uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022315 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 3.1e-34
Identity = 71/127 (55.91%), Postives = 102/127 (80.31%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M  +Y+SL EEKMGE+VS  TT+DIW SL R Y+S +  R++ LK ++Q ++KDG +V+Q
Sbjct: 92  MCWIYSSLSEEKMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQ 151

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           YLAK K+I+DK +A+ EP+S++DH++++L+GLG EYNAFVTSI N+ D P LEDV +LLL
Sbjct: 152 YLAKIKEIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLL 211

Query: 121 SYDYRLE 128
           +Y+ RL+
Sbjct: 212 AYEARLD 218

BLAST of CmUC02G034300 vs. ExPASy TrEMBL
Match: A0A2P5BGF8 (Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_321990 PE=4 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 1.5e-25
Identity = 59/127 (46.46%), Postives = 92/127 (72.44%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M+ +YASL +  MG+IV + + ++IW +L++ Y SSS  ++  L+A++Q ++KDGLT  +
Sbjct: 114 MSWIYASLTQGVMGQIVGYASAFEIWEALNQIYTSSSLAKITELRAKLQNLRKDGLTAIE 173

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           Y+ K K+I + L+A+ EP+S KDH+ Y+  GL  EYNAFVTSI  + D   LE++ +LLL
Sbjct: 174 YIQKHKNICNTLAAVGEPVSCKDHLLYLFGGLDREYNAFVTSITKRPDNLPLEEIYSLLL 233

Query: 121 SYDYRLE 128
           SY++RLE
Sbjct: 234 SYEFRLE 240

BLAST of CmUC02G034300 vs. ExPASy TrEMBL
Match: A0A6J1D6N7 (uncharacterized protein LOC111017438 OS=Momordica charantia OX=3673 GN=LOC111017438 PE=4 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 5.8e-25
Identity = 59/81 (72.84%), Postives = 73/81 (90.12%), Query Frame = 0

Query: 47  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNK 106
           +IQ+++KDGL+V+QYLAK K+I+ KLS+I EPIS KDHISYI+EGLG+EYNAFVTSIQN+
Sbjct: 7   EIQQVKKDGLSVSQYLAKIKEITGKLSSIGEPISLKDHISYIIEGLGIEYNAFVTSIQNR 66

Query: 107 GDIPMLEDVITLLLSYDYRLE 128
            D+  LEDV TLLL+YDYRLE
Sbjct: 67  SDMXTLEDVRTLLLAYDYRLE 87

BLAST of CmUC02G034300 vs. ExPASy TrEMBL
Match: A0A438DD82 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=CK203_098877 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.7e-24
Identity = 57/127 (44.88%), Postives = 90/127 (70.87%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M+ LYASL E+ M +IV ++T  +IW++L++ Y +SS  R   L+ ++Q ++KDGL+  +
Sbjct: 1   MSWLYASLSEDIMAQIVGYSTAMEIWNALNQIYFASSMARFTELRTKLQTLKKDGLSAGE 60

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           Y+ + K I + ++AI EP+S K H+ Y+  GL  EYN+FVTSIQ + D P ++ + +LLL
Sbjct: 61  YIQRLKSICNSIAAIGEPVSEKGHLIYLFNGLDCEYNSFVTSIQIRSDQPTIDKIHSLLL 120

Query: 121 SYDYRLE 128
           SYD+RLE
Sbjct: 121 SYDFRLE 127

BLAST of CmUC02G034300 vs. ExPASy TrEMBL
Match: A0A7J6E2L1 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_012566 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 6.4e-24
Identity = 58/127 (45.67%), Postives = 88/127 (69.29%), Query Frame = 0

Query: 1   MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQ 60
           M+ LYASL +  + +IV+FTT  +IW SL R+Y ++S+ R    +  +Q ++KDGL  + 
Sbjct: 1   MSWLYASLSDSMLSQIVAFTTAAEIWVSLERTYSTASFARSSDYRTTLQNLKKDGLNASA 60

Query: 61  YLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLL 120
           YL K K + + L+++ EPIS ++H++Y+L GLG EYNAFVT I  +   P++E+V  LLL
Sbjct: 61  YLQKLKSLCNTLASVGEPISSQEHLTYLLNGLGPEYNAFVTPILARSVKPIIEEVNALLL 120

Query: 121 SYDYRLE 128
           SY+ RLE
Sbjct: 121 SYEARLE 127

BLAST of CmUC02G034300 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 60.1 bits (144), Expect = 1.5e-09
Identity = 30/125 (24.00%), Postives = 65/125 (52.00%), Query Frame = 0

Query: 4   LYASLIEEK-MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYL 63
           LY +L  ++  G  V+ +T+ DIW  +   + ++   R L L ++++      + V  Y 
Sbjct: 76  LYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDMRVADYY 135

Query: 64  AKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSY 123
            K K ++D L  +  P++ ++ + Y+L GL  +++  +  I+++   P  +D  T+L   
Sbjct: 136 RKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAATMLQEE 195

Query: 124 DYRLE 128
           + RL+
Sbjct: 196 EDRLK 200

BLAST of CmUC02G034300 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 59.3 bits (142), Expect = 2.6e-09
Identity = 32/124 (25.81%), Postives = 63/124 (50.81%), Query Frame = 0

Query: 4   LYASLIEEKMGEIVSF-TTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYL 63
           +Y ++ +  +  I+    T  D+W SL   +  +   R L  + +++    D L+V +Y 
Sbjct: 78  IYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTTTIDDLSVHEYC 137

Query: 64  AKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSY 123
            K K +SD L+ +  PIS +  + ++L GL  +Y+  +  I++K   P   +  ++LL  
Sbjct: 138 QKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSFTEARSMLLME 197

Query: 124 DYRL 127
           + RL
Sbjct: 198 ESRL 201

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155181.16.3e-3455.91uncharacterized protein LOC111022315 [Momordica charantia][more]
XP_038887133.13.0e-2855.65uncharacterized protein LOC120077323 [Benincasa hispida][more]
PON47862.13.1e-2546.46hypothetical protein TorRG33x02_321990 [Trema orientale][more]
XP_022148871.11.2e-2472.84uncharacterized protein LOC111017438 [Momordica charantia][more]
RVW33435.13.5e-2444.88hypothetical protein CK203_098877 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DQX73.1e-3455.91uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A2P5BGF81.5e-2546.46Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_321990 PE=4 SV... [more]
A0A6J1D6N75.8e-2572.84uncharacterized protein LOC111017438 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A438DD821.7e-2444.88Uncharacterized protein OS=Vitis vinifera OX=29760 GN=CK203_098877 PE=4 SV=1[more]
A0A7J6E2L16.4e-2445.67Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_012566 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G34070.11.5e-0924.00CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT5G48050.12.6e-0925.81CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 2..123
e-value: 1.7E-20
score: 73.2
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 3..125

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC02G034300.1CmUC02G034300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding