Clc01G23190 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G23190
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationClcChr01: 34040384 .. 34043693 (+)
RNA-Seq ExpressionClc01G23190
SyntenyClc01G23190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGGTACCAGGATCAACGCGATTAACGGTGGAAATTTTTCGAGGAAGTATTCTGACCGTTTGATTCCAAAACGGGGTCAGGTGAAAATGGCGATCATAGTGGGTATTGCTCACTCAGTGACTTCGATCTTCTCTCACGGCGGCCGGAAATGACTGTGGTGGCCGAGGAAGCGAACGGGTTAATCGGGAGCGTTGAGTTGGTTTTTTTTTTAAATCGACGGTCCTGATCTTACGATTGTGAGCTGTTCAAATTTTATCGAACGTCACAGATTAGACGCTGAGATCTTCCGAATTATTAATTGAAAGATGAGGAAGAAACTAAGATGGCCTGTTCCCCCATGGCGTCAATCAAATTCTATGTGTGATTTTCTTTTTCTTTTTTTTCTTTTCTTTTCTTTTTTTTTTTTTTTTTTTTGTTAAAGTTGTTAATTGGAGGATTTATGGTGTGAGATGTCTGTAAATTAGCGAATTGCTTAATTTACTGTAAAATAACAATATTCTTCTTCTTGTTGATTTTGCACCGAGATATATAAATGTTTATTTTTCCTGATTACTGCTTGTGCTTAATTCAATTACATATGGGACTTTTCAATTGAAATTTTAGTTGAATTAAAAGAGATAATGATCATAGGGAGTTATGAACTCGAGATTATATTTTCTTTTTAAGCCTATAGAAGAAGGAAGATTAGTGTAGAGGCTATAAATCCTCGACCAATTGGAGTTTGATTAGGTTGGTAATTAAATTATTATTTGGTTTTAATGTACAAATAGAGTAACTCTTTGAAGAAATTTGCAATTCTCGTTTTCTTTTCTTTTGGTTATTGACTTGAGCAGGGCAACTAGAAAACTAAGTTTTTAATTACTATTTGGCGTATTAACTATTCATTAAATTTAATTAAGAATTAAAACTACATAAAAACATGATAAATAGATGGTTGAAGTTAAGTACAAATTTTTAAACACCAAATCAAATAATAACTATCCATTTTAGCAGTTGTGGGTTAGAATTTTCCAAAATGTGTACAAGAGAAAGGGAATGAGGGTATAGATTTTACTAAGAGGACAGATTAAATTCGGGAGTTATTAGATTGACTTGACTTTGAGGTATATTGAACGCATAAAACCTAAATTTAAACAATAATAAATAATAAATAATAATGATAATAATAATGGCCTAAAATGATATTTTAACATGAAATGAATTTGTACATCGATTTTCATTTTTCTTTTTTGGATTTGAGTACGAGTAATATCTATTAGAACTTTGGAATAGCAAAATCTATCTAAAGTTTTTCTTATAGAGAATATAAAGTTAACAGTGACAATCTCACTCAGAAAATACCCAAAGAAAAAATGATATATATATAAAACCACTCACCAGATAAGAAATAGCGACAAAATCCCACAAGTTGGATGCGAATCTTTTTCCCTTTAGAACAATCCAACAGTCTGTCAAATAAATTTGATTTAAAAACTAAATATAATATTAGTTTTGAAAGGTGTGTGTTGGGTAGGCTCCGGAAGTCAGGATCAGGCTCAGTGCATTGGATTGAATTGAACAATGTCATTTTCAATAAAACTTAAAAATTTACTTTATAATTATTTTAAATTACTAAAAATATTAATTTTATAAATTATAAAACCCTTTTGCAAAAATAGCCAAAAATATAACAAAATTTTCGATATAGTATCTTGATATATTTTGCTATTAGTTATAAATATTTTCAACAGTAATTTTACTAAATTGTAAAGATTAAATCTCAAGAAACTTAGAAGGTTGTATCTTTTTATATTCCTCAATATTTTTTTTTAAAAAACCTCTAATATATGTCTCAAAACTTTTAACTTTGTTTTAACAGCTTTATAGACTTTAAAATGTCTAATGAAATAAAGTTCATTCTAAAATGTCTAATGAAATAAAGTTCATTCATAGTTTATAGTAGAGTAGCAAAATTACATCATCATGAATGACTGTAAAACAAATTCAATGTACAATTTTTCAAACTTCATAGCACAAAATACCACTTTTTTTTTAGATTAATTTCCTAAATTATTCTAAATTCTTTAGAACACACATTGATAAATTATAACGATCAACTACATTTCCTTTAAAAAGATGTAGAGTGAATAAAATATTGAATCTCATCTACCACAAAATAAGATGAACTGAAAATTTTAAATTATCAATAATAGAACTTCTAGAAGCATGATCACCATATTTTATATCAATAAATGAATGGGACCTTCAGGAAATTATTATATTATAGATAACAAATTCGAAAATTATATTTAATAAATAACATACATGAACCTCCATTCCAATTCCGCAACCACAATACCCCGGCGGCGGCCCCTTAGTTGTTACAATCAGTCACTTATATCACTTCCGTCTACGGAGGAAAAAGAAAATATAAAATAAAAAATTTTAAATTAAATTAAATATTATAAATAAATAAATAGATAAAACCGCCGGCTCGTATTGCCGGAAGAGATGCCATGTGGTTCAGGTGAATCGAACAAAAAGCCACCGTGGCCCAGCAGATGTGGGTCAGGAGAACCTCCACCTGGCCATTTACTTTATTTTTAATCTTTTTTTATGAGAAGTTATTTGATTTTGAACACGTTTTTCATTTTATTTTCGTCAATGGATTCAAAAGCATCAAACTCCACTTTCAATTCTTTTTATTTTTTTTTAATTACATGATTGTGTTTGGCTTTATCTTGTTCCCCACGGTCGAATCGAATCCCATTCTTTTTTTCCAAAAACCAAAACCACCCTTGGCCTTTAGAAAAATTCTCAAAATTGTAGGCCAGAGCTGACACCCCTGTGTGGAGAAAGTGGTAATTTCACAAAATATAAATATAAATATAAAATGCAAAAAAAGACAGCTGTTGTGCAAAAGCCAAGACCACCTACTAACAATTTTAAAATAGAGGCCATAAGCAGTGTATAGCAAGTAAGCAATTTCGTCACAAGTGCTTATCTTCAAACAATTATTAAATTATAAATTTTGAATTGGCTCTGTATATAAATCCACCTTATGTATCCCCTTTCAGAGCATCCAAAACTTCAACAACAACAAATTTTCTTTTACTTCCTCTTAAGTCATCTCTCTGGAAAATGGGTGGAGAAGCTAAGATGATGAACAACGTCAACGGTCAGAGATTGGGAAGGAGGATGTCTGGGCGTTTGATCCCAAAGAGAGGGCAAGTGAAGATGGGAATCATGGTGGGGTTGGCTCAAACTGTCACTTCCATCTTCTCTCACAGTCACAATACTCAACAACAACATTGTACATCTACATAG

mRNA sequence

ATGGCCGGTACCAGGATCAACGCGATTAACGGTGGAAATTTTTCGAGGAAGTATTCTGACCGTTTGATTCCAAAACGGGGTCAGATAAAACCGCCGGCTCGTATTGCCGGAAGAGATGCCATGTGGTTCAGAGCATCCAAAACTTCAACAACAACAAATTTTCTTTTACTTCCTCTTAAGTCATCTCTCTGGAAAATGGGTGGAGAAGCTAAGATGATGAACAACGTCAACGGTCAGAGATTGGGAAGGAGGATGTCTGGGCGTTTGATCCCAAAGAGAGGGCAAGTGAAGATGGGAATCATGGTGGGGTTGGCTCAAACTGTCACTTCCATCTTCTCTCACAGTCACAATACTCAACAACAACATTGTACATCTACATAG

Coding sequence (CDS)

ATGGCCGGTACCAGGATCAACGCGATTAACGGTGGAAATTTTTCGAGGAAGTATTCTGACCGTTTGATTCCAAAACGGGGTCAGATAAAACCGCCGGCTCGTATTGCCGGAAGAGATGCCATGTGGTTCAGAGCATCCAAAACTTCAACAACAACAAATTTTCTTTTACTTCCTCTTAAGTCATCTCTCTGGAAAATGGGTGGAGAAGCTAAGATGATGAACAACGTCAACGGTCAGAGATTGGGAAGGAGGATGTCTGGGCGTTTGATCCCAAAGAGAGGGCAAGTGAAGATGGGAATCATGGTGGGGTTGGCTCAAACTGTCACTTCCATCTTCTCTCACAGTCACAATACTCAACAACAACATTGTACATCTACATAG

Protein sequence

MAGTRINAINGGNFSRKYSDRLIPKRGQIKPPARIAGRDAMWFRASKTSTTTNFLLLPLKSSLWKMGGEAKMMNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQHCTST
Homology
BLAST of Clc01G23190 vs. NCBI nr
Match: KAA0052717.1 (hypothetical protein E6C27_scaffold120G003300 [Cucumis melo var. makuwa] >TYK13107.1 hypothetical protein E5676_scaffold255G007170 [Cucumis melo var. makuwa])

HSP 1 Score: 105.1 bits (261), Expect = 4.4e-19
Identity = 52/57 (91.23%), Postives = 56/57 (98.25%), Query Frame = 0

Query: 66  MGGEAKMMNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQH 123
           MG EAKMMNN+NGQRLGRR+SGRLIPKRGQVKMGIMVGLAQTVTSIFS+SH+TQQQH
Sbjct: 1   MGEEAKMMNNINGQRLGRRISGRLIPKRGQVKMGIMVGLAQTVTSIFSNSHSTQQQH 57

BLAST of Clc01G23190 vs. NCBI nr
Match: KAG7026593.1 (Anaphase-promoting complex subunit 4, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 71.2 bits (173), Expect = 7.0e-09
Identity = 34/50 (68.00%), Postives = 39/50 (78.00%), Query Frame = 0

Query: 73  MNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQH 123
           MNN+NG+R+ RR+ GR IPKRGQVKMGIMVG+  TVTSIFSH H     H
Sbjct: 1   MNNINGERICRRIGGRPIPKRGQVKMGIMVGIVHTVTSIFSHGHGHGHGH 50

BLAST of Clc01G23190 vs. NCBI nr
Match: KAG6603714.1 (hypothetical protein SDJN03_04323, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 65.1 bits (157), Expect = 5.0e-07
Identity = 50/140 (35.71%), Postives = 71/140 (50.71%), Query Frame = 0

Query: 1   MAGTRINAINGGNFSRKYSDRLIPKRGQI--------KPPARIA--GRDAMWF------- 60
           M G+R + I+G  +S + S R IPKRGQ+        K   +I   G   + F       
Sbjct: 1   MGGSRSDDISGATYSPRSSGRPIPKRGQLASLAVGVPKRKGQIGKLGLSLIDFENFEKKK 60

Query: 61  ---RASKTSTTTNF------LLLPLKSSLWKMGGEAKMMNNVNGQRLGRRMSGRLIPKRG 115
              +  K      F      ++LP  +S  +M G +  +  +NG  L R+ S RLIPKRG
Sbjct: 61  ARRQLGKQGLNRQFPLGGISVILPSGTSGTEMAGNS--IKAINGGDLSRKYSDRLIPKRG 120

BLAST of Clc01G23190 vs. NCBI nr
Match: KAG5622653.1 (hypothetical protein H5410_007871 [Solanum commersonii])

HSP 1 Score: 63.5 bits (153), Expect = 1.5e-06
Identity = 50/130 (38.46%), Postives = 66/130 (50.77%), Query Frame = 0

Query: 1   MAGTRINAINGGNFSRKYS-DRLIPKRGQIK---------------PPARIAGRDAMWFR 60
           MAG R   INGGN SR+    R IPKRGQ+K                  R+A  + +  R
Sbjct: 1   MAGVRKEMINGGNISRREEYGRPIPKRGQVKVTIVLGLAHSLSSIFSNRRVAAGEQIHQR 60

Query: 61  ASKTSTTTNFLLLPLKSSLWKMGGEAKMMNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGL 115
           +S       F ++     + KM G  K M N  G  L     GR IPKRGQVK+ I++GL
Sbjct: 61  SS-------FTVIKAHRRILKMPGVRKEMIN-GGNMLRSEEYGRPIPKRGQVKVTIVLGL 120

BLAST of Clc01G23190 vs. NCBI nr
Match: EEF40912.1 (conserved hypothetical protein [Ricinus communis])

HSP 1 Score: 59.3 bits (142), Expect = 2.7e-05
Identity = 29/54 (53.70%), Postives = 44/54 (81.48%), Query Frame = 0

Query: 73  MNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQHCTST 127
           M  ++ +++ RR+SGR IPKRGQVK+GI+VGLA +V SIFSHSH++++   T++
Sbjct: 6   MGMMSDRKVSRRLSGRPIPKRGQVKVGIVVGLAHSVASIFSHSHSSRRAAPTAS 59

BLAST of Clc01G23190 vs. ExPASy TrEMBL
Match: A0A5A7UDZ4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G007170 PE=4 SV=1)

HSP 1 Score: 105.1 bits (261), Expect = 2.1e-19
Identity = 52/57 (91.23%), Postives = 56/57 (98.25%), Query Frame = 0

Query: 66  MGGEAKMMNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQH 123
           MG EAKMMNN+NGQRLGRR+SGRLIPKRGQVKMGIMVGLAQTVTSIFS+SH+TQQQH
Sbjct: 1   MGEEAKMMNNINGQRLGRRISGRLIPKRGQVKMGIMVGLAQTVTSIFSNSHSTQQQH 57

BLAST of Clc01G23190 vs. ExPASy TrEMBL
Match: A0A6N2L7Q2 (Uncharacterized protein (Fragment) OS=Salix viminalis OX=40686 GN=SVIM_LOCUS151409 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.2e-06
Identity = 45/119 (37.82%), Postives = 63/119 (52.94%), Query Frame = 0

Query: 2   AGTRINAINGGNFSRKYSDRLIPKRGQIKPP--ARIAGRDAMWFRASK----TSTTTNFL 61
           AG R   I+G ++S+++  RLIPKRGQ+K      +A   A  F   K        T  +
Sbjct: 55  AGGRSVMISGVSYSQRFYGRLIPKRGQVKVAIVMGLAHTFASMFSPCKKCGAAQRATRLI 114

Query: 62  LLPLKSSLWKMGGEAKMMNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSH 115
            L       +M G A+ +   NG +  R  + R IPKRGQVK+ I VGLA + +S+FSH
Sbjct: 115 SLEETKKESEMAG-ARWLTIFNGGKFYRGFASRPIPKRGQVKVAIAVGLAHSFSSVFSH 172

BLAST of Clc01G23190 vs. ExPASy TrEMBL
Match: B9S603 (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1062910 PE=4 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.3e-05
Identity = 29/54 (53.70%), Postives = 44/54 (81.48%), Query Frame = 0

Query: 73  MNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQQQHCTST 127
           M  ++ +++ RR+SGR IPKRGQVK+GI+VGLA +V SIFSHSH++++   T++
Sbjct: 6   MGMMSDRKVSRRLSGRPIPKRGQVKVGIVVGLAHSVASIFSHSHSSRRAAPTAS 59

BLAST of Clc01G23190 vs. ExPASy TrEMBL
Match: A0A0A0KLE0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G517000 PE=4 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.1e-04
Identity = 27/42 (64.29%), Postives = 34/42 (80.95%), Query Frame = 0

Query: 73  MNNVNGQRLGRRMSGRLIPKRGQVKMGIMVGLAQTVTSIFSH 115
           +N +N   L R+ S RLIPKRGQVK+GI+VG+A +VTSIFSH
Sbjct: 6   INAINHGNLSRKCSDRLIPKRGQVKLGIIVGIAHSVTSIFSH 47

BLAST of Clc01G23190 vs. ExPASy TrEMBL
Match: A0A2H5N387 (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_011210 PE=4 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.5e-04
Identity = 29/49 (59.18%), Postives = 36/49 (73.47%), Query Frame = 0

Query: 76  VNGQRLGRRM-----SGRLIPKRGQVKMGIMVGLAQTVTSIFSHSHNTQ 120
           +NG  + RR+     SGR IPKRGQVKM I+VGLA +V SIFSHSH+ +
Sbjct: 9   INGGNISRRLTAGGGSGRPIPKRGQVKMAILVGLAHSVASIFSHSHSNR 57

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0052717.14.4e-1991.23hypothetical protein E6C27_scaffold120G003300 [Cucumis melo var. makuwa] >TYK131... [more]
KAG7026593.17.0e-0968.00Anaphase-promoting complex subunit 4, partial [Cucurbita argyrosperma subsp. arg... [more]
KAG6603714.15.0e-0735.71hypothetical protein SDJN03_04323, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG5622653.11.5e-0638.46hypothetical protein H5410_007871 [Solanum commersonii][more]
EEF40912.12.7e-0553.70conserved hypothetical protein [Ricinus communis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UDZ42.1e-1991.23Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6N2L7Q21.2e-0637.82Uncharacterized protein (Fragment) OS=Salix viminalis OX=40686 GN=SVIM_LOCUS1514... [more]
B9S6031.3e-0553.70Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1062910 PE=4 SV=1[more]
A0A0A0KLE01.1e-0464.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G517000 PE=4 SV=1[more]
A0A2H5N3871.5e-0459.18Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_011210 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36615PROTEIN, PUTATIVE-RELATEDcoord: 1..30
coord: 74..121

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G23190.1Clc01G23190.1mRNA