Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTCAACTGTCATAAATGGGCAGAATCCTTTCAGCATCCTAAATCTCAAGCAACCTGATTATCACAGTTTAAGAACCTTTGGATTAACTTGTTATCCTTGCTTAAGACCATATCAGCAGCATAAATTTGATTTTCACACTGAGAAATGTGTCTTCATAAGCTACAATGATCACCATAAAGGCTACCGGTGTCTTAGTCCGGCTAGCAGAATTTATAATTCTCGTCATGTGTGCATTAATGAAACTAAATTTCCATATCAACAACTGTTTTCAGGCTTTAGCAGTGAAGGCTCATCCATTGGCAACACAATTCTCTCTTGGCTGCCAGTTGCTGCTCCGTCTTCCATCCCTCAATCAGATCTTATGGCAGTACCACACACCAGCCAGCCCTTGGTTCCTATATCTTCAGAGAATTCCCCTTCATATACTGTACAAAATCACAGTGGAGATGCCCCGCTGGTTCTCCTCAAAATGTTTCTCACTATCAAGCTGCACCTTTACACTCTCCTACCTCTAACTCTGCAATATCTCACCATGGTAGGACAACAATTACATCAGCAACATGAGAATATCCTCCTCCTACTACAGCTAGCAATGCTCATCCCATGGTTACACGTGCTAAGGGCTGGAATTTCCAAACCTAAACAGTTCTTTGGTGGCTTTGCTCAAATCTCTTCTGCTATAGATTGA
mRNA sequence
ATGCCTTCAACTGTCATAAATGGGCAGAATCCTTTCAGCATCCTAAATCTCAAGCAACCTGATTATCACAGTTTAAGAACCTTTGGATTAACTTGTTATCCTTGCTTAAGACCATATCAGCAGCATAAATTTGATTTTCACACTGAGAAATGTGTCTTCATAAGCTACAATGATCACCATAAAGGCTACCGGTGTCTTAGTCCGGCTAGCAGAATTTATAATTCTCGTCATGTGTGCATTAATGAAACTAAATTTCCATATCAACAACTGTTTTCAGGCTTTAGCAGTGAAGGCTCATCCATTGGCAACACAATTCTCTCTTGGCTGCCAGTTGCTGCTCCGTCTTCCATCCCTCAATCAGATCTTATGGCAGTACCACACACCAGCCAGCCCTTGGTTCCTATATCTTCAGAGAATTCCCCTTCATATACTGTACAAAATCACAGTGGAGATGCCCCGCTGGTTCTCCTCAAAATGTTTCTCACTATCAAGCTGCACCTTTACACTCTCCTACCTCTAACTCTGCAATATCTCACCATGGTAGGACAACAATTACATCAGCAACATGAGAATATCCTCCTCCTACTACAGCTAGCAATGCTCATCCCATGGTTACACGTGCTAAGGGCTGGAATTTCCAAACCTAAACAGTTCTTTGGTGGCTTTGCTCAAATCTCTTCTGCTATAGATTGA
Coding sequence (CDS)
ATGCCTTCAACTGTCATAAATGGGCAGAATCCTTTCAGCATCCTAAATCTCAAGCAACCTGATTATCACAGTTTAAGAACCTTTGGATTAACTTGTTATCCTTGCTTAAGACCATATCAGCAGCATAAATTTGATTTTCACACTGAGAAATGTGTCTTCATAAGCTACAATGATCACCATAAAGGCTACCGGTGTCTTAGTCCGGCTAGCAGAATTTATAATTCTCGTCATGTGTGCATTAATGAAACTAAATTTCCATATCAACAACTGTTTTCAGGCTTTAGCAGTGAAGGCTCATCCATTGGCAACACAATTCTCTCTTGGCTGCCAGTTGCTGCTCCGTCTTCCATCCCTCAATCAGATCTTATGGCAGTACCACACACCAGCCAGCCCTTGGTTCCTATATCTTCAGAGAATTCCCCTTCATATACTGTACAAAATCACAGTGGAGATGCCCCGCTGGTTCTCCTCAAAATGTTTCTCACTATCAAGCTGCACCTTTACACTCTCCTACCTCTAACTCTGCAATATCTCACCATGGTAGGACAACAATTACATCAGCAACATGAGAATATCCTCCTCCTACTACAGCTAGCAATGCTCATCCCATGGTTACACGTGCTAAGGGCTGGAATTTCCAAACCTAAACAGTTCTTTGGTGGCTTTGCTCAAATCTCTTCTGCTATAGATTGA
Protein sequence
MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSPSYTVQNHSGDAPLVLLKMFLTIKLHLYTLLPLTLQYLTMVGQQLHQQHENILLLLQLAMLIPWLHVLRAGISKPKQFFGGFAQISSAID
Homology
BLAST of Sgr018121 vs. NCBI nr
Match:
QHO25178.1 (Copia-like retrotransposon Hopscotch polyprotein [Arachis hypogaea])
HSP 1 Score: 124.4 bits (311), Expect = 1.3e-24
Identity = 65/156 (41.67%), Postives = 90/156 (57.69%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS +P+ +LN + PDY L+ FG +C+P LRPYQ HKFDF T KC+F+ Y+ HH
Sbjct: 735 LPSATTQYISPYELLNNRAPDYRFLKIFGCSCFPQLRPYQPHKFDFKTHKCLFLGYSPHH 794
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSS--EGSSIGNTILSWLPVAAPSSIP 120
KGY+CL P+ ++Y +RHV E+KFPYQ LF S + S T L +P+
Sbjct: 795 KGYKCLCPSGKLYVARHVIFYESKFPYQLLFFNKDSNLKASVPHTTKLITIPIHVRH--- 854
Query: 121 QSDLMAVPHTSQPLVP-----ISSENSPSYTVQNHS 150
L+ +PH S P P I S +SP+ V + S
Sbjct: 855 PPVLLEIPHESPPSTPAPTTLIPSTDSPATAVPSSS 887
BLAST of Sgr018121 vs. NCBI nr
Match:
TXG58227.1 (hypothetical protein EZV62_016056 [Acer yangbiense])
HSP 1 Score: 123.2 bits (308), Expect = 2.8e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 1241 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 1300
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 1301 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 1360
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 1361 IPCCAP---LAHSPHASGTISNSSDVDSP 1386
BLAST of Sgr018121 vs. NCBI nr
Match:
PNX92571.1 (histone deacetylase [Trifolium pratense])
HSP 1 Score: 123.2 bits (308), Expect = 2.8e-24
Identity = 60/156 (38.46%), Postives = 91/156 (58.33%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+ IN Q P+ +L + PDYH L+ FG C+P LRPY HK +F +++C+F+ Y+ H
Sbjct: 764 LPSSSINFQTPYFLLFKQHPDYHFLKVFGCACFPLLRPYHNHKLEFRSQECLFLGYSPSH 823
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLPVAAPSSIPQS 120
KGYRCLSP+ R+Y S+ V NE++FPY++LF S S + P+ SI +
Sbjct: 824 KGYRCLSPSGRLYVSKDVLFNESRFPYKELFPISSGSSHSPPSKSFKLPPLPTFPSI-TT 883
Query: 121 DLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD 152
D+ + PH S P PI+ + P+ + + D
Sbjct: 884 DITSPLPPTAPHISSPPTPINDPSPPNSPLSATASD 918
BLAST of Sgr018121 vs. NCBI nr
Match:
TXG57080.1 (hypothetical protein EZV62_018393 [Acer yangbiense])
HSP 1 Score: 123.2 bits (308), Expect = 2.8e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 257 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 316
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 317 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 376
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 377 IPCCAP---LAHSPHASGTISNSSDVDSP 402
BLAST of Sgr018121 vs. NCBI nr
Match:
TXG56026.1 (hypothetical protein EZV62_017339 [Acer yangbiense])
HSP 1 Score: 123.2 bits (308), Expect = 2.8e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 257 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 316
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 317 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 376
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 377 IPCCAP---LAHSPHASGTISNSSDVDSP 402
BLAST of Sgr018121 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 88.2 bits (217), Expect = 1.3e-16
Identity = 55/168 (32.74%), Postives = 82/168 (48.81%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+P+ ++ Q+PF L + P+Y L+ FG CYP LRPY +HK + +++C F+ Y+
Sbjct: 643 LPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQ 702
Query: 61 KGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSW-----LPV--- 120
Y CL P R+Y SRHV +E FP+ G S+ ++ +W LP
Sbjct: 703 SAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPL 762
Query: 121 ---AAPSSIPQSDLMAVPHTS-QPL--VPISSENSPSYTVQNHSGDAP 154
A P P D P +S PL +SS N PS ++ + S P
Sbjct: 763 VLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEP 810
BLAST of Sgr018121 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 85.9 bits (211), Expect = 6.6e-16
Identity = 50/149 (33.56%), Postives = 72/149 (48.32%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+P+ ++ ++PF L P+Y LR FG CYP LRPY QHK D + +CVF+ Y+
Sbjct: 664 LPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQ 723
Query: 61 KGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLP-VAAPSSIP 120
Y CL SR+Y SRHV +E FP+ + S + W P P+ P
Sbjct: 724 SAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTP 783
Query: 121 QSDLMAVPHTSQPLVPISSENSPSYTVQN 148
++ P S P + +SPS +N
Sbjct: 784 ---VLPAPSCSDPHHAATPPSSPSAPFRN 809
BLAST of Sgr018121 vs. ExPASy TrEMBL
Match:
A0A803Q615 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 124.8 bits (312), Expect = 4.7e-25
Identity = 62/153 (40.52%), Postives = 87/153 (56.86%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+P+ V+ G++P +L K+PDY L+TFG TCYPCLRPYQ HKF +H+ KCV + Y+D H
Sbjct: 128 LPTPVLKGKSPLEVLFGKKPDYKFLKTFGCTCYPCLRPYQSHKFQYHSTKCVNLGYSDRH 187
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLPVAAPSSIPQS 120
KGY+CLS R+Y SR+V NE +FP+ F ++ ++ SW + +IP S
Sbjct: 188 KGYKCLSSTGRLYISRNVIFNEDEFPFLTGFLNTHQNEQTVHVSVPSW---STMLNIPLS 247
Query: 121 DLMAVPHTSQPLVPI----SSENSPSYTVQNHS 150
S+P P SE PS H+
Sbjct: 248 SSQTPSVPSEPETPTPPADDSEEPPSAPQSTHN 277
BLAST of Sgr018121 vs. ExPASy TrEMBL
Match:
A0A2K3MP35 (Histone deacetylase OS=Trifolium pratense OX=57577 GN=L195_g015711 PE=4 SV=1)
HSP 1 Score: 123.2 bits (308), Expect = 1.4e-24
Identity = 60/156 (38.46%), Postives = 91/156 (58.33%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+ IN Q P+ +L + PDYH L+ FG C+P LRPY HK +F +++C+F+ Y+ H
Sbjct: 764 LPSSSINFQTPYFLLFKQHPDYHFLKVFGCACFPLLRPYHNHKLEFRSQECLFLGYSPSH 823
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLPVAAPSSIPQS 120
KGYRCLSP+ R+Y S+ V NE++FPY++LF S S + P+ SI +
Sbjct: 824 KGYRCLSPSGRLYVSKDVLFNESRFPYKELFPISSGSSHSPPSKSFKLPPLPTFPSI-TT 883
Query: 121 DLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD 152
D+ + PH S P PI+ + P+ + + D
Sbjct: 884 DITSPLPPTAPHISSPPTPINDPSPPNSPLSATASD 918
BLAST of Sgr018121 vs. ExPASy TrEMBL
Match:
A0A5C7HMG8 (Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_016056 PE=4 SV=1)
HSP 1 Score: 123.2 bits (308), Expect = 1.4e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 1241 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 1300
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 1301 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 1360
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 1361 IPCCAP---LAHSPHASGTISNSSDVDSP 1386
BLAST of Sgr018121 vs. ExPASy TrEMBL
Match:
A0A5C7HIM7 (Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_017339 PE=4 SV=1)
HSP 1 Score: 123.2 bits (308), Expect = 1.4e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 257 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 316
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 317 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 376
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 377 IPCCAP---LAHSPHASGTISNSSDVDSP 402
BLAST of Sgr018121 vs. ExPASy TrEMBL
Match:
A0A5C7HJ99 (Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_018393 PE=4 SV=1)
HSP 1 Score: 123.2 bits (308), Expect = 1.4e-24
Identity = 67/149 (44.97%), Postives = 89/149 (59.73%), Query Frame = 0
Query: 1 MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHH 60
+PS+V+N +PF L ++P+Y L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+ H
Sbjct: 257 LPSSVLNFMSPFEKLFSRKPNYAFLKTFGCSCFPYLRYYSKHKFDFHSAKCIFLGYSMSH 316
Query: 61 KGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGSSIG-------NTILSWLPVA 120
KGY+CL P+ +IY SRHV NET+FPY LFS SS+ SS G N S P
Sbjct: 317 KGYKCLHPSGKIYVSRHVVFNETEFPYPLLFSTKVSSQMSSTGLPIDILQNASTSLSPSV 376
Query: 121 APSSIPQSDLMAVPHTSQPLVPISSENSP 142
P P L PH S + S +SP
Sbjct: 377 IPCCAP---LAHSPHASGTISNSSDVDSP 402
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
QHO25178.1 | 1.3e-24 | 41.67 | Copia-like retrotransposon Hopscotch polyprotein [Arachis hypogaea] | [more] |
TXG58227.1 | 2.8e-24 | 44.97 | hypothetical protein EZV62_016056 [Acer yangbiense] | [more] |
PNX92571.1 | 2.8e-24 | 38.46 | histone deacetylase [Trifolium pratense] | [more] |
TXG57080.1 | 2.8e-24 | 44.97 | hypothetical protein EZV62_018393 [Acer yangbiense] | [more] |
TXG56026.1 | 2.8e-24 | 44.97 | hypothetical protein EZV62_017339 [Acer yangbiense] | [more] |
Match Name | E-value | Identity | Description | |
Q9ZT94 | 1.3e-16 | 32.74 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
Q94HW2 | 6.6e-16 | 33.56 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Match Name | E-value | Identity | Description | |
A0A803Q615 | 4.7e-25 | 40.52 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A2K3MP35 | 1.4e-24 | 38.46 | Histone deacetylase OS=Trifolium pratense OX=57577 GN=L195_g015711 PE=4 SV=1 | [more] |
A0A5C7HMG8 | 1.4e-24 | 44.97 | Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=E... | [more] |
A0A5C7HIM7 | 1.4e-24 | 44.97 | Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=E... | [more] |
A0A5C7HJ99 | 1.4e-24 | 44.97 | Integrase catalytic domain-containing protein OS=Acer yangbiense OX=1000413 GN=E... | [more] |
Match Name | E-value | Identity | Description | |