Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCGCTACGAGAATGGAGTCTCGAGTTGAAGCAGTGGAACAACAAATTTCCGGGGTGGTTTCGGCCATAGAAGATAGTAGGGAATCGTGGAAGAAGGAAATGGCATTGTTTAAGGATGAGATGCGTCTGTGGATGACAGCGGTATCACAGAGGATGGGAATGGAAGAGAAGACTAGGGACGACAAAGGGAAAGGGCTGGAGAAAATCGATGTGGGGGAGAAGACGCCTCAGAGTATTGCGACTTCAAGTGATGGGTCGAATCAACTCACGAACGCGCTAGCCCCGGTATTCGATCTCCGTTTACGCAAGTTGGAGGTACCTATTTTTGAGGGGGAAAATCCCGATGCGTGGCTACACCGTGTGGCCCGATATTTTCGGATTAATCGATTGACGGATGACGAGAAATTAGAGGCGGCGGTGCTCTGTTTGGACGGTGAGGCTTTGGCTTGGCATCAGTGGGAGGAGAGGAAGAATCCGATACAGACTTGGGAGGAGTTCCGGCTATTGTTATTGCAGCGTTTCCGACCAACCTTGGAAGGGACTCTGTGCGACCGATTCATGGCAGTGAAGCAGGAGACGACGGTGAGGGATTACCGCTGA
mRNA sequence
ATGGTCGCTACGAGAATGGAGTCTCGAGTTGAAGCAGTGGAACAACAAATTTCCGGGGTGGTTTCGGCCATAGAAGATAGTAGGGAATCGTGGAAGAAGGAAATGGCATTGTTTAAGGATGAGATGCGTCTGTGGATGACAGCGGTATCACAGAGGATGGGAATGGAAGAGAAGACTAGGGACGACAAAGGGAAAGGGCTGGAGAAAATCGATGTGGGGGAGAAGACGCCTCAGAGTATTGCGACTTCAAGTGATGGGTCGAATCAACTCACGAACGCGCTAGCCCCGGTATTCGATCTCCGTTTACGCAAGTTGGAGGTACCTATTTTTGAGGGGGAAAATCCCGATGCGTGGCTACACCGTGTGGCCCGATATTTTCGGATTAATCGATTGACGGATGACGAGAAATTAGAGGCGGCGGTGCTCTGTTTGGACGGTGAGGCTTTGGCTTGGCATCAGTGGGAGGAGAGGAAGAATCCGATACAGACTTGGGAGGAGTTCCGGCTATTGTTATTGCAGCGTTTCCGACCAACCTTGGAAGGGACTCTGTGCGACCGATTCATGGCAGTGAAGCAGGAGACGACGGTGAGGGATTACCGCTGA
Coding sequence (CDS)
ATGGTCGCTACGAGAATGGAGTCTCGAGTTGAAGCAGTGGAACAACAAATTTCCGGGGTGGTTTCGGCCATAGAAGATAGTAGGGAATCGTGGAAGAAGGAAATGGCATTGTTTAAGGATGAGATGCGTCTGTGGATGACAGCGGTATCACAGAGGATGGGAATGGAAGAGAAGACTAGGGACGACAAAGGGAAAGGGCTGGAGAAAATCGATGTGGGGGAGAAGACGCCTCAGAGTATTGCGACTTCAAGTGATGGGTCGAATCAACTCACGAACGCGCTAGCCCCGGTATTCGATCTCCGTTTACGCAAGTTGGAGGTACCTATTTTTGAGGGGGAAAATCCCGATGCGTGGCTACACCGTGTGGCCCGATATTTTCGGATTAATCGATTGACGGATGACGAGAAATTAGAGGCGGCGGTGCTCTGTTTGGACGGTGAGGCTTTGGCTTGGCATCAGTGGGAGGAGAGGAAGAATCCGATACAGACTTGGGAGGAGTTCCGGCTATTGTTATTGCAGCGTTTCCGACCAACCTTGGAAGGGACTCTGTGCGACCGATTCATGGCAGTGAAGCAGGAGACGACGGTGAGGGATTACCGCTGA
Protein sequence
MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR
Homology
BLAST of Moc06g16550 vs. NCBI nr
Match:
XP_022897442.1 (uncharacterized protein LOC111411108 [Olea europaea var. sylvestris])
HSP 1 Score: 152.9 bits (385), Expect = 2.9e-33
Identity = 79/198 (39.90%), Postives = 118/198 (59.60%), Query Frame = 0
Query: 6 MESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEKTRDDKGK 65
+ RV +EQ++ ++ S + E L M + ++ +EK R +KG
Sbjct: 50 LTDRVGELEQRVENHYQELQAGLGSLQSEFRLMHAGMTERFEILMRKWDTQEKERKEKGI 109
Query: 66 GLEKIDVGEKTPQSIATSSDGSNQLTNALAPV---FDLRLRKLEVPIFEGENPDAWLHRV 125
EK + S SNQ A + R+R+LE+P+FEG +PD W+ RV
Sbjct: 110 TSEKSPT-LSAESGLTQSRPSSNQCVGGSAGSEWRGEPRIRRLEMPVFEGNDPDGWVFRV 169
Query: 126 ARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGT 185
RYF +NRL+++EKLEAA +C DGEALAW QWEER+ P++ WE+ + LL+RFRP+ EG+
Sbjct: 170 ERYFSVNRLSEEEKLEAAAVCFDGEALAWFQWEERRRPVKAWEDLKAHLLRRFRPSQEGS 229
Query: 186 LCDRFMAVKQETTVRDYR 201
LC +F++++Q TTVR+YR
Sbjct: 230 LCAQFLSLQQTTTVREYR 246
BLAST of Moc06g16550 vs. NCBI nr
Match:
EXB38291.1 (hypothetical protein L484_013924 [Morus notabilis])
HSP 1 Score: 138.7 bits (348), Expect = 5.7e-29
Identity = 56/102 (54.90%), Postives = 84/102 (82.35%), Query Frame = 0
Query: 99 DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERK 158
+L R++E+P+F+GENPD W R RYF +N++T+ EKL+ AV+ L+GEALAW QWE+ +
Sbjct: 861 ELCTRRVEMPVFDGENPDGWSIRAERYFAMNKMTEREKLDVAVVSLEGEALAWFQWEDGR 920
Query: 159 NPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR 201
+PI++W +L+LL+RFRP EG+LC++F++++QETTVRDYR
Sbjct: 921 SPIRSWMVLKLMLLERFRPMQEGSLCEKFLSLRQETTVRDYR 962
BLAST of Moc06g16550 vs. NCBI nr
Match:
TXG60193.1 (hypothetical protein EZV62_014766 [Acer yangbiense])
HSP 1 Score: 137.5 bits (345), Expect = 1.3e-28
Identity = 72/204 (35.29%), Postives = 115/204 (56.37%), Query Frame = 0
Query: 1 MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK-- 60
M M+SR++ +E+ + V RE ++E+ KDE+ + M+E+
Sbjct: 1 MTNKNMQSRMDTMEEVVVTV-------REDLQREVGTIKDEIMAMNKRFDHLLQMQEEAA 60
Query: 61 --TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPD 120
GKG E + GS +N D R RKLE+P+F+G NPD
Sbjct: 61 RVAASSSGKGKESTSRVPTVGGPAIGVTHGSEGTSNVFEHRRDFRFRKLEMPVFDGTNPD 120
Query: 121 AWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFR 180
W+ + RYF + R ++EKLEA+V+ +G+AL W+QWE +K P+ WEE +LL+L++FR
Sbjct: 121 GWILKAERYFSLQRFNNEEKLEASVIGFEGDALLWYQWEHKKRPMFLWEEMKLLILKQFR 180
Query: 181 PTLEGTLCDRFMAVKQETTVRDYR 201
T EG+L ++F+A++Q+ TV++YR
Sbjct: 181 STQEGSLHEQFLALRQQGTVKEYR 197
BLAST of Moc06g16550 vs. NCBI nr
Match:
XP_038904464.1 (uncharacterized protein LOC120090832 [Benincasa hispida])
HSP 1 Score: 137.5 bits (345), Expect = 1.3e-28
Identity = 82/223 (36.77%), Postives = 121/223 (54.26%), Query Frame = 0
Query: 5 RMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWM-----TAVSQRM------ 64
+ME+R+ AVE+Q++ + +E+ + +EM + M + + Q+M
Sbjct: 4 KMEARLSAVEEQLTLLQVGMENRMDQRFREMQQITESMMIAKIEALGETLEQKMIRPLEV 63
Query: 65 -------------GMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDL 124
G EKT D+GK + DVGE+ +FD+
Sbjct: 64 WSKKLEILKEGESGSSEKTVMDRGKQPAREDVGERRE-----------------VLLFDM 123
Query: 125 RLRKLEVPIFE---GENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEER 184
RLRKLE+PIF+ GE+P W HRV RYF +NRL++ +K+EAA+LCL+GEAL WHQWEE
Sbjct: 124 RLRKLEIPIFKGEIGEDPMGWFHRVERYFVVNRLSEKDKIEAAILCLEGEALEWHQWEEE 183
Query: 185 KNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR 201
+ P+ TW +F+ LL RF P E +F+ +KQ+ +VR YR
Sbjct: 184 RTPMNTWAKFKAHLLLRFLPIKEEDRRAQFLTLKQDGSVRAYR 209
BLAST of Moc06g16550 vs. NCBI nr
Match:
XP_024017591.1 (uncharacterized protein LOC112090471 [Morus notabilis])
HSP 1 Score: 133.7 bits (335), Expect = 1.8e-27
Identity = 74/189 (39.15%), Postives = 112/189 (59.26%), Query Frame = 0
Query: 14 EQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGME--EKTRDDKGKGLEKID 73
E+ + V S ED + + E L + + + M +S++ G E E D G E+
Sbjct: 18 EELLKEVESMREDLGKIPRLEQGL--ELLLIRMDELSKQRGPENRESPETDNGIPAEQTA 77
Query: 74 VGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRL 133
+T A +G + + R R++E+P+F+GENPD W+ R RYF +NRL
Sbjct: 78 PTTETDYRPAEWREGGS--------CTEFRTRRVEMPVFDGENPDGWIFRAERYFSLNRL 137
Query: 134 TDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVK 193
TD EKL+ AV+ L+GEALAW QWE+R+ ++ W E + +L+RF T EGTLC++F+++
Sbjct: 138 TDREKLDVAVVSLEGEALAWFQWEDRRWAVRDWTELKRKVLERFHSTQEGTLCEKFLSLH 196
Query: 194 QETTVRDYR 201
QETTVR+YR
Sbjct: 198 QETTVREYR 196
BLAST of Moc06g16550 vs. ExPASy TrEMBL
Match:
W9QTX5 (Mediator of RNA polymerase II transcription subunit 25 OS=Morus notabilis OX=981085 GN=L484_013924 PE=3 SV=1)
HSP 1 Score: 138.7 bits (348), Expect = 2.7e-29
Identity = 56/102 (54.90%), Postives = 84/102 (82.35%), Query Frame = 0
Query: 99 DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERK 158
+L R++E+P+F+GENPD W R RYF +N++T+ EKL+ AV+ L+GEALAW QWE+ +
Sbjct: 861 ELCTRRVEMPVFDGENPDGWSIRAERYFAMNKMTEREKLDVAVVSLEGEALAWFQWEDGR 920
Query: 159 NPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR 201
+PI++W +L+LL+RFRP EG+LC++F++++QETTVRDYR
Sbjct: 921 SPIRSWMVLKLMLLERFRPMQEGSLCEKFLSLRQETTVRDYR 962
BLAST of Moc06g16550 vs. ExPASy TrEMBL
Match:
A0A5C7HSW3 (Chromo domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_014766 PE=4 SV=1)
HSP 1 Score: 137.5 bits (345), Expect = 6.1e-29
Identity = 72/204 (35.29%), Postives = 115/204 (56.37%), Query Frame = 0
Query: 1 MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK-- 60
M M+SR++ +E+ + V RE ++E+ KDE+ + M+E+
Sbjct: 1 MTNKNMQSRMDTMEEVVVTV-------REDLQREVGTIKDEIMAMNKRFDHLLQMQEEAA 60
Query: 61 --TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPD 120
GKG E + GS +N D R RKLE+P+F+G NPD
Sbjct: 61 RVAASSSGKGKESTSRVPTVGGPAIGVTHGSEGTSNVFEHRRDFRFRKLEMPVFDGTNPD 120
Query: 121 AWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFR 180
W+ + RYF + R ++EKLEA+V+ +G+AL W+QWE +K P+ WEE +LL+L++FR
Sbjct: 121 GWILKAERYFSLQRFNNEEKLEASVIGFEGDALLWYQWEHKKRPMFLWEEMKLLILKQFR 180
Query: 181 PTLEGTLCDRFMAVKQETTVRDYR 201
T EG+L ++F+A++Q+ TV++YR
Sbjct: 181 STQEGSLHEQFLALRQQGTVKEYR 197
BLAST of Moc06g16550 vs. ExPASy TrEMBL
Match:
W9RBJ4 (Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_024931 PE=4 SV=1)
HSP 1 Score: 131.3 bits (329), Expect = 4.4e-27
Identity = 54/98 (55.10%), Postives = 77/98 (78.57%), Query Frame = 0
Query: 99 DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERK 158
DL LR+LE+P+FEG+NP+ WL RV RYF +NRLT+++KL AA +C G+ALAW QWE+ +
Sbjct: 98 DLGLRRLEMPLFEGDNPEGWLFRVERYFSVNRLTEEDKLSAAAICFKGDALAWFQWEDGR 157
Query: 159 NPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV 197
NP+++W E + LL RFR + EGT D+F+A++Q+ TV
Sbjct: 158 NPVRSWLELKRKLLDRFRSSQEGTALDKFLAIQQKGTV 195
BLAST of Moc06g16550 vs. ExPASy TrEMBL
Match:
A0A5C7IJS7 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004373 PE=4 SV=1)
HSP 1 Score: 130.6 bits (327), Expect = 7.5e-27
Identity = 64/176 (36.36%), Postives = 102/176 (57.95%), Query Frame = 0
Query: 29 ESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSS 88
E ++E+ KDE+ + + M+E+ GKG E +
Sbjct: 13 EELQREVGTIKDEIMAMNKRLDHLLQMQEEAARVAASSSGKGKESTSHVPTVGGPTIGVT 72
Query: 89 DGSNQLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCL 148
GS +N D R RKLE+P+F+G NPD W+ + YF + R ++EKLEA+V+
Sbjct: 73 HGSEGTSNVFEHRRDFRFRKLEMPVFDGTNPDGWILKAECYFSLQRFNNEEKLEASVIGF 132
Query: 149 DGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR 201
+G+AL W+QWE +K P+ WEE +LL+L++FR T EG+L ++F+A++Q+ TV++YR
Sbjct: 133 EGDALLWYQWEHKKRPMFLWEEMKLLILKQFRSTQEGSLHEQFLALRQQGTVKEYR 188
BLAST of Moc06g16550 vs. ExPASy TrEMBL
Match:
A0A7J6HNQ9 (Retrotrans_gag domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_019159 PE=4 SV=1)
HSP 1 Score: 129.4 bits (324), Expect = 1.7e-26
Identity = 81/220 (36.82%), Postives = 116/220 (52.73%), Query Frame = 0
Query: 2 VATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDE-----------MRLWMT--- 61
V TRMESRV+ VE ++GV SA+ L K RL ++
Sbjct: 3 VITRMESRVKEVEVAVTGVQSAVSGVNNQLNDHGVLLKTHGVELQSLKEVLNRLVISNGR 62
Query: 62 AVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSN-------QLTNALAPVFDL 121
V + + E + + G G GE++ + S+G N + T P +
Sbjct: 63 IVDELASLRENSNGNHGSGRRL--QGERSTATGGAGSNGRNVGIGFTGRDTVLGEPRNEF 122
Query: 122 RLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNP 181
+K+E+P+F G+NPD W +R RYF + RL+ E+LEAAVLCL+G AL W +WE +++
Sbjct: 123 HAQKIELPLFSGDNPDKWAYRAERYFGLQRLSPPEQLEAAVLCLEGAALNWFRWENQRHA 182
Query: 182 IQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR 201
I +WEE + LLL+RF EGT+ DRF Q TTV+DYR
Sbjct: 183 INSWEELKSLLLRRFCQKAEGTVFDRFFVHHQVTTVQDYR 220
BLAST of Moc06g16550 vs. TAIR 10
Match:
AT1G67020.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: leaf; Has 72 Blast hits to 72 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 72; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 65.1 bits (157), Expect = 7.4e-11
Identity = 28/76 (36.84%), Postives = 41/76 (53.95%), Query Frame = 0
Query: 102 LRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPI 161
+R++E+P+F+G W +V R+FR+ R D +KL+ L L+G AL W E
Sbjct: 107 IRRIEMPVFDGSGVYEWFSKVERFFRVGRYQDSDKLDLVALSLEGVALKWFLREMSTLEF 166
Query: 162 QTWEEFRLLLLQRFRP 178
+ W F LL RF P
Sbjct: 167 RDWNSFEQRLLARFDP 182
BLAST of Moc06g16550 vs. TAIR 10
Match:
AT3G44713.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14800.1); Has 69 Blast hits to 64 proteins in 24 species: Archae - 2; Bacteria - 10; Metazoa - 4; Fungi - 4; Plants - 36; Viruses - 1; Other Eukaryotes - 12 (source: NCBI BLink). )
HSP 1 Score: 43.5 bits (101), Expect = 2.3e-04
Identity = 22/70 (31.43%), Postives = 35/70 (50.00%), Query Frame = 0
Query: 108 PIFEGENPD--AWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWE 167
P F G + +W+ + +F TDDEK+ A ++GEA AW ++ ++WE
Sbjct: 7 PTFNGVTHELRSWISWLEDFFVWENFTDDEKMNLAQSLIEGEAEAWFYRRQKMILFRSWE 66
Query: 168 EFRLLLLQRF 176
R L+ RF
Sbjct: 67 HLRDCLVLRF 76
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022897442.1 | 2.9e-33 | 39.90 | uncharacterized protein LOC111411108 [Olea europaea var. sylvestris] | [more] |
EXB38291.1 | 5.7e-29 | 54.90 | hypothetical protein L484_013924 [Morus notabilis] | [more] |
TXG60193.1 | 1.3e-28 | 35.29 | hypothetical protein EZV62_014766 [Acer yangbiense] | [more] |
XP_038904464.1 | 1.3e-28 | 36.77 | uncharacterized protein LOC120090832 [Benincasa hispida] | [more] |
XP_024017591.1 | 1.8e-27 | 39.15 | uncharacterized protein LOC112090471 [Morus notabilis] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
W9QTX5 | 2.7e-29 | 54.90 | Mediator of RNA polymerase II transcription subunit 25 OS=Morus notabilis OX=981... | [more] |
A0A5C7HSW3 | 6.1e-29 | 35.29 | Chromo domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_014766 P... | [more] |
W9RBJ4 | 4.4e-27 | 55.10 | Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_02... | [more] |
A0A5C7IJS7 | 7.5e-27 | 36.36 | Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004373 PE=4 SV=1 | [more] |
A0A7J6HNQ9 | 1.7e-26 | 36.82 | Retrotrans_gag domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_019... | [more] |
Match Name | E-value | Identity | Description | |
AT1G67020.1 | 7.4e-11 | 36.84 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G44713.1 | 2.3e-04 | 31.43 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |