Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA
mRNA sequence
ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA
Coding sequence (CDS)
ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA
Protein sequence
MLHVFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTKLIP
Homology
BLAST of HG10022471 vs. NCBI nr
Match:
XP_022932063.1 (uncharacterized protein LOC111438388 [Cucurbita moschata])
HSP 1 Score: 156.8 bits (395), Expect = 1.9e-34
Identity = 88/169 (52.07%), Postives = 105/169 (62.13%), Query Frame = 0
Query: 33 LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
L K LVP GPNQ FPPETP HN L K LVP+GPNQ +PP P H++ + +G
Sbjct: 681 LGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETPLHNLGSKRLVPDG 740
Query: 93 IGEL--------TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVP 152
++ LG KRLVP GPNQ FPPETP HN LG K LVP+GPNQ +PP P
Sbjct: 741 PNQMFPPETPLHNLGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETP 800
Query: 153 PHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTK-LIP 193
HN+ + +G LGSKRLVPGGPNQ F PETP H++ +K L+P
Sbjct: 801 LHNLGSKRLVPDG---PNLGSKRLVPGGPNQMFSPETPLHNLGSKRLVP 838
BLAST of HG10022471 vs. NCBI nr
Match:
XP_022159163.1 (proline-rich proteoglycan 2-like [Momordica charantia])
HSP 1 Score: 117.9 bits (294), Expect = 9.9e-23
Identity = 81/194 (41.75%), Postives = 92/194 (47.42%), Query Frame = 0
Query: 4 VFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLK 63
VF F +A++S+ +VEA+RD + D L L LK VP GPN A PE PP FTL LK
Sbjct: 12 VFLFLSFVAILSDANTVEARRDANGDVLALDLKRRVPMGPNPATSPERPP----FTLGLK 71
Query: 64 HLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKF 123
VP PN P PP LTLG KR VP GPN A PE+PP F
Sbjct: 72 RHVPTDPNPTTSPDRPP---------------LTLGLKRRVPTGPNPATSPESPP----F 131
Query: 124 TLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIG--------ELTLGSKRLVPGGPNQ 183
GLK VP GPN A P PP + G T KR VP GPN
Sbjct: 132 IFGLKRRVPTGPNPATSPERPPFTFSLKRRVPTGPNPATSPERLPFTFSLKRRVPTGPNP 182
Query: 184 AFPPETPPHSIVTK 190
A PE PP ++ K
Sbjct: 192 ATSPERPPFTLGLK 182
BLAST of HG10022471 vs. NCBI nr
Match:
VDC94000.1 (unnamed protein product [Brassica oleracea])
HSP 1 Score: 111.7 bits (278), Expect = 7.1e-21
Identity = 70/162 (43.21%), Postives = 81/162 (50.00%), Query Frame = 0
Query: 35 LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
+K LVP GPN P + PH+ +K LVP GPN P PPHSIV
Sbjct: 230 VKRLVPSGPNNETSPSSSPHSIA-DFGVKRLVPSGPNNETSPPSPPHSIV---------- 289
Query: 95 ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
G KRLVP GPN P +PPH G+K LVP GPN P PPH I A
Sbjct: 290 --DFGVKRLVPSGPNNETSPPSPPHPIA-DFGVKRLVPSGPNNETSPPSPPHPI-----A 349
Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
G+ L +G KRLVP GPN P +PPH I V +L+P
Sbjct: 350 DFGVKRLNIGVKRLVPSGPNNETSPPSPPHFIADFGVKRLVP 372
BLAST of HG10022471 vs. NCBI nr
Match:
KAG2315933.1 (hypothetical protein Bca52824_019055 [Brassica carinata])
HSP 1 Score: 110.2 bits (274), Expect = 2.1e-20
Identity = 72/176 (40.91%), Postives = 85/176 (48.30%), Query Frame = 0
Query: 35 LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
LK LVP GPN P +PPH+ + + +K LVP GPN P PPHSI
Sbjct: 138 LKRLVPSGPNNETSPPSPPHSME-DIRVKRLVPSGPNNETSPPSPPHSIA---------- 197
Query: 95 ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIV---TN 154
G KRLVP GPN P +PPH+ F G+K LVP GPN P PPH I
Sbjct: 198 --EFGLKRLVPSGPNNETSPPSPPHSIAF-FGMKRLVPSGPNNETSPPSPPHFIADFGLK 257
Query: 155 SKAKNGIGELT-----------LGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
+G T +G KRLVP GPN P +PPHSI V +L+P
Sbjct: 258 RLVPSGPNNETSPPSPPHSMEDIGVKRLVPSGPNNETSPPSPPHSIADFGVKRLVP 299
BLAST of HG10022471 vs. NCBI nr
Match:
XP_033143265.1 (leucine-rich repeat extensin-like protein 5 [Brassica rapa])
HSP 1 Score: 110.2 bits (274), Expect = 2.1e-20
Identity = 69/162 (42.59%), Postives = 80/162 (49.38%), Query Frame = 0
Query: 35 LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
+K LVP GPN P +PPH+ F +K LVP GPN P PPHSI
Sbjct: 79 VKRLVPSGPNNETSPPSPPHSADF--GVKRLVPSGPNNETSPPSPPHSIA---------- 138
Query: 95 ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
G KRLVP GPN P +PPH S + G+K LVP GPN P PPH+I
Sbjct: 139 --DYGVKRLVPSGPNNETSPPSPPH-SIASFGVKRLVPSGPNNETSPPSPPHSIA----- 198
Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
G K LVP GPN P +PPHSI V +L+P
Sbjct: 199 -------DFGVKGLVPSGPNNETSPPSPPHSIADFGVKRLVP 213
BLAST of HG10022471 vs. ExPASy TrEMBL
Match:
A0A6J1F0K6 (uncharacterized protein LOC111438388 OS=Cucurbita moschata OX=3662 GN=LOC111438388 PE=4 SV=1)
HSP 1 Score: 156.8 bits (395), Expect = 9.3e-35
Identity = 88/169 (52.07%), Postives = 105/169 (62.13%), Query Frame = 0
Query: 33 LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
L K LVP GPNQ FPPETP HN L K LVP+GPNQ +PP P H++ + +G
Sbjct: 681 LGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETPLHNLGSKRLVPDG 740
Query: 93 IGEL--------TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVP 152
++ LG KRLVP GPNQ FPPETP HN LG K LVP+GPNQ +PP P
Sbjct: 741 PNQMFPPETPLHNLGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETP 800
Query: 153 PHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTK-LIP 193
HN+ + +G LGSKRLVPGGPNQ F PETP H++ +K L+P
Sbjct: 801 LHNLGSKRLVPDG---PNLGSKRLVPGGPNQMFSPETPLHNLGSKRLVP 838
BLAST of HG10022471 vs. ExPASy TrEMBL
Match:
A0A6J1DY20 (proline-rich proteoglycan 2-like OS=Momordica charantia OX=3673 GN=LOC111025587 PE=4 SV=1)
HSP 1 Score: 117.9 bits (294), Expect = 4.8e-23
Identity = 81/194 (41.75%), Postives = 92/194 (47.42%), Query Frame = 0
Query: 4 VFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLK 63
VF F +A++S+ +VEA+RD + D L L LK VP GPN A PE PP FTL LK
Sbjct: 12 VFLFLSFVAILSDANTVEARRDANGDVLALDLKRRVPMGPNPATSPERPP----FTLGLK 71
Query: 64 HLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKF 123
VP PN P PP LTLG KR VP GPN A PE+PP F
Sbjct: 72 RHVPTDPNPTTSPDRPP---------------LTLGLKRRVPTGPNPATSPESPP----F 131
Query: 124 TLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIG--------ELTLGSKRLVPGGPNQ 183
GLK VP GPN A P PP + G T KR VP GPN
Sbjct: 132 IFGLKRRVPTGPNPATSPERPPFTFSLKRRVPTGPNPATSPERLPFTFSLKRRVPTGPNP 182
Query: 184 AFPPETPPHSIVTK 190
A PE PP ++ K
Sbjct: 192 ATSPERPPFTLGLK 182
BLAST of HG10022471 vs. ExPASy TrEMBL
Match:
A0A3P6B7Y9 (Uncharacterized protein OS=Brassica oleracea OX=3712 GN=BOLC3T17342H PE=4 SV=1)
HSP 1 Score: 111.7 bits (278), Expect = 3.4e-21
Identity = 70/162 (43.21%), Postives = 81/162 (50.00%), Query Frame = 0
Query: 35 LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
+K LVP GPN P + PH+ +K LVP GPN P PPHSIV
Sbjct: 230 VKRLVPSGPNNETSPSSSPHSIA-DFGVKRLVPSGPNNETSPPSPPHSIV---------- 289
Query: 95 ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
G KRLVP GPN P +PPH G+K LVP GPN P PPH I A
Sbjct: 290 --DFGVKRLVPSGPNNETSPPSPPHPIA-DFGVKRLVPSGPNNETSPPSPPHPI-----A 349
Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
G+ L +G KRLVP GPN P +PPH I V +L+P
Sbjct: 350 DFGVKRLNIGVKRLVPSGPNNETSPPSPPHFIADFGVKRLVP 372
BLAST of HG10022471 vs. ExPASy TrEMBL
Match:
M4D9I0 (Uncharacterized protein OS=Brassica rapa subsp. pekinensis OX=51351 PE=4 SV=1)
HSP 1 Score: 111.3 bits (277), Expect = 4.5e-21
Identity = 68/165 (41.21%), Postives = 83/165 (50.30%), Query Frame = 0
Query: 33 LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
+ +K LVP GPN P +PPH + + +K LVP GPN P PPH I + +
Sbjct: 25 IGVKRLVPSGPNNETSPPSPPHFIE-DIGVKRLVPSGPNNETSPPSPPHFIEDETSPPSP 84
Query: 93 IGEL-TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTN 152
+ G KRLVP GPN P +PPH S G+K LVP GPN P PPH+I
Sbjct: 85 PHSIANFGVKRLVPSGPNNETSPPSPPH-SIANFGVKRLVPSGPNNETSPPSPPHSIA-- 144
Query: 153 SKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
G KRLVP GPN P +PPHSI V +L+P
Sbjct: 145 ----------NFGVKRLVPSGPNNETSPPSPPHSIADFGVKRLVP 175
BLAST of HG10022471 vs. ExPASy TrEMBL
Match:
A0A398A230 (Uncharacterized protein OS=Brassica campestris OX=3711 GN=BRARA_C03751 PE=4 SV=1)
HSP 1 Score: 108.2 bits (269), Expect = 3.8e-20
Identity = 68/162 (41.98%), Postives = 80/162 (49.38%), Query Frame = 0
Query: 35 LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
+K LVP GPN P +PPH + +++K LVP GPN P PPHSI
Sbjct: 120 IKRLVPSGPNNETSPPSPPHFIE-DIEVKRLVPSGPNNETSPPSPPHSIA---------- 179
Query: 95 ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
G KRLVP GPN P +PPH S G+K LVP GPN P PPH+I
Sbjct: 180 --DFGVKRLVPSGPNNETSPPSPPH-SIADFGVKRLVPSGPNNEASPPSPPHSIA----- 239
Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
G KRLVP GPN P +PP SI V +L+P
Sbjct: 240 -------DFGVKRLVPSGPNNETSPPSPPRSIAGFGVKRLVP 255
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1F0K6 | 9.3e-35 | 52.07 | uncharacterized protein LOC111438388 OS=Cucurbita moschata OX=3662 GN=LOC1114383... | [more] |
A0A6J1DY20 | 4.8e-23 | 41.75 | proline-rich proteoglycan 2-like OS=Momordica charantia OX=3673 GN=LOC111025587 ... | [more] |
A0A3P6B7Y9 | 3.4e-21 | 43.21 | Uncharacterized protein OS=Brassica oleracea OX=3712 GN=BOLC3T17342H PE=4 SV=1 | [more] |
M4D9I0 | 4.5e-21 | 41.21 | Uncharacterized protein OS=Brassica rapa subsp. pekinensis OX=51351 PE=4 SV=1 | [more] |
A0A398A230 | 3.8e-20 | 41.98 | Uncharacterized protein OS=Brassica campestris OX=3711 GN=BRARA_C03751 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |