Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTGTTAAAGATCACGACTACTTGGAGAAGAGAATGGAAGAATTAAAAGACATGAGCGAAAAGCAAGAAGCAATAGCGAATTACTATGACACCAAATTACACACTGTAATTGTGGCTTACTTCATTTGGGAACGTGTCTTCTTCTTCTTCGTCTCCCAGAAAACTAATTCTCCTTCCTCCCTCAGCTGCAATAATGGCAACTTGTGGGTGATTCTTGCTCTAAGCTGTTTATGCAGCCTCGTTTACATGCTTCTTTTTGTCGACGCTGCCCTCATTCTCTACCAAACAGAGCATCAACTCAATCTCATCCTCCAAACACATGCGCAACTTTATCGACAAATCTGGACAATCAAAAGAGAAGCTGCTGACACCAATATAGATCCATCAATAATGGAGGCCGGGGATGATCAGTCTAACCAAGGACTCCGGCCAGAGGAGGAACTCATACTCATAATAAACTCAAATTCAGCATTTAGAAGGAGAGGGCAGAGGAAATTTTATGTGTATACCATTTTCTGTGCTCTGCTTGCTGTTGCTGCTTTAGAATTATATGCCTGCAAGTCTATGCTCTGCAATTGA
mRNA sequence
ATGGAAGTTGTTAAAGATCACGACTACTTGGAGAAGAGAATGGAAGAATTAAAAGACATGAGCGAAAAGCAAGAAGCAATAGCGAATTACTATGACACCAAATTACACACTGTAATTGTGGCTTACTTCATTTGGGAACGTGTCTTCTTCTTCTTCGTCTCCCAGAAAACTAATTCTCCTTCCTCCCTCAGCTGCAATAATGGCAACTTGTGGGTGATTCTTGCTCTAAGCTGTTTATGCAGCCTCGTTTACATGCTTCTTTTTGTCGACGCTGCCCTCATTCTCTACCAAACAGAGCATCAACTCAATCTCATCCTCCAAACACATGCGCAACTTTATCGACAAATCTGGACAATCAAAAGAGAAGCTGCTGACACCAATATAGATCCATCAATAATGGAGGCCGGGGATGATCAGTCTAACCAAGGACTCCGGCCAGAGGAGGAACTCATACTCATAATAAACTCAAATTCAGCATTTAGAAGGAGAGGGCAGAGGAAATTTTATGTGTATACCATTTTCTGTGCTCTGCTTGCTGTTGCTGCTTTAGAATTATATGCCTGCAAGTCTATGCTCTGCAATTGA
Coding sequence (CDS)
ATGGAAGTTGTTAAAGATCACGACTACTTGGAGAAGAGAATGGAAGAATTAAAAGACATGAGCGAAAAGCAAGAAGCAATAGCGAATTACTATGACACCAAATTACACACTGTAATTGTGGCTTACTTCATTTGGGAACGTGTCTTCTTCTTCTTCGTCTCCCAGAAAACTAATTCTCCTTCCTCCCTCAGCTGCAATAATGGCAACTTGTGGGTGATTCTTGCTCTAAGCTGTTTATGCAGCCTCGTTTACATGCTTCTTTTTGTCGACGCTGCCCTCATTCTCTACCAAACAGAGCATCAACTCAATCTCATCCTCCAAACACATGCGCAACTTTATCGACAAATCTGGACAATCAAAAGAGAAGCTGCTGACACCAATATAGATCCATCAATAATGGAGGCCGGGGATGATCAGTCTAACCAAGGACTCCGGCCAGAGGAGGAACTCATACTCATAATAAACTCAAATTCAGCATTTAGAAGGAGAGGGCAGAGGAAATTTTATGTGTATACCATTTTCTGTGCTCTGCTTGCTGTTGCTGCTTTAGAATTATATGCCTGCAAGTCTATGCTCTGCAATTGA
Protein sequence
MEVVKDHDYLEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNIDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACKSMLCN
Homology
BLAST of HG10008934 vs. NCBI nr
Match:
KAG6579352.1 (hypothetical protein SDJN03_23800, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 209.5 bits (532), Expect = 2.5e-50
Identity = 122/188 (64.89%), Postives = 148/188 (78.72%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKT--NSPSSLSCNN 69
L+K MEE++ M EK EA NYYDTKLHT+IVAYF+WERVFFF +S+KT ++ S+LSC N
Sbjct: 6 LKKMMEEVQGMGEKVEAKVNYYDTKLHTIIVAYFVWERVFFFGISKKTFPSTISTLSC-N 65
Query: 70 GNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTN 129
GN WVILALSC CS VY+LLF DAAL+LY+ E+QL+LILQ HAQL R + IK E ADT
Sbjct: 66 GNWWVILALSCSCSFVYVLLFFDAALMLYRHENQLHLILQKHAQLCRHLLAIKEEQADTK 125
Query: 130 IDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRR--GQRKFYVYTIFCALLAVAALEL 189
S+MEAG DQ++ GL EEEL+L INS S FRRR +RK +VYTIFCA + VA+LEL
Sbjct: 126 --ASLMEAG-DQASHGLSLEEELML-INSTSPFRRRRPWERKVHVYTIFCAFIGVASLEL 185
Query: 190 YACKSMLC 194
YACK++LC
Sbjct: 186 YACKAVLC 188
BLAST of HG10008934 vs. NCBI nr
Match:
KAG6579344.1 (hypothetical protein SDJN03_23792, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 204.1 bits (518), Expect = 1.1e-48
Identity = 121/195 (62.05%), Postives = 145/195 (74.36%), Query Frame = 0
Query: 8 DYLEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFV-------SQKTNSP 67
+ +EKRMEE+K M EK E NYYDTKLH +IVAY +WERVFFFF + ++
Sbjct: 5 EQMEKRMEEVKGMWEKVEGRVNYYDTKLHAIIVAYLVWERVFFFFFFFFGVSNTNFASNI 64
Query: 68 SSLSCNNGNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIK 127
S+LSC NG WVI+ALSCLCS VYMLLFVDAAL+LY ++QLNLILQTH QLYRQ+ IK
Sbjct: 65 STLSC-NGKWWVIVALSCLCSFVYMLLFVDAALMLYPHQNQLNLILQTHHQLYRQLLAIK 124
Query: 128 REAADTNIDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRR-GQRKFYVYTIFCALLA 187
S+MEAGD+ S+ GL EEEL+L INSN+A+RRR RKFYVYTIFCAL+
Sbjct: 125 ---------DSLMEAGDEASH-GLSLEEELML-INSNAAYRRRPWGRKFYVYTIFCALID 184
Query: 188 VAALELYACKSMLCN 195
VA+LELYAC S+LC+
Sbjct: 185 VASLELYACNSVLCS 187
BLAST of HG10008934 vs. NCBI nr
Match:
KAG6579345.1 (hypothetical protein SDJN03_23793, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 203.4 bits (516), Expect = 1.8e-48
Identity = 122/197 (61.93%), Postives = 148/197 (75.13%), Query Frame = 0
Query: 1 MEVVKDHDYLEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSP 60
M ++ + L+K MEE++ M E EA NYYDTKLH +IVAYF+WERVFFF +S+KT P
Sbjct: 1 MTKMEVEEELKKMMEEVQGMGENVEAKVNYYDTKLHNIIVAYFVWERVFFFGISKKTFPP 60
Query: 61 --SSLSCNNGNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWT 120
S+LSC NGN WVILALS CS VY+LLF D AL+LY+ E+QL+LILQ HAQL R +
Sbjct: 61 TISTLSC-NGNWWVILALSSSCSFVYVLLFFDTALMLYRHENQLHLILQKHAQLCRHLLA 120
Query: 121 IKREAADTNIDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRR--GQRKFYVYTIFCA 180
IK E AD I S+MEAGD+ S+ GL EEEL+L INS S FRRR +RK YVYTIFCA
Sbjct: 121 IKEEQAD--IKASLMEAGDEASH-GLSLEEELML-INSTSPFRRRRPWERKVYVYTIFCA 180
Query: 181 LLAVAALELYACKSMLC 194
L+ VA+LELYACK++LC
Sbjct: 181 LIGVASLELYACKAVLC 192
BLAST of HG10008934 vs. NCBI nr
Match:
KAE8650729.1 (hypothetical protein Csa_023412 [Cucumis sativus])
HSP 1 Score: 199.1 bits (505), Expect = 3.4e-47
Identity = 120/197 (60.91%), Postives = 145/197 (73.60%), Query Frame = 0
Query: 1 MEVVKDHDYLEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSP 60
M + + + LEKRMEELK MSEKQEA ANYYDTKLHT+I AYFIWER F F +S KTNSP
Sbjct: 1 MTTIMELEELEKRMEELKAMSEKQEATANYYDTKLHTIIAAYFIWERAFCFAISNKTNSP 60
Query: 61 ---SSLSCNNGNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIW 120
SSL C + N +ILALS L SLVY+LL++DAAL+LY++E + NLIL HAQLY QI
Sbjct: 61 NYFSSLIC-HANWRLILALSSLYSLVYILLYLDAALMLYRSELKQNLILNKHAQLYHQIS 120
Query: 121 TIKREAADTNIDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCAL 180
IK+E +ID S MEA EE+LIL+INS+S FRR +R FY+ TIFCAL
Sbjct: 121 KIKQEF--NSIDSSSMEA-----------EEDLILLINSSSTFRRSEERIFYMSTIFCAL 180
Query: 181 LAVAALELYACKSMLCN 195
+ VA+LELYACKS+LC+
Sbjct: 181 VCVASLELYACKSILCS 183
BLAST of HG10008934 vs. NCBI nr
Match:
XP_022157182.1 (uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncharacterized protein LOC111023958 [Momordica charantia])
HSP 1 Score: 118.6 bits (296), Expect = 5.9e-23
Identity = 75/185 (40.54%), Postives = 115/185 (62.16%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGN 69
LE++ EELKD++EKQE+ YY+TK+ ++ Y I+ R+FFF +SQ + SS +C +
Sbjct: 6 LERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQ---TSSSFNCK--D 65
Query: 70 LWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNID 129
WVILALS LCS +Y LLF+DA +L++T++QL++I + +L++QI + + D +
Sbjct: 66 WWVILALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQI-LVSKNQDDVGLS 125
Query: 130 PSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACK 189
ME G+ E+++++ FR G RK Y+Y ALLAV A+ELY K
Sbjct: 126 ---METGESSGGFEFGFHEKMLML----DHFRIVG-RKVYIYFTVSALLAVTAIELYVSK 176
Query: 190 SMLCN 195
+LCN
Sbjct: 186 YVLCN 176
BLAST of HG10008934 vs. ExPASy TrEMBL
Match:
A0A0A0L822 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G484330 PE=4 SV=1)
HSP 1 Score: 198.4 bits (503), Expect = 2.8e-47
Identity = 119/188 (63.30%), Postives = 141/188 (75.00%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSP---SSLSCN 69
LEKRMEELK MSEKQEA ANYYDTKLHT+I AYFIWER F F +S KTNSP SSL C
Sbjct: 6 LEKRMEELKAMSEKQEATANYYDTKLHTIIAAYFIWERAFCFAISNKTNSPNYFSSLIC- 65
Query: 70 NGNLWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADT 129
+ N +ILALS L SLVY+LL++DAAL+LY++E + NLIL HAQLY QI IK+E
Sbjct: 66 HANWRLILALSSLYSLVYILLYLDAALMLYRSELKQNLILNKHAQLYHQISKIKQEF--N 125
Query: 130 NIDPSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELY 189
+ID S MEA EE+LIL+INS+S FRR +R FY+ TIFCAL+ VA+LELY
Sbjct: 126 SIDSSSMEA-----------EEDLILLINSSSTFRRSEERIFYMSTIFCALVCVASLELY 179
Query: 190 ACKSMLCN 195
ACKS+LC+
Sbjct: 186 ACKSILCS 179
BLAST of HG10008934 vs. ExPASy TrEMBL
Match:
A0A6J1DSQ0 (uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023958 PE=4 SV=1)
HSP 1 Score: 118.6 bits (296), Expect = 2.8e-23
Identity = 75/185 (40.54%), Postives = 115/185 (62.16%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGN 69
LE++ EELKD++EKQE+ YY+TK+ ++ Y I+ R+FFF +SQ + SS +C +
Sbjct: 6 LERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQ---TSSSFNCK--D 65
Query: 70 LWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNID 129
WVILALS LCS +Y LLF+DA +L++T++QL++I + +L++QI + + D +
Sbjct: 66 WWVILALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQI-LVSKNQDDVGLS 125
Query: 130 PSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACK 189
ME G+ E+++++ FR G RK Y+Y ALLAV A+ELY K
Sbjct: 126 ---METGESSGGFEFGFHEKMLML----DHFRIVG-RKVYIYFTVSALLAVTAIELYVSK 176
Query: 190 SMLCN 195
+LCN
Sbjct: 186 YVLCN 176
BLAST of HG10008934 vs. ExPASy TrEMBL
Match:
A0A0A0KUT0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G562290 PE=4 SV=1)
HSP 1 Score: 109.8 bits (273), Expect = 1.3e-20
Identity = 62/185 (33.51%), Postives = 104/185 (56.22%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGN 69
++K +EEL M++ Q+ Y+T L + +A+ +W R+FFF +SQ + + S L C +
Sbjct: 9 MQKVLEELNAMADTQKGRVENYETSLQNIAIAFLVWLRLFFFSLSQTSPNSSLLHCK--H 68
Query: 70 LWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNID 129
W++ AL+C + +Y+LLF+ +L+L +TE QL++I + QL++QIW ++ + I+
Sbjct: 69 WWLLFALTCFSAFLYILLFIHNSLMLSRTERQLHVISRQQIQLHQQIWMLRLQELPQMIE 128
Query: 130 PSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACK 189
P IM+ IIN +RK Y+ I C + + ALELYA +
Sbjct: 129 PIIMDH-----------------IINGEMGRTSTFERKLYINCILCGFIGIVALELYASR 174
Query: 190 SMLCN 195
S+LCN
Sbjct: 189 SLLCN 174
BLAST of HG10008934 vs. ExPASy TrEMBL
Match:
A0A6J1DS87 (uncharacterized protein LOC111023927 OS=Momordica charantia OX=3673 GN=LOC111023927 PE=4 SV=1)
HSP 1 Score: 109.0 bits (271), Expect = 2.3e-20
Identity = 70/185 (37.84%), Postives = 111/185 (60.00%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGN 69
L++ E LKD+ EKQE+ Y++++ + +AY IW R+FFF +SQ S S L C +
Sbjct: 6 LKRNFEALKDLVEKQESRVQYHESRAQNITMAYLIWGRLFFFAISQ--TSSSLLKCI--D 65
Query: 70 LWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNID 129
W++L LS C+ VY L F++A +LY+ +HQ+++I + A++ +QI + + D ++
Sbjct: 66 WWMVLGLSVSCAFVYFLFFLEAVTMLYRVQHQMDIICKEQAEICQQILVARSQLDDVDL- 125
Query: 130 PSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACK 189
MEAGD S+ G + + L+ AF R +RKFY+ ALLAV A+ELYAC
Sbjct: 126 --AMEAGD--SSDGFQFSFHVKLL--EYGAF-RIVERKFYICATVSALLAVTAIELYACS 178
Query: 190 SMLCN 195
+ C+
Sbjct: 186 WLYCD 178
BLAST of HG10008934 vs. ExPASy TrEMBL
Match:
A0A6J1DX74 (uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111023953 PE=4 SV=1)
HSP 1 Score: 106.3 bits (264), Expect = 1.5e-19
Identity = 74/185 (40.00%), Postives = 109/185 (58.92%), Query Frame = 0
Query: 10 LEKRMEELKDMSEKQEAIANYYDTKLHTVIVAYFIWERVFFFFVSQKTNSPSSLSCNNGN 69
L ++ ELKD++EKQE+ Y++ K ++ Y I R+FFF +SQ SS C+ +
Sbjct: 6 LRRKFGELKDINEKQESRVRYHEAKFQKIVSGYLILTRLFFFGISQ----TSSSKCH--D 65
Query: 70 LWVILALSCLCSLVYMLLFVDAALILYQTEHQLNLILQTHAQLYRQIWTIKREAADTNID 129
WVIL+LS LCS VY LLF+DAA LYQT+ QL++I + ++ +QI + + D ++
Sbjct: 66 WWVILSLSLLCSFVYFLLFLDAATRLYQTKGQLDMICKELIEVCQQI-LVAQNQDDVDL- 125
Query: 130 PSIMEAGDDQSNQGLRPEEELILIINSNSAFRRRGQRKFYVYTIFCALLAVAALELYACK 189
ME GD E+++++ FR G RK Y+Y CAL+AV A+ELY K
Sbjct: 126 --AMEGGDFSDGFEFGFHEKMLVL----DHFRFVG-RKVYIYFTVCALVAVTAIELYVSK 175
Query: 190 SMLCN 195
+LCN
Sbjct: 186 YLLCN 175
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6579352.1 | 2.5e-50 | 64.89 | hypothetical protein SDJN03_23800, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG6579344.1 | 1.1e-48 | 62.05 | hypothetical protein SDJN03_23792, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG6579345.1 | 1.8e-48 | 61.93 | hypothetical protein SDJN03_23793, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAE8650729.1 | 3.4e-47 | 60.91 | hypothetical protein Csa_023412 [Cucumis sativus] | [more] |
XP_022157182.1 | 5.9e-23 | 40.54 | uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncha... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L822 | 2.8e-47 | 63.30 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G484330 PE=4 SV=1 | [more] |
A0A6J1DSQ0 | 2.8e-23 | 40.54 | uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A0A0KUT0 | 1.3e-20 | 33.51 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G562290 PE=4 SV=1 | [more] |
A0A6J1DS87 | 2.3e-20 | 37.84 | uncharacterized protein LOC111023927 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DX74 | 1.5e-19 | 40.00 | uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |