Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTCTGTTCTTTAGAGAGGTGGAGAGGAGATTCGAGCAGTTGAATGCCATGGACGAGAAGCTAGAAGCGAGACTCCGTTACTATGAGACCAAAGTTCAAAACATAATCATCGCCTATTTCGTTTGGGCACGCTTTTTCTTCTTCGCCATCTCTCGTTCCTCCCTCACTTGCAATGGTTGGTGGGTAACTTTTGCTCTAAATTCCTTGTGCACCTTTGTTTATTTCGTACTTTTTCTAGATTCCGTCCTCATGTTGTATCGAGCAGAGCACCACCTTGACATGATGTATCAAGAACAGGCAGAATTTTATCGAAAAATTTGTATGACCAGAGAAGAAGCGAACGAGGAGGGTGAAGATGACTCAAGTCATGAAGTGGAGCTTATTCGGGAGATGATACTCACAAATTCAAATTATACCAGATCATCTCGAAGGGGTATGGAGAGAAAAGTTTACATTTACTCTATTTCAGGTGCTTTGGTCGGTGTTGCGGCAACAGAGTTATACGCAT
mRNA sequence
ATGACTCTGTTCTTTAGAGAGGTGGAGAGGAGATTCGAGCAGTTGAATGCCATGGACGAGAAGCTAGAAGCGAGACTCCGTTACTATGAGACCAAAGTTCAAAACATAATCATCGCCTATTTCGTTTGGGCACGCTTTTTCTTCTTCGCCATCTCTCGTTCCTCCCTCACTTGCAATGGTTGGTGGGTAACTTTTGCTCTAAATTCCTTGTGCACCTTTGTTTATTTCGTACTTTTTCTAGATTCCGTCCTCATGTTGTATCGAGCAGAGCACCACCTTGACATGATGTATCAAGAACAGGCAGAATTTTATCGAAAAATTTGTATGACCAGAGAAGAAGCGAACGAGGAGGGTGAAGATGACTCAAGTCATGAAGTGGAGCTTATTCGGGAGATGATACTCACAAATTCAAATTATACCAGATCATCTCGAAGGGGTATGGAGAGAAAAGTTTACATTTACTCTATTTCAGGTGCTTTGGTCGGTGTTGCGGCAACAGAGTTATACGCAT
Coding sequence (CDS)
ATGACTCTGTTCTTTAGAGAGGTGGAGAGGAGATTCGAGCAGTTGAATGCCATGGACGAGAAGCTAGAAGCGAGACTCCGTTACTATGAGACCAAAGTTCAAAACATAATCATCGCCTATTTCGTTTGGGCACGCTTTTTCTTCTTCGCCATCTCTCGTTCCTCCCTCACTTGCAATGGTTGGTGGGTAACTTTTGCTCTAAATTCCTTGTGCACCTTTGTTTATTTCGTACTTTTTCTAGATTCCGTCCTCATGTTGTATCGAGCAGAGCACCACCTTGACATGATGTATCAAGAACAGGCAGAATTTTATCGAAAAATTTGTATGACCAGAGAAGAAGCGAACGAGGAGGGTGAAGATGACTCAAGTCATGAAGTGGAGCTTATTCGGGAGATGATACTCACAAATTCAAATTATACCAGATCATCTCGAAGGGGTATGGAGAGAAAAGTTTACATTTACTCTATTTCAGGTGCTTTGGTCGGTGTTGCGGCAACAGAGTTATACGCAT
Protein sequence
MTLFFREVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISRSSLTCNGWWVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREEANEEGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYAX
Homology
BLAST of Sgr021188 vs. NCBI nr
Match:
KAG6579345.1 (hypothetical protein SDJN03_23793, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 142.9 bits (359), Expect = 2.6e-30
Identity = 81/178 (45.51%), Postives = 116/178 (65.17%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR-------SSLTCN 66
E+++ E++ M E +EA++ YY+TK+ NII+AYFVW R FFF IS+ S+L+CN
Sbjct: 9 ELKKMMEEVQGMGENVEAKVNYYDTKLHNIIVAYFVWERVFFFGISKKTFPPTISTLSCN 68
Query: 67 G-WWVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREEANE-- 126
G WWV AL+S C+FVY +LF D+ LMLYR E+ L ++ Q+ A+ R + +EE +
Sbjct: 69 GNWWVILALSSSCSFVYVLLFFDTALMLYRHENQLHLILQKHAQLCRHLLAIKEEQADIK 128
Query: 127 ----EGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E D++SH + L E++L NS RR ERKVY+Y+I AL+GVA+ ELYA
Sbjct: 129 ASLMEAGDEASHGLSLEEELMLINSTSPFRRRRPWERKVYVYTIFCALIGVASLELYA 186
BLAST of Sgr021188 vs. NCBI nr
Match:
KAG6579352.1 (hypothetical protein SDJN03_23800, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 136.3 bits (342), Expect = 2.4e-28
Identity = 78/178 (43.82%), Postives = 113/178 (63.48%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR-------SSLTCN 66
E+++ E++ M EK+EA++ YY+TK+ II+AYFVW R FFF IS+ S+L+CN
Sbjct: 5 ELKKMMEEVQGMGEKVEAKVNYYDTKLHTIIVAYFVWERVFFFGISKKTFPSTISTLSCN 64
Query: 67 G-WWVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREEANE-- 126
G WWV AL+ C+FVY +LF D+ LMLYR E+ L ++ Q+ A+ R + +EE +
Sbjct: 65 GNWWVILALSCSCSFVYVLLFFDAALMLYRHENQLHLILQKHAQLCRHLLAIKEEQADTK 124
Query: 127 ----EGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E D +SH + L E++L NS RR ERKV++Y+I A +GVA+ ELYA
Sbjct: 125 ASLMEAGDQASHGLSLEEELMLINSTSPFRRRRPWERKVHVYTIFCAFIGVASLELYA 182
BLAST of Sgr021188 vs. NCBI nr
Match:
KAG6579344.1 (hypothetical protein SDJN03_23792, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 127.9 bits (320), Expect = 8.5e-26
Identity = 76/177 (42.94%), Postives = 113/177 (63.84%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWAR-----FFFFAISR-------S 66
++E+R E++ M EK+E R+ YY+TK+ II+AY VW R FFFF +S S
Sbjct: 6 QMEKRMEEVKGMWEKVEGRVNYYDTKLHAIIVAYLVWERVFFFFFFFFGVSNTNFASNIS 65
Query: 67 SLTCNG-WWVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE 126
+L+CNG WWV AL+ LC+FVY +LF+D+ LMLY ++ L+++ Q + YR++ ++
Sbjct: 66 TLSCNGKWWVIVALSCLCSFVYMLLFVDAALMLYPHQNQLNLILQTHHQLYRQLLAIKDS 125
Query: 127 ANEEGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E G D++SH + L E++L NSN RR RK Y+Y+I AL+ VA+ ELYA
Sbjct: 126 LMEAG-DEASHGLSLEEELMLINSN-AAYRRRPWGRKFYVYTIFCALIDVASLELYA 180
BLAST of Sgr021188 vs. NCBI nr
Match:
XP_022157182.1 (uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncharacterized protein LOC111023958 [Momordica charantia])
HSP 1 Score: 126.3 bits (316), Expect = 2.5e-25
Identity = 72/170 (42.35%), Postives = 108/170 (63.53%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR--SSLTCNGWWVT 66
E+ER+FE+L ++EK E+R+RYYETKVQNI+ Y ++ R FFF IS+ SS C WWV
Sbjct: 5 ELERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQTSSSFNCKDWWVI 64
Query: 67 FALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE-----ANEEGE 126
AL+ LC+F+YF+LFLD+V ML+R ++ LD++ +E E +++I +++ + + E GE
Sbjct: 65 LALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQILVSKNQDDVGLSMETGE 124
Query: 127 DDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELY 170
E +M++ + R + RKVYIY AL+ V A ELY
Sbjct: 125 SSGGFEFGFHEKMLMLD------HFRIVGRKVYIYFTVSALLAVTAIELY 168
BLAST of Sgr021188 vs. NCBI nr
Match:
XP_022157130.1 (uncharacterized protein LOC111023927 [Momordica charantia])
HSP 1 Score: 113.6 bits (283), Expect = 1.7e-21
Identity = 68/175 (38.86%), Postives = 104/175 (59.43%), Query Frame = 0
Query: 5 FREVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISRSS---LTCNGW 64
F E++R FE L + EK E+R++Y+E++ QNI +AY +W R FFFAIS++S L C W
Sbjct: 3 FGELKRNFEALKDLVEKQESRVQYHESRAQNITMAYLIWGRLFFFAISQTSSSLLKCIDW 62
Query: 65 WVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE------AN 124
W+ L+ C FVYF+ FL++V MLYR +H +D++ +EQAE ++I + R + A
Sbjct: 63 WMVLGLSVSCAFVYFLFFLEAVTMLYRVQHQMDIICKEQAEICQQILVARSQLDDVDLAM 122
Query: 125 EEGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E G+ + +++ + R +ERK YI + AL+ V A ELYA
Sbjct: 123 EAGDSSDGFQFSFHVKLL------EYGAFRIVERKFYICATVSALLAVTAIELYA 171
BLAST of Sgr021188 vs. ExPASy TrEMBL
Match:
A0A6J1DSQ0 (uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023958 PE=4 SV=1)
HSP 1 Score: 126.3 bits (316), Expect = 1.2e-25
Identity = 72/170 (42.35%), Postives = 108/170 (63.53%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR--SSLTCNGWWVT 66
E+ER+FE+L ++EK E+R+RYYETKVQNI+ Y ++ R FFF IS+ SS C WWV
Sbjct: 5 ELERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQTSSSFNCKDWWVI 64
Query: 67 FALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE-----ANEEGE 126
AL+ LC+F+YF+LFLD+V ML+R ++ LD++ +E E +++I +++ + + E GE
Sbjct: 65 LALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQILVSKNQDDVGLSMETGE 124
Query: 127 DDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELY 170
E +M++ + R + RKVYIY AL+ V A ELY
Sbjct: 125 SSGGFEFGFHEKMLMLD------HFRIVGRKVYIYFTVSALLAVTAIELY 168
BLAST of Sgr021188 vs. ExPASy TrEMBL
Match:
A0A6J1DS87 (uncharacterized protein LOC111023927 OS=Momordica charantia OX=3673 GN=LOC111023927 PE=4 SV=1)
HSP 1 Score: 113.6 bits (283), Expect = 8.1e-22
Identity = 68/175 (38.86%), Postives = 104/175 (59.43%), Query Frame = 0
Query: 5 FREVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISRSS---LTCNGW 64
F E++R FE L + EK E+R++Y+E++ QNI +AY +W R FFFAIS++S L C W
Sbjct: 3 FGELKRNFEALKDLVEKQESRVQYHESRAQNITMAYLIWGRLFFFAISQTSSSLLKCIDW 62
Query: 65 WVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE------AN 124
W+ L+ C FVYF+ FL++V MLYR +H +D++ +EQAE ++I + R + A
Sbjct: 63 WMVLGLSVSCAFVYFLFFLEAVTMLYRVQHQMDIICKEQAEICQQILVARSQLDDVDLAM 122
Query: 125 EEGEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E G+ + +++ + R +ERK YI + AL+ V A ELYA
Sbjct: 123 EAGDSSDGFQFSFHVKLL------EYGAFRIVERKFYICATVSALLAVTAIELYA 171
BLAST of Sgr021188 vs. ExPASy TrEMBL
Match:
A0A6J1DTV6 (uncharacterized protein LOC111023951 OS=Momordica charantia OX=3673 GN=LOC111023951 PE=4 SV=1)
HSP 1 Score: 109.0 bits (271), Expect = 2.0e-20
Identity = 68/174 (39.08%), Postives = 106/174 (60.92%), Query Frame = 0
Query: 6 REVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISRSS--LTCNGWWV 65
R ++R FE+L + EK E LR+YE++ QNI + Y +W RFFFFA+S++S L C W V
Sbjct: 4 RVLKRDFEELKLLAEKQEPILRFYESRAQNITMGYLIWERFFFFALSQTSSPLKCIDWRV 63
Query: 66 TFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREEANEEGEDDSS 125
L+ +C+FVYF+LFL++V MLYR ++ +DM+ +EQ++ IC +A + + S+
Sbjct: 64 VLGLSLVCSFVYFLLFLEAVTMLYRTQNQMDMICKEQSD----ICQQILDAGNQDDGGSA 123
Query: 126 HEVELIREMILTNSNYTRS-------SRRGMERKVYIYSISGALVGVAATELYA 171
E + + + N ++ S R ++KVYI +I +LV V A ELYA
Sbjct: 124 MEAGDLSDGVAFNFSFHMRTTLNYDYSFRIFQKKVYISAIISSLVAVTALELYA 173
BLAST of Sgr021188 vs. ExPASy TrEMBL
Match:
A0A0A0L822 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G484330 PE=4 SV=1)
HSP 1 Score: 106.7 bits (265), Expect = 9.9e-20
Identity = 74/173 (42.77%), Postives = 102/173 (58.96%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR--------SSLTC 66
E+E+R E+L AM EK EA YY+TK+ II AYF+W R F FAIS SSL C
Sbjct: 5 ELEKRMEELKAMSEKQEATANYYDTKLHTIIAAYFIWERAFCFAISNKTNSPNYFSSLIC 64
Query: 67 N-GWWVTFALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREEANEE 126
+ W + AL+SL + VY +L+LD+ LMLYR+E +++ + A+ Y +I ++E N
Sbjct: 65 HANWRLILALSSLYSLVYILLYLDAALMLYRSELKQNLILNKHAQLYHQISKIKQEFNSI 124
Query: 127 GEDDSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELYA 171
E +LI +L NS+ T RR ER Y+ +I ALV VA+ ELYA
Sbjct: 125 DSSSMEAEEDLI---LLINSSST--FRRSEERIFYMSTIFCALVCVASLELYA 172
BLAST of Sgr021188 vs. ExPASy TrEMBL
Match:
A0A6J1DX74 (uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111023953 PE=4 SV=1)
HSP 1 Score: 101.7 bits (252), Expect = 3.2e-18
Identity = 66/169 (39.05%), Postives = 98/169 (57.99%), Query Frame = 0
Query: 7 EVERRFEQLNAMDEKLEARLRYYETKVQNIIIAYFVWARFFFFAISR-SSLTCNGWWVTF 66
E+ R+F +L ++EK E+R+RY+E K Q I+ Y + R FFF IS+ SS C+ WWV
Sbjct: 5 ELRRKFGELKDINEKQESRVRYHEAKFQKIVSGYLILTRLFFFGISQTSSSKCHDWWVIL 64
Query: 67 ALNSLCTFVYFVLFLDSVLMLYRAEHHLDMMYQEQAEFYRKICMTREE-----ANEEGED 126
+L+ LC+FVYF+LFLD+ LY+ + LDM+ +E E ++I + + + A E G+
Sbjct: 65 SLSLLCSFVYFLLFLDAATRLYQTKGQLDMICKELIEVCQQILVAQNQDDVDLAMEGGDF 124
Query: 127 DSSHEVELIREMILTNSNYTRSSRRGMERKVYIYSISGALVGVAATELY 170
E +M++ + R + RKVYIY ALV V A ELY
Sbjct: 125 SDGFEFGFHEKMLVLD------HFRFVGRKVYIYFTVCALVAVTAIELY 167
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6579345.1 | 2.6e-30 | 45.51 | hypothetical protein SDJN03_23793, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG6579352.1 | 2.4e-28 | 43.82 | hypothetical protein SDJN03_23800, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG6579344.1 | 8.5e-26 | 42.94 | hypothetical protein SDJN03_23792, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022157182.1 | 2.5e-25 | 42.35 | uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncha... | [more] |
XP_022157130.1 | 1.7e-21 | 38.86 | uncharacterized protein LOC111023927 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DSQ0 | 1.2e-25 | 42.35 | uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DS87 | 8.1e-22 | 38.86 | uncharacterized protein LOC111023927 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DTV6 | 2.0e-20 | 39.08 | uncharacterized protein LOC111023951 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A0A0L822 | 9.9e-20 | 42.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G484330 PE=4 SV=1 | [more] |
A0A6J1DX74 | 3.2e-18 | 39.05 | uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |