Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSsinglepolypeptidestart_codonstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCATGGGGAGTTGAAGAGGAGGTTTAAGGAGTTGATAGATGTGAATGAGAAGCAAGAAACAAAAGTGTGTTACCATAAAAACAAAGTTCAGAACATTGTTTTTGGCTATCTCATTTGGGTTCGTTTGTTCATCTACGGTATTTCTCAGGCTTTGTCCTTCAAGTGCAATAATTGGTGGGTCATTTTGGCTTTAAGTCTTTTATGGAATTTCATTTACTTTTTACTTTTCCTGGATGCTATGACCATGTTACATCGGGCTCAGTACCAGCTAGACATAATCTGCAAGGAACTAATTGAATTTTGCCAACAAAATTTGATACCCAAAAACCGAGATGATATGGATCTAGTGGAGGCTGGTGAATCTCGTGATGGATTTGAATTCGGTTTCCATAAGAAGATGCTCATGCTTGATCATTCTACAATTGTTGGGAGGAATGTTTATATCTATTTCATTGTCTGTGCTTTGCTTGCTGTTGCTGCGATTGAATTATATGCTTATAAGTACTTGCTATGCAAATGA
mRNA sequence
ATGGCGCATGGGGAGTTGAAGAGGAGGTTTAAGGAGTTGATAGATGTGAATGAGAAGCAAGAAACAAAAGTGTGTTACCATAAAAACAAAGTTCAGAACATTGTTTTTGGCTATCTCATTTGGGTTCGTTTGTTCATCTACGGTATTTCTCAGGCTTTGTCCTTCAAGTGCAATAATTGGTGGGTCATTTTGGCTTTAAGTCTTTTATGGAATTTCATTTACTTTTTACTTTTCCTGGATGCTATGACCATGTTACATCGGGCTCAGTACCAGCTAGACATAATCTGCAAGGAACTAATTGAATTTTGCCAACAAAATTTGATACCCAAAAACCGAGATGATATGGATCTAGTGGAGGCTGGTGAATCTCGTGATGGATTTGAATTCGGTTTCCATAAGAAGATGCTCATGCTTGATCATTCTACAATTGTTGGGAGGAATGTTTATATCTATTTCATTGTCTGTGCTTTGCTTGCTGTTGCTGCGATTGAATTATATGCTTATAAGTACTTGCTATGCAAATGA
Coding sequence (CDS)
ATGGCGCATGGGGAGTTGAAGAGGAGGTTTAAGGAGTTGATAGATGTGAATGAGAAGCAAGAAACAAAAGTGTGTTACCATAAAAACAAAGTTCAGAACATTGTTTTTGGCTATCTCATTTGGGTTCGTTTGTTCATCTACGGTATTTCTCAGGCTTTGTCCTTCAAGTGCAATAATTGGTGGGTCATTTTGGCTTTAAGTCTTTTATGGAATTTCATTTACTTTTTACTTTTCCTGGATGCTATGACCATGTTACATCGGGCTCAGTACCAGCTAGACATAATCTGCAAGGAACTAATTGAATTTTGCCAACAAAATTTGATACCCAAAAACCGAGATGATATGGATCTAGTGGAGGCTGGTGAATCTCGTGATGGATTTGAATTCGGTTTCCATAAGAAGATGCTCATGCTTGATCATTCTACAATTGTTGGGAGGAATGTTTATATCTATTTCATTGTCTGTGCTTTGCTTGCTGTTGCTGCGATTGAATTATATGCTTATAAGTACTTGCTATGCAAATGA
Protein sequence
MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNWWVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEAGESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK
Homology
BLAST of Csor.00g010540 vs. NCBI nr
Match:
KAG6579338.1 (hypothetical protein SDJN03_23786, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 351 bits (900), Expect = 5.87e-122
Identity = 174/174 (100.00%), Postives = 174/174 (100.00%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW 60
MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW
Sbjct: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW 60
Query: 61 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA 120
WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA
Sbjct: 61 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA 120
Query: 121 GESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
GESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK
Sbjct: 121 GESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
BLAST of Csor.00g010540 vs. NCBI nr
Match:
KAG7016840.1 (hypothetical protein SDJN02_21951, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 341 bits (874), Expect = 1.85e-117
Identity = 170/174 (97.70%), Postives = 170/174 (97.70%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW 60
MAHGELKRR KELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLF YGISQALSFKCNNW
Sbjct: 36 MAHGELKRRIKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFFYGISQALSFKCNNW 95
Query: 61 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA 120
WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA
Sbjct: 96 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDLVEA 155
Query: 121 GESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
GES D FEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK
Sbjct: 156 GESCDRFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 209
BLAST of Csor.00g010540 vs. NCBI nr
Match:
XP_022157182.1 (uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncharacterized protein LOC111023958 [Momordica charantia])
HSP 1 Score: 229 bits (585), Expect = 6.27e-74
Identity = 123/176 (69.89%), Postives = 143/176 (81.25%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQ-ALSFKCNN 60
MA GEL+R+F+EL D+NEKQE++V Y++ KVQNIVFGYLI+ RLF +GISQ + SF C +
Sbjct: 1 MALGELERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQTSSSFNCKD 60
Query: 61 WWVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-V 120
WWVILALSLL +FIYFLLFLDA+ ML R QYQLDIICKEL E QQ L+ KN+DD+ L +
Sbjct: 61 WWVILALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQILVSKNQDDVGLSM 120
Query: 121 EAGESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
E GES GFEFGFH+KMLMLDH IVGR VYIYF V ALLAV AIELY KY+LC
Sbjct: 121 ETGESSGGFEFGFHEKMLMLDHFRIVGRKVYIYFTVSALLAVTAIELYVSKYVLCN 176
BLAST of Csor.00g010540 vs. NCBI nr
Match:
XP_022157176.1 (uncharacterized protein LOC111023953 isoform X2 [Momordica charantia])
HSP 1 Score: 223 bits (569), Expect = 1.65e-71
Identity = 115/175 (65.71%), Postives = 140/175 (80.00%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW 60
MA GEL+R+F EL D+NEKQE++V YH+ K Q IV GYLI RLF +GISQ S KC++W
Sbjct: 1 MAVGELRRKFGELKDINEKQESRVRYHEAKFQKIVSGYLILTRLFFFGISQTSSSKCHDW 60
Query: 61 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-VE 120
WVIL+LSLL +F+YFLLFLDA T L++ + QLD+ICKELIE CQQ L+ +N+DD+DL +E
Sbjct: 61 WVILSLSLLCSFVYFLLFLDAATRLYQTKGQLDMICKELIEVCQQILVAQNQDDVDLAME 120
Query: 121 AGESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
G+ DGFEFGFH+KML+LDH VGR VYIYF VCAL+AV AIELY KYLLC
Sbjct: 121 GGDFSDGFEFGFHEKMLVLDHFRFVGRKVYIYFTVCALVAVTAIELYVSKYLLCN 175
BLAST of Csor.00g010540 vs. NCBI nr
Match:
KAA0042579.1 (WD repeat-containing protein 91-like protein [Cucumis melo var. makuwa] >TYK05983.1 WD repeat-containing protein 91-like protein [Cucumis melo var. makuwa])
HSP 1 Score: 187 bits (476), Expect = 9.03e-57
Identity = 101/174 (58.05%), Postives = 126/174 (72.41%), Query Frame = 0
Query: 4 GELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNWWVI 63
G+L+R F L D+N+ QET + Y + K+QN+V GYL W RLF +G+S SFKC +WWVI
Sbjct: 48 GDLRRNFVLLKDINDNQETSLRYCETKLQNVVLGYLSWGRLFFFGVS--FSFKCKDWWVI 107
Query: 64 LALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-VEAGE 123
LAL+L + F YFLLF+DA+ ML R QLDII KEL E CQQ L+ +N+D++ L +EAGE
Sbjct: 108 LALTLFYTFFYFLLFMDAVIMLSRTHDQLDIIRKELAEICQQILVAQNQDNVGLSMEAGE 167
Query: 124 SRDGFEFGFHKKMLMLDHSTIV--GRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
DGFE FH++M MLD +V GR VYIYFIVC LLA+ AIELYA K LLC
Sbjct: 168 DSDGFELSFHERMFMLDQFRVVETGRKVYIYFIVCPLLAITAIELYACKCLLCN 219
BLAST of Csor.00g010540 vs. ExPASy TrEMBL
Match:
A0A6J1DSQ0 (uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023958 PE=4 SV=1)
HSP 1 Score: 229 bits (585), Expect = 3.04e-74
Identity = 123/176 (69.89%), Postives = 143/176 (81.25%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQ-ALSFKCNN 60
MA GEL+R+F+EL D+NEKQE++V Y++ KVQNIVFGYLI+ RLF +GISQ + SF C +
Sbjct: 1 MALGELERKFEELKDINEKQESRVRYYETKVQNIVFGYLIFTRLFFFGISQTSSSFNCKD 60
Query: 61 WWVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-V 120
WWVILALSLL +FIYFLLFLDA+ ML R QYQLDIICKEL E QQ L+ KN+DD+ L +
Sbjct: 61 WWVILALSLLCSFIYFLLFLDAVAMLFRTQYQLDIICKELKELFQQILVSKNQDDVGLSM 120
Query: 121 EAGESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
E GES GFEFGFH+KMLMLDH IVGR VYIYF V ALLAV AIELY KY+LC
Sbjct: 121 ETGESSGGFEFGFHEKMLMLDHFRIVGRKVYIYFTVSALLAVTAIELYVSKYVLCN 176
BLAST of Csor.00g010540 vs. ExPASy TrEMBL
Match:
A0A6J1DX74 (uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111023953 PE=4 SV=1)
HSP 1 Score: 223 bits (569), Expect = 7.99e-72
Identity = 115/175 (65.71%), Postives = 140/175 (80.00%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNW 60
MA GEL+R+F EL D+NEKQE++V YH+ K Q IV GYLI RLF +GISQ S KC++W
Sbjct: 1 MAVGELRRKFGELKDINEKQESRVRYHEAKFQKIVSGYLILTRLFFFGISQTSSSKCHDW 60
Query: 61 WVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-VE 120
WVIL+LSLL +F+YFLLFLDA T L++ + QLD+ICKELIE CQQ L+ +N+DD+DL +E
Sbjct: 61 WVILSLSLLCSFVYFLLFLDAATRLYQTKGQLDMICKELIEVCQQILVAQNQDDVDLAME 120
Query: 121 AGESRDGFEFGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
G+ DGFEFGFH+KML+LDH VGR VYIYF VCAL+AV AIELY KYLLC
Sbjct: 121 GGDFSDGFEFGFHEKMLVLDHFRFVGRKVYIYFTVCALVAVTAIELYVSKYLLCN 175
BLAST of Csor.00g010540 vs. ExPASy TrEMBL
Match:
A0A5A7TMJ1 (WD repeat-containing protein 91-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold376G00790 PE=4 SV=1)
HSP 1 Score: 187 bits (476), Expect = 4.37e-57
Identity = 101/174 (58.05%), Postives = 126/174 (72.41%), Query Frame = 0
Query: 4 GELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQALSFKCNNWWVI 63
G+L+R F L D+N+ QET + Y + K+QN+V GYL W RLF +G+S SFKC +WWVI
Sbjct: 48 GDLRRNFVLLKDINDNQETSLRYCETKLQNVVLGYLSWGRLFFFGVS--FSFKCKDWWVI 107
Query: 64 LALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDMDL-VEAGE 123
LAL+L + F YFLLF+DA+ ML R QLDII KEL E CQQ L+ +N+D++ L +EAGE
Sbjct: 108 LALTLFYTFFYFLLFMDAVIMLSRTHDQLDIIRKELAEICQQILVAQNQDNVGLSMEAGE 167
Query: 124 SRDGFEFGFHKKMLMLDHSTIV--GRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
DGFE FH++M MLD +V GR VYIYFIVC LLA+ AIELYA K LLC
Sbjct: 168 DSDGFELSFHERMFMLDQFRVVETGRKVYIYFIVCPLLAITAIELYACKCLLCN 219
BLAST of Csor.00g010540 vs. ExPASy TrEMBL
Match:
A0A5A7TLI0 (WD repeat-containing protein 91-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold376G00770 PE=4 SV=1)
HSP 1 Score: 184 bits (468), Expect = 2.17e-56
Identity = 106/181 (58.56%), Postives = 133/181 (73.48%), Query Frame = 0
Query: 1 MAHGELKRRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQ-ALSFKCNN 60
MA G+L R F+E+ + ++QE KVCYH+NKVQN+V GYL++ RL I+G +Q +L FKC +
Sbjct: 1 MASGDL-RSFEEVTVIYKEQEEKVCYHENKVQNLVIGYLVFGRLLIFGFTQTSLPFKCKD 60
Query: 61 WWVILALSLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKN-RDDMDL- 120
WWVILAL+L +YF L LDA+TML R +Y+LDII KELIE CQ+ L+ +N RD +DL
Sbjct: 61 WWVILALTLSCTLVYFSLLLDAVTMLCRTEYELDIIRKELIEICQRILVSQNQRDLVDLT 120
Query: 121 ---VEAGESRDGFEFGF--HKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLL 173
+EA ES DGF+FGF H+KMLMLDH V R V+IYF V ALL V IELY KYLL
Sbjct: 121 QLTMEAEESSDGFDFGFGFHQKMLMLDHFRTVRRKVHIYFTVSALLVVVVIELYVSKYLL 180
BLAST of Csor.00g010540 vs. ExPASy TrEMBL
Match:
A0A0A0KND2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139800 PE=4 SV=1)
HSP 1 Score: 183 bits (465), Expect = 6.20e-56
Identity = 104/175 (59.43%), Postives = 130/175 (74.29%), Query Frame = 0
Query: 8 RRFKELIDVNEKQETKVCYHKNKVQNIVFGYLIWVRLFIYGISQA-LSFKCNNWWVILAL 67
R F+EL + ++QE +VC+H +KVQN+V GYLI+ RL I+GI+Q L FKC +WWVILAL
Sbjct: 7 RSFQELKVIYKEQEERVCHHDSKVQNLVIGYLIFGRLLIFGIAQTFLPFKCKDWWVILAL 66
Query: 68 SLLWNFIYFLLFLDAMTMLHRAQYQLDIICKELIEFCQQNLIPKNRDDM-DL----VEAG 127
+L IYF L LDA+TML RAQYQLDII +ELIE CQ+ L +N+ ++ DL +EAG
Sbjct: 67 TLSCTLIYFSLLLDAVTMLRRAQYQLDIIREELIEICQRILETQNQKELVDLTQLTMEAG 126
Query: 128 ESRDGFE--FGFHKKMLMLDHSTIVGRNVYIYFIVCALLAVAAIELYAYKYLLCK 174
ES DGF+ FGFHKKMLMLDHS+IV R V++YF V LL V IELY KYL+C
Sbjct: 127 ESNDGFDYNFGFHKKMLMLDHSSIVRRKVHMYFTVSVLLVVIVIELYVSKYLVCN 181
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6579338.1 | 5.87e-122 | 100.00 | hypothetical protein SDJN03_23786, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7016840.1 | 1.85e-117 | 97.70 | hypothetical protein SDJN02_21951, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022157182.1 | 6.27e-74 | 69.89 | uncharacterized protein LOC111023958 [Momordica charantia] >XP_022157183.1 uncha... | [more] |
XP_022157176.1 | 1.65e-71 | 65.71 | uncharacterized protein LOC111023953 isoform X2 [Momordica charantia] | [more] |
KAA0042579.1 | 9.03e-57 | 58.05 | WD repeat-containing protein 91-like protein [Cucumis melo var. makuwa] >TYK0598... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DSQ0 | 3.04e-74 | 69.89 | uncharacterized protein LOC111023958 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DX74 | 7.99e-72 | 65.71 | uncharacterized protein LOC111023953 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A5A7TMJ1 | 4.37e-57 | 58.05 | WD repeat-containing protein 91-like protein OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5A7TLI0 | 2.17e-56 | 58.56 | WD repeat-containing protein 91-like protein OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A0A0KND2 | 6.20e-56 | 59.43 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139800 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |