Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGACGGCTACTCCAAAATCAAAGCCGCCTGCAAAATCAAATCAAGATCCATAGATTATTCCGATCTTGCATCGCTTCCTCACTCCCTCAAATTCAACGGCAACGGCCCCAATCCGAACCCTCACGAATCCGATCAGAACAAGAAGAAATGGACGCAGCAGGGCTGCTTCCCGGAGGAGGAAGAAGACGAGAATGGCGTCGCCGCGCTGAGTAGGAACAGCTCTGTGTCTTCTTCGACCTCCGGATTATTCCATTCGGCGGTGAAGAGAGCTCTGTCGATGCGGAGATCTTCGTCCGTGGCGGAGAGGTACTGTAGGATTCATGATCAGTTTGCGACGCTTGCATCGCCAATCGATGATGATGATGAAGAAGATGAAGGCGGAGGTTCCAAGGAAGGGCGAAGAACGAGGGGATCTGTGACGACTGTGAGGAAGAAGAAGAAGCACGCAGCAGGGAAGATCGTTAAAGCCTGTAAGAGGTTTTTTGGACTCTAGTTTGTCCATGAGCATTTTTAACAAATGGGTAAAGGTATATATATCCTTTGCTTTGCTCTTTTGTTGCAGACGAAGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGGTTTTTTGGTTCTCAGTGAGTATTTTACGCTGGGAATGTCTGTTTTGATTGTCCTTCTGGTCTGGTTAATCGATTTGGGAGTAATGGCGTAG
mRNA sequence
ATGGCCGACGGCTACTCCAAAATCAAAGCCGCCTGCAAAATCAAATCAAGATCCATAGATTATTCCGATCTTGCATCGCTTCCTCACTCCCTCAAATTCAACGGCAACGGCCCCAATCCGAACCCTCACGAATCCGATCAGAACAAGAAGAAATGGACGCAGCAGGGCTGCTTCCCGGAGGAGGAAGAAGACGAGAATGGCGTCGCCGCGCTGAGTAGGAACAGCTCTGTGTCTTCTTCGACCTCCGGATTATTCCATTCGGCGGTGAAGAGAGCTCTGTCGATGCGGAGATCTTCGTCCGTGGCGGAGAGGTACTGTAGGATTCATGATCAGTTTGCGACGCTTGCATCGCCAATCGATGATGATGATGAAGAAGATGAAGGCGGAGGTTCCAAGGAAGGGCGAAGAACGAGGGGATCTGTGACGACTGTGAGGAAGAAGAAGAAGCACGCAGCAGGGAAGATCGTTAAAGCCTACGAAGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGGTTTTTTGGTTCTCAGTGAGTATTTTACGCTGGGAATGTCTGTTTTGATTGTCCTTCTGGTCTGGTTAATCGATTTGGGAGTAATGGCGTAG
Coding sequence (CDS)
ATGGCCGACGGCTACTCCAAAATCAAAGCCGCCTGCAAAATCAAATCAAGATCCATAGATTATTCCGATCTTGCATCGCTTCCTCACTCCCTCAAATTCAACGGCAACGGCCCCAATCCGAACCCTCACGAATCCGATCAGAACAAGAAGAAATGGACGCAGCAGGGCTGCTTCCCGGAGGAGGAAGAAGACGAGAATGGCGTCGCCGCGCTGAGTAGGAACAGCTCTGTGTCTTCTTCGACCTCCGGATTATTCCATTCGGCGGTGAAGAGAGCTCTGTCGATGCGGAGATCTTCGTCCGTGGCGGAGAGGTACTGTAGGATTCATGATCAGTTTGCGACGCTTGCATCGCCAATCGATGATGATGATGAAGAAGATGAAGGCGGAGGTTCCAAGGAAGGGCGAAGAACGAGGGGATCTGTGACGACTGTGAGGAAGAAGAAGAAGCACGCAGCAGGGAAGATCGTTAAAGCCTACGAAGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGGTTTTTTGGTTCTCAGTGAGTATTTTACGCTGGGAATGTCTGTTTTGATTGTCCTTCTGGTCTGGTTAATCGATTTGGGAGTAATGGCGTAG
Protein sequence
MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPNPHESDQNKKKWTQQGCFPEEEEDENGVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQFATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKAYEDLSLSLSLSLSLSSGFLVLSEYFTLGMSVLIVLLVWLIDLGVMA
Homology
BLAST of Sgr029859 vs. NCBI nr
Match:
XP_022136741.1 (uncharacterized protein LOC111008371 [Momordica charantia])
HSP 1 Score: 200.3 bits (508), Expect = 1.6e-47
Identity = 126/171 (73.68%), Postives = 134/171 (78.36%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPN--PHESDQNK-KKWTQQGC 60
MADGYSKIKAACK KSRSIDYSDLASLPHSLKF PNPN HESDQN+ + Q C
Sbjct: 1 MADGYSKIKAACKFKSRSIDYSDLASLPHSLKFTAAVPNPNSLAHESDQNRANRARAQSC 60
Query: 61 FPEEEEDENG------VAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQ 120
PEEEEDE G VAALSRNSSVSSS SGL HSAVKRALSMRRSSSVAERYCRIHDQ
Sbjct: 61 LPEEEEDEYGGGGGVAVAALSRNSSVSSSASGL-HSAVKRALSMRRSSSVAERYCRIHDQ 120
Query: 121 FATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKAYEDL 163
FATLASPI DDEE G SKE R++ GS VR+KKK+AAGKIV+A + L
Sbjct: 121 FATLASPI--DDEEIGAGDSKESRKSAGS---VRRKKKNAAGKIVRACKRL 165
BLAST of Sgr029859 vs. NCBI nr
Match:
KAG7015455.1 (hypothetical protein SDJN02_23091, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 182.6 bits (462), Expect = 3.5e-42
Identity = 117/177 (66.10%), Postives = 130/177 (73.45%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNG--PNPNPHESDQNKKKWTQQGCF 60
MAD YSKIKAACK KSRSIDYSDL SLPHS +FN NPN H+S++NK Q
Sbjct: 1 MADSYSKIKAACKFKSRSIDYSDLTSLPHSPRFNAAAAVSNPNSHDSNKNKTN-RQHSRL 60
Query: 61 PEEEEDEN------------GVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYC 120
PEEEE+E AALSRN+SVSSS SG FHSAVKRALSMRRSSSVAERYC
Sbjct: 61 PEEEEEEEEEEDEYGGEVSARAAALSRNNSVSSSVSG-FHSAVKRALSMRRSSSVAERYC 120
Query: 121 RIHDQFATLASPIDDDDEEDEGGGSKEGRRTRGSV-TTVRKKKKHAAGKIVKAYEDL 163
RIHDQFATLASPIDDD+ ED G SKE R+T GSV T +KKKK++AGKIV+A + L
Sbjct: 121 RIHDQFATLASPIDDDEMED--GDSKEERKTGGSVKKTKKKKKKNSAGKIVRACKRL 173
BLAST of Sgr029859 vs. NCBI nr
Match:
XP_022929198.1 (uncharacterized protein LOC111435863 [Cucurbita moschata] >KAG6577366.1 hypothetical protein SDJN03_24940, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 179.5 bits (454), Expect = 2.9e-41
Identity = 116/177 (65.54%), Postives = 129/177 (72.88%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNG--PNPNPHESDQNKKKWTQQGCF 60
MAD YSKIKAACK KSRSIDYSDL SLPHS +FN NPN H+S++NK Q
Sbjct: 1 MADSYSKIKAACKFKSRSIDYSDLTSLPHSPRFNAAAAVSNPNSHDSNKNKTN-RQHSRL 60
Query: 61 PEEEEDEN------------GVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYC 120
PEEEE+E AALSRN+SVSSS SG FHSAVKRALSMRRSSSVAERYC
Sbjct: 61 PEEEEEEEEEEDEYGGEVSARAAALSRNNSVSSSVSG-FHSAVKRALSMRRSSSVAERYC 120
Query: 121 RIHDQFATLASPIDDDDEEDEGGGSKEGRRTRGSV-TTVRKKKKHAAGKIVKAYEDL 163
RIHDQFATLASPIDDD+ ED G SKE R+T GSV T +KKKK++A KIV+A + L
Sbjct: 121 RIHDQFATLASPIDDDEMED--GDSKEERKTGGSVKKTKKKKKKNSAEKIVRACKRL 173
BLAST of Sgr029859 vs. NCBI nr
Match:
KAG7031359.1 (hypothetical protein SDJN02_05399, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 178.7 bits (452), Expect = 5.0e-41
Identity = 113/171 (66.08%), Postives = 127/171 (74.27%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPNPHESDQNKKKWTQQ---GC 60
MAD YSKI+AACK+KSRS+DYSDL+SLPHSL+F NPN +SDQN+ +++ GC
Sbjct: 1 MADTYSKIEAACKLKSRSVDYSDLSSLPHSLRFAAADSNPNSRDSDQNRTNRSREPPHGC 60
Query: 61 FPEEEEDENGVAA------LSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQ 120
PEEEEDE AA L RN SVS+S SG FHSAVKRALSMRRSSSVAERY RIHDQ
Sbjct: 61 LPEEEEDEYRSAATATASTLRRNCSVSASASG-FHSAVKRALSMRRSSSVAERYSRIHDQ 120
Query: 121 FATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKAYEDL 163
F TLASPIDDD E EG SKEGR GS VRKKK +AAGKIV+A + L
Sbjct: 121 FGTLASPIDDD--EIEGEESKEGRNAGGS---VRKKKTNAAGKIVRACKRL 165
BLAST of Sgr029859 vs. NCBI nr
Match:
KAG6600720.1 (hypothetical protein SDJN03_05953, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 176.4 bits (446), Expect = 2.5e-40
Identity = 112/167 (67.07%), Postives = 124/167 (74.25%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPNPHESDQNKKKWTQQ---GC 60
MAD YSKI+AACK+KSRS DYSDL+SLPHSL+F NPN +SDQN+ +++ GC
Sbjct: 1 MADTYSKIEAACKLKSRSADYSDLSSLPHSLRFAAADSNPNSRDSDQNRTNRSREPPHGC 60
Query: 61 FPEEEEDENGVAA------LSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQ 120
PEEEEDE AA L RN SVS+S SG FHSAVKRALSMRRSSSVAERY RIHDQ
Sbjct: 61 LPEEEEDEYRSAATATASTLRRNCSVSASASG-FHSAVKRALSMRRSSSVAERYSRIHDQ 120
Query: 121 FATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKA 159
F TLASPIDDD E EG SKEGR GS VRKKK +AAGKIV+A
Sbjct: 121 FGTLASPIDDD--EIEGEESKEGRNAGGS---VRKKKTNAAGKIVRA 161
BLAST of Sgr029859 vs. ExPASy TrEMBL
Match:
A0A6J1C4C9 (uncharacterized protein LOC111008371 OS=Momordica charantia OX=3673 GN=LOC111008371 PE=4 SV=1)
HSP 1 Score: 200.3 bits (508), Expect = 7.8e-48
Identity = 126/171 (73.68%), Postives = 134/171 (78.36%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPN--PHESDQNK-KKWTQQGC 60
MADGYSKIKAACK KSRSIDYSDLASLPHSLKF PNPN HESDQN+ + Q C
Sbjct: 1 MADGYSKIKAACKFKSRSIDYSDLASLPHSLKFTAAVPNPNSLAHESDQNRANRARAQSC 60
Query: 61 FPEEEEDENG------VAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQ 120
PEEEEDE G VAALSRNSSVSSS SGL HSAVKRALSMRRSSSVAERYCRIHDQ
Sbjct: 61 LPEEEEDEYGGGGGVAVAALSRNSSVSSSASGL-HSAVKRALSMRRSSSVAERYCRIHDQ 120
Query: 121 FATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKAYEDL 163
FATLASPI DDEE G SKE R++ GS VR+KKK+AAGKIV+A + L
Sbjct: 121 FATLASPI--DDEEIGAGDSKESRKSAGS---VRRKKKNAAGKIVRACKRL 165
BLAST of Sgr029859 vs. ExPASy TrEMBL
Match:
A0A6J1ERE9 (uncharacterized protein LOC111435863 OS=Cucurbita moschata OX=3662 GN=LOC111435863 PE=4 SV=1)
HSP 1 Score: 179.5 bits (454), Expect = 1.4e-41
Identity = 116/177 (65.54%), Postives = 129/177 (72.88%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNG--PNPNPHESDQNKKKWTQQGCF 60
MAD YSKIKAACK KSRSIDYSDL SLPHS +FN NPN H+S++NK Q
Sbjct: 1 MADSYSKIKAACKFKSRSIDYSDLTSLPHSPRFNAAAAVSNPNSHDSNKNKTN-RQHSRL 60
Query: 61 PEEEEDEN------------GVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYC 120
PEEEE+E AALSRN+SVSSS SG FHSAVKRALSMRRSSSVAERYC
Sbjct: 61 PEEEEEEEEEEDEYGGEVSARAAALSRNNSVSSSVSG-FHSAVKRALSMRRSSSVAERYC 120
Query: 121 RIHDQFATLASPIDDDDEEDEGGGSKEGRRTRGSV-TTVRKKKKHAAGKIVKAYEDL 163
RIHDQFATLASPIDDD+ ED G SKE R+T GSV T +KKKK++A KIV+A + L
Sbjct: 121 RIHDQFATLASPIDDDEMED--GDSKEERKTGGSVKKTKKKKKKNSAEKIVRACKRL 173
BLAST of Sgr029859 vs. ExPASy TrEMBL
Match:
A0A5A7V402 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold18G00790 PE=4 SV=1)
HSP 1 Score: 161.0 bits (406), Expect = 5.3e-36
Identity = 108/173 (62.43%), Postives = 120/173 (69.36%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNG-PNPNPHESDQNKKKWTQQ---G 60
MADGYSKIKAACK KSRSIDYSDL+SLPHSL FN NP H S+ ++ Q+
Sbjct: 1 MADGYSKIKAACKFKSRSIDYSDLSSLPHSLTFNNAAVSNPISHHSNHSRTNRPQEPPHR 60
Query: 61 CFPEEEEDEN-----------GVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERY 120
PEE+E+ N A LSRNSSVSSS SG F SAVKRALSMRRSSSVAERY
Sbjct: 61 RLPEEDEEVNDDECDRPRGGTAAATLSRNSSVSSSVSG-FQSAVKRALSMRRSSSVAERY 120
Query: 121 CRIHDQFATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKA 159
CRIHDQFAT ASPI+DD E EGG SKE + GS +KKK +AAGKIV+A
Sbjct: 121 CRIHDQFATFASPIEDD--ELEGGDSKERGKLGGSAIK-KKKKTNAAGKIVRA 169
BLAST of Sgr029859 vs. ExPASy TrEMBL
Match:
A0A0A0L2N9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G110030 PE=4 SV=1)
HSP 1 Score: 155.2 bits (391), Expect = 2.9e-34
Identity = 103/170 (60.59%), Postives = 115/170 (67.65%), Query Frame = 0
Query: 1 MADGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPNPHESDQNKKKWTQQGCFPE 60
MADGYSKIKAA K KSRSIDYSDL+SLPHSL F+ NP + N+ + G PE
Sbjct: 1 MADGYSKIKAASKFKSRSIDYSDLSSLPHSLTFSAAVSNP----TRTNRPQEPPHGRLPE 60
Query: 61 EEEDEN------------GVAALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRI 120
E+E+ N A L RNSSVSSS SG F SAVKRALSMRRSSSVAERYCRI
Sbjct: 61 EDEEVNDDECDRPRGATTAAATLCRNSSVSSSVSG-FQSAVKRALSMRRSSSVAERYCRI 120
Query: 121 HDQFATLASPIDDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKA 159
HDQFAT ASPI+DD E EGG KE + GS +KKKK+AA KIV+A
Sbjct: 121 HDQFATFASPIEDD--EMEGGDWKERGKIGGSEIRKKKKKKNAAEKIVRA 163
BLAST of Sgr029859 vs. ExPASy TrEMBL
Match:
A0A5N6QLE4 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_004482 PE=4 SV=1)
HSP 1 Score: 110.9 bits (276), Expect = 6.2e-21
Identity = 85/163 (52.15%), Postives = 104/163 (63.80%), Query Frame = 0
Query: 3 DGYSKIKAACKIKSRSIDYSDLASLPHSLKFNGNGPNPNPHESDQNKKKWTQQGCFPEEE 62
+GYSK+KA KSRS+D+SD S + K N + P PHE Q K+ ++ E+
Sbjct: 2 NGYSKMKAIDTQKSRSMDFSDALSFSQTKKTNSD-PTSKPHEGSQIKESNAEK----EDG 61
Query: 63 EDENGV---AALSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQFATLASPI 122
+ NG A LSR+ SV SSTSG F SAVKRA S++RSSSV+ERYCRI+DQ TLASPI
Sbjct: 62 DGGNGEKFGAVLSRSCSV-SSTSG-FQSAVKRAFSVKRSSSVSERYCRIYDQSVTLASPI 121
Query: 123 DDDDEEDEGGGSKEGRRTRGSVTTVRKKKKHAAGKIVKAYEDL 163
DDDD+E G R TR SV KKKKH GKI+KA + L
Sbjct: 122 DDDDDE----GWDTMRATRRSV----KKKKHKGGKILKACKRL 149
BLAST of Sgr029859 vs. TAIR 10
Match:
AT4G24275.1 (Identified as a screen for stress-responsive genes. )
HSP 1 Score: 45.8 bits (107), Expect = 4.7e-05
Identity = 37/80 (46.25%), Postives = 47/80 (58.75%), Query Frame = 0
Query: 71 LSRNSSVSSSTSGLFHSAVKRALSMRRSSSVAERYCRIHDQFATLASPID-DDDEEDEGG 130
LSRN SVS+S + S +K +SMRRSSSV+ERYCRI+DQ + P+ + +EDE
Sbjct: 39 LSRNRSVSASAQAV-PSPIK--MSMRRSSSVSERYCRIYDQSSATTWPLPFHEGDEDEDD 98
Query: 131 GSKEGRRTRGSVTTVRKKKK 150
KE V KKKK
Sbjct: 99 DDKE---------KVHKKKK 106
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022136741.1 | 1.6e-47 | 73.68 | uncharacterized protein LOC111008371 [Momordica charantia] | [more] |
KAG7015455.1 | 3.5e-42 | 66.10 | hypothetical protein SDJN02_23091, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022929198.1 | 2.9e-41 | 65.54 | uncharacterized protein LOC111435863 [Cucurbita moschata] >KAG6577366.1 hypothet... | [more] |
KAG7031359.1 | 5.0e-41 | 66.08 | hypothetical protein SDJN02_05399, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6600720.1 | 2.5e-40 | 67.07 | hypothetical protein SDJN03_05953, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C4C9 | 7.8e-48 | 73.68 | uncharacterized protein LOC111008371 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1ERE9 | 1.4e-41 | 65.54 | uncharacterized protein LOC111435863 OS=Cucurbita moschata OX=3662 GN=LOC1114358... | [more] |
A0A5A7V402 | 5.3e-36 | 62.43 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0L2N9 | 2.9e-34 | 60.59 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G110030 PE=4 SV=1 | [more] |
A0A5N6QLE4 | 6.2e-21 | 52.15 | Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_004482 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT4G24275.1 | 4.7e-05 | 46.25 | Identified as a screen for stress-responsive genes. | [more] |