Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAGATTCAATATCCTTATACTTGCAGTAACGTTGTTCTTGGCTCTGATTACTGAAACAGAGGGTCAATGGCGGCAGCCACCGGACGTTGCTCCTCGACCACTCTGCGCCTCCCAGATTGCACTAGCAAACTATGCTTGCGCAATGTTGCCTTACTCCACAGTCCTGCCACCTCCACCTCCATCATCATCACTCTCTGATAACCATGAAAGCCAAGGCACTCCCAGCCACACACACCGGCACGGACACCAGCACAGACACCCGCATGGACACCAGCACATGATCCCGCTATCGCCAATGGAGGAGAATTGTTGCAAGTGGGTGCAGCAGTTGGATAGTGAATGTATATGCGGGCTGTTGTCCCGGTTGCCTGCATTCCTAGAGAGGCCTCAGCATAACTTTTCTGTTACCGTTGGCGGTTCGTGTGATGCTACATACTCGTGCGGAGGAGGAATCAAAATCTAA
mRNA sequence
ATGGGGAGATTCAATATCCTTATACTTGCAGTAACGTTGTTCTTGGCTCTGATTACTGAAACAGAGGGTCAATGGCGGCAGCCACCGGACGTTGCTCCTCGACCACTCTGCGCCTCCCAGATTGCACTAGCAAACTATGCTTGCGCAATGTTGCCTTACTCCACAGTCCTGCCACCTCCACCTCCATCATCATCACTCTCTGATAACCATGAAAGCCAAGGCACTCCCAGCCACACACACCGGCACGGACACCAGCACAGACACCCGCATGGACACCAGCACATGATCCCGCTATCGCCAATGGAGGAGAATTGTTGCAAGTGGGTGCAGCAGTTGGATAGTGAATGTATATGCGGGCTGTTGTCCCGGTTGCCTGCATTCCTAGAGAGGCCTCAGCATAACTTTTCTGTTACCGTTGGCGGTTCGTGTGATGCTACATACTCGTGCGGAGGAGGAATCAAAATCTAA
Coding sequence (CDS)
ATGGGGAGATTCAATATCCTTATACTTGCAGTAACGTTGTTCTTGGCTCTGATTACTGAAACAGAGGGTCAATGGCGGCAGCCACCGGACGTTGCTCCTCGACCACTCTGCGCCTCCCAGATTGCACTAGCAAACTATGCTTGCGCAATGTTGCCTTACTCCACAGTCCTGCCACCTCCACCTCCATCATCATCACTCTCTGATAACCATGAAAGCCAAGGCACTCCCAGCCACACACACCGGCACGGACACCAGCACAGACACCCGCATGGACACCAGCACATGATCCCGCTATCGCCAATGGAGGAGAATTGTTGCAAGTGGGTGCAGCAGTTGGATAGTGAATGTATATGCGGGCTGTTGTCCCGGTTGCCTGCATTCCTAGAGAGGCCTCAGCATAACTTTTCTGTTACCGTTGGCGGTTCGTGTGATGCTACATACTCGTGCGGAGGAGGAATCAAAATCTAA
Protein sequence
MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPPPPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGLLSRLPAFLERPQHNFSVTVGGSCDATYSCGGGIKI
Homology
BLAST of HG10022858 vs. NCBI nr
Match:
XP_038899044.1 (uncharacterized protein LOC120086453 [Benincasa hispida])
HSP 1 Score: 290.8 bits (743), Expect = 6.9e-75
Identity = 140/155 (90.32%), Postives = 144/155 (92.90%), Query Frame = 0
Query: 1 MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
MGRF ILI +VTLFLALIT+TEGQW QPPDVAPRPLCASQIALANYACAMLPYSTVLPPP
Sbjct: 1 MGRFKILIFSVTLFLALITDTEGQW-QPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
Query: 61 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 120
PPS SLSDNHESQG+P+ HRH HQH H HGHQ IPLSPMEENCCKWVQQLDSECICGL
Sbjct: 61 PPSLSLSDNHESQGSPNQRHRHTHQHGHHHGHQQKIPLSPMEENCCKWVQQLDSECICGL 120
Query: 121 LSRLPAFLERPQHNFSVTVGGSCDATYSCGGGIKI 156
LSRLPAFLERPQHNFSVTVGGSCDATYSCGGGIKI
Sbjct: 121 LSRLPAFLERPQHNFSVTVGGSCDATYSCGGGIKI 154
BLAST of HG10022858 vs. NCBI nr
Match:
XP_031744964.1 (uncharacterized protein LOC116405195 [Cucumis sativus] >KGN44871.1 hypothetical protein Csa_016586 [Cucumis sativus])
HSP 1 Score: 235.3 bits (599), Expect = 3.4e-58
Identity = 112/151 (74.17%), Postives = 127/151 (84.11%), Query Frame = 0
Query: 1 MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
MGRF ILIL+V LFL LIT++E QW QPP VAPRPLCASQI LANYACA LPY+ + PPP
Sbjct: 1 MGRFKILILSVVLFLTLITDSESQW-QPPGVAPRPLCASQITLANYACATLPYAKLPPPP 60
Query: 61 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 120
PPSSSLS+N +SQG+P H H HGH HR H H IPL+P+EENCCKWVQQ+DSEC+C L
Sbjct: 61 PPSSSLSNNQDSQGSPVHGHHHGHHHR----HHHKIPLTPIEENCCKWVQQVDSECVCEL 120
Query: 121 LSRLPAFLERPQHNFSVTVGGSCDATYSCGG 152
LSRLPAFL+RP HNFSVT+GGSC+ATY CGG
Sbjct: 121 LSRLPAFLKRPIHNFSVTIGGSCNATYWCGG 146
BLAST of HG10022858 vs. NCBI nr
Match:
XP_023548911.1 (uncharacterized protein LOC111807420 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 221.9 bits (564), Expect = 3.9e-54
Identity = 113/152 (74.34%), Postives = 122/152 (80.26%), Query Frame = 0
Query: 1 MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
MGRF IL+ AVTLFLALI +TE QW QPP VAPRP C SQIALAN ACAMLPYST +
Sbjct: 1 MGRFKILLFAVTLFLALIPDTEAQW-QPPKVAPRPPCPSQIALANLACAMLPYST-MALS 60
Query: 61 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 120
PPS SLSDNHES G P + H H H H HGHQH I LSPMEENCCKWVQQLDS+C+C L
Sbjct: 61 PPSVSLSDNHESHGPPPN---HRHLHTHWHGHQHQISLSPMEENCCKWVQQLDSQCLCRL 120
Query: 121 LSRLPAFLERPQHNFSVTVGGSCDATYSCGGG 153
LSRLPAFLERPQH S +VGGSC+AT+SCGGG
Sbjct: 121 LSRLPAFLERPQHKISFSVGGSCNATFSCGGG 147
BLAST of HG10022858 vs. NCBI nr
Match:
KAA0057606.1 (hypothetical protein E6C27_scaffold497G00830 [Cucumis melo var. makuwa] >TYK20987.1 hypothetical protein E5676_scaffold328G00120 [Cucumis melo var. makuwa])
HSP 1 Score: 214.9 bits (546), Expect = 4.8e-52
Identity = 103/131 (78.63%), Postives = 112/131 (85.50%), Query Frame = 0
Query: 23 GQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPPPPSSSLSDNHESQGTPSHTHRH 82
GQW QPP VAPRPLCASQIALANYACA LPYS +LPPPPPSSSLS NH+SQG+P HTH H
Sbjct: 83 GQW-QPPAVAPRPLCASQIALANYACATLPYSKLLPPPPPSSSLSSNHDSQGSPVHTHHH 142
Query: 83 GHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGLLSRLPAFLERPQHNFSVTVGG- 142
GH HR H H IPLSP+EENCCKWVQQ+DSEC+C LLSRLPAFL+RP HNFSV +GG
Sbjct: 143 GHHHR----HHHKIPLSPIEENCCKWVQQVDSECVCELLSRLPAFLKRPIHNFSVIIGGS 202
Query: 143 -SCDATYSCGG 152
SC+ATYSCGG
Sbjct: 203 LSCNATYSCGG 208
BLAST of HG10022858 vs. NCBI nr
Match:
KAG7014427.1 (hypothetical protein SDJN02_24604, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 195.7 bits (496), Expect = 3.0e-46
Identity = 103/142 (72.54%), Postives = 110/142 (77.46%), Query Frame = 0
Query: 1 MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
M RF ILI AVTLFLALI +TE QW QPP VAPRP C SQIALAN ACAMLPYST +
Sbjct: 1 MERFKILIFAVTLFLALIPDTEAQW-QPPKVAPRPPCPSQIALANLACAMLPYST-MALS 60
Query: 61 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 120
PPS SLSDNHES G P + H H H HGHQH I LSPMEENCCKWVQQLDS+C+C L
Sbjct: 61 PPSLSLSDNHESHGLPPN---HKHLRTHWHGHQHQISLSPMEENCCKWVQQLDSQCLCRL 120
Query: 121 LSRLPAFLERPQHNFSVTVGGS 143
LSRLPAFLERPQH S +VGG+
Sbjct: 121 LSRLPAFLERPQHKISFSVGGN 137
BLAST of HG10022858 vs. ExPASy TrEMBL
Match:
A0A0A0KAN7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G392440 PE=4 SV=1)
HSP 1 Score: 235.3 bits (599), Expect = 1.7e-58
Identity = 112/151 (74.17%), Postives = 127/151 (84.11%), Query Frame = 0
Query: 1 MGRFNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 60
MGRF ILIL+V LFL LIT++E QW QPP VAPRPLCASQI LANYACA LPY+ + PPP
Sbjct: 1 MGRFKILILSVVLFLTLITDSESQW-QPPGVAPRPLCASQITLANYACATLPYAKLPPPP 60
Query: 61 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 120
PPSSSLS+N +SQG+P H H HGH HR H H IPL+P+EENCCKWVQQ+DSEC+C L
Sbjct: 61 PPSSSLSNNQDSQGSPVHGHHHGHHHR----HHHKIPLTPIEENCCKWVQQVDSECVCEL 120
Query: 121 LSRLPAFLERPQHNFSVTVGGSCDATYSCGG 152
LSRLPAFL+RP HNFSVT+GGSC+ATY CGG
Sbjct: 121 LSRLPAFLKRPIHNFSVTIGGSCNATYWCGG 146
BLAST of HG10022858 vs. ExPASy TrEMBL
Match:
A0A5A7UTY5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold328G00120 PE=4 SV=1)
HSP 1 Score: 214.9 bits (546), Expect = 2.3e-52
Identity = 103/131 (78.63%), Postives = 112/131 (85.50%), Query Frame = 0
Query: 23 GQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPPPPSSSLSDNHESQGTPSHTHRH 82
GQW QPP VAPRPLCASQIALANYACA LPYS +LPPPPPSSSLS NH+SQG+P HTH H
Sbjct: 83 GQW-QPPAVAPRPLCASQIALANYACATLPYSKLLPPPPPSSSLSSNHDSQGSPVHTHHH 142
Query: 83 GHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGLLSRLPAFLERPQHNFSVTVGG- 142
GH HR H H IPLSP+EENCCKWVQQ+DSEC+C LLSRLPAFL+RP HNFSV +GG
Sbjct: 143 GHHHR----HHHKIPLSPIEENCCKWVQQVDSECVCELLSRLPAFLKRPIHNFSVIIGGS 202
Query: 143 -SCDATYSCGG 152
SC+ATYSCGG
Sbjct: 203 LSCNATYSCGG 208
BLAST of HG10022858 vs. ExPASy TrEMBL
Match:
A0A0B2QE80 (Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_007235 PE=4 SV=1)
HSP 1 Score: 152.9 bits (385), Expect = 1.1e-33
Identity = 75/150 (50.00%), Postives = 100/150 (66.67%), Query Frame = 0
Query: 4 FNILILAVTLFLALITETEGQWRQPPDVAPRPLCASQIALANYACAMLPYSTVLPPPPPS 63
FN++ LA+TLFLA++ + E Q + P PRPLCASQ AL NYAC+ LP+S +PP PS
Sbjct: 4 FNLIALALTLFLAIVPKMESQIKPIPK-TPRPLCASQFALVNYACSRLPFSPGVPPDSPS 63
Query: 64 SSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGLLSR 123
+N+ + H HRHGH+HRH HQ +P E+NCC+W +++D +C+C LL R
Sbjct: 64 PDEENNNHHNRSHRHGHRHGHRHRH---HQ-----TPDEDNCCRWAKEVDHQCVCELLLR 123
Query: 124 LPAFLERPQHNFSVTVGGSCDATYSCGGGI 154
LP FL RP H +++ VG SCD TYSCG I
Sbjct: 124 LPPFLIRPSHQYTLKVGASCDITYSCGAPI 144
BLAST of HG10022858 vs. ExPASy TrEMBL
Match:
K7MYJ6 (Uncharacterized protein OS=Glycine max OX=3847 GN=102669806 PE=4 SV=1)
HSP 1 Score: 149.8 bits (377), Expect = 9.2e-33
Identity = 74/153 (48.37%), Postives = 101/153 (66.01%), Query Frame = 0
Query: 4 FNILILAVTLFLALITETEGQWR---QPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 63
FN++ LA+T+FLAL+ + E R P PRPLCASQ AL NYAC+ LP+S +PP
Sbjct: 4 FNLITLAMTIFLALVPKMESHIRTPGMPMPKTPRPLCASQFALVNYACSRLPFSLGVPPD 63
Query: 64 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 123
PS+ S N E +G ++ H H+H H HGH+H + E+NCC+W +++D++C+C L
Sbjct: 64 SPSTPPSPNDE-EGHRNNHHNGSHRHGHRHGHKHRNHQTADEDNCCRWAKEVDNQCVCEL 123
Query: 124 LSRLPAFLERPQHNFSVTVGGSCDATYSCGGGI 154
L RLP FL RP H +++ VG SCD TYSCG I
Sbjct: 124 LLRLPPFLIRPLHQYTLNVGESCDITYSCGAPI 155
BLAST of HG10022858 vs. ExPASy TrEMBL
Match:
A0A0B2SLT8 (Uncharacterized protein OS=Glycine soja OX=3848 GN=glysoja_028376 PE=4 SV=1)
HSP 1 Score: 149.8 bits (377), Expect = 9.2e-33
Identity = 74/153 (48.37%), Postives = 101/153 (66.01%), Query Frame = 0
Query: 4 FNILILAVTLFLALITETEGQWR---QPPDVAPRPLCASQIALANYACAMLPYSTVLPPP 63
FN++ LA+T+FLAL+ + E R P PRPLCASQ AL NYAC+ LP+S +PP
Sbjct: 4 FNLITLAMTIFLALVPKMESHIRTPGMPMPKTPRPLCASQFALVNYACSRLPFSLGVPPD 63
Query: 64 PPSSSLSDNHESQGTPSHTHRHGHQHRHPHGHQHMIPLSPMEENCCKWVQQLDSECICGL 123
PS+ S N E +G ++ H H+H H HGH+H + E+NCC+W +++D++C+C L
Sbjct: 64 SPSTPPSPNDE-EGHRNNHHNGSHRHGHRHGHKHRNHQTADEDNCCRWAKEVDNQCVCEL 123
Query: 124 LSRLPAFLERPQHNFSVTVGGSCDATYSCGGGI 154
L RLP FL RP H +++ VG SCD TYSCG I
Sbjct: 124 LLRLPPFLIRPLHQYTLNVGESCDITYSCGAPI 155
BLAST of HG10022858 vs. TAIR 10
Match:
AT3G63095.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 92.0 bits (227), Expect = 4.4e-19
Identity = 75/247 (30.36%), Postives = 94/247 (38.06%), Query Frame = 0
Query: 7 LILAVTLFLALITETEGQ--WRQPPDVAPRPLCASQIALANYACAMLPYSTVLPP----P 66
L +AVTL LAL + T GQ + PP PRPLCASQ ALANYAC+ LP TV PP P
Sbjct: 4 LFIAVTLALALSSFTRGQRVIQIPP---PRPLCASQYALANYACSRLPMHTVPPPAPITP 63
Query: 67 PPS---------------------------------SSLSDNHESQGTPSHTH------- 126
PP+ D+H+ H H
Sbjct: 64 PPAPIAPPPPEHDHHDHDHDHDHDHDHDDDHDDDDDHDHDDDHDHDHDDDHDHDDDDRDR 123
Query: 127 --------------------------------------------------------RHGH 152
R H
Sbjct: 124 DRDRDRDRDRDRDHDRAHDDHDHNHDHDNDHNDNNHDHNHDADHHNNDDNSHDQHRRRHH 183
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038899044.1 | 6.9e-75 | 90.32 | uncharacterized protein LOC120086453 [Benincasa hispida] | [more] |
XP_031744964.1 | 3.4e-58 | 74.17 | uncharacterized protein LOC116405195 [Cucumis sativus] >KGN44871.1 hypothetical ... | [more] |
XP_023548911.1 | 3.9e-54 | 74.34 | uncharacterized protein LOC111807420 [Cucurbita pepo subsp. pepo] | [more] |
KAA0057606.1 | 4.8e-52 | 78.63 | hypothetical protein E6C27_scaffold497G00830 [Cucumis melo var. makuwa] >TYK2098... | [more] |
KAG7014427.1 | 3.0e-46 | 72.54 | hypothetical protein SDJN02_24604, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KAN7 | 1.7e-58 | 74.17 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G392440 PE=4 SV=1 | [more] |
A0A5A7UTY5 | 2.3e-52 | 78.63 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0B2QE80 | 1.1e-33 | 50.00 | Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_007235 PE=4 SV=1 | [more] |
K7MYJ6 | 9.2e-33 | 48.37 | Uncharacterized protein OS=Glycine max OX=3847 GN=102669806 PE=4 SV=1 | [more] |
A0A0B2SLT8 | 9.2e-33 | 48.37 | Uncharacterized protein OS=Glycine soja OX=3848 GN=glysoja_028376 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G63095.1 | 4.4e-19 | 30.36 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |