Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTATTTTTCAATCGCCAAAAGAAATTCATTTCCAATTGAGAAATTCATCGAAGAAGAAAAAAAAGAAGTGACTTTTGTCTCTGATCGGAAATCCTCGCCGACGGCCGGCGGTTTACTGCCGGAGAGTGAATGTGCCGACGATAATTTCACAACCACCGAGGACTCCGAAGTGGGGTGGGGGGCTTTCGACCGGTTTAAGGCAAAAACCCTAGACCAGTCCGATGAGGGTTCGATTTCCGACGAGGAAAGTTTGATCGAGATCTCTCTCCCGACCGGTCATTATGTCAGCCACGAGTTCGAGGATGAGGAAGAAGAAGAGGACGACATGTCGTCTTGTTACCGGGAGCTACCGGAATTTGAGAAGCAGAAGAAGAATCTTATGGAGCTGTTGGCTGAAATTAACGACATCGACGAGGAGAATTTGATAGAGATCGACATATCCATGGGCTCCATTAAGTATTCAAGGTTTGAAATTGGAGGAAGGAAATAA
mRNA sequence
ATGCTTTATTTTTCAATCGCCAAAAGAAATTCATTTCCAATTGAGAAATTCATCGAAGAAGAAAAAAAAGAAGTGACTTTTGTCTCTGATCGGAAATCCTCGCCGACGGCCGGCGGTTTACTGCCGGAGAGTGAATGTGCCGACGATAATTTCACAACCACCGAGGACTCCGAAGTGGGGTGGGGGGCTTTCGACCGGTTTAAGGCAAAAACCCTAGACCAGTCCGATGAGGGTTCGATTTCCGACGAGGAAAGTTTGATCGAGATCTCTCTCCCGACCGGTCATTATGTCAGCCACGAGTTCGAGGATGAGGAAGAAGAAGAGGACGACATGTCGTCTTGTTACCGGGAGCTACCGGAATTTGAGAAGCAGAAGAAGAATCTTATGGAGCTGTTGGCTGAAATTAACGACATCGACGAGGAGAATTTGATAGAGATCGACATATCCATGGGCTCCATTAAGTATTCAAGGTTTGAAATTGGAGGAAGGAAATAA
Coding sequence (CDS)
ATGCTTTATTTTTCAATCGCCAAAAGAAATTCATTTCCAATTGAGAAATTCATCGAAGAAGAAAAAAAAGAAGTGACTTTTGTCTCTGATCGGAAATCCTCGCCGACGGCCGGCGGTTTACTGCCGGAGAGTGAATGTGCCGACGATAATTTCACAACCACCGAGGACTCCGAAGTGGGGTGGGGGGCTTTCGACCGGTTTAAGGCAAAAACCCTAGACCAGTCCGATGAGGGTTCGATTTCCGACGAGGAAAGTTTGATCGAGATCTCTCTCCCGACCGGTCATTATGTCAGCCACGAGTTCGAGGATGAGGAAGAAGAAGAGGACGACATGTCGTCTTGTTACCGGGAGCTACCGGAATTTGAGAAGCAGAAGAAGAATCTTATGGAGCTGTTGGCTGAAATTAACGACATCGACGAGGAGAATTTGATAGAGATCGACATATCCATGGGCTCCATTAAGTATTCAAGGTTTGAAATTGGAGGAAGGAAATAA
Protein sequence
MLYFSIAKRNSFPIEKFIEEEKKEVTFVSDRKSSPTAGGLLPESECADDNFTTTEDSEVGWGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPEFEKQKKNLMELLAEINDIDEENLIEIDISMGSIKYSRFEIGGRK
Homology
BLAST of Carg05156 vs. NCBI nr
Match:
KAG6604917.1 (hypothetical protein SDJN03_02234, partial [Cucurbita argyrosperma subsp. sororia] >KAG7026997.1 hypothetical protein SDJN02_11005, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 320.9 bits (821), Expect = 6.6e-84
Identity = 164/164 (100.00%), Postives = 164/164 (100.00%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFIEEEKKEVTFVSDRKSSPTAGGLLPESECADDNFTTTEDSEVG 60
MLYFSIAKRNSFPIEKFIEEEKKEVTFVSDRKSSPTAGGLLPESECADDNFTTTEDSEVG
Sbjct: 1 MLYFSIAKRNSFPIEKFIEEEKKEVTFVSDRKSSPTAGGLLPESECADDNFTTTEDSEVG 60
Query: 61 WGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPE 120
WGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPE
Sbjct: 61 WGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPE 120
Query: 121 FEKQKKNLMELLAEINDIDEENLIEIDISMGSIKYSRFEIGGRK 165
FEKQKKNLMELLAEINDIDEENLIEIDISMGSIKYSRFEIGGRK
Sbjct: 121 FEKQKKNLMELLAEINDIDEENLIEIDISMGSIKYSRFEIGGRK 164
BLAST of Carg05156 vs. NCBI nr
Match:
XP_038902685.1 (uncharacterized protein LOC120089322 [Benincasa hispida])
HSP 1 Score: 198.0 bits (502), Expect = 6.4e-47
Identity = 115/177 (64.97%), Postives = 139/177 (78.53%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI--------EEEKKEVTFVSDR-----KSSPTAGGLLPESECA 60
MLYFSI+KRNS PI+KF+ EE++K++TF SD S+PT G LL ESECA
Sbjct: 46 MLYFSISKRNSLPIQKFVQQVQEEEEEEDQKKLTFFSDHLKSSSVSAPTTGDLLSESECA 105
Query: 61 DDNFTTTEDSEVGWGAFDR--FKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFED-E 120
DDNFTTTE+SEV W FD +K D+SD+GSISDEESLIEI+LPTGHYVSH+F+D +
Sbjct: 106 DDNFTTTEESEVEWPYFDHRSYKPNNPDRSDDGSISDEESLIEIALPTGHYVSHKFDDVD 165
Query: 121 EEEEDDMSSCYRELPEFEKQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
+E+ED MS Y++L +F QKKNL+E+LAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 166 DEDEDGMSFRYQKLSDF--QKKNLVEVLAEINDINEEENLIEIDISMGSIKYSRFEI 220
BLAST of Carg05156 vs. NCBI nr
Match:
XP_022985724.1 (uncharacterized protein LOC111483694 [Cucurbita maxima])
HSP 1 Score: 192.6 bits (488), Expect = 2.7e-45
Identity = 112/173 (64.74%), Postives = 138/173 (79.77%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI----EEEKKEVTFVS------DRKSSPTAGGLLPESECADDN 60
MLYFSI+K N FP+ K + +EE+KE+TF+S S+PT LL ESECADDN
Sbjct: 55 MLYFSISKTNPFPLHKSLHHQQQEEQKELTFLSSDHRNFSSASAPTV-DLLSESECADDN 114
Query: 61 FTTTEDSEVGWGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDD 120
FTTTE+SEV W FDR AK L +SD+GSISDEESLIEI+LPTGHYV+H+ D+++++DD
Sbjct: 115 FTTTEESEVEWPFFDRSTAKNLYRSDDGSISDEESLIEIALPTGHYVNHK-SDDDDDKDD 174
Query: 121 MSSCYRELPEFE--KQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
MS C+++L EF+ K+KKNL+ELLAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 175 MSFCFQKLSEFQKLKRKKNLVELLAEINDINEEENLIEIDISMGSIKYSRFEI 225
BLAST of Carg05156 vs. NCBI nr
Match:
XP_022943897.1 (uncharacterized protein LOC111448485 [Cucurbita moschata])
HSP 1 Score: 192.2 bits (487), Expect = 3.5e-45
Identity = 112/174 (64.37%), Postives = 138/174 (79.31%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI----EEEKKEVTFVS------DRKSSPTAGGLLPESECADDN 60
MLYFSI+K N FP+ K + ++E+KE+TF+S S+PT LL ESECADDN
Sbjct: 55 MLYFSISKTNPFPLHKSLHQQQQQEQKELTFLSSDHRNFSSASAPTV-DLLSESECADDN 114
Query: 61 FTTTEDSEVGWGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEF-EDEEEEED 120
FTTTE+SEV W FDR A L +SD+GSISDEESLIEI+LPTGHYVSH+ +D++++ED
Sbjct: 115 FTTTEESEVEWPFFDRSTANNLYRSDDGSISDEESLIEIALPTGHYVSHKSDDDDDDDED 174
Query: 121 DMSSCYRELPEFE--KQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
DMS C+++L EF+ K+KKNL+ELLAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 175 DMSFCFQKLSEFQKLKRKKNLVELLAEINDINEEENLIEIDISMGSIKYSRFEI 227
BLAST of Carg05156 vs. NCBI nr
Match:
TYK11971.1 (uncharacterized protein E5676_scaffold177G001550 [Cucumis melo var. makuwa])
HSP 1 Score: 191.8 bits (486), Expect = 4.6e-45
Identity = 113/177 (63.84%), Postives = 138/177 (77.97%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI-------EEEKKEVTFVSDR-----KSSPTAGGLLPESECAD 60
MLYFSI+KR+S PI+KF+ EE++KE+TF SD S+PT LL ESECAD
Sbjct: 3 MLYFSISKRSSLPIQKFVQEQEEEEEEDQKELTFFSDHLKSSSASAPTTRDLLSESECAD 62
Query: 61 DNFTTTEDSEVGWGAFDR--FKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFE--DE 120
DNFTTTE+SEV W FD +KA++ +SD+GSISDEESLIEI+LPTGHYVS +F+ DE
Sbjct: 63 DNFTTTEESEVEWPYFDHRFYKARSPHRSDDGSISDEESLIEIALPTGHYVSRKFDDVDE 122
Query: 121 EEEEDDMSSCYRELPEFEKQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
+E+EDD Y++L +F QKKNL+E+LAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 123 DEDEDDTPFRYQKLSDF--QKKNLVEVLAEINDINEEENLIEIDISMGSIKYSRFEI 177
BLAST of Carg05156 vs. ExPASy TrEMBL
Match:
A0A0A0KCP7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G112470 PE=4 SV=1)
HSP 1 Score: 193.0 bits (489), Expect = 1.0e-45
Identity = 113/173 (65.32%), Postives = 137/173 (79.19%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI-----EEEKKEVTFVSDR-----KSSPTAGGLLPESECADDN 60
MLYFSI+KRNS PI KF+ EE++K++TF SD S+PT LL ESECADDN
Sbjct: 3 MLYFSISKRNSLPIRKFVQEEEEEEDQKQLTFFSDHLKSSSASAPTIEDLLSESECADDN 62
Query: 61 FTTTEDSEVGWGAFD-RF-KAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEE 120
FTTTE+SEV W FD RF K + +SD+GSISDEESLIEI+LPTGHYVS +F+D +E+E
Sbjct: 63 FTTTEESEVEWPYFDHRFNKTRNPHRSDDGSISDEESLIEIALPTGHYVSRKFDDVDEDE 122
Query: 121 DDMSSCYRELPEFEKQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
DD+S Y++L +F Q+KNL+E+LAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 123 DDISFRYQKLTDF--QRKNLVEVLAEINDINEEENLIEIDISMGSIKYSRFEI 173
BLAST of Carg05156 vs. ExPASy TrEMBL
Match:
A0A6J1JE34 (uncharacterized protein LOC111483694 OS=Cucurbita maxima OX=3661 GN=LOC111483694 PE=4 SV=1)
HSP 1 Score: 192.6 bits (488), Expect = 1.3e-45
Identity = 112/173 (64.74%), Postives = 138/173 (79.77%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI----EEEKKEVTFVS------DRKSSPTAGGLLPESECADDN 60
MLYFSI+K N FP+ K + +EE+KE+TF+S S+PT LL ESECADDN
Sbjct: 55 MLYFSISKTNPFPLHKSLHHQQQEEQKELTFLSSDHRNFSSASAPTV-DLLSESECADDN 114
Query: 61 FTTTEDSEVGWGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDD 120
FTTTE+SEV W FDR AK L +SD+GSISDEESLIEI+LPTGHYV+H+ D+++++DD
Sbjct: 115 FTTTEESEVEWPFFDRSTAKNLYRSDDGSISDEESLIEIALPTGHYVNHK-SDDDDDKDD 174
Query: 121 MSSCYRELPEFE--KQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
MS C+++L EF+ K+KKNL+ELLAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 175 MSFCFQKLSEFQKLKRKKNLVELLAEINDINEEENLIEIDISMGSIKYSRFEI 225
BLAST of Carg05156 vs. ExPASy TrEMBL
Match:
A0A6J1FUA8 (uncharacterized protein LOC111448485 OS=Cucurbita moschata OX=3662 GN=LOC111448485 PE=4 SV=1)
HSP 1 Score: 192.2 bits (487), Expect = 1.7e-45
Identity = 112/174 (64.37%), Postives = 138/174 (79.31%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI----EEEKKEVTFVS------DRKSSPTAGGLLPESECADDN 60
MLYFSI+K N FP+ K + ++E+KE+TF+S S+PT LL ESECADDN
Sbjct: 55 MLYFSISKTNPFPLHKSLHQQQQQEQKELTFLSSDHRNFSSASAPTV-DLLSESECADDN 114
Query: 61 FTTTEDSEVGWGAFDRFKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEF-EDEEEEED 120
FTTTE+SEV W FDR A L +SD+GSISDEESLIEI+LPTGHYVSH+ +D++++ED
Sbjct: 115 FTTTEESEVEWPFFDRSTANNLYRSDDGSISDEESLIEIALPTGHYVSHKSDDDDDDDED 174
Query: 121 DMSSCYRELPEFE--KQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
DMS C+++L EF+ K+KKNL+ELLAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 175 DMSFCFQKLSEFQKLKRKKNLVELLAEINDINEEENLIEIDISMGSIKYSRFEI 227
BLAST of Carg05156 vs. ExPASy TrEMBL
Match:
A0A5D3CJH9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold177G001550 PE=4 SV=1)
HSP 1 Score: 191.8 bits (486), Expect = 2.2e-45
Identity = 113/177 (63.84%), Postives = 138/177 (77.97%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI-------EEEKKEVTFVSDR-----KSSPTAGGLLPESECAD 60
MLYFSI+KR+S PI+KF+ EE++KE+TF SD S+PT LL ESECAD
Sbjct: 3 MLYFSISKRSSLPIQKFVQEQEEEEEEDQKELTFFSDHLKSSSASAPTTRDLLSESECAD 62
Query: 61 DNFTTTEDSEVGWGAFDR--FKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFE--DE 120
DNFTTTE+SEV W FD +KA++ +SD+GSISDEESLIEI+LPTGHYVS +F+ DE
Sbjct: 63 DNFTTTEESEVEWPYFDHRFYKARSPHRSDDGSISDEESLIEIALPTGHYVSRKFDDVDE 122
Query: 121 EEEEDDMSSCYRELPEFEKQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
+E+EDD Y++L +F QKKNL+E+LAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 123 DEDEDDTPFRYQKLSDF--QKKNLVEVLAEINDINEEENLIEIDISMGSIKYSRFEI 177
BLAST of Carg05156 vs. ExPASy TrEMBL
Match:
A0A1S3CMD0 (uncharacterized protein LOC103502111 OS=Cucumis melo OX=3656 GN=LOC103502111 PE=4 SV=1)
HSP 1 Score: 191.8 bits (486), Expect = 2.2e-45
Identity = 113/177 (63.84%), Postives = 138/177 (77.97%), Query Frame = 0
Query: 1 MLYFSIAKRNSFPIEKFI-------EEEKKEVTFVSDR-----KSSPTAGGLLPESECAD 60
MLYFSI+KR+S PI+KF+ EE++KE+TF SD S+PT LL ESECAD
Sbjct: 56 MLYFSISKRSSLPIQKFVQEQEEEEEEDQKELTFFSDHLKSSSASAPTTRDLLSESECAD 115
Query: 61 DNFTTTEDSEVGWGAFDR--FKAKTLDQSDEGSISDEESLIEISLPTGHYVSHEFE--DE 120
DNFTTTE+SEV W FD +KA++ +SD+GSISDEESLIEI+LPTGHYVS +F+ DE
Sbjct: 116 DNFTTTEESEVEWPYFDHRFYKARSPHRSDDGSISDEESLIEIALPTGHYVSRKFDDVDE 175
Query: 121 EEEEDDMSSCYRELPEFEKQKKNLMELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
+E+EDD Y++L +F QKKNL+E+LAEINDI +EENLIEIDISMGSIKYSRFEI
Sbjct: 176 DEDEDDTPFRYQKLSDF--QKKNLVEVLAEINDINEEENLIEIDISMGSIKYSRFEI 230
BLAST of Carg05156 vs. TAIR 10
Match:
AT2G04480.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G35870.1); Has 38 Blast hits to 38 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 38; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 77.4 bits (189), Expect = 1.2e-14
Identity = 45/92 (48.91%), Postives = 63/92 (68.48%), Query Frame = 0
Query: 70 KTLDQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPEFEKQKKNLM 129
KT ++ D+G+I D+ESLIE+SLP+GHY+ H + + + ++ +F L
Sbjct: 163 KTNEEDDDGAIPDDESLIELSLPSGHYLGHHYNSNKNH----LYIHNKVQDF-----RLF 222
Query: 130 ELLAEINDI-DEENLIEIDISMGSIKYSRFEI 161
+LL EIND +E+NLIEIDIS+GSIKYSRFEI
Sbjct: 223 DLLNEINDFTEEDNLIEIDISIGSIKYSRFEI 245
BLAST of Carg05156 vs. TAIR 10
Match:
AT5G35870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04480.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 70.1 bits (170), Expect = 1.9e-12
Identity = 42/88 (47.73%), Postives = 58/88 (65.91%), Query Frame = 0
Query: 73 DQSDEGSISDEESLIEISLPTGHYVSHEFEDEEEEEDDMSSCYRELPEFEKQKKNLMELL 132
+ D+G+I DEESLIE+SLP+GHY+ H + ++ D Y +P+F L++L
Sbjct: 116 NDEDDGTIPDEESLIELSLPSGHYIGHHYSTMIGKQ-DQKIMYNNIPDF-----RLIQLS 175
Query: 133 AEINDIDEENLIEIDISMGSIKYSRFEI 161
AE D +NLIEIDIS+GSIK SRF+I
Sbjct: 176 AEYED---DNLIEIDISIGSIKCSRFQI 194
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6604917.1 | 6.6e-84 | 100.00 | hypothetical protein SDJN03_02234, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038902685.1 | 6.4e-47 | 64.97 | uncharacterized protein LOC120089322 [Benincasa hispida] | [more] |
XP_022985724.1 | 2.7e-45 | 64.74 | uncharacterized protein LOC111483694 [Cucurbita maxima] | [more] |
XP_022943897.1 | 3.5e-45 | 64.37 | uncharacterized protein LOC111448485 [Cucurbita moschata] | [more] |
TYK11971.1 | 4.6e-45 | 63.84 | uncharacterized protein E5676_scaffold177G001550 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KCP7 | 1.0e-45 | 65.32 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G112470 PE=4 SV=1 | [more] |
A0A6J1JE34 | 1.3e-45 | 64.74 | uncharacterized protein LOC111483694 OS=Cucurbita maxima OX=3661 GN=LOC111483694... | [more] |
A0A6J1FUA8 | 1.7e-45 | 64.37 | uncharacterized protein LOC111448485 OS=Cucurbita moschata OX=3662 GN=LOC1114484... | [more] |
A0A5D3CJH9 | 2.2e-45 | 63.84 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CMD0 | 2.2e-45 | 63.84 | uncharacterized protein LOC103502111 OS=Cucumis melo OX=3656 GN=LOC103502111 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G04480.1 | 1.2e-14 | 48.91 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G35870.1 | 1.9e-12 | 47.73 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |