Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTAACTGTTGCAGAGGTCAATCCTCCACCCCGGTGGTGTCGGGTGGCGATCGGAGGGCTCTGCCGCCGAAGAACCTCAGTCAGAGAAGGCATGGAGCTAATTCTTGTTCTACTACGATCAAGGGTTCAAAAGAGGGTAATCTGAGAGAGTTGAAAATAAGAATGACGAAAAAGGAGCTGAAGGAATTGGTTGGATGGTTGAACATGGCGGACTCGAGTTTTGAACAAGTTATGGCTCGTCTTGTGAACGTTATTAGCGAGCACAATGGTGATGATAGTGAAGAAGATGATAGGAATATTGGGCAGATGGTGAAATTGCGTCAGCAACGTTCATGGAGGCCTTCTCTGCAGAGCATTCCTGAGATAATCTGA
mRNA sequence
ATGGGTAACTGTTGCAGAGGTCAATCCTCCACCCCGGTGGTGTCGGGTGGCGATCGGAGGGCTCTGCCGCCGAAGAACCTCAGTCAGAGAAGGCATGGAGCTAATTCTTGTTCTACTACGATCAAGGGTTCAAAAGAGGGTAATCTGAGAGAGTTGAAAATAAGAATGACGAAAAAGGAGCTGAAGGAATTGGTTGGATGGTTGAACATGGCGGACTCGAGTTTTGAACAAGTTATGGCTCGTCTTGTGAACGTTATTAGCGAGCACAATGGTGATGATAGTGAAGAAGATGATAGGAATATTGGGCAGATGGTGAAATTGCGTCAGCAACGTTCATGGAGGCCTTCTCTGCAGAGCATTCCTGAGATAATCTGA
Coding sequence (CDS)
ATGGGTAACTGTTGCAGAGGTCAATCCTCCACCCCGGTGGTGTCGGGTGGCGATCGGAGGGCTCTGCCGCCGAAGAACCTCAGTCAGAGAAGGCATGGAGCTAATTCTTGTTCTACTACGATCAAGGGTTCAAAAGAGGGTAATCTGAGAGAGTTGAAAATAAGAATGACGAAAAAGGAGCTGAAGGAATTGGTTGGATGGTTGAACATGGCGGACTCGAGTTTTGAACAAGTTATGGCTCGTCTTGTGAACGTTATTAGCGAGCACAATGGTGATGATAGTGAAGAAGATGATAGGAATATTGGGCAGATGGTGAAATTGCGTCAGCAACGTTCATGGAGGCCTTCTCTGCAGAGCATTCCTGAGATAATCTGA
Protein sequence
MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSIPEII
Homology
BLAST of Cp4.1LG10g09660 vs. NCBI nr
Match:
XP_023544016.1 (uncharacterized protein LOC111803724 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 247 bits (630), Expect = 2.06e-82
Identity = 124/124 (100.00%), Postives = 124/124 (100.00%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE
Sbjct: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI
Sbjct: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
Query: 121 PEII 124
PEII
Sbjct: 121 PEII 124
BLAST of Cp4.1LG10g09660 vs. NCBI nr
Match:
XP_022977330.1 (uncharacterized protein LOC111477681 [Cucurbita maxima])
HSP 1 Score: 234 bits (596), Expect = 3.17e-77
Identity = 117/124 (94.35%), Postives = 119/124 (95.97%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCCRGQSSTPVVSGGDRRA PPKNL QRRHGA SCSTTIKGSKEGNLRELKIRMTKKE
Sbjct: 1 MGNCCRGQSSTPVVSGGDRRAQPPKNLRQRRHGAGSCSTTIKGSKEGNLRELKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
LKEL+GWLNMADSSFEQVMARLVNVIS+ NGDDSEEDD NIGQMVKLRQQRSWRPSLQSI
Sbjct: 61 LKELIGWLNMADSSFEQVMARLVNVISDQNGDDSEEDDENIGQMVKLRQQRSWRPSLQSI 120
Query: 121 PEII 124
PEII
Sbjct: 121 PEII 124
BLAST of Cp4.1LG10g09660 vs. NCBI nr
Match:
KAG7033944.1 (hypothetical protein SDJN02_03670, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 233 bits (594), Expect = 6.39e-77
Identity = 117/124 (94.35%), Postives = 121/124 (97.58%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCCRGQSSTPVVSG +RRALPPKN+ QRRHGA+SCSTTIKGSKEGN+RELKIRMTKKE
Sbjct: 1 MGNCCRGQSSTPVVSGRNRRALPPKNVRQRRHGASSCSTTIKGSKEGNMRELKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
LKELVGWLNMADSSFEQVMARLVNVISE NGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI
Sbjct: 61 LKELVGWLNMADSSFEQVMARLVNVISEQNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
Query: 121 PEII 124
PEII
Sbjct: 121 PEII 124
BLAST of Cp4.1LG10g09660 vs. NCBI nr
Match:
KAG6603773.1 (hypothetical protein SDJN03_04382, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 231 bits (590), Expect = 2.60e-76
Identity = 116/124 (93.55%), Postives = 121/124 (97.58%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCCRGQSSTPVVSG +RRALPPKN+ QRRHGA+SCSTTIKGSKEGN+RELKIRMTKKE
Sbjct: 1 MGNCCRGQSSTPVVSGRNRRALPPKNVRQRRHGASSCSTTIKGSKEGNMRELKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
LKELVGWLNMADSSFEQVMARLVNVISE NGDDSEE+DRNIGQMVKLRQQRSWRPSLQSI
Sbjct: 61 LKELVGWLNMADSSFEQVMARLVNVISEQNGDDSEEEDRNIGQMVKLRQQRSWRPSLQSI 120
Query: 121 PEII 124
PEII
Sbjct: 121 PEII 124
BLAST of Cp4.1LG10g09660 vs. NCBI nr
Match:
XP_008441100.1 (PREDICTED: uncharacterized protein LOC103485326 [Cucumis melo] >TYK13052.1 hypothetical protein E5676_scaffold255G006540 [Cucumis melo var. makuwa])
HSP 1 Score: 155 bits (391), Expect = 4.06e-46
Identity = 87/126 (69.05%), Postives = 95/126 (75.40%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRH--GANSCSTTIKGSKEGNLRELKIRMTK 60
MGNCC+GQ STPV DRR P KN QRRH G++ CS KE NLRE+KIRMTK
Sbjct: 1 MGNCCKGQYSTPVTGCSDRRTPPLKNHRQRRHEEGSSCCS------KETNLREVKIRMTK 60
Query: 61 KELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQ 120
KEL+ELVGWLNMADSSFE VMARLVNVI++ NG D N Q+VKLRQQRSWRPSLQ
Sbjct: 61 KELEELVGWLNMADSSFEHVMARLVNVINDQNG------DNNNDQVVKLRQQRSWRPSLQ 114
Query: 121 SIPEII 124
SIPEII
Sbjct: 121 SIPEII 114
BLAST of Cp4.1LG10g09660 vs. ExPASy TrEMBL
Match:
A0A6J1IM02 (uncharacterized protein LOC111477681 OS=Cucurbita maxima OX=3661 GN=LOC111477681 PE=4 SV=1)
HSP 1 Score: 234 bits (596), Expect = 1.53e-77
Identity = 117/124 (94.35%), Postives = 119/124 (95.97%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCCRGQSSTPVVSGGDRRA PPKNL QRRHGA SCSTTIKGSKEGNLRELKIRMTKKE
Sbjct: 1 MGNCCRGQSSTPVVSGGDRRAQPPKNLRQRRHGAGSCSTTIKGSKEGNLRELKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
LKEL+GWLNMADSSFEQVMARLVNVIS+ NGDDSEEDD NIGQMVKLRQQRSWRPSLQSI
Sbjct: 61 LKELIGWLNMADSSFEQVMARLVNVISDQNGDDSEEDDENIGQMVKLRQQRSWRPSLQSI 120
Query: 121 PEII 124
PEII
Sbjct: 121 PEII 124
BLAST of Cp4.1LG10g09660 vs. ExPASy TrEMBL
Match:
A0A5D3CNX5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G006540 PE=4 SV=1)
HSP 1 Score: 155 bits (391), Expect = 1.97e-46
Identity = 87/126 (69.05%), Postives = 95/126 (75.40%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRH--GANSCSTTIKGSKEGNLRELKIRMTK 60
MGNCC+GQ STPV DRR P KN QRRH G++ CS KE NLRE+KIRMTK
Sbjct: 1 MGNCCKGQYSTPVTGCSDRRTPPLKNHRQRRHEEGSSCCS------KETNLREVKIRMTK 60
Query: 61 KELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQ 120
KEL+ELVGWLNMADSSFE VMARLVNVI++ NG D N Q+VKLRQQRSWRPSLQ
Sbjct: 61 KELEELVGWLNMADSSFEHVMARLVNVINDQNG------DNNNDQVVKLRQQRSWRPSLQ 114
Query: 121 SIPEII 124
SIPEII
Sbjct: 121 SIPEII 114
BLAST of Cp4.1LG10g09660 vs. ExPASy TrEMBL
Match:
A0A1S3B2N5 (uncharacterized protein LOC103485326 OS=Cucumis melo OX=3656 GN=LOC103485326 PE=4 SV=1)
HSP 1 Score: 155 bits (391), Expect = 1.97e-46
Identity = 87/126 (69.05%), Postives = 95/126 (75.40%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRH--GANSCSTTIKGSKEGNLRELKIRMTK 60
MGNCC+GQ STPV DRR P KN QRRH G++ CS KE NLRE+KIRMTK
Sbjct: 1 MGNCCKGQYSTPVTGCSDRRTPPLKNHRQRRHEEGSSCCS------KETNLREVKIRMTK 60
Query: 61 KELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQ 120
KEL+ELVGWLNMADSSFE VMARLVNVI++ NG D N Q+VKLRQQRSWRPSLQ
Sbjct: 61 KELEELVGWLNMADSSFEHVMARLVNVINDQNG------DNNNDQVVKLRQQRSWRPSLQ 114
Query: 121 SIPEII 124
SIPEII
Sbjct: 121 SIPEII 114
BLAST of Cp4.1LG10g09660 vs. ExPASy TrEMBL
Match:
A0A0A0KHU7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514300 PE=4 SV=1)
HSP 1 Score: 146 bits (368), Expect = 5.69e-43
Identity = 83/124 (66.94%), Postives = 91/124 (73.39%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVVSGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKKE 60
MGNCC+ QSSTPV+ G P KN QRR+ S SKE NLRE+KIRMTKKE
Sbjct: 1 MGNCCKAQSSTPVMEGSHPTTPPFKNHPQRRNQEGS-------SKETNLREVKIRMTKKE 60
Query: 61 LKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQSI 120
L+ELVGWLNMADSSFEQVMARLVNVI+ NG D N Q+VKLRQQRSWRPSLQSI
Sbjct: 61 LQELVGWLNMADSSFEQVMARLVNVINYQNG------DNNNDQVVKLRQQRSWRPSLQSI 111
Query: 121 PEII 124
PE+I
Sbjct: 121 PEMI 111
BLAST of Cp4.1LG10g09660 vs. ExPASy TrEMBL
Match:
A0A059CAC2 (Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_E04063 PE=4 SV=1)
HSP 1 Score: 68.2 bits (165), Expect = 3.75e-12
Identity = 45/125 (36.00%), Postives = 69/125 (55.20%), Query Frame = 0
Query: 1 MGNCCRGQSSTPVV-SGGDRRALPPKNLSQRRHGANSCSTTIKGSKEGNLRELKIRMTKK 60
MGNCCR +SS+ + +G D +LP ++R ++ + S RE+KI+++K+
Sbjct: 1 MGNCCRAESSSSTIWAGDDWNSLPCDGEGKKRLLDDADGKKVSSSSSSRRREIKIQISKE 60
Query: 61 ELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQRSWRPSLQS 120
EL++LV + S EQV+ L+N ++ D R G K + RSWRP+LQS
Sbjct: 61 ELEKLVHKIEAQGLSLEQVLPLLIN--------ENTFDRRGSG-FTKYGRHRSWRPALQS 116
Query: 121 IPEII 124
IPE I
Sbjct: 121 IPEAI 116
BLAST of Cp4.1LG10g09660 vs. TAIR 10
Match:
AT4G21920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20340.1); Has 40 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 44.7 bits (104), Expect = 6.4e-05
Identity = 36/137 (26.28%), Postives = 62/137 (45.26%), Query Frame = 0
Query: 1 MGNC-CRGQSSTPVVSGGDRRALPPKNLSQRRH-------------GANSCSTTIKGSKE 60
MGNC C + +T SG D + + +R G S T+ S
Sbjct: 1 MGNCICVTEKTTTSWSGDDNGSYNKRRRRRRSTVVHDDNDDGEKLLGETSNVTSTSSSSS 60
Query: 61 GNLRELKIRMTKKELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVK 120
RE+KIR+TKKEL++L+ + + + E+++++L+ + G + +
Sbjct: 61 SERREIKIRITKKELEDLMRNIGLKSLTAEEILSKLIFEGGDQIGFSAVD---------V 120
Query: 121 LRQQRSWRPSLQSIPEI 124
+ W+P LQSIPE+
Sbjct: 121 TNHHQPWKPVLQSIPEM 128
BLAST of Cp4.1LG10g09660 vs. TAIR 10
Match:
AT3G20340.1 (Expression of the gene is downregulated in the presence of paraquat, an inducer of photoxidative stress. )
HSP 1 Score: 42.4 bits (98), Expect = 3.2e-04
Identity = 25/73 (34.25%), Postives = 41/73 (56.16%), Query Frame = 0
Query: 51 ELKIRMTKKELKELVGWLNMADSSFEQVMARLVNVISEHNGDDSEEDDRNIGQMVKLRQQ 110
E+KIR+TKK+L +L+ +N+ D +F+Q +++ +++ QQ
Sbjct: 56 EIKIRLTKKQLHDLLSKVNVHDLTFQQ-QTFSCPILNNRGYEEA-------------NQQ 114
Query: 111 RSWRPSLQSIPEI 124
R WRP LQSIPE+
Sbjct: 116 RLWRPVLQSIPEV 114
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023544016.1 | 2.06e-82 | 100.00 | uncharacterized protein LOC111803724 [Cucurbita pepo subsp. pepo] | [more] |
XP_022977330.1 | 3.17e-77 | 94.35 | uncharacterized protein LOC111477681 [Cucurbita maxima] | [more] |
KAG7033944.1 | 6.39e-77 | 94.35 | hypothetical protein SDJN02_03670, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6603773.1 | 2.60e-76 | 93.55 | hypothetical protein SDJN03_04382, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_008441100.1 | 4.06e-46 | 69.05 | PREDICTED: uncharacterized protein LOC103485326 [Cucumis melo] >TYK13052.1 hypot... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1IM02 | 1.53e-77 | 94.35 | uncharacterized protein LOC111477681 OS=Cucurbita maxima OX=3661 GN=LOC111477681... | [more] |
A0A5D3CNX5 | 1.97e-46 | 69.05 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3B2N5 | 1.97e-46 | 69.05 | uncharacterized protein LOC103485326 OS=Cucumis melo OX=3656 GN=LOC103485326 PE=... | [more] |
A0A0A0KHU7 | 5.69e-43 | 66.94 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514300 PE=4 SV=1 | [more] |
A0A059CAC2 | 3.75e-12 | 36.00 | Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_E04063 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
AT4G21920.1 | 6.4e-05 | 26.28 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... | [more] |
AT3G20340.1 | 3.2e-04 | 34.25 | Expression of the gene is downregulated in the presence of paraquat, an inducer ... | [more] |