Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCGTTCCCCTCGACCAGCTCCAACCCCCGCCCTCTGCCGCCGCTATCCACCCTGCCCAGGGCTCCGTCGGCCCCGTCATCGCCGTCCTTGCCGTGATTTCCATCCTCGGCTTCATTGCCGGCATGATCGGCCGCGTCTGCTTTGGCCGTCCCGTTTTTGGCTACAACGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGATGGGTCTCTAGACCCTCCCCCGCATCTCCGCCCCCCTCCACCAATCGAAGCTATTCCGGTGGCGGAGCCGCTTGGTGGGCCGCCGAACATCAAGGAAGGTGGCGATGGCGATGGCGACAGGGAACGCGAGAATTTGCAGTCGGTGCCTCCCGGCAGTGGCGGTGAGTCGTGA
mRNA sequence
ATGTCCGTTCCCCTCGACCAGCTCCAACCCCCGCCCTCTGCCGCCGCTATCCACCCTGCCCAGGGCTCCGTCGGCCCCGTCATCGCCGTCCTTGCCGTGATTTCCATCCTCGGCTTCATTGCCGGCATGATCGGCCGCGTCTGCTTTGGCCGTCCCGTTTTTGGCTACAACGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGATGGGTCTCTAGACCCTCCCCCGCATCTCCGCCCCCCTCCACCAATCGAAGCTATTCCGGTGGCGGAGCCGCTTGGTGGGCCGCCGAACATCAAGGAAGGTGGCGATGGCGATGGCGACAGGGAACGCGAGAATTTGCAGTCGGTGCCTCCCGGCAGTGGCGGTGAGTCGTGA
Coding sequence (CDS)
ATGTCCGTTCCCCTCGACCAGCTCCAACCCCCGCCCTCTGCCGCCGCTATCCACCCTGCCCAGGGCTCCGTCGGCCCCGTCATCGCCGTCCTTGCCGTGATTTCCATCCTCGGCTTCATTGCCGGCATGATCGGCCGCGTCTGCTTTGGCCGTCCCGTTTTTGGCTACAACGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGTGCTTCCTGCCTCGATGGGTCTCTAGACCCTCCCCCGCATCTCCGCCCCCCTCCACCAATCGAAGCTATTCCGGTGGCGGAGCCGCTTGGTGGGCCGCCGAACATCAAGGAAGGTGGCGATGGCGATGGCGACAGGGAACGCGAGAATTTGCAGTCGGTGCCTCCCGGCAGTGGCGGTGAGTCGTGA
Protein sequence
MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHYDVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENLQSVPPGSGGES
Homology
BLAST of Cp4.1LG04g08150 vs. NCBI nr
Match:
XP_023530142.1 (uncharacterized protein LOC111792788 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 261 bits (668), Expect = 5.19e-88
Identity = 129/129 (100.00%), Postives = 129/129 (100.00%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY
Sbjct: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL
Sbjct: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
Query: 121 QSVPPGSGG 129
QSVPPGSGG
Sbjct: 121 QSVPPGSGG 129
BLAST of Cp4.1LG04g08150 vs. NCBI nr
Match:
XP_022927713.1 (uncharacterized protein LOC111434531 [Cucurbita moschata])
HSP 1 Score: 248 bits (634), Expect = 7.97e-83
Identity = 126/131 (96.18%), Postives = 127/131 (96.95%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNA Y
Sbjct: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAQY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPV+EPLGGPPNIKEGGDGD RERENL
Sbjct: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVSEPLGGPPNIKEGGDGD--RERENL 120
Query: 121 QSVPPGSGGES 131
QSV PGSGGES
Sbjct: 121 QSVAPGSGGES 129
BLAST of Cp4.1LG04g08150 vs. NCBI nr
Match:
XP_022989510.1 (uncharacterized protein LOC111486567 [Cucurbita maxima])
HSP 1 Score: 248 bits (633), Expect = 1.13e-82
Identity = 125/131 (95.42%), Postives = 128/131 (97.71%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVPLDQLQPPP+AAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY
Sbjct: 1 MSVPLDQLQPPPAAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCA+CLDGSLDPPP LRPPPPIEAIPVAEPLGGPPNIKE DGDGD+ERENL
Sbjct: 61 DVEDWVEKKCATCLDGSLDPPPLLRPPPPIEAIPVAEPLGGPPNIKE--DGDGDKERENL 120
Query: 121 QSVPPGSGGES 131
QSVPPGSGGES
Sbjct: 121 QSVPPGSGGES 129
BLAST of Cp4.1LG04g08150 vs. NCBI nr
Match:
KAG6588645.1 (hypothetical protein SDJN03_17210, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 246 bits (628), Expect = 6.55e-82
Identity = 124/131 (94.66%), Postives = 127/131 (96.95%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVP+DQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY
Sbjct: 1 MSVPVDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCA+CLDGSLDPPPHLRPPPPIEAIPV+EPLG PPNIKEGGDGD RERENL
Sbjct: 61 DVEDWVEKKCATCLDGSLDPPPHLRPPPPIEAIPVSEPLGEPPNIKEGGDGD--RERENL 120
Query: 121 QSVPPGSGGES 131
QSV PGSGGES
Sbjct: 121 QSVAPGSGGES 129
BLAST of Cp4.1LG04g08150 vs. NCBI nr
Match:
XP_038887147.1 (uncharacterized protein LOC120077337 [Benincasa hispida])
HSP 1 Score: 191 bits (484), Expect = 7.62e-60
Identity = 102/140 (72.86%), Postives = 113/140 (80.71%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MS PLDQLQPPPS +H A SVGP+IAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1 MSTPLDQLQPPPS---LHSAHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPP----HLRPPPPIEAIPVAEPLGGPP-NIKEGGDGDG-- 120
DVEDW+EKKCASCLDGSLDPPP HLR PP ++++PVAEPLGGPP IK+G D D
Sbjct: 61 DVEDWIEKKCASCLDGSLDPPPPPPPHLRRPPLLDSVPVAEPLGGPPPEIKQGADADAVV 120
Query: 121 --DRERENLQSVPPGSGGES 131
D +RENLQS PPG+GGES
Sbjct: 121 DTDVKRENLQSAPPGTGGES 137
BLAST of Cp4.1LG04g08150 vs. ExPASy TrEMBL
Match:
A0A6J1ELS6 (uncharacterized protein LOC111434531 OS=Cucurbita moschata OX=3662 GN=LOC111434531 PE=4 SV=1)
HSP 1 Score: 248 bits (634), Expect = 3.86e-83
Identity = 126/131 (96.18%), Postives = 127/131 (96.95%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNA Y
Sbjct: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAQY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPV+EPLGGPPNIKEGGDGD RERENL
Sbjct: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVSEPLGGPPNIKEGGDGD--RERENL 120
Query: 121 QSVPPGSGGES 131
QSV PGSGGES
Sbjct: 121 QSVAPGSGGES 129
BLAST of Cp4.1LG04g08150 vs. ExPASy TrEMBL
Match:
A0A6J1JMK0 (uncharacterized protein LOC111486567 OS=Cucurbita maxima OX=3661 GN=LOC111486567 PE=4 SV=1)
HSP 1 Score: 248 bits (633), Expect = 5.48e-83
Identity = 125/131 (95.42%), Postives = 128/131 (97.71%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MSVPLDQLQPPP+AAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY
Sbjct: 1 MSVPLDQLQPPPAAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRERENL 120
DVEDWVEKKCA+CLDGSLDPPP LRPPPPIEAIPVAEPLGGPPNIKE DGDGD+ERENL
Sbjct: 61 DVEDWVEKKCATCLDGSLDPPPLLRPPPPIEAIPVAEPLGGPPNIKE--DGDGDKERENL 120
Query: 121 QSVPPGSGGES 131
QSVPPGSGGES
Sbjct: 121 QSVPPGSGGES 129
BLAST of Cp4.1LG04g08150 vs. ExPASy TrEMBL
Match:
A0A6J1IBA8 (uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940 PE=4 SV=1)
HSP 1 Score: 188 bits (478), Expect = 2.75e-59
Identity = 97/132 (73.48%), Postives = 107/132 (81.06%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MS P DQLQPPP+A H GSVGPVIAVLAVISILG IAG+IGR+C GRPVFGY AHY
Sbjct: 1 MSTPFDQLQPPPTA---HSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPP---HLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRER 120
DVE+WVEKKCASCLDGSLDPPP HLR PPP++A+PV EPLGGPP IK+G D +R
Sbjct: 61 DVEEWVEKKCASCLDGSLDPPPPPAHLRHPPPLDAVPVVEPLGGPPEIKQGAD----EKR 120
Query: 121 ENLQSVPPGSGG 129
ENLQS PG+GG
Sbjct: 121 ENLQSAAPGTGG 125
BLAST of Cp4.1LG04g08150 vs. ExPASy TrEMBL
Match:
A0A6J1GL66 (uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC111455383 PE=4 SV=1)
HSP 1 Score: 187 bits (475), Expect = 8.12e-59
Identity = 97/133 (72.93%), Postives = 108/133 (81.20%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MS P+DQLQPPP+A H GSVGPVIAVLAVISILG IAG+IGR+C GRPVFGY AHY
Sbjct: 1 MSTPVDQLQPPPTA---HSGYGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPP----HLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRE 120
DVE+WVEKKCASCLDGSLDPPP HLR PPP++A+PV EPLGGPP IK+G D +
Sbjct: 61 DVEEWVEKKCASCLDGSLDPPPPPPPHLRHPPPLDAVPVVEPLGGPPEIKQGADD----K 120
Query: 121 RENLQSVPPGSGG 129
RENLQS PG+GG
Sbjct: 121 RENLQSAAPGTGG 126
BLAST of Cp4.1LG04g08150 vs. ExPASy TrEMBL
Match:
A0A0A0K6N4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1)
HSP 1 Score: 186 bits (472), Expect = 2.11e-58
Identity = 96/135 (71.11%), Postives = 109/135 (80.74%), Query Frame = 0
Query: 1 MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60
MS P+DQLQPPP +H + SVGP+IAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1 MSTPIDQLQPPPP---LHSSHASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
Query: 61 DVEDWVEKKCASCLDGSLDPPP---HLRPPPPIEAIPVAEPLGGPP-NIKEGGDGDGDRE 120
D+EDWVEKKCASCLDGSLDPPP HLR PPP++++PVAEPLGGPP IK+ D D +
Sbjct: 61 DLEDWVEKKCASCLDGSLDPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQSAHADADAK 120
Query: 121 RENLQSVPPGSGGES 131
ENLQS PG+GGES
Sbjct: 121 GENLQSAAPGTGGES 132
BLAST of Cp4.1LG04g08150 vs. TAIR 10
Match:
AT2G26520.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G57500.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 66.6 bits (161), Expect = 1.7e-11
Identity = 46/136 (33.82%), Postives = 67/136 (49.26%), Query Frame = 0
Query: 2 SVPLDQLQPPP-------SAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVF 61
S+PL QPPP S++ ++GP IAV V+++L +A +IGR+C G+ +
Sbjct: 3 SMPL-YWQPPPATEVSQDSSSVSSAGNSTIGPFIAVFIVVTVLCVLASVIGRLCSGKTIL 62
Query: 62 GYNAHYDVEDWVEKKCASCLDGSLDPPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGD 121
GY YD+E W E +C SC+DG + P P P P+ G EG D D
Sbjct: 63 GY-GDYDMERWAESRCGSCIDGHIHPHRPSPSPTPPPRQPLHHTSSGVSAESEGHVADLD 122
Query: 122 RE-----RENLQSVPP 126
E +++L PP
Sbjct: 123 HETDGEKQDSLDHEPP 136
BLAST of Cp4.1LG04g08150 vs. TAIR 10
Match:
AT3G57500.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G26520.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 58.9 bits (141), Expect = 3.5e-09
Identity = 39/110 (35.45%), Postives = 56/110 (50.91%), Query Frame = 0
Query: 6 DQLQPPPSAAAIHPA----------QGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFG 65
DQL P+ I P S+ ++ VLAVI+IL +AG+ R+C GR +
Sbjct: 10 DQLSTTPTTIYIDPPSQDQPSHNSDHRSIETLVVVLAVITILSVLAGVFARLCGGRHL-S 69
Query: 66 YNAHYDVEDWVEKKCASCLDGSL-------DPPPHLRPPPPIEAIPVAEP 99
+ +D+E WVE+KC SC+D + PPP PPPP A ++P
Sbjct: 70 HGGDHDIEGWVERKCRSCIDAGIPAVSAAPSPPP---PPPPATAEERSKP 115
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023530142.1 | 5.19e-88 | 100.00 | uncharacterized protein LOC111792788 [Cucurbita pepo subsp. pepo] | [more] |
XP_022927713.1 | 7.97e-83 | 96.18 | uncharacterized protein LOC111434531 [Cucurbita moschata] | [more] |
XP_022989510.1 | 1.13e-82 | 95.42 | uncharacterized protein LOC111486567 [Cucurbita maxima] | [more] |
KAG6588645.1 | 6.55e-82 | 94.66 | hypothetical protein SDJN03_17210, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038887147.1 | 7.62e-60 | 72.86 | uncharacterized protein LOC120077337 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ELS6 | 3.86e-83 | 96.18 | uncharacterized protein LOC111434531 OS=Cucurbita moschata OX=3662 GN=LOC1114345... | [more] |
A0A6J1JMK0 | 5.48e-83 | 95.42 | uncharacterized protein LOC111486567 OS=Cucurbita maxima OX=3661 GN=LOC111486567... | [more] |
A0A6J1IBA8 | 2.75e-59 | 73.48 | uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940... | [more] |
A0A6J1GL66 | 8.12e-59 | 72.93 | uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC1114553... | [more] |
A0A0A0K6N4 | 2.11e-58 | 71.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT2G26520.1 | 1.7e-11 | 33.82 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT3G57500.1 | 3.5e-09 | 35.45 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |