Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTGGAGCCGATTCGACCGGCAGTGAAGAAGAAGCTGTGGAACGTGTTGCGAGCGGTTGTGTTCATGTTGAGGAAAGGCGTACGGACATTCGATCTTCATTTGATACTAAAACGAAGCAAAATCGCAGGAAAAGCGATTGGAAATCTGGTCGAATTTCATCACGGATCTGCTTTTAGTTGCCAAACGATTGATATCGCCAATTCCTACATCTCAACTCGCGATTACGAATTCAGTTGCAGTAACAGTCCTGCAGCATATCCGTTCCGTTACTTCAACAAACTCCGAAAGCACCGAAACCACCAATATTTCCCCAAATCTTACAGTCACGACGATTTCTCCACCGTCGCCGCCGTCACGAGAGTTCTGGATATTCTCCAAACCGATCAGAACTCCGAGAAGTCGCCATTAGTGCCGCTGCCTGGATTCGGAAAGAGTCCGCTAGTTGTGCGGCAGTTGCGCGTTACGGATTCGCCGTTCTCTTTGAAGGACGACGGCGATAGTCAGTTCGTCGATAAAGCCGCCGAGGAATTCATCAAGAAGTTCTACAAGGATCTAAGGCTAGAGCGAAGTTTTGCAGCTTGTGAATCGCCGTACTGGAATACGCTTTGCCGATGA
mRNA sequence
ATGGAAGTGGAGCCGATTCGACCGGCAGTGAAGAAGAAGCTGTGGAACGTGTTGCGAGCGGTTGTGTTCATGTTGAGGAAAGGCGTACGGACATTCGATCTTCATTTGATACTAAAACGAAGCAAAATCGCAGGAAAAGCGATTGGAAATCTGGTCGAATTTCATCACGGATCTGCTTTTAGTTGCCAAACGATTGATATCGCCAATTCCTACATCTCAACTCGCGATTACGAATTCAGTTGCAGTAACAGTCCTGCAGCATATCCGTTCCGTTACTTCAACAAACTCCGAAAGCACCGAAACCACCAATATTTCCCCAAATCTTACAGTCACGACGATTTCTCCACCGTCGCCGCCGTCACGAGAGTTCTGGATATTCTCCAAACCGATCAGAACTCCGAGAAGTCGCCATTAGTGCCGCTGCCTGGATTCGGAAAGAGTCCGCTAGTTGTGCGGCAGTTGCGCGTTACGGATTCGCCGTTCTCTTTGAAGGACGACGGCGATAGTCAGTTCGTCGATAAAGCCGCCGAGGAATTCATCAAGAAGTTCTACAAGGATCTAAGGCTAGAGCGAAGTTTTGCAGCTTGTGAATCGCCGTACTGGAATACGCTTTGCCGATGA
Coding sequence (CDS)
ATGGAAGTGGAGCCGATTCGACCGGCAGTGAAGAAGAAGCTGTGGAACGTGTTGCGAGCGGTTGTGTTCATGTTGAGGAAAGGCGTACGGACATTCGATCTTCATTTGATACTAAAACGAAGCAAAATCGCAGGAAAAGCGATTGGAAATCTGGTCGAATTTCATCACGGATCTGCTTTTAGTTGCCAAACGATTGATATCGCCAATTCCTACATCTCAACTCGCGATTACGAATTCAGTTGCAGTAACAGTCCTGCAGCATATCCGTTCCGTTACTTCAACAAACTCCGAAAGCACCGAAACCACCAATATTTCCCCAAATCTTACAGTCACGACGATTTCTCCACCGTCGCCGCCGTCACGAGAGTTCTGGATATTCTCCAAACCGATCAGAACTCCGAGAAGTCGCCATTAGTGCCGCTGCCTGGATTCGGAAAGAGTCCGCTAGTTGTGCGGCAGTTGCGCGTTACGGATTCGCCGTTCTCTTTGAAGGACGACGGCGATAGTCAGTTCGTCGATAAAGCCGCCGAGGAATTCATCAAGAAGTTCTACAAGGATCTAAGGCTAGAGCGAAGTTTTGCAGCTTGTGAATCGCCGTACTGGAATACGCTTTGCCGATGA
Protein sequence
MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLVEFHHGSAFSCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFIKKFYKDLRLERSFAACESPYWNTLCR
Homology
BLAST of CmoCh12G013020 vs. ExPASy TrEMBL
Match:
A0A6J1FIE3 (uncharacterized protein LOC111444217 OS=Cucurbita moschata OX=3662 GN=LOC111444217 PE=4 SV=1)
HSP 1 Score: 419.9 bits (1078), Expect = 6.3e-114
Identity = 206/206 (100.00%), Postives = 206/206 (100.00%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLVEFHHGSAF 60
MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLVEFHHGSAF
Sbjct: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLVEFHHGSAF 60
Query: 61 SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVAAV 120
SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVAAV
Sbjct: 61 SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVAAV 120
Query: 121 TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI 180
TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI
Sbjct: 121 TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI 180
Query: 181 KKFYKDLRLERSFAACESPYWNTLCR 207
KKFYKDLRLERSFAACESPYWNTLCR
Sbjct: 181 KKFYKDLRLERSFAACESPYWNTLCR 206
BLAST of CmoCh12G013020 vs. ExPASy TrEMBL
Match:
A0A6J1HSA9 (uncharacterized protein LOC111465667 OS=Cucurbita maxima OX=3661 GN=LOC111465667 PE=4 SV=1)
HSP 1 Score: 412.1 bits (1058), Expect = 1.3e-111
Identity = 203/206 (98.54%), Postives = 204/206 (99.03%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLVEFHHGSAF 60
MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNL+EFHHGSAF
Sbjct: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVRTFDLHLILKRSKIAGKAIGNLIEFHHGSAF 60
Query: 61 SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVAAV 120
SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSY HDDFSTVAAV
Sbjct: 61 SCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYRHDDFSTVAAV 120
Query: 121 TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI 180
TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI
Sbjct: 121 TRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAAEEFI 180
Query: 181 KKFYKDLRLERSFAACESPYWNTLCR 207
KKFYKDLRLERSFAACESPY NTLCR
Sbjct: 181 KKFYKDLRLERSFAACESPYRNTLCR 206
BLAST of CmoCh12G013020 vs. ExPASy TrEMBL
Match:
A0A0A0LHW0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G000960 PE=4 SV=1)
HSP 1 Score: 351.7 bits (901), Expect = 2.1e-93
Identity = 184/212 (86.79%), Postives = 188/212 (88.68%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVR----TFDLHLILKRSKIAGKAIGNLVEFHH 60
ME+EPIRPAVKKKLWNVLRAVVFMLRKG+ TFDLHL+LKRSKIAGKAI NLVEFHH
Sbjct: 1 MEMEPIRPAVKKKLWNVLRAVVFMLRKGLSKSKITFDLHLMLKRSKIAGKAIANLVEFHH 60
Query: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPA--AYPFRYFNKLRKHRNHQYFPKSYSHDDF 120
GSAFSCQTIDIANSYISTRDYEFSCSNSPA AYPFRYFNK K R YFPKSY +DDF
Sbjct: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPANTAYPFRYFNK--KLRKQHYFPKSYRYDDF 120
Query: 121 STVAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDK 180
STV AV RVLDIL TDQ SE SPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDK
Sbjct: 121 STVTAVQRVLDILHTDQKSEASPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDK 180
Query: 181 AAEEFIKKFYKDLRLERSFAACESPYWNTLCR 207
AAEEFIKKFY DLRLERS AA ESPY NTLCR
Sbjct: 181 AAEEFIKKFYTDLRLERSLAAFESPYRNTLCR 210
BLAST of CmoCh12G013020 vs. ExPASy TrEMBL
Match:
A0A6J1KI86 (uncharacterized protein LOC111493479 OS=Cucurbita maxima OX=3661 GN=LOC111493479 PE=4 SV=1)
HSP 1 Score: 350.1 bits (897), Expect = 6.2e-93
Identity = 177/210 (84.29%), Postives = 187/210 (89.05%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVR----TFDLHLILKRSKIAGKAIGNLVEFHH 60
ME+EPIRPAVKKKLWNVLRAVVFMLRKG+ FDLHL+LKRSK+AGKA+ NLVEFHH
Sbjct: 1 MEIEPIRPAVKKKLWNVLRAVVFMLRKGLNKSKIVFDLHLMLKRSKLAGKAMANLVEFHH 60
Query: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFST 120
GSAFSCQTIDIANSYISTRDYEFSCSNSP AYPFRYFNK RKH+N YFPKSY +DDFST
Sbjct: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSP-AYPFRYFNKHRKHQNRHYFPKSYRYDDFST 120
Query: 121 VAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDKAA 180
V AV RVLDIL +DQ SE SPLVPLPGFGKSPLVVRQLRVTDSPFSLKDD DSQ VDKAA
Sbjct: 121 VTAVQRVLDILHSDQKSEASPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDSDSQIVDKAA 180
Query: 181 EEFIKKFYKDLRLERSFAACESPYWNTLCR 207
EEFIKKFY DLRLE+S AA ESP+ NTLCR
Sbjct: 181 EEFIKKFYTDLRLEKSLAAYESPFRNTLCR 209
BLAST of CmoCh12G013020 vs. ExPASy TrEMBL
Match:
A0A5A7V077 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold332G00510 PE=4 SV=1)
HSP 1 Score: 349.4 bits (895), Expect = 1.1e-92
Identity = 183/212 (86.32%), Postives = 187/212 (88.21%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGVR----TFDLHLILKRSKIAGKAIGNLVEFHH 60
ME+EPIRPAVKKKLWNVLRAVVFMLRKG+ TFDLHL+LKRSKIAGKAI NLVEFHH
Sbjct: 1 MEMEPIRPAVKKKLWNVLRAVVFMLRKGLSKSKITFDLHLMLKRSKIAGKAIANLVEFHH 60
Query: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPA--AYPFRYFNKLRKHRNHQYFPKSYSHDDF 120
GSAFSCQTIDIANSYISTRDYEFSCSNSPA AYPFRYFNK K R YFPKSY +DDF
Sbjct: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPANTAYPFRYFNK--KLRKQHYFPKSYRYDDF 120
Query: 121 STVAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQFVDK 180
STV AV RVLDIL TDQ SE SPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQ VDK
Sbjct: 121 STVTAVQRVLDILHTDQKSEASPLVPLPGFGKSPLVVRQLRVTDSPFSLKDDGDSQLVDK 180
Query: 181 AAEEFIKKFYKDLRLERSFAACESPYWNTLCR 207
AAEEFIKKFY DLRLERS AA ESPY NTLCR
Sbjct: 181 AAEEFIKKFYTDLRLERSLAAFESPYRNTLCR 210
BLAST of CmoCh12G013020 vs. TAIR 10
Match:
AT1G52140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G16330.1); Has 114 Blast hits to 114 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 114; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 98.6 bits (244), Expect = 6.2e-21
Identity = 77/205 (37.56%), Postives = 111/205 (54.15%), Query Frame = 0
Query: 10 VKKKLWNVLRAVVFMLRKGVR----TFDLHLILKRSKIAGKAIGNLVEFHHGSAFSCQTI 69
+ KKLWN++R +++M+RKGV D + LKR K H GS S
Sbjct: 7 ISKKLWNIVRFLLYMIRKGVSKNKLIADFNATLKRGK--NLMFHQRRRVHAGSTASAALN 66
Query: 70 DIANSYISTRDYEFSCSNSP-AAYPFRYFNKLRKHRNHQYF-----PKSYSHDDFSTVAA 129
+ + S ++YEFSCSN+P ++PF +RK ++ F P++ D VAA
Sbjct: 67 ATSATASSRQEYEFSCSNTPNYSFPFSNMAFMRKKSHNNLFTCGQTPQTLDDD----VAA 126
Query: 130 VTRVLDILQTDQNSEKSPLVP----------LPGFGKSPLVVRQLRVTDSPFSL-KDDGD 189
VL++L + EK + P PGFG++PL VR LRVTDSPF L ++GD
Sbjct: 127 ARAVLELL--NGVGEKGNVTPADLTVALSPYFPGFGQTPL-VRPLRVTDSPFPLTPENGD 186
Query: 190 --SQFVDKAAEEFIKKFYKDLRLER 192
+ VDKAA++FIKKFYK+L ++
Sbjct: 187 VANGHVDKAADDFIKKFYKNLNQQK 202
BLAST of CmoCh12G013020 vs. TAIR 10
Match:
AT4G29110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to chitin; LOCATED IN: vacuole; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G52140.1); Has 109 Blast hits to 109 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 97.8 bits (242), Expect = 1.1e-20
Identity = 83/210 (39.52%), Postives = 114/210 (54.29%), Query Frame = 0
Query: 1 MEVEPIRPAVKKKLWNVLRAVVFMLRKGV----RTFDLHLILKRSKIAGKAIGNLVEFHH 60
ME+E K+LW V+R V +L+ G DL+L+LKR KAI NL
Sbjct: 12 MEMEQNAQVAAKRLWKVVRIVFCVLKTGTVKNKLMLDLNLMLKR---GNKAITNL----R 71
Query: 61 GSAFSCQTIDIANSYISTRDYEFSCSNSPAAYPFRYFNKLRKHRNHQYFPKSYSHDDFST 120
+ S + D+++S RDY+ PF + +K RK R H Y +++ +
Sbjct: 72 RRSSSTGSHDVSSS-SRVRDYD----------PFAFISK-RKRRVH----GGYDNEEDAV 131
Query: 121 VAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLV----VRQLRVTDSPFSLKDDGD-SQF 180
AAV +V ++L +N +K+ V +SPL+ VRQLRVTDSPF L D GD
Sbjct: 132 EAAVKKVFELL--GENDKKT--VATESARESPLIMSPAVRQLRVTDSPFPLDDGGDHDHV 191
Query: 181 VDKAAEEFIKKFYKDLRLERSFA-ACESPY 201
VDKAAEEFIKKFYK+L+L++ A ESPY
Sbjct: 192 VDKAAEEFIKKFYKNLKLQKKMTNALESPY 194
BLAST of CmoCh12G013020 vs. TAIR 10
Match:
AT3G16330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G52140.1); Has 109 Blast hits to 109 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 90.1 bits (222), Expect = 2.2e-18
Identity = 72/205 (35.12%), Postives = 106/205 (51.71%), Query Frame = 0
Query: 9 AVKKKLWNVLRAVVFMLRKGVR----TFDLHLILKRSKIAGKAIGNLVEFHH-----GSA 68
++ KKL N++R V++ML KG+ D + LKR K NL+ FH+ GSA
Sbjct: 6 SISKKLGNIVRFVLYMLHKGISKQKLLADFNATLKRGK-------NLM-FHNRRRVPGSA 65
Query: 69 FSCQTIDIANSYISTRDYEFSCSNSP-AAYPFRYFNKLRKHRNHQYFPKSYSHDDFSTVA 128
+ +YEFSCS++P +PF +K ++ F +
Sbjct: 66 VASH---------PQNEYEFSCSDTPNYTFPFNMAAFKKKSHHNSLFSCGQAPPTLDDDT 125
Query: 129 AVTR-VLDILQTDQNSEKSPLVP-------------LPGFGKSPLVVRQLRVTDSPFSLK 188
+V+R VL++L + + ++ P LPGFG+S VR LRVTDSPF L+
Sbjct: 126 SVSRAVLELLNSGGDHDQGSNTPAFSIEALTALSPYLPGFGRSTSSVRPLRVTDSPFPLR 185
BLAST of CmoCh12G013020 vs. TAIR 10
Match:
AT4G32860.1 (unknown protein; Has 46 Blast hits to 46 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 46; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 49.3 bits (116), Expect = 4.3e-06
Identity = 59/214 (27.57%), Postives = 94/214 (43.93%), Query Frame = 0
Query: 2 EVEPIRPAVKKKLWNVLRAVVFMLRKG--------VRTFDLHLILKRSKIAGKAIGNLVE 61
++E KKL ++ + ++F ++K + T D HL+ KR KI K++ V
Sbjct: 6 DMEVCSTVTTKKLSSLAKLILFTIQKVSDASRHKLLTTLDPHLLAKRGKILRKSLNEAVS 65
Query: 62 FHHGSAFSCQTI---DIANSYIS----TRDYEFSCSNSPAAYPFR-YFNKLRKHRNHQYF 121
H S +C+ D+ +S+IS +YEFSCS++P P R Y + K R
Sbjct: 66 TSH-SRITCRPSDHQDVRSSFISPVPLQLEYEFSCSSTP---PRRSYATTVSKGRR---- 125
Query: 122 PKSYSHDDFSTVAAVTRVLDILQTDQNSEKSPLVPLPGFGKSPLVVRQLRVTDSPFSLKD 181
+ SH+ ++ Q LP + + R + P
Sbjct: 126 -SNGSHN-----------RPLINKRQRQAYIRYNTLPKV-RDSIWDRHVAAAVFPDVASS 185
Query: 182 DG--DSQFVDKAAEEFIKKFYKDLRLERSFAACE 198
G +S VD+AAEEFI+ FY+ LRL++ A E
Sbjct: 186 TGTMESCHVDRAAEEFIQSFYRQLRLQKWMMAQE 198
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FIE3 | 6.3e-114 | 100.00 | uncharacterized protein LOC111444217 OS=Cucurbita moschata OX=3662 GN=LOC1114442... | [more] |
A0A6J1HSA9 | 1.3e-111 | 98.54 | uncharacterized protein LOC111465667 OS=Cucurbita maxima OX=3661 GN=LOC111465667... | [more] |
A0A0A0LHW0 | 2.1e-93 | 86.79 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G000960 PE=4 SV=1 | [more] |
A0A6J1KI86 | 6.2e-93 | 84.29 | uncharacterized protein LOC111493479 OS=Cucurbita maxima OX=3661 GN=LOC111493479... | [more] |
A0A5A7V077 | 1.1e-92 | 86.32 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G52140.1 | 6.2e-21 | 37.56 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G29110.1 | 1.1e-20 | 39.52 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response... | [more] |
AT3G16330.1 | 2.2e-18 | 35.12 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G32860.1 | 4.3e-06 | 27.57 | unknown protein; Has 46 Blast hits to 46 proteins in 10 species: Archae - 0; Bac... | [more] |