Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCAACTTCACACCAAACCTCATTTTATATGAATGCACTTTCTTCCTTTATCCCAACAATCAACCAAAGAAGCTATGGAATTAGCATTTTCTGCACACCATCAGATTGTTTACCAATTTGCATTAGAATGCAACAGAGCCCTCAAGCAACAGCCAGTAGCCCTCAAAATCTTGCCAAGACAATTCTTGGGCTTACTTTTCAAGCAATTTTGGCCTTGTTCATCAGCTCACCAACTTCTTCTCCTCCACTTTTGACTCATCTTTTTGGTGCTGCTGTTTTGATTAGCTTTGCAGTTTCATTTGCTGCTCTTTTCCTTCATAACTCCTTCCCGAGAACCGCACGTTTGTTCGAAAAGATCGGTGCACTTTTCGCCGGAATCGGTGTTTGTATCATAGCAAGTTTTCTTCTAGTCCATCAGAACTTTGCTTGGATATGTTGGTTGGCATGTGGCTTCTCCTTCATTGTCTTTGTTCTATCATTTAAGTGA
mRNA sequence
ATGGCTTCAACTTCACACCAAACCTCATTTTATATGAATGCACTTTCTTCCTTTATCCCAACAATCAACCAAAGAAGCTATGGAATTAGCATTTTCTGCACACCATCAGATTGTTTACCAATTTGCATTAGAATGCAACAGAGCCCTCAAGCAACAGCCAGTAGCCCTCAAAATCTTGCCAAGACAATTCTTGGGCTTACTTTTCAAGCAATTTTGGCCTTGTTCATCAGCTCACCAACTTCTTCTCCTCCACTTTTGACTCATCTTTTTGGTGCTGCTGTTTTGATTAGCTTTGCAGTTTCATTTGCTGCTCTTTTCCTTCATAACTCCTTCCCGAGAACCGCACGTTTGTTCGAAAAGATCGGTGCACTTTTCGCCGGAATCGGTGTTTGTATCATAGCAAGTTTTCTTCTAGTCCATCAGAACTTTGCTTGGATATGTTGGTTGGCATGTGGCTTCTCCTTCATTGTCTTTGTTCTATCATTTAAGTGA
Coding sequence (CDS)
ATGGCTTCAACTTCACACCAAACCTCATTTTATATGAATGCACTTTCTTCCTTTATCCCAACAATCAACCAAAGAAGCTATGGAATTAGCATTTTCTGCACACCATCAGATTGTTTACCAATTTGCATTAGAATGCAACAGAGCCCTCAAGCAACAGCCAGTAGCCCTCAAAATCTTGCCAAGACAATTCTTGGGCTTACTTTTCAAGCAATTTTGGCCTTGTTCATCAGCTCACCAACTTCTTCTCCTCCACTTTTGACTCATCTTTTTGGTGCTGCTGTTTTGATTAGCTTTGCAGTTTCATTTGCTGCTCTTTTCCTTCATAACTCCTTCCCGAGAACCGCACGTTTGTTCGAAAAGATCGGTGCACTTTTCGCCGGAATCGGTGTTTGTATCATAGCAAGTTTTCTTCTAGTCCATCAGAACTTTGCTTGGATATGTTGGTTGGCATGTGGCTTCTCCTTCATTGTCTTTGTTCTATCATTTAAGTGA
Protein sequence
MASTSHQTSFYMNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQQSPQATASSPQNLAKTILGLTFQAILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFEKIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK
Homology
BLAST of HG10009654 vs. NCBI nr
Match:
KGN49808.1 (hypothetical protein Csa_004650 [Cucumis sativus])
HSP 1 Score: 240.0 bits (611), Expect = 1.5e-59
Identity = 126/164 (76.83%), Postives = 137/164 (83.54%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQQSPQATASSPQNLA 60
MASTSHQTS M+A SSFI TI+QR+YG+ CTPS+CLP+ IRMQQSPQATA S Q L
Sbjct: 1 MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLG 60
Query: 61 KTILGLTFQAILALFISS-PTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFE 120
K IL L+FQA+LALFISS PTS PPLL H F AAV ISFAVSFAALFLHNSFPRTA LFE
Sbjct: 61 KIILSLSFQAVLALFISSPPTSPPPLLIHFFAAAVFISFAVSFAALFLHNSFPRTAHLFE 120
Query: 121 KIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
K+GALF+ GVC IASFLLVHQNFAWICW+AC FS IVF LSFK
Sbjct: 121 KVGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK 164
BLAST of HG10009654 vs. NCBI nr
Match:
KAA0050246.1 (putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa] >TYJ98160.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa])
HSP 1 Score: 179.9 bits (455), Expect = 1.8e-41
Identity = 102/153 (66.67%), Postives = 117/153 (76.47%), Query Frame = 0
Query: 12 MNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQQ-SPQATASSPQNLAKTILGLTFQA 71
MN L SFI IN+R+ I + LPICI MQ+ SP ++ + N+ KTILGLTFQA
Sbjct: 1 MNELYSFITAINERNPEI-------NSLPICITMQRPSPANSSKTENNVGKTILGLTFQA 60
Query: 72 ILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFEKIGALFAGIGV 131
+LALFI+SP SSPPLLTHLF AAVLISFAVSFA +FL N FPR A LFEKIGAL A IGV
Sbjct: 61 VLALFITSPNSSPPLLTHLFAAAVLISFAVSFAGIFLQNGFPRIALLFEKIGALIAAIGV 120
Query: 132 CIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
CI+AS LL+HQNFAWI WLA GFS + FVLSF+
Sbjct: 121 CIVAS-LLIHQNFAWISWLASGFSLMAFVLSFR 145
BLAST of HG10009654 vs. NCBI nr
Match:
KGN49803.1 (hypothetical protein Csa_004681 [Cucumis sativus])
HSP 1 Score: 177.9 bits (450), Expect = 6.9e-41
Identity = 104/164 (63.41%), Postives = 122/164 (74.39%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQ-QSPQATASSPQNL 60
M S+S Q S MN L S I +IN+R+ I + LPICI MQ SP ++ + N+
Sbjct: 1 MESSSQQISIDMNHLYSLITSINERNPEI-------NGLPICIIMQTASPANSSKTENNV 60
Query: 61 AKTILGLTFQAILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFE 120
TILGLTFQA+LALFI+S TSSPPLLTHLFGAAVLISFAVSF +FL + FPR A LFE
Sbjct: 61 GTTILGLTFQAVLALFITSSTSSPPLLTHLFGAAVLISFAVSFPGVFLQDGFPRIALLFE 120
Query: 121 KIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
KIGAL A IGVCI+AS LL+HQNFAWI WLACGFS + F+LSF+
Sbjct: 121 KIGALIAAIGVCILAS-LLIHQNFAWISWLACGFSLMAFLLSFR 156
BLAST of HG10009654 vs. NCBI nr
Match:
KAG6579132.1 (hypothetical protein SDJN03_23580, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 147.5 bits (371), Expect = 9.9e-32
Identity = 81/131 (61.83%), Postives = 98/131 (74.81%), Query Frame = 0
Query: 32 FCTPSDCLPICIRMQQSPQATASSPQNLAKTILGLTFQAILALFISSPTSSPPLLTHLFG 91
F TP DCLPI R ++ N+ K I+GLT QA+LA+FISSP+SSPPLL LFG
Sbjct: 3 FSTPRDCLPIW-RAREQRLEPDKGHSNVGKAIIGLTSQALLAMFISSPSSSPPLLRVLFG 62
Query: 92 AAVLISFAVSFAALFLHNSFPRTARLFEKIGALFAGIGVCIIASFLLVHQNFAWICWLAC 151
A + ISF +SFA +FL N+FP+ ARLFEK+GALFA IGV IIASFLL+H+N+AWI LAC
Sbjct: 63 ATMFISFLLSFAGIFLRNAFPKAARLFEKLGALFAAIGVSIIASFLLMHENYAWISGLAC 122
Query: 152 GFSFIVFVLSF 163
FS IVF LS+
Sbjct: 123 VFSLIVFGLSY 132
BLAST of HG10009654 vs. NCBI nr
Match:
XP_022146444.1 (uncharacterized protein LOC111015658 [Momordica charantia])
HSP 1 Score: 144.8 bits (364), Expect = 6.4e-31
Identity = 86/126 (68.25%), Postives = 95/126 (75.40%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIF------CTPSDCLPICIRMQQSPQATAS 60
MAST QTS MNALSSFI IN+R+ GIS F TPSDCLPICIRMQ+ P A +S
Sbjct: 1 MASTPQQTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSS 60
Query: 61 SPQNLAKTILGLTFQAILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRT 120
Q+L KTILGLTFQA+LALFIS P+S P L T LFGAAVLISFAVSFA LFL ++PR
Sbjct: 61 --QSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAGLFLQTAYPRM 120
BLAST of HG10009654 vs. ExPASy TrEMBL
Match:
A0A0A0KJP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G137410 PE=4 SV=1)
HSP 1 Score: 240.0 bits (611), Expect = 7.1e-60
Identity = 126/164 (76.83%), Postives = 137/164 (83.54%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQQSPQATASSPQNLA 60
MASTSHQTS M+A SSFI TI+QR+YG+ CTPS+CLP+ IRMQQSPQATA S Q L
Sbjct: 1 MASTSHQTSLDMSAFSSFIRTISQRNYGLGCCCTPSNCLPVFIRMQQSPQATADSSQKLG 60
Query: 61 KTILGLTFQAILALFISS-PTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFE 120
K IL L+FQA+LALFISS PTS PPLL H F AAV ISFAVSFAALFLHNSFPRTA LFE
Sbjct: 61 KIILSLSFQAVLALFISSPPTSPPPLLIHFFAAAVFISFAVSFAALFLHNSFPRTAHLFE 120
Query: 121 KIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
K+GALF+ GVC IASFLLVHQNFAWICW+AC FS IVF LSFK
Sbjct: 121 KVGALFSAFGVCFIASFLLVHQNFAWICWVACTFSIIVFALSFK 164
BLAST of HG10009654 vs. ExPASy TrEMBL
Match:
A0A5A7U7U1 (Putative Ileal sodium/bile acid cotransporter OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold180G00420 PE=4 SV=1)
HSP 1 Score: 179.9 bits (455), Expect = 8.7e-42
Identity = 102/153 (66.67%), Postives = 117/153 (76.47%), Query Frame = 0
Query: 12 MNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQQ-SPQATASSPQNLAKTILGLTFQA 71
MN L SFI IN+R+ I + LPICI MQ+ SP ++ + N+ KTILGLTFQA
Sbjct: 1 MNELYSFITAINERNPEI-------NSLPICITMQRPSPANSSKTENNVGKTILGLTFQA 60
Query: 72 ILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFEKIGALFAGIGV 131
+LALFI+SP SSPPLLTHLF AAVLISFAVSFA +FL N FPR A LFEKIGAL A IGV
Sbjct: 61 VLALFITSPNSSPPLLTHLFAAAVLISFAVSFAGIFLQNGFPRIALLFEKIGALIAAIGV 120
Query: 132 CIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
CI+AS LL+HQNFAWI WLA GFS + FVLSF+
Sbjct: 121 CIVAS-LLIHQNFAWISWLASGFSLMAFVLSFR 145
BLAST of HG10009654 vs. ExPASy TrEMBL
Match:
A0A0A0KJN8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G136870 PE=4 SV=1)
HSP 1 Score: 177.9 bits (450), Expect = 3.3e-41
Identity = 104/164 (63.41%), Postives = 122/164 (74.39%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIFCTPSDCLPICIRMQ-QSPQATASSPQNL 60
M S+S Q S MN L S I +IN+R+ I + LPICI MQ SP ++ + N+
Sbjct: 1 MESSSQQISIDMNHLYSLITSINERNPEI-------NGLPICIIMQTASPANSSKTENNV 60
Query: 61 AKTILGLTFQAILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRTARLFE 120
TILGLTFQA+LALFI+S TSSPPLLTHLFGAAVLISFAVSF +FL + FPR A LFE
Sbjct: 61 GTTILGLTFQAVLALFITSSTSSPPLLTHLFGAAVLISFAVSFPGVFLQDGFPRIALLFE 120
Query: 121 KIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
KIGAL A IGVCI+AS LL+HQNFAWI WLACGFS + F+LSF+
Sbjct: 121 KIGALIAAIGVCILAS-LLIHQNFAWISWLACGFSLMAFLLSFR 156
BLAST of HG10009654 vs. ExPASy TrEMBL
Match:
A0A6J1CY58 (uncharacterized protein LOC111015658 OS=Momordica charantia OX=3673 GN=LOC111015658 PE=4 SV=1)
HSP 1 Score: 144.8 bits (364), Expect = 3.1e-31
Identity = 86/126 (68.25%), Postives = 95/126 (75.40%), Query Frame = 0
Query: 1 MASTSHQTSFYMNALSSFIPTINQRSYGISIF------CTPSDCLPICIRMQQSPQATAS 60
MAST QTS MNALSSFI IN+R+ GIS F TPSDCLPICIRMQ+ P A +S
Sbjct: 1 MASTPQQTSVDMNALSSFITAINERNLGISSFTWGGGTSTPSDCLPICIRMQRPPPAKSS 60
Query: 61 SPQNLAKTILGLTFQAILALFISSPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRT 120
Q+L KTILGLTFQA+LALFIS P+S P L T LFGAAVLISFAVSFA LFL ++PR
Sbjct: 61 --QSLGKTILGLTFQAVLALFISLPSSPPQLPTLLFGAAVLISFAVSFAGLFLQTAYPRM 120
BLAST of HG10009654 vs. ExPASy TrEMBL
Match:
A0A0A0KQ03 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G136900 PE=4 SV=1)
HSP 1 Score: 125.6 bits (314), Expect = 2.0e-25
Identity = 83/169 (49.11%), Postives = 106/169 (62.72%), Query Frame = 0
Query: 1 MAST-SHQTSFYMNALSSFIPTINQRSYGISIFCT----PSDCLPICIRMQQSPQATASS 60
MA T H+ S MN L+S I + R+ GIS CT SDCLPI I+MQ+ + +S
Sbjct: 1 MAETPPHEISLDMNKLNSLIFAVTNRNPGIS-SCTWGGAASDCLPIYIKMQR--PSPVNS 60
Query: 61 PQNLAKTILGLTFQAILALFIS-SPTSSPPLLTHLFGAAVLISFAVSFAALFLHNSFPRT 120
PQ T L LTFQAI+ LF+S +P+SS PL + LF A +L SF S+ + L FP+T
Sbjct: 61 PQ-FGNTFLSLTFQAIVGLFLSLNPSSSSPLPSRLFAAVMLTSFIFSYDGVILQKPFPKT 120
Query: 121 ARLFEKIGALFAGIGVCIIASFLLVHQNFAWICWLACGFSFIVFVLSFK 164
A+L + GALFA IG CII S LL++ NF WICWLA G F++SFK
Sbjct: 121 AQLLQTFGALFAAIGTCIIGS-LLLYPNFTWICWLAAGLILPAFIISFK 164
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KGN49808.1 | 1.5e-59 | 76.83 | hypothetical protein Csa_004650 [Cucumis sativus] | [more] |
KAA0050246.1 | 1.8e-41 | 66.67 | putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa] >TYJ981... | [more] |
KGN49803.1 | 6.9e-41 | 63.41 | hypothetical protein Csa_004681 [Cucumis sativus] | [more] |
KAG6579132.1 | 9.9e-32 | 61.83 | hypothetical protein SDJN03_23580, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022146444.1 | 6.4e-31 | 68.25 | uncharacterized protein LOC111015658 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KJP2 | 7.1e-60 | 76.83 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G137410 PE=4 SV=1 | [more] |
A0A5A7U7U1 | 8.7e-42 | 66.67 | Putative Ileal sodium/bile acid cotransporter OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A0A0KJN8 | 3.3e-41 | 63.41 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G136870 PE=4 SV=1 | [more] |
A0A6J1CY58 | 3.1e-31 | 68.25 | uncharacterized protein LOC111015658 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A0A0KQ03 | 2.0e-25 | 49.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G136900 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |