Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCATTCACCACCTCCTCCTCCTTCACATCACCCACCATTTTCCTTTCGCAATCTCATAAAACATGCTGTTTCAGTCAATAGATTTCAGATCATTACCAGATTTATTCACGCCATAGCATCAAGTCGAAGGACATTTCAACAGAACACACTTCCATTGTACGTCGTTCATGAATTCCAGAGCCATCAAATTCCTATCACGAACCGCTACACCAGCTTCGGGGAGATGAATTTGAGTCTCACTTTTCAGGCTGCCATAGGACTGGTCGTCTCTCCACAGATTTCACCTTCTCCATTCCTGCAAATCGCAGAGGTAATGTTGGTGATAAATTATGGAGTTTCGTTTTCTGGTGTTTTTCTTCGAAATTCCTTTCCGAGACTCGGAAACAATCTCGAGAAATTCGGTTCAGTATTAACCTCCTCCATCTTCTTCCTAATGGCCGCCTCCTTCCTTCCGGCGAGTTTTTGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCATTCTTGCTCTCGCTCTTCAGATGA
mRNA sequence
ATGGCTTCCCATTCACCACCTCCTCCTCCTTCACATCACCCACCATTTTCCTTTCGCAATCTCATAAAACATGCTGTTTCAGTCAATAGATTTCAGATCATTACCAGATTTATTCACGCCATAGCATCAAGTCGAAGGACATTTCAACAGAACACACTTCCATTGTACGTCGTTCATGAATTCCAGAGCCATCAAATTCCTATCACGAACCGCTACACCAGCTTCGGGGAGATGAATTTGAGTCTCACTTTTCAGGCTGCCATAGGACTGGTCGTCTCTCCACAGATTTCACCTTCTCCATTCCTGCAAATCGCAGAGGTAATGTTGGTGATAAATTATGGAGTTTCGTTTTCTGGTGTTTTTCTTCGAAATTCCTTTCCGAGACTCGGAAACAATCTCGAGAAATTCGGTTCAGTATTAACCTCCTCCATCTTCTTCCTAATGGCCGCCTCCTTCCTTCCGGCGAGTTTTTGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCATTCTTGCTCTCGCTCTTCAGATGA
Coding sequence (CDS)
ATGGCTTCCCATTCACCACCTCCTCCTCCTTCACATCACCCACCATTTTCCTTTCGCAATCTCATAAAACATGCTGTTTCAGTCAATAGATTTCAGATCATTACCAGATTTATTCACGCCATAGCATCAAGTCGAAGGACATTTCAACAGAACACACTTCCATTGTACGTCGTTCATGAATTCCAGAGCCATCAAATTCCTATCACGAACCGCTACACCAGCTTCGGGGAGATGAATTTGAGTCTCACTTTTCAGGCTGCCATAGGACTGGTCGTCTCTCCACAGATTTCACCTTCTCCATTCCTGCAAATCGCAGAGGTAATGTTGGTGATAAATTATGGAGTTTCGTTTTCTGGTGTTTTTCTTCGAAATTCCTTTCCGAGACTCGGAAACAATCTCGAGAAATTCGGTTCAGTATTAACCTCCTCCATCTTCTTCCTAATGGCCGCCTCCTTCCTTCCGGCGAGTTTTTGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCATTCTTGCTCTCGCTCTTCAGATGA
Protein sequence
MASHSPPPPPSHHPPFSFRNLIKHAVSVNRFQIITRFIHAIASSRRTFQQNTLPLYVVHEFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVSPQISPSPFLQIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFLPASFCWISWPVFALSMAAFLLSLFR
Homology
BLAST of HG10015443 vs. NCBI nr
Match:
XP_021293273.1 (uncharacterized protein LOC110423379 isoform X1 [Herrania umbratica])
HSP 1 Score: 74.3 bits (181), Expect = 1.2e-09
Identity = 51/160 (31.87%), Postives = 85/160 (53.12%), Query Frame = 0
Query: 22 IKHAVSVNRFQIITRFIHAIASSR-RTFQQNTLPLYVVH-EFQSHQIPITNRYTSFGEMN 81
I + S+ ITR AS + R + +PLY+ + E + H S G+
Sbjct: 48 ITRSSSLPTSSAITRMRFYTASYQWRCQALDCIPLYINYSEIEMHSYQ-PRPPVSLGKTI 107
Query: 82 LSLTFQAAIGLVVSPQISPSPFL---QIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKF 141
LSL+FQ + L S + + L ++ +++ + SFSG+FLR+S+P++ N +E
Sbjct: 108 LSLSFQIVVALAPSSSMGQAHHLLPIDFVKISMIMAFAASFSGIFLRSSYPKMANIIENI 167
Query: 142 GSVLTSSIFFLMAASFLPASFCWISWPVFALSMAAFLLSL 177
GS++ + FF+M + FLP +F W++W A S+ AF SL
Sbjct: 168 GSLIAAVGFFIMTSIFLPGNFSWVNWLACAFSLLAFFSSL 206
BLAST of HG10015443 vs. NCBI nr
Match:
OMO72209.1 (hypothetical protein COLO4_27768 [Corchorus olitorius])
HSP 1 Score: 73.6 bits (179), Expect = 2.0e-09
Identity = 46/131 (35.11%), Postives = 71/131 (54.20%), Query Frame = 0
Query: 36 RFIHAIASSRRTFQQ-NTLPLYV-VHEFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVS 95
RF A S+RT Q ++ PLY+ E + + G+ +SLTFQ + L +S
Sbjct: 5 RFCSATPQSQRTSQALDSPPLYMNCFELEMQSYQQRPNSANLGKTIMSLTFQVVVALALS 64
Query: 96 PQISPSPFL--QIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAAS 155
S L QI +V +++ + SFSG+FLRNS+P+ +E GS+ + FF+M +
Sbjct: 65 MGQSHHQLLSIQIVKVSMIMAFAASFSGIFLRNSYPKSARIVENTGSIAAAVGFFIMTSI 124
Query: 156 FLPASFCWISW 163
FLP F W++W
Sbjct: 125 FLPVKFSWVAW 135
BLAST of HG10015443 vs. NCBI nr
Match:
EOY01534.1 (Ileal sodium/bile acid cotransporter, putative [Theobroma cacao])
HSP 1 Score: 72.8 bits (177), Expect = 3.4e-09
Identity = 41/120 (34.17%), Postives = 70/120 (58.33%), Query Frame = 0
Query: 60 EFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVSPQISPSPF---LQIAEVMLVINYGVS 119
E QS+Q S G+ LSL+FQ + L +S + + + I ++ +++ + S
Sbjct: 90 EMQSYQ---PRPPVSLGKTILSLSFQIVVALALSSSMGQTHHVLPIDIVKISMIMAFAAS 149
Query: 120 FSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFLPASFCWISWPVFALSMAAFLLSL 177
FSG+FLR+S+P++ N +E GS++ + FF+M + FLP + W++W A S+ AF SL
Sbjct: 150 FSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNLYWVTWLACAFSLLAFFSSL 206
BLAST of HG10015443 vs. NCBI nr
Match:
KAA0050246.1 (putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa] >TYJ98160.1 putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa])
HSP 1 Score: 68.2 bits (165), Expect = 8.4e-08
Identity = 48/142 (33.80%), Postives = 79/142 (55.63%), Query Frame = 0
Query: 37 FIHAIASSRRTFQQNTLPLYVVHEFQSHQIPITNRYT--SFGEMNLSLTFQAAIGL-VVS 96
FI AI + R + N+LP+ + + S P + T + G+ L LTFQA + L + S
Sbjct: 7 FITAI--NERNPEINSLPICITMQRPS---PANSSKTENNVGKTILGLTFQAVLALFITS 66
Query: 97 PQISPSPFLQIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFL 156
P SP + ++I++ VSF+G+FL+N FPR+ EK G+++ + ++A+ +
Sbjct: 67 PNSSPPLLTHLFAAAVLISFAVSFAGIFLQNGFPRIALLFEKIGALIAAIGVCIVASLLI 126
Query: 157 PASFCWISWPVFALSMAAFLLS 176
+F WISW S+ AF+LS
Sbjct: 127 HQNFAWISWLASGFSLMAFVLS 143
BLAST of HG10015443 vs. NCBI nr
Match:
MBA0843405.1 (hypothetical protein [Gossypium armourianum])
HSP 1 Score: 67.8 bits (164), Expect = 1.1e-07
Identity = 55/201 (27.36%), Postives = 92/201 (45.77%), Query Frame = 0
Query: 10 PSHHPPFSFRNLIKHAVSVNRFQIITRFIHAIAS--------SRRTFQQNTLPLYVVH-- 69
PS++PPFS ++IKH+ + ++++F H+ + SR T Q + +
Sbjct: 2 PSNNPPFSILSIIKHSFH-DVGIMLSQFRHSFEANNPLTLPISRSTPQATANSITRIRCC 61
Query: 70 --------------------EFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVSPQISPS 129
E QSH S G+ +SL FQA L +S +
Sbjct: 62 SAAGSWGASPICNYINSFEIEMQSHHQQPRPNSVSLGKTIMSLAFQAVFALALSSSTEQA 121
Query: 130 ----PFLQIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFLPA 177
P L + +V+ + SFSG++L S PR+ + + S++ + FF+M++ FLP
Sbjct: 122 DHHHPLLPWSAASMVMAFAASFSGIYLHTSHPRIASIIGNTASMIAALGFFIMSSIFLPG 181
BLAST of HG10015443 vs. ExPASy TrEMBL
Match:
A0A6J1B1Z4 (uncharacterized protein LOC110423379 isoform X1 OS=Herrania umbratica OX=108875 GN=LOC110423379 PE=4 SV=1)
HSP 1 Score: 74.3 bits (181), Expect = 5.6e-10
Identity = 51/160 (31.87%), Postives = 85/160 (53.12%), Query Frame = 0
Query: 22 IKHAVSVNRFQIITRFIHAIASSR-RTFQQNTLPLYVVH-EFQSHQIPITNRYTSFGEMN 81
I + S+ ITR AS + R + +PLY+ + E + H S G+
Sbjct: 48 ITRSSSLPTSSAITRMRFYTASYQWRCQALDCIPLYINYSEIEMHSYQ-PRPPVSLGKTI 107
Query: 82 LSLTFQAAIGLVVSPQISPSPFL---QIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKF 141
LSL+FQ + L S + + L ++ +++ + SFSG+FLR+S+P++ N +E
Sbjct: 108 LSLSFQIVVALAPSSSMGQAHHLLPIDFVKISMIMAFAASFSGIFLRSSYPKMANIIENI 167
Query: 142 GSVLTSSIFFLMAASFLPASFCWISWPVFALSMAAFLLSL 177
GS++ + FF+M + FLP +F W++W A S+ AF SL
Sbjct: 168 GSLIAAVGFFIMTSIFLPGNFSWVNWLACAFSLLAFFSSL 206
BLAST of HG10015443 vs. ExPASy TrEMBL
Match:
A0A1R3HPA4 (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_27768 PE=4 SV=1)
HSP 1 Score: 73.6 bits (179), Expect = 9.6e-10
Identity = 46/131 (35.11%), Postives = 71/131 (54.20%), Query Frame = 0
Query: 36 RFIHAIASSRRTFQQ-NTLPLYV-VHEFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVS 95
RF A S+RT Q ++ PLY+ E + + G+ +SLTFQ + L +S
Sbjct: 5 RFCSATPQSQRTSQALDSPPLYMNCFELEMQSYQQRPNSANLGKTIMSLTFQVVVALALS 64
Query: 96 PQISPSPFL--QIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAAS 155
S L QI +V +++ + SFSG+FLRNS+P+ +E GS+ + FF+M +
Sbjct: 65 MGQSHHQLLSIQIVKVSMIMAFAASFSGIFLRNSYPKSARIVENTGSIAAAVGFFIMTSI 124
Query: 156 FLPASFCWISW 163
FLP F W++W
Sbjct: 125 FLPVKFSWVAW 135
BLAST of HG10015443 vs. ExPASy TrEMBL
Match:
A0A061EGV0 (Ileal sodium/bile acid cotransporter, putative OS=Theobroma cacao OX=3641 GN=TCM_011398 PE=4 SV=1)
HSP 1 Score: 72.8 bits (177), Expect = 1.6e-09
Identity = 41/120 (34.17%), Postives = 70/120 (58.33%), Query Frame = 0
Query: 60 EFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVSPQISPSPF---LQIAEVMLVINYGVS 119
E QS+Q S G+ LSL+FQ + L +S + + + I ++ +++ + S
Sbjct: 90 EMQSYQ---PRPPVSLGKTILSLSFQIVVALALSSSMGQTHHVLPIDIVKISMIMAFAAS 149
Query: 120 FSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFLPASFCWISWPVFALSMAAFLLSL 177
FSG+FLR+S+P++ N +E GS++ + FF+M + FLP + W++W A S+ AF SL
Sbjct: 150 FSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNLYWVTWLACAFSLLAFFSSL 206
BLAST of HG10015443 vs. ExPASy TrEMBL
Match:
A0A5A7U7U1 (Putative Ileal sodium/bile acid cotransporter OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold180G00420 PE=4 SV=1)
HSP 1 Score: 68.2 bits (165), Expect = 4.0e-08
Identity = 48/142 (33.80%), Postives = 79/142 (55.63%), Query Frame = 0
Query: 37 FIHAIASSRRTFQQNTLPLYVVHEFQSHQIPITNRYT--SFGEMNLSLTFQAAIGL-VVS 96
FI AI + R + N+LP+ + + S P + T + G+ L LTFQA + L + S
Sbjct: 7 FITAI--NERNPEINSLPICITMQRPS---PANSSKTENNVGKTILGLTFQAVLALFITS 66
Query: 97 PQISPSPFLQIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFL 156
P SP + ++I++ VSF+G+FL+N FPR+ EK G+++ + ++A+ +
Sbjct: 67 PNSSPPLLTHLFAAAVLISFAVSFAGIFLQNGFPRIALLFEKIGALIAAIGVCIVASLLI 126
Query: 157 PASFCWISWPVFALSMAAFLLS 176
+F WISW S+ AF+LS
Sbjct: 127 HQNFAWISWLASGFSLMAFVLS 143
BLAST of HG10015443 vs. ExPASy TrEMBL
Match:
A0A7J9KAB3 (Uncharacterized protein OS=Gossypium armourianum OX=34283 GN=Goarm_000600 PE=4 SV=1)
HSP 1 Score: 67.8 bits (164), Expect = 5.3e-08
Identity = 55/201 (27.36%), Postives = 92/201 (45.77%), Query Frame = 0
Query: 10 PSHHPPFSFRNLIKHAVSVNRFQIITRFIHAIAS--------SRRTFQQNTLPLYVVH-- 69
PS++PPFS ++IKH+ + ++++F H+ + SR T Q + +
Sbjct: 2 PSNNPPFSILSIIKHSFH-DVGIMLSQFRHSFEANNPLTLPISRSTPQATANSITRIRCC 61
Query: 70 --------------------EFQSHQIPITNRYTSFGEMNLSLTFQAAIGLVVSPQISPS 129
E QSH S G+ +SL FQA L +S +
Sbjct: 62 SAAGSWGASPICNYINSFEIEMQSHHQQPRPNSVSLGKTIMSLAFQAVFALALSSSTEQA 121
Query: 130 ----PFLQIAEVMLVINYGVSFSGVFLRNSFPRLGNNLEKFGSVLTSSIFFLMAASFLPA 177
P L + +V+ + SFSG++L S PR+ + + S++ + FF+M++ FLP
Sbjct: 122 DHHHPLLPWSAASMVMAFAASFSGIYLHTSHPRIASIIGNTASMIAALGFFIMSSIFLPG 181
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_021293273.1 | 1.2e-09 | 31.88 | uncharacterized protein LOC110423379 isoform X1 [Herrania umbratica] | [more] |
OMO72209.1 | 2.0e-09 | 35.11 | hypothetical protein COLO4_27768 [Corchorus olitorius] | [more] |
EOY01534.1 | 3.4e-09 | 34.17 | Ileal sodium/bile acid cotransporter, putative [Theobroma cacao] | [more] |
KAA0050246.1 | 8.4e-08 | 33.80 | putative Ileal sodium/bile acid cotransporter [Cucumis melo var. makuwa] >TYJ981... | [more] |
MBA0843405.1 | 1.1e-07 | 27.36 | hypothetical protein [Gossypium armourianum] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1B1Z4 | 5.6e-10 | 31.88 | uncharacterized protein LOC110423379 isoform X1 OS=Herrania umbratica OX=108875 ... | [more] |
A0A1R3HPA4 | 9.6e-10 | 35.11 | Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_27768 PE=4 SV=1 | [more] |
A0A061EGV0 | 1.6e-09 | 34.17 | Ileal sodium/bile acid cotransporter, putative OS=Theobroma cacao OX=3641 GN=TCM... | [more] |
A0A5A7U7U1 | 4.0e-08 | 33.80 | Putative Ileal sodium/bile acid cotransporter OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A7J9KAB3 | 5.3e-08 | 27.36 | Uncharacterized protein OS=Gossypium armourianum OX=34283 GN=Goarm_000600 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |