Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCTGACAAACTTTCTGTCTAACTGTCGAGATCTCTTGCTCGGTGTGCTCGTTACTCGTTTAGCTCGGCAAGCAAGTAAGTAATTTTTCCTCAGCGTTTCGTTTCTAAGCTAGGTGACCTGAGCTTGGGGATTTTGTTTAGTTAGTAAGGGAGTAGGCAGGGGGGGAGTCAAAGGGTGGGAATAGATTGTAGTGAATTGTGGGAGTTTCTAAGGAAAGCTATCTTCTCAAAGTCAAATCTTCAAAAGGATGGAGTTCCAGCAAAACCCTTTCTAAGGATGAGGACGGTAGTTCGGGAGTTCGCTCGAAAGAACTGGCGTACCAAGGGGGATCTTTCGATCACTCTGTTAGATCCAAGGCACGTCTTCATCAAGCTCTCGAACAAAGAAGATATGCACCAAAAAGAGAAGTTTTTGATTGAGGGATTGATGTTTAGGGTTTTCTGCTGGAGCCATGATTTCCGTCTCCGCAATGGTGATCCAATGCATGCGCCTATTTGGGTATCGCGACCCCATCTGCCTATTGTCTTCTTCTTTGAGTTTCGCCTGTTTTCTATCGTCTCGACTTTCGGAATTGGCCTGACCCTTGAAGGTCCTACGAAGCTTGTCTCACGCCCCCCTCTCACAATAAAGCTAGGGTTTGTGTAGAAATCGGTCTTCTCAAGGAAAACCTCCCAAATCGTGTTTTGATTGGATCGACTGTGTATGTATGGATCACTAGCAAGAAATCGTTTCTTTTTGAATACCTAAGTTCTGCACGCATTGCAGACGACAAGGTCATGCCCGTCACGAATGCAAAGTGGCGAATAAGGAACCAGGTGTCAACCCGTCACCGAGAATGGCCTATGTTCCAAGAACCAATGCTACCCAGCACCTACCACTGAAGTTAGCCTTCGTGATTCAGATGTTGCTGCTGCTGACATGCAAACCATTGAAGATGGTCCTCGGGGCGAAGAGCTTGAACCAGATAACAATCCCCAAGCTTCTGTTCTTCAACAACCGGAGTATTTAGCTATTGGAAAATCGCCTTCGGTTAACGCAGCCCCTAATACCGGCGGTAGCTCTGCCTCATGGGGACAGCAGGAAGAGGAAAGAGTTTGA
mRNA sequence
CCCTGACAAACTTTCTGTCTAACTGTCGAGATCTCTTGCTCGGTGTGCTCGTTACTCGTTTAGCTCGGCAAGCAAGTAAGTAATTTTTCCTCAGCGTTTCGTTTCTAAGCTAGGTGACCTGAGCTTGGGGATTTTGTTTAGTTAGTAAGGGAGTAGGCAGGGGGGGAGTCAAAGGGTGGGAATAGATTGTAGTGAATTGTGGGAGTTTCTAAGGAAAGCTATCTTCTCAAAGTCAAATCTTCAAAAGGATGGAGTTCCAGCAAAACCCTTTCTAAGGATGAGGACGGTAGTTCGGGAGTTCGCTCGAAAGAACTGGCGTACCAAGGGGGATCTTTCGATCACTCTGTTAGATCCAAGGCACGTCTTCATCAAGCTCTCGAACAAAGAAGATATGCACCAAAAAGAGAAGTTTTTGATTGAGGGATTGATGTTTAGGGTTTTCTGCTGGAGCCATGATTTCCGTCTCCGCAATGGTGATCCAATGCATGCGCCTATTTGGGTATCGCGACCCCATCTGCCTATTGTCTTCTTCTTTGAGTTTCGCCTGTTTTCTATCGTCTCGACTTTCGGAATTGGCCTGACCCTTGAAGGTCCTACGAAGCTTGTCTCACGCCCCCCTCTCACAATAAAGCTAGGGTGTCAACCCGTCACCGAGAATGGCCTATGTTCCAAGAACCAATGCTACCCAGCACCTACCACTGAAGTTAGCCTTCGTGATTCAGATGTTGCTGCTGCTGACATGCAAACCATTGAAGATGGTCCTCGGGGCGAAGAGCTTGAACCAGATAACAATCCCCAAGCTTCTGTTCTTCAACAACCGGAGTATTTAGCTATTGGAAAATCGCCTTCGGTTAACGCAGCCCCTAATACCGGCGGTAGCTCTGCCTCATGGGGACAGCAGGAAGAGGAAAGAGTTTGA
Coding sequence (CDS)
ATGAGGACGGTAGTTCGGGAGTTCGCTCGAAAGAACTGGCGTACCAAGGGGGATCTTTCGATCACTCTGTTAGATCCAAGGCACGTCTTCATCAAGCTCTCGAACAAAGAAGATATGCACCAAAAAGAGAAGTTTTTGATTGAGGGATTGATGTTTAGGGTTTTCTGCTGGAGCCATGATTTCCGTCTCCGCAATGGTGATCCAATGCATGCGCCTATTTGGGTATCGCGACCCCATCTGCCTATTGTCTTCTTCTTTGAGTTTCGCCTGTTTTCTATCGTCTCGACTTTCGGAATTGGCCTGACCCTTGAAGGTCCTACGAAGCTTGTCTCACGCCCCCCTCTCACAATAAAGCTAGGGTGTCAACCCGTCACCGAGAATGGCCTATGTTCCAAGAACCAATGCTACCCAGCACCTACCACTGAAGTTAGCCTTCGTGATTCAGATGTTGCTGCTGCTGACATGCAAACCATTGAAGATGGTCCTCGGGGCGAAGAGCTTGAACCAGATAACAATCCCCAAGCTTCTGTTCTTCAACAACCGGAGTATTTAGCTATTGGAAAATCGCCTTCGGTTAACGCAGCCCCTAATACCGGCGGTAGCTCTGCCTCATGGGGACAGCAGGAAGAGGAAAGAGTTTGA
Protein sequence
MRTVVREFARKNWRTKGDLSITLLDPRHVFIKLSNKEDMHQKEKFLIEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRPPLTIKLGCQPVTENGLCSKNQCYPAPTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQPEYLAIGKSPSVNAAPNTGGSSASWGQQEEERV
Homology
BLAST of Tan0016175 vs. NCBI nr
Match:
TYK04334.1 (hypothetical protein E5676_scaffold84043G00010 [Cucumis melo var. makuwa])
HSP 1 Score: 177.9 bits (450), Expect = 9.0e-41
Identity = 115/212 (54.25%), Postives = 126/212 (59.43%), Query Frame = 0
Query: 2 RTVVREFARKNWRTKGDLSITLLDPRHVFIKLSNKEDMHQKEKFLIEGLMFRVFCWSHDF 61
RTVVREFARKNWRTKG D + K FR
Sbjct: 147 RTVVREFARKNWRTKG-------------------FDHSLRSKARFSAGAFRAMV----- 206
Query: 62 RLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRPPLTIKLGC 121
+ + R H PIVFFF F VSTFGIGLTLEGPTKLVSRPPL IKLG
Sbjct: 207 -------IQCMRLLGRLHPPIVFFF----FFAVSTFGIGLTLEGPTKLVSRPPLRIKLG- 266
Query: 122 QPVTENGLCSKNQCYPAPTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQP 181
++ L S + V+ +VAAADM+TI DGPRGEELEPD NPQAS+LQ+P
Sbjct: 267 ---KKSFLFSSARIADEKVMPVT-NAKNVAAADMETIFDGPRGEELEPDKNPQASLLQEP 318
Query: 182 EYLAIGKSPSVNAAPNTGGSSASWGQQEEERV 214
YLAIGKSPSV AAPNTGGSSASWGQQEEERV
Sbjct: 327 TYLAIGKSPSVEAAPNTGGSSASWGQQEEERV 318
BLAST of Tan0016175 vs. NCBI nr
Match:
KAA0059498.1 (hypothetical protein E6C27_scaffold48829G00010 [Cucumis melo var. makuwa])
HSP 1 Score: 177.9 bits (450), Expect = 9.0e-41
Identity = 115/212 (54.25%), Postives = 126/212 (59.43%), Query Frame = 0
Query: 2 RTVVREFARKNWRTKGDLSITLLDPRHVFIKLSNKEDMHQKEKFLIEGLMFRVFCWSHDF 61
RTVVREFARKNWRTKG D + K FR
Sbjct: 77 RTVVREFARKNWRTKG-------------------FDHSLRSKARFSAGAFRAMV----- 136
Query: 62 RLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRPPLTIKLGC 121
+ + R H PIVFFF F VSTFGIGLTLEGPTKLVSRPPL IKLG
Sbjct: 137 -------IQCMRLLGRLHPPIVFFF----FFAVSTFGIGLTLEGPTKLVSRPPLRIKLG- 196
Query: 122 QPVTENGLCSKNQCYPAPTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQP 181
++ L S + V+ +VAAADM+TI DGPRGEELEPD NPQAS+LQ+P
Sbjct: 197 ---KKSFLFSSARIADEKVMPVT-NAKNVAAADMETIFDGPRGEELEPDKNPQASLLQEP 248
Query: 182 EYLAIGKSPSVNAAPNTGGSSASWGQQEEERV 214
YLAIGKSPSV AAPNTGGSSASWGQQEEERV
Sbjct: 257 TYLAIGKSPSVEAAPNTGGSSASWGQQEEERV 248
BLAST of Tan0016175 vs. NCBI nr
Match:
VVA33529.1 (PREDICTED: IST1 [Prunus dulcis])
HSP 1 Score: 146.4 bits (368), Expect = 2.9e-31
Identity = 68/73 (93.15%), Postives = 70/73 (95.89%), Query Frame = 0
Query: 41 QKEKFLIEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIG 100
+KEKFLIEGLM RVF WSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVS FGIG
Sbjct: 127 KKEKFLIEGLMLRVFRWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSFFGIG 186
Query: 101 LTLEGPTKLVSRP 114
LTL+GPTKLVSRP
Sbjct: 187 LTLDGPTKLVSRP 199
BLAST of Tan0016175 vs. NCBI nr
Match:
WP_131796640.1 (DUF4283 domain-containing protein [Candidatus Frankia datiscae])
HSP 1 Score: 142.9 bits (359), Expect = 3.2e-30
Identity = 65/68 (95.59%), Postives = 66/68 (97.06%), Query Frame = 0
Query: 46 LIEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEG 105
+IEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLP VFFFEF LFSIVSTFGIGLTLEG
Sbjct: 1 MIEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLPRVFFFEFSLFSIVSTFGIGLTLEG 60
Query: 106 PTKLVSRP 114
PTKLVSRP
Sbjct: 61 PTKLVSRP 68
BLAST of Tan0016175 vs. NCBI nr
Match:
EEF27683.1 (conserved hypothetical protein [Ricinus communis])
HSP 1 Score: 104.4 bits (259), Expect = 1.3e-18
Identity = 58/76 (76.32%), Postives = 61/76 (80.26%), Query Frame = 0
Query: 138 APTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQPEYLAIGKSPSVNAAPN 197
APT E + AADMQTIEDGPRGEELEPDNNPQA VL++ EYLAIGKSPSVNAA N
Sbjct: 12 APTIET------MPAADMQTIEDGPRGEELEPDNNPQAPVLKKSEYLAIGKSPSVNAASN 71
Query: 198 TGGSSASWGQQEEERV 214
TGGSS S GQQEEE V
Sbjct: 72 TGGSSPSRGQQEEELV 81
BLAST of Tan0016175 vs. ExPASy TrEMBL
Match:
A0A5A7UWF6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold48829G00010 PE=4 SV=1)
HSP 1 Score: 177.9 bits (450), Expect = 4.3e-41
Identity = 115/212 (54.25%), Postives = 126/212 (59.43%), Query Frame = 0
Query: 2 RTVVREFARKNWRTKGDLSITLLDPRHVFIKLSNKEDMHQKEKFLIEGLMFRVFCWSHDF 61
RTVVREFARKNWRTKG D + K FR
Sbjct: 77 RTVVREFARKNWRTKG-------------------FDHSLRSKARFSAGAFRAMV----- 136
Query: 62 RLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRPPLTIKLGC 121
+ + R H PIVFFF F VSTFGIGLTLEGPTKLVSRPPL IKLG
Sbjct: 137 -------IQCMRLLGRLHPPIVFFF----FFAVSTFGIGLTLEGPTKLVSRPPLRIKLG- 196
Query: 122 QPVTENGLCSKNQCYPAPTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQP 181
++ L S + V+ +VAAADM+TI DGPRGEELEPD NPQAS+LQ+P
Sbjct: 197 ---KKSFLFSSARIADEKVMPVT-NAKNVAAADMETIFDGPRGEELEPDKNPQASLLQEP 248
Query: 182 EYLAIGKSPSVNAAPNTGGSSASWGQQEEERV 214
YLAIGKSPSV AAPNTGGSSASWGQQEEERV
Sbjct: 257 TYLAIGKSPSVEAAPNTGGSSASWGQQEEERV 248
BLAST of Tan0016175 vs. ExPASy TrEMBL
Match:
A0A5D3BZC3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold84043G00010 PE=4 SV=1)
HSP 1 Score: 177.9 bits (450), Expect = 4.3e-41
Identity = 115/212 (54.25%), Postives = 126/212 (59.43%), Query Frame = 0
Query: 2 RTVVREFARKNWRTKGDLSITLLDPRHVFIKLSNKEDMHQKEKFLIEGLMFRVFCWSHDF 61
RTVVREFARKNWRTKG D + K FR
Sbjct: 147 RTVVREFARKNWRTKG-------------------FDHSLRSKARFSAGAFRAMV----- 206
Query: 62 RLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRPPLTIKLGC 121
+ + R H PIVFFF F VSTFGIGLTLEGPTKLVSRPPL IKLG
Sbjct: 207 -------IQCMRLLGRLHPPIVFFF----FFAVSTFGIGLTLEGPTKLVSRPPLRIKLG- 266
Query: 122 QPVTENGLCSKNQCYPAPTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQP 181
++ L S + V+ +VAAADM+TI DGPRGEELEPD NPQAS+LQ+P
Sbjct: 267 ---KKSFLFSSARIADEKVMPVT-NAKNVAAADMETIFDGPRGEELEPDKNPQASLLQEP 318
Query: 182 EYLAIGKSPSVNAAPNTGGSSASWGQQEEERV 214
YLAIGKSPSV AAPNTGGSSASWGQQEEERV
Sbjct: 327 TYLAIGKSPSVEAAPNTGGSSASWGQQEEERV 318
BLAST of Tan0016175 vs. ExPASy TrEMBL
Match:
A0A5E4G1B8 (PREDICTED: IST1 OS=Prunus dulcis OX=3755 GN=ALMOND_2B000361 PE=3 SV=1)
HSP 1 Score: 146.4 bits (368), Expect = 1.4e-31
Identity = 68/73 (93.15%), Postives = 70/73 (95.89%), Query Frame = 0
Query: 41 QKEKFLIEGLMFRVFCWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSTFGIG 100
+KEKFLIEGLM RVF WSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVS FGIG
Sbjct: 127 KKEKFLIEGLMLRVFRWSHDFRLRNGDPMHAPIWVSRPHLPIVFFFEFRLFSIVSFFGIG 186
Query: 101 LTLEGPTKLVSRP 114
LTL+GPTKLVSRP
Sbjct: 187 LTLDGPTKLVSRP 199
BLAST of Tan0016175 vs. ExPASy TrEMBL
Match:
B9T8X8 (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1985520 PE=4 SV=1)
HSP 1 Score: 104.4 bits (259), Expect = 6.1e-19
Identity = 58/76 (76.32%), Postives = 61/76 (80.26%), Query Frame = 0
Query: 138 APTTEVSLRDSDVAAADMQTIEDGPRGEELEPDNNPQASVLQQPEYLAIGKSPSVNAAPN 197
APT E + AADMQTIEDGPRGEELEPDNNPQA VL++ EYLAIGKSPSVNAA N
Sbjct: 12 APTIET------MPAADMQTIEDGPRGEELEPDNNPQAPVLKKSEYLAIGKSPSVNAASN 71
Query: 198 TGGSSASWGQQEEERV 214
TGGSS S GQQEEE V
Sbjct: 72 TGGSSPSRGQQEEELV 81
BLAST of Tan0016175 vs. ExPASy TrEMBL
Match:
A0A6P6VHH0 (uncharacterized protein LOC113721847 OS=Coffea arabica OX=13443 GN=LOC113721847 PE=4 SV=1)
HSP 1 Score: 75.5 bits (184), Expect = 3.0e-10
Identity = 40/101 (39.60%), Postives = 59/101 (58.42%), Query Frame = 0
Query: 16 KGDLSITLLDPRHVFIKLSNKEDMHQ---KEKFLIEGLMFRVFCWSHDFRLRNGDPMHAP 75
KG++S+ LLD RHV I+L +K+D H+ + + G RVF W+ F + + +P P
Sbjct: 97 KGEISVGLLDQRHVLIRLGSKKDFHRLWGRNVRYVVGCPMRVFKWTSSFHV-DKEPSTVP 156
Query: 76 IWVSRPHLPIVFFFEFRLFSIVSTFGIGLTLEGPTKLVSRP 114
+W S P LPI FF + LF IVS G L ++ T ++RP
Sbjct: 157 VWFSLPKLPIHFFKKECLFHIVSCLGRPLFMDAATTSLARP 196
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
TYK04334.1 | 9.0e-41 | 54.25 | hypothetical protein E5676_scaffold84043G00010 [Cucumis melo var. makuwa] | [more] |
KAA0059498.1 | 9.0e-41 | 54.25 | hypothetical protein E6C27_scaffold48829G00010 [Cucumis melo var. makuwa] | [more] |
VVA33529.1 | 2.9e-31 | 93.15 | PREDICTED: IST1 [Prunus dulcis] | [more] |
WP_131796640.1 | 3.2e-30 | 95.59 | DUF4283 domain-containing protein [Candidatus Frankia datiscae] | [more] |
EEF27683.1 | 1.3e-18 | 76.32 | conserved hypothetical protein [Ricinus communis] | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7UWF6 | 4.3e-41 | 54.25 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5D3BZC3 | 4.3e-41 | 54.25 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A5E4G1B8 | 1.4e-31 | 93.15 | PREDICTED: IST1 OS=Prunus dulcis OX=3755 GN=ALMOND_2B000361 PE=3 SV=1 | [more] |
B9T8X8 | 6.1e-19 | 76.32 | Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1985520 PE=4 SV=1 | [more] |
A0A6P6VHH0 | 3.0e-10 | 39.60 | uncharacterized protein LOC113721847 OS=Coffea arabica OX=13443 GN=LOC113721847 ... | [more] |
Match Name | E-value | Identity | Description | |