Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG
mRNA sequence
ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG
Coding sequence (CDS)
ATGCCCAAGGGGAGAACTGCAGACTTGCACCCTTTTGATCTTGAGATTGAGAGGACCTGTAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAAATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCTACCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGCTAATAATTTTGAACTTAAGACAGGGCTGATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCTTAGCCACAGAAGATCCTAACTCTCATCTTCAAAATTTCTTAGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCAGATAGTATTCGCTTACGTCTTTTTCCTTTCTCTTTGTAG
Protein sequence
MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFSL
Homology
BLAST of Tan0004862 vs. NCBI nr
Match:
WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])
HSP 1 Score: 142.5 bits (358), Expect = 2.5e-30
Identity = 74/127 (58.27%), Postives = 90/127 (70.87%), Query Frame = 0
Query: 1 MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAG 60
MP+ T +L P D EI+RT RR LR + EMA++ PK IR+YF P + Q G
Sbjct: 17 MPRDNT-NLLPLDPEIDRTYRRNLRALLNQTTEMAEE----IPKAIRDYFQPTLPASQPG 76
Query: 61 IVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRL 120
I++ PIN NNFELK GLIQM RE AFRG EDP+ HL++FLEICGT KMNG+S D+I+L
Sbjct: 77 IMNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKL 136
Query: 121 RLFPFSL 128
RLFPFSL
Sbjct: 137 RLFPFSL 138
BLAST of Tan0004862 vs. NCBI nr
Match:
KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])
HSP 1 Score: 122.5 bits (306), Expect = 2.7e-24
Identity = 62/127 (48.82%), Postives = 82/127 (64.57%), Query Frame = 0
Query: 1 MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAG 60
M + R+ D+ P D EIERT R L R + MA+++ P+ +++Y PV N +
Sbjct: 1 MRRARSRDIIPVDPEIERTLRSLRRN---KILAMAEEDREVLPRTLKDYVRPVVNGNYSS 60
Query: 61 IVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADSIRL 120
I+ PINANNFELK LI MV++ F G +DPN HL FLEIC T K+NG++ D+IRL
Sbjct: 61 IMRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRL 120
Query: 121 RLFPFSL 128
RLFPFSL
Sbjct: 121 RLFPFSL 124
BLAST of Tan0004862 vs. NCBI nr
Match:
XP_017428686.1 (PREDICTED: uncharacterized protein LOC108336731 [Vigna angularis])
HSP 1 Score: 121.7 bits (304), Expect = 4.5e-24
Identity = 66/133 (49.62%), Postives = 83/133 (62.41%), Query Frame = 0
Query: 12 FDLEIERTCR-----------------RLLREGRAEPQEMADQELINNPKPIREYFLPVF 71
FD EIERT R RL+ +G+ E MADQE + K +R+Y +P
Sbjct: 18 FDSEIERTARRNRSSARRRKRERRQEQRLIEQGK-ETSTMADQEPVR--KTLRDYSMPNP 77
Query: 72 NSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGIS 128
NS Q IV PI ANNFE+K L+Q++++ F G +EDPNSHL+NFL IC T K NG+S
Sbjct: 78 NSYQGSIVRPPIQANNFEIKPALLQVIQQNQFGGTNSEDPNSHLENFLAICDTLKYNGVS 137
BLAST of Tan0004862 vs. NCBI nr
Match:
XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])
HSP 1 Score: 120.6 bits (301), Expect = 1.0e-23
Identity = 66/132 (50.00%), Postives = 81/132 (61.36%), Query Frame = 0
Query: 1 MPKGRTADLHPFDLEIERTCRRLLREGRAEPQEMADQEL-----INNPKPIREYFLPVFN 60
M + R DL D E ERT R L R E + MA+Q++ N + IR+Y PV N
Sbjct: 94 MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153
Query: 61 SEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISA 120
+GI I A NFELK GLI MV++ F G A EDPN+HL +FLEIC T KMNG++
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213
Query: 121 DSIRLRLFPFSL 128
D+IRLRLF FSL
Sbjct: 214 DAIRLRLFSFSL 225
BLAST of Tan0004862 vs. NCBI nr
Match:
PKA48167.1 (hypothetical protein AXF42_Ash020512 [Apostasia shenzhenica])
HSP 1 Score: 119.8 bits (299), Expect = 1.7e-23
Identity = 63/142 (44.37%), Postives = 84/142 (59.15%), Query Frame = 0
Query: 1 MPKGRTADLHPFDLEIERTCRRLLREGRAEP-------------QEMADQELINNP-KPI 60
M + DL PFD EIERT ++ R+ + + +M DQ I + +
Sbjct: 1 MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60
Query: 61 REYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEIC 120
REY LP N +V + ANNFE+K LIQM+++ F GL ++DPN+H+ NFLEIC
Sbjct: 61 REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120
Query: 121 GTFKMNGISADSIRLRLFPFSL 128
TFK NG+S D+IRLRLFPFSL
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSL 142
BLAST of Tan0004862 vs. ExPASy TrEMBL
Match:
A0A2H9ZY12 (Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 PE=4 SV=1)
HSP 1 Score: 119.8 bits (299), Expect = 8.4e-24
Identity = 63/142 (44.37%), Postives = 84/142 (59.15%), Query Frame = 0
Query: 1 MPKGRTADLHPFDLEIERTCRRLLREGRAEP-------------QEMADQELINNP-KPI 60
M + DL PFD EIERT ++ R+ + + +M DQ I + +
Sbjct: 1 MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60
Query: 61 REYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEIC 120
REY LP N +V + ANNFE+K LIQM+++ F GL ++DPN+H+ NFLEIC
Sbjct: 61 REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120
Query: 121 GTFKMNGISADSIRLRLFPFSL 128
TFK NG+S D+IRLRLFPFSL
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSL 142
BLAST of Tan0004862 vs. ExPASy TrEMBL
Match:
A0A1U8Q202 (uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208 PE=4 SV=1)
HSP 1 Score: 115.5 bits (288), Expect = 1.6e-22
Identity = 63/121 (52.07%), Postives = 79/121 (65.29%), Query Frame = 0
Query: 11 PFDLEIERT-CRRL--LREGRAEPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPIN 70
P+D EIERT C RL R+ RAE +EMA+ P+ + +Y P IV I+
Sbjct: 5 PYDPEIERTLCIRLRAARQVRAETEEMAE------PRTMMDYAKPTLTGAALSIVRPAIS 64
Query: 71 ANNFELKTGLIQMVREGA-FRGLATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFS 128
ANNFE+K +IQM++ F G+A EDPNSH+ NFLEIC TFK NG+S D +RLRLFPFS
Sbjct: 65 ANNFEIKPAIIQMIQNTVQFCGMANEDPNSHIANFLEICDTFKHNGVSDDVVRLRLFPFS 119
BLAST of Tan0004862 vs. ExPASy TrEMBL
Match:
A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)
HSP 1 Score: 114.8 bits (286), Expect = 2.7e-22
Identity = 59/98 (60.20%), Postives = 68/98 (69.39%), Query Frame = 0
Query: 30 EPQEMADQELINNPKPIREYFLPVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGL 89
+P E E NN IR+Y P F GI++ PINANN ELK GLIQMVRE FRG
Sbjct: 12 QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71
Query: 90 ATEDPNSHLQNFLEICGTFKMNGISADSIRLRLFPFSL 128
ATEDPN+HL FL++CGT KMNG+ D+IRLRLFP SL
Sbjct: 72 ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSL 107
BLAST of Tan0004862 vs. ExPASy TrEMBL
Match:
A0A1S3UKD4 (uncharacterized protein LOC106766267 OS=Vigna radiata var. radiata OX=3916 GN=LOC106766267 PE=4 SV=1)
HSP 1 Score: 114.0 bits (284), Expect = 4.6e-22
Identity = 63/130 (48.46%), Postives = 79/130 (60.77%), Query Frame = 0
Query: 12 FDLEIERT-------CRRLLREGRAEPQEMADQELIN------NP-KPIREYFLPVFNSE 71
FD IERT RR RE R E + + +E + NP K IR+Y +P N
Sbjct: 12 FDSRIERTARSNRSSARRRKRERRKEQRRIEQEEETSTMTEEQNPRKTIRDYSMPDPNGY 71
Query: 72 QAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMNGISADS 128
Q IV PI ANNFE+K L+Q++++ F G +EDPNSHL+NFL IC T K NG+S D+
Sbjct: 72 QGSIVRPPIQANNFEIKPALLQVIQQNQFGGAVSEDPNSHLENFLAICDTLKYNGVSDDA 131
BLAST of Tan0004862 vs. ExPASy TrEMBL
Match:
A0A6P6W382 (uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 PE=4 SV=1)
HSP 1 Score: 112.5 bits (280), Expect = 1.3e-21
Identity = 62/136 (45.59%), Postives = 80/136 (58.82%), Query Frame = 0
Query: 8 DLHPFDLEIERTCRRLLRE-GRAEPQEM------------ADQELINNP---KPIREYFL 67
++ PFD EIER RR R E QE+ ++E+ N + +R++ L
Sbjct: 7 EVAPFDPEIERALRRQRRNTPHQEEQEIWQPIEEILIELPFEEEIAENEPNRRILRDFAL 66
Query: 68 PVFNSEQAGIVHAPINANNFELKTGLIQMVREGAFRGLATEDPNSHLQNFLEICGTFKMN 127
P Q I +NANNFE+K LIQMV++ + G ATEDPNSHL FLEIC T K N
Sbjct: 67 PETQGSQTSIARPMVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLEICDTIKFN 126
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
WP_217833153.1 | 2.5e-30 | 58.27 | retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... | [more] |
KAG7990634.1 | 2.7e-24 | 48.82 | hypothetical protein I3843_02G035100 [Carya illinoinensis] | [more] |
XP_017428686.1 | 4.5e-24 | 49.62 | PREDICTED: uncharacterized protein LOC108336731 [Vigna angularis] | [more] |
XP_022843226.1 | 1.0e-23 | 50.00 | uncharacterized protein LOC111366761 [Olea europaea var. sylvestris] | [more] |
PKA48167.1 | 1.7e-23 | 44.37 | hypothetical protein AXF42_Ash020512 [Apostasia shenzhenica] | [more] |
Match Name | E-value | Identity | Description | |
A0A2H9ZY12 | 8.4e-24 | 44.37 | Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 P... | [more] |
A0A1U8Q202 | 1.6e-22 | 52.07 | uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208... | [more] |
A0A6J1DU19 | 2.7e-22 | 60.20 | uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A1S3UKD4 | 4.6e-22 | 48.46 | uncharacterized protein LOC106766267 OS=Vigna radiata var. radiata OX=3916 GN=LO... | [more] |
A0A6P6W382 | 1.3e-21 | 45.59 | uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 ... | [more] |
Match Name | E-value | Identity | Description | |