Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGACAATTTGATGGATAAAGTTAGTGCCTTGGGTGAACGCCTAAAGATTAGTGGGACAGAGATGAGCCGGAAGATGAGTGCAGGTGTCAGTTCTATGAGTTTCAAAATGAAGGAACTCTTTCAAGGTCCTAACCAGGACGATAAACTTGTTGAAGATGCCACATCAGAGACACTGGAAGAGCCGGATTGGGCCCTGAATCTTGAAATCTGTGACATGGTTAATTCTGAAAAAATCAACAGCATAGACTTGATTCGTGGGATTAAAAAACGAATAATGGTAAAAAGCCCCAGGATCCAGTATTTGGCATTGGTGCTGCTTGAGACGTGTGTTAAAAATTGTGAGAAGTGTTTCTCCGAGGTGGCAGCGGAGAGAGTTCTTGATGAGATGGTGAAGCTAATTGATGATCCGCAAACTGTTGTCAATAATCGGAACAAAGCTTTAATGTTGATTGAAGCATGGGGGGAATCAACTGGTGAACTTCGTTATTTGCCTGTTTATGAAGAAACGTACAAGGTATCCACATATTTCAGTACTCTCTTTCTTTGCTAAATATCTGGAATTCGGTTATGAAAAATATATAGGGCCATGGCATAATCTAAGTGGATTTATTAGTTCTGAAGGACATTTTTTTTTCTCCTTTCTTTCTTCTTAAAAGAAAACTTCCATCAATTGGCGTTGAAATTTTCTAAGTTTACTCATGGAATGATAGATACTAAGCATATATTTACTGTCACAATATTGTTTAAGAAACTTGAGTCAAATTGTGGTAACTTGGCAACAAACTATGTTTAACGCTTGGACATAAGAAGTATGCGGGACAAGCTTATCTTGGGAGGAACGGCTTGATTTAATTAAAAGGGAAGTAGCTTCATGGTGTGTTCTTTCCATATGCTTTTATAATTACTCTCATGCTGATGTATGCTTGTTGTATTTCCTTCAAATGTATTTCAGATTGAGTAGGGACGGTTTTAGTGTTAGTATTAAGGTTGGTCTGATTTTGTCTGGATTTGAATTTCTTTTGTTTTTGGGTACAGATGAAGTATGTAGGACAGTAGAAAAGATTGCATTATGCTATGAACTTATTATATAGAAAGCTTATAGACTGTTTCCAAAGCAACCATAATTAAATAAATTGAAATGGGAAACTGAGAATTGTGTGTAAATTTTTATTCAACAATTCTAAATTGTCAGGGTTTCTAATCTCATGCTCCAGTATGGAGTCCAGTCCCCAAAACAATATTTTTTTGAAGAGTGGAATTCCCTAAAGAACAGTATTCCAAAAACAGGATTTTTGTTCCAAAAGGAGGTTTGAAATTATTAATATATGAAATCATTTTTTTTTTCATCTGGTTATCCCCTTGAATGAATAGATATGGCAACCAGTTCTTTGTTGTGTATCAGCAATGGAGGGTATTTTAGCATATGTCATCTTAATTGACTTGAAGTGGTTTGACTGACTGCACAAATAGGATTCTCGTAATTCGTAAAAAAAAGTTGTGCACGTGGAGTGGTTTGGCTACTGCACAAACCAGGAAGCTAGTACTTCTTAACTCTACTTTTGAGGGAAGTCCATAGAGTAGTGTTGAGGGGGATCTTGCAAAATATATAAAGCTTCTAGAACTTTTTTGTAACTATTCTCCAAATGCAATTTTTCTTAGTTGGAGGCTTTCTTGTAAATCCTTCATTTGGTTGAGTTGCTCAAGTATAACAATCATTTGAACGTAAAATTCTAGGTTAAAAGTGTCGTTAATTTCTTGAAGTATAAATATTTGTTTTGGTAGATCAAATGGTGAAGCTAAGTTTAAAGGCAGTGCCTCTTCTTGTGTACAATGTGCGCAATATTGTTTAGGGCAGATAATGAGAACATGATATCTATGGTCTTCTATAAGATCTGGAACTTGAAGTCTAACCATATCCCAGTAAGGGATTTGAGGTAAATGCCGTGAAAATTTTCACAGCACCTTTTATTGGCAACCTTTAGATAGATCAAATTGATTTATGATGTTTTTTATACAAATGCTTGCCGTTCTTTGCTGAGAAGCGTAGGATAGAAGGTGCTGAATGGTTTGTAAAGCTCAATCTACTCTTATACACCATCAAGGAGCAAGCCTTAGATCTTAGTATATGTGGGTCCATCTTTGGAGGAATTCTTTACAAATTCACCCAGTTGCACCCGATGAAGAAACGCTCACTCCCTTGTTCAAGTGGCAACATCCACATCAGTCTCCTTTGCTTTAGAAAAATCCCTCTCCCAGGTGGACCAAGGTGACATGAACATGTGGCCTTCCTCTTTGATATCAAATACCAGTTATAAATGAGACGATCCCTGGATCCTAATAAATAAATAAACAAAAGTTGATATCATTCAATCCATCCCCTAAAATGAAACCCCTGAAGAAAGTGGCAAGCCTGATCCATTACAAGATGTAAGACACCTGCAATTCGCAAAGAACTCTCTTGCCTTCAAACGAACCTTCCTCATCATAAACTCCAAAAATAAGCCTCTTTCCTTTTAAGCATACCCATCAGACATAACTCCATTGATTCACTATAATTCGTGTTTTTAGTCCTTTGAAGTGGATTACTAAGTGCAAGAACTTGATTGTGTCCTTTTCACTACTGCTGATTTGTAAGCTTATCTCGTCAAATTGCATGTAGTTGCCCCGCAAACAGGAAAGCACCAACTGTAACTTATGTTGGACAGATTGAGCTTACAGTTTTTGGCTTGGGCAATCCTTGGTCTCGTGTTAGGAAAATCTTTTATGTTATTTGTTATCCCTTAGTTATGGCGCATTGACGGATACTGGATAAGGCAACCCATTGCTCTGTTTGACGTGTTGTACTGTGAATCAGGTTTTTAATGATCTGTTTTTGAATGCTTAATTTGTTTTCTTTTGAAATCTGCGATTATTTAACTTGTATTCTCTCTGCAGAATTAAGATTGTTTCCACATGCTAAAAGGCAGTTTCTTTAGACAAGGAAATTCTTGTTCTAATGGTTCAAGGGAGAAACTTTCTAGATATGTTAATTTTCAGTTAAATTAATTGTCATATTCATAGCCTTTTAAAGTAATAATTTCCAAGATCTTTGATGTCCAATCATCAATCAGTGATAGTTTAGCATTTAAACTTAACCATCCATTTCGAATCTTATATGCCCCATAGGTCATAGCTAATGATATGAAAACACAAAGTTTTCTAATTTTGGACTTTTTATCAGTTTCTGTCTTGCTAGAGAAGTTGGAGAGTAACAGTTTGGAAATTATTTTGCTAGTTATTGTTTTAGATGCTTTCCCCGTGGATTGAGAGAATTTTATTAGTTATTACAGTTTCTTATGTTTGCATTTATCAGAGTTTAAAATCAAGAGGTATTCGATTCCCTGGTCGGGACAATGAGAGTCTTGCACCCATTTTCACTCCTCCTCGCACAATTTCCGCTACGGAGACAGAAGCCAGTCATATCGAACAGTTTCATCACGATATTCCCGTGCAAACCTTTACAGCTGAAGAAACAAAGGAAGCATTTGACGTCGCAAGGAACAGCATTGAGCTTCTCTCAACAGTGTTATCCTCATCACCCGCTCAGGATACTTCAGAGGTAATGCATCTTTTCATAGTGTTCTGAGTTTTCTCTCTCTCACTTAATATGCATATGCTTTAAACGTTTCAATATATGATACAATTGGATGCTTGGAACTATTCATGGTCTGGCATCGAAAACTCGGTGAACTATCACAGCTGATCTGAAACACGAACATTTTCATGGTCATATCCTAATAATTGGATTTATTTAAGATGGTGCTCAATTTCCGTGAGATCCTACATCGATTGGGGAGGAGAACAAAACACCTCTTATAAGGGTGTGGAAACTTCTCCATAGTAAATGTTTTTAAAACCTTGAGGGAAAGCCCTAAAAGAAAAACCTAAAGAGGACAATATCTTCTAGCGAGGGCATGGGCCGTTACAACTGCCATTTGTATAACCAATATTTTACCAATTTCCCCCCTCCCTTTCTCCCTGAACTTTGAACCTTGCAACTCTTCAACTTTTATTGAGGCTCATGCTTATGAAGGGATTGGTTACTGTGTCTGAAAATTGAAGTTGGTCAGAACTATAATATTAGGCTCCTTCAAAATCTGAATTGGAACTCGACTATCCTGCAATGAGAATCATCGTGAAAGCGTCGTGTCCAATAATATTCACTTTCTGGGTTCCTTTTGATATTCATTACCGTGAGATTTAGGGAATGTTCGTTGTCTTTTGATTATCAGATTTGCGAGAAATTTTGTTAAATCGGACTTTGGAAGTAGAGATTATGTTTTGGTATATTAGTTTATTAGGACTTAGGACTCCTTATTTTTGCTTTGATGCTTAAGTTTCCTAAAAAGGAGTGGCTATGACCCATGGGATTCTCCTTATGTTGCTCGTGGAAGGTTTTATCGTAATCCATGATATCGAGACATCATAAAACCTGATTATTTTTCTCCTAAGTTGTTTCTTGCCCTATGCAGGATGATCTGACCGGGACACTCGTACTACAATGTCGCCAATCACGATCAGCCATCCAAAGAATTATCGAGACCGCTGAAGACAACGAGGCCCTTCTTTTCGAGGCATTGAATGTGAATGATGAAATTCAGAAAGTCCTTTCAAAGTGCGAAGATCTGAAGAAGCCCTCAACAACTTCTCCACGTGAACAAGAACCTGCCATGATACCCATTGCTGGGGAACCTGATGAATCTCCGAGTCACGCCAAGGAAGAAGATGCTCTGGTCAGAAAAGCTGCCACTTCCGGCAGCCGGCCTGGCGGCGGAAGCAGTGATGAAATGATGGATGATCTTGATGAGATGATATTTGGGAAGAAAGGTGGAAGTGCATCTGATCGGGGACAGGAACCCAAGAAGCCAGATTCATCAAAAGACAATGATCTCATTTCCTTTTGATTTTGAGCTTTTGGTTTGAACTGATTCATGATAATCACACTCTCAATTTTTAGAAAGATTTATTTGATGTTAAAAGATCTTTATGTAACTTTACATGTTTAGCTGCATTTTATTATAAACTTCTACACTAACTAATGCACATTTTCTATTATGAACTTCCAAGTGTTGAAAAGATTACTTTCATTTATTTGAATCATCACTCCCAACTGAACTTGAG
mRNA sequence
ATGAGTGACAATTTGATGGATAAAGTTAGTGCCTTGGGTGAACGCCTAAAGATTAGTGGGACAGAGATGAGCCGGAAGATGAGTGCAGGTGTCAGTTCTATGAGTTTCAAAATGAAGGAACTCTTTCAAGGTCCTAACCAGGACGATAAACTTGTTGAAGATGCCACATCAGAGACACTGGAAGAGCCGGATTGGGCCCTGAATCTTGAAATCTGTGACATGGTTAATTCTGAAAAAATCAACAGCATAGACTTGATTCGTGGGATTAAAAAACGAATAATGGTAAAAAGCCCCAGGATCCAGTATTTGGCATTGGTGCTGCTTGAGACGTGTGTTAAAAATTGTGAGAAGTGTTTCTCCGAGGTGGCAGCGGAGAGAGTTCTTGATGAGATGGTGAAGCTAATTGATGATCCGCAAACTGTTGTCAATAATCGGAACAAAGCTTTAATGTTGATTGAAGCATGGGGGGAATCAACTGGTGAACTTCGTTATTTGCCTGTTTATGAAGAAACGTACAAGAGTTTAAAATCAAGAGGTATTCGATTCCCTGGTCGGGACAATGAGAGTCTTGCACCCATTTTCACTCCTCCTCGCACAATTTCCGCTACGGAGACAGAAGCCAGTCATATCGAACAGTTTCATCACGATATTCCCGTGCAAACCTTTACAGCTGAAGAAACAAAGGAAGCATTTGACGTCGCAAGGAACAGCATTGAGCTTCTCTCAACAGTGTTATCCTCATCACCCGCTCAGGATACTTCAGAGGATGATCTGACCGGGACACTCGTACTACAATGTCGCCAATCACGATCAGCCATCCAAAGAATTATCGAGACCGCTGAAGACAACGAGGCCCTTCTTTTCGAGGCATTGAATGTGAATGATGAAATTCAGAAAGTCCTTTCAAAGTGCGAAGATCTGAAGAAGCCCTCAACAACTTCTCCACGTGAACAAGAACCTGCCATGATACCCATTGCTGGGGAACCTGATGAATCTCCGAGTCACGCCAAGGAAGAAGATGCTCTGGTCAGAAAAGCTGCCACTTCCGGCAGCCGGCCTGGCGGCGGAAGCAGTGATGAAATGATGGATGATCTTGATGAGATGATATTTGGGAAGAAAGGTGGAAGTGCATCTGATCGGGGACAGGAACCCAAGAAGCCAGATTCATCAAAAGACAATGATCTCATTTCCTTTTGATTTTGAGCTTTTGGTTTGAACTGATTCATGATAATCACACTCTCAATTTTTAGAAAGATTTATTTGATGTTAAAAGATCTTTATGTAACTTTACATGTTTAGCTGCATTTTATTATAAACTTCTACACTAACTAATGCACATTTTCTATTATGAACTTCCAAGTGTTGAAAAGATTACTTTCATTTATTTGAATCATCACTCCCAACTGAACTTGAG
Coding sequence (CDS)
ATGAGTGACAATTTGATGGATAAAGTTAGTGCCTTGGGTGAACGCCTAAAGATTAGTGGGACAGAGATGAGCCGGAAGATGAGTGCAGGTGTCAGTTCTATGAGTTTCAAAATGAAGGAACTCTTTCAAGGTCCTAACCAGGACGATAAACTTGTTGAAGATGCCACATCAGAGACACTGGAAGAGCCGGATTGGGCCCTGAATCTTGAAATCTGTGACATGGTTAATTCTGAAAAAATCAACAGCATAGACTTGATTCGTGGGATTAAAAAACGAATAATGGTAAAAAGCCCCAGGATCCAGTATTTGGCATTGGTGCTGCTTGAGACGTGTGTTAAAAATTGTGAGAAGTGTTTCTCCGAGGTGGCAGCGGAGAGAGTTCTTGATGAGATGGTGAAGCTAATTGATGATCCGCAAACTGTTGTCAATAATCGGAACAAAGCTTTAATGTTGATTGAAGCATGGGGGGAATCAACTGGTGAACTTCGTTATTTGCCTGTTTATGAAGAAACGTACAAGAGTTTAAAATCAAGAGGTATTCGATTCCCTGGTCGGGACAATGAGAGTCTTGCACCCATTTTCACTCCTCCTCGCACAATTTCCGCTACGGAGACAGAAGCCAGTCATATCGAACAGTTTCATCACGATATTCCCGTGCAAACCTTTACAGCTGAAGAAACAAAGGAAGCATTTGACGTCGCAAGGAACAGCATTGAGCTTCTCTCAACAGTGTTATCCTCATCACCCGCTCAGGATACTTCAGAGGATGATCTGACCGGGACACTCGTACTACAATGTCGCCAATCACGATCAGCCATCCAAAGAATTATCGAGACCGCTGAAGACAACGAGGCCCTTCTTTTCGAGGCATTGAATGTGAATGATGAAATTCAGAAAGTCCTTTCAAAGTGCGAAGATCTGAAGAAGCCCTCAACAACTTCTCCACGTGAACAAGAACCTGCCATGATACCCATTGCTGGGGAACCTGATGAATCTCCGAGTCACGCCAAGGAAGAAGATGCTCTGGTCAGAAAAGCTGCCACTTCCGGCAGCCGGCCTGGCGGCGGAAGCAGTGATGAAATGATGGATGATCTTGATGAGATGATATTTGGGAAGAAAGGTGGAAGTGCATCTGATCGGGGACAGGAACCCAAGAAGCCAGATTCATCAAAAGACAATGATCTCATTTCCTTTTGA
Protein sequence
MSDNLMDKVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETLEEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFSEVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLKSRGIRFPGRDNESLAPIFTPPRTISATETEASHIEQFHHDIPVQTFTAEETKEAFDVARNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVNDEIQKVLSKCEDLKKPSTTSPREQEPAMIPIAGEPDESPSHAKEEDALVRKAATSGSRPGGGSSDEMMDDLDEMIFGKKGGSASDRGQEPKKPDSSKDNDLISF
Homology
BLAST of CmaCh12G008560.1 vs. ExPASy Swiss-Prot
Match:
Q9LFL3 (TOM1-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=TOL1 PE=1 SV=1)
HSP 1 Score: 543.9 bits (1400), Expect = 1.5e-153
Identity = 295/409 (72.13%), Postives = 338/409 (82.64%), Query Frame = 0
Query: 1 MSDNLMDKVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETL 60
M DNLMDKV+A GERLKI G+E+S K+SAGVSSMSFK+KELFQGPN DK+VEDAT+E L
Sbjct: 1 MGDNLMDKVTAFGERLKIGGSEVSNKISAGVSSMSFKVKELFQGPNPTDKIVEDATTENL 60
Query: 61 EEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFS 120
EEPDW +NLEICDM+N E INS++LIRGIKKRIM+K PRIQYLALVLLETCVKNCEK FS
Sbjct: 61 EEPDWDMNLEICDMINQETINSVELIRGIKKRIMMKQPRIQYLALVLLETCVKNCEKAFS 120
Query: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLKSRGI 180
EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGEST ELRYLPV+EETYKSLK+RGI
Sbjct: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTSELRYLPVFEETYKSLKARGI 180
Query: 181 RFPGRDNESLAPIFTPPRTISATETEASHIEQFH------HDIPVQTFTAEETKEAFDVA 240
RFPGRDNESLAPIFTP R+ A E A + H +D+PV++FTAE+TKEAFD+A
Sbjct: 181 RFPGRDNESLAPIFTPARSTPAPELNADLPQHVHEPAHIQYDVPVRSFTAEQTKEAFDIA 240
Query: 241 RNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVN 300
RNSIELLSTVLSSSP D +DDLT TLV QCRQS++ +QRIIETA +NEALLFEALNVN
Sbjct: 241 RNSIELLSTVLSSSPQHDALQDDLTTTLVQQCRQSQTTVQRIIETAGENEALLFEALNVN 300
Query: 301 DEIQKVLSKCEDLKKPSTTSPREQEPAMIPIAGEPDESPSHAKEEDALVRKAA--TSGSR 360
DE+ K LSK E++ KPS EPAMIP+A EPD+SP H +EE +LVRK++ G
Sbjct: 301 DELVKTLSKYEEMNKPSAPL-TSHEPAMIPVAEEPDDSPIHGREE-SLVRKSSGVRGGFH 360
Query: 361 PGGGSSDEMMDDLDEMIFGKKGG--SASDRGQEPKK-PDSSKDNDLISF 399
GGGS D+MMDDLDEMIFGKK G S+++ +PKK SSK++DLI F
Sbjct: 361 GGGGSGDDMMDDLDEMIFGKKNGCDSSTNPDHDPKKEQSSSKNDDLIRF 407
BLAST of CmaCh12G008560.1 vs. ExPASy Swiss-Prot
Match:
Q9LNC6 (TOM1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=TOL2 PE=1 SV=1)
HSP 1 Score: 222.2 bits (565), Expect = 1.0e-56
Identity = 140/349 (40.11%), Postives = 210/349 (60.17%), Query Frame = 0
Query: 8 KVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETLEEPDWAL 67
K++ GE+LK G +MSR +S K+K++ Q P + K+V++AT ETLEEP+W +
Sbjct: 5 KIAEWGEKLKTGGAQMSRMVSE-------KVKDMLQAPTLESKMVDEATLETLEEPNWGM 64
Query: 68 NLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFSEVAAERV 127
N+ IC +N+++ N +++R IK++I KSP Q L+L LLE C NCEK FSEVA+E+V
Sbjct: 65 NMRICAQINNDEFNGTEIVRAIKRKISGKSPVSQRLSLELLEACAMNCEKVFSEVASEKV 124
Query: 128 LDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLK-SRGIRFPGRD 187
LDEMV LI + + NR +A LI AWG+S +L YLPV+ +TY SL+ G+ G +
Sbjct: 125 LDEMVWLIKNGEADSENRKRAFQLIRAWGQSQ-DLTYLPVFHQTYMSLEGENGLHARGEE 184
Query: 188 N--------ESL--API-FTPPRTISATETEASHIEQFHHDIPVQTFTAEETKEAFDVAR 247
N ESL P+ PP + E + + D + ++ KE ++ R
Sbjct: 185 NSMPGQSSLESLMQRPVPVPPPGSYPVPNQEQALGDDDGLDYNFGNLSIKDKKEQIEITR 244
Query: 248 NSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVND 307
NS+ELLS++L++ + +EDDLT +L+ +C+QS+ IQ IIE+ D+E +LFEAL++ND
Sbjct: 245 NSLELLSSMLNTEGKPNHTEDDLTVSLMEKCKQSQPLIQMIIESTTDDEGVLFEALHLND 304
Query: 308 EIQKVLSKCEDLKKPSTTSPR----EQEPAMIPIAGEPDESPSHAKEED 341
E+Q+VLS KKP T + EQE +G D P ++E+
Sbjct: 305 ELQQVLS---SYKKPDETEKKASIVEQES-----SGSKDTGPKPTEQEE 337
BLAST of CmaCh12G008560.1 vs. ExPASy Swiss-Prot
Match:
Q8L860 (TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1)
HSP 1 Score: 133.7 bits (335), Expect = 4.7e-30
Identity = 100/281 (35.59%), Postives = 153/281 (54.45%), Query Frame = 0
Query: 51 LVEDATSETLEEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLET 110
+VE ATSE L PDWA+NLEICDM+NS+ + D+++GIKKRI ++P+ Q LAL LLET
Sbjct: 5 MVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALTLLET 64
Query: 111 CVKNC-EKCFSEVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTG--ELRYLPV 170
VKNC + VA + V+ EMV+++ + + + K L+LI+ W E+ G RY P
Sbjct: 65 IVKNCGDMVHMHVAEKGVIHEMVRIV-KKKPDFHVKEKILVLIDTWQEAFGGPRARY-PQ 124
Query: 171 YEETYKSLKSRGIRFPGRDNESLAPIFTPPRTISATE--------TEASHIEQFHHDIPV 230
Y Y+ L G FP R S AP+FTPP+T T + + + +
Sbjct: 125 YYAGYQELLRAGAVFPQRSERS-APVFTPPQTQPLTSYPPNLRNAGPGNDVPEPSAEPEF 184
Query: 231 QTFTAEETKEAFDVARNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIET 290
T + E + A + E+LS L +D ++ + LV QCR + + ++ +
Sbjct: 185 PTLSLSEIQNAKGIMDVLAEMLS-ALEPGNKEDLKQEVMV-DLVEQCRTYKQRVVHLVNS 244
Query: 291 AEDNEALLFEALNVNDEIQKVLSKCEDLKK--PSTTSPREQ 319
D E+LL + L +ND++Q+VL+ E + P T+S E+
Sbjct: 245 TSD-ESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQIEK 279
BLAST of CmaCh12G008560.1 vs. ExPASy Swiss-Prot
Match:
Q6NQK0 (TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1)
HSP 1 Score: 132.1 bits (331), Expect = 1.4e-29
Identity = 108/345 (31.30%), Postives = 175/345 (50.72%), Query Frame = 0
Query: 53 EDATSETLEEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCV 112
E AT++ L PDWA+N+E+CD++N + + + ++ +KKR+ K+ ++Q LAL LET
Sbjct: 10 ERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYALETLS 69
Query: 113 KNC-EKCFSEVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGEL--RYLPVYE 172
KNC E + + +L++MVK++ + +N R K L L++ W E+ G RY P Y
Sbjct: 70 KNCGENVYQLIIDRGLLNDMVKIV-KKKPELNVREKILTLLDTWQEAFGGRGGRY-PQYY 129
Query: 173 ETYKSLKSRGIRFPGRDNESLAPIFTPPRTISATETEASHIEQFHHDIPVQTFTAEETKE 232
Y L+S GI FP R SL+ FTPP+T E + I+ + + EE +
Sbjct: 130 NAYNDLRSAGIEFPPRTESSLS-FFTPPQT---QPDEDAAIQASLQGDDASSLSLEEIQS 189
Query: 233 AFDVARNSIELLSTVLSS-SPAQDTS-EDDLTGTLVLQCRQSRSAIQRIIETAEDNEALL 292
A S+++L +L + P S ++++ LV QCR + + ++ T D E LL
Sbjct: 190 ----AEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLVNTTTDEE-LL 249
Query: 293 FEALNVNDEIQKVLSKCEDLKK----PST-TSPREQEPAMIPIAGEPDESPSHAKEEDAL 352
+ L +ND +Q VL + +D+ PS + R P I DE E L
Sbjct: 250 CQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDINHDDEDDESDDEFARL 309
Query: 353 VRKAATSGSRPGGGSSDEMMDDLDEMIFGKKGGSASDRGQEPKKP 388
+++T RP GS M+D L ++ +G S+S ++P P
Sbjct: 310 AHRSSTPTRRPVHGSDSGMVDILSGDVYKPQGNSSSQGVKKPPPP 343
BLAST of CmaCh12G008560.1 vs. ExPASy Swiss-Prot
Match:
F4KAU9 (TOM1-like protein 7 OS=Arabidopsis thaliana OX=3702 GN=TOL7 PE=2 SV=1)
HSP 1 Score: 122.5 bits (306), Expect = 1.1e-26
Identity = 98/325 (30.15%), Postives = 167/325 (51.38%), Query Frame = 0
Query: 52 VEDATSETLEEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETC 111
V+ ATSE L PDW + + ICD +NS + D I+ +K+R+ KS R+Q L L LLE
Sbjct: 26 VDKATSELLRTPDWTIIIAICDSLNSNRWQCKDAIKAVKRRLQHKSSRVQLLTLTLLEAM 85
Query: 112 VKNC-EKCFSEVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGES-TGELRYLPVYE 171
+KNC + S +A + +L++MVKL+ + RNK L+L++ W E+ +G P Y
Sbjct: 86 LKNCGDFVHSHIAEKHLLEDMVKLV-RKKGDFEVRNKLLILLDTWNEAFSGVACKHPHYN 145
Query: 172 ETYKSLKSRGIRFPGRDNESLAPIFTPP---RTISATETEASHIEQFHH-DIPVQTFTAE 231
Y+ LK G++FP R E+ + PP ++ S++ I F D + T
Sbjct: 146 WAYQELKRCGVKFPQRSKEAPLMLEPPPPVTQSSSSSSMNLMSIGSFRRLDETMATEIES 205
Query: 232 ETKEAFDVARNSIELLSTVLSSSPAQDTS--EDDLTGTLVLQCRQSRSAIQRIIETAEDN 291
+ + + RN ++L++ ++ + D S +D+L LV QCR ++ + +++ T D
Sbjct: 206 LSLSSLESMRNVMDLVNDMVQAVNPSDKSALKDELIVDLVEQCRSNQKKLIQMLTTTAD- 265
Query: 292 EALLFEALNVNDEIQKVLSKCEDLKKPSTTSPREQEPAMIPIAGEPDESPSHAKEEDALV 351
E +L L +ND +Q VL++ + + + Q P EP E+ S K A
Sbjct: 266 EDVLARGLELNDSLQVVLARHDAIASGVSLPLLLQAP-------EPRETSSSLKTCGAAA 325
Query: 352 RKAATSGSRPGGGSSDEMMDDLDEM 369
++A S S SS+ D+++++
Sbjct: 326 LESADSESSSSSSSSESETDEVEDV 341
BLAST of CmaCh12G008560.1 vs. TAIR 10
Match:
AT5G16880.1 (Target of Myb protein 1 )
HSP 1 Score: 543.9 bits (1400), Expect = 1.1e-154
Identity = 295/409 (72.13%), Postives = 338/409 (82.64%), Query Frame = 0
Query: 1 MSDNLMDKVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETL 60
M DNLMDKV+A GERLKI G+E+S K+SAGVSSMSFK+KELFQGPN DK+VEDAT+E L
Sbjct: 1 MGDNLMDKVTAFGERLKIGGSEVSNKISAGVSSMSFKVKELFQGPNPTDKIVEDATTENL 60
Query: 61 EEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFS 120
EEPDW +NLEICDM+N E INS++LIRGIKKRIM+K PRIQYLALVLLETCVKNCEK FS
Sbjct: 61 EEPDWDMNLEICDMINQETINSVELIRGIKKRIMMKQPRIQYLALVLLETCVKNCEKAFS 120
Query: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLKSRGI 180
EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGEST ELRYLPV+EETYKSLK+RGI
Sbjct: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTSELRYLPVFEETYKSLKARGI 180
Query: 181 RFPGRDNESLAPIFTPPRTISATETEASHIEQFH------HDIPVQTFTAEETKEAFDVA 240
RFPGRDNESLAPIFTP R+ A E A + H +D+PV++FTAE+TKEAFD+A
Sbjct: 181 RFPGRDNESLAPIFTPARSTPAPELNADLPQHVHEPAHIQYDVPVRSFTAEQTKEAFDIA 240
Query: 241 RNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVN 300
RNSIELLSTVLSSSP D +DDLT TLV QCRQS++ +QRIIETA +NEALLFEALNVN
Sbjct: 241 RNSIELLSTVLSSSPQHDALQDDLTTTLVQQCRQSQTTVQRIIETAGENEALLFEALNVN 300
Query: 301 DEIQKVLSKCEDLKKPSTTSPREQEPAMIPIAGEPDESPSHAKEEDALVRKAA--TSGSR 360
DE+ K LSK E++ KPS EPAMIP+A EPD+SP H +EE +LVRK++ G
Sbjct: 301 DELVKTLSKYEEMNKPSAPL-TSHEPAMIPVAEEPDDSPIHGREE-SLVRKSSGVRGGFH 360
Query: 361 PGGGSSDEMMDDLDEMIFGKKGG--SASDRGQEPKK-PDSSKDNDLISF 399
GGGS D+MMDDLDEMIFGKK G S+++ +PKK SSK++DLI F
Sbjct: 361 GGGGSGDDMMDDLDEMIFGKKNGCDSSTNPDHDPKKEQSSSKNDDLIRF 407
BLAST of CmaCh12G008560.1 vs. TAIR 10
Match:
AT5G16880.2 (Target of Myb protein 1 )
HSP 1 Score: 543.9 bits (1400), Expect = 1.1e-154
Identity = 295/409 (72.13%), Postives = 338/409 (82.64%), Query Frame = 0
Query: 1 MSDNLMDKVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETL 60
M DNLMDKV+A GERLKI G+E+S K+SAGVSSMSFK+KELFQGPN DK+VEDAT+E L
Sbjct: 1 MGDNLMDKVTAFGERLKIGGSEVSNKISAGVSSMSFKVKELFQGPNPTDKIVEDATTENL 60
Query: 61 EEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFS 120
EEPDW +NLEICDM+N E INS++LIRGIKKRIM+K PRIQYLALVLLETCVKNCEK FS
Sbjct: 61 EEPDWDMNLEICDMINQETINSVELIRGIKKRIMMKQPRIQYLALVLLETCVKNCEKAFS 120
Query: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLKSRGI 180
EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGEST ELRYLPV+EETYKSLK+RGI
Sbjct: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTSELRYLPVFEETYKSLKARGI 180
Query: 181 RFPGRDNESLAPIFTPPRTISATETEASHIEQFH------HDIPVQTFTAEETKEAFDVA 240
RFPGRDNESLAPIFTP R+ A E A + H +D+PV++FTAE+TKEAFD+A
Sbjct: 181 RFPGRDNESLAPIFTPARSTPAPELNADLPQHVHEPAHIQYDVPVRSFTAEQTKEAFDIA 240
Query: 241 RNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVN 300
RNSIELLSTVLSSSP D +DDLT TLV QCRQS++ +QRIIETA +NEALLFEALNVN
Sbjct: 241 RNSIELLSTVLSSSPQHDALQDDLTTTLVQQCRQSQTTVQRIIETAGENEALLFEALNVN 300
Query: 301 DEIQKVLSKCEDLKKPSTTSPREQEPAMIPIAGEPDESPSHAKEEDALVRKAA--TSGSR 360
DE+ K LSK E++ KPS EPAMIP+A EPD+SP H +EE +LVRK++ G
Sbjct: 301 DELVKTLSKYEEMNKPSAPL-TSHEPAMIPVAEEPDDSPIHGREE-SLVRKSSGVRGGFH 360
Query: 361 PGGGSSDEMMDDLDEMIFGKKGG--SASDRGQEPKK-PDSSKDNDLISF 399
GGGS D+MMDDLDEMIFGKK G S+++ +PKK SSK++DLI F
Sbjct: 361 GGGGSGDDMMDDLDEMIFGKKNGCDSSTNPDHDPKKEQSSSKNDDLIRF 407
BLAST of CmaCh12G008560.1 vs. TAIR 10
Match:
AT5G16880.3 (Target of Myb protein 1 )
HSP 1 Score: 434.5 bits (1116), Expect = 9.2e-122
Identity = 223/288 (77.43%), Postives = 251/288 (87.15%), Query Frame = 0
Query: 1 MSDNLMDKVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETL 60
M DNLMDKV+A GERLKI G+E+S K+SAGVSSMSFK+KELFQGPN DK+VEDAT+E L
Sbjct: 1 MGDNLMDKVTAFGERLKIGGSEVSNKISAGVSSMSFKVKELFQGPNPTDKIVEDATTENL 60
Query: 61 EEPDWALNLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFS 120
EEPDW +NLEICDM+N E INS++LIRGIKKRIM+K PRIQYLALVLLETCVKNCEK FS
Sbjct: 61 EEPDWDMNLEICDMINQETINSVELIRGIKKRIMMKQPRIQYLALVLLETCVKNCEKAFS 120
Query: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLKSRGI 180
EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGEST ELRYLPV+EETYKSLK+RGI
Sbjct: 121 EVAAERVLDEMVKLIDDPQTVVNNRNKALMLIEAWGESTSELRYLPVFEETYKSLKARGI 180
Query: 181 RFPGRDNESLAPIFTPPRTISATETEASHIEQFH------HDIPVQTFTAEETKEAFDVA 240
RFPGRDNESLAPIFTP R+ A E A + H +D+PV++FTAE+TKEAFD+A
Sbjct: 181 RFPGRDNESLAPIFTPARSTPAPELNADLPQHVHEPAHIQYDVPVRSFTAEQTKEAFDIA 240
Query: 241 RNSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAED 283
RNSIELLSTVLSSSP D +DDLT TLV QCRQS++ +QRIIETA++
Sbjct: 241 RNSIELLSTVLSSSPQHDALQDDLTTTLVQQCRQSQTTVQRIIETADE 288
BLAST of CmaCh12G008560.1 vs. TAIR 10
Match:
AT1G06210.1 (ENTH/VHS/GAT family protein )
HSP 1 Score: 222.2 bits (565), Expect = 7.2e-58
Identity = 140/349 (40.11%), Postives = 210/349 (60.17%), Query Frame = 0
Query: 8 KVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETLEEPDWAL 67
K++ GE+LK G +MSR +S K+K++ Q P + K+V++AT ETLEEP+W +
Sbjct: 5 KIAEWGEKLKTGGAQMSRMVSE-------KVKDMLQAPTLESKMVDEATLETLEEPNWGM 64
Query: 68 NLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFSEVAAERV 127
N+ IC +N+++ N +++R IK++I KSP Q L+L LLE C NCEK FSEVA+E+V
Sbjct: 65 NMRICAQINNDEFNGTEIVRAIKRKISGKSPVSQRLSLELLEACAMNCEKVFSEVASEKV 124
Query: 128 LDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLK-SRGIRFPGRD 187
LDEMV LI + + NR +A LI AWG+S +L YLPV+ +TY SL+ G+ G +
Sbjct: 125 LDEMVWLIKNGEADSENRKRAFQLIRAWGQSQ-DLTYLPVFHQTYMSLEGENGLHARGEE 184
Query: 188 N--------ESL--API-FTPPRTISATETEASHIEQFHHDIPVQTFTAEETKEAFDVAR 247
N ESL P+ PP + E + + D + ++ KE ++ R
Sbjct: 185 NSMPGQSSLESLMQRPVPVPPPGSYPVPNQEQALGDDDGLDYNFGNLSIKDKKEQIEITR 244
Query: 248 NSIELLSTVLSSSPAQDTSEDDLTGTLVLQCRQSRSAIQRIIETAEDNEALLFEALNVND 307
NS+ELLS++L++ + +EDDLT +L+ +C+QS+ IQ IIE+ D+E +LFEAL++ND
Sbjct: 245 NSLELLSSMLNTEGKPNHTEDDLTVSLMEKCKQSQPLIQMIIESTTDDEGVLFEALHLND 304
Query: 308 EIQKVLSKCEDLKKPSTTSPR----EQEPAMIPIAGEPDESPSHAKEED 341
E+Q+VLS KKP T + EQE +G D P ++E+
Sbjct: 305 ELQQVLS---SYKKPDETEKKASIVEQES-----SGSKDTGPKPTEQEE 337
BLAST of CmaCh12G008560.1 vs. TAIR 10
Match:
AT1G06210.2 (ENTH/VHS/GAT family protein )
HSP 1 Score: 169.5 bits (428), Expect = 5.5e-42
Identity = 102/260 (39.23%), Postives = 155/260 (59.62%), Query Frame = 0
Query: 8 KVSALGERLKISGTEMSRKMSAGVSSMSFKMKELFQGPNQDDKLVEDATSETLEEPDWAL 67
K++ GE+LK G +MSR +S K+K++ Q P + K+V++AT ETLEEP+W +
Sbjct: 5 KIAEWGEKLKTGGAQMSRMVSE-------KVKDMLQAPTLESKMVDEATLETLEEPNWGM 64
Query: 68 NLEICDMVNSEKINSIDLIRGIKKRIMVKSPRIQYLALVLLETCVKNCEKCFSEVAAERV 127
N+ IC +N+++ N +++R IK++I KSP Q L+L LLE C NCEK FSEVA+E+V
Sbjct: 65 NMRICAQINNDEFNGTEIVRAIKRKISGKSPVSQRLSLELLEACAMNCEKVFSEVASEKV 124
Query: 128 LDEMVKLIDDPQTVVNNRNKALMLIEAWGESTGELRYLPVYEETYKSLK-SRGIRFPGRD 187
LDEMV LI + + NR +A LI AWG+S +L YLPV+ +TY SL+ G+ G +
Sbjct: 125 LDEMVWLIKNGEADSENRKRAFQLIRAWGQSQ-DLTYLPVFHQTYMSLEGENGLHARGEE 184
Query: 188 N--------ESL--API-FTPPRTISATETEASHIEQFHHDIPVQTFTAEETKEAFDVAR 247
N ESL P+ PP + E + + D + ++ KE ++ R
Sbjct: 185 NSMPGQSSLESLMQRPVPVPPPGSYPVPNQEQALGDDDGLDYNFGNLSIKDKKEQIEITR 244
Query: 248 NSIELLSTVLSSSPAQDTSE 256
NS+ELLS++L++ + +E
Sbjct: 245 NSLELLSSMLNTEGKPNHTE 256
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LFL3 | 1.5e-153 | 72.13 | TOM1-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=TOL1 PE=1 SV=1 | [more] |
Q9LNC6 | 1.0e-56 | 40.11 | TOM1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=TOL2 PE=1 SV=1 | [more] |
Q8L860 | 4.7e-30 | 35.59 | TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1 | [more] |
Q6NQK0 | 1.4e-29 | 31.30 | TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1 | [more] |
F4KAU9 | 1.1e-26 | 30.15 | TOM1-like protein 7 OS=Arabidopsis thaliana OX=3702 GN=TOL7 PE=2 SV=1 | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmaCh12G008560.1:exon:147 | CmaCh12G008560.1:exon:147 | exon |
CmaCh12G008560.1:exon:146 | CmaCh12G008560.1:exon:146 | exon |
CmaCh12G008560.1:exon:145 | CmaCh12G008560.1:exon:145 | exon |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmaCh12G008560.1:three_prime_utr | CmaCh12G008560.1:three_prime_utr | three_prime_UTR |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmaCh12G008560.1:cds | CmaCh12G008560.1:cds_3 | CDS |
CmaCh12G008560.1:cds | CmaCh12G008560.1:cds_2 | CDS |
CmaCh12G008560.1:cds | CmaCh12G008560.1:cds | CDS |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
CmaCh12G008560.1 | CmaCh12G008560.1-protein | polypeptide |