Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGTCGTTTTGGGTGGTCAAAGGGGAAGTGAAGGTTTGTTTCGGTTCAATTCTACACACTGCTTCCACATTCGAGAATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGTAAGGTGGCTCAAGTTTGCTAAGTTTCCGTTGAGACGCGAAGACTGAGCCATGAGATCGATACTTCTGTAGTATTGCTCAATGCTTTCGATTTTAGTCGGTTATTAAATGGGAACGCCGGGGCGACGGAAAGAGTAGAGATTTTGATTCTGTTCTCTTTTCGTATTCCGTTCATTATATGACTGAACTATTCACGACGTGGAAAAATGGTCCTTTAGAATTATTTAAACAGAGACGTTGGCCAATAAATTTTCTATATGTGTGTGTGTGTGTGTAGCTATGGATTGGTGTAGATACTGAATTTCTGTTGTTTAAATTACTGATTCCTTATAGTTCATGTTCCAAGCTTCATATTGACTTGGCGTAGCAGCCACAAAGTGAAGCTAGTATACATAGTCTAAAATTATAGAAACTTAAGAAAATTGTTACCATGTTCTTTATTGAACTTGAGTGATCAGTGAAGTATTAGTAATTTTGTTGAATAGCAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGGTATGTCTTGGGATTTTAAAGTTATACGTACTAAAAAATTGGACTTGAAAATGGCAAGCCTTGTTTCTATAAATAATTAATTATGTAAATAAGAAGAGTGGAAACTAACACTTCGTATGCAAATGAGGTTAGGTCTATTGTTGTAAAAGCCACAAAGGATGTGTTGCTGAGAGAATAATTAAGATGAATTGTTGATGATAACATGGTTATTTTATTGATCAGATAGTTGCAGTATGTGAATTGTCAATCAACCTTTTCTTTCCTTCTATGAAATTTTCACAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGGTAATACTGCATACTTACGAGTTGGAAAAATAGTTATGAATAAATAGGTTCCTAAGGGGATGAGAAAGTGTAATTCAGGAAGCAAATATATTGGTTAAAAAAATACCTTTTTGGTCTCTAAGTTTTGAGTATAGTCTTAATTTGGTCCTTAAATTTTAAAATGTTATTTAATTTCTAAGATTTAAAATGTTATCCTTTGAAATGACCAGAAGGGAAACTATACTCAAAACTTAGGGCCAAAAGATATACTCTTTCCTATGTTGTTTTATGCTATTCAAAGAATGAGAGGCAACTTAAGAAATTGCTCCTTCCTTAAGAGTAGCTAGATTTGTTAAGACATGCATTTGTCAAATGTTCAAATCTATACCCCTACATATTTATTTATTTATTTATTCATTTATTATTATTATTATTATTATTATTGTTAATGATGTTTTCTTTCTTCAGTGAACGTCTTTTTCATATATGCATATATAAGATATGGTATTGATGGATGAACTGATATATTTCCTCTTATATTTCAATTGATAAGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTAATGCATTTGCGACCCTTGCTGTTCGCCATTGATGCAAAATTGTGATTGCATGGTCTTATTTCTGTGATATTGTAGGTTTTCAAAGCATCATCTGACCATTCAAGAAATTTCAAATTATTGTGTTAAGTCAGTAATTACTGAAACTTCCACAAATACTGTCATCTATTTGAATGAAATGACTACAGCTTAAGGCACCAGATTAAAATATATATATATATATATATTTCTCTTGATTTTATGTGTTTCTATGTTGGGTCATGCGAGCAAATGGAGAATGCACCTCTATTTTATTGTTGCATAATGACTTTCTGATGATGTAAGTGTTGATGTTATTTCTCTTTGTTAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGTAAGCTTTTGGACCTCGTTGTTAAGAGTATATTGAAATACTAAATATGTCAGTCTATTAGCTTGAACCAAATTTAGTGATTTATTTTGATATCAGATCTTCTATGGTCAAACTCTAATGATTCTATTTTCTTATCTATAGTGTTGCTAGTCTACTTGTTCGATATTCGTCATTGTGCCCATTTTTGAGCCTTCAAATGGGATGTCAAGCAATAAGCATATATTCAAAGAATTAGATATAGCCTATAACTTATTAACACTGAGTTAATCCTCATTGTTATCGTATCTTTGTTAGAATACCTCCCAACTAGAATTAAGATTTGGAATATATTGAAATATAACGGAATAACTACAAGATAACCCAAGTATAAAGGCACTTGGAACCTCCCTCTCGACTCCGGAGTGTGCAAAATCAAGCCCTAGACCCACTCCACAGAATTGCCTACCTTCCCTCGCTCACCCTCCTGGCCTCCTATTTATAACGATATGCACTAACTTCCTATCTAATTACCATGATATCCCTTACTAATACTATATTAATATTCTCATCACACAAGGGAAAAAAAGCCTTAGTCCCATCTCCGTAGATCTCCTGAACCTTTGATGTCCATCTAGTTAAGAGCCAGGATATACCACCTTGACTTTGAATTTGCGCAACAGACCACTTGGAACGTCGTCTGGAAAACGACCTTCAACTGGACTTTCTTTTTTTAAAAAAAAGATATATAAATATACACATATAAAAGCAATCTTATTAGGTTTCTTCTTCATTTGTGCACACTTTTAGGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGAGGTAAAGCATTTGGTAGTTAGGATTTATGTCTTTAGTTATGCATGAAAAATTGTATAGTGATTTTTTTTCTTTGTTGTATAACAAAATGAATATGAATAATGTATTCAAAATAGTTTATTTTCTCCATGGAAGTAGAGTGTGGTTCAAGTGGCTAGATAGTTGTACCTGTATTATGGCTTGTAGTTTCATGTTAGAATCTATTCTCTCTTTATTTTTCAATTTATTATTCTTTCACATAAAA
mRNA sequence
CAAAGTCGTTTTGGGTGGTCAAAGGGGAAGTGAAGGTTTGTTTCGGTTCAATTCTACACACTGCTTCCACATTCGAGAATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGAGGTAAAGCATTTGGTAGTTAGGATTTATGTCTTTAGTTATGCATGAAAAATTGTATAGTGATTTTTTTTCTTTGTTGTATAACAAAATGAATATGAATAATGTATTCAAAATAGTTTATTTTCTCCATGGAAGTAGAGTGTGGTTCAAGTGGCTAGATAGTTGTACCTGTATTATGGCTTGTAGTTTCATGTTAGAATCTATTCTCTCTTTATTTTTCAATTTATTATTCTTTCACATAAAA
Coding sequence (CDS)
ATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGA
Protein sequence
MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCRVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKNFM
Homology
BLAST of Bhi04G002045 vs. TAIR 10
Match:
AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )
HSP 1 Score: 518.8 bits (1335), Expect = 4.1e-147
Identity = 247/338 (73.08%), Postives = 302/338 (89.35%), Query Frame = 0
Query: 101 QNAKDGVES--LDQPKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQ 160
+ KDG E +D +M KVCDKLIEVFM+DKPTP+DWRRL+AFSKEWD+IRPHF++RCQ
Sbjct: 77 EEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLAFSKEWDSIRPHFYKRCQ 136
Query: 161 DRAASEDDPGMRHKLLRLGRKLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTK 220
+RA SED+P M+HK+ RL RKLKE+DED++RHNELL V++ P+E+GE+++RRRKDFT
Sbjct: 137 ERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTPPAEIGELVARRRKDFTN 196
Query: 221 EFFVHLHTVAESYYDDPAEQNALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINS 280
EFF HLHTVAESYYD+P EQNALA LG ++AVQ YD +TE+I+ALNAAE+K QDIINS
Sbjct: 197 EFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDIINS 256
Query: 281 PTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQ 340
P+LDAACRKID+LAEKNQLDSALVLMITKAWSAAKESNMMK+EVKDILYHLYVTA GNLQ
Sbjct: 257 PSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEEVKDILYHLYVTARGNLQ 316
Query: 341 RLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVD 400
RLMPKE+RILKYLL+I+DP+E++SAL+DAFTPG+ELEG DVD LYTTPE L + +KTV++
Sbjct: 317 RLMPKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLE 376
Query: 401 AYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKNFM 437
AYHFSREG+L+KEA+DLM+P++I K+E+LK L+EK +M
Sbjct: 377 AYHFSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414
BLAST of Bhi04G002045 vs. TAIR 10
Match:
AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )
HSP 1 Score: 299.3 bits (765), Expect = 5.1e-81
Identity = 150/373 (40.21%), Postives = 245/373 (65.68%), Query Frame = 0
Query: 68 FSAVVRGVSAVPSDCN----SETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLI 127
FSA + G + ++T+ ++ T E + VE + M + CDK+I
Sbjct: 32 FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKII 91
Query: 128 EVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEI 187
++F+ +KP W+ + EW+ +F++RC+ RA +E DP ++ KL+ L K+K+I
Sbjct: 92 DLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKI 151
Query: 188 DEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAK 247
D+++++HN+LL+ ++ P+++ I ++RR+DFT EFF ++ ++E+ D +++A+A+
Sbjct: 152 DKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSET-LDGLEDRDAVAR 211
Query: 248 LGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVL 307
L CLSAV YD E++E L+ A+ KF+DI+NSP++D+AC KI +LA+ +LDS+L+L
Sbjct: 212 LATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLIL 271
Query: 308 MITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSA 367
+I A++AAKES + +E KDI+YHLY +L+ + PKEI++LKYLL I DPEE+ SA
Sbjct: 272 LINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSA 331
Query: 368 LKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVK 427
L AF+PG++ E +D LYTTP++LH WIK ++DAYH ++E T IKEA+ + P VI +
Sbjct: 332 LATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQR 391
Query: 428 LEELKHLLEKNFM 437
L LK +E ++
Sbjct: 392 LFILKDTIEDEYL 402
BLAST of Bhi04G002045 vs. ExPASy Swiss-Prot
Match:
Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)
HSP 1 Score: 299.3 bits (765), Expect = 7.2e-80
Identity = 150/373 (40.21%), Postives = 245/373 (65.68%), Query Frame = 0
Query: 68 FSAVVRGVSAVPSDCN----SETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLI 127
FSA + G + ++T+ ++ T E + VE + M + CDK+I
Sbjct: 32 FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKII 91
Query: 128 EVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEI 187
++F+ +KP W+ + EW+ +F++RC+ RA +E DP ++ KL+ L K+K+I
Sbjct: 92 DLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKI 151
Query: 188 DEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAK 247
D+++++HN+LL+ ++ P+++ I ++RR+DFT EFF ++ ++E+ D +++A+A+
Sbjct: 152 DKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSET-LDGLEDRDAVAR 211
Query: 248 LGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVL 307
L CLSAV YD E++E L+ A+ KF+DI+NSP++D+AC KI +LA+ +LDS+L+L
Sbjct: 212 LATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLIL 271
Query: 308 MITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSA 367
+I A++AAKES + +E KDI+YHLY +L+ + PKEI++LKYLL I DPEE+ SA
Sbjct: 272 LINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSA 331
Query: 368 LKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVK 427
L AF+PG++ E +D LYTTP++LH WIK ++DAYH ++E T IKEA+ + P VI +
Sbjct: 332 LATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQR 391
Query: 428 LEELKHLLEKNFM 437
L LK +E ++
Sbjct: 392 LFILKDTIEDEYL 402
BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match:
A0A1S3B306 (uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)
HSP 1 Score: 792.3 bits (2045), Expect = 9.9e-226
Identity = 405/437 (92.68%), Postives = 415/437 (94.97%), Query Frame = 0
Query: 1 MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
MELH ATLQTSFSFSIR K+LA GDASA CSPSSPS SRIT RNFS+GSKSRGFPSL+CR
Sbjct: 1 MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRGFPSLLCR 60
Query: 61 VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
R KKSSFS VRGVSAVPSDCNSETLD NP+ E RDVQNAKD VESLDQ KMTKVC
Sbjct: 61 DRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKVC 120
Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLRLGRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGRK 180
Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
LKEIDEDV+RHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
ALAKLGNSCL+AVQTYDAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
KLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
Query: 421 VIVKLEELKHLLEKNFM 437
VIVKLEELKHLLEK FM
Sbjct: 421 VIVKLEELKHLLEKKFM 437
BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match:
A0A1S3B2I9 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)
HSP 1 Score: 787.7 bits (2033), Expect = 2.4e-224
Identity = 405/438 (92.47%), Postives = 415/438 (94.75%), Query Frame = 0
Query: 1 MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSR-GFPSLVC 60
MELH ATLQTSFSFSIR K+LA GDASA CSPSSPS SRIT RNFS+GSKSR GFPSL+C
Sbjct: 1 MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRAGFPSLLC 60
Query: 61 RVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKV 120
R R KKSSFS VRGVSAVPSDCNSETLD NP+ E RDVQNAKD VESLDQ KMTKV
Sbjct: 61 RDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKV 120
Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGR 180
CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLRLGR
Sbjct: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGR 180
Query: 181 KLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
KLKEIDEDV+RHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
Query: 241 NALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
NALAKLGNSCL+AVQTYDAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
Query: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
EKLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNP 420
Query: 421 QVIVKLEELKHLLEKNFM 437
QVIVKLEELKHLLEK FM
Sbjct: 421 QVIVKLEELKHLLEKKFM 438
BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match:
A0A0A0LMH6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1)
HSP 1 Score: 773.1 bits (1995), Expect = 6.2e-220
Identity = 391/437 (89.47%), Postives = 408/437 (93.36%), Query Frame = 0
Query: 1 MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
MELH ATL TSFSFSIRS LAHGDASA CSPS PS SRIT RNFS+GSKSRGFPSLVC
Sbjct: 1 MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60
Query: 61 VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
R KKSSFSA VRGV AVPSDCNSETLD NP+ EP RDVQNAKD VE+LDQ KMTKVC
Sbjct: 61 DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120
Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLR GRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
LKEIDEDV+RHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
ALAKLGNSCL+AVQTYDAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
ALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
KLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
Query: 421 VIVKLEELKHLLEKNFM 437
+IVKLEELK L+EK FM
Sbjct: 421 LIVKLEELKGLIEKKFM 437
BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match:
A0A6J1JSR8 (uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489439 PE=4 SV=1)
HSP 1 Score: 756.9 bits (1953), Expect = 4.6e-215
Identity = 390/437 (89.24%), Postives = 406/437 (92.91%), Query Frame = 0
Query: 1 MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
MELHCATLQ SFSF IR K L HGDASA CS SS S SRITAR+FS+GSKSRGFPSL R
Sbjct: 1 MELHCATLQASFSFYIRGKTLPHGDASATCSSSSSSVSRITARSFSLGSKSRGFPSLTWR 60
Query: 61 VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
VRLKKSS SAVVRG SA PS C+++TLD SN T + RDVQNAK+ VE LDQ KMTKVC
Sbjct: 61 VRLKKSSSSAVVRGGSAEPSHCSTDTLDSSNTTPDDSVRDVQNAKNDVECLDQHKMTKVC 120
Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
DKLIEVF+IDKPTPTDWRRLIAFSK WDNIRPHFFRRCQ+RAASEDDPGMRHKLLRLGRK
Sbjct: 121 DKLIEVFLIDKPTPTDWRRLIAFSKTWDNIRPHFFRRCQERAASEDDPGMRHKLLRLGRK 180
Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
LKEIDEDV+RHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYY DPAEQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYADPAEQN 240
Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
ALAKLGNSCL AVQ YDAATENIEALNAAELKFQDIINSPTLDAACRKID+LAEKNQLDS
Sbjct: 241 ALAKLGNSCLVAVQAYDAATENIEALNAAELKFQDIINSPTLDAACRKIDSLAEKNQLDS 300
Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
ALVLMI+KAWSAAKESNMMKDEVKDILYHLYVTA GNLQRLMPKEIRILKYLLTIKDPEE
Sbjct: 301 ALVLMISKAWSAAKESNMMKDEVKDILYHLYVTARGNLQRLMPKEIRILKYLLTIKDPEE 360
Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
KLSALKDAFTPGEE+EGQDVDCLYTTPEKLHTWIKTV+DAYHFSREGTL+KEARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPEKLHTWIKTVLDAYHFSREGTLVKEARDLMNPQ 420
Query: 421 VIVKLEELKHLLEKNFM 437
VIVKLEELK L+EK FM
Sbjct: 421 VIVKLEELKLLVEKKFM 437
BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match:
A0A6J1JWA0 (uncharacterized protein At4g37920-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489439 PE=4 SV=1)
HSP 1 Score: 752.3 bits (1941), Expect = 1.1e-213
Identity = 390/438 (89.04%), Postives = 406/438 (92.69%), Query Frame = 0
Query: 1 MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSR-GFPSLVC 60
MELHCATLQ SFSF IR K L HGDASA CS SS S SRITAR+FS+GSKSR GFPSL
Sbjct: 1 MELHCATLQASFSFYIRGKTLPHGDASATCSSSSSSVSRITARSFSLGSKSRAGFPSLTW 60
Query: 61 RVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKV 120
RVRLKKSS SAVVRG SA PS C+++TLD SN T + RDVQNAK+ VE LDQ KMTKV
Sbjct: 61 RVRLKKSSSSAVVRGGSAEPSHCSTDTLDSSNTTPDDSVRDVQNAKNDVECLDQHKMTKV 120
Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGR 180
CDKLIEVF+IDKPTPTDWRRLIAFSK WDNIRPHFFRRCQ+RAASEDDPGMRHKLLRLGR
Sbjct: 121 CDKLIEVFLIDKPTPTDWRRLIAFSKTWDNIRPHFFRRCQERAASEDDPGMRHKLLRLGR 180
Query: 181 KLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
KLKEIDEDV+RHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYY DPAEQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYADPAEQ 240
Query: 241 NALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
NALAKLGNSCL AVQ YDAATENIEALNAAELKFQDIINSPTLDAACRKID+LAEKNQLD
Sbjct: 241 NALAKLGNSCLVAVQAYDAATENIEALNAAELKFQDIINSPTLDAACRKIDSLAEKNQLD 300
Query: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
SALVLMI+KAWSAAKESNMMKDEVKDILYHLYVTA GNLQRLMPKEIRILKYLLTIKDPE
Sbjct: 301 SALVLMISKAWSAAKESNMMKDEVKDILYHLYVTARGNLQRLMPKEIRILKYLLTIKDPE 360
Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
EKLSALKDAFTPGEE+EGQDVDCLYTTPEKLHTWIKTV+DAYHFSREGTL+KEARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPEKLHTWIKTVLDAYHFSREGTLVKEARDLMNP 420
Query: 421 QVIVKLEELKHLLEKNFM 437
QVIVKLEELK L+EK FM
Sbjct: 421 QVIVKLEELKLLVEKKFM 438
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G36320.1 | 4.1e-147 | 73.08 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G37920.1 | 5.1e-81 | 40.21 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Match Name | E-value | Identity | Description | |
Q84WN0 | 7.2e-80 | 40.21 | Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3B306 | 9.9e-226 | 92.68 | uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3... | [more] |
A0A1S3B2I9 | 2.4e-224 | 92.47 | uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... | [more] |
A0A0A0LMH6 | 6.2e-220 | 89.47 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1 | [more] |
A0A6J1JSR8 | 4.6e-215 | 89.24 | uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN... | [more] |
A0A6J1JWA0 | 1.1e-213 | 89.04 | uncharacterized protein At4g37920-like isoform X1 OS=Cucurbita maxima OX=3661 GN... | [more] |