Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCTCATTTCTTTCTCTCTTTTGCACATTCATCCAGCAATTCTATTTTCACAACTTTACACTCCAAATATATACTTTCTATTATTAACATATTAAATTCATACTTAACACCACCTTAAAAATCAAATGGAAACTAGAAAGTAGTAAATATATATACACTTTTTCTAAGATAAAAAAGGCAAGTTGGGAGTATGATTGGATAATGAAACAAACCTCGTGCTCCTTTTCCAACTTTTACCATAAATTTTACATCTAAAACTCCTTTTTTTCTCAAACCCATTTGGACGAAAAGTAAAGGCCAAGGAAGCACAGCCCCCACTGTTCTTCAATTTTGACTCTTTTTTTCTTTTTCCATTTCTAAGGCAGGCAAAAAGGAGTGTTTTGAAGTTTTAACAGATAATAAAGTAAACACTCACACACCCAACAAAAAGTAGTAGGGGGCCAAAACCATAACCAGAAATCTGACCCTCCTTTTTCTTCTTCTTCTTCTCCTCCTTCTGATTCTCTGTAAGGTTTGTCTTTTAAAAACCCCACATCCCAAAAACCCCCTTAATGCTTGAAAAATCTTTTTTTTCTAAGACACCATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGATCACTGGTTATGGAAGACATAAATTTCAGCTTCATCATCCTTTTTTTTTTTTTTTTTTCATGATTTGAATATTTTTGAAAATTCTTGAGTTTGAATCTGAGTAAGATTGGTTTTGGTGGCTTGTTGTACAGTTAATCTATAATTTCGTTGGAGAGAATTTTGTTGTCATGTTCATGTACGAGACACGCTTGTGGTCAAATATAAGATTGGGCTGTTAATCCAAGTGTTCCATAGAAAATGATGAAATGTAATTATAGAAAGGGTTTTCTTTTTCGATCCTTTATTTGTATCTTTCTGTTTTTCTTTTAATTAAAAAGAAAACGTGTTTGATTTTAGCTCTCTTTAA
mRNA sequence
CTCCTCATTTCTTTCTCTCTTTTGCACATTCATCCAGCAATTCTATTTTCACAACTTTACACTCCAAATATATACTTTCTATTATTAACATATTAAATTCATACTTAACACCACCTTAAAAATCAAATGGAAACTAGAAAGTAGTAAATATATATACACTTTTTCTAAGATAAAAAAGGCAAGTTGGGAGTATGATTGGATAATGAAACAAACCTCGTGCTCCTTTTCCAACTTTTACCATAAATTTTACATCTAAAACTCCTTTTTTTCTCAAACCCATTTGGACGAAAAGTAAAGGCCAAGGAAGCACAGCCCCCACTGTTCTTCAATTTTGACTCTTTTTTTCTTTTTCCATTTCTAAGGCAGGCAAAAAGGAGTGTTTTGAAGTTTTAACAGATAATAAAGTAAACACTCACACACCCAACAAAAAGTAGTAGGGGGCCAAAACCATAACCAGAAATCTGACCCTCCTTTTTCTTCTTCTTCTTCTCCTCCTTCTGATTCTCTGTAAGGTTTGTCTTTTAAAAACCCCACATCCCAAAAACCCCCTTAATGCTTGAAAAATCTTTTTTTTCTAAGACACCATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGATCACTGGTTATGGAAGACATAAATTTCAGCTTCATCATCCTTTTTTTTTTTTTTTTTTCATGATTTGAATATTTTTGAAAATTCTTGAGTTTGAATCTGAGTAAGATTGGTTTTGGTGGCTTGTTGTACAGTTAATCTATAATTTCGTTGGAGAGAATTTTGTTGTCATGTTCATGTACGAGACACGCTTGTGGTCAAATATAAGATTGGGCTGTTAATCCAAGTGTTCCATAGAAAATGATGAAATGTAATTATAGAAAGGGTTTTCTTTTTCGATCCTTTATTTGTATCTTTCTGTTTTTCTTTTAATTAAAAAGAAAACGTGTTTGATTTTAGCTCTCTTTAA
Coding sequence (CDS)
ATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGA
Protein sequence
MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAPAKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWKV
Homology
BLAST of Bhi09G002208 vs. TAIR 10
Match:
AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )
HSP 1 Score: 263.8 bits (673), Expect = 1.7e-70
Identity = 170/339 (50.15%), Postives = 224/339 (66.08%), Query Frame = 0
Query: 12 SADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSA---------ESPV 71
SADLLVCFPSR+HLAL P P+CSP+R SDSS R R +HRR+ S SPV
Sbjct: 17 SADLLVCFPSRTHLALTPKPICSPSRPSDSSTNR---RPHHRRQLSKLSGGGGGGHGSPV 76
Query: 72 VWAK---AKTM-GSEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKL 131
+WAK +K M G EI+EP+SPKVTCAGQIK+RP K+WQSVMEEIERIH+ R
Sbjct: 77 LWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRSQ 136
Query: 132 RRRRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQG 191
+ G KKD+M FLTCLRNI+FDFRCF F D T++++EEE+++++E+ +
Sbjct: 137 SK-------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEE- 196
Query: 192 NQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSD----DDVSIEAAMAPPKNAL 251
V +E E+S+T FSKWFMVLQE +N+ ++ C + +D E A+ PP NAL
Sbjct: 197 -VVEGEEEENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETEPAV-PPPNAL 256
Query: 252 LLMRCRSAPAKRWLE---------EESEEEEDDDDDDDEKEEVKV-KKSLKWLMEEENRE 311
LLMRCRSAPAK WLE E+ EE++++ + +D++ +K KK L+ LMEEE E
Sbjct: 257 LLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLRSLMEEEKME 316
Query: 312 RLVMETGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWK 320
++M T+F R++SDIAKETWVV +D +RS SWK
Sbjct: 317 LVLMRYDTEFYRLSSDIAKETWVVG-GIQDPLSRSRSWK 341
BLAST of Bhi09G002208 vs. TAIR 10
Match:
AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )
HSP 1 Score: 211.8 bits (538), Expect = 7.8e-55
Identity = 153/324 (47.22%), Postives = 196/324 (60.49%), Query Frame = 0
Query: 12 SADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAESPVVWAKAKTMG 71
SADL+VCFPSR+HL+L + SP+ SS R + +HRR S S + G
Sbjct: 13 SADLMVCFPSRAHLSLPSKSISSPS----SSFNRRQNAPHHRRSISKLSSSGGGVRQNRG 72
Query: 72 ---SEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKLRRRRFHWVES 131
+ EP+SPKVTCAGQIK+R K+WQS+M EIE+IH R K + F
Sbjct: 73 GGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH-RSKSESKFF----- 132
Query: 132 LGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENES 191
G K+D+M FLTCLR+ FDFRCF AFP D +++EEE+EEEEEE + DE+ES
Sbjct: 133 -GIKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEE-----EDEDES 192
Query: 192 SRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAPAKRWL 251
S T FSKW MVL E +NE + K D +E A+ PP NALLLMRCRSAP K W
Sbjct: 193 SGTVFSKWLMVLHEKQNNEECVDGKENVFSD---VETAV-PPPNALLLMRCRSAPVKNWS 252
Query: 252 EEESEEEEDDD--------DDDDEKEEVKVKKSLKWLMEEENRERL-VMETGTDFCRMTS 311
EE+ EE E+ D ++++EK+ V KK L+ LMEEE + L VM T++ ++++
Sbjct: 253 EEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYDTNYYKLSN 312
Query: 312 DIAKETWVVSEKSRDLFTRSHSWK 320
DIAKETWVV LF RS SWK
Sbjct: 313 DIAKETWVVGGIQDPLF-RSRSWK 313
BLAST of Bhi09G002208 vs. TAIR 10
Match:
AT3G15095.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 9762 Blast hits to 6439 proteins in 764 species: Archae - 77; Bacteria - 1339; Metazoa - 3211; Fungi - 718; Plants - 437; Viruses - 131; Other Eukaryotes - 3849 (source: NCBI BLink). )
HSP 1 Score: 45.8 bits (107), Expect = 7.4e-05
Identity = 91/379 (24.01%), Postives = 141/379 (37.20%), Query Frame = 0
Query: 7 NKGIPSADLLVCFPSR----SHLALMPNPLCSPARG-----SDSSKFRLSHRHYHRRRKS 66
N S DL +CF SR S + L + SPAR S S + R S +
Sbjct: 19 NNSGSSTDLFICFTSRFSSSSSMRLSSKSIHSPARSACLTTSLSRRLRTSGSLKNASAGV 78
Query: 67 AESPVVWA----KAKTMGSEIS--------EPSSPKVTCAGQIKIRPKNSKSWQSVMEEI 126
SP+ A K G E S EPSSPKVTC GQ++++
Sbjct: 79 LNSPMFGANGGRKRSGSGYENSNNNNNNNIEPSSPKVTCIGQVRVK-------------- 138
Query: 127 ERIHNRRKLRRRRFHWVESLGFKKDIMQ----------------------FLTCLRNIRF 186
R H ++K+R R F++ + Q LR+
Sbjct: 139 TRKHVKKKMRARSRRKGGENSFRRSVDQNDGGGGCRFKASENRLVHLPVTICESLRSFGS 198
Query: 187 DFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENESSRTAFSKWFMVLQENGSNE 246
+ CF FP TE + E + G G + S F++WF+ ++E +
Sbjct: 199 ELNCF--FPCRSSCTENSHGDGRRAESNNDGCGGGGGGSNSCGAVFTRWFVAVEETSGGK 258
Query: 247 LKRESKILCSDDDVSIE----------------------------------AAMAPPKNA 306
+ ++ +D+V + + +PPKNA
Sbjct: 259 RREIELVVGGEDEVEEDRRRSRRRHVFEGLDLSEIEMKTEKKERGEEVGRMSICSPPKNA 318
Query: 307 LLLMRCRSAPAK-RWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGT 308
LLLMRCRS P K L E + +D EE + ++ ++ +E E+++R+
Sbjct: 319 LLLMRCRSDPVKVAALANRVRERQLSLNDGVYTEEEEDERRRRFELEIEDKKRI------ 373
BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match:
A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)
HSP 1 Score: 511.5 bits (1316), Expect = 2.5e-141
Identity = 266/321 (82.87%), Postives = 288/321 (89.72%), Query Frame = 0
Query: 1 MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
MKL RE +KGIPS+DLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL +R YHRRRKSAE
Sbjct: 1 MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 60
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
+W+ES GFKKDIMQFLTCLR +RFDFRCFRAFP TDFTTEEEEEEEEEEEE+ + NQVG+
Sbjct: 121 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEKNQVGI 180
Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
+ENESSRTAFSKWFMVLQENGSNELKR+S C +DD SIEA MAPP+NALLLMRC+SAP
Sbjct: 181 EENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEATMAPPRNALLLMRCKSAP 240
Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
A+RW+EEESEEE+D+ + + EKE+VKVKKSLKWLMEEENRER+VME GTDFCRM SD AK
Sbjct: 241 ARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTDFCRMISDNAK 300
Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
E FTRS SWKV
Sbjct: 301 E-----------FTRSQSWKV 310
BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match:
A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)
HSP 1 Score: 509.6 bits (1311), Expect = 9.4e-141
Identity = 272/321 (84.74%), Postives = 285/321 (88.79%), Query Frame = 0
Query: 1 MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
MKL RE +KGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL HR +HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
WVES GFKKDIMQFLTCLR IRFDFRCFRAFP TDFTTEEEEEEEEEEE++ NQVG+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK--NQVGI 187
Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
+ENESSRTAFSKWFMVLQENGSNELKR+SK LC++DD SIEA MAPP NALLLMRCRSAP
Sbjct: 188 EENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRCRSAP 247
Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
A+RW+EEESEE DDEKE+VKVKKSLKWLMEEENRERLV+E GTDFCRMTSD AK
Sbjct: 248 ARRWMEEESEE------GDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTSDNAK 307
Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
E FTRS SWKV
Sbjct: 308 E-----------FTRSQSWKV 309
BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match:
A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)
HSP 1 Score: 509.6 bits (1311), Expect = 9.4e-141
Identity = 272/321 (84.74%), Postives = 285/321 (88.79%), Query Frame = 0
Query: 1 MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
MKL RE +KGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL HR +HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
WVES GFKKDIMQFLTCLR IRFDFRCFRAFP TDFTTEEEEEEEEEEE++ NQVG+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK--NQVGI 187
Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
+ENESSRTAFSKWFMVLQENGSNELKR+SK LC++DD SIEA MAPP NALLLMRCRSAP
Sbjct: 188 EENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRCRSAP 247
Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
A+RW+EEESEE DDEKE+VKVKKSLKWLMEEENRERLV+E GTDFCRMTSD AK
Sbjct: 248 ARRWMEEESEE------GDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTSDNAK 307
Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
E FTRS SWKV
Sbjct: 308 E-----------FTRSQSWKV 309
BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match:
A0A6J1D3C2 (uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016595 PE=4 SV=1)
HSP 1 Score: 449.5 bits (1155), Expect = 1.2e-122
Identity = 251/334 (75.15%), Postives = 276/334 (82.63%), Query Frame = 0
Query: 1 MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRK--SA 60
MKLGR+ K I SADLLVCFPSRS+L LMP PLCSPARG DS+K R SHRH+HRRRK SA
Sbjct: 1 MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60
Query: 61 ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61 ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120
Query: 121 RRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEE-KSQGN 180
RR +WVESLGFKKDIMQFLTCLRNIRFDFRCF+AFP DFTTEEE+EEEEEEEE KSQ N
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTTEEEDEEEEEEEEGKSQEN 180
Query: 181 QVGVDENESSRTAFSKWFMVLQENG-SNELKRESKILCSDDDVSIEAAMAPPKNALLLMR 240
QVGV+ NESSRTAFSKWFMVLQE+G SN + RES +APPKNALLLMR
Sbjct: 181 QVGVEGNESSRTAFSKWFMVLQESGASNGICRESN----------GPPLAPPKNALLLMR 240
Query: 241 CRSAPAKRWLEEESEEEEDDDDDDDEKE---------EVKVKKSLKWLMEEENRERLVME 300
CRSAPAK W EEE EEEE+++++++E+E EVKVKKSLKWLMEEENRERLVME
Sbjct: 241 CRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEEENRERLVME 300
Query: 301 TGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWK 320
G DFCRM+S+IAKETWV RDLF+RS SWK
Sbjct: 301 MGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320
BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match:
A0A6J1IQQ3 (uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333 PE=4 SV=1)
HSP 1 Score: 371.3 bits (952), Expect = 4.0e-99
Identity = 211/321 (65.73%), Postives = 235/321 (73.21%), Query Frame = 0
Query: 1 MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAES 60
MKL R+ K IPS DLLVCFPSRSH ALMPNPLCSP R SDS+K R YHRRRKSAES
Sbjct: 1 MKLIRDIKAIPSPDLLVCFPSRSHFALMPNPLCSPVRASDSNKL----RRYHRRRKSAES 60
Query: 61 PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
PVVWAKAKT+ GSE+SEPSSPKVTCAGQIK+R K+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61 PVVWAKAKTIGGSEVSEPSSPKVTCAGQIKMRRKSRKSWESVMEEIERIHNRRELRRRRF 120
Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
+WVESLGFKKDIMQFLTCLR+IRFDF CF AFP +FT+E+EEEEE VGV
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSIRFDFGCFGAFPEAEFTSEDEEEEE-----------VGV 180
Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
+ ++ SRTAFSKWFMVLQ +G ++R+ LC+ DD SI MAPP+NALLLMRCRSAP
Sbjct: 181 EGSDGSRTAFSKWFMVLQGSG---VRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAP 240
Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
AK W+EE EEE+D EVKVKKSLKWLMEEENRE
Sbjct: 241 AKSWVEEACSEEEED-------TEVKVKKSLKWLMEEENRE------------------- 269
Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
SRDL TRS SWKV
Sbjct: 301 --------SRDLVTRSRSWKV 269
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G78110.1 | 1.7e-70 | 50.15 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G22230.1 | 7.8e-55 | 47.22 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |
AT3G15095.1 | 7.4e-05 | 24.01 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L1Z4 | 2.5e-141 | 82.87 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1 | [more] |
A0A5D3D503 | 9.4e-141 | 84.74 | Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... | [more] |
A0A1S3B949 | 9.4e-141 | 84.74 | uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... | [more] |
A0A6J1D3C2 | 1.2e-122 | 75.15 | uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |
A0A6J1IQQ3 | 4.0e-99 | 65.73 | uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333... | [more] |