Bhi09G002208 (gene) Wax gourd (B227) v1

Overview
NameBhi09G002208
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionARID domain-containing protein
Locationchr9: 70845093 .. 70846985 (+)
RNA-Seq ExpressionBhi09G002208
SyntenyBhi09G002208
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCTCATTTCTTTCTCTCTTTTGCACATTCATCCAGCAATTCTATTTTCACAACTTTACACTCCAAATATATACTTTCTATTATTAACATATTAAATTCATACTTAACACCACCTTAAAAATCAAATGGAAACTAGAAAGTAGTAAATATATATACACTTTTTCTAAGATAAAAAAGGCAAGTTGGGAGTATGATTGGATAATGAAACAAACCTCGTGCTCCTTTTCCAACTTTTACCATAAATTTTACATCTAAAACTCCTTTTTTTCTCAAACCCATTTGGACGAAAAGTAAAGGCCAAGGAAGCACAGCCCCCACTGTTCTTCAATTTTGACTCTTTTTTTCTTTTTCCATTTCTAAGGCAGGCAAAAAGGAGTGTTTTGAAGTTTTAACAGATAATAAAGTAAACACTCACACACCCAACAAAAAGTAGTAGGGGGCCAAAACCATAACCAGAAATCTGACCCTCCTTTTTCTTCTTCTTCTTCTCCTCCTTCTGATTCTCTGTAAGGTTTGTCTTTTAAAAACCCCACATCCCAAAAACCCCCTTAATGCTTGAAAAATCTTTTTTTTCTAAGACACCATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGATCACTGGTTATGGAAGACATAAATTTCAGCTTCATCATCCTTTTTTTTTTTTTTTTTTCATGATTTGAATATTTTTGAAAATTCTTGAGTTTGAATCTGAGTAAGATTGGTTTTGGTGGCTTGTTGTACAGTTAATCTATAATTTCGTTGGAGAGAATTTTGTTGTCATGTTCATGTACGAGACACGCTTGTGGTCAAATATAAGATTGGGCTGTTAATCCAAGTGTTCCATAGAAAATGATGAAATGTAATTATAGAAAGGGTTTTCTTTTTCGATCCTTTATTTGTATCTTTCTGTTTTTCTTTTAATTAAAAAGAAAACGTGTTTGATTTTAGCTCTCTTTAA

mRNA sequence

CTCCTCATTTCTTTCTCTCTTTTGCACATTCATCCAGCAATTCTATTTTCACAACTTTACACTCCAAATATATACTTTCTATTATTAACATATTAAATTCATACTTAACACCACCTTAAAAATCAAATGGAAACTAGAAAGTAGTAAATATATATACACTTTTTCTAAGATAAAAAAGGCAAGTTGGGAGTATGATTGGATAATGAAACAAACCTCGTGCTCCTTTTCCAACTTTTACCATAAATTTTACATCTAAAACTCCTTTTTTTCTCAAACCCATTTGGACGAAAAGTAAAGGCCAAGGAAGCACAGCCCCCACTGTTCTTCAATTTTGACTCTTTTTTTCTTTTTCCATTTCTAAGGCAGGCAAAAAGGAGTGTTTTGAAGTTTTAACAGATAATAAAGTAAACACTCACACACCCAACAAAAAGTAGTAGGGGGCCAAAACCATAACCAGAAATCTGACCCTCCTTTTTCTTCTTCTTCTTCTCCTCCTTCTGATTCTCTGTAAGGTTTGTCTTTTAAAAACCCCACATCCCAAAAACCCCCTTAATGCTTGAAAAATCTTTTTTTTCTAAGACACCATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGATCACTGGTTATGGAAGACATAAATTTCAGCTTCATCATCCTTTTTTTTTTTTTTTTTTCATGATTTGAATATTTTTGAAAATTCTTGAGTTTGAATCTGAGTAAGATTGGTTTTGGTGGCTTGTTGTACAGTTAATCTATAATTTCGTTGGAGAGAATTTTGTTGTCATGTTCATGTACGAGACACGCTTGTGGTCAAATATAAGATTGGGCTGTTAATCCAAGTGTTCCATAGAAAATGATGAAATGTAATTATAGAAAGGGTTTTCTTTTTCGATCCTTTATTTGTATCTTTCTGTTTTTCTTTTAATTAAAAAGAAAACGTGTTTGATTTTAGCTCTCTTTAA

Coding sequence (CDS)

ATGAAATTGGGTAGAGAGAATAAAGGAATCCCTTCAGCGGATTTGTTGGTTTGTTTTCCTTCTCGGTCGCATTTGGCTTTAATGCCAAACCCACTTTGTAGTCCAGCGAGAGGGTCCGATTCCAGTAAGTTTCGTTTAAGTCACCGCCATTACCACCGGCGGAGGAAGTCGGCGGAGAGTCCGGTGGTATGGGCCAAAGCGAAGACGATGGGGTCGGAGATATCGGAACCGTCGTCGCCGAAAGTGACATGTGCAGGGCAGATAAAGATCAGGCCGAAGAATAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGAAATTACGGAGGAGGAGGTTTCATTGGGTTGAATCTTTAGGGTTCAAGAAAGATATTATGCAATTCTTGACGTGTTTACGGAACATACGGTTTGATTTTAGGTGTTTCAGAGCTTTCCCAGCAACAGATTTCACCACTGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGAAGAAAAATCTCAAGGGAATCAGGTGGGTGTTGATGAAAATGAGAGCTCAAGAACTGCATTTTCTAAATGGTTTATGGTTTTACAGGAAAATGGGAGTAATGAGTTAAAGAGAGAGAGCAAAATTCTCTGTAGTGATGATGATGTATCGATTGAGGCAGCAATGGCACCACCCAAAAACGCCCTTTTGCTTATGCGTTGTAGGTCTGCTCCAGCAAAGAGATGGTTGGAAGAAGAATCTGAAGAAGAAGAAGATGATGATGATGATGATGATGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAAAACAGAGAGAGATTGGTTATGGAAACGGGCACTGATTTCTGCAGAATGACATCGGACATTGCAAAAGAGACATGGGTTGTTAGTGAAAAGAGCAGGGATTTGTTTACAAGGAGCCATAGTTGGAAAGTTTGA

Protein sequence

MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAPAKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWKV
Homology
BLAST of Bhi09G002208 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 263.8 bits (673), Expect = 1.7e-70
Identity = 170/339 (50.15%), Postives = 224/339 (66.08%), Query Frame = 0

Query: 12  SADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSA---------ESPV 71
           SADLLVCFPSR+HLAL P P+CSP+R SDSS  R   R +HRR+ S           SPV
Sbjct: 17  SADLLVCFPSRTHLALTPKPICSPSRPSDSSTNR---RPHHRRQLSKLSGGGGGGHGSPV 76

Query: 72  VWAK---AKTM-GSEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKL 131
           +WAK   +K M G EI+EP+SPKVTCAGQIK+RP       K+WQSVMEEIERIH+ R  
Sbjct: 77  LWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRSQ 136

Query: 132 RRRRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQG 191
            +         G KKD+M FLTCLRNI+FDFRCF  F   D T++++EEE+++++E+ + 
Sbjct: 137 SK-------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEE- 196

Query: 192 NQVGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSD----DDVSIEAAMAPPKNAL 251
             V  +E E+S+T FSKWFMVLQE  +N+   ++   C +    +D   E A+ PP NAL
Sbjct: 197 -VVEGEEEENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETEPAV-PPPNAL 256

Query: 252 LLMRCRSAPAKRWLE---------EESEEEEDDDDDDDEKEEVKV-KKSLKWLMEEENRE 311
           LLMRCRSAPAK WLE         E+ EE++++ + +D++  +K  KK L+ LMEEE  E
Sbjct: 257 LLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLRSLMEEEKME 316

Query: 312 RLVMETGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWK 320
            ++M   T+F R++SDIAKETWVV    +D  +RS SWK
Sbjct: 317 LVLMRYDTEFYRLSSDIAKETWVVG-GIQDPLSRSRSWK 341

BLAST of Bhi09G002208 vs. TAIR 10
Match: AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )

HSP 1 Score: 211.8 bits (538), Expect = 7.8e-55
Identity = 153/324 (47.22%), Postives = 196/324 (60.49%), Query Frame = 0

Query: 12  SADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAESPVVWAKAKTMG 71
           SADL+VCFPSR+HL+L    + SP+    SS  R  +  +HRR  S  S       +  G
Sbjct: 13  SADLMVCFPSRAHLSLPSKSISSPS----SSFNRRQNAPHHRRSISKLSSSGGGVRQNRG 72

Query: 72  ---SEISEPSSPKVTCAGQIKIRPK----NSKSWQSVMEEIERIHNRRKLRRRRFHWVES 131
                + EP+SPKVTCAGQIK+R        K+WQS+M EIE+IH R K   + F     
Sbjct: 73  GGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH-RSKSESKFF----- 132

Query: 132 LGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENES 191
            G K+D+M FLTCLR+  FDFRCF AFP  D  +++EEE+EEEEEE  +      DE+ES
Sbjct: 133 -GIKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEE-----EDEDES 192

Query: 192 SRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAPAKRWL 251
           S T FSKW MVL E  +NE   + K     D   +E A+ PP NALLLMRCRSAP K W 
Sbjct: 193 SGTVFSKWLMVLHEKQNNEECVDGKENVFSD---VETAV-PPPNALLLMRCRSAPVKNWS 252

Query: 252 EEESEEEEDDD--------DDDDEKEEVKVKKSLKWLMEEENRERL-VMETGTDFCRMTS 311
           EE+ EE E+ D        ++++EK+ V  KK L+ LMEEE +  L VM   T++ ++++
Sbjct: 253 EEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYDTNYYKLSN 312

Query: 312 DIAKETWVVSEKSRDLFTRSHSWK 320
           DIAKETWVV      LF RS SWK
Sbjct: 313 DIAKETWVVGGIQDPLF-RSRSWK 313

BLAST of Bhi09G002208 vs. TAIR 10
Match: AT3G15095.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 9762 Blast hits to 6439 proteins in 764 species: Archae - 77; Bacteria - 1339; Metazoa - 3211; Fungi - 718; Plants - 437; Viruses - 131; Other Eukaryotes - 3849 (source: NCBI BLink). )

HSP 1 Score: 45.8 bits (107), Expect = 7.4e-05
Identity = 91/379 (24.01%), Postives = 141/379 (37.20%), Query Frame = 0

Query: 7   NKGIPSADLLVCFPSR----SHLALMPNPLCSPARG-----SDSSKFRLSHRHYHRRRKS 66
           N    S DL +CF SR    S + L    + SPAR      S S + R S    +     
Sbjct: 19  NNSGSSTDLFICFTSRFSSSSSMRLSSKSIHSPARSACLTTSLSRRLRTSGSLKNASAGV 78

Query: 67  AESPVVWA----KAKTMGSEIS--------EPSSPKVTCAGQIKIRPKNSKSWQSVMEEI 126
             SP+  A    K    G E S        EPSSPKVTC GQ++++              
Sbjct: 79  LNSPMFGANGGRKRSGSGYENSNNNNNNNIEPSSPKVTCIGQVRVK-------------- 138

Query: 127 ERIHNRRKLRRRRFHWVESLGFKKDIMQ----------------------FLTCLRNIRF 186
            R H ++K+R R         F++ + Q                          LR+   
Sbjct: 139 TRKHVKKKMRARSRRKGGENSFRRSVDQNDGGGGCRFKASENRLVHLPVTICESLRSFGS 198

Query: 187 DFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGVDENESSRTAFSKWFMVLQENGSNE 246
           +  CF  FP     TE    +    E  + G   G   + S    F++WF+ ++E    +
Sbjct: 199 ELNCF--FPCRSSCTENSHGDGRRAESNNDGCGGGGGGSNSCGAVFTRWFVAVEETSGGK 258

Query: 247 LKRESKILCSDDDVSIE----------------------------------AAMAPPKNA 306
            +    ++  +D+V  +                                  +  +PPKNA
Sbjct: 259 RREIELVVGGEDEVEEDRRRSRRRHVFEGLDLSEIEMKTEKKERGEEVGRMSICSPPKNA 318

Query: 307 LLLMRCRSAPAK-RWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGT 308
           LLLMRCRS P K   L     E +   +D    EE + ++  ++ +E E+++R+      
Sbjct: 319 LLLMRCRSDPVKVAALANRVRERQLSLNDGVYTEEEEDERRRRFELEIEDKKRI------ 373

BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match: A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 2.5e-141
Identity = 266/321 (82.87%), Postives = 288/321 (89.72%), Query Frame = 0

Query: 1   MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
           MKL RE +KGIPS+DLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL +R YHRRRKSAE
Sbjct: 1   MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 60

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120

Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
           +W+ES GFKKDIMQFLTCLR +RFDFRCFRAFP TDFTTEEEEEEEEEEEE+ + NQVG+
Sbjct: 121 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEKNQVGI 180

Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
           +ENESSRTAFSKWFMVLQENGSNELKR+S   C +DD SIEA MAPP+NALLLMRC+SAP
Sbjct: 181 EENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEATMAPPRNALLLMRCKSAP 240

Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
           A+RW+EEESEEE+D+ + + EKE+VKVKKSLKWLMEEENRER+VME GTDFCRM SD AK
Sbjct: 241 ARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTDFCRMISDNAK 300

Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
           E           FTRS SWKV
Sbjct: 301 E-----------FTRSQSWKV 310

BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match: A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 9.4e-141
Identity = 272/321 (84.74%), Postives = 285/321 (88.79%), Query Frame = 0

Query: 1   MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
           MKL RE +KGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL HR +HRRRKSAE
Sbjct: 8   MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127

Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
            WVES GFKKDIMQFLTCLR IRFDFRCFRAFP TDFTTEEEEEEEEEEE++   NQVG+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK--NQVGI 187

Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
           +ENESSRTAFSKWFMVLQENGSNELKR+SK LC++DD SIEA MAPP NALLLMRCRSAP
Sbjct: 188 EENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRCRSAP 247

Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
           A+RW+EEESEE       DDEKE+VKVKKSLKWLMEEENRERLV+E GTDFCRMTSD AK
Sbjct: 248 ARRWMEEESEE------GDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTSDNAK 307

Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
           E           FTRS SWKV
Sbjct: 308 E-----------FTRSQSWKV 309

BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match: A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 9.4e-141
Identity = 272/321 (84.74%), Postives = 285/321 (88.79%), Query Frame = 0

Query: 1   MKLGRE-NKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAE 60
           MKL RE +KGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRL HR +HRRRKSAE
Sbjct: 8   MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67

Query: 61  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68  SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127

Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
            WVES GFKKDIMQFLTCLR IRFDFRCFRAFP TDFTTEEEEEEEEEEE++   NQVG+
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK--NQVGI 187

Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
           +ENESSRTAFSKWFMVLQENGSNELKR+SK LC++DD SIEA MAPP NALLLMRCRSAP
Sbjct: 188 EENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRCRSAP 247

Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
           A+RW+EEESEE       DDEKE+VKVKKSLKWLMEEENRERLV+E GTDFCRMTSD AK
Sbjct: 248 ARRWMEEESEE------GDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMTSDNAK 307

Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
           E           FTRS SWKV
Sbjct: 308 E-----------FTRSQSWKV 309

BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match: A0A6J1D3C2 (uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016595 PE=4 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 1.2e-122
Identity = 251/334 (75.15%), Postives = 276/334 (82.63%), Query Frame = 0

Query: 1   MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRK--SA 60
           MKLGR+ K I SADLLVCFPSRS+L LMP PLCSPARG DS+K R SHRH+HRRRK  SA
Sbjct: 1   MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60

Query: 61  ESPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPK--NSKSWQSVMEEIERIHNRRKLRR 120
            SP++WAK KTMGSEISEPSSPKVTCAGQIKIRPK  + KSWQSVMEEIERIHNRRKLRR
Sbjct: 61  ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120

Query: 121 RRFHWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEE-KSQGN 180
           RR +WVESLGFKKDIMQFLTCLRNIRFDFRCF+AFP  DFTTEEE+EEEEEEEE KSQ N
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTTEEEDEEEEEEEEGKSQEN 180

Query: 181 QVGVDENESSRTAFSKWFMVLQENG-SNELKRESKILCSDDDVSIEAAMAPPKNALLLMR 240
           QVGV+ NESSRTAFSKWFMVLQE+G SN + RES              +APPKNALLLMR
Sbjct: 181 QVGVEGNESSRTAFSKWFMVLQESGASNGICRESN----------GPPLAPPKNALLLMR 240

Query: 241 CRSAPAKRWLEEESEEEEDDDDDDDEKE---------EVKVKKSLKWLMEEENRERLVME 300
           CRSAPAK W EEE EEEE+++++++E+E         EVKVKKSLKWLMEEENRERLVME
Sbjct: 241 CRSAPAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEEENRERLVME 300

Query: 301 TGTDFCRMTSDIAKETWVVSEKSRDLFTRSHSWK 320
            G DFCRM+S+IAKETWV     RDLF+RS SWK
Sbjct: 301 MGPDFCRMSSEIAKETWV----GRDLFSRSRSWK 320

BLAST of Bhi09G002208 vs. ExPASy TrEMBL
Match: A0A6J1IQQ3 (uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333 PE=4 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 4.0e-99
Identity = 211/321 (65.73%), Postives = 235/321 (73.21%), Query Frame = 0

Query: 1   MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAES 60
           MKL R+ K IPS DLLVCFPSRSH ALMPNPLCSP R SDS+K     R YHRRRKSAES
Sbjct: 1   MKLIRDIKAIPSPDLLVCFPSRSHFALMPNPLCSPVRASDSNKL----RRYHRRRKSAES 60

Query: 61  PVVWAKAKTM-GSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
           PVVWAKAKT+ GSE+SEPSSPKVTCAGQIK+R K+ KSW+SVMEEIERIHNRR+LRRRRF
Sbjct: 61  PVVWAKAKTIGGSEVSEPSSPKVTCAGQIKMRRKSRKSWESVMEEIERIHNRRELRRRRF 120

Query: 121 HWVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEEEEEEEEEEEEKSQGNQVGV 180
           +WVESLGFKKDIMQFLTCLR+IRFDF CF AFP  +FT+E+EEEEE           VGV
Sbjct: 121 NWVESLGFKKDIMQFLTCLRSIRFDFGCFGAFPEAEFTSEDEEEEE-----------VGV 180

Query: 181 DENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCRSAP 240
           + ++ SRTAFSKWFMVLQ +G   ++R+   LC+ DD SI   MAPP+NALLLMRCRSAP
Sbjct: 181 EGSDGSRTAFSKWFMVLQGSG---VRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAP 240

Query: 241 AKRWLEEESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTSDIAK 300
           AK W+EE   EEE+D        EVKVKKSLKWLMEEENRE                   
Sbjct: 241 AKSWVEEACSEEEED-------TEVKVKKSLKWLMEEENRE------------------- 269

Query: 301 ETWVVSEKSRDLFTRSHSWKV 321
                   SRDL TRS SWKV
Sbjct: 301 --------SRDLVTRSRSWKV 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G78110.11.7e-7050.15unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22230.17.8e-5547.22unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
AT3G15095.17.4e-0524.01unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L1Z42.5e-14182.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1[more]
A0A5D3D5039.4e-14184.74Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... [more]
A0A1S3B9499.4e-14184.74uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... [more]
A0A6J1D3C21.2e-12275.15uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A6J1IQQ34.0e-9965.73uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 157..171
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 156..183
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 242..264
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 246..260
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 4..319
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 4..319

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi09M002208Bhi09M002208mRNA