Bhi04G001516 (gene) Wax gourd (B227) v1

Overview
NameBhi04G001516
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionCCHC-type domain-containing protein
Locationchr4: 48277313 .. 48278092 (-)
RNA-Seq ExpressionBhi04G001516
SyntenyBhi04G001516
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCCAATGGTTTGGAATTTCCAAGAAGAAGTCACAATCCAAAACGTCGATTGCAACATCTTCTTATGTATATTCAAAGCACCAAATGATAAGCAAACGGTTCTAGAAAACGGGCCATGGTTCTTTGACAAAGAGCTGCTGGTTTTCGAACACCCACCAAGAGGAACCTGCAAGTTCATAGACCTGGACTTCCAATACGTCTCTTTCTGGGTTCATTTTCACCACCTTCCAATGGCTTGCTTCTCGAGGAAAATGGCCATAGTATTTGGGAATATGGTTGGGAGGTTCGAAACGGTTGAAACCGATGAGGAAGGAATGTGTTGGGGAAGAACTATGAGAGTTAGAGTGAGTGTTGATGTGAAAAAGCCTTTGAGAAGAGTGGTGAAGTTGAAAGTTGGATCTATGGGGGAAGAAATTTTGATACCTATCACTTATGAAAAGCTCATGGATTTCTGCTATCAATGTGGAAAGCTTGGGCACGTCCTCGACGCCTGCAATTCTTCTTCTGATAGTGAAGAGAAAGACGATTTATGCTATAAATGTGGGAAGGTAGGACTCGTCCTTCAAGATTGCAATTCGCGTTTTGGCACTGGAGAGGAAGACTTACAGTATGGAGACTGGCCTAGAGGGATGAGAAGAGAAGGGGATGAGAAGTTTGAAGGGATAGAGTTCGGAAAGGGGGAGGGCTGGGAAGAGGACTACAATGACAGTGATAATCAGGAACTTCAGGTGGTCAGAATCAGCCCCGAATGGTTAGCGGGGCTCTCCGGAATGTGA

mRNA sequence

ATGATGCCAATGGTTTGGAATTTCCAAGAAGAAGTCACAATCCAAAACGTCGATTGCAACATCTTCTTATGTATATTCAAAGCACCAAATGATAAGCAAACGGTTCTAGAAAACGGGCCATGGTTCTTTGACAAAGAGCTGCTGGTTTTCGAACACCCACCAAGAGGAACCTGCAAGTTCATAGACCTGGACTTCCAATACGTCTCTTTCTGGGTTCATTTTCACCACCTTCCAATGGCTTGCTTCTCGAGGAAAATGGCCATAGTATTTGGGAATATGGTTGGGAGGTTCGAAACGGTTGAAACCGATGAGGAAGGAATGTGTTGGGGAAGAACTATGAGAGTTAGAGTGAGTGTTGATGTGAAAAAGCCTTTGAGAAGAGTGGTGAAGTTGAAAGTTGGATCTATGGGGGAAGAAATTTTGATACCTATCACTTATGAAAAGCTCATGGATTTCTGCTATCAATGTGGAAAGCTTGGGCACGTCCTCGACGCCTGCAATTCTTCTTCTGATAGTGAAGAGAAAGACGATTTATGCTATAAATGTGGGAAGGTAGGACTCGTCCTTCAAGATTGCAATTCGCGTTTTGGCACTGGAGAGGAAGACTTACAGTATGGAGACTGGCCTAGAGGGATGAGAAGAGAAGGGGATGAGAAGTTTGAAGGGATAGAGTTCGGAAAGGGGGAGGGCTGGGAAGAGGACTACAATGACAGTGATAATCAGGAACTTCAGGTGGTCAGAATCAGCCCCGAATGGTTAGCGGGGCTCTCCGGAATGTGA

Coding sequence (CDS)

ATGATGCCAATGGTTTGGAATTTCCAAGAAGAAGTCACAATCCAAAACGTCGATTGCAACATCTTCTTATGTATATTCAAAGCACCAAATGATAAGCAAACGGTTCTAGAAAACGGGCCATGGTTCTTTGACAAAGAGCTGCTGGTTTTCGAACACCCACCAAGAGGAACCTGCAAGTTCATAGACCTGGACTTCCAATACGTCTCTTTCTGGGTTCATTTTCACCACCTTCCAATGGCTTGCTTCTCGAGGAAAATGGCCATAGTATTTGGGAATATGGTTGGGAGGTTCGAAACGGTTGAAACCGATGAGGAAGGAATGTGTTGGGGAAGAACTATGAGAGTTAGAGTGAGTGTTGATGTGAAAAAGCCTTTGAGAAGAGTGGTGAAGTTGAAAGTTGGATCTATGGGGGAAGAAATTTTGATACCTATCACTTATGAAAAGCTCATGGATTTCTGCTATCAATGTGGAAAGCTTGGGCACGTCCTCGACGCCTGCAATTCTTCTTCTGATAGTGAAGAGAAAGACGATTTATGCTATAAATGTGGGAAGGTAGGACTCGTCCTTCAAGATTGCAATTCGCGTTTTGGCACTGGAGAGGAAGACTTACAGTATGGAGACTGGCCTAGAGGGATGAGAAGAGAAGGGGATGAGAAGTTTGAAGGGATAGAGTTCGGAAAGGGGGAGGGCTGGGAAGAGGACTACAATGACAGTGATAATCAGGAACTTCAGGTGGTCAGAATCAGCCCCGAATGGTTAGCGGGGCTCTCCGGAATGTGA

Protein sequence

MMPMVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKFIDLDFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVDVKKPLRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNSSSDSEEKDDLCYKCGKVGLVLQDCNSRFGTGEEDLQYGDWPRGMRREGDEKFEGIEFGKGEGWEEDYNDSDNQELQVVRISPEWLAGLSGM
Homology
BLAST of Bhi04G001516 vs. TAIR 10
Match: AT3G31430.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G18636.1); Has 295 Blast hits to 291 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 295; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 4.3e-11
Identity = 48/177 (27.12%), Postives = 77/177 (43.50%), Query Frame = 0

Query: 2   MPMVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEH-PPRGTCKF 61
           MP +W     V  + ++   F  IF      +TVL  GPW F+  +++ +   P+     
Sbjct: 153 MPRIWGQSGLVHGRIMEGRQFHFIFTLEESLETVLRRGPWAFNDWMILLQRWEPQ----- 212

Query: 62  IDLDFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVD 121
           I L F ++ FWV    +P    +R +    G  +G+    + + E +      RV +  D
Sbjct: 213 IPL-FPFIPFWVQIRGIPFQFLNRGVVEHIGRALGQVLDTDFNVEVVARMDFARVLLHWD 272

Query: 122 VKKPLRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNSSSDSEEKDD 178
           +  PLR     +  + G   L+   YE+L  FC  CG L H   AC   +  EE+ D
Sbjct: 273 ITHPLRFQRHFQF-TAGVNTLLRFRYERLRGFCEVCGMLTHDFGACLIQNGGEEQAD 322

BLAST of Bhi04G001516 vs. TAIR 10
Match: AT3G42140.1 (zinc ion binding;nucleic acid binding )

HSP 1 Score: 57.0 bits (136), Expect = 2.6e-08
Identity = 39/178 (21.91%), Postives = 73/178 (41.01%), Query Frame = 0

Query: 7   NFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKFIDLDFQ 66
           N  EEV  + ++ +    +F++     ++L  GPW F+  + V +   R T    D +F+
Sbjct: 47  NVDEEVVGRILEIHKIEFLFQSEESMFSILRRGPWSFNDWMCVIQ---RWTKLHSDAEFK 106

Query: 67  YVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVDVKKPLR 126
            + FW+    +P+   + ++    G  +G F  +ET                        
Sbjct: 107 RIPFWIQIRGIPLRFLTARIITSIGERMGLF--LET------------------------ 166

Query: 127 RVVKLKVGSMGEEI-LIPITYEKLMDFCYQCGKLGHVLDACNSS------SDSEEKDD 178
                   ++G ++ ++   YEKL +FC  CG L H    C +S      +D ++ DD
Sbjct: 167 --------NLGRDVSVLKFQYEKLKNFCTTCGMLSHDASECPTSGNQGPHADDDDDDD 187

BLAST of Bhi04G001516 vs. TAIR 10
Match: AT3G42860.1 (zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 42.7 bits (99), Expect = 5.1e-04
Identity = 30/98 (30.61%), Postives = 40/98 (40.82%), Query Frame = 0

Query: 153 CYQCGKLGHVLDACNSSSD-----SEEKDDLCYKCGKVGLVLQDCNSRFGTGEEDLQYGD 212
           CY+CGK GH    C   SD     S      C+KCGK G   +DC ++ G  +       
Sbjct: 239 CYKCGKEGHWARDCTVQSDTGPVKSTSAAGDCFKCGKPGHWSRDCTAQSGNPK------- 298

Query: 213 WPRGMRREGDEKFEGIEFGKGEGWEED-YNDSDNQELQ 245
           +  G  +      E  + GK   W  D    S NQ+ Q
Sbjct: 299 YEPGQMKSSSSSGECYKCGKQGHWSRDCTGQSSNQQFQ 329

BLAST of Bhi04G001516 vs. ExPASy Swiss-Prot
Match: Q04832 (DNA-binding protein HEXBP OS=Leishmania major OX=5664 GN=HEXBP PE=4 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 1.7e-04
Identity = 18/46 (39.13%), Postives = 27/46 (58.70%), Query Frame = 0

Query: 153 CYQCGKLGHVLDACNSSSDSEEKDDLCYKCGKVGLVLQDCNSRFGT 199
           CY+CG+ GH+   C S+  +   D  CYKCGK G + ++C    G+
Sbjct: 198 CYKCGESGHMSRECPSAGSTGSGDRACYKCGKPGHISRECPEAGGS 243

BLAST of Bhi04G001516 vs. ExPASy Swiss-Prot
Match: P36627 (Cellular nucleic acid-binding protein homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=byr3 PE=4 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 5.0e-04
Identity = 18/42 (42.86%), Postives = 27/42 (64.29%), Query Frame = 0

Query: 153 CYQCGKLGHVLDACNSSSDSEEKDDLCYKCGKVGLVLQDCNS 195
           CY CG  GH++  C SS +  +  + CYKCG+VG + +DC +
Sbjct: 60  CYACGTAGHLVRDCPSSPNPRQGAE-CYKCGRVGHIARDCRT 100

BLAST of Bhi04G001516 vs. ExPASy TrEMBL
Match: A0A6J1DU55 (uncharacterized protein LOC111023135 OS=Momordica charantia OX=3673 GN=LOC111023135 PE=4 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 7.6e-32
Identity = 64/165 (38.79%), Postives = 101/165 (61.21%), Query Frame = 0

Query: 4   MVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKFIDL 63
           + W  + ++T++++  N+FL  F    D   V++ GPWFFDK L+V +  P  +    +L
Sbjct: 63  LAWKVEHQLTVESIGKNLFLFHFCRECDMNRVMKTGPWFFDKALIVLQ-KPCSSKNISEL 122

Query: 64  DFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVDVKK 123
           +F  V+FW+H   LPM+  ++ MAI  GN +G F  V+ +E+G  WG ++R+RV +D+ K
Sbjct: 123 EFNRVAFWIHLFDLPMSWLNKTMAIRLGNAIGNFVDVDCNEKGFSWGASLRIRVLIDITK 182

Query: 124 PLRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNS 169
           PLRR +K+ +        IPI YE+L DFCY CG +GH    C++
Sbjct: 183 PLRRGIKINIDGPMGGCWIPIQYERLPDFCYFCGVIGHSSHDCDA 226

BLAST of Bhi04G001516 vs. ExPASy TrEMBL
Match: A0A5C7H9Y2 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_019269 PE=4 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 1.3e-31
Identity = 82/251 (32.67%), Postives = 130/251 (51.79%), Query Frame = 0

Query: 5   VWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKFIDLD 64
           +W  + EVT++ +  NIF   F+   D++ +LE GPW FDK+LLV      G+ K  DL 
Sbjct: 63  IWQTKNEVTMELMGINIFKFRFQNYWDRKRILEGGPWLFDKQLLVLRE-ASGSEKVTDLQ 122

Query: 65  FQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVDVKKP 124
           F+YV FW+  H+LP+AC +R++ +  G +VG+ + ++  E G C G+ +R+RV +DV  P
Sbjct: 123 FRYVPFWIQLHNLPLACLNREIGLHLGGLVGQVKEIDAGESGECVGQFIRIRVLIDVMNP 182

Query: 125 LRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNSSSDSEEKDDLCYKCGK 184
           L+R +++ +G   +   + I YE+L +FCY CGK+GH++  C                  
Sbjct: 183 LKRGLRVALGDDEKVNEVMICYERLPNFCYYCGKIGHLVRDC------------------ 242

Query: 185 VGLVLQDCNSRFGTGEEDLQYGDWPRGMRREGDEKFEGIEFGKGEGWEEDYNDSDNQELQ 244
                   N++  T     ++G W R + R    K  G +    EG  E    SD  E  
Sbjct: 243 ------PLNTKEITSSSSFKFGPWMRAVSRT-RSKGTGEKKNSPEGSREG-GSSDTLENL 286

Query: 245 VVRISPEWLAG 256
            V+ S +W  G
Sbjct: 303 RVKGSTKWNMG 286

BLAST of Bhi04G001516 vs. ExPASy TrEMBL
Match: A0A5C7GZQ4 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_025894 PE=4 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 1.7e-31
Identity = 67/177 (37.85%), Postives = 108/177 (61.02%), Query Frame = 0

Query: 1   MMPMVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKF 60
           ++P +W   +E  I+ ++ NIF   FK  +D+++VL  GPW FDK LLV E  P G    
Sbjct: 59  VLPRIWRTVKEFEIEILEGNIFSFTFKEESDRRSVLRGGPWSFDKALLVLEE-PTGKGDI 118

Query: 61  IDLDFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVD 120
            ++ F  V+FW+  H +P+ C + ++    G+M+G  + ++    G C G+ +RVRV VD
Sbjct: 119 REMKFDKVAFWIQIHKVPLLCMTSEIGRFLGSMIGEVKEIDDGGSGDCVGKYIRVRVVVD 178

Query: 121 VKKPLRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNSSSDSEEKDD 178
           V KPLRR++++ V   G+E  + + YE+L D CY+CG++GHV+  C+    S E +D
Sbjct: 179 VTKPLRRMLRVDVLGDGKETNMLLRYERLPDHCYRCGRIGHVVRDCSIVPSSVEPED 234

BLAST of Bhi04G001516 vs. ExPASy TrEMBL
Match: A0A1S8AC25 (CCHC-type domain-containing protein (Fragment) OS=Citrus limon OX=2708 PE=4 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 3.8e-31
Identity = 68/180 (37.78%), Postives = 108/180 (60.00%), Query Frame = 0

Query: 2   MPMVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKFI 61
           M  VW    EV I+ +  N+F+  F +  DK++++  GPW FD+ L+     P G     
Sbjct: 60  MQRVWRTSREVKIEKLGENVFMFKFGSEVDKRSIMVGGPWHFDRALIGLTE-PTGIGDIK 119

Query: 62  DLDFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVDV 121
             DF +VSFWV  H +P+ C S+ MA   G ++G+ E VETD  G C+G+ +R+R+SVD+
Sbjct: 120 KQDFSHVSFWVQIHDVPIMCMSKDMAAELGKVIGKVEEVETDAAGECFGQFLRLRISVDI 179

Query: 122 KKPLRRVVKLKVGSM-GEEILIPITYEKLMDFCYQCGKLGHVLDACNSSSDSEEKDDLCY 181
            KPL+++++L+      ++I + + YE+L DFC+ CG++GH    C     S+ KD+L Y
Sbjct: 180 TKPLKKIIELEQEEEDADDIPMRVMYERLPDFCFCCGRIGHQYREC-FYYKSQSKDELAY 237

BLAST of Bhi04G001516 vs. ExPASy TrEMBL
Match: A0A6J1D765 (uncharacterized protein LOC111017902 OS=Momordica charantia OX=3673 GN=LOC111017902 PE=4 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 4.9e-31
Identity = 69/170 (40.59%), Postives = 101/170 (59.41%), Query Frame = 0

Query: 1   MMPMVWNFQEEVTIQNVDCNIFLCIFKAPNDKQTVLENGPWFFDKELLVFEHPPRGTCKF 60
           +M  VW        + +  NI++ +FK+ ++K  VL +GPW F+K LLV    P  T + 
Sbjct: 57  VMKSVWRVHNSTRFEPLGMNIYVILFKSLSEKSRVLSSGPWTFNKSLLVLT-SPTATNQP 116

Query: 61  IDLDFQYVSFWVHFHHLPMACFSRKMAIVFGNMVGRFETVETDEEGMCWGRTMRVRVSVD 120
           +D++F + +FW+  H++P  C S +MA + G  +G  E +E D      G  +RVRV +D
Sbjct: 117 LDMNFNFCAFWIQIHNIPFECISTEMANILGAKLGDVEEIEGDGADGWAGPFIRVRVKID 176

Query: 121 VKKPLRRVVKLKVGSMGEEILIPITYEKLMDFCYQCGKLGHVLDACNSSS 171
           V KPLRR +KLK  S G++I  P+ YEKL DFCY+CGK+GH    C   S
Sbjct: 177 VSKPLRRGIKLK-NSDGKDIWCPLRYEKLPDFCYECGKIGHSGRECEQRS 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT3G31430.14.3e-1127.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G42140.12.6e-0821.91zinc ion binding;nucleic acid binding [more]
AT3G42860.15.1e-0430.61zinc knuckle (CCHC-type) family protein [more]
Match NameE-valueIdentityDescription
Q048321.7e-0439.13DNA-binding protein HEXBP OS=Leishmania major OX=5664 GN=HEXBP PE=4 SV=1[more]
P366275.0e-0442.86Cellular nucleic acid-binding protein homolog OS=Schizosaccharomyces pombe (stra... [more]
Match NameE-valueIdentityDescription
A0A6J1DU557.6e-3238.79uncharacterized protein LOC111023135 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A5C7H9Y21.3e-3132.67CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_01926... [more]
A0A5C7GZQ41.7e-3137.85CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_02589... [more]
A0A1S8AC253.8e-3137.78CCHC-type domain-containing protein (Fragment) OS=Citrus limon OX=2708 PE=4 SV=1[more]
A0A6J1D7654.9e-3140.59uncharacterized protein LOC111017902 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 178..194
e-value: 0.9
score: 11.5
coord: 152..168
e-value: 0.21
score: 15.4
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 153..166
score: 9.042855
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 119..167
e-value: 3.6E-12
score: 45.8
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 2..117
e-value: 1.0E-16
score: 60.7
NoneNo IPR availableGENE3D4.10.60.10coord: 151..201
e-value: 6.4E-9
score: 37.6
NoneNo IPR availablePANTHERPTHR31286:SF62SUBFAMILY NOT NAMEDcoord: 19..177
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 19..177
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 151..195

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001516Bhi04M001516mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding