Bhi05G001074 (gene) Wax gourd (B227) v1

Overview
NameBhi05G001074
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Locationchr5: 43232921 .. 43233559 (-)
RNA-Seq ExpressionBhi05G001074
SyntenyBhi05G001074
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA

mRNA sequence

ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA

Coding sequence (CDS)

ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA

Protein sequence

MSLSTPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLTYLVLCYQLQPSNELFKIG
Homology
BLAST of Bhi05G001074 vs. TAIR 10
Match: AT1G49330.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 169.1 bits (427), Expect = 3.9e-42
Identity = 79/159 (49.69%), Postives = 110/159 (69.18%), Query Frame = 0

Query: 51  IEPPYPWSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIE 110
           I PP+PW+T+++  I  L+YL+SN I TI G V+C+ CE+ Y++ Y+L ++F E+ +F  
Sbjct: 167 ISPPFPWATNRRGEIQSLEYLESNQITTITGEVQCRHCEKVYQVSYNLRERFAEVVKFYL 226

Query: 111 YEKDNMHDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLK 170
            EK  M DRA K W  P    C LC +EK V+P+I+ E  ++INW FLLLG+ LG   L+
Sbjct: 227 TEKRKMRDRAHKDWAYPEQRRCELCGREKAVKPVIA-ERKSQINWLFLLLGQTLGFCTLE 286

Query: 171 QLKYFCAQTNIHRTGAKNRLLYLTYLVLCYQLQPSNELF 210
           QLK FC  +  HRTGAK+R+LYLTY+ LC  LQP ++LF
Sbjct: 287 QLKNFCKHSKNHRTGAKDRVLYLTYMGLCKMLQPKSDLF 324

BLAST of Bhi05G001074 vs. TAIR 10
Match: AT2G16190.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 77 Blast hits to 77 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 13; Plants - 56; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 145.2 bits (365), Expect = 6.0e-35
Identity = 80/208 (38.46%), Postives = 116/208 (55.77%), Query Frame = 0

Query: 6   PKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIE---------PPYP 65
           P  P+E    P   N+++           P     RR ++  +  +E         PPYP
Sbjct: 93  PYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRPVAGVERNVGDREIVPPYP 152

Query: 66  WSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNM 125
           W+T +   I   + L SNNI  I G V CK C++   +EY+L +KF+E+  +I+  K+ M
Sbjct: 153 WATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNLEEKFSELYGYIKVNKEEM 212

Query: 126 HDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFC 185
             RAP  W+ P L  C  C  E  ++P++S E   +INW FLLLG++LGC  L QL+YFC
Sbjct: 213 RHRAPGSWSTPKLIPCRTCKSE--MKPVMS-ERKEEINWLFLLLGQMLGCCTLDQLRYFC 272

Query: 186 AQTNIHRTGAKNRLLYLTYLVLCYQLQP 205
              + HRTG+K+R++Y+TYL LC QL P
Sbjct: 273 QLNSKHRTGSKDRVVYITYLSLCKQLDP 297

BLAST of Bhi05G001074 vs. TAIR 10
Match: AT2G16190.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 104.0 bits (258), Expect = 1.5e-22
Identity = 62/176 (35.23%), Postives = 91/176 (51.70%), Query Frame = 0

Query: 6   PKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIE---------PPYP 65
           P  P+E    P   N+++           P     RR ++  +  +E         PPYP
Sbjct: 93  PYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRPVAGVERNVGDREIVPPYP 152

Query: 66  WSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNM 125
           W+T +   I   + L SNNI  I G V CK C++   +EY+L +KF+E+  +I+  K+ M
Sbjct: 153 WATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNLEEKFSELYGYIKVNKEEM 212

Query: 126 HDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQL 173
             RAP  W+ P L  C  C  E  ++P++S E   +INW FLLLG++LGC  L QL
Sbjct: 213 RHRAPGSWSTPKLIPCRTCKSE--MKPVMS-ERKEEINWLFLLLGQMLGCCTLDQL 265

BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match: A0A1S3CK70 (uncharacterized protein LOC103501397 OS=Cucumis melo OX=3656 GN=LOC103501397 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 2.8e-72
Identity = 143/203 (70.44%), Postives = 160/203 (78.82%), Query Frame = 0

Query: 2   SLSTPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQ 61
           +L +  T  ESEAI + NNE SN+ QQ         + RRRRTRADMTRIEPPYPWSTD+
Sbjct: 31  TLPSQITNEESEAITRPNNETSNNQQQ---------QRRRRRTRADMTRIEPPYPWSTDR 90

Query: 62  QAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAP 121
           +AV+HELKYLQ NNI+TIKG V CKKCE KYE+EYDLM K NEI RF E E D+MHDRAP
Sbjct: 91  RAVVHELKYLQVNNIMTIKGEVICKKCEMKYEMEYDLMNKVNEITRFFEEEIDSMHDRAP 150

Query: 122 KRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNI 181
             WTNP LPNC+LCN+EKCV P+ SQED TKINW FL LG+ LGCLKL+QLKYFC QTNI
Sbjct: 151 SCWTNPNLPNCSLCNEEKCVMPVTSQED-TKINWLFLFLGQFLGCLKLRQLKYFCTQTNI 210

Query: 182 HRTGAKNRLLYLTYLVLCYQLQP 205
           HRTGAKNRLLYL+Y  L  QLQP
Sbjct: 211 HRTGAKNRLLYLSYRTLFRQLQP 223

BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match: A0A0A0KMQ2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G332030 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 3.6e-72
Identity = 143/208 (68.75%), Postives = 163/208 (78.37%), Query Frame = 0

Query: 2   SLSTPKTPN-ESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTD 61
           +L + + PN ES A PQ N E SN  QQ       + R RRRRTRADMTRIEPPYPW+TD
Sbjct: 37  TLPSSQIPNEESNATPQPNIETSNDQQQ------HRRRLRRRRTRADMTRIEPPYPWATD 96

Query: 62  QQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRA 121
           ++AV+HELKYLQSNNI+ IKG V CKKCE KYEIEYDLM K NEI RF E E D+MHDRA
Sbjct: 97  KRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRA 156

Query: 122 PKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTN 181
           P  WT P LPNCN CN+EKCV P+IS+ED +KINW FL LG+ LGCL+LKQLK+FCAQ+N
Sbjct: 157 PNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSN 216

Query: 182 IHRTGAKNRLLYLTYLVLCYQLQPSNEL 209
           IHRTGAKNRLLYL+Y  L +QLQPS  L
Sbjct: 217 IHRTGAKNRLLYLSYRALFHQLQPSPTL 238

BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match: A0A1S3AZB1 (protein PAF1 homolog OS=Cucumis melo OX=3656 GN=LOC103484199 PE=4 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 2.0e-70
Identity = 133/203 (65.52%), Postives = 164/203 (80.79%), Query Frame = 0

Query: 5   TPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAV 64
           T +T N+   IP+   +  N   QP EI    P+P+RRRT+AD +RIEPPYPWST++ AV
Sbjct: 145 TQQTQNQPPEIPKPKRQTQN---QPPEI----PKPKRRRTQADNSRIEPPYPWSTEKGAV 204

Query: 65  IHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRW 124
           IH+L+YL++NNI+TIKG VKCK+C++K EIEY+L+ KF+EI RFIE EKDNMHDRAP RW
Sbjct: 205 IHKLEYLEANNILTIKGEVKCKRCDRKDEIEYELISKFDEIRRFIEREKDNMHDRAPDRW 264

Query: 125 TNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIHRT 184
            NPIL NCN CNKE+CVEP+IS+ + + INW FLLLG  LGCLKL QLKYFC QTNIHRT
Sbjct: 265 VNPILLNCNFCNKEECVEPIISEAN-SNINWLFLLLGNFLGCLKLSQLKYFCTQTNIHRT 324

Query: 185 GAKNRLLYLTYLVLCYQLQPSNE 208
           GAK+RL+YLTYL LC QLQP+++
Sbjct: 325 GAKDRLIYLTYLALCKQLQPNSD 339

BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match: A0A6J1GM83 (mucin-16-like OS=Cucurbita moschata OX=3662 GN=LOC111455541 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 3.2e-68
Identity = 134/207 (64.73%), Postives = 157/207 (75.85%), Query Frame = 0

Query: 7   KTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIH 66
           +TPN+S  IPQ  N  S            +PR RR RTRAD  RIEPPYPWS +Q+A IH
Sbjct: 413 QTPNQSTTIPQATNGHST----------SRPRRRRSRTRADTRRIEPPYPWSAEQRASIH 472

Query: 67  ELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTN 126
            L+YLQSNNI+TIKG V+CKKCE+ YEIEY+LM KF+EIARFIE E+DNMHDRAP  W N
Sbjct: 473 NLEYLQSNNIVTIKGDVRCKKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKN 532

Query: 127 PILPNCNLCNKEKCVEPLISQED----YTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIH 186
           PILPNC  C +E CVEP+I  E+    +++INW FLLLG+L+G LKLKQLKYFCA T  H
Sbjct: 533 PILPNCEHCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNH 592

Query: 187 RTGAKNRLLYLTYLVLCYQLQPSNELF 210
           RTGAK+RL++LTYL LC QLQPSN LF
Sbjct: 593 RTGAKDRLIFLTYLALCKQLQPSNRLF 609

BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match: A0A6J1I8I0 (uncharacterized protein KIAA0754-like OS=Cucurbita maxima OX=3661 GN=LOC111470967 PE=4 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 2.1e-67
Identity = 133/207 (64.25%), Postives = 155/207 (74.88%), Query Frame = 0

Query: 7   KTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIH 66
           +TPN+S  IPQ  N  S            +PR RR RTRAD  RIEPPYPWS +Q+A IH
Sbjct: 407 ETPNQSTTIPQATNGHST----------SRPRRRRSRTRADTRRIEPPYPWSAEQRASIH 466

Query: 67  ELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTN 126
            L+YLQSNNI+ IKG V+CKKCE+ YEIEY+LM KF+EIARFIE E+DNMHDRAP  W N
Sbjct: 467 NLEYLQSNNIVMIKGDVRCKKCERYYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKN 526

Query: 127 PILPNCNLCNKEKCVEPLISQED----YTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIH 186
           PILPNC  C +E CVEP+I  E+    + +INW FLLLG+L+G LKLKQLKYFCA T  H
Sbjct: 527 PILPNCEHCREENCVEPMIPDEEDDNQFRRINWLFLLLGQLIGRLKLKQLKYFCAHTYNH 586

Query: 187 RTGAKNRLLYLTYLVLCYQLQPSNELF 210
           RTGAK+RL++LTYL LC QLQPSN LF
Sbjct: 587 RTGAKDRLIFLTYLALCKQLQPSNRLF 603

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G49330.13.9e-4249.69hydroxyproline-rich glycoprotein family protein [more]
AT2G16190.16.0e-3538.46BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT2G16190.21.5e-2235.23FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CK702.8e-7270.44uncharacterized protein LOC103501397 OS=Cucumis melo OX=3656 GN=LOC103501397 PE=... [more]
A0A0A0KMQ23.6e-7268.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G332030 PE=4 SV=1[more]
A0A1S3AZB12.0e-7065.52protein PAF1 homolog OS=Cucumis melo OX=3656 GN=LOC103484199 PE=4 SV=1[more]
A0A6J1GM833.2e-6864.73mucin-16-like OS=Cucurbita moschata OX=3662 GN=LOC111455541 PE=4 SV=1[more]
A0A6J1I8I02.1e-6764.25uncharacterized protein KIAA0754-like OS=Cucurbita maxima OX=3661 GN=LOC11147096... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availablePANTHERPTHR34272EXPRESSED PROTEINcoord: 12..209

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi05M001074Bhi05M001074mRNA