Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA
mRNA sequence
ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA
Coding sequence (CDS)
ATGTCACTTTCAACACCGAAAACTCCAAACGAATCCGAGGCGATCCCTCAACACAACAATGAAATTTCAAACCACACCCAACAACCAATGGAGATAATTGAACCACAACCAAGACCGAGACGACGTAGAACGAGAGCAGACATGACAAGAATCGAGCCACCGTATCCATGGTCAACAGACCAACAAGCAGTAATCCACGAACTCAAGTATCTTCAATCAAACAACATAATCACAATCAAGGGGGGAGTAAAATGCAAAAAATGCGAGCAAAAGTATGAGATAGAATATGACCTAATGCAAAAGTTCAATGAAATAGCAAGATTTATCGAATACGAAAAAGATAATATGCATGATAGAGCTCCAAAACGTTGGACAAACCCTATTTTGCCAAATTGCAATTTGTGCAATAAAGAAAAATGTGTGGAGCCACTCATATCTCAAGAAGATTATACTAAAATCAATTGGTGGTTCTTGCTCTTGGGAAAACTTCTTGGATGTTTGAAGCTTAAACAACTCAAATATTTTTGTGCTCAAACAAATATTCATCGAACCGGGGCCAAGAATCGTCTTCTTTATCTCACTTATCTTGTTTTGTGTTACCAACTTCAACCCTCCAATGAACTCTTCAAGATTGGTTGA
Protein sequence
MSLSTPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLTYLVLCYQLQPSNELFKIG
Homology
BLAST of Bhi05G001074 vs. TAIR 10
Match:
AT1G49330.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 169.1 bits (427), Expect = 3.9e-42
Identity = 79/159 (49.69%), Postives = 110/159 (69.18%), Query Frame = 0
Query: 51 IEPPYPWSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIE 110
I PP+PW+T+++ I L+YL+SN I TI G V+C+ CE+ Y++ Y+L ++F E+ +F
Sbjct: 167 ISPPFPWATNRRGEIQSLEYLESNQITTITGEVQCRHCEKVYQVSYNLRERFAEVVKFYL 226
Query: 111 YEKDNMHDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLK 170
EK M DRA K W P C LC +EK V+P+I+ E ++INW FLLLG+ LG L+
Sbjct: 227 TEKRKMRDRAHKDWAYPEQRRCELCGREKAVKPVIA-ERKSQINWLFLLLGQTLGFCTLE 286
Query: 171 QLKYFCAQTNIHRTGAKNRLLYLTYLVLCYQLQPSNELF 210
QLK FC + HRTGAK+R+LYLTY+ LC LQP ++LF
Sbjct: 287 QLKNFCKHSKNHRTGAKDRVLYLTYMGLCKMLQPKSDLF 324
BLAST of Bhi05G001074 vs. TAIR 10
Match:
AT2G16190.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 77 Blast hits to 77 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 13; Plants - 56; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 145.2 bits (365), Expect = 6.0e-35
Identity = 80/208 (38.46%), Postives = 116/208 (55.77%), Query Frame = 0
Query: 6 PKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIE---------PPYP 65
P P+E P N+++ P RR ++ + +E PPYP
Sbjct: 93 PYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRPVAGVERNVGDREIVPPYP 152
Query: 66 WSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNM 125
W+T + I + L SNNI I G V CK C++ +EY+L +KF+E+ +I+ K+ M
Sbjct: 153 WATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNLEEKFSELYGYIKVNKEEM 212
Query: 126 HDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFC 185
RAP W+ P L C C E ++P++S E +INW FLLLG++LGC L QL+YFC
Sbjct: 213 RHRAPGSWSTPKLIPCRTCKSE--MKPVMS-ERKEEINWLFLLLGQMLGCCTLDQLRYFC 272
Query: 186 AQTNIHRTGAKNRLLYLTYLVLCYQLQP 205
+ HRTG+K+R++Y+TYL LC QL P
Sbjct: 273 QLNSKHRTGSKDRVVYITYLSLCKQLDP 297
BLAST of Bhi05G001074 vs. TAIR 10
Match:
AT2G16190.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 104.0 bits (258), Expect = 1.5e-22
Identity = 62/176 (35.23%), Postives = 91/176 (51.70%), Query Frame = 0
Query: 6 PKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIE---------PPYP 65
P P+E P N+++ P RR ++ + +E PPYP
Sbjct: 93 PYQPSEEVLPPPQLNQVATVALATPRRGRPPGGQARRNSKRPVAGVERNVGDREIVPPYP 152
Query: 66 WSTDQQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNM 125
W+T + I + L SNNI I G V CK C++ +EY+L +KF+E+ +I+ K+ M
Sbjct: 153 WATKKPGKIQSFRDLSSNNINVISGQVHCKTCDRTDTVEYNLEEKFSELYGYIKVNKEEM 212
Query: 126 HDRAPKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQL 173
RAP W+ P L C C E ++P++S E +INW FLLLG++LGC L QL
Sbjct: 213 RHRAPGSWSTPKLIPCRTCKSE--MKPVMS-ERKEEINWLFLLLGQMLGCCTLDQL 265
BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match:
A0A1S3CK70 (uncharacterized protein LOC103501397 OS=Cucumis melo OX=3656 GN=LOC103501397 PE=4 SV=1)
HSP 1 Score: 281.6 bits (719), Expect = 2.8e-72
Identity = 143/203 (70.44%), Postives = 160/203 (78.82%), Query Frame = 0
Query: 2 SLSTPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQ 61
+L + T ESEAI + NNE SN+ QQ + RRRRTRADMTRIEPPYPWSTD+
Sbjct: 31 TLPSQITNEESEAITRPNNETSNNQQQ---------QRRRRRTRADMTRIEPPYPWSTDR 90
Query: 62 QAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAP 121
+AV+HELKYLQ NNI+TIKG V CKKCE KYE+EYDLM K NEI RF E E D+MHDRAP
Sbjct: 91 RAVVHELKYLQVNNIMTIKGEVICKKCEMKYEMEYDLMNKVNEITRFFEEEIDSMHDRAP 150
Query: 122 KRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNI 181
WTNP LPNC+LCN+EKCV P+ SQED TKINW FL LG+ LGCLKL+QLKYFC QTNI
Sbjct: 151 SCWTNPNLPNCSLCNEEKCVMPVTSQED-TKINWLFLFLGQFLGCLKLRQLKYFCTQTNI 210
Query: 182 HRTGAKNRLLYLTYLVLCYQLQP 205
HRTGAKNRLLYL+Y L QLQP
Sbjct: 211 HRTGAKNRLLYLSYRTLFRQLQP 223
BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match:
A0A0A0KMQ2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G332030 PE=4 SV=1)
HSP 1 Score: 281.2 bits (718), Expect = 3.6e-72
Identity = 143/208 (68.75%), Postives = 163/208 (78.37%), Query Frame = 0
Query: 2 SLSTPKTPN-ESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTD 61
+L + + PN ES A PQ N E SN QQ + R RRRRTRADMTRIEPPYPW+TD
Sbjct: 37 TLPSSQIPNEESNATPQPNIETSNDQQQ------HRRRLRRRRTRADMTRIEPPYPWATD 96
Query: 62 QQAVIHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRA 121
++AV+HELKYLQSNNI+ IKG V CKKCE KYEIEYDLM K NEI RF E E D+MHDRA
Sbjct: 97 KRAVVHELKYLQSNNIMKIKGEVICKKCEMKYEIEYDLMNKVNEITRFFEEEIDSMHDRA 156
Query: 122 PKRWTNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTN 181
P WT P LPNCN CN+EKCV P+IS+ED +KINW FL LG+ LGCL+LKQLK+FCAQ+N
Sbjct: 157 PNCWTKPNLPNCNFCNEEKCVMPVISKEDDSKINWLFLFLGQFLGCLRLKQLKHFCAQSN 216
Query: 182 IHRTGAKNRLLYLTYLVLCYQLQPSNEL 209
IHRTGAKNRLLYL+Y L +QLQPS L
Sbjct: 217 IHRTGAKNRLLYLSYRALFHQLQPSPTL 238
BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match:
A0A1S3AZB1 (protein PAF1 homolog OS=Cucumis melo OX=3656 GN=LOC103484199 PE=4 SV=1)
HSP 1 Score: 275.4 bits (703), Expect = 2.0e-70
Identity = 133/203 (65.52%), Postives = 164/203 (80.79%), Query Frame = 0
Query: 5 TPKTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAV 64
T +T N+ IP+ + N QP EI P+P+RRRT+AD +RIEPPYPWST++ AV
Sbjct: 145 TQQTQNQPPEIPKPKRQTQN---QPPEI----PKPKRRRTQADNSRIEPPYPWSTEKGAV 204
Query: 65 IHELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRW 124
IH+L+YL++NNI+TIKG VKCK+C++K EIEY+L+ KF+EI RFIE EKDNMHDRAP RW
Sbjct: 205 IHKLEYLEANNILTIKGEVKCKRCDRKDEIEYELISKFDEIRRFIEREKDNMHDRAPDRW 264
Query: 125 TNPILPNCNLCNKEKCVEPLISQEDYTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIHRT 184
NPIL NCN CNKE+CVEP+IS+ + + INW FLLLG LGCLKL QLKYFC QTNIHRT
Sbjct: 265 VNPILLNCNFCNKEECVEPIISEAN-SNINWLFLLLGNFLGCLKLSQLKYFCTQTNIHRT 324
Query: 185 GAKNRLLYLTYLVLCYQLQPSNE 208
GAK+RL+YLTYL LC QLQP+++
Sbjct: 325 GAKDRLIYLTYLALCKQLQPNSD 339
BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match:
A0A6J1GM83 (mucin-16-like OS=Cucurbita moschata OX=3662 GN=LOC111455541 PE=4 SV=1)
HSP 1 Score: 268.1 bits (684), Expect = 3.2e-68
Identity = 134/207 (64.73%), Postives = 157/207 (75.85%), Query Frame = 0
Query: 7 KTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIH 66
+TPN+S IPQ N S +PR RR RTRAD RIEPPYPWS +Q+A IH
Sbjct: 413 QTPNQSTTIPQATNGHST----------SRPRRRRSRTRADTRRIEPPYPWSAEQRASIH 472
Query: 67 ELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTN 126
L+YLQSNNI+TIKG V+CKKCE+ YEIEY+LM KF+EIARFIE E+DNMHDRAP W N
Sbjct: 473 NLEYLQSNNIVTIKGDVRCKKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKN 532
Query: 127 PILPNCNLCNKEKCVEPLISQED----YTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIH 186
PILPNC C +E CVEP+I E+ +++INW FLLLG+L+G LKLKQLKYFCA T H
Sbjct: 533 PILPNCEHCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNH 592
Query: 187 RTGAKNRLLYLTYLVLCYQLQPSNELF 210
RTGAK+RL++LTYL LC QLQPSN LF
Sbjct: 593 RTGAKDRLIFLTYLALCKQLQPSNRLF 609
BLAST of Bhi05G001074 vs. ExPASy TrEMBL
Match:
A0A6J1I8I0 (uncharacterized protein KIAA0754-like OS=Cucurbita maxima OX=3661 GN=LOC111470967 PE=4 SV=1)
HSP 1 Score: 265.4 bits (677), Expect = 2.1e-67
Identity = 133/207 (64.25%), Postives = 155/207 (74.88%), Query Frame = 0
Query: 7 KTPNESEAIPQHNNEISNHTQQPMEIIEPQPRPRRRRTRADMTRIEPPYPWSTDQQAVIH 66
+TPN+S IPQ N S +PR RR RTRAD RIEPPYPWS +Q+A IH
Sbjct: 407 ETPNQSTTIPQATNGHST----------SRPRRRRSRTRADTRRIEPPYPWSAEQRASIH 466
Query: 67 ELKYLQSNNIITIKGGVKCKKCEQKYEIEYDLMQKFNEIARFIEYEKDNMHDRAPKRWTN 126
L+YLQSNNI+ IKG V+CKKCE+ YEIEY+LM KF+EIARFIE E+DNMHDRAP W N
Sbjct: 467 NLEYLQSNNIVMIKGDVRCKKCERYYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKN 526
Query: 127 PILPNCNLCNKEKCVEPLISQED----YTKINWWFLLLGKLLGCLKLKQLKYFCAQTNIH 186
PILPNC C +E CVEP+I E+ + +INW FLLLG+L+G LKLKQLKYFCA T H
Sbjct: 527 PILPNCEHCREENCVEPMIPDEEDDNQFRRINWLFLLLGQLIGRLKLKQLKYFCAHTYNH 586
Query: 187 RTGAKNRLLYLTYLVLCYQLQPSNELF 210
RTGAK+RL++LTYL LC QLQPSN LF
Sbjct: 587 RTGAKDRLIFLTYLALCKQLQPSNRLF 603
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G49330.1 | 3.9e-42 | 49.69 | hydroxyproline-rich glycoprotein family protein | [more] |
AT2G16190.1 | 6.0e-35 | 38.46 | BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... | [more] |
AT2G16190.2 | 1.5e-22 | 35.23 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CK70 | 2.8e-72 | 70.44 | uncharacterized protein LOC103501397 OS=Cucumis melo OX=3656 GN=LOC103501397 PE=... | [more] |
A0A0A0KMQ2 | 3.6e-72 | 68.75 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G332030 PE=4 SV=1 | [more] |
A0A1S3AZB1 | 2.0e-70 | 65.52 | protein PAF1 homolog OS=Cucumis melo OX=3656 GN=LOC103484199 PE=4 SV=1 | [more] |
A0A6J1GM83 | 3.2e-68 | 64.73 | mucin-16-like OS=Cucurbita moschata OX=3662 GN=LOC111455541 PE=4 SV=1 | [more] |
A0A6J1I8I0 | 2.1e-67 | 64.25 | uncharacterized protein KIAA0754-like OS=Cucurbita maxima OX=3661 GN=LOC11147096... | [more] |