Bhi04G001740 (gene) Wax gourd (B227) v1

Overview
NameBhi04G001740
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionLow affinity potassium transport system protein
Locationchr4: 59144114 .. 59144787 (+)
RNA-Seq ExpressionBhi04G001740
SyntenyBhi04G001740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGACTGTTCTGATTATCGATCACTGGTGGTGGACAGGTGTCTTTGGTTGCTTGAGGGGTGGAAGCCTTCCAAGAAATAAATTTTGAGCTCCCCAGATGTGGTATAACTAATGTTATTGGCAGTAGAAGGAGGAGGGTTCTTTTCTTCTTCAGCATCTGGGTATAGCAAGGGTTTGACCCTTCTCCTTTTGGGTCAAAAGGACGAAGATAAACCCATGAGAGTTTCTCCGTGGAATCATTACCAGTTGGTGGACCAAGAATCAGAATCTGACCTCCAGCTGGCTTCCACAAAGAACCACATTTCCCACGGCTGTGCCTCCTTTGTTTGCTTTGGTCGCACTTCCGCAGGGCTCGACAGTTCCCCTTCTCCTCTTAAAGTTGGCCCAACCTTACCACATGATTCTTTGCCTGGGCCTATAAGTACCGAAGAGAGGAAAGATGAATTTCCTAATGTGGAAGATGGTAATACTTTCAGAAATATAGCCCTTAAAAGTAGCTTGAAAAGGCCAAGTAATGGTATTTCAATTTCTCATCAGAATGCTCATGAAAGTGAAACAATAAGTAAAAAGGATGGTGATATACGTTGTCTTACTAATAGAAGGAAAGTTCAGTGGACTGATGCTTGTGGGAGTGAACTTGTAGAGATCAGGGAATTTGAGCCCAGGTATTGA

mRNA sequence

ATGTGGACTGTTCTGATTATCGATCACTGGTGGTGGACAGTAGAAGGAGGAGGGTTCTTTTCTTCTTCAGCATCTGGGTATAGCAAGGGTTTGACCCTTCTCCTTTTGGGTCAAAAGGACGAAGATAAACCCATGAGAGTTTCTCCGTGGAATCATTACCAGTTGGTGGACCAAGAATCAGAATCTGACCTCCAGCTGGCTTCCACAAAGAACCACATTTCCCACGGCTGTGCCTCCTTTGTTTGCTTTGGTCGCACTTCCGCAGGGCTCGACAGTTCCCCTTCTCCTCTTAAAGTTGGCCCAACCTTACCACATGATTCTTTGCCTGGGCCTATAAGTACCGAAGAGAGGAAAGATGAATTTCCTAATGTGGAAGATGGTAATACTTTCAGAAATATAGCCCTTAAAAGTAGCTTGAAAAGGCCAAGTAATGGTATTTCAATTTCTCATCAGAATGCTCATGAAAGTGAAACAATAAGTAAAAAGGATGGTGATATACGTTGTCTTACTAATAGAAGGAAAGTTCAGTGGACTGATGCTTGTGGGAGTGAACTTGTAGAGATCAGGGAATTTGAGCCCAGGTATTGA

Coding sequence (CDS)

ATGTGGACTGTTCTGATTATCGATCACTGGTGGTGGACAGTAGAAGGAGGAGGGTTCTTTTCTTCTTCAGCATCTGGGTATAGCAAGGGTTTGACCCTTCTCCTTTTGGGTCAAAAGGACGAAGATAAACCCATGAGAGTTTCTCCGTGGAATCATTACCAGTTGGTGGACCAAGAATCAGAATCTGACCTCCAGCTGGCTTCCACAAAGAACCACATTTCCCACGGCTGTGCCTCCTTTGTTTGCTTTGGTCGCACTTCCGCAGGGCTCGACAGTTCCCCTTCTCCTCTTAAAGTTGGCCCAACCTTACCACATGATTCTTTGCCTGGGCCTATAAGTACCGAAGAGAGGAAAGATGAATTTCCTAATGTGGAAGATGGTAATACTTTCAGAAATATAGCCCTTAAAAGTAGCTTGAAAAGGCCAAGTAATGGTATTTCAATTTCTCATCAGAATGCTCATGAAAGTGAAACAATAAGTAAAAAGGATGGTGATATACGTTGTCTTACTAATAGAAGGAAAGTTCAGTGGACTGATGCTTGTGGGAGTGAACTTGTAGAGATCAGGGAATTTGAGCCCAGGTATTGA

Protein sequence

MWTVLIIDHWWWTVEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNHISHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNTFRNIALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREFEPRY
Homology
BLAST of Bhi04G001740 vs. TAIR 10
Match: AT1G22790.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34010.1); Has 67 Blast hits to 67 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 180.6 bits (457), Expect = 1.2e-45
Identity = 104/190 (54.74%), Postives = 129/190 (67.89%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESD--LQLASTKN 73
           VEGGG FS+SASGYSKGLTLL  G KD D+PMRV PWNHYQ+VDQE E+D  LQL S KN
Sbjct: 5   VEGGGLFSASASGYSKGLTLLFSGDKDVDRPMRVVPWNHYQVVDQEPEADPVLQLDSIKN 64

Query: 74  HISHGC-ASFVCFGRTSAGLDSSPSPLKVGPT-LPHDSLPGP----ISTEERKDEFPNVE 133
            +S GC ASF CFG  SAGL+ +PSPLKV P    H  +  P    + +E+ KD+    +
Sbjct: 65  RVSRGCAASFSCFGGASAGLE-TPSPLKVEPVQQQHREISSPESVVVVSEKGKDQISEAD 124

Query: 134 DGNTFR--NIALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGS 193
           +G++     ++L+SSLKRPS   S S ++  E ET+S    D+     RRKVQW DACGS
Sbjct: 125 NGSSKEAFKLSLRSSLKRPSVAESRSLEDIKEYETLSVDGSDLTGDMARRKVQWPDACGS 184

BLAST of Bhi04G001740 vs. TAIR 10
Match: AT1G22790.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34010.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 180.6 bits (457), Expect = 1.2e-45
Identity = 104/190 (54.74%), Postives = 129/190 (67.89%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESD--LQLASTKN 73
           VEGGG FS+SASGYSKGLTLL  G KD D+PMRV PWNHYQ+VDQE E+D  LQL S KN
Sbjct: 5   VEGGGLFSASASGYSKGLTLLFSGDKDVDRPMRVVPWNHYQVVDQEPEADPVLQLDSIKN 64

Query: 74  HISHGC-ASFVCFGRTSAGLDSSPSPLKVGPT-LPHDSLPGP----ISTEERKDEFPNVE 133
            +S GC ASF CFG  SAGL+ +PSPLKV P    H  +  P    + +E+ KD+    +
Sbjct: 65  RVSRGCAASFSCFGGASAGLE-TPSPLKVEPVQQQHREISSPESVVVVSEKGKDQISEAD 124

Query: 134 DGNTFR--NIALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGS 193
           +G++     ++L+SSLKRPS   S S ++  E ET+S    D+     RRKVQW DACGS
Sbjct: 125 NGSSKEAFKLSLRSSLKRPSVAESRSLEDIKEYETLSVDGSDLTGDMARRKVQWPDACGS 184

BLAST of Bhi04G001740 vs. TAIR 10
Match: AT1G34010.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22790.2); Has 74 Blast hits to 74 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 138.3 bits (347), Expect = 6.7e-33
Identity = 84/187 (44.92%), Postives = 105/187 (56.15%), Query Frame = 0

Query: 12  WTVEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRV-SPWNHYQLVDQESESDLQLASTK 71
           +  EGGGFFSSSASGYS GL LLLLGQK E KP++V S WNHY LV ++S++  +L S+K
Sbjct: 3   FAAEGGGFFSSSASGYSNGLALLLLGQKTEQKPIKVSSQWNHYHLVLEDSDTGFRLDSSK 62

Query: 72  NHISHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGN-- 131
           N +S  C S +CFGR S  L+S                      E +KDE P+VED N  
Sbjct: 63  NWLSSACTSLICFGRKSERLES----------------------EGKKDEAPSVEDYNNC 122

Query: 132 -TFRNIALKSSLKRPS-NGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELV 191
                 ALKSSLK+ S + + I   +      +   D        RRKVQW D CG E+ 
Sbjct: 123 EVTNRFALKSSLKKRSFSDVVIGDDDVSRDGVVDHID--------RRKVQWPDTCGIEIA 159

Query: 192 EIREFEP 194
           E+REFEP
Sbjct: 183 EVREFEP 159

BLAST of Bhi04G001740 vs. TAIR 10
Match: AT1G55475.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13480.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 42.0 bits (97), Expect = 6.5e-04
Identity = 30/89 (33.71%), Postives = 44/89 (49.44%), Query Frame = 0

Query: 106 DSLPGPISTEERKDEFPNVEDGNTFRN-IALKSSLKRPSNGISISHQNAHESETISKKDG 165
           + + G  + E+R++     E+G    + + LKSSL++  +       N+ E+E   KK  
Sbjct: 31  EHVDGLTNAEDREEVDAKEEEGQIVGDTLTLKSSLRKVDS-------NSTEAEKREKK-- 90

Query: 166 DIRCLTNRRKVQWTDACGSELVEIREFEP 194
                    KVQW D  G EL EIREFEP
Sbjct: 91  ---------KVQWVDVIGKELAEIREFEP 101

BLAST of Bhi04G001740 vs. ExPASy TrEMBL
Match: A0A6J1DQI3 (uncharacterized protein LOC111022876 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022876 PE=4 SV=1)

HSP 1 Score: 334.3 bits (856), Expect = 3.3e-88
Identity = 164/181 (90.61%), Postives = 170/181 (93.92%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNHI 73
           VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKN I
Sbjct: 5   VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNRI 64

Query: 74  SHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNTFRNI 133
           S GCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGP+ST   KDE P V DGN+ RNI
Sbjct: 65  SRGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPLSTNTGKDEVPIVNDGNSLRNI 124

Query: 134 ALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREFEP 193
           ALKSSLK+P+NGISISHQNAHESET+SKKDGDIRC T+RRKVQW DACGSELVEIREFEP
Sbjct: 125 ALKSSLKKPNNGISISHQNAHESETMSKKDGDIRCPTDRRKVQWNDACGSELVEIREFEP 184

Query: 194 R 195
           R
Sbjct: 185 R 185

BLAST of Bhi04G001740 vs. ExPASy TrEMBL
Match: A0A6J1DP18 (uncharacterized protein LOC111022876 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022876 PE=4 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 1.3e-87
Identity = 163/180 (90.56%), Postives = 169/180 (93.89%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNHI 73
           VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKN I
Sbjct: 5   VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNRI 64

Query: 74  SHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNTFRNI 133
           S GCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGP+ST   KDE P V DGN+ RNI
Sbjct: 65  SRGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPLSTNTGKDEVPIVNDGNSLRNI 124

Query: 134 ALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREFEP 193
           ALKSSLK+P+NGISISHQNAHESET+SKKDGDIRC T+RRKVQW DACGSELVEIREFEP
Sbjct: 125 ALKSSLKKPNNGISISHQNAHESETMSKKDGDIRCPTDRRKVQWNDACGSELVEIREFEP 184

BLAST of Bhi04G001740 vs. ExPASy TrEMBL
Match: A0A6J1EPJ6 (uncharacterized protein LOC111436656 OS=Cucurbita moschata OX=3662 GN=LOC111436656 PE=4 SV=1)

HSP 1 Score: 324.3 bits (830), Expect = 3.4e-85
Identity = 161/180 (89.44%), Postives = 168/180 (93.33%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNHI 73
           VEGGGFFS+SASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKN I
Sbjct: 5   VEGGGFFSASASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNRI 64

Query: 74  SHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNTFRNI 133
           S GCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPG  S ++ KDE PNVEDG+T RNI
Sbjct: 65  SRGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGSKSIDKVKDETPNVEDGSTLRNI 124

Query: 134 ALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREFEP 193
           ALKSSLKRPSN +SISHQNA  SE +SKKDGDIRCLT+RRKVQWTDACGSELVEIREFEP
Sbjct: 125 ALKSSLKRPSNVVSISHQNADVSEPMSKKDGDIRCLTDRRKVQWTDACGSELVEIREFEP 184

BLAST of Bhi04G001740 vs. ExPASy TrEMBL
Match: A0A6J1I989 (uncharacterized protein LOC111472690 OS=Cucurbita maxima OX=3661 GN=LOC111472690 PE=4 SV=1)

HSP 1 Score: 322.0 bits (824), Expect = 1.7e-84
Identity = 160/180 (88.89%), Postives = 167/180 (92.78%), Query Frame = 0

Query: 14  VEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNHI 73
           VEGGGFFS+SASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKN I
Sbjct: 5   VEGGGFFSASASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNRI 64

Query: 74  SHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNTFRNI 133
           S GCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPG  S ++ KDE PNVEDG+  RNI
Sbjct: 65  SRGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGSKSIDKVKDETPNVEDGSILRNI 124

Query: 134 ALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREFEP 193
           ALKSSLKRPSN +SISHQNA  SE +SKKDGDIRCLT+RRKVQWTDACGSELVEIREFEP
Sbjct: 125 ALKSSLKRPSNVVSISHQNADVSEPMSKKDGDIRCLTDRRKVQWTDACGSELVEIREFEP 184

BLAST of Bhi04G001740 vs. ExPASy TrEMBL
Match: A0A061FHK2 (Low affinity potassium transport system protein kup isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_035703 PE=4 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 1.7e-63
Identity = 126/182 (69.23%), Postives = 146/182 (80.22%), Query Frame = 0

Query: 13  TVEGGGFFSSSASGYSKGLTLLLLGQKDEDKPMRVSPWNHYQLVDQESESDLQLASTKNH 72
           TVEGGGFFSSSASGYSKGLTLLLLGQK ED+PMRVSPWNHYQLVDQE + DLQLAS KN 
Sbjct: 4   TVEGGGFFSSSASGYSKGLTLLLLGQKHEDRPMRVSPWNHYQLVDQEPDPDLQLASIKNR 63

Query: 73  ISHGCASFVCFGRTSAGLDSSPSPLKVGPTLPHDSLPGPISTEERKDEFPNVEDGNT-FR 132
           +S GCASFVCFGRTSAGLD +PSPLKVGP    D LPGP+ +++  D   ++EDGN+  R
Sbjct: 64  LSRGCASFVCFGRTSAGLD-TPSPLKVGPVQQQDVLPGPLDSDKSNDHTSHLEDGNSNAR 123

Query: 133 NIALKSSLKRPSNGISISHQNAHESETISKKDGDIRCLTNRRKVQWTDACGSELVEIREF 192
            +ALKSSLK+PSN   +  ++ ++ E   +KDGDI   T RRKVQWTDACGSEL EI+EF
Sbjct: 124 KVALKSSLKKPSNSTPVPLEDVNDHEASGEKDGDIPSHTERRKVQWTDACGSELAEIKEF 183

Query: 193 EP 194
           EP
Sbjct: 184 EP 184

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G22790.11.2e-4554.74unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22790.21.2e-4554.74unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G34010.16.7e-3344.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G55475.16.5e-0433.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DQI33.3e-8890.61uncharacterized protein LOC111022876 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DP181.3e-8790.56uncharacterized protein LOC111022876 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1EPJ63.4e-8589.44uncharacterized protein LOC111436656 OS=Cucurbita moschata OX=3662 GN=LOC1114366... [more]
A0A6J1I9891.7e-8488.89uncharacterized protein LOC111472690 OS=Cucurbita maxima OX=3661 GN=LOC111472690... [more]
A0A061FHK21.7e-6369.23Low affinity potassium transport system protein kup isoform 1 OS=Theobroma cacao... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 111..125
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 93..126
NoneNo IPR availablePANTHERPTHR33401:SF3LOW AFFINITY POTASSIUM TRANSPORT SYSTEM PROTEINcoord: 13..193
NoneNo IPR availablePANTHERPTHR33401LIGHT-HARVESTING COMPLEX-LIKE PROTEIN OHP2, CHLOROPLASTICcoord: 13..193

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001740Bhi04M001740mRNA