CsGy7G019167.1 (mRNA) Cucumber (Gy14) v2.1

Overview
NameCsGy7G019167.1
TypemRNA
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionHCO3-transporter family
LocationGy14Chr7: 21848492 .. 21849043 (-)
Sequence length336
RNA-Seq ExpressionCsGy7G019167.1
SyntenyCsGy7G019167.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGTGAAATGGAAGAAGGCAGGAACCAATAAGGTCAGCCGATTTCACGTGGTTCTAAACCATGTAATTACAACCCACCTCCCTCTTTGTATAAATACCGTCCTCTCACCCTGACTTTCTTCAGTTTATCCTTCTCTTTCAAAATCCCTACATTTCTGCTTCTCTCACATGGCCACCAATTTTGCCGTTTTTCATCCCGCTCCGATCACAGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCAGGAAAGTACCAACCTAACCGCCCGAACGCCGCGGCGCCGAAATGGTGGTCGCCGATCTTCGGATGGTCTTCCGAACCGGACTACGTCGTATCTTCCGCCGCGGAAGTACAATCCATTCAGAATGCCGATCCGGAGCTCCACGGTGGAAGAATCAGATCCAGATTTGCACCTGGATGCTTCACGGAGGAGAAGGCGAAGCTACTCCGAATGAAAACGCGGGAGAGCTCCACTTTCCATGATATTATGTACCATTCGGCGATCGCATCGCGTCTGGCGTCCGACTTGTCGGACCGAAGCAAGAAATAG

mRNA sequence

ATGAATGTGAAATGGAAGAAGGCAGGAACCAATAAGTACCAACCTAACCGCCCGAACGCCGCGGCGCCGAAATGGTGGTCGCCGATCTTCGGATGGTCTTCCGAACCGGACTACGTCGTATCTTCCGCCGCGGAAGTACAATCCATTCAGAATGCCGATCCGGAGCTCCACGGTGGAAGAATCAGATCCAGATTTGCACCTGGATGCTTCACGGAGGAGAAGGCGAAGCTACTCCGAATGAAAACGCGGGAGAGCTCCACTTTCCATGATATTATGTACCATTCGGCGATCGCATCGCGTCTGGCGTCCGACTTGTCGGACCGAAGCAAGAAATAG

Coding sequence (CDS)

ATGAATGTGAAATGGAAGAAGGCAGGAACCAATAAGTACCAACCTAACCGCCCGAACGCCGCGGCGCCGAAATGGTGGTCGCCGATCTTCGGATGGTCTTCCGAACCGGACTACGTCGTATCTTCCGCCGCGGAAGTACAATCCATTCAGAATGCCGATCCGGAGCTCCACGGTGGAAGAATCAGATCCAGATTTGCACCTGGATGCTTCACGGAGGAGAAGGCGAAGCTACTCCGAATGAAAACGCGGGAGAGCTCCACTTTCCATGATATTATGTACCATTCGGCGATCGCATCGCGTCTGGCGTCCGACTTGTCGGACCGAAGCAAGAAATAG

Protein sequence

MNVKWKKAGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAPGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK*
Homology
BLAST of CsGy7G019167.1 vs. NCBI nr
Match: KAE8646454.1 (hypothetical protein Csa_016484 [Cucumis sativus])

HSP 1 Score: 231 bits (589), Expect = 1.47e-76
Identity = 111/111 (100.00%), Postives = 111/111 (100.00%), Query Frame = 0

Query: 1   MNVKWKKAGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGR 60
           MNVKWKKAGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGR
Sbjct: 1   MNVKWKKAGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGR 60

Query: 61  IRSRFAPGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           IRSRFAPGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 61  IRSRFAPGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111

BLAST of CsGy7G019167.1 vs. NCBI nr
Match: XP_008461069.1 (PREDICTED: uncharacterized protein LOC103499770 [Cucumis melo] >KAA0058783.1 uncharacterized protein E6C27_scaffold339G002570 [Cucumis melo var. makuwa] >TYK10577.1 uncharacterized protein E5676_scaffold459G002310 [Cucumis melo var. makuwa])

HSP 1 Score: 190 bits (483), Expect = 3.10e-60
Identity = 93/104 (89.42%), Postives = 95/104 (91.35%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           A + K QPNR NAA PKWWSPIFGWSSEPDYVVSSAAEVQS+QNADPEL GGR  SRFAP
Sbjct: 19  ASSGKSQPNRRNAAGPKWWSPIFGWSSEPDYVVSSAAEVQSVQNADPELDGGRTGSRFAP 78

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 79  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKK 122

BLAST of CsGy7G019167.1 vs. NCBI nr
Match: XP_038898524.1 (uncharacterized protein LOC120086132 [Benincasa hispida])

HSP 1 Score: 177 bits (449), Expect = 4.15e-55
Identity = 87/104 (83.65%), Postives = 93/104 (89.42%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + K +PNR  A APKWWSPIFGWSSEPDYV+SSAAE QS+ NA+PEL GGR RSRFAP
Sbjct: 16  SSSGKSEPNRRKATAPKWWSPIFGWSSEPDYVLSSAAE-QSVLNAEPELEGGRTRSRFAP 75

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 76  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKK 118

BLAST of CsGy7G019167.1 vs. NCBI nr
Match: XP_023004861.1 (uncharacterized protein LOC111498038 [Cucurbita maxima])

HSP 1 Score: 172 bits (435), Expect = 5.65e-53
Identity = 83/104 (79.81%), Postives = 92/104 (88.46%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + K +PNR  AAAPKWWSPIFGWSSEPDY+ SSAA+ Q +QNADPEL GGR +S+F P
Sbjct: 16  SSSGKSEPNRRKAAAPKWWSPIFGWSSEPDYIGSSAAD-QPVQNADPELDGGRAKSKFLP 75

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSK+
Sbjct: 76  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKE 118

BLAST of CsGy7G019167.1 vs. NCBI nr
Match: XP_022960101.1 (uncharacterized protein LOC111460952 [Cucurbita moschata] >KAG6593414.1 hypothetical protein SDJN03_12890, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025762.1 hypothetical protein SDJN02_12260, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 171 bits (433), Expect = 1.14e-52
Identity = 83/104 (79.81%), Postives = 92/104 (88.46%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + K +PNR  AAAPKWWSPIFGWSSEPDY+ SSAA+ Q +QNADPEL GGR +S+F P
Sbjct: 16  SSSGKSEPNRRKAAAPKWWSPIFGWSSEPDYIGSSAAD-QPVQNADPELDGGRPKSKFLP 75

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSK+
Sbjct: 76  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKE 118

BLAST of CsGy7G019167.1 vs. ExPASy TrEMBL
Match: A0A0A0K8X0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G433380 PE=4 SV=1)

HSP 1 Score: 209 bits (531), Expect = 8.19e-68
Identity = 100/104 (96.15%), Postives = 102/104 (98.08%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + KYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP
Sbjct: 23  SSSGKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 82

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 83  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 126

BLAST of CsGy7G019167.1 vs. ExPASy TrEMBL
Match: A0A5D3CF29 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G002310 PE=4 SV=1)

HSP 1 Score: 190 bits (483), Expect = 1.50e-60
Identity = 93/104 (89.42%), Postives = 95/104 (91.35%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           A + K QPNR NAA PKWWSPIFGWSSEPDYVVSSAAEVQS+QNADPEL GGR  SRFAP
Sbjct: 19  ASSGKSQPNRRNAAGPKWWSPIFGWSSEPDYVVSSAAEVQSVQNADPELDGGRTGSRFAP 78

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 79  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKK 122

BLAST of CsGy7G019167.1 vs. ExPASy TrEMBL
Match: A0A1S3CEC0 (uncharacterized protein LOC103499770 OS=Cucumis melo OX=3656 GN=LOC103499770 PE=4 SV=1)

HSP 1 Score: 190 bits (483), Expect = 1.50e-60
Identity = 93/104 (89.42%), Postives = 95/104 (91.35%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           A + K QPNR NAA PKWWSPIFGWSSEPDYVVSSAAEVQS+QNADPEL GGR  SRFAP
Sbjct: 19  ASSGKSQPNRRNAAGPKWWSPIFGWSSEPDYVVSSAAEVQSVQNADPELDGGRTGSRFAP 78

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSKK
Sbjct: 79  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKK 122

BLAST of CsGy7G019167.1 vs. ExPASy TrEMBL
Match: A0A6J1KTB2 (uncharacterized protein LOC111498038 OS=Cucurbita maxima OX=3661 GN=LOC111498038 PE=4 SV=1)

HSP 1 Score: 172 bits (435), Expect = 2.74e-53
Identity = 83/104 (79.81%), Postives = 92/104 (88.46%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + K +PNR  AAAPKWWSPIFGWSSEPDY+ SSAA+ Q +QNADPEL GGR +S+F P
Sbjct: 16  SSSGKSEPNRRKAAAPKWWSPIFGWSSEPDYIGSSAAD-QPVQNADPELDGGRAKSKFLP 75

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSK+
Sbjct: 76  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKE 118

BLAST of CsGy7G019167.1 vs. ExPASy TrEMBL
Match: A0A6J1H6P9 (uncharacterized protein LOC111460952 OS=Cucurbita moschata OX=3662 GN=LOC111460952 PE=4 SV=1)

HSP 1 Score: 171 bits (433), Expect = 5.52e-53
Identity = 83/104 (79.81%), Postives = 92/104 (88.46%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFAP 67
           + + K +PNR  AAAPKWWSPIFGWSSEPDY+ SSAA+ Q +QNADPEL GGR +S+F P
Sbjct: 16  SSSGKSEPNRRKAAAPKWWSPIFGWSSEPDYIGSSAAD-QPVQNADPELDGGRPKSKFLP 75

Query: 68  GCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDRSKK 111
           GCFTEEKAKLLRMKT ESSTFHDIMYHSAIASRLASDLSDRSK+
Sbjct: 76  GCFTEEKAKLLRMKTLESSTFHDIMYHSAIASRLASDLSDRSKE 118

BLAST of CsGy7G019167.1 vs. TAIR 10
Match: AT1G52720.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15630.1); Has 61 Blast hits to 61 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 61; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 96.3 bits (238), Expect = 1.7e-20
Identity = 48/102 (47.06%), Postives = 65/102 (63.73%), Query Frame = 0

Query: 8   AGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYV-VSSAAEVQSIQNADPELHGGRIRSRFA 67
           +G+    P++    +  WW+P+FG  S+PDY+ + S+    +    D    G     +F 
Sbjct: 17  SGSGSLNPDQNRKKSAAWWAPLFGLPSDPDYLNIESSCSTVNPDKTDISGSG----QKFR 76

Query: 68  PGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDLSDR 109
            GCFTEEKAK LR KT E+STFHD+MYHSAIASRLASD++ R
Sbjct: 77  RGCFTEEKAKQLRRKTAEASTFHDVMYHSAIASRLASDITGR 114

BLAST of CsGy7G019167.1 vs. TAIR 10
Match: AT3G15630.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G52720.1); Has 61 Blast hits to 61 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 61; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 86.3 bits (212), Expect = 1.7e-17
Identity = 47/99 (47.47%), Postives = 60/99 (60.61%), Query Frame = 0

Query: 7   KAGTNKYQPNRPNAAAPKWWSPIFGWSSEPDYVVSSAAEVQSIQNADPELHGGRIRSRFA 66
           +A +      +   ++  WW+P+FG SSEPDYV  +        N + +L     RS   
Sbjct: 14  RASSESDPARKKPVSSVSWWAPLFGMSSEPDYVNKTV-------NLESDLDKAEKRSLRC 73

Query: 67  PGCFTEEKAKLLRMKTRESSTFHDIMYHSAIASRLASDL 106
             C TEEKAK LR KT E+STFHD+MYHSAIASRLASD+
Sbjct: 74  --CLTEEKAKQLRRKTAEASTFHDVMYHSAIASRLASDV 103

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAE8646454.11.47e-76100.00hypothetical protein Csa_016484 [Cucumis sativus][more]
XP_008461069.13.10e-6089.42PREDICTED: uncharacterized protein LOC103499770 [Cucumis melo] >KAA0058783.1 unc... [more]
XP_038898524.14.15e-5583.65uncharacterized protein LOC120086132 [Benincasa hispida][more]
XP_023004861.15.65e-5379.81uncharacterized protein LOC111498038 [Cucurbita maxima][more]
XP_022960101.11.14e-5279.81uncharacterized protein LOC111460952 [Cucurbita moschata] >KAG6593414.1 hypothet... [more]
Match NameE-valueIdentityDescription
A0A0A0K8X08.19e-6896.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G433380 PE=4 SV=1[more]
A0A5D3CF291.50e-6089.42Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CEC01.50e-6089.42uncharacterized protein LOC103499770 OS=Cucumis melo OX=3656 GN=LOC103499770 PE=... [more]
A0A6J1KTB22.74e-5379.81uncharacterized protein LOC111498038 OS=Cucurbita maxima OX=3661 GN=LOC111498038... [more]
A0A6J1H6P95.52e-5379.81uncharacterized protein LOC111460952 OS=Cucurbita moschata OX=3662 GN=LOC1114609... [more]
Match NameE-valueIdentityDescription
AT1G52720.11.7e-2047.06unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G15630.11.7e-1747.47unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34198OS01G0175100 PROTEINcoord: 8..111
NoneNo IPR availablePANTHERPTHR34198:SF12BNAA05G35680D PROTEINcoord: 8..111

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy7G019167CsGy7G019167gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy7G019167.1.exon2CsGy7G019167.1.exon2exon
CsGy7G019167.1.exon1CsGy7G019167.1.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.CsGy7G019167.1cds.CsGy7G019167.1_2CDS
cds.CsGy7G019167.1cds.CsGy7G019167.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy7G019167.1CsGy7G019167.1-proteinpolypeptide