CsaV3_1G031100 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G031100
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionArmadillo repeat-containing protein 7
Locationchr1 : 18285091 .. 18286445 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAGGTATTAAATCTCTATTTCTGATATTACTGTGATTTGTTTCAGAGCTCTCCATGGATGTTTGCCTACTTATCCCCTTCTCTAGAACAATTTTACTGAATTGTAATTTTGTGAGCAGCTAAATGTTCTGGAGCTGTTTTTGGACTGTTTCATAATTTTAAATTATTTGATTTAGATGTGAGTACTGTTTGTTCTACCGATTGGATGACTACCTGACCACAGATACATCTACTGTTACTTTTTTCATGCAGACCCTGCTAATGCTTCTCTTATAACTCAGTGTGGTGGAATTCCCCTCATCATTGAATGCTTGTCAAGTCCTGTAAACCTTGTGTGTGTTGCTTCTTGTCAGGTTTTCAGTCTTTGGTTCATAAAAGGCATTGCCCAAAAGGTTTTTTGGGAAAAATAATTAGGGCCCTACTGTATGAAAATTAACTTTTGGTAACTTCTTTAACTAAAATTTTAAACATGATTCAGCATTCTATTTCTTTTCGTTCGGTAAGAGACAAGAGTACTAATAGGAATATCTTTAGAACAGTAATAAGGATCTTTGTAAGGGTATAGTGGTAGTTAGCTGAGAGTCTTGGTTACAAATATAGATAGTTAGATATCTTTAAGGGAAAGCCATATTTTGATGGGTCAGTCCATAGTGGGCGGATTGTGGAAGGAGATAGCCCATTCAAAAGGCTGCTAGATATTGTAGGTTGCTTTTTATATTGCATTGTATTAGCATCTTTGTCTTTTCTGTGCTTGGATACCGAACACATTCTTTTATCAAATATCGAATTTCAATGTATTTTTTCTCTTGGATATGCATGATATTTTTATCATTTGCTAATATATGGAAACCTTGACTATTGAAAATTGATTTGTTGATCTAGTCCAATGTATCTCCACCTCATTTTTTCTAAAGGATCTACTAATATCTCGACTAAATCATCTGATAAATGGATCACTTTTACTATTCTGTCAAATTGTATCACCTGTGGTTGAATCAGTGATTGATATTCCCTTTGGTTTTGATCAAAATAAAGTGTCGAACTTATTTCAATTTGAGGTGAATCATGCAGTTAGCTTCCTTTTTGTATTTTATGGAGGTACATTAGTGACATAGAATGTGTAGAATTTAGCCACATTTGAAAATCCATTCATTCGTGGCAAACTTAACAATTCTTATCAGGTGAATTATGCACTTGGCGCCATATATTACCTCTGCAATACATCAAACAAAGAGGAGATTATGAAACCAGAAGTTGTAGATGTCATCAACAAATACGCAGTGGCTGAGAGTGTGAGCTTTAGTAATCTATCTAAAGCAATTCTGGACAAGCACCTATCTAACAGAAACTGA

mRNA sequence

ATGCCAGACCCTGCTAATGCTTCTCTTATAACTCAGTGTGGTGGAATTCCCCTCATCATTGAATGCTTGTCAAGTCCTGTAAACCTTGTGTGTGTTGCTTCTTGTCAGGTGAATTATGCACTTGGCGCCATATATTACCTCTGCAATACATCAAACAAAGAGGAGATTATGAAACCAGAAGTTGTAGATGTCATCAACAAATACGCAGTGGCTGAGAGTGTGAGCTTTAGTAATCTATCTAAAGCAATTCTGGACAAGCACCTATCTAACAGAAACTGA

Coding sequence (CDS)

ATGCCAGACCCTGCTAATGCTTCTCTTATAACTCAGTGTGGTGGAATTCCCCTCATCATTGAATGCTTGTCAAGTCCTGTAAACCTTGTGTGTGTTGCTTCTTGTCAGGTGAATTATGCACTTGGCGCCATATATTACCTCTGCAATACATCAAACAAAGAGGAGATTATGAAACCAGAAGTTGTAGATGTCATCAACAAATACGCAGTGGCTGAGAGTGTGAGCTTTAGTAATCTATCTAAAGCAATTCTGGACAAGCACCTATCTAACAGAAACTGA

Protein sequence

MPDPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVVDVINKYAVAESVSFSNLSKAILDKHLSNRN
BLAST of CsaV3_1G031100 vs. NCBI nr
Match: XP_011648493.1 (PREDICTED: armadillo repeat-containing protein 7-like [Cucumis sativus])

HSP 1 Score: 176.0 bits (445), Expect = 5.9e-41
Identity = 87/89 (97.75%), Postives = 89/89 (100.00%), Query Frame = 0

Query: 4   PANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVVD 63
           PANAS+ITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVVD
Sbjct: 38  PANASIITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVVD 97

Query: 64  VINKYAVAESVSFSNLSKAILDKHLSNRN 93
           VINKYAVAESVSFSNL+KAILDKHLSNRN
Sbjct: 98  VINKYAVAESVSFSNLAKAILDKHLSNRN 126

BLAST of CsaV3_1G031100 vs. NCBI nr
Match: XP_004301610.1 (PREDICTED: armadillo repeat-containing protein 7-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 125.2 bits (313), Expect = 1.2e-25
Identity = 63/91 (69.23%), Postives = 76/91 (83.52%), Query Frame = 0

Query: 2   PDPANASLITQCGGIPLIIECLSSPV-NLVCVASCQVNYALGAIYYLCNTSNKEEIMKPE 61
           PDPANA+++TQ GGIPL+I+CLSSPV N        VNYA+G++YYLCNTSNKEEIMKPE
Sbjct: 92  PDPANAAIVTQSGGIPLVIQCLSSPVSNPPLPVRNTVNYAIGSLYYLCNTSNKEEIMKPE 151

Query: 62  VVDVINKYAVAE--SVSFSNLSKAILDKHLS 90
           VVD++ +YA AE  SVSFSNL+KA LDKH+S
Sbjct: 152 VVDIMKRYAAAEGASVSFSNLAKAFLDKHVS 182

BLAST of CsaV3_1G031100 vs. NCBI nr
Match: XP_022729317.1 (armadillo repeat-containing protein 7 isoform X1 [Durio zibethinus])

HSP 1 Score: 123.6 bits (309), Expect = 3.4e-25
Identity = 62/89 (69.66%), Postives = 74/89 (83.15%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA++ITQCGGIPL+I+CLSSPV         VNYALGA+YYLCN SN+EEI+KPEVV
Sbjct: 93  DPANAAIITQCGGIPLVIQCLSSPVRNT------VNYALGALYYLCNKSNREEILKPEVV 152

Query: 63  DVINKYAVAE--SVSFSNLSKAILDKHLS 90
           DVI +YA A+  +VSFSNL+KA LDKH+S
Sbjct: 153 DVIERYAAAQTINVSFSNLAKAFLDKHVS 175

BLAST of CsaV3_1G031100 vs. NCBI nr
Match: XP_022729318.1 (armadillo repeat-containing protein 7 isoform X2 [Durio zibethinus])

HSP 1 Score: 123.6 bits (309), Expect = 3.4e-25
Identity = 62/89 (69.66%), Postives = 74/89 (83.15%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA++ITQCGGIPL+I+CLSSPV         VNYALGA+YYLCN SN+EEI+KPEVV
Sbjct: 60  DPANAAIITQCGGIPLVIQCLSSPVRNT------VNYALGALYYLCNKSNREEILKPEVV 119

Query: 63  DVINKYAVAE--SVSFSNLSKAILDKHLS 90
           DVI +YA A+  +VSFSNL+KA LDKH+S
Sbjct: 120 DVIERYAAAQTINVSFSNLAKAFLDKHVS 142

BLAST of CsaV3_1G031100 vs. NCBI nr
Match: XP_017627539.1 (PREDICTED: armadillo repeat-containing protein 7 isoform X3 [Gossypium arboreum])

HSP 1 Score: 122.9 bits (307), Expect = 5.9e-25
Identity = 61/89 (68.54%), Postives = 74/89 (83.15%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA++ITQCGGIPL+I+CLSSPV         VNYALGA+YYLCN SN+EEI+KPEV+
Sbjct: 60  DPANAAIITQCGGIPLVIKCLSSPVRNT------VNYALGALYYLCNKSNREEILKPEVI 119

Query: 63  DVINKYAVAESV--SFSNLSKAILDKHLS 90
           DVI +YA A++V  SFSNL+KA LDKH+S
Sbjct: 120 DVIERYAAAQTVNASFSNLAKAFLDKHVS 142

BLAST of CsaV3_1G031100 vs. TAIR10
Match: AT5G37290.1 (ARM repeat superfamily protein)

HSP 1 Score: 62.4 bits (150), Expect = 1.7e-10
Identity = 31/50 (62.00%), Postives = 41/50 (82.00%), Query Frame = 0

Query: 43  AIYYLC--NTSNKEEIMKPEVVDVINKYAVAE--SVSFSNLSKAILDKHL 89
           A+YY+C  N + +EEI++PEVVD+I +YA AE  SVSFSNL+KA LDKH+
Sbjct: 127 ALYYMCDYNRATREEILRPEVVDLIERYAAAESVSVSFSNLAKAFLDKHV 176

BLAST of CsaV3_1G031100 vs. TrEMBL
Match: tr|A0A2P5SCW5|A0A2P5SCW5_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD06098 PE=4 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 3.9e-25
Identity = 61/89 (68.54%), Postives = 74/89 (83.15%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA++ITQCGGIPL+I+CLSSPV         VNYALGA+YYLCN SN+EEI+KPEV+
Sbjct: 35  DPANAAIITQCGGIPLVIKCLSSPVRNT------VNYALGALYYLCNKSNREEILKPEVI 94

Query: 63  DVINKYAVAESV--SFSNLSKAILDKHLS 90
           DVI +YA A++V  SFSNL+KA LDKHL+
Sbjct: 95  DVIERYAAAQTVNASFSNLAKAFLDKHLT 117

BLAST of CsaV3_1G031100 vs. TrEMBL
Match: tr|A0A251Q039|A0A251Q039_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G143200 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 1.9e-24
Identity = 61/89 (68.54%), Postives = 72/89 (80.90%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA +ITQCGGIPL+I+CLSSPV         VNYA+G++YYLCN SNK EIMKPEVV
Sbjct: 66  DPANAVIITQCGGIPLVIQCLSSPVRNT------VNYAIGSLYYLCNASNKGEIMKPEVV 125

Query: 63  DVINKYAVAE--SVSFSNLSKAILDKHLS 90
           DV+ +YA AE  S+SFSNL+KA LDKH+S
Sbjct: 126 DVMKRYAAAEEVSLSFSNLAKAFLDKHVS 148

BLAST of CsaV3_1G031100 vs. TrEMBL
Match: tr|A0A061GXH2|A0A061GXH2_THECC (ARM repeat superfamily protein isoform 3 OS=Theobroma cacao OX=3641 GN=TCM_041735 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 2.5e-24
Identity = 60/89 (67.42%), Postives = 73/89 (82.02%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPANA+++TQC GIPL+I+CLSSPV         VNYALGA+YYLCN SN+EEI+KPEVV
Sbjct: 34  DPANAAILTQCDGIPLVIQCLSSPVRNT------VNYALGALYYLCNKSNREEILKPEVV 93

Query: 63  DVINKYAVAE--SVSFSNLSKAILDKHLS 90
           DVI +YA A+  +VSFSNL+KA LDKH+S
Sbjct: 94  DVIERYAAAQTVNVSFSNLAKAFLDKHVS 116

BLAST of CsaV3_1G031100 vs. TrEMBL
Match: tr|A0A061F372|A0A061F372_THECC (ARM repeat superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_026566 PE=4 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 7.3e-24
Identity = 58/87 (66.67%), Postives = 70/87 (80.46%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           DPAN ++ITQC GIPL I+CLSSPV         VNYALGA YYLCN +N+EEI+KPEVV
Sbjct: 26  DPANGAIITQCDGIPLAIQCLSSPV------INTVNYALGAFYYLCNKANREEILKPEVV 85

Query: 63  DVINKYAVAESVSFSNLSKAILDKHLS 90
           DVI +YA +++VSFSNL+KA LDKH+S
Sbjct: 86  DVIERYAASQNVSFSNLAKAFLDKHVS 106

BLAST of CsaV3_1G031100 vs. TrEMBL
Match: tr|A0A2P2JC85|A0A2P2JC85_RHIMU (Uncharacterized protein MANES_02G134600 OS=Rhizophora mucronata OX=61149 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 9.0e-22
Identity = 55/87 (63.22%), Postives = 68/87 (78.16%), Query Frame = 0

Query: 3   DPANASLITQCGGIPLIIECLSSPVNLVCVASCQVNYALGAIYYLCNTSNKEEIMKPEVV 62
           D ANA+++T+CGGI  II+CLSSP+         V YALGA+YYLC  SNKEEI+KPEV+
Sbjct: 93  DSANAAVVTECGGILHIIQCLSSPIRNT------VKYALGALYYLCYASNKEEILKPEVI 152

Query: 63  DVINKYAVAESVSFSNLSKAILDKHLS 90
           DVI +YA A+SV FSNL+KA LDKH+S
Sbjct: 153 DVIKRYAEADSVDFSNLAKAFLDKHVS 173

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011648493.15.9e-4197.75PREDICTED: armadillo repeat-containing protein 7-like [Cucumis sativus][more]
XP_004301610.11.2e-2569.23PREDICTED: armadillo repeat-containing protein 7-like [Fragaria vesca subsp. ves... [more]
XP_022729317.13.4e-2569.66armadillo repeat-containing protein 7 isoform X1 [Durio zibethinus][more]
XP_022729318.13.4e-2569.66armadillo repeat-containing protein 7 isoform X2 [Durio zibethinus][more]
XP_017627539.15.9e-2568.54PREDICTED: armadillo repeat-containing protein 7 isoform X3 [Gossypium arboreum][more]
Match NameE-valueIdentityDescription
AT5G37290.11.7e-1062.00ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A2P5SCW5|A0A2P5SCW5_GOSBA3.9e-2568.54Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD06098 PE=4 SV... [more]
tr|A0A251Q039|A0A251Q039_PRUPE1.9e-2468.54Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G143200 PE=4 SV=1[more]
tr|A0A061GXH2|A0A061GXH2_THECC2.5e-2467.42ARM repeat superfamily protein isoform 3 OS=Theobroma cacao OX=3641 GN=TCM_04173... [more]
tr|A0A061F372|A0A061F372_THECC7.3e-2466.67ARM repeat superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_026566 PE=4 SV=... [more]
tr|A0A2P2JC85|A0A2P2JC85_RHIMU9.0e-2263.22Uncharacterized protein MANES_02G134600 OS=Rhizophora mucronata OX=61149 PE=4 SV... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011989ARM-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G031100.1CsaV3_1G031100.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 1..91
e-value: 1.6E-5
score: 26.7

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G031100CsGy1G019560Cucumber (Gy14) v2cgybcucB001
The following gene(s) are paralogous to this gene:

None