Csa7G374730.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa7G374730.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionU-box domain-containing protein 3
LocationChr7 : 13636410 .. 13637309 (-)
Sequence length462
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTGTTTTATCGTGAATGTGAGTCACTGGATGCAGCTGTAAATGAGGCTCGAGAATTCATCGAAAACTGGTGTCTGAAACAAGCATAATTTGCAGTGTGAGTTACCTTGAGAATCAGCCTTCTAATACTCAAGTTCTTTTTTTCATCATCCACTGCATATGCTTCCGTGGTTGTATCAGCGACTCTTTTTCTTTTGCATGTTCTCATGGAATTAGCTGGCCGTTCTTTGTCTGTGGCTGTTAGATTTCAAGAGCTGAATGCTGTGACAAATTCAAAGCTCTTCACAAGTGACCTGCGAGATTGTTTGGAAGTTGTTGGAGTCAGTATCATGCAGCTCGAGTTTAAATGTTGTTCAGGCATGTATAAATTTGAAAACTTTTATAACACATTTTATTTTTGATGTTTTTTCCTGTTTCTAAATAAATTGAGATGTTTGGAGTAGTAAACACTTAGCTAGCTCACACTTTTTTTTCTCTCTTATTTTGAAATATAAGTTTCTTGATTTTGATCCATCTTTTGTCATAATTTTCTGACTATTTGATAAGAATGATATTGACATTTAATGAACTGTCCAGAAATGTCCTGAAGGTCTTCAATCATTGAAACAAGAAAGGACAGCTAGATCTACAGAAGCGGCTCTAATTAGTCAAAGAAGCATTGGCCCAAACTCTGAACATCTTCAAGCACTTCATTTGACGTCAAATCAAGAACTTCTGAAAAAGACTATAGCTGTTGAAAAGGAGAGAATCGATGCTGAATCCAACAATGCGACGGAGGAACTACATCACATCATCCAGATTGTGGATCTAATTATCCGTATACCATTCCGTGGGATAAATGGTGTCTCGGTTCCTTCCTATTTCCATTGCCCAATTGTCATTGGAGCTGATTCTTGA

mRNA sequence

ATGAGTTGTTTTATCGTGAATGTGAGTCACTGGATGCAGCTATTTCAAGAGCTGAATGCTGTGACAAATTCAAAGCTCTTCACAAGTGACCTGCGAGATTGTTTGGAAGTTGTTGGAGTCAGTATCATGCAGCTCGAGTTTAAATGTTGTTCAGGTCTTCAATCATTGAAACAAGAAAGGACAGCTAGATCTACAGAAGCGGCTCTAATTAGTCAAAGAAGCATTGGCCCAAACTCTGAACATCTTCAAGCACTTCATTTGACGTCAAATCAAGAACTTCTGAAAAAGACTATAGCTGTTGAAAAGGAGAGAATCGATGCTGAATCCAACAATGCGACGGAGGAACTACATCACATCATCCAGATTGTGGATCTAATTATCCGTATACCATTCCGTGGGATAAATGGTGTCTCGGTTCCTTCCTATTTCCATTGCCCAATTGTCATTGGAGCTGATTCTTGA

Coding sequence (CDS)

ATGAGTTGTTTTATCGTGAATGTGAGTCACTGGATGCAGCTATTTCAAGAGCTGAATGCTGTGACAAATTCAAAGCTCTTCACAAGTGACCTGCGAGATTGTTTGGAAGTTGTTGGAGTCAGTATCATGCAGCTCGAGTTTAAATGTTGTTCAGGTCTTCAATCATTGAAACAAGAAAGGACAGCTAGATCTACAGAAGCGGCTCTAATTAGTCAAAGAAGCATTGGCCCAAACTCTGAACATCTTCAAGCACTTCATTTGACGTCAAATCAAGAACTTCTGAAAAAGACTATAGCTGTTGAAAAGGAGAGAATCGATGCTGAATCCAACAATGCGACGGAGGAACTACATCACATCATCCAGATTGTGGATCTAATTATCCGTATACCATTCCGTGGGATAAATGGTGTCTCGGTTCCTTCCTATTTCCATTGCCCAATTGTCATTGGAGCTGATTCTTGA

Protein sequence

MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQERTARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHIIQIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS*
BLAST of Csa7G374730.1 vs. Swiss-Prot
Match: PUB3_ARATH (U-box domain-containing protein 3 OS=Arabidopsis thaliana GN=PUB3 PE=2 SV=2)

HSP 1 Score: 60.8 bits (146), Expect = 1.5e-08
Identity = 41/122 (33.61%), Postives = 65/122 (53.28%), Query Frame = 1

Query: 41  SIMQLEFKCCSGLQSLKQERTARST-EAALISQRS--IGPNSEHL----QALHLTSNQEL 100
           S+  +E +C    +S KQE T     E AL +Q+      ++ HL    Q L L SNQ+L
Sbjct: 126 SVQSVE-RCVQETESFKQEGTLMELMENALRNQKDDITSLDNNHLESIIQMLGLISNQDL 185

Query: 101 LKKTIAVEKERIDAESNNATEELHHIIQIVDLIIRIP--------FRGINGVSVPSYFHC 148
           LK++I VEKERI ++++ + E++    Q+++L++ I              G+S+P YF C
Sbjct: 186 LKESITVEKERIRSQASKSEEDMEQTEQLIELVLCIREHMLKTEFLEVAKGISIPPYFRC 245

BLAST of Csa7G374730.1 vs. TrEMBL
Match: A0A0A0K4T7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374730 PE=4 SV=1)

HSP 1 Score: 303.9 bits (777), Expect = 1.1e-79
Identity = 153/153 (100.00%), Postives = 153/153 (100.00%), Query Frame = 1

Query: 1   MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER 60
           MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER
Sbjct: 1   MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER 60

Query: 61  TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII 120
           TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII
Sbjct: 61  TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII 120

Query: 121 QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS 154
           QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS
Sbjct: 121 QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS 153

BLAST of Csa7G374730.1 vs. TrEMBL
Match: A0A0A0K726_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374670 PE=4 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 5.6e-31
Identity = 81/113 (71.68%), Postives = 87/113 (76.99%), Query Frame = 1

Query: 48  KCCSGLQSLKQERTARSTEAALISQRS-IGPNSEHL----QALHLTSNQELLKKTIAVEK 107
           KC  GLQSLKQER + S E ALISQRS IGPNSEHL    +ALHLTSNQELLK+TIAVEK
Sbjct: 132 KCLEGLQSLKQERISDSIEEALISQRSGIGPNSEHLLKLIEALHLTSNQELLKETIAVEK 191

Query: 108 ERIDAESNNATEELHHIIQIVDLIIRIP--------FRGINGVSVPSYFHCPI 148
           ERI+A  NNA EELHHI QI+DLIIRI         F GINGVSVPSYF CP+
Sbjct: 192 ERINAARNNAKEELHHINQIMDLIIRIRDWMVRKDYFHGINGVSVPSYFRCPL 244

BLAST of Csa7G374730.1 vs. TrEMBL
Match: A0A061DQQ1_THECC (ARM repeat superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_004020 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 4.0e-13
Identity = 51/112 (45.54%), Postives = 70/112 (62.50%), Query Frame = 1

Query: 49  CCSGLQSLKQERTARSTEAALISQRSIG-PNSEHL----QALHLTSNQELLKKTIAVEKE 108
           C   ++ LKQER + + E AL SQR+   P  +HL    ++L+LTSNQELLK+T+AVEKE
Sbjct: 97  CMREIKCLKQERVSENIEEALRSQRNDAIPCPDHLVEVIKSLNLTSNQELLKETVAVEKE 156

Query: 109 RIDAESNNATEELHHIIQIVDLIIRI--------PFRGINGVSVPSYFHCPI 148
           R++A+ NNA  +L  I QIVDLI  +         F    GV +P +F CP+
Sbjct: 157 RMNAQVNNAKGKLDQINQIVDLISHVRDYLLKIEHFEPTTGVLIPPHFLCPL 208

BLAST of Csa7G374730.1 vs. TrEMBL
Match: A0A061DNZ3_THECC (ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_004020 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 4.0e-13
Identity = 51/112 (45.54%), Postives = 70/112 (62.50%), Query Frame = 1

Query: 49  CCSGLQSLKQERTARSTEAALISQRSIG-PNSEHL----QALHLTSNQELLKKTIAVEKE 108
           C   ++ LKQER + + E AL SQR+   P  +HL    ++L+LTSNQELLK+T+AVEKE
Sbjct: 133 CMREIKCLKQERVSENIEEALRSQRNDAIPCPDHLVEVIKSLNLTSNQELLKETVAVEKE 192

Query: 109 RIDAESNNATEELHHIIQIVDLIIRI--------PFRGINGVSVPSYFHCPI 148
           R++A+ NNA  +L  I QIVDLI  +         F    GV +P +F CP+
Sbjct: 193 RMNAQVNNAKGKLDQINQIVDLISHVRDYLLKIEHFEPTTGVLIPPHFLCPL 244

BLAST of Csa7G374730.1 vs. TrEMBL
Match: F6HZL0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g03520 PE=4 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 5.2e-13
Identity = 50/112 (44.64%), Postives = 69/112 (61.61%), Query Frame = 1

Query: 49  CCSGLQSLKQERTARSTEAALISQRS-IGPNSEHL----QALHLTSNQELLKKTIAVEKE 108
           C   LQ L+Q+R +   E AL SQR  I P+++ L    ++L LTS QELLK+++AVE+E
Sbjct: 133 CMQKLQHLEQKRISEYIEQALRSQRDEIIPHTQQLAKIIESLSLTSKQELLKESVAVERE 192

Query: 109 RIDAESNNATEELHHIIQIVDLIIRI--------PFRGINGVSVPSYFHCPI 148
           R++A+ N    EL  I QIV+L+  I         F  INGV +PSYF CP+
Sbjct: 193 RMNAQVNKTAYELDQINQIVELVSHIRDCMVRLGGFEAINGVRIPSYFRCPL 244

BLAST of Csa7G374730.1 vs. TAIR10
Match: AT3G54790.1 (AT3G54790.1 ARM repeat superfamily protein)

HSP 1 Score: 60.8 bits (146), Expect = 8.2e-10
Identity = 41/122 (33.61%), Postives = 65/122 (53.28%), Query Frame = 1

Query: 41  SIMQLEFKCCSGLQSLKQERTARST-EAALISQRS--IGPNSEHL----QALHLTSNQEL 100
           S+  +E +C    +S KQE T     E AL +Q+      ++ HL    Q L L SNQ+L
Sbjct: 126 SVQSVE-RCVQETESFKQEGTLMELMENALRNQKDDITSLDNNHLESIIQMLGLISNQDL 185

Query: 101 LKKTIAVEKERIDAESNNATEELHHIIQIVDLIIRIP--------FRGINGVSVPSYFHC 148
           LK++I VEKERI ++++ + E++    Q+++L++ I              G+S+P YF C
Sbjct: 186 LKESITVEKERIRSQASKSEEDMEQTEQLIELVLCIREHMLKTEFLEVAKGISIPPYFRC 245

BLAST of Csa7G374730.1 vs. NCBI nr
Match: gi|700189497|gb|KGN44730.1| (hypothetical protein Csa_7G374730 [Cucumis sativus])

HSP 1 Score: 303.9 bits (777), Expect = 1.6e-79
Identity = 153/153 (100.00%), Postives = 153/153 (100.00%), Query Frame = 1

Query: 1   MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER 60
           MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER
Sbjct: 1   MSCFIVNVSHWMQLFQELNAVTNSKLFTSDLRDCLEVVGVSIMQLEFKCCSGLQSLKQER 60

Query: 61  TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII 120
           TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII
Sbjct: 61  TARSTEAALISQRSIGPNSEHLQALHLTSNQELLKKTIAVEKERIDAESNNATEELHHII 120

Query: 121 QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS 154
           QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS
Sbjct: 121 QIVDLIIRIPFRGINGVSVPSYFHCPIVIGADS 153

BLAST of Csa7G374730.1 vs. NCBI nr
Match: gi|659100738|ref|XP_008451242.1| (PREDICTED: U-box domain-containing protein 3 [Cucumis melo])

HSP 1 Score: 142.9 bits (359), Expect = 4.7e-31
Identity = 81/113 (71.68%), Postives = 88/113 (77.88%), Query Frame = 1

Query: 48  KCCSGLQSLKQERTARSTEAALISQRS-IGPNSEHL----QALHLTSNQELLKKTIAVEK 107
           KC  GLQSLKQER + S E ALISQRS IGPNSEHL    +ALHLTSNQELLK+TIAVEK
Sbjct: 132 KCLEGLQSLKQERISDSIEEALISQRSGIGPNSEHLLKLIEALHLTSNQELLKETIAVEK 191

Query: 108 ERIDAESNNATEELHHIIQIVDLIIRIP--------FRGINGVSVPSYFHCPI 148
           ERI+AE NNA +ELHHI QI+DLIIRI         F GINGVSVPSYF CP+
Sbjct: 192 ERINAERNNAKKELHHINQIMDLIIRIRDWMVRKDYFHGINGVSVPSYFRCPL 244

BLAST of Csa7G374730.1 vs. NCBI nr
Match: gi|778727334|ref|XP_011659243.1| (PREDICTED: U-box domain-containing protein 3 [Cucumis sativus])

HSP 1 Score: 142.1 bits (357), Expect = 8.0e-31
Identity = 81/113 (71.68%), Postives = 87/113 (76.99%), Query Frame = 1

Query: 48  KCCSGLQSLKQERTARSTEAALISQRS-IGPNSEHL----QALHLTSNQELLKKTIAVEK 107
           KC  GLQSLKQER + S E ALISQRS IGPNSEHL    +ALHLTSNQELLK+TIAVEK
Sbjct: 132 KCLEGLQSLKQERISDSIEEALISQRSGIGPNSEHLLKLIEALHLTSNQELLKETIAVEK 191

Query: 108 ERIDAESNNATEELHHIIQIVDLIIRIP--------FRGINGVSVPSYFHCPI 148
           ERI+A  NNA EELHHI QI+DLIIRI         F GINGVSVPSYF CP+
Sbjct: 192 ERINAARNNAKEELHHINQIMDLIIRIRDWMVRKDYFHGINGVSVPSYFRCPL 244

BLAST of Csa7G374730.1 vs. NCBI nr
Match: gi|590715742|ref|XP_007050278.1| (ARM repeat superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 82.8 bits (203), Expect = 5.7e-13
Identity = 51/112 (45.54%), Postives = 70/112 (62.50%), Query Frame = 1

Query: 49  CCSGLQSLKQERTARSTEAALISQRSIG-PNSEHL----QALHLTSNQELLKKTIAVEKE 108
           C   ++ LKQER + + E AL SQR+   P  +HL    ++L+LTSNQELLK+T+AVEKE
Sbjct: 133 CMREIKCLKQERVSENIEEALRSQRNDAIPCPDHLVEVIKSLNLTSNQELLKETVAVEKE 192

Query: 109 RIDAESNNATEELHHIIQIVDLIIRI--------PFRGINGVSVPSYFHCPI 148
           R++A+ NNA  +L  I QIVDLI  +         F    GV +P +F CP+
Sbjct: 193 RMNAQVNNAKGKLDQINQIVDLISHVRDYLLKIEHFEPTTGVLIPPHFLCPL 244

BLAST of Csa7G374730.1 vs. NCBI nr
Match: gi|590715753|ref|XP_007050279.1| (ARM repeat superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 82.8 bits (203), Expect = 5.7e-13
Identity = 51/112 (45.54%), Postives = 70/112 (62.50%), Query Frame = 1

Query: 49  CCSGLQSLKQERTARSTEAALISQRSIG-PNSEHL----QALHLTSNQELLKKTIAVEKE 108
           C   ++ LKQER + + E AL SQR+   P  +HL    ++L+LTSNQELLK+T+AVEKE
Sbjct: 97  CMREIKCLKQERVSENIEEALRSQRNDAIPCPDHLVEVIKSLNLTSNQELLKETVAVEKE 156

Query: 109 RIDAESNNATEELHHIIQIVDLIIRI--------PFRGINGVSVPSYFHCPI 148
           R++A+ NNA  +L  I QIVDLI  +         F    GV +P +F CP+
Sbjct: 157 RMNAQVNNAKGKLDQINQIVDLISHVRDYLLKIEHFEPTTGVLIPPHFLCPL 208

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUB3_ARATH1.5e-0833.61U-box domain-containing protein 3 OS=Arabidopsis thaliana GN=PUB3 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0K4T7_CUCSA1.1e-79100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374730 PE=4 SV=1[more]
A0A0A0K726_CUCSA5.6e-3171.68Uncharacterized protein OS=Cucumis sativus GN=Csa_7G374670 PE=4 SV=1[more]
A0A061DQQ1_THECC4.0e-1345.54ARM repeat superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_004020 PE=4 S... [more]
A0A061DNZ3_THECC4.0e-1345.54ARM repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_004020 PE=4 S... [more]
F6HZL0_VITVI5.2e-1344.64Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g03520 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G54790.18.2e-1033.61 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700189497|gb|KGN44730.1|1.6e-79100.00hypothetical protein Csa_7G374730 [Cucumis sativus][more]
gi|659100738|ref|XP_008451242.1|4.7e-3171.68PREDICTED: U-box domain-containing protein 3 [Cucumis melo][more]
gi|778727334|ref|XP_011659243.1|8.0e-3171.68PREDICTED: U-box domain-containing protein 3 [Cucumis sativus][more]
gi|590715742|ref|XP_007050278.1|5.7e-1345.54ARM repeat superfamily protein isoform 1 [Theobroma cacao][more]
gi|590715753|ref|XP_007050279.1|5.7e-1345.54ARM repeat superfamily protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa7G374730Csa7G374730gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa7G374730.1Csa7G374730.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa7G374730.1.cds3Csa7G374730.1.cds3CDS
Csa7G374730.1.cds2Csa7G374730.1.cds2CDS
Csa7G374730.1.cds1Csa7G374730.1.cds1CDS


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR23315BETA CATENIN-RELATED ARMADILLO REPEAT-CONTAININGcoord: 53..147
score: 6.7
NoneNo IPR availablePANTHERPTHR23315:SF119U-BOX DOMAIN-CONTAINING PROTEIN 3coord: 53..147
score: 6.7