Csa1G009730.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G009730.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionProtein DCL; contains IPR015801 (Copper amine oxidase, N2/N3-terminal), IPR021602 (Protein of unknown function DUF3223)
LocationChr1 : 1467359 .. 1469635 (+)
Sequence length699
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTATACTGTACTTTGCTAGGGTTTTAGTTTGAAGCTTGATTGTTCCAACTTGAATTCACAAATTACAATGGCGGAAGAGACAATCCCTCAACTCGATCCGGAGGTAACCAACTCAGGCACTGAAGATATGGAATTGGAAACTACGGAACCGAAAGCTTCAACTGATGCTTCTGAAGCAAATGAGGCCGCTCCAGTAATTAATGGCGATGCTACTTCGAAGCGAGAGAGGGAGGAAAGCGCCGATGATGGCGTCGCCGATGGGAAGAAGCAGAAGATAGAGAAATCTGTTGAGGAGGAAAGGCTTGAGAAGTTGGGTGGAGACGGTAAGTGTGAGGAGGAACCTGTTCCTGTGAGCTTAGGCCCCAAGAGCTTTCGATCATCGGTGGAATTGTTTGATTACTTCTATAAGATCCTTCACCACTGGCCTGTCAACTTGAACGTGAACCAGGTATGATATTATTAACTTATTTTAAATGTGTCATAGTTCTCTGTTTTTGTTAATTGAAAGCTCGAGAATGATATTGAGAAAATACTAATTTTGTTATAAGGGATTGTTGGAACTTGGTTGAACAGTTTTAGCAAAAGGGTTTCTATATAACTGCATGCTTATTCAAAAGGGAATCTCAAATGTCCAAGTTCTTTTTTATATTATTTGAATACAATGCTTGCTTGTTTGAATTATTTATGGTTTCCTGTTGTTGAGTATAATTTAATTGAAGACATAAAGATTGTATTGCTTCTTACTGCCAGTTAGATTAATTATTGTTGTTTGTTTAATTTGATAAAATTTGAACTCGAATGGATATTCATCTATATTGTGCTCTATTTGGTCGTGTTATGATTCGTTTAGTTGATTCTTTTAGTAAATTTTTGTTGTTATGGCTGGATGGTTTTTCCTAAACTAAGTATGATTTTTGAGTCAGTTTCCAATATTTCTTCAAATGTTTTGTTGACGGTTGGTTGGTGTCCGTTTTACTCTAAACCTACTTACTTCTTTAGATGAGACTGTTCGACTTTATTAATTATGGAAAATGGAATCTGGGTAGTACTAGAAAAATCCCAGCCTGGAGGGATAAATTTTATTTGTTTCTGTGACGGATACTCTTTTACCTTATTTGCTCATTCAAGAATTTCTTTCAGCAGTTTATTTTTCTTGTGTTATGTTGTTATTCTTCAGGATTCCCGTATTCTTGATTGATTGATGTCTGTGGAATTAATTAATGATATTGATATACTTTATCTATATACTACGGTATGGCTTATCCTGTGATGTGGAGCAAATTATTCAAACAGAAGCTATGGCTTCTAACCGTTTGTGATACAATGAGAACATTTTTATCAAATAATTCGTAATTAGAAATGCAGTCAAGTGTTTATCTTCTACTGTCTTAAAGCGTTTAGAAGTGTATTCCTTTAGGCTTTAGCTCACCTTACCATAACTTAAATGGTTAAAACATAAGCAATGCCAATTTTAGGCTTGAATTTATGTTCTATGAAGAGTGTATTTCCTTAAGGTCCCCTATATTAACTTGTAAGTCGTGCTTTTCATTTAATTAACAGCCTTTCCAACGTAAAAGATAATCCCCCGAAGTCATTGACTTGATTTGAAACATCTATAAATGATAATTTACATGATGAGAAAGCAGTGACTCTGTTAATACAATATTTGATATTTGTTTCTATGGCAGTACGAGCAAATGGTGCTGGTAGATTTACTTAAGAAGGGACATCTAGAACCAGAGAAAAAGATTGGCTGTGGTATTCACTCTTTTCAAATTCGTTTCCATCCTGAGTGGAAAAGCAAGTGCTTCTTCCTCATCCGGGAGGATGAGTCTGCTGATGACTTCAGCTTCAGGAAGTGTGTTGATCATATCCTTCCTCTGCCTGAAAACTTCAAAGGAAAATTTGATGCAAACAGAGCATTAGGCGGTGGTAAGCACCGCGGTGGAGGACGAGGTGGGGGTGGTGGACGTGGAAGAGGAGGAGGGGGTCGGTGGAGAAACTGATGTTTTTCCCAACACTTTTCAGTCTCACCACTTTGGACTCAATTTTCCTGCGAAGAATTTACCACAAAATCCCCACCAATTCAAGCTTCTTGGCCATGAGTTCAATGAAAAAAGTAGAGGAATAATTTTTTTATGCCCATAATTTATGCTTTTCTTAGCGTGCTTGATGTATGAACTCAAAATTATGATACAATTTTCTGACGATAATTGTTAAAGGATGAGTTTTTATGTTAATTTTAGAGAGAATTAATGTCCGAATATTATCCAAG

mRNA sequence

ATGGCGGAAGAGACAATCCCTCAACTCGATCCGGAGGTAACCAACTCAGGCACTGAAGATATGGAATTGGAAACTACGGAACCGAAAGCTTCAACTGATGCTTCTGAAGCAAATGAGGCCGCTCCAGTAATTAATGGCGATGCTACTTCGAAGCGAGAGAGGGAGGAAAGCGCCGATGATGGCGTCGCCGATGGGAAGAAGCAGAAGATAGAGAAATCTGTTGAGGAGGAAAGGCTTGAGAAGTTGGGTGGAGACGGTAAGTGTGAGGAGGAACCTGTTCCTGTGAGCTTAGGCCCCAAGAGCTTTCGATCATCGGTGGAATTGTTTGATTACTTCTATAAGATCCTTCACCACTGGCCTGTCAACTTGAACGTGAACCAGTACGAGCAAATGGTGCTGGTAGATTTACTTAAGAAGGGACATCTAGAACCAGAGAAAAAGATTGGCTGTGGTATTCACTCTTTTCAAATTCGTTTCCATCCTGAGTGGAAAAGCAAGTGCTTCTTCCTCATCCGGGAGGATGAGTCTGCTGATGACTTCAGCTTCAGGAAGTGTGTTGATCATATCCTTCCTCTGCCTGAAAACTTCAAAGGAAAATTTGATGCAAACAGAGCATTAGGCGGTGGTAAGCACCGCGGTGGAGGACGAGGTGGGGGTGGTGGACGTGGAAGAGGAGGAGGGGGTCGGTGGAGAAACTGA

Coding sequence (CDS)

ATGGCGGAAGAGACAATCCCTCAACTCGATCCGGAGGTAACCAACTCAGGCACTGAAGATATGGAATTGGAAACTACGGAACCGAAAGCTTCAACTGATGCTTCTGAAGCAAATGAGGCCGCTCCAGTAATTAATGGCGATGCTACTTCGAAGCGAGAGAGGGAGGAAAGCGCCGATGATGGCGTCGCCGATGGGAAGAAGCAGAAGATAGAGAAATCTGTTGAGGAGGAAAGGCTTGAGAAGTTGGGTGGAGACGGTAAGTGTGAGGAGGAACCTGTTCCTGTGAGCTTAGGCCCCAAGAGCTTTCGATCATCGGTGGAATTGTTTGATTACTTCTATAAGATCCTTCACCACTGGCCTGTCAACTTGAACGTGAACCAGTACGAGCAAATGGTGCTGGTAGATTTACTTAAGAAGGGACATCTAGAACCAGAGAAAAAGATTGGCTGTGGTATTCACTCTTTTCAAATTCGTTTCCATCCTGAGTGGAAAAGCAAGTGCTTCTTCCTCATCCGGGAGGATGAGTCTGCTGATGACTTCAGCTTCAGGAAGTGTGTTGATCATATCCTTCCTCTGCCTGAAAACTTCAAAGGAAAATTTGATGCAAACAGAGCATTAGGCGGTGGTAAGCACCGCGGTGGAGGACGAGGTGGGGGTGGTGGACGTGGAAGAGGAGGAGGGGGTCGGTGGAGAAACTGA

Protein sequence

MAEETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADDGVADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWPVNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDFSFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN*
BLAST of Csa1G009730.1 vs. TAIR10
Match: AT5G62440.1 (AT5G62440.1 Protein of unknown function (DUF3223))

HSP 1 Score: 216.5 bits (550), Expect = 1.8e-56
Identity = 117/229 (51.09%), Postives = 139/229 (60.70%), Query Frame = 1

Query: 3   EETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADDGV 62
           +E +  L  EV      DME+ET  PKA T             GD   +RE  E  ++G 
Sbjct: 5   QEIVDSLSAEVNPDQKVDMEVETATPKAET-------------GDEKREREETEEEENG- 64

Query: 63  ADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWPVN 122
            + KKQK+ +                EE+  PV LGPK F +SV +FDYF K LH WP +
Sbjct: 65  GESKKQKVGE----------------EEKSGPVKLGPKEFVTSVAMFDYFVKFLHFWPTD 124

Query: 123 LNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDFSF 182
           L+VN+YE MVL+DL+KKGH EPEKKIG GI +FQ+R HP WKS+CFFL+RED++ADDFSF
Sbjct: 125 LDVNKYEHMVLLDLIKKGHSEPEKKIGGGIKTFQVRTHPMWKSRCFFLVREDDTADDFSF 184

Query: 183 RKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWR 232
           RKCVD ILPLPEN K         GGG  RGGG G  GGRG G GGR+R
Sbjct: 185 RKCVDQILPLPENMKTPGANGNGHGGG--RGGGGGRRGGRGGGRGGRFR 201

BLAST of Csa1G009730.1 vs. NCBI nr
Match: gi|778655953|ref|XP_011660085.1| (PREDICTED: protein DCL, chloroplastic [Cucumis sativus])

HSP 1 Score: 484.2 bits (1245), Expect = 1.3e-133
Identity = 232/232 (100.00%), Postives = 232/232 (100.00%), Query Frame = 1

Query: 1   MAEETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADD 60
           MAEETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADD
Sbjct: 1   MAEETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADD 60

Query: 61  GVADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWP 120
           GVADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWP
Sbjct: 61  GVADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWP 120

Query: 121 VNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDF 180
           VNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDF
Sbjct: 121 VNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDF 180

Query: 181 SFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN 233
           SFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN
Sbjct: 181 SFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN 232

BLAST of Csa1G009730.1 vs. NCBI nr
Match: gi|659105915|ref|XP_008453205.1| (PREDICTED: protein DCL, chloroplastic [Cucumis melo])

HSP 1 Score: 456.1 bits (1172), Expect = 3.8e-125
Identity = 218/232 (93.97%), Postives = 223/232 (96.12%), Query Frame = 1

Query: 1   MAEETIPQLDPEVTNSGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREESADD 60
           MAEETIPQLDPEVTNSGTEDMELETT PK S+DASEA EAAPVINGDA SKREREESADD
Sbjct: 1   MAEETIPQLDPEVTNSGTEDMELETTVPKGSSDASEATEAAPVINGDANSKREREESADD 60

Query: 61  GVADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHHWP 120
           GV DGKKQKIEKSVEEER+EKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILH+WP
Sbjct: 61  GVGDGKKQKIEKSVEEERVEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKILHNWP 120

Query: 121 VNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDESADDF 180
           +NLNVNQYEQMVLVDLLKKGHLEPEKKIGCGI +FQIRFHP WKSKCFFLIREDESADDF
Sbjct: 121 INLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIQAFQIRFHPVWKSKCFFLIREDESADDF 180

Query: 181 SFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN 233
           SFRKCVDHILPLPENFK K DANRALGGGKHRGGGRGGGGGRGRGGGGRWRN
Sbjct: 181 SFRKCVDHILPLPENFKAKSDANRALGGGKHRGGGRGGGGGRGRGGGGRWRN 232

BLAST of Csa1G009730.1 vs. NCBI nr
Match: gi|1009132640|ref|XP_015883480.1| (PREDICTED: protein DCL, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 266.5 bits (680), Expect = 4.2e-68
Identity = 146/252 (57.94%), Postives = 174/252 (69.05%), Query Frame = 1

Query: 1   MAEETIPQL---DPEVTNSGTEDMELETTE-----PKASTDASE---ANEAAPVINGDAT 60
           MAEET+ +L   + E  +   EDMELET E      K   D      AN A P  NG+A 
Sbjct: 1   MAEETVAELLYTNLEGADVPGEDMELETLENGQEDTKGDGDGGSGAVANGAEPPANGEAN 60

Query: 61  SKREREESADDGVADG----KKQKIEKSVEEERLEKLGGDGKCEEE-------PVPVSLG 120
           SKR REE  ++   +     KKQK+EKSVEE+RLEKLGG G   +E        V V LG
Sbjct: 61  SKRAREEEEEEEEEENNGVSKKQKVEKSVEEDRLEKLGGGGSGGKEGENGDDGSVRVKLG 120

Query: 121 PKSFRSSVELFDYFYKILHHWPVNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIR 180
           PK F SSVE+FDYF+K+LH WP N+N+N+YEQMVL+DLLKKGH EP+KKIG GI +FQ+R
Sbjct: 121 PKEFVSSVEMFDYFFKLLHFWPTNVNINKYEQMVLLDLLKKGHAEPDKKIGGGIKAFQVR 180

Query: 181 FHPEWKSKCFFLIREDESADDFSFRKCVDHILPLPENFKGKFDANRALGGGKHRG-GGRG 230
           FHP +KS+CFFL+R DE+ DDFSFRKCVD ILPLPE+ + K DANR L GG  +G GGRG
Sbjct: 181 FHPTFKSRCFFLLRNDETTDDFSFRKCVDRILPLPEHMQIKSDANRGLSGGGGKGRGGRG 240

BLAST of Csa1G009730.1 vs. NCBI nr
Match: gi|225457413|ref|XP_002284935.1| (PREDICTED: uncharacterized protein LOC100255290 [Vitis vinifera])

HSP 1 Score: 263.8 bits (673), Expect = 2.8e-67
Identity = 139/241 (57.68%), Postives = 173/241 (71.78%), Query Frame = 1

Query: 1   MAEETIPQLDPEVTN---SGTEDMELETTEPKASTDASEANEAAPVINGDATSKREREES 60
           MAE  + +    VT    + T+DM++E  EP     +  A+    V NGD+ SKR REE+
Sbjct: 1   MAETVVSETPETVTERETANTQDMDVEAPEPSQPNGSDSADN---VTNGDSNSKRGREEA 60

Query: 61  ADDGVADG---KKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYK 120
            +   A+    KKQK+EKSVEEERLEKL  +     E    SLGPK+F SSVE+FD+F+K
Sbjct: 61  GEGEDANDAVTKKQKVEKSVEEERLEKLEAE---VVETGRFSLGPKTFGSSVEMFDHFFK 120

Query: 121 ILHHWPVNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIRED 180
            LH+WP NL+VN+YE M+L+DLLKKGH EP+KKIG GIH+FQ+R+HP +KS+CFF+IR+D
Sbjct: 121 FLHYWPANLDVNKYEHMMLLDLLKKGHTEPDKKIGGGIHAFQVRYHPVFKSRCFFVIRDD 180

Query: 181 ESADDFSFRKCVDHILPLPENFKGKFDANRALGGGK--HRGGGRGGGGGRGRGG-GGRWR 233
           ES DDFSFRKCVDHI PLPEN K K + N+ALGGG+    GGG G GGGRGRGG GGR R
Sbjct: 181 ESVDDFSFRKCVDHISPLPENMKAKSEVNKALGGGRGGKGGGGGGRGGGRGRGGRGGRGR 235

BLAST of Csa1G009730.1 vs. NCBI nr
Match: gi|645269311|ref|XP_008239940.1| (PREDICTED: protein DCL, chloroplastic [Prunus mume])

HSP 1 Score: 258.5 bits (659), Expect = 1.2e-65
Identity = 140/232 (60.34%), Postives = 167/232 (71.98%), Query Frame = 1

Query: 1   MAEETIPQLDPEVTNSGT----EDMELETTEPKASTDASEANEAAPVINGDATSKREREE 60
           MAE+T+ +L    TNS T    EDMELE +EP  +    E          +A +KR R+E
Sbjct: 1   MAEKTVSELAE--TNSETAAAAEDMELEASEPAPAEKPDEGTNG----EAEANAKRLRDE 60

Query: 61  SADDGV-ADGKKQKIEKSVEEERLEKLGGDGKCEEEPVPVSLGPKSFRSSVELFDYFYKI 120
              +G  A  KK K+EKS EEERLEKLG +GK   E   VSLGPKSF SSVE+FDYFYK+
Sbjct: 61  EGSEGNDAVAKKTKVEKSPEEERLEKLG-EGK---ESGRVSLGPKSFGSSVEMFDYFYKL 120

Query: 121 LHHWPVNLNVNQYEQMVLVDLLKKGHLEPEKKIGCGIHSFQIRFHPEWKSKCFFLIREDE 180
           LH+WP +L+VN+YE +VL+DLLKKGH EP+KKIG G+H+FQ+R HP +KS+CFFLIREDE
Sbjct: 121 LHYWPTDLSVNKYEHLVLLDLLKKGHAEPDKKIGGGVHAFQVRTHPLYKSRCFFLIREDE 180

Query: 181 SADDFSFRKCVDHILPLPENFKGKFDANRALGGGKHRGGGRGGGGGRGRGGG 228
           + DDFSFRKCVD ILPLPEN K   DAN+ALGG   RGGG G GG RGRG G
Sbjct: 181 AVDDFSFRKCVDQILPLPENMKAHSDANKALGGKGGRGGGGGRGGWRGRGRG 222

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G62440.11.8e-5651.09 Protein of unknown function (DUF3223)[more]
Match NameE-valueIdentityDescription
gi|778655953|ref|XP_011660085.1|1.3e-133100.00PREDICTED: protein DCL, chloroplastic [Cucumis sativus][more]
gi|659105915|ref|XP_008453205.1|3.8e-12593.97PREDICTED: protein DCL, chloroplastic [Cucumis melo][more]
gi|1009132640|ref|XP_015883480.1|4.2e-6857.94PREDICTED: protein DCL, chloroplastic [Ziziphus jujuba][more]
gi|225457413|ref|XP_002284935.1|2.8e-6757.68PREDICTED: uncharacterized protein LOC100255290 [Vitis vinifera][more]
gi|645269311|ref|XP_008239940.1|1.2e-6560.34PREDICTED: protein DCL, chloroplastic [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR021602Protein of unknown function DUF3223
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009308 amine metabolic process
biological_process GO:0003006 developmental process involved in reproduction
biological_process GO:0009790 embryo development
biological_process GO:0009791 post-embryonic development
biological_process GO:0044763 single-organism cellular process
biological_process GO:0044702 single organism reproductive process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005507 copper ion binding
molecular_function GO:0048038 quinone binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G009730Csa1G009730gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G009730.1Csa1G009730.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G009730.1.utr5p1Csa1G009730.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G009730.1.cds1Csa1G009730.1.cds1CDS
Csa1G009730.1.cds2Csa1G009730.1.cds2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G009730.1.utr3p1Csa1G009730.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021602Protein of unknown function DUF3223PFAMPF11523DUF3223coord: 114..186
score: 1.2
NoneNo IPR availableGENE3DG3DSA:3.10.450.40coord: 94..191
score: 4.6
NoneNo IPR availablePANTHERPTHR33415FAMILY NOT NAMEDcoord: 1..232
score: 2.5
NoneNo IPR availablePANTHERPTHR33415:SF3EMB514coord: 1..232
score: 2.5