Cucsa.051140 (gene) Cucumber (Gy14) v1

NameCucsa.051140
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionArgininosuccinate lyase
Locationscaffold00557 : 319996 .. 320678 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTATAGTGATGATCAGGAGATTCCTGGACAAATGGTGGTTGAAGGCTTCAAATGTTTTGACAATAAGTTTGTGAGTTTTTCTTATCCCTTTTAATTATTTTTCTTAAGATGATGAGTTGCATAATGTTTTGTTTATTATTTTTATAAGCTTTTATTGTTTTTAATAGATATACAATGGTTGTGAAAGTGCTTATAGATTGAACCCAAGTGGGAGTTTCAATGTCCCTCCCGAGGCTACTAACCTATTTTGCAATGGACCATGTTTGATCGAAACACAACTTCTGCTCAACTGCCTTGACAACACTTTTCAAAACTTCTTGTTCTACAACAAAGCCACCGCACAAAGCGTTCGAAACGCCCTCCGTGTTGGCTGCAGCTACTCGAGCCAAAGAGGTAGGCCATACGTTGTTGGCCAGCTACGTACCTTATGTTAACTTACAGAATGTGTATTATTGCTAATTTATATAAAAAAAAGAATTGTTATGTAAATAGTTTAGAATGATATAATTTGTAACACATGATCATCTTATTTGCCATTTGTTGTTTAATTGATTTAGGGAACTTCAATCAAGGATTGTTCATGCAAGGTGAATTTAGTGAAGCTCACACCTTCCAAAAATGGTTCATCCTCCATTATTCTCTTCTTTTTCTTGTTGGGTGTTGTTGTTTTCTCATCTTATAA

mRNA sequence

TTTATAGTGATGATCAGGAGATTCCTGGACAAATGGTGGTTGAAGGCTTCAAATGTTTTGACAATAAGTTTATATACAATGGTTGTGAAAGTGCTTATAGATTGAACCCAAGTGGGAGTTTCAATGTCCCTCCCGAGGCTACTAACCTATTTTGCAATGGACCATGTTTGATCGAAACACAACTTCTGCTCAACTGCCTTGACAACACTTTTCAAAACTTCTTGTTCTACAACAAAGCCACCGCACAAAGCGTTCGAAACGCCCTCCGTGTTGGCTGCAGCTACTCGAGCCAAAGAGGGAACTTCAATCAAGGATTGTTCATGCAAGGTGAATTTAGTGAAGCTCACACCTTCCAAAAATGGTTCATCCTCCATTATTCTCTTCTTTTTCTTGTTGGGTGTTGTTGTTTTCTCATCTTATAA

Coding sequence (CDS)

TTTATAGTGATGATCAGGAGATTCCTGGACAAATGGTGGTTGAAGGCTTCAAATGTTTTGACAATAAGTTTATATACAATGGTTGTGAAAGTGCTTATAGATTGAACCCAAGTGGGAGTTTCAATGTCCCTCCCGAGGCTACTAACCTATTTTGCAATGGACCATGTTTGATCGAAACACAACTTCTGCTCAACTGCCTTGACAACACTTTTCAAAACTTCTTGTTCTACAACAAAGCCACCGCACAAAGCGTTCGAAACGCCCTCCGTGTTGGCTGCAGCTACTCGAGCCAAAGAGGGAACTTCAATCAAGGATTGTTCATGCAAGGTGAATTTAGTGAAGCTCACACCTTCCAAAAATGGTTCATCCTCCATTATTCTCTTCTTTTTCTTGTTGGGTGTTGTTGTTTTCTCATCTTATAA

Protein sequence

YSDDQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFILHYSLLFLVGCCCFLIL*
BLAST of Cucsa.051140 vs. TrEMBL
Match: A0A0A0LYD5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G561940 PE=4 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 5.7e-75
Identity = 134/136 (98.53%), Postives = 134/136 (98.53%), Query Frame = 1

Query: 4   DQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 63
           D EIPGQMVVEGFKCFDNKFIYNGCE AYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL
Sbjct: 34  DTEIPGQMVVEGFKCFDNKFIYNGCERAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 93

Query: 64  NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL 123
           NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL
Sbjct: 94  NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL 153

Query: 124 HYSLLFLVGCCCFLIL 140
           HYSLLFLVGCCCFLIL
Sbjct: 154 HYSLLFLVGCCCFLIL 169

BLAST of Cucsa.051140 vs. TrEMBL
Match: A0A0D2PH82_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G304800 PE=4 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 2.9e-34
Identity = 68/134 (50.75%), Postives = 91/134 (67.91%), Query Frame = 1

Query: 3   DDQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLL 62
           + Q  P Q + + F CF+NK+IY GC+ AYRLN SG+ NVP EAT++FCNGPC  ETQL+
Sbjct: 85  NSQNFPAQSLAKAFFCFNNKYIYTGCDEAYRLNESGNLNVPREATDIFCNGPCFAETQLV 144

Query: 63  LNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFI 122
           L C+DN   +F+FYNKAT   V N L  GCSY+++RGNF+ G + QGE SEA   + + I
Sbjct: 145 LKCVDNILSDFIFYNKATIGDVTNVLHAGCSYTNRRGNFDVGDYFQGEISEAPRLRSFII 204

Query: 123 LHYSLLFLVGCCCF 137
              +L  ++G   F
Sbjct: 205 SLSTLTLIIGSFMF 218

BLAST of Cucsa.051140 vs. TrEMBL
Match: M5WIC7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025734mg PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 1.9e-33
Identity = 72/138 (52.17%), Postives = 92/138 (66.67%), Query Frame = 1

Query: 5   QEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLN 64
           +E P Q  V    CF+NKFIY GC+ AYRLN SG+FNVPPEAT+LFC+GPCL ETQ +LN
Sbjct: 30  EENPAQTFVTALACFNNKFIYAGCDEAYRLNESGNFNVPPEATDLFCHGPCLAETQQVLN 89

Query: 65  CLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQ----GLFMQGEFSEAHTFQKW 124
           C+D+    F+F N+AT   +R ALR GCSY+SQRG FN     G ++QGE S A      
Sbjct: 90  CVDHMLSGFVFNNRATLPDIRGALRAGCSYTSQRGKFNGFGPFGEYIQGETSNAQKLPN- 149

Query: 125 FILHYSLLFLVGCCCFLI 139
           F   ++ L + GC  F++
Sbjct: 150 FSSFFTFLIVTGCSLFIL 166

BLAST of Cucsa.051140 vs. TrEMBL
Match: K7LJG9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G147900 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 2.4e-33
Identity = 65/109 (59.63%), Postives = 82/109 (75.23%), Query Frame = 1

Query: 6   EIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLNC 65
           ++ GQ +++   CFDNK IY GC+ AYRLNPSG+ N+PP AT+ FC+GPCL ETQL+LNC
Sbjct: 43  DLQGQNMLKALSCFDNKLIYVGCDEAYRLNPSGNINIPPVATDFFCSGPCLTETQLVLNC 102

Query: 66  LDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEA 115
           +DN   NF+FYNKAT Q +R AL  GCSYS QRGNFN   ++ GE + A
Sbjct: 103 IDNILSNFIFYNKATLQQMRYALNAGCSYSRQRGNFNLAEYIGGETNNA 151

BLAST of Cucsa.051140 vs. TrEMBL
Match: A0A0B2Q432_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_012046 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 2.4e-33
Identity = 65/109 (59.63%), Postives = 82/109 (75.23%), Query Frame = 1

Query: 6   EIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLNC 65
           ++ GQ +++   CFDNK IY GC+ AYRLNPSG+ N+PP AT+ FC+GPCL ETQL+LNC
Sbjct: 32  DLQGQNMLKALSCFDNKLIYVGCDEAYRLNPSGNINIPPVATDFFCSGPCLTETQLVLNC 91

Query: 66  LDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEA 115
           +DN   NF+FYNKAT Q +R AL  GCSYS QRGNFN   ++ GE + A
Sbjct: 92  IDNILSNFIFYNKATLQQMRYALNAGCSYSRQRGNFNLAEYIGGETNNA 140

BLAST of Cucsa.051140 vs. TAIR10
Match: AT1G56320.1 (AT1G56320.1 BEST Arabidopsis thaliana protein match is: Glycine-rich protein family (TAIR:AT5G49350.2))

HSP 1 Score: 121.7 bits (304), Expect = 3.6e-28
Identity = 54/101 (53.47%), Postives = 69/101 (68.32%), Query Frame = 1

Query: 9   GQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLNCLDN 68
           G +V     CF+N  +Y GC  A+RLN  G F VPPE T+ FCNGPC  ET+L+L C+++
Sbjct: 37  GILVQRAAFCFNNNLLYRGCNEAFRLNQQGEFKVPPEETDRFCNGPCSAETELVLTCINS 96

Query: 69  TFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQG 110
              +F+FYN+AT + VRNALR GCS S  RGNFN G + QG
Sbjct: 97  VMSDFVFYNRATPRDVRNALRGGCSSSFTRGNFNVGDYAQG 137

BLAST of Cucsa.051140 vs. TAIR10
Match: AT5G49350.1 (AT5G49350.1 Glycine-rich protein family)

HSP 1 Score: 94.7 bits (234), Expect = 4.7e-20
Identity = 39/97 (40.21%), Postives = 58/97 (59.79%), Query Frame = 1

Query: 6   EIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLLNC 65
           E P ++V +  +C + K IY  C+ ++RL  +G  N+P   T  FC GPC  ET L LNC
Sbjct: 175 EDPPEIVAKALECLNEKHIYRECDESWRLTLNGDLNIPLGRTEEFCEGPCFSETHLALNC 234

Query: 66  LDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFN 103
           ++    ++ F+N+AT   +R  L+ GCSY  +RG FN
Sbjct: 235 IEEIIHHYRFFNRATIHDIRETLKSGCSYGPERGVFN 271

BLAST of Cucsa.051140 vs. NCBI nr
Match: gi|700210918|gb|KGN66014.1| (hypothetical protein Csa_1G561940 [Cucumis sativus])

HSP 1 Score: 288.1 bits (736), Expect = 8.2e-75
Identity = 134/136 (98.53%), Postives = 134/136 (98.53%), Query Frame = 1

Query: 4   DQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 63
           D EIPGQMVVEGFKCFDNKFIYNGCE AYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL
Sbjct: 34  DTEIPGQMVVEGFKCFDNKFIYNGCERAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 93

Query: 64  NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL 123
           NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL
Sbjct: 94  NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL 153

Query: 124 HYSLLFLVGCCCFLIL 140
           HYSLLFLVGCCCFLIL
Sbjct: 154 HYSLLFLVGCCCFLIL 169

BLAST of Cucsa.051140 vs. NCBI nr
Match: gi|659099339|ref|XP_008450550.1| (PREDICTED: uncharacterized protein LOC103492117 [Cucumis melo])

HSP 1 Score: 262.7 bits (670), Expect = 3.7e-67
Identity = 123/136 (90.44%), Postives = 128/136 (94.12%), Query Frame = 1

Query: 4   DQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 63
           DQEI GQMVVE FKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL
Sbjct: 26  DQEISGQMVVESFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLLL 85

Query: 64  NCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFIL 123
           NCLDN+F NFLFYNKATAQSVRNALR+GCS+SSQRGNFNQGLFMQGE S+AHTFQ+WF L
Sbjct: 86  NCLDNSFHNFLFYNKATAQSVRNALRIGCSFSSQRGNFNQGLFMQGELSKAHTFQEWFFL 145

Query: 124 HYSLLFLVGCCCFLIL 140
           HYS L LVG CCFLIL
Sbjct: 146 HYSFLLLVG-CCFLIL 160

BLAST of Cucsa.051140 vs. NCBI nr
Match: gi|778665266|ref|XP_004135766.2| (PREDICTED: uncharacterized protein LOC101204762 [Cucumis sativus])

HSP 1 Score: 252.3 bits (643), Expect = 5.0e-64
Identity = 118/121 (97.52%), Postives = 119/121 (98.35%), Query Frame = 1

Query: 1   YSDDQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQ 60
           YSDDQEIPGQMVVEGFKCFDNKFIYNGCE AYRLNPSGSFNVPPEATNLFCNGPCLIETQ
Sbjct: 24  YSDDQEIPGQMVVEGFKCFDNKFIYNGCERAYRLNPSGSFNVPPEATNLFCNGPCLIETQ 83

Query: 61  LLLNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKW 120
           LLLNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQK 
Sbjct: 84  LLLNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKC 143

Query: 121 F 122
           +
Sbjct: 144 Y 144

BLAST of Cucsa.051140 vs. NCBI nr
Match: gi|763778289|gb|KJB45412.1| (hypothetical protein B456_007G304800 [Gossypium raimondii])

HSP 1 Score: 152.9 bits (385), Expect = 4.1e-34
Identity = 68/134 (50.75%), Postives = 91/134 (67.91%), Query Frame = 1

Query: 3   DDQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLL 62
           + Q  P Q + + F CF+NK+IY GC+ AYRLN SG+ NVP EAT++FCNGPC  ETQL+
Sbjct: 85  NSQNFPAQSLAKAFFCFNNKYIYTGCDEAYRLNESGNLNVPREATDIFCNGPCFAETQLV 144

Query: 63  LNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFI 122
           L C+DN   +F+FYNKAT   V N L  GCSY+++RGNF+ G + QGE SEA   + + I
Sbjct: 145 LKCVDNILSDFIFYNKATIGDVTNVLHAGCSYTNRRGNFDVGDYFQGEISEAPRLRSFII 204

Query: 123 LHYSLLFLVGCCCF 137
              +L  ++G   F
Sbjct: 205 SLSTLTLIIGSFMF 218

BLAST of Cucsa.051140 vs. NCBI nr
Match: gi|823197371|ref|XP_012434254.1| (PREDICTED: uncharacterized protein LOC105761106 [Gossypium raimondii])

HSP 1 Score: 152.9 bits (385), Expect = 4.1e-34
Identity = 68/134 (50.75%), Postives = 91/134 (67.91%), Query Frame = 1

Query: 3   DDQEIPGQMVVEGFKCFDNKFIYNGCESAYRLNPSGSFNVPPEATNLFCNGPCLIETQLL 62
           + Q  P Q + + F CF+NK+IY GC+ AYRLN SG+ NVP EAT++FCNGPC  ETQL+
Sbjct: 28  NSQNFPAQSLAKAFFCFNNKYIYTGCDEAYRLNESGNLNVPREATDIFCNGPCFAETQLV 87

Query: 63  LNCLDNTFQNFLFYNKATAQSVRNALRVGCSYSSQRGNFNQGLFMQGEFSEAHTFQKWFI 122
           L C+DN   +F+FYNKAT   V N L  GCSY+++RGNF+ G + QGE SEA   + + I
Sbjct: 88  LKCVDNILSDFIFYNKATIGDVTNVLHAGCSYTNRRGNFDVGDYFQGEISEAPRLRSFII 147

Query: 123 LHYSLLFLVGCCCF 137
              +L  ++G   F
Sbjct: 148 SLSTLTLIIGSFMF 161

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LYD5_CUCSA5.7e-7598.53Uncharacterized protein OS=Cucumis sativus GN=Csa_1G561940 PE=4 SV=1[more]
A0A0D2PH82_GOSRA2.9e-3450.75Uncharacterized protein OS=Gossypium raimondii GN=B456_007G304800 PE=4 SV=1[more]
M5WIC7_PRUPE1.9e-3352.17Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025734mg PE=4 SV=1[more]
K7LJG9_SOYBN2.4e-3359.63Uncharacterized protein OS=Glycine max GN=GLYMA_10G147900 PE=4 SV=1[more]
A0A0B2Q432_GLYSO2.4e-3359.63Uncharacterized protein OS=Glycine soja GN=glysoja_012046 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G56320.13.6e-2853.47 BEST Arabidopsis thaliana protein match is: Glycine-rich protein fam... [more]
AT5G49350.14.7e-2040.21 Glycine-rich protein family[more]
Match NameE-valueIdentityDescription
gi|700210918|gb|KGN66014.1|8.2e-7598.53hypothetical protein Csa_1G561940 [Cucumis sativus][more]
gi|659099339|ref|XP_008450550.1|3.7e-6790.44PREDICTED: uncharacterized protein LOC103492117 [Cucumis melo][more]
gi|778665266|ref|XP_004135766.2|5.0e-6497.52PREDICTED: uncharacterized protein LOC101204762 [Cucumis sativus][more]
gi|763778289|gb|KJB45412.1|4.1e-3450.75hypothetical protein B456_007G304800 [Gossypium raimondii][more]
gi|823197371|ref|XP_012434254.1|4.1e-3450.75PREDICTED: uncharacterized protein LOC105761106 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.051140.1Cucsa.051140.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34366FAMILY NOT NAMEDcoord: 1..114
score: 1.2
NoneNo IPR availablePANTHERPTHR34366:SF3SUBFAMILY NOT NAMEDcoord: 1..114
score: 1.2