Sed0012386 (gene) Chayote v1

Overview
NameSed0012386
Typegene
OrganismSechium edule (Chayote v1)
DescriptionPrecursor of CEP3
LocationLG10: 4262848 .. 4263512 (-)
RNA-Seq ExpressionSed0012386
SyntenySed0012386
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATCTTAACCCTTCTAAGATCTCTCATTAGAGCTTTTGCCTCAATTTTCTTTTGTTGAAATTTGTTCAAATCTATGGCACCAACCAAGCAAAGCTTTATCTTTCTTTTCATTGGCCTAATAATTTTGTGTCATGTAATAGATTCTGCCTTTAGCAGACCACTGAAATCAGACACAAATGACCAACTCTCTGAGAGCCCTACAAAAGGGGTTCATTTTGAATTCCATGGTGAAAGTGTTGACGAAGGATCCGTCCCGAACCCGCCCGCTGCAGCGTCGAGTTCGTCTTCGGGCCGTAAGGTGGATGACTTCAGGCCAACCACCCCTGGCCACAGCCCTGGCGTTGGCCATGCCATTGAGACCTAAAGCTCCAACGTCAGTATCATCATCATTATTATTATTGGTTATTTCTCAACAAAGAAAAAAAGAAGACAAAATGGTGTGACATGGATTATAAAGTAAGCTCTATATTATTTTGTGTTGTATTATTATTGGTATTACTTATAATCTATTTCTCCGTGAAGCATTTTATGCGGGCCGGATAAAGCAAAGTCGGTCTAGTGTGAAGTTTATGTAATTTAACTTGGATTGCTTTCATGAAAAAAAAGGAATGTAAACCATGCATTATATATTTTTCAATGGAAATGAAAATTTGTGTATTTGCTT

mRNA sequence

AAATCTTAACCCTTCTAAGATCTCTCATTAGAGCTTTTGCCTCAATTTTCTTTTGTTGAAATTTGTTCAAATCTATGGCACCAACCAAGCAAAGCTTTATCTTTCTTTTCATTGGCCTAATAATTTTGTGTCATGTAATAGATTCTGCCTTTAGCAGACCACTGAAATCAGACACAAATGACCAACTCTCTGAGAGCCCTACAAAAGGGGTTCATTTTGAATTCCATGGTGAAAGTGTTGACGAAGGATCCGTCCCGAACCCGCCCGCTGCAGCGTCGAGTTCGTCTTCGGGCCGTAAGGTGGATGACTTCAGGCCAACCACCCCTGGCCACAGCCCTGGCGTTGGCCATGCCATTGAGACCTAAAGCTCCAACGTCAGTATCATCATCATTATTATTATTGGTTATTTCTCAACAAAGAAAAAAAGAAGACAAAATGGTGTGACATGGATTATAAAGTAAGCTCTATATTATTTTGTGTTGTATTATTATTGGTATTACTTATAATCTATTTCTCCGTGAAGCATTTTATGCGGGCCGGATAAAGCAAAGTCGGTCTAGTGTGAAGTTTATGTAATTTAACTTGGATTGCTTTCATGAAAAAAAAGGAATGTAAACCATGCATTATATATTTTTCAATGGAAATGAAAATTTGTGTATTTGCTT

Coding sequence (CDS)

ATGGCACCAACCAAGCAAAGCTTTATCTTTCTTTTCATTGGCCTAATAATTTTGTGTCATGTAATAGATTCTGCCTTTAGCAGACCACTGAAATCAGACACAAATGACCAACTCTCTGAGAGCCCTACAAAAGGGGTTCATTTTGAATTCCATGGTGAAAGTGTTGACGAAGGATCCGTCCCGAACCCGCCCGCTGCAGCGTCGAGTTCGTCTTCGGGCCGTAAGGTGGATGACTTCAGGCCAACCACCCCTGGCCACAGCCCTGGCGTTGGCCATGCCATTGAGACCTAA

Protein sequence

MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEGSVPNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIET
Homology
BLAST of Sed0012386 vs. NCBI nr
Match: KAG6604187.1 (hypothetical protein SDJN03_04796, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 131.3 bits (329), Expect = 4.3e-27
Identity = 67/99 (67.68%), Postives = 79/99 (79.80%), Query Frame = 0

Query: 1  MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEG-- 60
          MAPTK SF FLF  LI+LC+V +S  SRPLK+DTN QL E P K  HF+FHGESV+EG  
Sbjct: 1  MAPTKLSFAFLFTALIVLCYVTESVLSRPLKTDTNHQLFERPAKDGHFQFHGESVEEGNG 60

Query: 61 -SVPNP-PAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
           + PNP  AAA++SS GRK+DDFRPTTPGHSPGVGH+I+
Sbjct: 61 TTNPNPLTAAAAASSPGRKMDDFRPTTPGHSPGVGHSIQ 99

BLAST of Sed0012386 vs. NCBI nr
Match: KGN48516.1 (hypothetical protein Csa_002724 [Cucumis sativus])

HSP 1 Score: 125.2 bits (313), Expect = 3.1e-25
Identity = 63/106 (59.43%), Postives = 79/106 (74.53%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDT-NDQLSESPTKGVHFEFHGESVDEGS 60
           MAPTK SF FLFI L+ILCH++D AFSRPL + T + QLS++  K  HF+ HG+++ EG 
Sbjct: 5   MAPTKLSFAFLFISLLILCHLVDPAFSRPLTTHTIHQQLSDTLPKNPHFQLHGQTLHEGK 64

Query: 61  V----------PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                      PNPP+AA+SS+ GRK+DDFRPTTPGHSPGVGH+IE
Sbjct: 65  ASNDDAVSTPNPNPPSAAASSTPGRKMDDFRPTTPGHSPGVGHSIE 110

BLAST of Sed0012386 vs. NCBI nr
Match: KAG7027066.1 (Precursor of CEP3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 121.3 bits (303), Expect = 4.5e-24
Identity = 62/102 (60.78%), Postives = 71/102 (69.61%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEG-- 60
           M PT  S  FLFI L+I  H IDSA SRPLK+ TN QLS    K  HF+FHG+SV EG  
Sbjct: 1   MLPTNLSLAFLFIALLIFSHFIDSASSRPLKTHTNHQLSARSAKEAHFQFHGQSVAEGKG 60

Query: 61  -----SVPNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                S P+PP  A+ SS GRK+DDFRPTTPGHSPGVGH+I+
Sbjct: 61  TDGAVSTPSPPTTAAGSSPGRKMDDFRPTTPGHSPGVGHSIQ 102

BLAST of Sed0012386 vs. NCBI nr
Match: KAG6595043.1 (Precursor of CEP3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 121.3 bits (303), Expect = 4.5e-24
Identity = 62/102 (60.78%), Postives = 71/102 (69.61%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEG-- 60
           M PT  S  FLFI L+I  H IDSA SRPLK+ TN QLS    K  HF+FHG+SV EG  
Sbjct: 1   MPPTNLSLAFLFIALLIFSHFIDSASSRPLKTHTNHQLSARSAKEAHFQFHGQSVAEGKG 60

Query: 61  -----SVPNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                S P+PP  A+ SS GRK+DDFRPTTPGHSPGVGH+I+
Sbjct: 61  TDGAVSTPSPPTTAAGSSPGRKMDDFRPTTPGHSPGVGHSIQ 102

BLAST of Sed0012386 vs. NCBI nr
Match: TYK12897.1 (hypothetical protein E5676_scaffold255G004910 [Cucumis melo var. makuwa])

HSP 1 Score: 118.6 bits (296), Expect = 2.9e-23
Identity = 62/106 (58.49%), Postives = 74/106 (69.81%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDT-NDQLSESPTKGVHFEFHGESVDEGS 60
           MA TK SF FLFI L+ILCH+ID AFSRPL + T + QLS +     HF+FHG +V EG 
Sbjct: 1   MASTKLSFAFLFISLLILCHLIDPAFSRPLTTHTDHQQLSNTLPTNPHFQFHGHTVHEGK 60

Query: 61  V----------PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                      PNPP AA+ S+ GRK+DDFRPTTPGHSPGVGH+I+
Sbjct: 61  ASNNDAVSTPNPNPPTAAAGSTPGRKMDDFRPTTPGHSPGVGHSIK 106

BLAST of Sed0012386 vs. ExPASy Swiss-Prot
Match: Q058G9 (Precursor of CEP5 OS=Arabidopsis thaliana OX=3702 GN=CEP5 PE=1 SV=1)

HSP 1 Score: 49.7 bits (117), Expect = 2.2e-05
Identity = 20/34 (58.82%), Postives = 25/34 (73.53%), Query Frame = 0

Query: 61  PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAI 95
           P PP    S S G+  +DFRPTTPGHSPG+GH++
Sbjct: 69  PPPPPPPPSQSGGKDAEDFRPTTPGHSPGIGHSL 102

BLAST of Sed0012386 vs. ExPASy TrEMBL
Match: A0A0A0KLC0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490800 PE=3 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.5e-25
Identity = 63/106 (59.43%), Postives = 79/106 (74.53%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDT-NDQLSESPTKGVHFEFHGESVDEGS 60
           MAPTK SF FLFI L+ILCH++D AFSRPL + T + QLS++  K  HF+ HG+++ EG 
Sbjct: 5   MAPTKLSFAFLFISLLILCHLVDPAFSRPLTTHTIHQQLSDTLPKNPHFQLHGQTLHEGK 64

Query: 61  V----------PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                      PNPP+AA+SS+ GRK+DDFRPTTPGHSPGVGH+IE
Sbjct: 65  ASNDDAVSTPNPNPPSAAASSTPGRKMDDFRPTTPGHSPGVGHSIE 110

BLAST of Sed0012386 vs. ExPASy TrEMBL
Match: A0A5D3CLS2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004910 PE=3 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 1.4e-23
Identity = 62/106 (58.49%), Postives = 74/106 (69.81%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDT-NDQLSESPTKGVHFEFHGESVDEGS 60
           MA TK SF FLFI L+ILCH+ID AFSRPL + T + QLS +     HF+FHG +V EG 
Sbjct: 1   MASTKLSFAFLFISLLILCHLIDPAFSRPLTTHTDHQQLSNTLPTNPHFQFHGHTVHEGK 60

Query: 61  V----------PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAIE 96
                      PNPP AA+ S+ GRK+DDFRPTTPGHSPGVGH+I+
Sbjct: 61  ASNNDAVSTPNPNPPTAAAGSTPGRKMDDFRPTTPGHSPGVGHSIK 106

BLAST of Sed0012386 vs. ExPASy TrEMBL
Match: A0A2K3N1M2 (Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g020169 PE=3 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 4.5e-06
Identity = 40/113 (35.40%), Postives = 60/113 (53.10%), Query Frame = 0

Query: 1   MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEGSV 60
           MA  K  F  +F+ LI+L    +S   R LKS   +++++SP K  H   + ++V  GS+
Sbjct: 1   MAQNKSIFSLIFVALIVLSQTFESIEGRYLKS---NEVNQSPMK--HNNANNDNVVHGSI 60

Query: 61  P----------NPPAAASSSSS---------GRKVDDFRPTTPGHSPGVGHAI 95
                      +PP+   + ++         GR V DFRPTTPGHSPG+GH+I
Sbjct: 61  SISNAEKLTSMSPPSVVVNGATGEPSPPPTPGRGVSDFRPTTPGHSPGIGHSI 108

BLAST of Sed0012386 vs. ExPASy TrEMBL
Match: D7LFB9 (Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata OX=81972 GN=ARALYDRAFT_900830 PE=3 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.0e-05
Identity = 35/102 (34.31%), Postives = 55/102 (53.92%), Query Frame = 0

Query: 1  MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSDTNDQLSESPTKGVHFEFHGESVDEGSV 60
          MA T++  ++LF+ ++++C +ID A S  L+    +  S       H   H     +G++
Sbjct: 1  MAKTRR-VVYLFLSILLICEIIDEAQSSRLRCHHREDYSCKKRSSHHHHHHHH--HKGTL 60

Query: 61 PNPPAAASSSSSGRK------VDDFRPTTPGHSPGVGHAIET 97
            P    S+S   R+      +D FRPT PGHSPGVGH+I+T
Sbjct: 61 SEPNLRGSNSIKARRSKDIYGLDAFRPTAPGHSPGVGHSIKT 99

BLAST of Sed0012386 vs. ExPASy TrEMBL
Match: A0A4D6LRW0 (Uncharacterized protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG4g2282 PE=3 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.0e-05
Identity = 38/97 (39.18%), Postives = 53/97 (54.64%), Query Frame = 0

Query: 1  MAPTKQSFIFLFIGLIILCHVIDSAFSRPLKSD--TNDQLSES-PTKGVHFEFHGESVDE 60
          MA  K +   + + L+IL   ++S   R LK D  T  Q+ E  PT  V       + D 
Sbjct: 1  MAQKKLTVCLMLVALVILLQGLESIEGRLLKLDETTEHQMQERIPTTNV------AAFDT 60

Query: 61 GSVPNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAI 95
              +PP   S+++ GR VD+FRPT PGHSPGVGH++
Sbjct: 61 DVSVSPPTPPSAAAPGRDVDNFRPTAPGHSPGVGHSV 91

BLAST of Sed0012386 vs. TAIR 10
Match: AT5G66815.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: root; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 49.7 bits (117), Expect = 1.5e-06
Identity = 20/34 (58.82%), Postives = 25/34 (73.53%), Query Frame = 0

Query: 61  PNPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAI 95
           P PP    S S G+  +DFRPTTPGHSPG+GH++
Sbjct: 69  PPPPPPPPSQSGGKDAEDFRPTTPGHSPGIGHSL 102

BLAST of Sed0012386 vs. TAIR 10
Match: AT3G50610.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G66816.1); Has 125 Blast hits to 60 proteins in 16 species: Archae - 0; Bacteria - 2; Metazoa - 10; Fungi - 4; Plants - 97; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 43.5 bits (101), Expect = 1.1e-04
Identity = 20/34 (58.82%), Postives = 25/34 (73.53%), Query Frame = 0

Query: 63  PPAAASSSSSG-RKVDDFRPTTPGHSPGVGHAIE 96
           P    +S   G +K DDF+PTTPGHSPGVGHA++
Sbjct: 190 PTTPGNSPGMGHKKGDDFKPTTPGHSPGVGHAVK 223

BLAST of Sed0012386 vs. TAIR 10
Match: AT5G66816.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50610.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 41.2 bits (95), Expect = 5.5e-04
Identity = 24/65 (36.92%), Postives = 31/65 (47.69%), Query Frame = 0

Query: 31 KSDTNDQLSESPTKGVHFEFHGESVDEGSVPNPPAAASSSSSGRKVDDFRPTTPGHSPGV 90
          K+D  D      T G   +F   S             +  ++G   DDF PTTPGHSPGV
Sbjct: 31 KTDDQDHDDHHFTVGYTDDFGPTSPGNSPGIGHKMKENEENAGGYKDDFEPTTPGHSPGV 90

Query: 91 GHAIE 96
          GHA++
Sbjct: 91 GHAVK 95

BLAST of Sed0012386 vs. TAIR 10
Match: AT2G23440.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: root; Has 25 Blast hits to 25 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 40.8 bits (94), Expect = 7.2e-04
Identity = 17/33 (51.52%), Postives = 21/33 (63.64%), Query Frame = 0

Query: 62 NPPAAASSSSSGRKVDDFRPTTPGHSPGVGHAI 95
          +PP     S     VD FRPT PGHSPG+GH++
Sbjct: 48 SPPTEPLESPPSHGVDTFRPTEPGHSPGIGHSV 80

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6604187.14.3e-2767.68hypothetical protein SDJN03_04796, partial [Cucurbita argyrosperma subsp. sorori... [more]
KGN48516.13.1e-2559.43hypothetical protein Csa_002724 [Cucumis sativus][more]
KAG7027066.14.5e-2460.78Precursor of CEP3, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6595043.14.5e-2460.78Precursor of CEP3, partial [Cucurbita argyrosperma subsp. sororia][more]
TYK12897.12.9e-2358.49hypothetical protein E5676_scaffold255G004910 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q058G92.2e-0558.82Precursor of CEP5 OS=Arabidopsis thaliana OX=3702 GN=CEP5 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KLC01.5e-2559.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490800 PE=3 SV=1[more]
A0A5D3CLS21.4e-2358.49Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A2K3N1M24.5e-0635.40Uncharacterized protein OS=Trifolium pratense OX=57577 GN=L195_g020169 PE=3 SV=1[more]
D7LFB91.0e-0534.31Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata OX=81972 GN=ARALYDRA... [more]
A0A4D6LRW01.0e-0539.18Uncharacterized protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG4g2282 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66815.11.5e-0658.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G50610.11.1e-0458.82unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G66816.15.5e-0436.92unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G23440.17.2e-0451.52unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..96
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..77
NoneNo IPR availablePANTHERPTHR33348:SF28PRECURSOR OF CEP7coord: 1..94
IPR033250C-terminally encoded peptidePANTHERPTHR33348PRECURSOR OF CEP5coord: 1..94

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0012386.1Sed0012386.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1902025 nitrate import
biological_process GO:1901371 regulation of leaf morphogenesis
biological_process GO:2000280 regulation of root development
biological_process GO:0048364 root development
cellular_component GO:0005576 extracellular region
molecular_function GO:0005179 hormone activity