Sed0010339 (gene) Chayote v1

Overview
NameSed0010339
Typegene
OrganismSechium edule (Chayote v1)
DescriptionGAG1At protein
LocationLG09: 37938376 .. 37941341 (+)
RNA-Seq ExpressionSed0010339
SyntenySed0010339
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAAACACCGTAACCAAACGCTCCAAATATGCACAGAGCATTTGAGCAAAACCGGAAGTGGAGACCCGGAGGAGCTAATCTCACTTCGACAGAATCGCCGGCCAAAATGAGCGACGGTGCGAGCTCCGGCGGCGGCGGCGGCGGCTTCCGATCGAGTATGGAGCGATATTTATACAGCGGCGACAAAAAGCACGTCGCGGCCGGCATCGCCGTCATCGGTGTCCTCTTCGGCATCCCTTGGGTACTCATGAACCGAGGTCTCTCTTCTTTTTCCTTATCCAATTTTTGTTTTCTGTTAATCTATAATCGATGTTTTGTTTGGATGTTAAGAGAAATTGATTTATGGGAAAAGAGAATCGAACTGTTTTGAGTAATTGTGTTTTTGATTGCCCTGGATTTGAAGAAACCTTGAAATATTGTTGTGTGCAGTGGAGTTAGGGGACCTTGATTCTTGATTTGGAGTTTTGTTTTTAACTTACTTCTTAGTTTGTTTGAAGAACGAATAGGGAATTTGGCAATCTGGCAATCTTTGGATTGCAATTGTGGACTATTTTTATTTTGATAATTGTGACTTTCTCGGCTAGTTTACGTGCACTTCGGTTAATGTCATGGGATATAACGTCTGACGTAACAACATTTGGATGTCAAGGGAACACATAGGGGTGTTTGTTTAACATATTTACTGAGTATTTAGCTTAAGAAGTTATATTGAAGAATTATTTTGAGTAGTTAAAAATCGAGAATTGTAGTAAAAAACTTGTCTAGATGTAGTTTTTGTGTAGAGGAGTTAGAGTCAAAACCTTGTCTGGATGTAGATTTAAAATAAGTTGTTATTTTGAGTAGTTAAAAATAATGATTAAAAATGAGTTATTTTGATTGGTCCCCAAGGGGTGGCATAGTGGTTGAAAACTTGGGCTTTGAAGGTATGCTCCCTTCAAGGTCCCAGGTTCGAGGCTTACCTGTGATATTACTCCTTCGATGTCTCCCAGTGCTTGACCTAGGGACGGGCGTGGTTACTCTTGTTTAAAAAAAAAGGAGATATTTTGATTGATTTGAAATTAATGGTTAAATATGAGTGGTTAGAAATAATTGAGTAGTTATAAATAAATGATTAAAAATAAATAGTTAGAAATGAGAGTTATTTTTAAAAGTAAAAATGAAGAGTTAGTAAAAACTCGCTCAAATATAGGTTTTCGAGGAGAGTTAGTAAATCCTCAAGTATGACAAAGAGGACTATTGAGAGTAAGTGGAAGTTCGGGTTGTAAATTTTCTGTTGGGTTGCTCCTGTAAATTTCGAATTAGAAACGGTAGCTTATCAGGGCAGTGTCATTGAGAATTTTGATTTTGGTTTTCGAACTTGAGAACTATTGAGGATTCTACATTGGTCAAATGAAAAGAACTCTTATCATTTATAAGATAGTCCAATTGGTTTTGAGTTGGAACCTCATGCTGATCTAATATCATACCAGATCTTATTAAACCTGAGCAAGTGTTTGGCCCAAAAATTAAAATTGAAACTCGATCCAAGAATGAATCCAAGAGGCATGGAGGGGACGTGGTGAAGATCTTACATTGAAAAAATGAGGAGACCTCATTTATTTATAAGATAGATGAAGTATTGCAAAGTCCAATCATTATCAATTGGTTTTGCATTGAAACTCCTACTAATTTAATAAGAACCAGTGTTTTAAAAAGCGCAAGGCGCACCAAAAGGCGAGGTTTTTTTTTTACCAAAGGCGCACTATATAAAAAAAACACAAAAAATATATATATGTGTGTGTGTATACACAATAAAAACATTTCCATAGATATAAATGAAGTTTTAACCAAGAATCATATATAAAACATACATGACATTTAACTATTAACCTTCATTCATGAATAGAAATGAAGTTTAACAAAGAATCTATATAAAACATACATAAGATTCAACTATTAACCTTTCATTTTTAAGCGAATGTTGTCTTTGTGCGTTAAAAAATTAAGTTATTTAAGGCATTGAAATGTCTTGAATAAGTTATGTTGCGCTTCCCAAGCGAGGCGCTAAAGGTGCGACTTACGCCCAAGCGCGCGCCTTTTAAAACACTGATAAGAACAACTCGAAATAAAAAAGTCATGCCTTGGGGCTGTTTGGAAATTAGGTGGTGCTCTAGATAGAACAACATATTTGATCGTGTCTTTAAAAAGTTTGTTTTTCAGCTTACTACTTCTAATAAATGACCTCTACATTCGAGTATAAGTATTAACACAAAAACACACAGATACACAGAAAATCTTATAGGTGAAGATTGAGAATGCTTTCCTTGTTCCTTTTAGGCATCCAAATCTACACCCCATATAATAATTTGTGGCTTTACTTAAATTCAATACGATAGATTACTAGGTCCCTGAGGAATGGCGGAGTGGTTGAAGACTTGAGTTTTGAAGGAGCGCTCCCTTCAAGGTCTCAGGTTTGAGACTCACTTGTGACATTAATTTGTAGACACCTTTCGTATCTCCATCCCTTCGATGTTTCGCGGTGCCTGGCCAAGAGACTGGCTTGATTACCCCAAGTATAGTGGAGCGAAGCTCCGATTTTCCAATTTTTAAAAAAATGAAATTACTAGGAGGTCGTAAATTTACACGAATGGACTTATCACCACTTAATTGTTTACAGGATCAAAACATCGGTCTCATCAAGATTACATGGAAAAAGCTGACAAAGCACGAAGTCAGAGACTCTCTTCAGGTTCATCATCAGCTAAATGATTTTTATTTAATTTTTTTTTTCTGGCTTTCTTCTGATAGGAGATGTTCTTTGGAGCAGTTAGTTTACATAACTTTCTTGTAATAGCTCATATATTTTTCATATTTTTCATATTTTTCTATCTCTGCTGACATTAAATGGAGACAGGTAAGTTACACTCACCTAGGTACACTCATGGTTATGAAATAAAGGACAGATTTTTTTTTGTTCCTTTTATAA

mRNA sequence

CAAAAACACCGTAACCAAACGCTCCAAATATGCACAGAGCATTTGAGCAAAACCGGAAGTGGAGACCCGGAGGAGCTAATCTCACTTCGACAGAATCGCCGGCCAAAATGAGCGACGGTGCGAGCTCCGGCGGCGGCGGCGGCGGCTTCCGATCGAGTATGGAGCGATATTTATACAGCGGCGACAAAAAGCACGTCGCGGCCGGCATCGCCGTCATCGGTGTCCTCTTCGGCATCCCTTGGGTACTCATGAACCGAGGATCAAAACATCGGTCTCATCAAGATTACATGGAAAAAGCTGACAAAGCACGAAGTCAGAGACTCTCTTCAGGTTCATCATCAGCTAAATGATTTTTATTTAATTTTTTTTTTCTGGCTTTCTTCTGATAGGAGATGTTCTTTGGAGCAGTTAGTTTACATAACTTTCTTGTAATAGCTCATATATTTTTCATATTTTTCATATTTTTCTATCTCTGCTGACATTAAATGGAGACAGGTAAGTTACACTCACCTAGGTACACTCATGGTTATGAAATAAAGGACAGATTTTTTTTTGTTCCTTTTATAA

Coding sequence (CDS)

ATGCACAGAGCATTTGAGCAAAACCGGAAGTGGAGACCCGGAGGAGCTAATCTCACTTCGACAGAATCGCCGGCCAAAATGAGCGACGGTGCGAGCTCCGGCGGCGGCGGCGGCGGCTTCCGATCGAGTATGGAGCGATATTTATACAGCGGCGACAAAAAGCACGTCGCGGCCGGCATCGCCGTCATCGGTGTCCTCTTCGGCATCCCTTGGGTACTCATGAACCGAGGATCAAAACATCGGTCTCATCAAGATTACATGGAAAAAGCTGACAAAGCACGAAGTCAGAGACTCTCTTCAGGTTCATCATCAGCTAAATGA

Protein sequence

MHRAFEQNRKWRPGGANLTSTESPAKMSDGASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDYMEKADKARSQRLSSGSSSAK
Homology
BLAST of Sed0010339 vs. NCBI nr
Match: XP_022992483.1 (uncharacterized protein LOC111488802 [Cucurbita maxima])

HSP 1 Score: 131.3 bits (329), Expect = 4.8e-27
Identity = 68/83 (81.93%), Postives = 73/83 (87.95%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GGGGGFRS ME YLYSG+KKHVAAGI VIG++FGIPW LMNRGSKH+SH
Sbjct: 1   MSGDAKSGGSIGGGGGFRSRMEHYLYSGEKKHVAAGIVVIGIIFGIPWALMNRGSKHQSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. NCBI nr
Match: XP_022953259.1 (uncharacterized protein LOC111455862 [Cucurbita moschata] >XP_023547796.1 uncharacterized protein LOC111806650 [Cucurbita pepo subsp. pepo] >KAG6575633.1 hypothetical protein SDJN03_26272, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014187.1 hypothetical protein SDJN02_24361 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 131.0 bits (328), Expect = 6.3e-27
Identity = 68/83 (81.93%), Postives = 73/83 (87.95%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GGGGGFRS ME YLYSG+KKHVAAGI VIG++FGIPW LMNRGSKH+SH
Sbjct: 1   MSGEAKSGGSIGGGGGFRSRMEHYLYSGEKKHVAAGIVVIGIIFGIPWALMNRGSKHQSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. NCBI nr
Match: KAB1223321.1 (hypothetical protein CJ030_MR2G001433 [Morella rubra])

HSP 1 Score: 130.2 bits (326), Expect = 1.1e-26
Identity = 63/78 (80.77%), Postives = 71/78 (91.03%), Query Frame = 0

Query: 29  DGASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDYME 88
           +G  SGGGGGGFRS +E YLYSGDKKHV AGIA+I V+FG+PW LM+RGSKH+SHQDYME
Sbjct: 8   NGTKSGGGGGGFRSRVEHYLYSGDKKHVVAGIAIISVIFGVPWYLMSRGSKHQSHQDYME 67

Query: 89  KADKARSQRLSSGSSSAK 107
           KADKARSQRLSSG+SSAK
Sbjct: 68  KADKARSQRLSSGASSAK 85

BLAST of Sed0010339 vs. NCBI nr
Match: XP_004135879.1 (uncharacterized protein LOC101214375 [Cucumis sativus])

HSP 1 Score: 129.8 bits (325), Expect = 1.4e-26
Identity = 67/83 (80.72%), Postives = 71/83 (85.54%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GG GGFRS ME YLYSGDKKHVAAGI + G++FGIPW LMNRGSKHRSH
Sbjct: 1   MSGEAKSGGASGGAGGFRSRMEHYLYSGDKKHVAAGIVIFGIIFGIPWALMNRGSKHRSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. NCBI nr
Match: XP_022150938.1 (uncharacterized protein LOC111018966 [Momordica charantia])

HSP 1 Score: 128.3 bits (321), Expect = 4.1e-26
Identity = 63/77 (81.82%), Postives = 69/77 (89.61%), Query Frame = 0

Query: 30  GASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDYMEK 89
           G   GGGG GFRS ME +LYSGDKKHVAAGIAVI ++FGIPWVLM+RGSKH+SHQDYME+
Sbjct: 14  GGGGGGGGAGFRSRMEHFLYSGDKKHVAAGIAVISIIFGIPWVLMSRGSKHQSHQDYMER 73

Query: 90  ADKARSQRLSSGSSSAK 107
           ADKARSQRLSSGSS AK
Sbjct: 74  ADKARSQRLSSGSSQAK 90

BLAST of Sed0010339 vs. ExPASy TrEMBL
Match: A0A6J1JZC0 (uncharacterized protein LOC111488802 OS=Cucurbita maxima OX=3661 GN=LOC111488802 PE=4 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 2.3e-27
Identity = 68/83 (81.93%), Postives = 73/83 (87.95%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GGGGGFRS ME YLYSG+KKHVAAGI VIG++FGIPW LMNRGSKH+SH
Sbjct: 1   MSGDAKSGGSIGGGGGFRSRMEHYLYSGEKKHVAAGIVVIGIIFGIPWALMNRGSKHQSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. ExPASy TrEMBL
Match: A0A6J1GP50 (uncharacterized protein LOC111455862 OS=Cucurbita moschata OX=3662 GN=LOC111455862 PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 3.0e-27
Identity = 68/83 (81.93%), Postives = 73/83 (87.95%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GGGGGFRS ME YLYSG+KKHVAAGI VIG++FGIPW LMNRGSKH+SH
Sbjct: 1   MSGEAKSGGSIGGGGGFRSRMEHYLYSGEKKHVAAGIVVIGIIFGIPWALMNRGSKHQSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. ExPASy TrEMBL
Match: A0A6A1WDR3 (Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR2G001433 PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 5.2e-27
Identity = 63/78 (80.77%), Postives = 71/78 (91.03%), Query Frame = 0

Query: 29  DGASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDYME 88
           +G  SGGGGGGFRS +E YLYSGDKKHV AGIA+I V+FG+PW LM+RGSKH+SHQDYME
Sbjct: 8   NGTKSGGGGGGFRSRVEHYLYSGDKKHVVAGIAIISVIFGVPWYLMSRGSKHQSHQDYME 67

Query: 89  KADKARSQRLSSGSSSAK 107
           KADKARSQRLSSG+SSAK
Sbjct: 68  KADKARSQRLSSGASSAK 85

BLAST of Sed0010339 vs. ExPASy TrEMBL
Match: A0A6J1DAT5 (uncharacterized protein LOC111018966 OS=Momordica charantia OX=3673 GN=LOC111018966 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 2.0e-26
Identity = 63/77 (81.82%), Postives = 69/77 (89.61%), Query Frame = 0

Query: 30  GASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDYMEK 89
           G   GGGG GFRS ME +LYSGDKKHVAAGIAVI ++FGIPWVLM+RGSKH+SHQDYME+
Sbjct: 14  GGGGGGGGAGFRSRMEHFLYSGDKKHVAAGIAVISIIFGIPWVLMSRGSKHQSHQDYMER 73

Query: 90  ADKARSQRLSSGSSSAK 107
           ADKARSQRLSSGSS AK
Sbjct: 74  ADKARSQRLSSGSSQAK 90

BLAST of Sed0010339 vs. ExPASy TrEMBL
Match: A0A5D3CFG0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G001610 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 3.3e-26
Identity = 66/83 (79.52%), Postives = 71/83 (85.54%), Query Frame = 0

Query: 27  MSDGASSG---GGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSH 86
           MS  A SG   GG GGFRS ME YLYSGDKKHVAAGI + G++FGIPW LMNRGSKH+SH
Sbjct: 1   MSGEAKSGGASGGAGGFRSRMEYYLYSGDKKHVAAGIVIFGIIFGIPWALMNRGSKHQSH 60

Query: 87  QDYMEKADKARSQRLSSGSSSAK 107
           QDYME+ADKARSQRLSSGSSSAK
Sbjct: 61  QDYMERADKARSQRLSSGSSSAK 83

BLAST of Sed0010339 vs. TAIR 10
Match: AT1G80890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16000.1); Has 41 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 110.5 bits (275), Expect = 8.1e-25
Identity = 54/80 (67.50%), Postives = 64/80 (80.00%), Query Frame = 0

Query: 27  MSDGASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQDY 86
           M + + S  GGGG R+ ME Y+YSG+KKHV AGI +I ++FGIPW LMN+GSKHRSHQDY
Sbjct: 1   MGNESKSNLGGGGIRAKMEHYVYSGEKKHVLAGIGIISIIFGIPWYLMNQGSKHRSHQDY 60

Query: 87  MEKADKARSQRLSSGSSSAK 107
           +EKADKAR  RLSS SSS K
Sbjct: 61  LEKADKARKARLSSSSSSDK 80

BLAST of Sed0010339 vs. TAIR 10
Match: AT1G16000.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G80890.1); Has 41 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 107.1 bits (266), Expect = 9.0e-24
Identity = 50/81 (61.73%), Postives = 64/81 (79.01%), Query Frame = 0

Query: 26  KMSDGASSGGGGGGFRSSMERYLYSGDKKHVAAGIAVIGVLFGIPWVLMNRGSKHRSHQD 85
           K + G +S  GGGGFR+ ME Y+YSG+KKHV  GI ++ ++FG+PW LM +GSKH+SHQD
Sbjct: 6   KTNGGPASMAGGGGFRAKMEHYVYSGEKKHVLVGIGIVTIIFGVPWYLMTQGSKHQSHQD 65

Query: 86  YMEKADKARSQRLSSGSSSAK 107
           YM+KADKAR  RLSS SS+ K
Sbjct: 66  YMDKADKARKARLSSSSSANK 86

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022992483.14.8e-2781.93uncharacterized protein LOC111488802 [Cucurbita maxima][more]
XP_022953259.16.3e-2781.93uncharacterized protein LOC111455862 [Cucurbita moschata] >XP_023547796.1 unchar... [more]
KAB1223321.11.1e-2680.77hypothetical protein CJ030_MR2G001433 [Morella rubra][more]
XP_004135879.11.4e-2680.72uncharacterized protein LOC101214375 [Cucumis sativus][more]
XP_022150938.14.1e-2681.82uncharacterized protein LOC111018966 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JZC02.3e-2781.93uncharacterized protein LOC111488802 OS=Cucurbita maxima OX=3661 GN=LOC111488802... [more]
A0A6J1GP503.0e-2781.93uncharacterized protein LOC111455862 OS=Cucurbita moschata OX=3662 GN=LOC1114558... [more]
A0A6A1WDR35.2e-2780.77Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR2G001433 PE=4 SV=1[more]
A0A6J1DAT52.0e-2681.82uncharacterized protein LOC111018966 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A5D3CFG03.3e-2679.52Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT1G80890.18.1e-2567.50unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G16000.19.0e-2461.73unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 78..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..38
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 13..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 78..95
NoneNo IPR availablePANTHERPTHR35990GAG1AT PROTEINcoord: 30..105

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0010339.1Sed0010339.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane