CmaCh14G020830 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G020830
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionUnknown protein
LocationCma_Chr14 : 14421112 .. 14421883 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACTCTTCTCCTTCTCTTTCTTCGTTCTTCTTCATCGTTTGATCCGAGCTTCCAGAGATTTGGTTGTTCCGAGAGGGAGACCATCCATGGCGTCCTCTAATTCTACTTTTAGGGCTTTCTCCGGCCTCTTCACCTTCTTCACTCTCATCCTCTTCATTCTATCGCCACTCATTGACGCCCACTCTTCGGTCCCCTCTCCCGCTCCCGCTCCCGCTCCCGCTAGCGATGGTAACTACATTCCTCCTTTTCATTTCACTTCTATTTTCTAATTTGCTCAATCTCTATCTCTTCGCATTCTTGTTCTTCTTCTGTATTCCTTGTTTATTGATACCGAAGAATCTAGATCTTTCTTTCAATTCGGGTAGGAACATAATAGTTTTCGAATCTGAAGCTGATTGTGGACAATTTTGTGTGAAAACAGGGACCTCCATAGACCAGGGCGTTGCGTACGTGTTGATGCTGGTGGCGTTGGTTCTCACATACCTAATTCACCCGCTCGATGCATCTTCCAACAATTTTTTTCTGAATTGAATTCTAGGATTGTAGCAATGTAGGCGCTGTTTGTGGACGTTTTGAGAAGGAACGAGGCCCAACATTCTATTTCGTTGTCATAAATGCAGGTTTCAGAGGAATATGTTCATATTGGGTTTCGCAGATTTTAATCCATTGGTCTAATTATTCATTTGTTTATTCCATTACCTTTTTCTCTTCTTTTATCATGATTTATACACTGGATCCATTTTACAGAGAGAGAGAGAGAGTTCAAACCACA

mRNA sequence

TACTCTTCTCCTTCTCTTTCTTCGTTCTTCTTCATCGTTTGATCCGAGCTTCCAGAGATTTGGTTGTTCCGAGAGGGAGACCATCCATGGCGTCCTCTAATTCTACTTTTAGGGCTTTCTCCGGCCTCTTCACCTTCTTCACTCTCATCCTCTTCATTCTATCGCCACTCATTGACGCCCACTCTTCGGTCCCCTCTCCCGCTCCCGCTCCCGCTCCCGCTAGCGATGGGACCTCCATAGACCAGGGCGTTGCGTACGTGTTGATGCTGGTGGCGTTGGTTCTCACATACCTAATTCACCCGCTCGATGCATCTTCCAACAATTTTTTTCTGAATTGAATTCTAGGATTGTAGCAATGTAGGCGCTGTTTGTGGACGTTTTGAGAAGGAACGAGGCCCAACATTCTATTTCGTTGTCATAAATGCAGGTTTCAGAGGAATATGTTCATATTGGGTTTCGCAGATTTTAATCCATTGGTCTAATTATTCATTTGTTTATTCCATTACCTTTTTCTCTTCTTTTATCATGATTTATACACTGGATCCATTTTACAGAGAGAGAGAGAGAGTTCAAACCACA

Coding sequence (CDS)

ATGGCGTCCTCTAATTCTACTTTTAGGGCTTTCTCCGGCCTCTTCACCTTCTTCACTCTCATCCTCTTCATTCTATCGCCACTCATTGACGCCCACTCTTCGGTCCCCTCTCCCGCTCCCGCTCCCGCTCCCGCTAGCGATGGGACCTCCATAGACCAGGGCGTTGCGTACGTGTTGATGCTGGTGGCGTTGGTTCTCACATACCTAATTCACCCGCTCGATGCATCTTCCAACAATTTTTTTCTGAATTGA

Protein sequence

MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTYLIHPLDASSNNFFLN
BLAST of CmaCh14G020830 vs. Swiss-Prot
Match: AGP20_ARATH (Arabinogalactan peptide 20 OS=Arabidopsis thaliana GN=AGP20 PE=3 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 4.8e-14
Identity = 44/80 (55.00%), Postives = 55/80 (68.75%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MAS NS       +   F  +  ++SP   A S     APAP+P SDGTSIDQG+AY+LM
Sbjct: 1  MASRNSV-----AVIALFAFVFAVISPFAGAQSL----APAPSPTSDGTSIDQGIAYLLM 60

Query: 61 LVALVLTYLIHPLDASSNNF 81
          +VALVLTYLIHPLDASS+++
Sbjct: 61 VVALVLTYLIHPLDASSSSY 71

BLAST of CmaCh14G020830 vs. Swiss-Prot
Match: AGP16_ARATH (Arabinogalactan peptide 16 OS=Arabidopsis thaliana GN=AGP16 PE=1 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 3.1e-13
Identity = 46/81 (56.79%), Postives = 55/81 (67.90%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MAS NS         T F L  F+ + ++   +   S APAPAP SDGTSIDQG+AY+LM
Sbjct: 1  MASRNSV--------TGFALFSFVFAVILSL-AGAQSLAPAPAPTSDGTSIDQGIAYLLM 60

Query: 61 LVALVLTYLIHPLDASSNNFF 82
          +VALVLTYLIHPLDASS+  F
Sbjct: 61 VVALVLTYLIHPLDASSSYSF 72

BLAST of CmaCh14G020830 vs. Swiss-Prot
Match: AGP22_ARATH (Arabinogalactan peptide 22 OS=Arabidopsis thaliana GN=AGP22 PE=3 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 4.2e-10
Identity = 35/58 (60.34%), Postives = 42/58 (72.41%), Query Frame = 1

Query: 14 LFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTYLIH 72
          +   F +I  IL P+  +HSS    +PAPAP SDGTSIDQG+AYVLM+VAL LTY IH
Sbjct: 10 ILAVFVIISVILLPIAQSHSS----SPAPAPTSDGTSIDQGIAYVLMMVALALTYFIH 63

BLAST of CmaCh14G020830 vs. TrEMBL
Match: A0A0A0L7A0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120410 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 1.2e-24
Identity = 65/83 (78.31%), Postives = 71/83 (85.54%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          M S NST RAF  LFTFF+LI FILSPL+DA     +PAPAPAP+SDGTSIDQG+AYVLM
Sbjct: 1  MESFNSTSRAFKALFTFFSLIFFILSPLVDA-----TPAPAPAPSSDGTSIDQGIAYVLM 60

Query: 61 LVALVLTYLIHPLDASSNNFFLN 84
          L+ALVLTYLIHPLDASS NFFLN
Sbjct: 61 LLALVLTYLIHPLDASSYNFFLN 78

BLAST of CmaCh14G020830 vs. TrEMBL
Match: A0A061DUP4_THECC (Arabinogalactan protein 20 OS=Theobroma cacao GN=TCM_005200 PE=4 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 2.1e-16
Identity = 49/73 (67.12%), Postives = 58/73 (79.45%), Query Frame = 1

Query: 9  RAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTY 68
          RAF G+   F L+  I+SP ++A S+    APAP+P SDGTSIDQG+AYVLMLVALVLTY
Sbjct: 7  RAFVGVMAIFALVFAIVSPFVEAQSA----APAPSPTSDGTSIDQGIAYVLMLVALVLTY 66

Query: 69 LIHPLDASSNNFF 82
          LIHPLDASS +FF
Sbjct: 67 LIHPLDASSYSFF 75

BLAST of CmaCh14G020830 vs. TrEMBL
Match: A9PBL3_POPTR (Arabinogalactan-protein OS=Populus trichocarpa GN=POPTR_0014s09050g PE=2 SV=1)

HSP 1 Score: 91.7 bits (226), Expect = 4.6e-16
Identity = 49/76 (64.47%), Postives = 60/76 (78.95%), Query Frame = 1

Query: 6  STFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALV 65
          ++F+AF  +    +LIL ++SP ++A     SPAPAPAP SDGTSIDQG+AY+LMLVALV
Sbjct: 5  ASFKAFIAVLAVVSLILAVVSPSVEAQ----SPAPAPAPTSDGTSIDQGIAYLLMLVALV 64

Query: 66 LTYLIHPLDASSNNFF 82
          LTYLIHPLDASS  FF
Sbjct: 65 LTYLIHPLDASSYTFF 76

BLAST of CmaCh14G020830 vs. TrEMBL
Match: A0A067JXA9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14293 PE=4 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 7.9e-16
Identity = 55/81 (67.90%), Postives = 62/81 (76.54%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MA S S+ R+F+G+   FTLI  I SP      SV + APAPAPASDGTSIDQG+AYVLM
Sbjct: 1  MAMSGSS-RSFAGVLLAFTLIFVIFSP------SVQAQAPAPAPASDGTSIDQGIAYVLM 60

Query: 61 LVALVLTYLIHPLDASSNNFF 82
          LVALVLTYLIHPLDASS+  F
Sbjct: 61 LVALVLTYLIHPLDASSSYGF 74

BLAST of CmaCh14G020830 vs. TrEMBL
Match: A0A0D2TCU0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105900 PE=4 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 3.9e-15
Identity = 48/75 (64.00%), Postives = 57/75 (76.00%), Query Frame = 1

Query: 7  TFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVL 66
          +FR    + T F L+  I+SP ++A S+    APAP+P SDGTSIDQG+AYVLMLVALVL
Sbjct: 5  SFRVLMRVVTIFALVFAIVSPNVEAQSA----APAPSPTSDGTSIDQGIAYVLMLVALVL 64

Query: 67 TYLIHPLDASSNNFF 82
          TYLIHPLDASS  FF
Sbjct: 65 TYLIHPLDASSYTFF 75

BLAST of CmaCh14G020830 vs. TAIR10
Match: AT3G61640.1 (AT3G61640.1 arabinogalactan protein 20)

HSP 1 Score: 78.2 bits (191), Expect = 2.7e-15
Identity = 44/80 (55.00%), Postives = 55/80 (68.75%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MAS NS       +   F  +  ++SP   A S     APAP+P SDGTSIDQG+AY+LM
Sbjct: 1  MASRNSV-----AVIALFAFVFAVISPFAGAQSL----APAPSPTSDGTSIDQGIAYLLM 60

Query: 61 LVALVLTYLIHPLDASSNNF 81
          +VALVLTYLIHPLDASS+++
Sbjct: 61 VVALVLTYLIHPLDASSSSY 71

BLAST of CmaCh14G020830 vs. TAIR10
Match: AT2G46330.1 (AT2G46330.1 arabinogalactan protein 16)

HSP 1 Score: 75.5 bits (184), Expect = 1.7e-14
Identity = 46/81 (56.79%), Postives = 55/81 (67.90%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MAS NS         T F L  F+ + ++   +   S APAPAP SDGTSIDQG+AY+LM
Sbjct: 1  MASRNSV--------TGFALFSFVFAVILSL-AGAQSLAPAPAPTSDGTSIDQGIAYLLM 60

Query: 61 LVALVLTYLIHPLDASSNNFF 82
          +VALVLTYLIHPLDASS+  F
Sbjct: 61 VVALVLTYLIHPLDASSSYSF 72

BLAST of CmaCh14G020830 vs. TAIR10
Match: AT5G24105.1 (AT5G24105.1 arabinogalactan protein 41)

HSP 1 Score: 70.1 bits (170), Expect = 7.3e-13
Identity = 40/63 (63.49%), Postives = 47/63 (74.60%), Query Frame = 1

Query: 9  RAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTY 68
          R F G+ T  ++I  IL P+  A S+    APAPAP SDGT+IDQG+AYVLMLVALVLTY
Sbjct: 5  RLFFGVSTIVSIIFAILLPMAHAQSA----APAPAPTSDGTTIDQGIAYVLMLVALVLTY 63

Query: 69 LIH 72
          LIH
Sbjct: 65 LIH 63

BLAST of CmaCh14G020830 vs. TAIR10
Match: AT5G53250.1 (AT5G53250.1 arabinogalactan protein 22)

HSP 1 Score: 65.1 bits (157), Expect = 2.4e-11
Identity = 35/58 (60.34%), Postives = 42/58 (72.41%), Query Frame = 1

Query: 14 LFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTYLIH 72
          +   F +I  IL P+  +HSS    +PAPAP SDGTSIDQG+AYVLM+VAL LTY IH
Sbjct: 10 ILAVFVIISVILLPIAQSHSS----SPAPAPTSDGTSIDQGIAYVLMMVALALTYFIH 63

BLAST of CmaCh14G020830 vs. NCBI nr
Match: gi|449432096|ref|XP_004133836.1| (PREDICTED: arabinogalactan peptide 20 [Cucumis sativus])

HSP 1 Score: 120.2 bits (300), Expect = 1.7e-24
Identity = 65/83 (78.31%), Postives = 71/83 (85.54%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          M S NST RAF  LFTFF+LI FILSPL+DA     +PAPAPAP+SDGTSIDQG+AYVLM
Sbjct: 1  MESFNSTSRAFKALFTFFSLIFFILSPLVDA-----TPAPAPAPSSDGTSIDQGIAYVLM 60

Query: 61 LVALVLTYLIHPLDASSNNFFLN 84
          L+ALVLTYLIHPLDASS NFFLN
Sbjct: 61 LLALVLTYLIHPLDASSYNFFLN 78

BLAST of CmaCh14G020830 vs. NCBI nr
Match: gi|590721494|ref|XP_007051629.1| (Arabinogalactan protein 20 [Theobroma cacao])

HSP 1 Score: 92.8 bits (229), Expect = 3.0e-16
Identity = 49/73 (67.12%), Postives = 58/73 (79.45%), Query Frame = 1

Query: 9  RAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALVLTY 68
          RAF G+   F L+  I+SP ++A S+    APAP+P SDGTSIDQG+AYVLMLVALVLTY
Sbjct: 7  RAFVGVMAIFALVFAIVSPFVEAQSA----APAPSPTSDGTSIDQGIAYVLMLVALVLTY 66

Query: 69 LIHPLDASSNNFF 82
          LIHPLDASS +FF
Sbjct: 67 LIHPLDASSYSFF 75

BLAST of CmaCh14G020830 vs. NCBI nr
Match: gi|224130474|ref|XP_002320846.1| (arabinogalactan-protein [Populus trichocarpa])

HSP 1 Score: 91.7 bits (226), Expect = 6.7e-16
Identity = 49/76 (64.47%), Postives = 60/76 (78.95%), Query Frame = 1

Query: 6  STFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALV 65
          ++F+AF  +    +LIL ++SP ++A     SPAPAPAP SDGTSIDQG+AY+LMLVALV
Sbjct: 5  ASFKAFIAVLAVVSLILAVVSPSVEAQ----SPAPAPAPTSDGTSIDQGIAYLLMLVALV 64

Query: 66 LTYLIHPLDASSNNFF 82
          LTYLIHPLDASS  FF
Sbjct: 65 LTYLIHPLDASSYTFF 76

BLAST of CmaCh14G020830 vs. NCBI nr
Match: gi|743789959|ref|XP_011037662.1| (PREDICTED: arabinogalactan peptide 20 [Populus euphratica])

HSP 1 Score: 91.3 bits (225), Expect = 8.7e-16
Identity = 49/76 (64.47%), Postives = 60/76 (78.95%), Query Frame = 1

Query: 6  STFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLMLVALV 65
          ++F+AF  +    +LIL ++SP ++A     SPAPAPAP SDGTSIDQG+AY+LMLVALV
Sbjct: 5  ASFKAFIAVLAVASLILAVVSPSVEAQ----SPAPAPAPTSDGTSIDQGIAYLLMLVALV 64

Query: 66 LTYLIHPLDASSNNFF 82
          LTYLIHPLDASS  FF
Sbjct: 65 LTYLIHPLDASSYTFF 76

BLAST of CmaCh14G020830 vs. NCBI nr
Match: gi|802694771|ref|XP_012083260.1| (PREDICTED: arabinogalactan peptide 16 [Jatropha curcas])

HSP 1 Score: 90.9 bits (224), Expect = 1.1e-15
Identity = 55/81 (67.90%), Postives = 62/81 (76.54%), Query Frame = 1

Query: 1  MASSNSTFRAFSGLFTFFTLILFILSPLIDAHSSVPSPAPAPAPASDGTSIDQGVAYVLM 60
          MA S S+ R+F+G+   FTLI  I SP      SV + APAPAPASDGTSIDQG+AYVLM
Sbjct: 1  MAMSGSS-RSFAGVLLAFTLIFVIFSP------SVQAQAPAPAPASDGTSIDQGIAYVLM 60

Query: 61 LVALVLTYLIHPLDASSNNFF 82
          LVALVLTYLIHPLDASS+  F
Sbjct: 61 LVALVLTYLIHPLDASSSYGF 74

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AGP20_ARATH4.8e-1455.00Arabinogalactan peptide 20 OS=Arabidopsis thaliana GN=AGP20 PE=3 SV=1[more]
AGP16_ARATH3.1e-1356.79Arabinogalactan peptide 16 OS=Arabidopsis thaliana GN=AGP16 PE=1 SV=1[more]
AGP22_ARATH4.2e-1060.34Arabinogalactan peptide 22 OS=Arabidopsis thaliana GN=AGP22 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7A0_CUCSA1.2e-2478.31Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120410 PE=4 SV=1[more]
A0A061DUP4_THECC2.1e-1667.12Arabinogalactan protein 20 OS=Theobroma cacao GN=TCM_005200 PE=4 SV=1[more]
A9PBL3_POPTR4.6e-1664.47Arabinogalactan-protein OS=Populus trichocarpa GN=POPTR_0014s09050g PE=2 SV=1[more]
A0A067JXA9_JATCU7.9e-1667.90Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14293 PE=4 SV=1[more]
A0A0D2TCU0_GOSRA3.9e-1564.00Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G61640.12.7e-1555.00 arabinogalactan protein 20[more]
AT2G46330.11.7e-1456.79 arabinogalactan protein 16[more]
AT5G24105.17.3e-1363.49 arabinogalactan protein 41[more]
AT5G53250.12.4e-1160.34 arabinogalactan protein 22[more]
Match NameE-valueIdentityDescription
gi|449432096|ref|XP_004133836.1|1.7e-2478.31PREDICTED: arabinogalactan peptide 20 [Cucumis sativus][more]
gi|590721494|ref|XP_007051629.1|3.0e-1667.12Arabinogalactan protein 20 [Theobroma cacao][more]
gi|224130474|ref|XP_002320846.1|6.7e-1664.47arabinogalactan-protein [Populus trichocarpa][more]
gi|743789959|ref|XP_011037662.1|8.7e-1664.47PREDICTED: arabinogalactan peptide 20 [Populus euphratica][more]
gi|802694771|ref|XP_012083260.1|1.1e-1567.90PREDICTED: arabinogalactan peptide 16 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009424AGP16/20/22/41
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G020830.1CmaCh14G020830.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009424Arabinogalactan peptide, AGPPFAMPF06376DUF1070coord: 38..71
score: 8.1
NoneNo IPR availablePANTHERPTHR33374FAMILY NOT NAMEDcoord: 1..82
score: 6.4
NoneNo IPR availablePANTHERPTHR33374:SF6ARABINOGALACTAN PEPTIDE 16-RELATEDcoord: 1..82
score: 6.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh14G020830CmaCh00G003360Cucurbita maxima (Rimu)cmacmaB011