Cla97C05G090890.1 (mRNA) Watermelon (97103) v2

NameCla97C05G090890.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionarabinogalactan peptide 22
LocationCla97Chr05 : 8906248 .. 8907051 (-)
Sequence length207
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTTTTGAGGGCTTGTTTTGGAATCTATGCTGCTGTGATTATGGCTGTCTTTTATGTTGTTTCTTTGCCTGTGGCTGTATCTGCAGCTGAACATTCATCTTCTCCAGCTCCAGCTCCCACTAGTGATGGTTAGTTTTCTTTTCTTCCTGGTTTTCTCTTTGTTTTGTTTGTTTCTTGAGAAATTTTCTGGGAAAAAGACGGGAAATCCGAGAAAATGGCTTTGAAAATACTCTTCCCTTTAAATTCTAGATTCTAAGAAGCACCAATCAATTCAATTGGGTCTCCTTGAATTGGAAATGGGGTTTGGGGAATTTGGCTCCTTGTTTGAAAAGAAGCAAAATTTACTCCTTTTCTCCCATGCCATTTTTAGACATTTTTCTGTTAGTTTCTTGAGAGGGGACTGACAGGGAATATGAGAATGGCTTTGAAATCAACTCTCATCTTCAAATACAAAGGTGAATCTCCTTAAATTGAAAATGGGTTTCTGGTATTTGAAGAGAAGAGTTGATTTCAAAGGCAAATAGAGTTGTTGGAATTGAATCTCCTTCCTAAATTGGGAAAGTTGACTCCTTGTTTGAATAGAAAACAATATCTACTCTTTTTTCTTCCTCAAGTCAAGAAGAAGAAGCCACCATTTTAGGACATTTTCTTAAAGAAACCTTGCTTAGCTATTAATGGTGTTAAACTCCAAAAATAACATTAATTTGGGATTTCATATGTTGCAGGAACCACAATAGACCAAGGAATAGCATATGTTCTAATGCTGTTGGCTTTAGTGCTCACTTATATCATCCATTGA

mRNA sequence

ATGGCTGTTTTGAGGGCTTGTTTTGGAATCTATGCTGCTGTGATTATGGCTGTCTTTTATGTTGTTTCTTTGCCTGTGGCTGTATCTGCAGCTGAACATTCATCTTCTCCAGCTCCAGCTCCCACTAGTGATGGAACCACAATAGACCAAGGAATAGCATATGTTCTAATGCTGTTGGCTTTAGTGCTCACTTATATCATCCATTGA

Coding sequence (CDS)

ATGGCTGTTTTGAGGGCTTGTTTTGGAATCTATGCTGCTGTGATTATGGCTGTCTTTTATGTTGTTTCTTTGCCTGTGGCTGTATCTGCAGCTGAACATTCATCTTCTCCAGCTCCAGCTCCCACTAGTGATGGAACCACAATAGACCAAGGAATAGCATATGTTCTAATGCTGTTGGCTTTAGTGCTCACTTATATCATCCATTGA

Protein sequence

MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH
BLAST of Cla97C05G090890.1 vs. NCBI nr
Match: XP_022945187.1 (arabinogalactan peptide 22-like [Cucurbita moschata] >XP_023541894.1 arabinogalactan peptide 22-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 97.8 bits (242), Expect = 1.5e-17
Identity = 57/68 (83.82%), Postives = 61/68 (89.71%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAVLRACFGIY AVI+A+FYVV LP  VS AEH+SSPAPAPTSDGT IDQGIAY+LMLLA
Sbjct: 1  MAVLRACFGIY-AVIIAIFYVVMLP--VSRAEHASSPAPAPTSDGTAIDQGIAYILMLLA 60

Query: 61 LVLTYIIH 69
          L LTYIIH
Sbjct: 61 LALTYIIH 65

BLAST of Cla97C05G090890.1 vs. NCBI nr
Match: XP_008465391.1 (PREDICTED: arabinogalactan peptide 22 [Cucumis melo])

HSP 1 Score: 96.3 bits (238), Expect = 4.3e-17
Identity = 58/69 (84.06%), Postives = 65/69 (94.20%), Query Frame = 0

Query: 1  MAVLRA-CFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLL 60
          MAVLRA CFGIYAAV++AVFYV++L  +VS+AE SSSPAPAPTSDGTTIDQGIAYVLML+
Sbjct: 1  MAVLRASCFGIYAAVLIAVFYVLAL--SVSSAELSSSPAPAPTSDGTTIDQGIAYVLMLV 60

Query: 61 ALVLTYIIH 69
          ALVLTYIIH
Sbjct: 61 ALVLTYIIH 67

BLAST of Cla97C05G090890.1 vs. NCBI nr
Match: XP_022140891.1 (arabinogalactan peptide 22-like [Momordica charantia])

HSP 1 Score: 93.2 bits (230), Expect = 3.6e-16
Identity = 54/68 (79.41%), Postives = 60/68 (88.24%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAV RACFGIYA +I A+ +V+ LP  VS AEHSSSPAPAP+SDGTTIDQGIAY+LMLLA
Sbjct: 1  MAVSRACFGIYAGII-AILFVIILP--VSRAEHSSSPAPAPSSDGTTIDQGIAYILMLLA 60

Query: 61 LVLTYIIH 69
          LVLTYIIH
Sbjct: 61 LVLTYIIH 65

BLAST of Cla97C05G090890.1 vs. NCBI nr
Match: XP_022968217.1 (arabinogalactan peptide 22-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 73.2 bits (178), Expect = 3.9e-10
Identity = 48/70 (68.57%), Postives = 53/70 (75.71%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSS--PAPAPTSDGTTIDQGIAYVLML 60
          MA LRACFG+Y AVI+A+FYVV LP  VS AEH+SS         DGT IDQGIAY+LML
Sbjct: 21 MAALRACFGLY-AVIIAIFYVVMLP--VSRAEHASSXXXXXXXXXDGTAIDQGIAYILML 80

Query: 61 LALVLTYIIH 69
          LAL LTYIIH
Sbjct: 81 LALALTYIIH 87

BLAST of Cla97C05G090890.1 vs. NCBI nr
Match: XP_022968216.1 (arabinogalactan peptide 22-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 73.2 bits (178), Expect = 3.9e-10
Identity = 48/70 (68.57%), Postives = 53/70 (75.71%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSS--PAPAPTSDGTTIDQGIAYVLML 60
          MA LRACFG+Y AVI+A+FYVV LP  VS AEH+SS         DGT IDQGIAY+LML
Sbjct: 26 MAALRACFGLY-AVIIAIFYVVMLP--VSRAEHASSXXXXXXXXXDGTAIDQGIAYILML 85

Query: 61 LALVLTYIIH 69
          LAL LTYIIH
Sbjct: 86 LALALTYIIH 92

BLAST of Cla97C05G090890.1 vs. TrEMBL
Match: tr|A0A1S3CNS5|A0A1S3CNS5_CUCME (arabinogalactan peptide 22 OS=Cucumis melo OX=3656 GN=LOC103503022 PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 2.9e-17
Identity = 58/69 (84.06%), Postives = 65/69 (94.20%), Query Frame = 0

Query: 1  MAVLRA-CFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLL 60
          MAVLRA CFGIYAAV++AVFYV++L  +VS+AE SSSPAPAPTSDGTTIDQGIAYVLML+
Sbjct: 1  MAVLRASCFGIYAAVLIAVFYVLAL--SVSSAELSSSPAPAPTSDGTTIDQGIAYVLMLV 60

Query: 61 ALVLTYIIH 69
          ALVLTYIIH
Sbjct: 61 ALVLTYIIH 67

BLAST of Cla97C05G090890.1 vs. TrEMBL
Match: tr|B9RIM3|B9RIM3_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1580600 PE=4 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 1.3e-09
Identity = 41/68 (60.29%), Postives = 51/68 (75.00%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAV R  FG++A +  A  Y + LP+A     H  +PAP+PTSDGT+IDQGIAYVLML+A
Sbjct: 1  MAVSRISFGLFATI--ATIYAIMLPLA-----HGQAPAPSPTSDGTSIDQGIAYVLMLVA 60

Query: 61 LVLTYIIH 69
          LVLTY+IH
Sbjct: 61 LVLTYLIH 61

BLAST of Cla97C05G090890.1 vs. TrEMBL
Match: tr|A0A0A0KZS8|A0A0A0KZS8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G051380 PE=4 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 4.9e-09
Identity = 42/68 (61.76%), Postives = 52/68 (76.47%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAV R+ FG+ AA   A+ + + LPV   A  HS +PAPAPTSDGT+IDQGIAYVLM++A
Sbjct: 1  MAVSRSSFGLLAAT--ALIFAIFLPV---AHPHSLAPAPAPTSDGTSIDQGIAYVLMMVA 60

Query: 61 LVLTYIIH 69
          L LTY+IH
Sbjct: 61 LALTYLIH 63

BLAST of Cla97C05G090890.1 vs. TrEMBL
Match: tr|A0A1U8AFS3|A0A1U8AFS3_NELNU (arabinogalactan peptide 20-like OS=Nelumbo nucifera OX=4432 GN=LOC104603947 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 6.4e-09
Identity = 45/68 (66.18%), Postives = 54/68 (79.41%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAV R  FG+ A  I+A+ + V+LP AV A   S +PAPAPTSDGT+IDQGIAYVLML+A
Sbjct: 1  MAVSRVSFGVVA--IVALIFAVALP-AVQA--QSVAPAPAPTSDGTSIDQGIAYVLMLVA 60

Query: 61 LVLTYIIH 69
          LVLTY+IH
Sbjct: 61 LVLTYLIH 63

BLAST of Cla97C05G090890.1 vs. TrEMBL
Match: tr|A0A2C9UIZ9|A0A2C9UIZ9_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_14G018800 PE=4 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 1.9e-08
Identity = 43/68 (63.24%), Postives = 54/68 (79.41%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          MAV R  FG+  A++ A+F VV LP+A  A    S+P+P+PTSDGTTIDQG+AYVLML+A
Sbjct: 1  MAVCRVSFGVLVAIV-ALFAVV-LPLA-HAQSPVSAPSPSPTSDGTTIDQGVAYVLMLVA 60

Query: 61 LVLTYIIH 69
          LVLTY+IH
Sbjct: 61 LVLTYLIH 65

BLAST of Cla97C05G090890.1 vs. Swiss-Prot
Match: sp|Q8L9T8|AGP41_ARATH (Arabinogalactan protein 41 OS=Arabidopsis thaliana OX=3702 GN=AGP41 PE=1 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 5.9e-10
Identity = 39/68 (57.35%), Postives = 53/68 (77.94%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          M+  R  FG+  + I+++ + + LP+A +    S++PAPAPTSDGTTIDQGIAYVLML+A
Sbjct: 1  MSGSRLFFGV--STIVSIIFAILLPMAHA---QSAAPAPAPTSDGTTIDQGIAYVLMLVA 60

Query: 61 LVLTYIIH 69
          LVLTY+IH
Sbjct: 61 LVLTYLIH 63

BLAST of Cla97C05G090890.1 vs. Swiss-Prot
Match: sp|O82337|AGP16_ARATH (Arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=AGP16 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 4.3e-08
Identity = 32/52 (61.54%), Postives = 40/52 (76.92%), Query Frame = 0

Query: 17 AVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          A+F  V   +   A   S +PAPAPTSDGT+IDQGIAY+LM++ALVLTY+IH
Sbjct: 11 ALFSFVFAVILSLAGAQSLAPAPAPTSDGTSIDQGIAYLLMVVALVLTYLIH 62

BLAST of Cla97C05G090890.1 vs. Swiss-Prot
Match: sp|Q9M373|AGP20_ARATH (Arabinogalactan protein 20 OS=Arabidopsis thaliana OX=3702 GN=AGP20 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 4.3e-08
Identity = 31/57 (54.39%), Postives = 44/57 (77.19%), Query Frame = 0

Query: 12 AAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          +  ++A+F  V   ++  A   S +PAP+PTSDGT+IDQGIAY+LM++ALVLTY+IH
Sbjct: 6  SVAVIALFAFVFAVISPFAGAQSLAPAPSPTSDGTSIDQGIAYLLMVVALVLTYLIH 62

BLAST of Cla97C05G090890.1 vs. Swiss-Prot
Match: sp|Q9FK16|AGP22_ARATH (Arabinogalactan protein 22 OS=Arabidopsis thaliana OX=3702 GN=AGP22 PE=1 SV=1)

HSP 1 Score: 45.1 bits (105), Expect = 3.7e-04
Identity = 27/54 (50.00%), Postives = 35/54 (64.81%), Query Frame = 0

Query: 15 IMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          I+AVF ++S+ +   A            SDGT+IDQGIAYVLM++AL LTY IH
Sbjct: 10 ILAVFVIISVILLPIAQXXXXXXXXXXXSDGTSIDQGIAYVLMMVALALTYFIH 63

BLAST of Cla97C05G090890.1 vs. TAIR10
Match: AT5G24105.1 (arabinogalactan protein 41)

HSP 1 Score: 64.3 bits (155), Expect = 3.3e-11
Identity = 39/68 (57.35%), Postives = 53/68 (77.94%), Query Frame = 0

Query: 1  MAVLRACFGIYAAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLA 60
          M+  R  FG+  + I+++ + + LP+A +    S++PAPAPTSDGTTIDQGIAYVLML+A
Sbjct: 1  MSGSRLFFGV--STIVSIIFAILLPMAHA---QSAAPAPAPTSDGTTIDQGIAYVLMLVA 60

Query: 61 LVLTYIIH 69
          LVLTY+IH
Sbjct: 61 LVLTYLIH 63

BLAST of Cla97C05G090890.1 vs. TAIR10
Match: AT2G46330.1 (arabinogalactan protein 16)

HSP 1 Score: 58.2 bits (139), Expect = 2.4e-09
Identity = 32/52 (61.54%), Postives = 40/52 (76.92%), Query Frame = 0

Query: 17 AVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          A+F  V   +   A   S +PAPAPTSDGT+IDQGIAY+LM++ALVLTY+IH
Sbjct: 11 ALFSFVFAVILSLAGAQSLAPAPAPTSDGTSIDQGIAYLLMVVALVLTYLIH 62

BLAST of Cla97C05G090890.1 vs. TAIR10
Match: AT3G61640.1 (arabinogalactan protein 20)

HSP 1 Score: 58.2 bits (139), Expect = 2.4e-09
Identity = 31/57 (54.39%), Postives = 44/57 (77.19%), Query Frame = 0

Query: 12 AAVIMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          +  ++A+F  V   ++  A   S +PAP+PTSDGT+IDQGIAY+LM++ALVLTY+IH
Sbjct: 6  SVAVIALFAFVFAVISPFAGAQSLAPAPSPTSDGTSIDQGIAYLLMVVALVLTYLIH 62

BLAST of Cla97C05G090890.1 vs. TAIR10
Match: AT5G53250.1 (arabinogalactan protein 22)

HSP 1 Score: 45.1 bits (105), Expect = 2.1e-05
Identity = 27/54 (50.00%), Postives = 35/54 (64.81%), Query Frame = 0

Query: 15 IMAVFYVVSLPVAVSAAEHSSSPAPAPTSDGTTIDQGIAYVLMLLALVLTYIIH 69
          I+AVF ++S+ +   A            SDGT+IDQGIAYVLM++AL LTY IH
Sbjct: 10 ILAVFVIISVILLPIAQXXXXXXXXXXXSDGTSIDQGIAYVLMMVALALTYFIH 63

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022945187.11.5e-1783.82arabinogalactan peptide 22-like [Cucurbita moschata] >XP_023541894.1 arabinogala... [more]
XP_008465391.14.3e-1784.06PREDICTED: arabinogalactan peptide 22 [Cucumis melo][more]
XP_022140891.13.6e-1679.41arabinogalactan peptide 22-like [Momordica charantia][more]
XP_022968217.13.9e-1068.57arabinogalactan peptide 22-like isoform X2 [Cucurbita maxima][more]
XP_022968216.13.9e-1068.57arabinogalactan peptide 22-like isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CNS5|A0A1S3CNS5_CUCME2.9e-1784.06arabinogalactan peptide 22 OS=Cucumis melo OX=3656 GN=LOC103503022 PE=4 SV=1[more]
tr|B9RIM3|B9RIM3_RICCO1.3e-0960.29Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_1580600 PE=4 SV=1[more]
tr|A0A0A0KZS8|A0A0A0KZS8_CUCSA4.9e-0961.76Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G051380 PE=4 SV=1[more]
tr|A0A1U8AFS3|A0A1U8AFS3_NELNU6.4e-0966.18arabinogalactan peptide 20-like OS=Nelumbo nucifera OX=4432 GN=LOC104603947 PE=4... [more]
tr|A0A2C9UIZ9|A0A2C9UIZ9_MANES1.9e-0863.24Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_14G018800 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
sp|Q8L9T8|AGP41_ARATH5.9e-1057.35Arabinogalactan protein 41 OS=Arabidopsis thaliana OX=3702 GN=AGP41 PE=1 SV=1[more]
sp|O82337|AGP16_ARATH4.3e-0861.54Arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=AGP16 PE=1 SV=1[more]
sp|Q9M373|AGP20_ARATH4.3e-0854.39Arabinogalactan protein 20 OS=Arabidopsis thaliana OX=3702 GN=AGP20 PE=1 SV=1[more]
sp|Q9FK16|AGP22_ARATH3.7e-0450.00Arabinogalactan protein 22 OS=Arabidopsis thaliana OX=3702 GN=AGP22 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
AT5G24105.13.3e-1157.35arabinogalactan protein 41[more]
AT2G46330.12.4e-0961.54arabinogalactan protein 16[more]
AT3G61640.12.4e-0954.39arabinogalactan protein 20[more]
AT5G53250.12.1e-0550.00arabinogalactan protein 22[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR009424AGP16/20/22/41
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C05G090890Cla97C05G090890gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C05G090890.1.CDS.2Cla97C05G090890.1.CDS.2CDS
Cla97C05G090890.1.CDS.1Cla97C05G090890.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C05G090890.1.exon.2Cla97C05G090890.1.exon.2exon
Cla97C05G090890.1.exon.1Cla97C05G090890.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C05G090890.1Cla97C05G090890.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009424Arabinogalactan protein 16/20/22/41PFAMPF06376AGPcoord: 36..68
e-value: 1.1E-18
score: 66.7
IPR009424Arabinogalactan protein 16/20/22/41PANTHERPTHR33374FAMILY NOT NAMEDcoord: 1..68