Tan0000071 (gene) Snake gourd v1

Overview
NameTan0000071
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionproline-rich protein HaeIII subfamily 1-like
LocationLG03: 60572004 .. 60572537 (-)
RNA-Seq ExpressionTan0000071
SyntenyTan0000071
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATAAATTCAGAGATTTAAAATTGATTAGGAGCCAAATGCTATATAAAGGAAGCATTGGAACAAGGGAAAACACAGCAAAAATGGCCTCAATGAGTTCTAAAATGCTTTTAGGAGCTTTGGCTCTTATTTTCTTAGGGTTTCATTTTACCACAAACCCAACACATGCTAGAAAACTTGCTGAGAAGGAATCAAGGTTTAACTTTGTGATGTATCCCGAAAGAGTACCTATTCCCCCGTCGGGGCCAAATCAAAGACATTCATTTGCTCCTCCACCACCTTCCTTTACTATTTCGAAGAATGAACCCGAGTTCAATTTTGGGATGTACCCAAAAGGTGTACCTATTCCTCCTTCTGGCCCGAGTCAAAGGACATCAGATTCATCTCCTCCTCCACTCCATTCATTGGATCAATTCGATTTCGGAATGTATCCCAAAGGCGTGCCTATTCCTCCTTCTGGACCGAGTCAAAGGACATCTGACGATCTTCACCTCCTCCACCACATTCCATTTATCAAGCTTCGAGAATGA

mRNA sequence

ATGGCTAATAAATTCAGAGATTTAAAATTGATTAGGAGCCAAATGCTATATAAAGGAAGCATTGGAACAAGGGAAAACACAGCAAAAATGGCCTCAATGAGTTCTAAAATGCTTTTAGGAGCTTTGGCTCTTATTTTCTTAGGGTTTCATTTTACCACAAACCCAACACATGCTAGAAAACTTGCTGAGAAGGAATCAAGGTTTAACTTTGTGATGTATCCCGAAAGAGTACCTATTCCCCCGTCGGGGCCAAATCAAAGACATTCATTTGCTCCTCCACCACCTTCCTTTACTATTTCGAAGAATGAACCCGAGTTCAATTTTGGGATGTACCCAAAAGGTGTACCTATTCCTCCTTCTGGCCCGAGTCAAAGGACATCAGATTCATCTCCTCCTCCACTCCATTCATTGGATCAATTCGATTTCGGAATGTATCCCAAAGGCGTGCCTATTCCTCCTTCTGGACCGAGTCAAAGGACATCTGACGATCTTCACCTCCTCCACCACATTCCATTTATCAAGCTTCGAGAATGA

Coding sequence (CDS)

ATGGCTAATAAATTCAGAGATTTAAAATTGATTAGGAGCCAAATGCTATATAAAGGAAGCATTGGAACAAGGGAAAACACAGCAAAAATGGCCTCAATGAGTTCTAAAATGCTTTTAGGAGCTTTGGCTCTTATTTTCTTAGGGTTTCATTTTACCACAAACCCAACACATGCTAGAAAACTTGCTGAGAAGGAATCAAGGTTTAACTTTGTGATGTATCCCGAAAGAGTACCTATTCCCCCGTCGGGGCCAAATCAAAGACATTCATTTGCTCCTCCACCACCTTCCTTTACTATTTCGAAGAATGAACCCGAGTTCAATTTTGGGATGTACCCAAAAGGTGTACCTATTCCTCCTTCTGGCCCGAGTCAAAGGACATCAGATTCATCTCCTCCTCCACTCCATTCATTGGATCAATTCGATTTCGGAATGTATCCCAAAGGCGTGCCTATTCCTCCTTCTGGACCGAGTCAAAGGACATCTGACGATCTTCACCTCCTCCACCACATTCCATTTATCAAGCTTCGAGAATGA

Protein sequence

MANKFRDLKLIRSQMLYKGSIGTRENTAKMASMSSKMLLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSDDLHLLHHIPFIKLRE
Homology
BLAST of Tan0000071 vs. NCBI nr
Match: KAG6603169.1 (hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 137.5 bits (345), Expect = 1.1e-28
Identity = 75/127 (59.06%), Postives = 86/127 (67.72%), Query Frame = 0

Query: 55  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPP---SFTISKNEPEFNFGM 114
           P HA   +   +S+ NF M P+ VPIPPSGP+QR S  PPPP   S  I   + + NFGM
Sbjct: 230 PPHASSFILNTQSKINFGMLPKGVPIPPSGPSQRTSDYPPPPPHASSVILNTQSKINFGM 289

Query: 115 YPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIPPSGPSQRTSDDLHLLHH 174
            PKGVPIPPSGPSQRTS+  PPP H L  + +FGM PKGVPIPPSGPS+RTSD      H
Sbjct: 290 LPKGVPIPPSGPSQRTSNYPPPPPHVLRPKINFGMLPKGVPIPPSGPSRRTSDHPPPAPH 349

Query: 175 IPFIKLR 177
            PFI LR
Sbjct: 350 TPFITLR 356

BLAST of Tan0000071 vs. NCBI nr
Match: XP_022967687.1 (actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima])

HSP 1 Score: 128.3 bits (321), Expect = 6.8e-26
Identity = 71/124 (57.26%), Postives = 82/124 (66.13%), Query Frame = 0

Query: 55  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPK 114
           P HA   +  K+S+ NF M P+ VPIPPSGP+ R S  PPPP   +    P+ NFGM PK
Sbjct: 389 PPHASSVILNKQSKINFGMLPKGVPIPPSGPSHRTSDYPPPPPHVL---WPKINFGMLPK 448

Query: 115 GVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIPPSGPSQRTSDDLHLLHHIPF 174
            VPIPPSGPSQRTSD  PPP H L  + +FGM PKGVPIPP GPS+RTSD      + P 
Sbjct: 449 DVPIPPSGPSQRTSDYPPPPPHVLWPKINFGMLPKGVPIPPHGPSRRTSDYPPPAPNTPS 508

Query: 175 IKLR 177
           I LR
Sbjct: 509 ITLR 509

BLAST of Tan0000071 vs. NCBI nr
Match: XP_008442275.1 (PREDICTED: proline-rich protein HaeIII subfamily 1-like [Cucumis melo] >KAA0041826.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK05348.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 120.6 bits (301), Expect = 1.4e-23
Identity = 72/146 (49.32%), Postives = 89/146 (60.96%), Query Frame = 0

Query: 30  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPER 89
           MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ 
Sbjct: 1   MASKSWIMMLGGWLLVLLGFHLIAVQTHARPIPPSGSNPTTSDPPPPPPSNFIFKMYPKG 60

Query: 90  VPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHS 149
           + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ +PPSGPSQRTSDSSPPP  +
Sbjct: 61  I-VPPSGPSQRTSDSSPPPPFNILQN--KFDFGMYPKGI-VPPSGPSQRTSDSSPPPPSN 120

Query: 150 LDQFDFGMYPKGVPIPPSGPSQRTSD 163
             +  FG     VP+PPSGP+  TSD
Sbjct: 121 FFEERFGR----VPVPPSGPNPTTSD 138

BLAST of Tan0000071 vs. NCBI nr
Match: XP_011654399.2 (proline-rich protein HaeIII subfamily 1-like [Cucumis sativus])

HSP 1 Score: 120.2 bits (300), Expect = 1.8e-23
Identity = 71/141 (50.35%), Postives = 87/141 (61.70%), Query Frame = 0

Query: 37  MLLGALALIFLGFHFTTNPTHARKL-------------AEKESRFNFVMYPERVPIPPSG 96
           M+LGA  L+ LGFH     THAR +                 S   F +Y + + IPPSG
Sbjct: 1   MILGASFLVLLGFHLIAVQTHARPIPPTGSNPTTSDPTPPPPSNSIFKVYSKGI-IPPSG 60

Query: 97  PNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL---DQF 156
           P+QR S + PPP F I   + +F+FGMYPKG+PIPPSGPSQRTSDSSPPP  +    + F
Sbjct: 61  PSQRTSDSSPPPPFNIL--QKKFDFGMYPKGIPIPPSGPSQRTSDSSPPPPSNFFHHNTF 120

Query: 157 DFGMYPKGVPIPPSGPSQRTS 162
            FGMY + VPIPPSG + RTS
Sbjct: 121 GFGMYQRRVPIPPSGLNPRTS 138

BLAST of Tan0000071 vs. NCBI nr
Match: EOY16631.1 (Uncharacterized protein TCM_035454 [Theobroma cacao])

HSP 1 Score: 114.0 bits (284), Expect = 1.3e-21
Identity = 66/124 (53.23%), Postives = 86/124 (69.35%), Query Frame = 0

Query: 38  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSF 97
           +L  +AL+FLG  + +   +AR + E  S+ NF ++P+ VPIPPSGP++R S +PPPP  
Sbjct: 7   MLLMVALVFLG--YVSMSGNARAVPE-ASQLNFGVFPKGVPIPPSGPSRRTSASPPPP-- 66

Query: 98  TISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPS 157
                    NFGM+PKGVPIPPSGPS+RTS +SPPP       +FG +PKGVPIPPSGPS
Sbjct: 67  ---PPIRLLNFGMFPKGVPIPPSGPSRRTS-ASPPPPPPTGLLNFGTFPKGVPIPPSGPS 121

Query: 158 QRTS 162
           + TS
Sbjct: 127 RGTS 121

BLAST of Tan0000071 vs. ExPASy TrEMBL
Match: A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 3.3e-26
Identity = 71/124 (57.26%), Postives = 82/124 (66.13%), Query Frame = 0

Query: 55  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPK 114
           P HA   +  K+S+ NF M P+ VPIPPSGP+ R S  PPPP   +    P+ NFGM PK
Sbjct: 389 PPHASSVILNKQSKINFGMLPKGVPIPPSGPSHRTSDYPPPPPHVL---WPKINFGMLPK 448

Query: 115 GVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIPPSGPSQRTSDDLHLLHHIPF 174
            VPIPPSGPSQRTSD  PPP H L  + +FGM PKGVPIPP GPS+RTSD      + P 
Sbjct: 449 DVPIPPSGPSQRTSDYPPPPPHVLWPKINFGMLPKGVPIPPHGPSRRTSDYPPPAPNTPS 508

Query: 175 IKLR 177
           I LR
Sbjct: 509 ITLR 509

BLAST of Tan0000071 vs. ExPASy TrEMBL
Match: A0A5D3C2C7 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold83G00150 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 6.8e-24
Identity = 72/146 (49.32%), Postives = 89/146 (60.96%), Query Frame = 0

Query: 30  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPER 89
           MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ 
Sbjct: 1   MASKSWIMMLGGWLLVLLGFHLIAVQTHARPIPPSGSNPTTSDPPPPPPSNFIFKMYPKG 60

Query: 90  VPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHS 149
           + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ +PPSGPSQRTSDSSPPP  +
Sbjct: 61  I-VPPSGPSQRTSDSSPPPPFNILQN--KFDFGMYPKGI-VPPSGPSQRTSDSSPPPPSN 120

Query: 150 LDQFDFGMYPKGVPIPPSGPSQRTSD 163
             +  FG     VP+PPSGP+  TSD
Sbjct: 121 FFEERFGR----VPVPPSGPNPTTSD 138

BLAST of Tan0000071 vs. ExPASy TrEMBL
Match: A0A1S3B4V4 (proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo OX=3656 GN=LOC103486183 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 6.8e-24
Identity = 72/146 (49.32%), Postives = 89/146 (60.96%), Query Frame = 0

Query: 30  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPER 89
           MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ 
Sbjct: 1   MASKSWIMMLGGWLLVLLGFHLIAVQTHARPIPPSGSNPTTSDPPPPPPSNFIFKMYPKG 60

Query: 90  VPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHS 149
           + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ +PPSGPSQRTSDSSPPP  +
Sbjct: 61  I-VPPSGPSQRTSDSSPPPPFNILQN--KFDFGMYPKGI-VPPSGPSQRTSDSSPPPPSN 120

Query: 150 LDQFDFGMYPKGVPIPPSGPSQRTSD 163
             +  FG     VP+PPSGP+  TSD
Sbjct: 121 FFEERFGR----VPVPPSGPNPTTSD 138

BLAST of Tan0000071 vs. ExPASy TrEMBL
Match: A0A061FHV4 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_035454 PE=4 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 6.4e-22
Identity = 66/124 (53.23%), Postives = 86/124 (69.35%), Query Frame = 0

Query: 38  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSF 97
           +L  +AL+FLG  + +   +AR + E  S+ NF ++P+ VPIPPSGP++R S +PPPP  
Sbjct: 7   MLLMVALVFLG--YVSMSGNARAVPE-ASQLNFGVFPKGVPIPPSGPSRRTSASPPPP-- 66

Query: 98  TISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPS 157
                    NFGM+PKGVPIPPSGPS+RTS +SPPP       +FG +PKGVPIPPSGPS
Sbjct: 67  ---PPIRLLNFGMFPKGVPIPPSGPSRRTS-ASPPPPPPTGLLNFGTFPKGVPIPPSGPS 121

Query: 158 QRTS 162
           + TS
Sbjct: 127 RGTS 121

BLAST of Tan0000071 vs. ExPASy TrEMBL
Match: A0A1R3GMC3 (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_25007 PE=4 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 2.1e-20
Identity = 70/135 (51.85%), Postives = 82/135 (60.74%), Query Frame = 0

Query: 30  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHS 89
           MA+  S ML+  L L+F+G+  T    +AR L+ +    NF M P+ VPIPPSGP+ R S
Sbjct: 1   MANKLSMMLM--LVLVFVGYSATL--INARSLSSQ--LLNFEMLPKGVPIPPSGPSTRTS 60

Query: 90  FAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGV 149
              PPP        P  NF M PKGVPIPPSGPS RTS   PPP       +F M PKGV
Sbjct: 61  ADVPPP--------PSLNFQMLPKGVPIPPSGPSTRTSADVPPP----PSLNFQMLPKGV 117

Query: 150 PIPPSGPSQRTSDDL 165
           PIPPSGPS RTS D+
Sbjct: 121 PIPPSGPSTRTSADV 117

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6603169.11.1e-2859.06hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022967687.16.8e-2657.26actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima][more]
XP_008442275.11.4e-2349.32PREDICTED: proline-rich protein HaeIII subfamily 1-like [Cucumis melo] >KAA00418... [more]
XP_011654399.21.8e-2350.35proline-rich protein HaeIII subfamily 1-like [Cucumis sativus][more]
EOY16631.11.3e-2153.23Uncharacterized protein TCM_035454 [Theobroma cacao][more]
Match NameE-valueIdentityDescription
A0A6J1HRH73.3e-2657.26actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... [more]
A0A5D3C2C76.8e-2449.32Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3B4V46.8e-2449.32proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A061FHV46.4e-2253.23Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_035454 PE=4 SV=1[more]
A0A1R3GMC32.1e-2051.85Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_25007 PE=4 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..134
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..163
NoneNo IPR availablePANTHERPTHR33599:SF13FORMIN-LIKE PROTEIN 20coord: 106..162
NoneNo IPR availablePANTHERPTHR33599:SF13FORMIN-LIKE PROTEIN 20coord: 65..132
IPR039639Protein IDA-likePANTHERPTHR33599PROTEIN IDA-LIKE 5coord: 106..162
IPR039639Protein IDA-likePANTHERPTHR33599PROTEIN IDA-LIKE 5coord: 65..132

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000071.1Tan0000071.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010227 floral organ abscission
cellular_component GO:0110165 cellular anatomical entity