Tan0000319 (gene) Snake gourd v1

Overview
NameTan0000319
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
LocationLG06: 12246966 .. 12248399 (-)
RNA-Seq ExpressionTan0000319
SyntenyTan0000319
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCGTTTGGAAAGTTGGAGAGGAAGTTTTGGCAGGGGATGGGAAATTGGCGCCTTTAGTGCGATCTTATTTACAGATTTCAAGCATCACTATCCTTTCTTATATGGCAAGGAAAATCTTCGAAATCCCCGAGCATTGGAGAGCTTTGGAATTCTGGCTGCTATTTAAAGGAATTTGGACCCTTCTTCTAAACACATCCTGGGCTAGTATTCACCTAATAATGGAATCAATTCAGTTAGGAATGTTGGTCGAGAAGCTGAAGCTCACCTCAGAAGAGAAACAAACAATTTTGGTTCCTGATGATGATATAGATGAAAGTATACAATTACTTCAATTGGCTCTTCTCTGTAAAAGTATCTCGGTCACGCCAATAAATTTAGACTTATTCAAACAAACAATACCGAAGATTTGGGGAGTAGATGCACCGCTTTCTATCGAAAAGGTGGGTTTCAACATATTCTTGTGCAACTTCCATTCTTTGGCGGACAAACGAAGAACGTTAGAAAATGATCCCTGATACTACGACAGAGCAATCCTTCTCCTAGACGAGCTCAGAGGATCACGTTGTTTCGGTGATCATGAATTCTGGTACGCTTCCTTTTGGATTCATATCCACAATCTACCCCCTGCAGGTCAGACTAAGATGATGGCACGGATATGTGGTAACCGACTAGGACAATTCCAAAAGGTGGACTCCTTCGATTTAGACTTCTGCTGGGGGAACACTCTTCACATCAGAATCAGGATTGATGTTACAAAGCCATTGAAAAGAGGGTTGGAGGTGAAAGTCGGAGTGATGGGCGAGGAAACATGGTGTCCGGTGACCTACGAAAAGCTCCCGGATTTCTGTTACAGATGCAGGAGAATTGGCCATAGAAAACGGGAATGCCCTTTCGAAGTTGATATTGATGCTGAGGGAAAGAAATTCGGACCTAGTCTCCGCCACTCCACCTCCTGAGAGGCATCCCTGTCTGGTTTTCGTCCAATGAGCATGGATTGTAGAGGTCGAGTGAGATCCAACTGAGGGTGAAACGAGGGTTCAGGGCAACGAAGAGCAGAAGAGGGAGACTCGGAAGGGGATCCAAACCAAAGCCCCCCCTCAACCCCCTCGGTTCAGGCGAGCAGCGGTGGCTCACCGAAGATACAAAATAAGAAACCGAAAATGACCGGCGACAATGAATCAAGGAAGTTTTCCGATGACTTTCAAATTATATTAAATGCTATTAACTTCTCAGCTACAACGACTACAAAGGAAGGAGATAAAGCGGGATTCAATGAGCATTTAAAGGCACCCCGAAAGATGAAGAAAGTTGTCGTTCAGGAGGTCACCAAAACGTTAGGGTTTGAGGAGTCAGTTTCGGATGGAAAGTACAAAAATGAAAAGAAGAGGGCGATCATTGAAAGAGACGGTTGTGGGGCCCACACCAGTTAG

mRNA sequence

ATGTGCGTTTGGAAAGTTGGAGAGGAAGTTTTGGCAGGGGATGGGAAATTGGCGCCTTTAGTGCGATCTTATTTACAGATTTCAAGCATCACTATCCTTTCTTATATGGCAAGGAAAATCTTCGAAATCCCCGAGCATTGGAGAGCTTTGGAATTCTGGCTGCTATTTAAAGGAATTTGGACCCTTCTTCTAAACACATCCTGGGCTAGTATTCACCTAATAATGGAATCAATTCAGTTAGGAATGTTGGTCGAGAAGCTGAAGCTCACCTCAGAAGAGAAACAAACAATTTTGGTTCCTGATGATGATATAGATGAAAGTCAGACTAAGATGATGGCACGGATATGTGGTAACCGACTAGGACAATTCCAAAAGGTGGACTCCTTCGATTTAGACTTCTGCTGGGGGAACACTCTTCACATCAGAATCAGGATTGATGTTACAAAGCCATTGAAAAGAGGGTTGGAGGTGAAAGTCGGAGTGATGGGCGAGGAAACATGGTGTCCGGTGACCTACGAAAAGCTCCCGGATTTCTGTTACAGATGCAGGAGAATTGGCCATAGAAAACGGGAATGCCCTTTCGAAGTTGATATTGATGCTGAGGGAAAGAAATTCGGACCTAGGCAACGAAGAGCAGAAGAGGGAGACTCGGAAGGGGATCCAAACCAAAGCCCCCCCTCAACCCCCTCGGTTCAGGCGAGCAGCGGTGGCTCACCGAAGATACAAAATAAGAAACCGAAAATGACCGGCGACAATGAATCAAGGAAGTTTTCCGATGACTTTCAAATTATATTAAATGCTATTAACTTCTCAGCTACAACGACTACAAAGGAAGGAGATAAAGCGGGATTCAATGAGCATTTAAAGGCACCCCGAAAGATGAAGAAAGTTGTCGTTCAGGAGGTCACCAAAACGTTAGGGTTTGAGGAGTCAGTTTCGGATGGAAAGTACAAAAATGAAAAGAAGAGGGCGATCATTGAAAGAGACGGTTGTGGGGCCCACACCAGTTAG

Coding sequence (CDS)

ATGTGCGTTTGGAAAGTTGGAGAGGAAGTTTTGGCAGGGGATGGGAAATTGGCGCCTTTAGTGCGATCTTATTTACAGATTTCAAGCATCACTATCCTTTCTTATATGGCAAGGAAAATCTTCGAAATCCCCGAGCATTGGAGAGCTTTGGAATTCTGGCTGCTATTTAAAGGAATTTGGACCCTTCTTCTAAACACATCCTGGGCTAGTATTCACCTAATAATGGAATCAATTCAGTTAGGAATGTTGGTCGAGAAGCTGAAGCTCACCTCAGAAGAGAAACAAACAATTTTGGTTCCTGATGATGATATAGATGAAAGTCAGACTAAGATGATGGCACGGATATGTGGTAACCGACTAGGACAATTCCAAAAGGTGGACTCCTTCGATTTAGACTTCTGCTGGGGGAACACTCTTCACATCAGAATCAGGATTGATGTTACAAAGCCATTGAAAAGAGGGTTGGAGGTGAAAGTCGGAGTGATGGGCGAGGAAACATGGTGTCCGGTGACCTACGAAAAGCTCCCGGATTTCTGTTACAGATGCAGGAGAATTGGCCATAGAAAACGGGAATGCCCTTTCGAAGTTGATATTGATGCTGAGGGAAAGAAATTCGGACCTAGGCAACGAAGAGCAGAAGAGGGAGACTCGGAAGGGGATCCAAACCAAAGCCCCCCCTCAACCCCCTCGGTTCAGGCGAGCAGCGGTGGCTCACCGAAGATACAAAATAAGAAACCGAAAATGACCGGCGACAATGAATCAAGGAAGTTTTCCGATGACTTTCAAATTATATTAAATGCTATTAACTTCTCAGCTACAACGACTACAAAGGAAGGAGATAAAGCGGGATTCAATGAGCATTTAAAGGCACCCCGAAAGATGAAGAAAGTTGTCGTTCAGGAGGTCACCAAAACGTTAGGGTTTGAGGAGTCAGTTTCGGATGGAAAGTACAAAAATGAAAAGAAGAGGGCGATCATTGAAAGAGACGGTTGTGGGGCCCACACCAGTTAG

Protein sequence

MCVWKVGEEVLAGDGKLAPLVRSYLQISSITILSYMARKIFEIPEHWRALEFWLLFKGIWTLLLNTSWASIHLIMESIQLGMLVEKLKLTSEEKQTILVPDDDIDESQTKMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCPVTYEKLPDFCYRCRRIGHRKRECPFEVDIDAEGKKFGPRQRRAEEGDSEGDPNQSPPSTPSVQASSGGSPKIQNKKPKMTGDNESRKFSDDFQIILNAINFSATTTTKEGDKAGFNEHLKAPRKMKKVVVQEVTKTLGFEESVSDGKYKNEKKRAIIERDGCGAHTS
Homology
BLAST of Tan0000319 vs. NCBI nr
Match: XP_039815364.1 (uncharacterized protein LOC120678302 [Panicum virgatum])

HSP 1 Score: 94.4 bits (233), Expect = 2.1e-15
Identity = 67/193 (34.72%), Postives = 95/193 (49.22%), Query Frame = 0

Query: 110 KMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCP 169
           K +A   G  LG+F +VD  + +F  G  L +++RI++ KPL+RG+ + VG   +E WCP
Sbjct: 75  KEVAIQIGEELGEFMEVDLENDEFAAGRFLRVKVRIEIEKPLRRGIMIDVGEGAQERWCP 134

Query: 170 VTYEKLPDFCYRCRRIGHRKREC----------PF--EVDIDAEGKKFGPRQRRAEEGDS 229
           +TYE LPDFCY C RIGH  + C          PF  E+      KKFG    R++E   
Sbjct: 135 ITYEFLPDFCYVCGRIGHTDKACLTKLAAGEQAPFGRELRYIPPKKKFGGESWRSQENRR 194

Query: 230 EGDPNQSPPSTPSVQASSGG--------SPKIQNKKPKMTGDNESRKFSDDFQIILNAIN 283
            G   +S  S P     SGG        S  +  +K  M G+  S K   D + + +   
Sbjct: 195 SGG-GRSGGSGPWFSGGSGGRGVGGRTRSDALSWRKDDM-GEKTSGK--GDEEEVNSPSK 254

BLAST of Tan0000319 vs. NCBI nr
Match: XP_039827601.1 (uncharacterized protein LOC120689378 [Panicum virgatum])

HSP 1 Score: 92.4 bits (228), Expect = 7.8e-15
Identity = 65/193 (33.68%), Postives = 92/193 (47.67%), Query Frame = 0

Query: 110 KMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCP 169
           K +A   G  LG+F +VD  +  F  G  L +++RI++ KPL+RG+ + VG   +E WCP
Sbjct: 138 KEVAIQIGEELGEFMEVDLENDVFAAGRFLRVKVRIEIEKPLRRGIMIDVGEGAQERWCP 197

Query: 170 VTYEKLPDFCYRCRRIGHRKREC----------PF--EVDIDAEGKKFGPRQRRAEEGDS 229
           +TYE LPDFCY C RIGH  + C          PF  E+      KKFG    R++E   
Sbjct: 198 ITYEFLPDFCYVCGRIGHTDKACLTKLAAEEQAPFGRELRYIPPKKKFGGESWRSQENRR 257

Query: 230 EGDPNQSPPSTPSVQASSGG--------SPKIQNKKPKMTGDNESRKFSDDFQIILNAIN 283
            G   +S  S P     SGG        S  +  +K  M    E  +   D + + +   
Sbjct: 258 SGG-GRSGGSGPWFSGGSGGRGVGGRTRSDALSWRKDDM---GEKTRGKGDEEEVNSPSK 317

BLAST of Tan0000319 vs. NCBI nr
Match: XP_022149484.1 (uncharacterized protein LOC111017902 [Momordica charantia])

HSP 1 Score: 91.7 bits (226), Expect = 1.3e-14
Identity = 38/81 (46.91%), Postives = 56/81 (69.14%), Query Frame = 0

Query: 112 MARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCPVT 171
           MA I G +LG  ++++    D   G  + +R++IDV+KPL+RG+++K    G++ WCP+ 
Sbjct: 141 MANILGAKLGDVEEIEGDGADGWAGPFIRVRVKIDVSKPLRRGIKLK-NSDGKDIWCPLR 200

Query: 172 YEKLPDFCYRCRRIGHRKREC 193
           YEKLPDFCY C +IGH  REC
Sbjct: 201 YEKLPDFCYECGKIGHSGREC 220

BLAST of Tan0000319 vs. NCBI nr
Match: XP_022132681.1 (uncharacterized protein LOC111005481 [Momordica charantia])

HSP 1 Score: 88.6 bits (218), Expect = 1.1e-13
Identity = 43/100 (43.00%), Postives = 63/100 (63.00%), Query Frame = 0

Query: 110 KMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKV-GVMGEETWC 169
           K MA   GN +G F+ V+S   +FCWG+ L +R+R DV KPL RG+++ + G MG   W 
Sbjct: 143 KTMATRLGNAIGLFEDVESNANNFCWGSCLRVRVRFDVMKPLHRGIKLNLDGPMG-GCWI 202

Query: 170 PVTYEKLPDFCYRCRRIGHRKREC-PFEVDIDAEGKKFGP 208
           P+ YE+LPDF Y C R+ H  ++C    VD  ++  ++GP
Sbjct: 203 PIQYERLPDFRYHCGRLDHILKDCSDCCVDSVSKNLQYGP 241

BLAST of Tan0000319 vs. NCBI nr
Match: XP_031120295.1 (uncharacterized protein LOC116023434 [Ipomoea triloba])

HSP 1 Score: 88.6 bits (218), Expect = 1.1e-13
Identity = 64/205 (31.22%), Postives = 100/205 (48.78%), Query Frame = 0

Query: 117 GNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCPVTYEKLP 176
           GN LG+F K D  + D  W   + IR+ +DVTKPLK+GL + +   GE+      YE+LP
Sbjct: 153 GNSLGEFVKADITNFDGTWNAFIRIRVLLDVTKPLKKGLTITLST-GEDLRLECKYERLP 212

Query: 177 DFCYRCRRIGHRKRECPFEVDIDAEGKKFGPR----QRRAEEGDSEGDPNQSPPSTPSVQ 236
            FC+ CRRIGH +R CP +++ +   + +GP      RR +   ++   ++ P  T S  
Sbjct: 213 TFCFSCRRIGHGERYCPMQLEENECIRSYGPELRVGGRRMQSSGNKWILSERPAKTLSPA 272

Query: 237 AS----------SGGSPKIQNKKPKMTGDNESRKFSDDFQIILNAINFSATTTTKEGDKA 296
           A+            G P+ +    + T  + +    +D      AI F+AT T +E   +
Sbjct: 273 ATPTSDNQHVHPQAGFPRYEVTVGQSTVPDSAYTLGNDQSAAPQAI-FNATATLEE---S 332

Query: 297 GFNEHLKAPRKMKKVVVQEVTKTLG 308
            FN H+ A   +   + Q  T  LG
Sbjct: 333 SFNAHIVAAETLP--LSQPSTSVLG 350

BLAST of Tan0000319 vs. ExPASy TrEMBL
Match: A0A6J1D765 (uncharacterized protein LOC111017902 OS=Momordica charantia OX=3673 GN=LOC111017902 PE=4 SV=1)

HSP 1 Score: 91.7 bits (226), Expect = 6.4e-15
Identity = 38/81 (46.91%), Postives = 56/81 (69.14%), Query Frame = 0

Query: 112 MARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEETWCPVT 171
           MA I G +LG  ++++    D   G  + +R++IDV+KPL+RG+++K    G++ WCP+ 
Sbjct: 141 MANILGAKLGDVEEIEGDGADGWAGPFIRVRVKIDVSKPLRRGIKLK-NSDGKDIWCPLR 200

Query: 172 YEKLPDFCYRCRRIGHRKREC 193
           YEKLPDFCY C +IGH  REC
Sbjct: 201 YEKLPDFCYECGKIGHSGREC 220

BLAST of Tan0000319 vs. ExPASy TrEMBL
Match: A0A803P119 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 8.4e-15
Identity = 40/86 (46.51%), Postives = 57/86 (66.28%), Query Frame = 0

Query: 107 SQTKMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEET 166
           S+TK+MA I G+ +G+F KVD   L+  WG  + IRI +DVT+PL RG  +K+  + ++ 
Sbjct: 466 SKTKVMANIIGSIVGEFLKVDDDSLEEGWGPFMRIRISLDVTQPLLRGTLLKITGLVDDL 525

Query: 167 WCPVTYEKLPDFCYRCRRIGHRKREC 193
           W  + YEKLPD CY+C R+GH    C
Sbjct: 526 WVILKYEKLPDICYKCGRLGHTFLRC 551

BLAST of Tan0000319 vs. ExPASy TrEMBL
Match: A0A6J1BSZ1 (uncharacterized protein LOC111005481 OS=Momordica charantia OX=3673 GN=LOC111005481 PE=4 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 5.5e-14
Identity = 43/100 (43.00%), Postives = 63/100 (63.00%), Query Frame = 0

Query: 110 KMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKV-GVMGEETWC 169
           K MA   GN +G F+ V+S   +FCWG+ L +R+R DV KPL RG+++ + G MG   W 
Sbjct: 143 KTMATRLGNAIGLFEDVESNANNFCWGSCLRVRVRFDVMKPLHRGIKLNLDGPMG-GCWI 202

Query: 170 PVTYEKLPDFCYRCRRIGHRKREC-PFEVDIDAEGKKFGP 208
           P+ YE+LPDF Y C R+ H  ++C    VD  ++  ++GP
Sbjct: 203 PIQYERLPDFRYHCGRLDHILKDCSDCCVDSVSKNLQYGP 241

BLAST of Tan0000319 vs. ExPASy TrEMBL
Match: A0A6J1DU55 (uncharacterized protein LOC111023135 OS=Momordica charantia OX=3673 GN=LOC111023135 PE=4 SV=1)

HSP 1 Score: 88.2 bits (217), Expect = 7.1e-14
Identity = 53/136 (38.97%), Postives = 73/136 (53.68%), Query Frame = 0

Query: 110 KMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKV-GVMGEETWC 169
           K MA   GN +G F  VD  +  F WG +L IR+ ID+TKPL+RG+++ + G MG   W 
Sbjct: 142 KTMAIRLGNAIGNFVDVDCNEKGFSWGASLRIRVLIDITKPLRRGIKINIDGPMG-GCWI 201

Query: 170 PVTYEKLPDFCYRCRRIGHRKRECPFEV----DIDAEGKKFGPRQR--RAEEGDSEGDPN 229
           P+ YE+LPDFCY C  IGH   +C        D      ++GP  R   ++ G  +G   
Sbjct: 202 PIQYERLPDFCYFCGVIGHSSHDCDARYLAAQDDSRATSEYGPWLRFVGSKAGAQKGRKG 261

Query: 230 QSPPSTPSVQASSGGS 239
           +SP    S  +SS  S
Sbjct: 262 KSPAREDSCGSSSMNS 276

BLAST of Tan0000319 vs. ExPASy TrEMBL
Match: A0A7J6FN44 (zf-CCHC_4 domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_000357 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 2.1e-13
Identity = 40/87 (45.98%), Postives = 53/87 (60.92%), Query Frame = 0

Query: 107 SQTKMMARICGNRLGQFQKVDSFDLDFCWGNTLHIRIRIDVTKPLKRGLEVKVGVMGEET 166
           S+T+ +A+I GN +G+F +V    L+  WG  L +R+ IDV+KPL RG  V    M +E 
Sbjct: 55  SKTEALAKILGNMIGKFLEVFEDSLNEGWGPFLRMRVEIDVSKPLLRGQLVTFPWMNDEL 114

Query: 167 WCPVTYEKLPDFCYRCRRIGHRKRECP 194
           W    YE+LPDFCY C  IGH  R  P
Sbjct: 115 WIDYRYERLPDFCYECGIIGHHPRHQP 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_039815364.12.1e-1534.72uncharacterized protein LOC120678302 [Panicum virgatum][more]
XP_039827601.17.8e-1533.68uncharacterized protein LOC120689378 [Panicum virgatum][more]
XP_022149484.11.3e-1446.91uncharacterized protein LOC111017902 [Momordica charantia][more]
XP_022132681.11.1e-1343.00uncharacterized protein LOC111005481 [Momordica charantia][more]
XP_031120295.11.1e-1331.22uncharacterized protein LOC116023434 [Ipomoea triloba][more]
Match NameE-valueIdentityDescription
A0A6J1D7656.4e-1546.91uncharacterized protein LOC111017902 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A803P1198.4e-1546.51Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J1BSZ15.5e-1443.00uncharacterized protein LOC111005481 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1DU557.1e-1438.97uncharacterized protein LOC111023135 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A7J6FN442.1e-1345.98zf-CCHC_4 domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_000357 P... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 145..193
e-value: 9.1E-15
score: 54.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..218
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 220..242
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..254
NoneNo IPR availablePANTHERPTHR31286:SF84SUBFAMILY NOT NAMEDcoord: 110..229
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 110..229
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 179..193
score: 9.850513
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 176..194

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000319.1Tan0000319.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding