Tan0019389 (gene) Snake gourd v1

Overview
NameTan0019389
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
LocationLG01: 109957340 .. 109958571 (+)
RNA-Seq ExpressionTan0019389
SyntenyTan0019389
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGATATAAGGAATTTTCACAGGTGCTAGCTATTGGCTTTATCCTTTCATGGAAGTTTGGGGCTCTCGACACCATTCTTGGACGAAATCTTGGGATTTTGGAAATTTTACGGTAAGTTTGTACCCGAGAAAGGAAGACGAACAGATCTCGCTGATTTCTCTGGGGGTAATAACGCGGTTTCCTTCTATAATTCGCAATTTATGGGTCCAAATCTTAGGAATTAATAGGTCCCCATGGGATTTAAAGGGGAAGGTCTGGGATCTTATTCATTCAGTAACAAAATTAAGGTTCCAGTGTTGTTACTCAAAAAACAGATCGAAAATGGAAGCCAAACAACTCGGACTAATGGTCGAACGTCTAAATCTTAAAGAGGAGGAAGGTAGAGCTATCGTGGTTGAGGACGATGATGTCGATGAGAGCGTCCGCCTCCTGTTGATCTCTCTGATCTGCAAAAGTCTTTCCTCGAAACCAGTTCATATTGACGTTTTTCGGCAAACAATTCCAAAATTTTGGAAAACCACTGCCCCGATTGGTATCGATAAAGTTGGGGAAAATCTATTCCTCTGCAGCTTTGGGAATGCATGAGATTTACGGAGAGTCCTGAAAAATGAACCATGGTACTTTGATAAGGCGATTCTCTTGTTTGATGAACAAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTTGTTAACTTCTGGATCCACTTACATAATCTGCCTCCTGCTGGACAATCGCTGAAAATGACACAAATCTTTGGAAATTTCCTCGGTGTGTTCTAGAAGATGGACCTGAACGATACGGAAACATGTTGGGGAAGCCCCTTGCGCATGAAGGTTCGGATCGACGTCACCATGCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACAATGGCAGAGGAGTTATGGTGCCTGGTCACCTACGAGAAACTCCCCGATTTTTGCTACAGTTGTGGGAGAATCGGGCACCTGGACAAGATGTGCAATGAGGTTGGTTGGTCAACTTCGGATAAAAAGCAATTTGGTTCTGGACTAAGACACCCATCCTCAAATTTTGGACAACAGAGTTGGGCACGATCGTTTTTATCAGATAGCAGAGGAAGAGGAAGAAGAGTTCATGGCAGAGGATCGAGAGATACAAAGAAGATTGTGGAGGATGATATCGAAGAAGACCCTTCGATGAATTCGAGTTCAGGGATAAAATTACAGACCGAAAGCAAATAG

mRNA sequence

ATGCGATATAAGGAATTTTCACAGGTGCTAGCTATTGGCTTTATCCTTTCATGGAAGTTTGGGGCTCTCGACACCATTCTTGGACGAAATCTTGGGATTTTGGAAATTTTACGATCGAAAATGGAAGCCAAACAACTCGGACTAATGGTCGAACGTCTAAATCTTAAAGAGGAGGAAGGTAGAGCTATCGTGGTTGAGGACGATGATGTCGATGAGAGCAAGATGGACCTGAACGATACGGAAACATGTTGGGGAAGCCCCTTGCGCATGAAGGTTCGGATCGACGTCACCATGCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACAATGGCAGAGGAGTTATGGTGCCTGGTCACCTACGAGAAACTCCCCGATTTTTGCTACAGTTGTGGGAGAATCGGGCACCTGGACAAGATGTGCAATGAGGTTGGTTGGTCAACTTCGGATAAAAAGCAATTTGGTTCTGGACTAAGACACCCATCCTCAAATTTTGGACAACAGAGTTGGGCACGATCGTTTTTATCAGATAGCAGAGGAAGAGGAAGAAGAGTTCATGGCAGAGGATCGAGAGATACAAAGAAGATTGTGGAGGATGATATCGAAGAAGACCCTTCGATGAATTCGAGTTCAGGGATAAAATTACAGACCGAAAGCAAATAG

Coding sequence (CDS)

ATGCGATATAAGGAATTTTCACAGGTGCTAGCTATTGGCTTTATCCTTTCATGGAAGTTTGGGGCTCTCGACACCATTCTTGGACGAAATCTTGGGATTTTGGAAATTTTACGATCGAAAATGGAAGCCAAACAACTCGGACTAATGGTCGAACGTCTAAATCTTAAAGAGGAGGAAGGTAGAGCTATCGTGGTTGAGGACGATGATGTCGATGAGAGCAAGATGGACCTGAACGATACGGAAACATGTTGGGGAAGCCCCTTGCGCATGAAGGTTCGGATCGACGTCACCATGCCTCTAAAACGGGGATTAAAGGTCAAAATAGGAACAATGGCAGAGGAGTTATGGTGCCTGGTCACCTACGAGAAACTCCCCGATTTTTGCTACAGTTGTGGGAGAATCGGGCACCTGGACAAGATGTGCAATGAGGTTGGTTGGTCAACTTCGGATAAAAAGCAATTTGGTTCTGGACTAAGACACCCATCCTCAAATTTTGGACAACAGAGTTGGGCACGATCGTTTTTATCAGATAGCAGAGGAAGAGGAAGAAGAGTTCATGGCAGAGGATCGAGAGATACAAAGAAGATTGTGGAGGATGATATCGAAGAAGACCCTTCGATGAATTCGAGTTCAGGGATAAAATTACAGACCGAAAGCAAATAG

Protein sequence

MRYKEFSQVLAIGFILSWKFGALDTILGRNLGILEILRSKMEAKQLGLMVERLNLKEEEGRAIVVEDDDVDESKMDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRIGHLDKMCNEVGWSTSDKKQFGSGLRHPSSNFGQQSWARSFLSDSRGRGRRVHGRGSRDTKKIVEDDIEEDPSMNSSSGIKLQTESK
Homology
BLAST of Tan0019389 vs. NCBI nr
Match: RLM86337.1 (hypothetical protein C2845_PM04G11780 [Panicum miliaceum])

HSP 1 Score: 95.9 bits (237), Expect = 4.6e-16
Identity = 57/149 (38.26%), Postives = 78/149 (52.35%), Query Frame = 0

Query: 75  MDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRI 134
           M+L D ET  G  L +KVR+D+  PL RG  + +G   + +WC + YE LPDFCY+C RI
Sbjct: 3   MELEDDETAIGQFLHIKVRLDIRKPLMRGATLHVGDDEKPIWCPLVYEFLPDFCYTCWRI 62

Query: 135 GHLDKMCNEVGWSTSDKKQFGSGLRH----------PSSNFGQQSWARSFLSDSRGRGRR 194
           GHLDK C EV     + +QF   LR            S   G   +     ++ RG GR+
Sbjct: 63  GHLDKCC-EVVLKDGETQQFSKDLRFIPKKKRWDAGSSDRSGAGRYIPPRRNNGRGSGRK 122

Query: 195 VH-GRGSRDTKKIVEDDIEEDPSMNSSSG 213
           V  GR   D+    +DD+ +D    S+SG
Sbjct: 123 VSGGRSGSDSLSWKKDDLPKDGKKGSTSG 150

BLAST of Tan0019389 vs. NCBI nr
Match: XP_039815364.1 (uncharacterized protein LOC120678302 [Panicum virgatum])

HSP 1 Score: 95.9 bits (237), Expect = 4.6e-16
Identity = 63/163 (38.65%), Postives = 86/163 (52.76%), Query Frame = 0

Query: 62  AIVVEDDDVDESKMDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTY 121
           AI + ++  +  ++DL + E   G  LR+KVRI++  PL+RG+ + +G  A+E WC +TY
Sbjct: 78  AIQIGEELGEFMEVDLENDEFAAGRFLRVKVRIEIEKPLRRGIMIDVGEGAQERWCPITY 137

Query: 122 EKLPDFCYSCGRIGHLDKMCNEVGWSTSDKKQFGSGLRH--PSSNFGQQSW--------- 181
           E LPDFCY CGRIGH DK C     +  ++  FG  LR+  P   FG +SW         
Sbjct: 138 EFLPDFCYVCGRIGHTDKAC-LTKLAAGEQAPFGRELRYIPPKKKFGGESWRSQENRRSG 197

Query: 182 -ARS------FLSDSRGRGRRVHGRGSRDTKKIVEDDIEEDPS 207
             RS      F   S GRG  V GR   D     +DD+ E  S
Sbjct: 198 GGRSGGSGPWFSGGSGGRG--VGGRTRSDALSWRKDDMGEKTS 237

BLAST of Tan0019389 vs. NCBI nr
Match: XP_039827601.1 (uncharacterized protein LOC120689378 [Panicum virgatum])

HSP 1 Score: 93.2 bits (230), Expect = 3.0e-15
Identity = 52/130 (40.00%), Postives = 74/130 (56.92%), Query Frame = 0

Query: 62  AIVVEDDDVDESKMDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTY 121
           AI + ++  +  ++DL +     G  LR+KVRI++  PL+RG+ + +G  A+E WC +TY
Sbjct: 141 AIQIGEELGEFMEVDLENDVFAAGRFLRVKVRIEIEKPLRRGIMIDVGEGAQERWCPITY 200

Query: 122 EKLPDFCYSCGRIGHLDKMCNEVGWSTSDKKQFGSGLRH--PSSNFGQQSWARSFLSDSR 181
           E LPDFCY CGRIGH DK C     +  ++  FG  LR+  P   FG +SW      + R
Sbjct: 201 EFLPDFCYVCGRIGHTDKAC-LTKLAAEEQAPFGRELRYIPPKKKFGGESWRSQ--ENRR 260

Query: 182 GRGRRVHGRG 190
             G R  G G
Sbjct: 261 SGGGRSGGSG 267

BLAST of Tan0019389 vs. NCBI nr
Match: PUZ69939.1 (hypothetical protein GQ55_2G170400 [Panicum hallii var. hallii])

HSP 1 Score: 90.1 bits (222), Expect = 2.5e-14
Identity = 54/151 (35.76%), Postives = 79/151 (52.32%), Query Frame = 0

Query: 75  MDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRI 134
           M+L D ET  G  L +K ++D+  PL RG  + +G   + +WC + YE LPDFCY+CGRI
Sbjct: 92  MELEDDETAIGQFLHIKAKLDIRKPLIRGPTLHVGDDEKPIWCPLVYEFLPDFCYTCGRI 151

Query: 135 GHLDKMCNEV------------GWSTSDKKQFGSGLRHPSSNFGQQSWARSFLSDSRGRG 194
           GHLDK C  V             W    KK++ +G    S+  G   +   + ++ RG G
Sbjct: 152 GHLDKCCEVVLKDGETQQFSKDLWFILKKKRWDAG---SSNRSGGGRYIAPWRNNGRGSG 211

Query: 195 RRVH-GRGSRDTKKIVEDDIEEDPSMNSSSG 213
           R+V  GR   ++    +DD  +D    S+SG
Sbjct: 212 RKVSGGRSGSNSLSWKKDDPPKDGKKGSTSG 239

BLAST of Tan0019389 vs. NCBI nr
Match: XP_039815069.1 (uncharacterized protein LOC120677956 [Panicum virgatum])

HSP 1 Score: 89.0 bits (219), Expect = 5.7e-14
Identity = 49/114 (42.98%), Postives = 64/114 (56.14%), Query Frame = 0

Query: 75  MDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRI 134
           MDL++ +T  G  LR+KVR+D+  PL RG+ V +G   + LWC V YE LPDFCY+CG I
Sbjct: 129 MDLDEDDTAVGRYLRIKVRLDIRKPLMRGVMVYVGKEDKPLWCPVEYEYLPDFCYTCGII 188

Query: 135 GHLDKMCNEVGWSTSDKKQFGSGLR------HPSSNFGQQSWARSFLSDSRGRG 183
           GH DK+C EV     + +QF   LR           FG +S    F+   R  G
Sbjct: 189 GHTDKVC-EVEVERGETQQFSKYLRCMPERKRVEEGFGGRSGGDRFVPSWRKSG 241

BLAST of Tan0019389 vs. ExPASy TrEMBL
Match: A0A3L6QR72 (zf-CCHC_4 domain-containing protein OS=Panicum miliaceum OX=4540 GN=C2845_PM04G11780 PE=4 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 2.2e-16
Identity = 57/149 (38.26%), Postives = 78/149 (52.35%), Query Frame = 0

Query: 75  MDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRI 134
           M+L D ET  G  L +KVR+D+  PL RG  + +G   + +WC + YE LPDFCY+C RI
Sbjct: 3   MELEDDETAIGQFLHIKVRLDIRKPLMRGATLHVGDDEKPIWCPLVYEFLPDFCYTCWRI 62

Query: 135 GHLDKMCNEVGWSTSDKKQFGSGLRH----------PSSNFGQQSWARSFLSDSRGRGRR 194
           GHLDK C EV     + +QF   LR            S   G   +     ++ RG GR+
Sbjct: 63  GHLDKCC-EVVLKDGETQQFSKDLRFIPKKKRWDAGSSDRSGAGRYIPPRRNNGRGSGRK 122

Query: 195 VH-GRGSRDTKKIVEDDIEEDPSMNSSSG 213
           V  GR   D+    +DD+ +D    S+SG
Sbjct: 123 VSGGRSGSDSLSWKKDDLPKDGKKGSTSG 150

BLAST of Tan0019389 vs. ExPASy TrEMBL
Match: A0A2T7EQ43 (CCHC-type domain-containing protein OS=Panicum hallii var. hallii OX=1504633 GN=GQ55_2G170400 PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 1.2e-14
Identity = 54/151 (35.76%), Postives = 79/151 (52.32%), Query Frame = 0

Query: 75  MDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRI 134
           M+L D ET  G  L +K ++D+  PL RG  + +G   + +WC + YE LPDFCY+CGRI
Sbjct: 92  MELEDDETAIGQFLHIKAKLDIRKPLIRGPTLHVGDDEKPIWCPLVYEFLPDFCYTCGRI 151

Query: 135 GHLDKMCNEV------------GWSTSDKKQFGSGLRHPSSNFGQQSWARSFLSDSRGRG 194
           GHLDK C  V             W    KK++ +G    S+  G   +   + ++ RG G
Sbjct: 152 GHLDKCCEVVLKDGETQQFSKDLWFILKKKRWDAG---SSNRSGGGRYIAPWRNNGRGSG 211

Query: 195 RRVH-GRGSRDTKKIVEDDIEEDPSMNSSSG 213
           R+V  GR   ++    +DD  +D    S+SG
Sbjct: 212 RKVSGGRSGSNSLSWKKDDPPKDGKKGSTSG 239

BLAST of Tan0019389 vs. ExPASy TrEMBL
Match: A0A5D3C0K2 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold68G001040 PE=4 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 8.8e-13
Identity = 64/173 (36.99%), Postives = 85/173 (49.13%), Query Frame = 0

Query: 39  SKMEAKQLGLMVERLNLKEEEGRAIVVEDDDVDESKMDLNDTETCWGSPLRMKVRIDVTM 98
           SKM  K L LMV      E++ RA+ VEDDD+DE   D                      
Sbjct: 5   SKMLGK-LSLMV------EQKQRAVTVEDDDIDEVVRD---------------------F 64

Query: 99  PLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRIGHLDKMCNEVGWSTSDKKQFGSGL 158
           P   G   K+G MA+E W  V+YEKLPDFCY CGR+GH+ + C EVG    + ++FG  L
Sbjct: 65  PNVHGTIKKLGVMAKEKWIPVSYEKLPDFCYKCGRVGHVVQECKEVGDDGEEGQRFGVWL 124

Query: 159 R----HPSSNFGQQSWARSFLSDSRGRGRRVHGRGSRDTKKIVEDDIEEDPSM 208
           R      S + G++   RS  +  RGRG    G+G    K+   D    DP++
Sbjct: 125 RKSQTSQSISKGRREEVRSQFTIDRGRG---EGKGIEAFKR---DKSSRDPNL 143

BLAST of Tan0019389 vs. ExPASy TrEMBL
Match: A0A3L6PJW2 (Retrotransposon protein, putative, unclassified OS=Panicum miliaceum OX=4540 GN=C2845_PM18G07530 PE=4 SV=1)

HSP 1 Score: 83.2 bits (204), Expect = 1.5e-12
Identity = 57/151 (37.75%), Postives = 75/151 (49.67%), Query Frame = 0

Query: 55  LKEEEGRAIVVEDDDVDE-SKMDLNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAE 114
           +K   G AI    D++ E   MDL+D +T  G  +R+KV++D+  PL RGL V +     
Sbjct: 121 MKRATGEAI---GDEIGEFMPMDLDDDDTAVGRFIRIKVKLDIRKPLMRGLTVCVREENR 180

Query: 115 ELWCLVTYEKLPDFCYSCGRIGHLDKMCNEVGWSTSDKKQF---------------GSGL 174
            +WC V YE LPDFCY+CG IGH DK+C  V  S  + +Q+               GSG 
Sbjct: 181 PVWCPVEYEYLPDFCYTCGIIGHTDKVCG-VKLSRGEVQQYSKKLRCMLERSRLDDGSGD 240

Query: 175 RHPSSNF-----GQQSWARSFLSDSRGRGRR 185
           R     F        +  RSF S SRG   R
Sbjct: 241 RFGGGRFILHWKSSGNGGRSFSSSSRGSASR 267

BLAST of Tan0019389 vs. ExPASy TrEMBL
Match: A0A5C7HZB5 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_013089 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 2.0e-12
Identity = 52/137 (37.96%), Postives = 79/137 (57.66%), Query Frame = 0

Query: 77  LNDTETCWGSPLRMKVRIDVTMPLKRGLKVKIGTMAEELWCLVTYEKLPDFCYSCGRIGH 136
           L+D+  CWG  LR+KV+ID++ PLKR L++K+G   + +   + YE+LPDFC+ CGRIGH
Sbjct: 22  LSDSRECWGKYLRVKVQIDISKPLKRWLRLKLGKTEDIVVVGLKYERLPDFCFVCGRIGH 81

Query: 137 LDKMCNEVGWSTSDKKQFGSGLRHPSSNFGQQSWARSFLSDSRGRGRRVHGRGS-RDTKK 196
           + K C      T +  + G+ L   ++ FGQ  W R+  +D      R+  +GS + ++ 
Sbjct: 82  VVKEC------TDELARLGA-LDGTATRFGQ--WLRAPGTD------RLQTKGSGQGSES 141

Query: 197 IVEDDIEEDPSMNSSSG 213
             E D   D   NSS G
Sbjct: 142 SFERDRLRDKDPNSSDG 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
RLM86337.14.6e-1638.26hypothetical protein C2845_PM04G11780 [Panicum miliaceum][more]
XP_039815364.14.6e-1638.65uncharacterized protein LOC120678302 [Panicum virgatum][more]
XP_039827601.13.0e-1540.00uncharacterized protein LOC120689378 [Panicum virgatum][more]
PUZ69939.12.5e-1435.76hypothetical protein GQ55_2G170400 [Panicum hallii var. hallii][more]
XP_039815069.15.7e-1442.98uncharacterized protein LOC120677956 [Panicum virgatum][more]
Match NameE-valueIdentityDescription
A0A3L6QR722.2e-1638.26zf-CCHC_4 domain-containing protein OS=Panicum miliaceum OX=4540 GN=C2845_PM04G1... [more]
A0A2T7EQ431.2e-1435.76CCHC-type domain-containing protein OS=Panicum hallii var. hallii OX=1504633 GN=... [more]
A0A5D3C0K28.8e-1336.99Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold68... [more]
A0A3L6PJW21.5e-1237.75Retrotransposon protein, putative, unclassified OS=Panicum miliaceum OX=4540 GN=... [more]
A0A5C7HZB52.0e-1237.96CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_01308... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 94..142
e-value: 1.7E-14
score: 53.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..220
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..220
NoneNo IPR availablePANTHERPTHR31286:SF84SUBFAMILY NOT NAMEDcoord: 83..190
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 83..190
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 128..143
score: 9.30658

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019389.1Tan0019389.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding