Tan0007163 (gene) Snake gourd v1

Overview
NameTan0007163
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
LocationLG07: 61932345 .. 61932761 (+)
RNA-Seq ExpressionTan0007163
SyntenyTan0007163
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAAGCTTTATGAGTCCAGTAAAGATTGATGTGGAGAAATTTGACGGAAAGATGAATTTTGGCTTGTGGCAAGTTCAAGTCAAAGATGTGCTGATACAATCTGGGTTACACAAGGCTTTAAAGGGAAGACCAAGCTGTGGTGTTCTTGAAAAGTTAAGCAGTGATGGTGATCAAACGGAGTCCAGTGGTGGTTCCAACAGAGGTTCTAAGAAGTCTAGCATGAGTGATGAAGAATCGGATGAATTAGATTTGAGAGCTGCAAGTGCAATCAGACTAAATTTGGCTAAGAATATTCTTGCAAATTTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGATGCGGTGTATCAGGCAAAGAGCATCTCAAATCGATTGTACCTGAAGGAGCAATTTCACACGCTGTGA

mRNA sequence

ATGTCAAGCTTTATGAGTCCAGTAAAGATTGATGTGGAGAAATTTGACGGAAAGATGAATTTTGGCTTGTGGCAAGTTCAAGTCAAAGATGTGCTGATACAATCTGGGTTACACAAGGCTTTAAAGGGAAGACCAAGCTGTGGTGTTCTTGAAAAGTTAAGCAGTGATGGTGATCAAACGGAGTCCAGTGGTGGTTCCAACAGAGGTTCTAAGAAGTCTAGCATGAGTGATGAAGAATCGGATGAATTAGATTTGAGAGCTGCAAGTGCAATCAGACTAAATTTGGCTAAGAATATTCTTGCAAATTTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGATGCGGTGTATCAGGCAAAGAGCATCTCAAATCGATTGTACCTGAAGGAGCAATTTCACACGCTGTGA

Coding sequence (CDS)

ATGTCAAGCTTTATGAGTCCAGTAAAGATTGATGTGGAGAAATTTGACGGAAAGATGAATTTTGGCTTGTGGCAAGTTCAAGTCAAAGATGTGCTGATACAATCTGGGTTACACAAGGCTTTAAAGGGAAGACCAAGCTGTGGTGTTCTTGAAAAGTTAAGCAGTGATGGTGATCAAACGGAGTCCAGTGGTGGTTCCAACAGAGGTTCTAAGAAGTCTAGCATGAGTGATGAAGAATCGGATGAATTAGATTTGAGAGCTGCAAGTGCAATCAGACTAAATTTGGCTAAGAATATTCTTGCAAATTTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGATGCGGTGTATCAGGCAAAGAGCATCTCAAATCGATTGTACCTGAAGGAGCAATTTCACACGCTGTGA

Protein sequence

MSSFMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSISNRLYLKEQFHTL
Homology
BLAST of Tan0007163 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 1.1e-15
Identity = 50/134 (37.31%), Postives = 78/134 (58.21%), Query Frame = 0

Query: 5   MSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSG 64
           MS VK +V KF+G   F  WQ +++D+LIQ GLHK L              D D      
Sbjct: 1   MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVL--------------DVD------ 60

Query: 65  GSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKS 124
                 K  +M  E+  +LD RAASAIRL+L+ +++ N+    TA+ +W +L+++Y +K+
Sbjct: 61  ----SKKPDTMKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKT 110

Query: 125 ISNRLYLKEQFHTL 139
           ++N+LYLK+Q + L
Sbjct: 121 LTNKLYLKKQLYAL 110

BLAST of Tan0007163 vs. NCBI nr
Match: XP_022139673.1 (uncharacterized protein LOC111010521 [Momordica charantia])

HSP 1 Score: 214.2 bits (544), Expect = 7.3e-52
Identity = 114/138 (82.61%), Postives = 123/138 (89.13%), Query Frame = 0

Query: 1   MSSFMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQT 60
           MS FMSPVKIDVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS G  EKLS DG   
Sbjct: 295 MSFFMSPVKIDVEKFDGMINFGLWQVQVKDVLIQSELHKALKGRPSEGASEKLSDDGGPM 354

Query: 61  ESSGGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVY 120
           ESSGGS+RGSKKSSMS E+ +E+DLRAASAIR +LAKNILAN+H ISTAKELWEKL+A+Y
Sbjct: 355 ESSGGSSRGSKKSSMSYEDWEEMDLRAASAIRTSLAKNILANVHRISTAKELWEKLEALY 414

Query: 121 QAKSISNRLYLKEQFHTL 139
           QAK ISNRLYLKEQFHTL
Sbjct: 415 QAKGISNRLYLKEQFHTL 432

BLAST of Tan0007163 vs. NCBI nr
Match: KAF5463231.1 (hypothetical protein F2P56_019161 [Juglans regia])

HSP 1 Score: 165.6 bits (418), Expect = 3.0e-37
Identity = 87/137 (63.50%), Postives = 109/137 (79.56%), Query Frame = 0

Query: 2   SSFMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTE 61
           S   + ++ +VEKFDG++NFGLWQVQVKDVLIQSGLHKALKGRP+     ++SSD   T+
Sbjct: 4   SKISNSIRYEVEKFDGRINFGLWQVQVKDVLIQSGLHKALKGRPT----PEVSSDTSVTD 63

Query: 62  SSGGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQ 121
            +        +S MSDE+ ++LDLRAASAIRL LAKN+LAN+HGISTAK+LWEKL+ +YQ
Sbjct: 64  QT------KSRSVMSDEDWEDLDLRAASAIRLCLAKNVLANIHGISTAKKLWEKLEELYQ 123

Query: 122 AKSISNRLYLKEQFHTL 139
            K +SNR+YLKEQFHTL
Sbjct: 124 TKGVSNRVYLKEQFHTL 130

BLAST of Tan0007163 vs. NCBI nr
Match: KAF3622233.1 (hypothetical protein FXO38_31400 [Capsicum annuum] >KAF3630407.1 hypothetical protein FXO37_28487 [Capsicum annuum])

HSP 1 Score: 161.8 bits (408), Expect = 4.3e-36
Identity = 86/130 (66.15%), Postives = 107/130 (82.31%), Query Frame = 0

Query: 9   KIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGGSNR 68
           K ++EKFDG++NFG+WQVQVKDVLIQSGLHKALK RP+    EK + D +++ESS    +
Sbjct: 73  KFEIEKFDGRINFGVWQVQVKDVLIQSGLHKALKERPTS---EKGNKDSEKSESS----K 132

Query: 69  GSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSISNR 128
            S+KS +SDEE +ELD++AAS IRL LAKN+LAN+ G+ST KELWEKL+ +YQ KSISNR
Sbjct: 133 DSEKSKISDEEWEELDMKAASQIRLYLAKNVLANVIGLSTTKELWEKLEELYQTKSISNR 192

Query: 129 LYLKEQFHTL 139
           LYLKEQFH L
Sbjct: 193 LYLKEQFHKL 195

BLAST of Tan0007163 vs. NCBI nr
Match: KAF7802225.1 (cytochrome p450 [Senna tora])

HSP 1 Score: 161.4 bits (407), Expect = 5.6e-36
Identity = 90/138 (65.22%), Postives = 105/138 (76.09%), Query Frame = 0

Query: 1   MSSFMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQT 60
           MS F S VK DVEKFDG++NFGLWQVQVKDVLIQSGLHKAL+G+ S       S D +++
Sbjct: 1   MSKFSSAVKFDVEKFDGRINFGLWQVQVKDVLIQSGLHKALEGKVS-------SKDSEKS 60

Query: 61  ESSGGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVY 120
           E           SSMSD + +ELDLRAAS IRL+LAKN+LAN+ GISTAKELW+KL+ +Y
Sbjct: 61  E-----------SSMSDGDWEELDLRAASTIRLSLAKNVLANVQGISTAKELWKKLEGLY 120

Query: 121 QAKSISNRLYLKEQFHTL 139
           QAK ISN L LKEQFHTL
Sbjct: 121 QAKGISNWLMLKEQFHTL 120

BLAST of Tan0007163 vs. NCBI nr
Match: KAF5765959.1 (putative RNA-directed DNA polymerase [Helianthus annuus])

HSP 1 Score: 158.7 bits (400), Expect = 3.6e-35
Identity = 85/133 (63.91%), Postives = 101/133 (75.94%), Query Frame = 0

Query: 6   SPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGG 65
           SP++ DVEK+DG++NFGLWQVQVKDVLIQSGLHKAL+G+P              T  S  
Sbjct: 5   SPMRFDVEKYDGRINFGLWQVQVKDVLIQSGLHKALRGKP--------------TPVSSK 64

Query: 66  SNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSI 125
            + G+ K    DEE ++LDLRAASAIRL LAKN+LAN+HGISTAK+LWEKL+ +YQ K I
Sbjct: 65  DSSGTSKD--DDEEWEDLDLRAASAIRLCLAKNVLANVHGISTAKDLWEKLEQLYQGKGI 121

Query: 126 SNRLYLKEQFHTL 139
            NRLYLKEQFHTL
Sbjct: 125 PNRLYLKEQFHTL 121

BLAST of Tan0007163 vs. ExPASy TrEMBL
Match: A0A6J1CG82 (uncharacterized protein LOC111010521 OS=Momordica charantia OX=3673 GN=LOC111010521 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.5e-52
Identity = 114/138 (82.61%), Postives = 123/138 (89.13%), Query Frame = 0

Query: 1   MSSFMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQT 60
           MS FMSPVKIDVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS G  EKLS DG   
Sbjct: 295 MSFFMSPVKIDVEKFDGMINFGLWQVQVKDVLIQSELHKALKGRPSEGASEKLSDDGGPM 354

Query: 61  ESSGGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVY 120
           ESSGGS+RGSKKSSMS E+ +E+DLRAASAIR +LAKNILAN+H ISTAKELWEKL+A+Y
Sbjct: 355 ESSGGSSRGSKKSSMSYEDWEEMDLRAASAIRTSLAKNILANVHRISTAKELWEKLEALY 414

Query: 121 QAKSISNRLYLKEQFHTL 139
           QAK ISNRLYLKEQFHTL
Sbjct: 415 QAKGISNRLYLKEQFHTL 432

BLAST of Tan0007163 vs. ExPASy TrEMBL
Match: A0A2G2YJH4 (Uncharacterized protein OS=Capsicum annuum OX=4072 GN=T459_25006 PE=3 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.7e-33
Identity = 81/135 (60.00%), Postives = 100/135 (74.07%), Query Frame = 0

Query: 4   FMSPVKIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESS 63
           +M   K ++EKFDG++NFGLWQVQVKDVLIQSGLHKALK RP+ G        GD+    
Sbjct: 245 YMMGTKFEIEKFDGRINFGLWQVQVKDVLIQSGLHKALKERPTSG-------KGDKDSKK 304

Query: 64  GGSNRGSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAK 123
             S+  S+KS +SDEE +ELD++  S IRL LAK +L N+ G+ST K+LWEKL+ +YQ K
Sbjct: 305 FESSEDSEKSRISDEEWEELDMKVESQIRLCLAKYVLTNVIGLSTTKKLWEKLEELYQTK 364

Query: 124 SISNRLYLKEQFHTL 139
           SISNRLYLKEQFH L
Sbjct: 365 SISNRLYLKEQFHKL 372

BLAST of Tan0007163 vs. ExPASy TrEMBL
Match: A0A6A3CWI3 (CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00001713pilonHSYRG00162 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.1e-32
Identity = 79/130 (60.77%), Postives = 101/130 (77.69%), Query Frame = 0

Query: 9   KIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGGSNR 68
           + D+EKFDG++NFGLWQVQVKD+LIQSGL+KALKG+P+        S+G + +    S+ 
Sbjct: 4   RFDIEKFDGRINFGLWQVQVKDILIQSGLYKALKGKPAS------LSEGKEPDKPSDSSG 63

Query: 69  GSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSISNR 128
              KS MS+EE +ELD+RAAS IRL LAKN+LAN+   S+ KELWEKL+ +YQAKS+SNR
Sbjct: 64  DKGKSKMSEEEWEELDMRAASQIRLCLAKNVLANVARWSSTKELWEKLEEMYQAKSLSNR 123

Query: 129 LYLKEQFHTL 139
           LYLKE+FH L
Sbjct: 124 LYLKEKFHKL 127

BLAST of Tan0007163 vs. ExPASy TrEMBL
Match: A0A6A2YS90 (Transcription initiation factor IIA subunit 2 OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00111273pilonHSYRG00172 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.1e-32
Identity = 79/130 (60.77%), Postives = 101/130 (77.69%), Query Frame = 0

Query: 9   KIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGGSNR 68
           + D+EKFDG++NFGLWQVQVKD+LIQSGL+KALKG+P+        S+G + +    S+ 
Sbjct: 90  RFDIEKFDGRINFGLWQVQVKDILIQSGLYKALKGKPAS------LSEGKEPDKPSDSSG 149

Query: 69  GSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSISNR 128
              KS MS+EE +ELD+RAAS IRL LAKN+LAN+   S+ KELWEKL+ +YQAKS+SNR
Sbjct: 150 DKGKSKMSEEEWEELDMRAASQIRLCLAKNVLANVARWSSTKELWEKLEEMYQAKSLSNR 209

Query: 129 LYLKEQFHTL 139
           LYLKE+FH L
Sbjct: 210 LYLKEKFHKL 213

BLAST of Tan0007163 vs. ExPASy TrEMBL
Match: A0A6A3BK59 (CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00110050pilonHSYRG00143 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.1e-32
Identity = 79/130 (60.77%), Postives = 101/130 (77.69%), Query Frame = 0

Query: 9   KIDVEKFDGKMNFGLWQVQVKDVLIQSGLHKALKGRPSCGVLEKLSSDGDQTESSGGSNR 68
           + D+EKFDG++NFGLWQVQVKD+LIQSGL+KALKG+P+        S+G + +    S+ 
Sbjct: 4   RFDIEKFDGRINFGLWQVQVKDILIQSGLYKALKGKPAS------LSEGKEPDKPSDSSG 63

Query: 69  GSKKSSMSDEESDELDLRAASAIRLNLAKNILANLHGISTAKELWEKLDAVYQAKSISNR 128
              KS MS+EE +ELD+RAAS IRL LAKN+LAN+   S+ KELWEKL+ +YQAKS+SNR
Sbjct: 64  DKGKSKMSEEEWEELDMRAASQIRLCLAKNVLANVARWSSTKELWEKLEEMYQAKSLSNR 123

Query: 129 LYLKEQFHTL 139
           LYLKE+FH L
Sbjct: 124 LYLKEKFHKL 127

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.1e-1537.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
XP_022139673.17.3e-5282.61uncharacterized protein LOC111010521 [Momordica charantia][more]
KAF5463231.13.0e-3763.50hypothetical protein F2P56_019161 [Juglans regia][more]
KAF3622233.14.3e-3666.15hypothetical protein FXO38_31400 [Capsicum annuum] >KAF3630407.1 hypothetical pr... [more]
KAF7802225.15.6e-3665.22cytochrome p450 [Senna tora][more]
KAF5765959.13.6e-3563.91putative RNA-directed DNA polymerase [Helianthus annuus][more]
Match NameE-valueIdentityDescription
A0A6J1CG823.5e-5282.61uncharacterized protein LOC111010521 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A2G2YJH41.7e-3360.00Uncharacterized protein OS=Capsicum annuum OX=4072 GN=T459_25006 PE=3 SV=1[more]
A0A6A3CWI31.1e-3260.77CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig0... [more]
A0A6A2YS901.1e-3260.77Transcription initiation factor IIA subunit 2 OS=Hibiscus syriacus OX=106335 GN=... [more]
A0A6A3BK591.1e-3260.77CCHC-type domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig0... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 83..138
e-value: 1.2E-10
score: 41.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 50..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..71
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 5..138
NoneNo IPR availablePANTHERPTHR34676:SF6SUBFAMILY NOT NAMEDcoord: 5..138

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007163.1Tan0007163.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044271 cellular nitrogen compound biosynthetic process
biological_process GO:0010467 gene expression
biological_process GO:0009059 macromolecule biosynthetic process
biological_process GO:0044238 primary metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding