Tan0022773.1 (mRNA) Snake gourd v1

Overview
NameTan0022773.1
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF2431 domain-containing protein
LocationLG02: 6787266 .. 6792703 (-)
Sequence length924
RNA-Seq ExpressionTan0022773.1
SyntenyTan0022773.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGCTACTTCTCTCGACTCGAAAGGTATGAACCGACCATATAATTATATGGTCGTAATAATTCAGGTCCCGTTTGATAACTCATTTTGTTTTTAGTTTTTTGTGTTTGAAAAATAAACTTATAAAGCACTAATTTCATCTATAAATTTATTGCTTTATTTTTTAAAACTAAAAAAAAGTATTTTATTTTTTAAATTTGCCTTATTAGCTTATGAAAAGAATTTGTGGGAAACGAGCATGATTTTAAAAAACAGAAAATCATATACAAAATGGTTATCAAACGAAGTCTCGTATTTTGGGATGATGCAAGAAGGATGAACCAACATCCACTTCTTTGCTTCACGCCATTCGATCGGATCGTCTTCAACTTCCCGCATGCTGGTTTTGACTTCGGAGAGCAAAACAACCTTCAAATTCTGTGAGTACATATCAACTATGATATTAGAATATTAAATTTATCGTACCCTATTAGTTTAAGTTCCTGGATTCATCCGTAGTTCAATATAGCATCAGGAAGTCCTATGTTTAAATCTTTGTAATGCCATTTATTTTTTAATTGATATTGATTACTATTTGTTGGAAGTGTTAACTTTTGAATTCAATGGTAGTTTAACAGAGTGGAGCTTTCTATTTGACAGACTTCATAGGGATGTGGTGAGAGGATTTTTGAGGAATGCAAAGGAAATGGTGGCTGAAAATGGTGAAGTTCACATCACTCACAAGACCTCATATCCTTACAGTGAATGGGAGATTGAGAAACTGGCAGAAAAGGTAGGTCTCATTCGAATCGAGAGAGCGTACTTTCGTAAAAAGGACTATCCAGGTTACGAAAATAAGAGAGGCGACCGGCCTAACAACGACGACACTTTCCCCATTGGATCATGCAGCACATTCAAGTTTGCAAAGTATTTGGATGCACATTCATGGTTCTTAAGGTTAGCACTTATTTAGTACACATAGCCACAGCTCAACTGGAAAATACCAGTTGGTCTATAATCTCTCTATTGTTGAGTATCTATCTACTTTCAGTTAGTGCAATCTTTTTACCTATAATCTGTTTAATCCAAGCAAAGTGGAAGCAAAATGTCTGATGTCCAAGGCTTGTACTTGGCTATAATTGGCCCATAGGTCTTTTTGAAGAGCAAAATGGGGCCTTGTGCTGGTTTGTTCATTAAAAAATGCTAGACAATGTATTTAATTAACCAACATCAATGGCTGGATTGTAATAATTGGAAGACAAATCATGTGATGCCTCATAACTTCATGTCTCCAACTAGGCCCCAGCCAATAGTTTTGAGAATACTTCTTTTATCATCTGTATAAATAAACACCTGATATGGTTTATATGCAACCACAAGCCTTCAGCTTCCCATTCAAAAGAAGCATTTCTTCTTTCTAAAACGCGTTCTTCGATCTTCTTGTTGCTCTAAGTATACAAATTTTATAGTACAAACATGGGGCTTCGTCAACTTCTCAAAAGAAACCAGGGCGCTTCAGCAATTCCTAAAGGCTATTGTGCAGTCTATGTTGGAGAGAGCCAAAAGAAGCGGTTTGTGATCCCGATAACTTACTTGAATCAACCATGTTTTCAAGAGTTGCTTTGTCAAACGGAAGAAGAATTTGGTTACTATCATCCCATGGGTGGTCTTACTATTCATTGTAGAGACGATATCTTCACCGATCTCATCTCCCATTTGAATGACTTATGAGAGATGCACTAACAATTGTAGTATACAATCAATCAGGTAGAATAGAGATGTAACAAATAACCAAAGCCATGTACACTCTTTTTTTTTTTTTCATTTTCTTCCCTTTTTGTTTCCTTCAGTGGGGAGTAGAACAATGTTTCCCTGCATAAGGAAATTACCAATGATGAAAATGAAAAGGTTGATTCCTTGAAAAAGTACTGTTCTCTTTATTTTATTGTTAATTTTATCTTCTGTTTATACAAATCAATTATATTGATGAATCTTAATTTTCAAAGTTGGAGTTGTTAGATAAACCATCTATATTAACTTTGGTCAAGCTGTATCAAACAAGTATTTGTAAGGATATATATCCAAAAACATGATTGAGAGATTTCATATTTCAAATTTGGGTAAAAAATCAAATGCATAAGGATATAGAAATTCATAATCAATAGCAGGTCCTCTAAATGGACATGGTATGACCAAATAACAATATGCTTTAACATGGGTAAATTTGACCATCAATTCAAATGGTTTCTTCAATTTGGCCATCTCAGTTGGTGAAAAGTTTTACTTTTTTTTTCAAAATAATCATACTAGTTGCTATCACTTTTGAGAAAGTTATTGAGCTAACATTAGAATCTTTGAATCATATATATTGTTGATTTGTCAAAGCTAAGCAAAGGACTACTTTAAAAATTAGGCCCCTTAACTCGAATTTTGTTTACGTTTTTGTCGGTTGAGCAATTAACTTAGACTTATCTTCATACTCTGCAAGATATAGTATTTCCTCCATTTAATGCAATCTTACTTATAATTTGTGTAATCTAAAGAGAGTGGAAGCAGAATGTGTGATGTCCAAGGCTTGTCCTTGGCTAAAACTGGTCCACATTTTCTAATCTCCATTTTCAGAAGTCTTTTCTTTAAGAGCTAAATGGGGTCTTGTATTTAGTTTCTTCACTGAAAATGAAAGACAATGTCTTTACAAACATCACTGAGGTGGATAATGGTAGTTGAAAGACAAAGCATGTGATGCCTTATAGGATCATGTCTTCTACTAGGCCCTATCCAATAGCTATGAGAGTGTACCTTCTTGTTATTTATATAAATAGATATTTGATGGTTTAGAAACAACCATAAGTCTTCACATTCTCATTCAAAAGAAACTTTCTTCTCTCGAAAACTTGTTCCTCGATCTTCTCATTAGTCTAAGTATATTTGATTCTATTGTACAACATGGGCATTCGCCAACTTTTTAAAAGAAACCAAGGAGTTTCTACAATTCCTAAGGGCTATTGTGCAGTCTATGTTGGAGAGAGCCAAAAGAAGCGATTTGTGATTCCGATTACTTACTTGAATCAACCGTGCTTTCAAGAGTTGCTTTGTCAGACTGAAGAAGAATTCGGTCACCATCATCCCATGGGTGGTCTTACTATTCATTGTAGAGACGACATCTTCACCGATCTCATCTCTCGTTTGAATGAACTATGAGAGATGCACGAGCATTTGTAGTATATACATTTAATCTTGTAGAATAGGGACATTAGAGACAACCAAAACATTGTACATTTTTTTTTGTTTCCTTCATTGGGGAGTAAAAATGTTCCCCCACTAAGGAAATTATGAATGATGAAAATGCAATGACTACAACTTTACTCGAAATTGAAGAGGTTCAATCATTCCTTAACAAAATATTCTCCTTATTTTATGGTTATGCTTCTGCTATGGATGGATGTAAACCATTGTCATTCACCCTTTTTTGGTGTGGTTGCATCAAAAAAGCATTTGTAATGATAGAAATAATAATAAAAAAAATGATTGTGAGATTTCATATTTCAAATTTGGGTCAAATTTTGAAGGTCATTCTGTATTTGCAAAAGGATATAATATGTATTGCTCATAATGGTAGGCCAAGATTGGCCCTCCAAAAGAACACAGTATGGCCAAATAGCCATATGTTTTTCTCAATTGCCCTGTCAGTGTAGATTTCACCACCAATTCAAAGGGTCTCCTCCATTTAGACTGTTTAACTTTGTTCTAGATAATAACATTAGTTTTGAAAGTGATGATTCATAAGTTTCATCTCTTCCACTGGGACCATTCAATAATTTTGAAAGTGCCTCTTATCATCTATATAAATATACATTTGATATAGTTTAGAAGCAACCACAAGTCTTCATATTCCCATTCAAAAGAAACTTTCATCTCACAAAAACGCGTTTTAGTTCTAGTTGCTCCAAGTATATTGACTGTACCGTTCAACATGGGACTTCTTAAAAGAGGCCAAGGAGTTTCATGTTGGTGAGAGCCAAAAGAAGAGATTTGTGATCCCAATAACCACTACAAAAAAAAAAAAAGAAGACCTATTTCGACAATTTTTTATCGAAGGTTCTAAAAAAACCTTTGAGAAATATATACATTTATCGAAAGTATTTAAAAAACCTTTGAAAAATATTCACTTTTTATCGAAAGTTATTAAAAGCCTTTGAAAAATATGTGTATTTATCAAAAGATTTTAAAACACCTTTGAAAAATATATGATCATTTATCGAAGGTATTTAAAAACATTTGAAAAATATCTACTTTTTAACGAAGGTTTTTAAAAACCTTTGAAAAAAATATTATATTTAACAAAGGACATTAAGAACCTTTGAAAATAATATGATCCTTTATCGAAGGTATTTAAAAACCTTTGAAAATATATATATTTTTTAACGAAGATTTTTTAAAACCTTTGAAAAAAATATTATATTTATCAAAGGACATTAAAAACCTTTGAAAATAATAGATCATTTATCGAAAGTATTTAAAAACTCTTTGAAAAATATCAACTATGTATCGAAGGTTATTAAAAACCTTTGAAAAAGATATTGTATTTATCAAAGGTAACTAAGAACGTTTGAAAATAATATGTTCTTTTATCGAAAGCATTTAAAAAACCTTTGAAAAATAACTCTTATATATTATATTTAACTTTTTATCGAAGGTTCTTAAAACCCTTGAAAAATATATTATATTTAACCTTTTATCGAAGGTTATTGAAAATCCTTGAAAAAGATATTATATTAAATATTTTAATATTTAAATATATTAAAAACAAAAATCATAGTCACGAATGTTGAAGGACATATATCAATGGTGGTGATGTAGTTTAAAAAGAAGTGAAGTGAAGGGCATAGTGGTGATTTAGGATATAATAAAAAAAAAATGTTCAGTCGGATTGAAGGGCATAATTAAAAAAAAAAAAAAGAGAGGTTGATTTAGGGTTGAAGTGAAGGGCATAGTGGTATTTTAGGGTTGATTTAGCTTGTTGGTGTTCATCCCCTTCCTTCTCTTCTTCTCCATAGTCACGATACCACACCTTTCCCAACCTCTAAACCGCCTTGCACCAGTTGTTGGAGACAGGCTCAGTCGCCGGATCTTCGCCCTCACGCCAGATCTTCTTCGACCAATCGTCGGTTAGATTTCGCCCACTCGCACGCTGGAGAAAAGTTCTCCGGCAGTTCTCCTTCGATCCATGTCATTCTCCTGAAACTTAAGCTCATACTTGCACTCCCGCTGCAACCACCACCGGACTTCGGACCTGCTCAACGTCGCAACCTGCACAGTTCGTTGGATCTGACTCCTTCATTGGTGTTGTTGTCGCGTCGTCGTTCTCCCCTGTTATCGCGCGTACTAGAAGAGAAGGAAAACACAGGTCTCGACTCAACTCGTTCTCCCCTGTTCCCGCATTATCTTGGCTACTAA

mRNA sequence

ATGTTTGCTACTTCTCTCGACTCGAAAGAATTTGTGGGAAACGAGCATGATTTTAAAAAACAGAAAATCATATACAAAATGGTTATCAAACGAAGTCTCGTATTTTGGGATGATGCAAGAAGGATGAACCAACATCCACTTCTTTGCTTCACGCCATTCGATCGGATCGTCTTCAACTTCCCGCATGCTGGTTTTGACTTCGGAGAGCAAAACAACCTTCAAATTCTACTTCATAGGGATGTGGTGAGAGGATTTTTGAGGAATGCAAAGGAAATGGTGGCTGAAAATGGTGAAGTTCACATCACTCACAAGACCTCATATCCTTACAGTGAATGGGAGATTGAGAAACTGGCAGAAAAGGTAGGTCTCATTCGAATCGAGAGAGCGTACTTTCGTAAAAAGGACTATCCAGGTTACGAAAATAAGAGAGGCGACCGGCCTAACAACGACGACACTTTCCCCATTGGATCATGCAGCACATTCAAGTTTGCAAAGTATTTGGATGCACATTCATGGTTCTTAAGTCACGATACCACACCTTTCCCAACCTCTAAACCGCCTTGCACCAGTTGTTGGAGACAGGCTCAGTCGCCGGATCTTCGCCCTCACGCCAGATCTTCTTCGACCAATCGTCGGTTAGATTTCGCCCACTCGCACGCTGGAGAAAAGTTCTCCGGCAGTTCTCCTTCGATCCATGTCATTCTCCTGAAACTTAAGCTCATACTTGCACTCCCGCTGCAACCACCACCGGACTTCGGACCTGCTCAACGTCGCAACCTGCACAGTTCGTTGGATCTGACTCCTTCATTGGTGTTGTTGTCGCGTCGTCGTTCTCCCCTGTTATCGCGCGTACTAGAAGAGAAGGAAAACACAGGTCTCGACTCAACTCGTTCTCCCCTGTTCCCGCATTATCTTGGCTACTAA

Coding sequence (CDS)

ATGTTTGCTACTTCTCTCGACTCGAAAGAATTTGTGGGAAACGAGCATGATTTTAAAAAACAGAAAATCATATACAAAATGGTTATCAAACGAAGTCTCGTATTTTGGGATGATGCAAGAAGGATGAACCAACATCCACTTCTTTGCTTCACGCCATTCGATCGGATCGTCTTCAACTTCCCGCATGCTGGTTTTGACTTCGGAGAGCAAAACAACCTTCAAATTCTACTTCATAGGGATGTGGTGAGAGGATTTTTGAGGAATGCAAAGGAAATGGTGGCTGAAAATGGTGAAGTTCACATCACTCACAAGACCTCATATCCTTACAGTGAATGGGAGATTGAGAAACTGGCAGAAAAGGTAGGTCTCATTCGAATCGAGAGAGCGTACTTTCGTAAAAAGGACTATCCAGGTTACGAAAATAAGAGAGGCGACCGGCCTAACAACGACGACACTTTCCCCATTGGATCATGCAGCACATTCAAGTTTGCAAAGTATTTGGATGCACATTCATGGTTCTTAAGTCACGATACCACACCTTTCCCAACCTCTAAACCGCCTTGCACCAGTTGTTGGAGACAGGCTCAGTCGCCGGATCTTCGCCCTCACGCCAGATCTTCTTCGACCAATCGTCGGTTAGATTTCGCCCACTCGCACGCTGGAGAAAAGTTCTCCGGCAGTTCTCCTTCGATCCATGTCATTCTCCTGAAACTTAAGCTCATACTTGCACTCCCGCTGCAACCACCACCGGACTTCGGACCTGCTCAACGTCGCAACCTGCACAGTTCGTTGGATCTGACTCCTTCATTGGTGTTGTTGTCGCGTCGTCGTTCTCCCCTGTTATCGCGCGTACTAGAAGAGAAGGAAAACACAGGTCTCGACTCAACTCGTTCTCCCCTGTTCCCGCATTATCTTGGCTACTAA

Protein sequence

MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFPHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEKVGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAKYLDAHSWFLSHDTTPFPTSKPPCTSCWRQAQSPDLRPHARSSSTNRRLDFAHSHAGEKFSGSSPSIHVILLKLKLILALPLQPPPDFGPAQRRNLHSSLDLTPSLVLLSRRRSPLLSRVLEEKENTGLDSTRSPLFPHYLGY
Homology
BLAST of Tan0022773.1 vs. ExPASy Swiss-Prot
Match: F4I1X0 (Heavy metal-associated isoprenylated plant protein 41 OS=Arabidopsis thaliana OX=3702 GN=HIPP41 PE=3 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 7.9e-33
Identity = 69/161 (42.86%), Postives = 98/161 (60.87%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFPH 62
           A+SLDS + V  ++   +  +     +   L+   DA  ++ HP L +  FDR++FNFPH
Sbjct: 56  ASSLDSYDVVVRKYKKARSNLKTLKRLGALLLHGVDATTLHFHPDLRYRRFDRVIFNFPH 115

Query: 63  AGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEKVG 122
           AGF   E ++  I  HR++V GF   A  ++  NGEVH++HK   P+SEW +E+LA +  
Sbjct: 116 AGFHGRESDSSLIRKHRELVFGFFNGASRLLRANGEVHVSHKNKAPFSEWNLEELASRCF 175

Query: 123 LIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKF 164
           L+ I+R  F K +YPGYENKRGD    D  F +G CSTFKF
Sbjct: 176 LVLIQRVAFEKNNYPGYENKRGDGRRCDQPFLLGECSTFKF 216

BLAST of Tan0022773.1 vs. ExPASy Swiss-Prot
Match: P0C8L4 (Uncharacterized protein At4g26485 OS=Arabidopsis thaliana OX=3702 GN=At4g26485 PE=4 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 6.9e-29
Identity = 72/169 (42.60%), Postives = 103/169 (60.95%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKR---SLVFWDDARRMNQHPLLCFTPFDRIVFN 62
           ATSLDS++ +  ++      I    ++KR    +    D   M+    L    +DRIVFN
Sbjct: 43  ATSLDSEDELSIKYMDAVDNI---NILKRYGCDIQHEVDVHTMSFDNSLSLQRYDRIVFN 102

Query: 63  FPHAGFDF--GEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKL 122
           FPHAG  F   E ++  I  H+++VRGFL NAKEM+ E+GE+HITHKT+YP+S+W I+KL
Sbjct: 103 FPHAGSRFFGRELSSRAIESHKELVRGFLENAKEMLEEDGEIHITHKTTYPFSDWGIKKL 162

Query: 123 AEKVGLIRIERAYFRKKDYPGYENKRGD-RPNNDDTFPIGSCSTFKFAK 166
            +  GL  ++++ F    YPGY  KRG     +DD FP+G CST+ F +
Sbjct: 163 GKGEGLKLLKKSKFELSHYPGYITKRGSGGRRSDDYFPVGECSTYMFTQ 208

BLAST of Tan0022773.1 vs. ExPASy Swiss-Prot
Match: P40493 (25S rRNA (uridine(2634)-N(3))-methyltransferase OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=BMT5 PE=1 SV=1)

HSP 1 Score: 51.6 bits (122), Expect = 1.8e-05
Identity = 41/154 (26.62%), Postives = 64/154 (41.56%), Query Frame = 0

Query: 52  PFDRIVFNFPHAGFDFGEQNNLQILLHRDVVRGFLRN----------------------- 111
           P   IVFNFPH G    +Q    I  H+D++  F +N                       
Sbjct: 170 PLQNIVFNFPHNGKGIKDQER-NIREHQDLIFNFFQNSLQLFNLINTKIQNDTLRYTQGY 229

Query: 112 --------AKEMVAEN-GEVHITHKTSYPYSEWEIEKLAEKVGLIRIERAYFRKKDYPGY 171
                   AK++ AE  G + ++     PY  W+I+ LA+K GL     + F+ +++PGY
Sbjct: 230 DLNEDTPQAKKLTAEGYGNIILSLFDGEPYDSWQIKLLAKKNGLTLSRSSKFQWENFPGY 289

BLAST of Tan0022773.1 vs. NCBI nr
Match: KAF5185807.1 (hypothetical protein FRX31_024606 [Thalictrum thalictroides])

HSP 1 Score: 189.5 bits (480), Expect = 4.3e-44
Identity = 90/165 (54.55%), Postives = 117/165 (70.91%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLD++  V  +H   K  +     ++ ++    DA  M+ HPLL    FDRIVFNF
Sbjct: 40  MVATSLDNRAMVIVKHPTAKANLETLENLQCTIFHEVDAHTMSTHPLLKTMKFDRIVFNF 99

Query: 61  PHAGFDF-GEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAE 120
           PHAGF + GE N LQI LH++V+RGF RNA+ M+  NGE+H+THKT+YP+S+WE+EKL E
Sbjct: 100 PHAGFYYRGEHNQLQIQLHQEVLRGFFRNARNMLTINGEIHVTHKTAYPFSKWEVEKLGE 159

Query: 121 KVGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFA 165
           + GL  +E+  F K DYPGYENK+GD  N+D TFP+G CSTFKFA
Sbjct: 160 EAGLYLVEKVKFTKYDYPGYENKKGDGLNSDGTFPVGECSTFKFA 204

BLAST of Tan0022773.1 vs. NCBI nr
Match: XP_028081873.1 (uncharacterized protein At4g26485 [Camellia sinensis])

HSP 1 Score: 184.1 bits (466), Expect = 1.8e-42
Identity = 85/165 (51.52%), Postives = 119/165 (72.12%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS+E +  ++      +     +  +++   D   M+ HP L F  FDRIVFNF
Sbjct: 52  MVATSLDSQESLMMKYPTASDNLKQLQDLGCTILHEIDGTTMSLHPRLGFQLFDRIVFNF 111

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF   E +++QI+LH+DVV+GFL +A  M+ +NGEVH+THKT++P++ WEI+KLAE+
Sbjct: 112 PHAGFTLSEHSSIQIMLHQDVVKGFLSSASGMLTDNGEVHVTHKTAHPFNLWEIDKLAEE 171

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           VGL  +E A+F + DYPGY NKRGD   ++DTFP+G+C+TFKFAK
Sbjct: 172 VGLCLVEEAWFSRYDYPGYVNKRGDGHRSNDTFPVGACTTFKFAK 216

BLAST of Tan0022773.1 vs. NCBI nr
Match: THG08732.1 (hypothetical protein TEA_017395 [Camellia sinensis var. sinensis])

HSP 1 Score: 184.1 bits (466), Expect = 1.8e-42
Identity = 85/165 (51.52%), Postives = 119/165 (72.12%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS+E +  ++      +     +  +++   D   M+ HP L F  FDRIVFNF
Sbjct: 70  MVATSLDSQESLMMKYPTASDNLKQLQDLGCTILHEIDGTTMSLHPRLGFQLFDRIVFNF 129

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF   E +++QI+LH+DVV+GFL +A  M+ +NGEVH+THKT++P++ WEI+KLAE+
Sbjct: 130 PHAGFTLSEHSSIQIMLHQDVVKGFLSSASGMLTDNGEVHVTHKTAHPFNLWEIDKLAEE 189

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           VGL  +E A+F + DYPGY NKRGD   ++DTFP+G+C+TFKFAK
Sbjct: 190 VGLCLVEEAWFSRYDYPGYVNKRGDGHRSNDTFPVGACTTFKFAK 234

BLAST of Tan0022773.1 vs. NCBI nr
Match: KAG7032604.1 (hypothetical protein SDJN02_06653, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 181.8 bits (460), Expect = 8.9e-42
Identity = 86/165 (52.12%), Postives = 117/165 (70.91%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS E +  ++      +    ++  +++   DA  M++H  LC+  FDRIVFNF
Sbjct: 52  MVATSLDSNEVLLRKYSRVAANLEALGLLGGTVLHEVDATAMSRHCSLCYKEFDRIVFNF 111

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF F E + +QI LH+D+VRGFLR A+++V++ GE+HITHK SYPY EWEIE+LA+K
Sbjct: 112 PHAGFSFRETDAIQIKLHQDLVRGFLREARKLVSDKGEIHITHKISYPYCEWEIEELAKK 171

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
            GL+  E A F   DYP YENKRG+  N+D TFP+G+C+TFKF +
Sbjct: 172 EGLLLKETAEFSLWDYPNYENKRGEGGNSDHTFPVGACATFKFVR 216

BLAST of Tan0022773.1 vs. NCBI nr
Match: KAG7024996.1 (hypothetical protein SDJN02_13816, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 181.4 bits (459), Expect = 1.2e-41
Identity = 89/166 (53.61%), Postives = 117/166 (70.48%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKM-VIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFP 62
           ATSLDS+E +  ++    +  + ++  +  S++   D   M+QHPLLC T FDRIVFNFP
Sbjct: 44  ATSLDSEETLLRKYGSDIKTTLEELKELGCSVMHGVDVTTMSQHPLLCHTLFDRIVFNFP 103

Query: 63  HAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEKV 122
           HAGF + E    QI LH+++VR FLRNAKE++AENG++HITHK S+PYSEWEIE++AE+ 
Sbjct: 104 HAGFQYSEHETRQIKLHQNLVRRFLRNAKELLAENGDIHITHKISFPYSEWEIEEVAEEE 163

Query: 123 GLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAKYL 168
            L   E   F   DYPGY NK+G  PN++ TFP+G CSTFKF K L
Sbjct: 164 DLFLRELVEFNIGDYPGYVNKKGSGPNSNLTFPVGLCSTFKFVKTL 209

BLAST of Tan0022773.1 vs. ExPASy TrEMBL
Match: A0A7J6VM26 (DUF2431 domain-containing protein OS=Thalictrum thalictroides OX=46969 GN=FRX31_024606 PE=4 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 2.1e-44
Identity = 90/165 (54.55%), Postives = 117/165 (70.91%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLD++  V  +H   K  +     ++ ++    DA  M+ HPLL    FDRIVFNF
Sbjct: 40  MVATSLDNRAMVIVKHPTAKANLETLENLQCTIFHEVDAHTMSTHPLLKTMKFDRIVFNF 99

Query: 61  PHAGFDF-GEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAE 120
           PHAGF + GE N LQI LH++V+RGF RNA+ M+  NGE+H+THKT+YP+S+WE+EKL E
Sbjct: 100 PHAGFYYRGEHNQLQIQLHQEVLRGFFRNARNMLTINGEIHVTHKTAYPFSKWEVEKLGE 159

Query: 121 KVGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFA 165
           + GL  +E+  F K DYPGYENK+GD  N+D TFP+G CSTFKFA
Sbjct: 160 EAGLYLVEKVKFTKYDYPGYENKKGDGLNSDGTFPVGECSTFKFA 204

BLAST of Tan0022773.1 vs. ExPASy TrEMBL
Match: A0A4S4DZ19 (DUF2431 domain-containing protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_017395 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 8.7e-43
Identity = 85/165 (51.52%), Postives = 119/165 (72.12%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS+E +  ++      +     +  +++   D   M+ HP L F  FDRIVFNF
Sbjct: 70  MVATSLDSQESLMMKYPTASDNLKQLQDLGCTILHEIDGTTMSLHPRLGFQLFDRIVFNF 129

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF   E +++QI+LH+DVV+GFL +A  M+ +NGEVH+THKT++P++ WEI+KLAE+
Sbjct: 130 PHAGFTLSEHSSIQIMLHQDVVKGFLSSASGMLTDNGEVHVTHKTAHPFNLWEIDKLAEE 189

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           VGL  +E A+F + DYPGY NKRGD   ++DTFP+G+C+TFKFAK
Sbjct: 190 VGLCLVEEAWFSRYDYPGYVNKRGDGHRSNDTFPVGACTTFKFAK 234

BLAST of Tan0022773.1 vs. ExPASy TrEMBL
Match: A0A6J1IKX9 (uncharacterized protein At4g26485 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476560 PE=4 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 9.6e-42
Identity = 90/169 (53.25%), Postives = 119/169 (70.41%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKM-VIKRSLVFWDDARRMNQHPLLCFTPFDRIVFN 60
           M ATSLDS+E +  ++    +  + ++  +  S++   D   M+QH LLC T FDRIVFN
Sbjct: 42  MVATSLDSEETLLRKYGSDIKTTLEELKELGCSVIHGVDVATMSQHLLLCHTWFDRIVFN 101

Query: 61  FPHAGFDFGEQNNL-QILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLA 120
           FPHAGF +  ++   QI LH+++VR FLRNAKE++AENGE+HITHK SYPYSEW+IEK+A
Sbjct: 102 FPHAGFQYSREHETGQIKLHQNLVRSFLRNAKELLAENGEIHITHKISYPYSEWKIEKIA 161

Query: 121 EKVGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAKYL 168
           E+ GL   E   F K DYP Y+NK+G  PN++ TFP+G C TFKF K L
Sbjct: 162 EEEGLFLREEVEFDKWDYPCYDNKKGSGPNSNRTFPVGLCCTFKFVKTL 210

BLAST of Tan0022773.1 vs. ExPASy TrEMBL
Match: A0A6I9U587 (uncharacterized protein At4g26485-like OS=Sesamum indicum OX=4182 GN=LOC105175164 PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 2.1e-41
Identity = 93/165 (56.36%), Postives = 109/165 (66.06%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS E +   H      +        +++   DA  M +HPLL    FDRIVFNF
Sbjct: 39  MVATSLDSPEMLRINHPSSVSNLDLLEEKGCTIIHKVDACYMCEHPLLSHRKFDRIVFNF 98

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF   E N  QI LH+DVVRGFL+NA EMV E GEVHITHKTS+P+SEW+IE+LA +
Sbjct: 99  PHAGFYGPEHNAYQISLHQDVVRGFLKNAYEMVREEGEVHITHKTSHPFSEWKIEELAAE 158

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           VG    E   F   DYP YENKRGD   +DDTFP+G CSTFKF+K
Sbjct: 159 VGFYLSEEVDFFIWDYPEYENKRGDGSRSDDTFPVGRCSTFKFSK 203

BLAST of Tan0022773.1 vs. ExPASy TrEMBL
Match: A0A6J1H6C3 (uncharacterized protein At4g26485-like OS=Cucurbita moschata OX=3662 GN=LOC111460425 PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 1.4e-40
Identity = 85/169 (50.30%), Postives = 117/169 (69.23%), Query Frame = 0

Query: 1   MFATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNF 60
           M ATSLDS E +  ++      +    ++  +++   DA  M++H  LC+  FDRI+FNF
Sbjct: 52  MVATSLDSNEVLLRKYSRVAANLEALGLLGGTVLHEVDATAMSRHCSLCYKEFDRIIFNF 111

Query: 61  PHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEK 120
           PHAGF F E + +QI LH+D+VRGFLR A+++V++ GE+HITHK S+PY EWEIEKLA+K
Sbjct: 112 PHAGFSFPESDAVQIKLHQDLVRGFLREARKLVSDKGEIHITHKISHPYCEWEIEKLAKK 171

Query: 121 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFP---IGSCSTFKFAKY 167
            GL+  E A F + DYP YENKRG   N+D  FP   +G+C+TFKF +Y
Sbjct: 172 EGLLLKETAEFSRWDYPNYENKRGGGGNSDHAFPVGAVGACATFKFVRY 220

BLAST of Tan0022773.1 vs. TAIR 10
Match: AT1G55790.1 (Domain of unknown function (DUF2431) )

HSP 1 Score: 142.5 bits (358), Expect = 5.6e-34
Identity = 69/161 (42.86%), Postives = 98/161 (60.87%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFPH 62
           A+SLDS + V  ++   +  +     +   L+   DA  ++ HP L +  FDR++FNFPH
Sbjct: 56  ASSLDSYDVVVRKYKKARSNLKTLKRLGALLLHGVDATTLHFHPDLRYRRFDRVIFNFPH 115

Query: 63  AGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKLAEKVG 122
           AGF   E ++  I  HR++V GF   A  ++  NGEVH++HK   P+SEW +E+LA +  
Sbjct: 116 AGFHGRESDSSLIRKHRELVFGFFNGASRLLRANGEVHVSHKNKAPFSEWNLEELASRCF 175

Query: 123 LIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKF 164
           L+ I+R  F K +YPGYENKRGD    D  F +G CSTFKF
Sbjct: 176 LVLIQRVAFEKNNYPGYENKRGDGRRCDQPFLLGECSTFKF 216

BLAST of Tan0022773.1 vs. TAIR 10
Match: AT4G26485.1 (Domain of unknown function (DUF2431) )

HSP 1 Score: 129.4 bits (324), Expect = 4.9e-30
Identity = 72/169 (42.60%), Postives = 103/169 (60.95%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKR---SLVFWDDARRMNQHPLLCFTPFDRIVFN 62
           ATSLDS++ +  ++      I    ++KR    +    D   M+    L    +DRIVFN
Sbjct: 5   ATSLDSEDELSIKYMDAVDNI---NILKRYGCDIQHEVDVHTMSFDNSLSLQRYDRIVFN 64

Query: 63  FPHAGFDF--GEQNNLQILLHRDVVRGFLRNAKEMVAENGEVHITHKTSYPYSEWEIEKL 122
           FPHAG  F   E ++  I  H+++VRGFL NAKEM+ E+GE+HITHKT+YP+S+W I+KL
Sbjct: 65  FPHAGSRFFGRELSSRAIESHKELVRGFLENAKEMLEEDGEIHITHKTTYPFSDWGIKKL 124

Query: 123 AEKVGLIRIERAYFRKKDYPGYENKRGD-RPNNDDTFPIGSCSTFKFAK 166
            +  GL  ++++ F    YPGY  KRG     +DD FP+G CST+ F +
Sbjct: 125 GKGEGLKLLKKSKFELSHYPGYITKRGSGGRRSDDYFPVGECSTYMFTQ 170

BLAST of Tan0022773.1 vs. TAIR 10
Match: AT5G56060.1 (Domain of unknown function (DUF2431) )

HSP 1 Score: 109.4 bits (272), Expect = 5.3e-24
Identity = 60/153 (39.22%), Postives = 95/153 (62.09%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFPH 62
           ATSLD++E +G ++   K  +    +   ++V   +   M+    L    +DRI+FNFPH
Sbjct: 44  ATSLDTREELGIKYTDGKANVEGLELFGCTVVHGVNVHSMSSDYRL--GRYDRIIFNFPH 103

Query: 63  AGFDFGEQNNL-QILLHRDVVRGFLRNAKEMVA-ENGEVHITHKTSYPYSEWEIEKLAEK 122
           +G  FG ++++  I+LH+ +VRGFL +A++M+  E+GE+H+THKT+ P++ W IE LA +
Sbjct: 104 SGLGFGSEHDIFFIMLHQGLVRGFLESARKMLKDEDGEIHVTHKTTDPFNRWGIETLAGE 163

Query: 123 VGLIRIERAYFRKKDYPGYENKRGDRPNNDDTF 154
            GL  I    F K  +PGY NK+G   N + TF
Sbjct: 164 KGLRLIGEIEFHKWAFPGYSNKKGGGSNCNSTF 194

BLAST of Tan0022773.1 vs. TAIR 10
Match: AT5G56075.1 (Domain of unknown function (DUF2431) )

HSP 1 Score: 96.3 bits (238), Expect = 4.6e-20
Identity = 47/116 (40.52%), Postives = 67/116 (57.76%), Query Frame = 0

Query: 53  FDRIVFNFPHAGFDFGEQNNLQILLHRDVVRGFLRNAKEMVAE---NGEVHITHKTSYPY 112
           +DR++FNFP                  ++VRGF+++A+ +V +    GE+H+ HKT YP+
Sbjct: 153 YDRVIFNFP----------------THELVRGFMKSARVLVKDEDKGGEIHVIHKTEYPF 212

Query: 113 SEWEIEKLAEKVGLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           SEW+++ L EK GL  I    F    YPGY NKRG    +D +FP+G  STF F K
Sbjct: 213 SEWKLKTLGEKEGLDLIREVEFCLSHYPGYFNKRGSGGYSDSSFPVGKSSTFMFTK 252

BLAST of Tan0022773.1 vs. TAIR 10
Match: AT5G25030.1 (Domain of unknown function (DUF2431) )

HSP 1 Score: 87.4 bits (215), Expect = 2.1e-17
Identity = 56/164 (34.15%), Postives = 84/164 (51.22%), Query Frame = 0

Query: 3   ATSLDSKEFVGNEHDFKKQKIIYKMVIKRSLVFWDDARRMNQHPLLCFTPFDRIVFNFPH 62
           A SLD +E +G  ++  K  +     +  ++V   +   M     L    +D I+FNFPH
Sbjct: 44  AISLDIREDLGRNYNNGKGNVEELERLGCTVVRGVNVHSMKSDDRLAH--YDIIIFNFPH 103

Query: 63  AGFDFGEQNNLQILLHRDVVRGFLRNAKEMVA-ENGEVHITHKTSYPYSEWEIEKLAEKV 122
           AG                V  GF+ +A+EM+  E+GE+HIT  T  P+++W+++ LAE+ 
Sbjct: 104 AG------------KRNKVFGGFMESAREMMKDEDGEIHITLNTLNPFNKWDLKALAEES 163

Query: 123 GLIRIERAYFRKKDYPGYENKRGDRPNNDDTFPIGSCSTFKFAK 166
           GL  I+R  F K  +P   NKR    N D  +PIGS  T+ F K
Sbjct: 164 GLRLIQRMQFIKWAFPSSSNKRESGSNCDFIYPIGSAITYMFKK 193

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4I1X07.9e-3342.86Heavy metal-associated isoprenylated plant protein 41 OS=Arabidopsis thaliana OX... [more]
P0C8L46.9e-2942.60Uncharacterized protein At4g26485 OS=Arabidopsis thaliana OX=3702 GN=At4g26485 P... [more]
P404931.8e-0526.6225S rRNA (uridine(2634)-N(3))-methyltransferase OS=Saccharomyces cerevisiae (str... [more]
Match NameE-valueIdentityDescription
KAF5185807.14.3e-4454.55hypothetical protein FRX31_024606 [Thalictrum thalictroides][more]
XP_028081873.11.8e-4251.52uncharacterized protein At4g26485 [Camellia sinensis][more]
THG08732.11.8e-4251.52hypothetical protein TEA_017395 [Camellia sinensis var. sinensis][more]
KAG7032604.18.9e-4252.12hypothetical protein SDJN02_06653, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG7024996.11.2e-4153.61hypothetical protein SDJN02_13816, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A7J6VM262.1e-4454.55DUF2431 domain-containing protein OS=Thalictrum thalictroides OX=46969 GN=FRX31_... [more]
A0A4S4DZ198.7e-4351.52DUF2431 domain-containing protein OS=Camellia sinensis var. sinensis OX=542762 G... [more]
A0A6J1IKX99.6e-4253.25uncharacterized protein At4g26485 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6I9U5872.1e-4156.36uncharacterized protein At4g26485-like OS=Sesamum indicum OX=4182 GN=LOC10517516... [more]
A0A6J1H6C31.4e-4050.30uncharacterized protein At4g26485-like OS=Cucurbita moschata OX=3662 GN=LOC11146... [more]
Match NameE-valueIdentityDescription
AT1G55790.15.6e-3442.86Domain of unknown function (DUF2431) [more]
AT4G26485.14.9e-3042.60Domain of unknown function (DUF2431) [more]
AT5G56060.15.3e-2439.22Domain of unknown function (DUF2431) [more]
AT5G56075.14.6e-2040.52Domain of unknown function (DUF2431) [more]
AT5G25030.12.1e-1734.15Domain of unknown function (DUF2431) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019446Domain of unknown function DUF2431PFAMPF10354DUF2431coord: 2..143
e-value: 1.2E-27
score: 97.2
NoneNo IPR availablePANTHERPTHR11538:SF80SUBFAMILY NOT NAMEDcoord: 37..166
NoneNo IPR availablePANTHERPTHR11538PHENYLALANYL-TRNA SYNTHETASEcoord: 37..166
IPR029063S-adenosyl-L-methionine-dependent methyltransferaseSUPERFAMILY53335S-adenosyl-L-methionine-dependent methyltransferasescoord: 34..138

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0022773Tan0022773gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022773.1-exonTan0022773.1-exon-LG02:6787266..6787665exon
Tan0022773.1-exonTan0022773.1-exon-LG02:6791766..6792062exon
Tan0022773.1-exonTan0022773.1-exon-LG02:6792283..6792481exon
Tan0022773.1-exonTan0022773.1-exon-LG02:6792676..6792703exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022773.1-cdsTan0022773.1-cds-LG02:6787266..6787665CDS
Tan0022773.1-cdsTan0022773.1-cds-LG02:6791766..6792062CDS
Tan0022773.1-cdsTan0022773.1-cds-LG02:6792283..6792481CDS
Tan0022773.1-cdsTan0022773.1-cds-LG02:6792676..6792703CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0022773.1Tan0022773.1-proteinpolypeptide