Tan0022807 (gene) Snake gourd v1

Overview
NameTan0022807
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
LocationLG02: 66950409 .. 66953698 (-)
RNA-Seq ExpressionTan0022807
SyntenyTan0022807
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTGGAAGTTACTTTAAAGTTCAGGTCAAGTTCAGGATGCAGCAGGAAAACGAATATGGAAATAACATAAGCTTATTTTTAAGGAAGCACCTTTCGAAGTATTTACGGACATATCCTTTGGGAATGTCAATTTCGTTTATTCATATGTGTTTTAGAGGGTTCAAAAAGGAAGAGTTTGAGATAAAATACATCTTTTTTGGCTAAGTAATGAAGAAGTTATGAGTGTGTGAAATCAGGGTATGAAATCTGTAGAATGCTGGAAAATAGATAGAATTGTTTTTGTAATCCTATGGTTAGAATTCATGAGCTCAAGGATCTAATGCTTGGTTTTGGTTTGTTTGACACGCCAAGAAGAATTAGGTCGAGTATACGGGCCAAAGTTCTAGTTCTAACGCTGTGAGTGACAAAACAAACATCGAAAACGATTCTTATGCTTGAATTATGCTTAAGGAAATACAAGATTTTCCTAAGAATGAACTAGTTTAAATGCTTTATATTTTCTTTTGAACGCATACTTATGAGCATGCATGGTTTCCTAGAAATTATTTCGAGAGCTCGTTGTGAGGCTAGATTTTCCCAATGAAATTTATATGTGAGCATATGTTTATTGATTTACTAAAATTTAGTGGAAATATTATGATTTGAAGCAAGCATGTACTAGCCAGTATTATTTAATGATTTCAAGTTTTGACGAACAAGCTGAGTAAGCTTGAGTTTTACGAAAAGAGATAAGCAGGCATGTTAAAGTATTTTCATGTGTTTTTCTAAGTCTCGAATGAGTAAATGTCTGAGATATTGAGCCTGAGGTTGTATGGTACCGTGTGCACACAGGTCATTTTATATTGTCGACGTTGAGTGTACTCCGTGGCAACAATGTTGTCGTGAGTGTTGGGCGGGCCCCACTACGACAAAGACGATGAGAGTGTTGGGCAGACCCTACTACATCGTAGAGTAAACGTTGGTTGTACTGGGCGTGCCTTACACGACGTAGATTGCCATATGTTTTATAAGGGTTAATATTATCGATATGCCTTGCGAGATTTTAATGACATACTTGCTGGATTTCTAAAACAGCTATTATGGTTACACGTTTATGTTTAACGCTTGATTACAGATGCATATGCTCATAAATTATCAGGTGATATGATTTGCGAGTTAATAATTTTTAAACTCAGTCACTCATTGAGCTTTATAGCTCATTCTTTCAGTGTTTTCCAATTTTTTCAGGTAGAGATCGAGCTCCCGGTGCCTGATATCCTGCCATAATCTACTGTAAGCTCCACGAGTTTGATATTTTGTACGCGGACGGAGTTGTATAGAAAGTCCTTATGTATTGTAGGTTATTTTTAGGGTACTATGTAGTTATGTTTGTGTGTGCGTTAGAGGTTGTGGTTGTTTAAACCATGGTTGGATATGTTTTTTGTTGTGTTGCTCCTTACGCTTGTATAGTAATTGTCAGGGTTCCGCTGTATTATGTTTAAAGTATTTCATATACAAGTATAGTATGCTTATTAGGTACAACAGGGTCAACATGTATCGTTAGAGAGGTAAACGATATCTGTTGCCTTCACGCCGCCTGCTGGGCTAGATTAACAGGTAGTTCGGGAAGAGGTGTGCCAACTTGGTCTCAGAGCAGTTAGCTCCATGGGAAATGAAACAAAGCAAGTAAACTTTAAGAAGAAACTAAGAAATAAATAACAAGTTGTTGAGTTAGTTAAGATAGAACTAGTGAAACATGGTCCAAGTAAGGATAAGAGTAAGTCCAGTCAGGAATAGAATGTGAATACAAATTTAATATTGTGCATTTCAGTTAAGTATTCATTACATGTATAATATGTTGATATGTTATAGGAGTCATGCCACCACGTACTAGCAGACAATGCAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGCCAATATGGGAGGGGTTCTAGTGCCTCGAGAGTTCTGACTGGGGTCGAAAATAAAGAGCATGCTAGTTCCTCAGAGGAGGTAGGTAGGCCAGAGACAGCAGGGCCAAGTGATCCAGAGAAAACATATGGAATAAAACGCCTGGAGAAATTAGGAGCCACAGTGTTTGGGGGTTCCACAGATCCAGCTGACGCCGAGGTTTGGTTGAATATGCTTGAGAAATGTTTTGATGTGATGAGTTGCCCGAAGGAGCGAAAAGTCAAGTTTGCCACATTCGTGCTGCAGAAGGAGGAAAAGGGATGGTGGAAATCAATATTAGCCAGGCGCAGGGATGCACGTACTTTATACTGGCAAACTTTCAGAAGCATATTCGAGGATAAGTATTACCCTAGCACGTGTCGCGAGGCAAAGAGGGATGAGTTTTTAGAGTTAAAGCAAGGGTCACTTTCAGTGGCTGAGTACGAGAGGAAGTATACCGAGTTCTCGCAGTATGCTGATGTGATTGTGGCATCCGAGAGTGACAGATGTCGAAGGTTTGAAAGAGGATTACGTCCTGAGATACGTACCCCAGTCACAGTTATTACTAAGTGGACTGACTTTTCTCAGCTAGTAGAGACTGCTCTACGTGTTGAGCAGAGTATAGCAGAGGAGAAGTCAGTAGTGGAGCCTAGTCGTGGGGCTTTGACAACAAGAAGTTTTCGAGGTCGTGAGCAGCGGAGGTTCACACCTGGCGTAAATGTTTCAAGTCGTCAAGACTTTAAGAATCGAGCTGGTCGCCAGGCATCGAGATAGATGAATGTGGGTGGTGCCTATCAAAGGCAAAGTCAGAGAGCACCTAGTCAGTCTACTAGATAAGCAACAAGACCACAGACAGGGCAAGAGTCTGATGCCAGTGTAGCAAGGAGAACTCCATGTGCGAATTGTGGCAAGAATCATCGAGGTCATTGCCTTGTGGGTGTCGGTGTGTGTTACCAGTGTGGAAAACCAAGGCATTTCAAGAGGGATTGTCCACAGTTGAGAGTAGCCGCATAGAGGAACTAGTGAGTTGAGTCCCAGACAGTTGAGCAGCTGAGAGTTCGAGTAGCTGCAGGAGAGGGCACCAACGGAGTGATTATCGGTACAGCAGGGGAAGGTCTACGCTATGACTCAACAGGAAGGTCTACGCTATGACTCAACAGGAAGCAGGGGAAGGTCTACGCTATGACTCAACAGGAAGCAGACAATGCACCAGATGTGATTATCGGTACGATTCCTATTTGTAATGTACCTGCACGTGTTTTAATAGATCCAGGTGCTACACATTCCTTTGTTTCTAGTATATTCCTAACCAAGCTGAATAGGAAGCTAAAGCCTTTACTGAGAGGTTGA

mRNA sequence

ATGATTGGAAGTTACTTTAAAGTTCAGGTCAAGTTCAGGATGCAGCAGGAAAACGAATATGGAAATAACATAAGCTTATTTTTAAGGAAGCACCTTTCGAAGTATTTACGGACATATCCTTTGGGAATGTCAATTTCGTTTATTCATATGTGTCATTTTATATTGTCGACGTTGAGTGTACTCCGTGGCAACAATGTTGTCGTGAGTGTTGGGCGGCTTTATAGCTCATTCTTTCAGTGTTTTCCAATTTTTTCAGGAGTCATGCCACCACGTACTAGCAGACAATGCAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGCCAATATGGGAGGGGTTCTAGTGCCTCGAGAGTTCTGACTGGGGTCGAAAATAAAGAGCATGCTAGTTCCTCAGAGGAGGTAGGTAGGCCAGAGACAGCAGGGCCAAGTGATCCAGAGAAAACATATGGAATAAAACGCCTGGAGAAATTAGGAGCCACAGTGTTTGGGGGTTCCACAGATCCAGCTGACGCCGAGGTTTGGCGCAGGGATGCACGTACTTTATACTGGCAAACTTTCAGAAGCATATTCGAGGATAAGTATTACCCTAGCACGTGTCGCGAGGCAAAGAGGGATGAGTTTTTAGAGTTAAAGCAAGGGTCACTTTCAGTGGCTGAGTACGAGAGGAAGTATACCGAGTTCTCGCAGTATGCTGATGTGATTGTGGCATCCGAGAGTGACAGATGTCGAAGGTTTGAAAGAGGATTACGTCCTGAGATACGTACCCCAGTCACAGTTATTACTAAGTGGACTGACTTTTCTCAGCTAGTAGAGACTGCTCTACGTGTTGAGCAGAGTATAGCAGAGGAGAAGTCAGTAGTGGAGCCTAGTCGTGGGGCTTTGACAACAAGAAGTTTTCGAGATCCAGGTGCTACACATTCCTTTGTTTCTAGTATATTCCTAACCAAGCTGAATAGGAAGCTAAAGCCTTTACTGAGAGGTTGA

Coding sequence (CDS)

ATGATTGGAAGTTACTTTAAAGTTCAGGTCAAGTTCAGGATGCAGCAGGAAAACGAATATGGAAATAACATAAGCTTATTTTTAAGGAAGCACCTTTCGAAGTATTTACGGACATATCCTTTGGGAATGTCAATTTCGTTTATTCATATGTGTCATTTTATATTGTCGACGTTGAGTGTACTCCGTGGCAACAATGTTGTCGTGAGTGTTGGGCGGCTTTATAGCTCATTCTTTCAGTGTTTTCCAATTTTTTCAGGAGTCATGCCACCACGTACTAGCAGACAATGCAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGCCAATATGGGAGGGGTTCTAGTGCCTCGAGAGTTCTGACTGGGGTCGAAAATAAAGAGCATGCTAGTTCCTCAGAGGAGGTAGGTAGGCCAGAGACAGCAGGGCCAAGTGATCCAGAGAAAACATATGGAATAAAACGCCTGGAGAAATTAGGAGCCACAGTGTTTGGGGGTTCCACAGATCCAGCTGACGCCGAGGTTTGGCGCAGGGATGCACGTACTTTATACTGGCAAACTTTCAGAAGCATATTCGAGGATAAGTATTACCCTAGCACGTGTCGCGAGGCAAAGAGGGATGAGTTTTTAGAGTTAAAGCAAGGGTCACTTTCAGTGGCTGAGTACGAGAGGAAGTATACCGAGTTCTCGCAGTATGCTGATGTGATTGTGGCATCCGAGAGTGACAGATGTCGAAGGTTTGAAAGAGGATTACGTCCTGAGATACGTACCCCAGTCACAGTTATTACTAAGTGGACTGACTTTTCTCAGCTAGTAGAGACTGCTCTACGTGTTGAGCAGAGTATAGCAGAGGAGAAGTCAGTAGTGGAGCCTAGTCGTGGGGCTTTGACAACAAGAAGTTTTCGAGATCCAGGTGCTACACATTCCTTTGTTTCTAGTATATTCCTAACCAAGCTGAATAGGAAGCTAAAGCCTTTACTGAGAGGTTGA

Protein sequence

MIGSYFKVQVKFRMQQENEYGNNISLFLRKHLSKYLRTYPLGMSISFIHMCHFILSTLSVLRGNNVVVSVGRLYSSFFQCFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFRDPGATHSFVSSIFLTKLNRKLKPLLRG
Homology
BLAST of Tan0022807 vs. NCBI nr
Match: KAA0051980.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK04577.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 300.8 bits (769), Expect = 1.4e-77
Identity = 171/277 (61.73%), Postives = 199/277 (71.84%), Query Frame = 0

Query: 41  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------S 100
           L ++  F+H+  F   T  V+ G ++V  VSVG  R+      C+   +F          
Sbjct: 89  LELNFEFVHVEIFYYFTNQVMSGLHLVFNVSVGSTRIVRGDNVCWLHAVFWAKTAGGPGG 148

Query: 101 GVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPS 160
           GVMPPRTSR+ RQ+Q   QDPTQGQ  RGSS  R      ++    S++E+GR E A PS
Sbjct: 149 GVMPPRTSRRRRQNQDRMQDPTQGQSERGSSTPRGQNEAGSERFVRSAQEIGRSERARPS 208

Query: 161 DPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAK 220
           DPEK YGI+RL++LGATVF GSTD ADAEVW  DARTL WQTFR IFE+KYYP+TC EAK
Sbjct: 209 DPEKMYGIERLKELGATVFVGSTDLADAEVWHNDARTLDWQTFRGIFEEKYYPTTCCEAK 268

Query: 221 RDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKW 280
           RDEFLELKQGSLSVA+Y+RKYTE S YA+VI+ASESDRCRRFERGL  EIRTPVT I KW
Sbjct: 269 RDEFLELKQGSLSVAKYKRKYTELSWYAEVIMASESDRCRRFERGLHFEIRTPVTAIAKW 328

Query: 281 TDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR 303
           TDFSQL+ETALRVEQSI EEKS +E SRG  TT   R
Sbjct: 329 TDFSQLIETALRVEQSIVEEKSAMELSRGVSTTSRIR 365

BLAST of Tan0022807 vs. NCBI nr
Match: KAA0037515.1 (uncharacterized protein E6C27_scaffold277G001260 [Cucumis melo var. makuwa] >TYK01990.1 uncharacterized protein E5676_scaffold808G001020 [Cucumis melo var. makuwa])

HSP 1 Score: 291.2 bits (744), Expect = 1.1e-74
Identity = 158/232 (68.10%), Postives = 178/232 (76.72%), Query Frame = 0

Query: 84  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAG 143
           + GVMPPR+SR+ RQ++  TQDPTQGQ  RGSS  R      ++  A S++E+GRPE AG
Sbjct: 188 YQGVMPPRSSRRRRQNKDETQDPTQGQSERGSSTPRGQNEARSERFARSAQEIGRPEKAG 247

Query: 144 PSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW-------RRDARTLYWQTFRSIF 203
           PSDPEK YGI+RL+KL ATVF GSTD AD    AE W       R DARTL WQTFR IF
Sbjct: 248 PSDPEKMYGIERLKKLEATVFEGSTDLADAKKEAEGWWKSIIARRNDARTLDWQTFRGIF 307

Query: 204 EDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLR 263
           E+KYYP+T  EAKRDEFLELKQGSLS+AEYERKYTE S+YA +I+ASESDRC RFERGLR
Sbjct: 308 EEKYYPTTYCEAKRDEFLELKQGSLSMAEYERKYTELSRYAGMIMASESDRCHRFERGLR 367

Query: 264 PEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFRDP 305
            EIRTPVT I KWT+FSQLVETALRVEQSI EEKS +E SRG  TT   R P
Sbjct: 368 FEIRTPVTAIAKWTNFSQLVETALRVEQSIVEEKSAMELSRGVSTTSGIRGP 419

BLAST of Tan0022807 vs. NCBI nr
Match: KAA0041108.1 (reverse transcriptase [Cucumis melo var. makuwa])

HSP 1 Score: 283.5 bits (724), Expect = 2.3e-72
Identity = 160/263 (60.84%), Postives = 175/263 (66.54%), Query Frame = 0

Query: 80  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRP 139
           C  +  GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR 
Sbjct: 450 CEVLVEGVMPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRT 509

Query: 140 ETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW----------------------- 199
           + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W                       
Sbjct: 510 DRAEPSDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLAT 569

Query: 200 -----------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSV 259
                            R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSV
Sbjct: 570 FLLQKEAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSV 629

Query: 260 AEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVE 303
           AEYERKYTE S+YADVI+ASESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVE
Sbjct: 630 AEYERKYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVE 689

BLAST of Tan0022807 vs. NCBI nr
Match: KAA0056353.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 281.6 bits (719), Expect = 8.9e-72
Identity = 153/216 (70.83%), Postives = 169/216 (78.24%), Query Frame = 0

Query: 87  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSD 146
           VMPPRTS++ RQ+Q GTQDPTQGQ  RGSS  R      ++  + S++E+GRPE AGPSD
Sbjct: 64  VMPPRTSKRHRQNQDGTQDPTQGQSERGSSTPRGQNEAGSERFSRSAQEIGRPEKAGPSD 123

Query: 147 PEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKR 206
           PEK YGI+RL+KL ATVF GSTD ADAEVWR DARTL WQTFR IFE+KYYP+T  EAKR
Sbjct: 124 PEKMYGIERLKKLEATVFDGSTDLADAEVWRNDARTLDWQTFRGIFEEKYYPTTYCEAKR 183

Query: 207 DEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWT 266
           DEFLELKQ SLSV EYERK      YA++IVA ESDRC R ERGLR E RTPVT ITKW 
Sbjct: 184 DEFLELKQESLSVVEYERK------YAEMIVAFESDRCCRGERGLRFEKRTPVTAITKWM 243

Query: 267 DFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR 303
           DFSQLVETALRVEQSI EEKSV+E SRG  TT   R
Sbjct: 244 DFSQLVETALRVEQSIVEEKSVMELSRGVSTTSGIR 273

BLAST of Tan0022807 vs. NCBI nr
Match: KAA0056684.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 281.6 bits (719), Expect = 8.9e-72
Identity = 159/258 (61.63%), Postives = 174/258 (67.44%), Query Frame = 0

Query: 85  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGP 144
           +GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A P
Sbjct: 247 AGVMPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEP 306

Query: 145 SDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---------------------------- 204
           SDPEK YGI+RL+KLGATVF GSTDPADAE W                            
Sbjct: 307 SDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQK 366

Query: 205 ------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYER 264
                       R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYER
Sbjct: 367 EAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYER 426

Query: 265 KYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAE 303
           KYTE S+YADVI+ASESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI E
Sbjct: 427 KYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITE 486

BLAST of Tan0022807 vs. ExPASy TrEMBL
Match: A0A5A7U9X4 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G002130 PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 6.9e-78
Identity = 171/277 (61.73%), Postives = 199/277 (71.84%), Query Frame = 0

Query: 41  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------S 100
           L ++  F+H+  F   T  V+ G ++V  VSVG  R+      C+   +F          
Sbjct: 89  LELNFEFVHVEIFYYFTNQVMSGLHLVFNVSVGSTRIVRGDNVCWLHAVFWAKTAGGPGG 148

Query: 101 GVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPS 160
           GVMPPRTSR+ RQ+Q   QDPTQGQ  RGSS  R      ++    S++E+GR E A PS
Sbjct: 149 GVMPPRTSRRRRQNQDRMQDPTQGQSERGSSTPRGQNEAGSERFVRSAQEIGRSERARPS 208

Query: 161 DPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAK 220
           DPEK YGI+RL++LGATVF GSTD ADAEVW  DARTL WQTFR IFE+KYYP+TC EAK
Sbjct: 209 DPEKMYGIERLKELGATVFVGSTDLADAEVWHNDARTLDWQTFRGIFEEKYYPTTCCEAK 268

Query: 221 RDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKW 280
           RDEFLELKQGSLSVA+Y+RKYTE S YA+VI+ASESDRCRRFERGL  EIRTPVT I KW
Sbjct: 269 RDEFLELKQGSLSVAKYKRKYTELSWYAEVIMASESDRCRRFERGLHFEIRTPVTAIAKW 328

Query: 281 TDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR 303
           TDFSQL+ETALRVEQSI EEKS +E SRG  TT   R
Sbjct: 329 TDFSQLIETALRVEQSIVEEKSAMELSRGVSTTSRIR 365

BLAST of Tan0022807 vs. ExPASy TrEMBL
Match: A0A5D3BSA6 (Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold808G001020 PE=4 SV=1)

HSP 1 Score: 291.2 bits (744), Expect = 5.4e-75
Identity = 158/232 (68.10%), Postives = 178/232 (76.72%), Query Frame = 0

Query: 84  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAG 143
           + GVMPPR+SR+ RQ++  TQDPTQGQ  RGSS  R      ++  A S++E+GRPE AG
Sbjct: 188 YQGVMPPRSSRRRRQNKDETQDPTQGQSERGSSTPRGQNEARSERFARSAQEIGRPEKAG 247

Query: 144 PSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW-------RRDARTLYWQTFRSIF 203
           PSDPEK YGI+RL+KL ATVF GSTD AD    AE W       R DARTL WQTFR IF
Sbjct: 248 PSDPEKMYGIERLKKLEATVFEGSTDLADAKKEAEGWWKSIIARRNDARTLDWQTFRGIF 307

Query: 204 EDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLR 263
           E+KYYP+T  EAKRDEFLELKQGSLS+AEYERKYTE S+YA +I+ASESDRC RFERGLR
Sbjct: 308 EEKYYPTTYCEAKRDEFLELKQGSLSMAEYERKYTELSRYAGMIMASESDRCHRFERGLR 367

Query: 264 PEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFRDP 305
            EIRTPVT I KWT+FSQLVETALRVEQSI EEKS +E SRG  TT   R P
Sbjct: 368 FEIRTPVTAIAKWTNFSQLVETALRVEQSIVEEKSAMELSRGVSTTSGIRGP 419

BLAST of Tan0022807 vs. ExPASy TrEMBL
Match: A0A5A7TDR2 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold128G00110 PE=4 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 1.1e-72
Identity = 160/263 (60.84%), Postives = 175/263 (66.54%), Query Frame = 0

Query: 80  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRP 139
           C  +  GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR 
Sbjct: 450 CEVLVEGVMPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRT 509

Query: 140 ETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW----------------------- 199
           + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W                       
Sbjct: 510 DRAEPSDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLAT 569

Query: 200 -----------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSV 259
                            R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSV
Sbjct: 570 FLLQKEAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSV 629

Query: 260 AEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVE 303
           AEYERKYTE S+YADVI+ASESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVE
Sbjct: 630 AEYERKYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVE 689

BLAST of Tan0022807 vs. ExPASy TrEMBL
Match: A0A5A7UNA3 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold73G00100 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 4.3e-72
Identity = 159/258 (61.63%), Postives = 174/258 (67.44%), Query Frame = 0

Query: 85  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGP 144
           +GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A P
Sbjct: 247 AGVMPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEP 306

Query: 145 SDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---------------------------- 204
           SDPEK YGI+RL+KLGATVF GSTDPADAE W                            
Sbjct: 307 SDPEKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQK 366

Query: 205 ------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYER 264
                       R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYER
Sbjct: 367 EAEGWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYER 426

Query: 265 KYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAE 303
           KYTE S+YADVI+ASESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI E
Sbjct: 427 KYTELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITE 486

BLAST of Tan0022807 vs. ExPASy TrEMBL
Match: A0A5A7UKD3 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold186G00860 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 4.3e-72
Identity = 153/216 (70.83%), Postives = 169/216 (78.24%), Query Frame = 0

Query: 87  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSD 146
           VMPPRTS++ RQ+Q GTQDPTQGQ  RGSS  R      ++  + S++E+GRPE AGPSD
Sbjct: 64  VMPPRTSKRHRQNQDGTQDPTQGQSERGSSTPRGQNEAGSERFSRSAQEIGRPEKAGPSD 123

Query: 147 PEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKR 206
           PEK YGI+RL+KL ATVF GSTD ADAEVWR DARTL WQTFR IFE+KYYP+T  EAKR
Sbjct: 124 PEKMYGIERLKKLEATVFDGSTDLADAEVWRNDARTLDWQTFRGIFEEKYYPTTYCEAKR 183

Query: 207 DEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWT 266
           DEFLELKQ SLSV EYERK      YA++IVA ESDRC R ERGLR E RTPVT ITKW 
Sbjct: 184 DEFLELKQESLSVVEYERK------YAEMIVAFESDRCCRGERGLRFEKRTPVTAITKWM 243

Query: 267 DFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR 303
           DFSQLVETALRVEQSI EEKSV+E SRG  TT   R
Sbjct: 244 DFSQLVETALRVEQSIVEEKSVMELSRGVSTTSGIR 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0051980.11.4e-7761.73DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK04577.1 D... [more]
KAA0037515.11.1e-7468.10uncharacterized protein E6C27_scaffold277G001260 [Cucumis melo var. makuwa] >TYK... [more]
KAA0041108.12.3e-7260.84reverse transcriptase [Cucumis melo var. makuwa][more]
KAA0056353.18.9e-7270.83DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
KAA0056684.18.9e-7261.63DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7U9X46.9e-7861.73DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5D3BSA65.4e-7568.10Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A5A7TDR21.1e-7260.84Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold12... [more]
A0A5A7UNA34.3e-7261.63Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold73... [more]
A0A5A7UKD34.3e-7270.83DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 184..253
e-value: 4.7E-10
score: 39.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..127
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..150
NoneNo IPR availablePANTHERPTHR34482DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKEcoord: 179..308
NoneNo IPR availablePANTHERPTHR34482:SF4POLYMERASES SUPERFAMILY PROTEIN, PUTATIVE ISOFORM 1-RELATEDcoord: 179..308

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022807.1Tan0022807.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
molecular_function GO:0005488 binding
molecular_function GO:0008233 peptidase activity