Tan0011974 (gene) Snake gourd v1

Overview
NameTan0011974
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG02: 12353791 .. 12354519 (-)
RNA-Seq ExpressionTan0011974
SyntenyTan0011974
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGTCAAGACCGCCTTTCTGAATGGCAAACTTGACGAGACCATCTACATGGACCAGCCCAAGGGGTTCATTACCCAGGGCCAAGAGCAAAAGGTTTGCCCGCTTCATAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAGAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTCAACAACACTGTCGCATTTCTGATATTGTATGTGGATGATATCCTTCTCATTGGGAAGGAGGTAGGATTTCTTACTAACGTAAAGAAATGGCTGGCTTCACAATTCTAAATAAACGATTTGGGAGAAGCACAATATGTTCTAGGTATCCAGATAGTCTGGAACCGGAGAAACAGAACGCTAGCCATGTCTCAGGCATCTTATATTGACAAGATGTTGTCTAGATATAAGATGAAGAACTTCAAGAAGGGCTTGTTGCCTTTCAGGCATGTGGTTCACCTGTCTAAGAATCAATGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGACGAATCTCTTATGCTTCAGCTGTTGGGAGCCTTATGTATGCCATGCTGTGTACTAGACCCGACATCTGTTATGCAGTTGGGATTGTCAATAGGTATCAATCCAATCCAAGATTAGATCACTGGACAACCGTAAAGACAATCCTCAAGTATCTTAGGAGAACGAGGAACTGA

mRNA sequence

ATGGATGTCAAGACCGCCTTTCTGAATGGCAAACTTGACGAGACCATCTACATGGACCAGCCCAAGGGGTTCATTACCCAGGGCCAAGAGCAAAAGGTTTGCCCGCTTCATAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAGAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTCAACAACACTGTCGCATTTCTGATATTGTATGTGGATGATATCCTTCTCATTGGGAAGGAGATAGTCTGGAACCGGAGAAACAGAACGCTAGCCATGTCTCAGGCATCTTATATTGACAAGATGTTGTCTAGATATAAGATGAAGAACTTCAAGAAGGGCTTGTTGCCTTTCAGGCATGTGGTTCACCTGTCTAAGAATCAATGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGACGAATCTCTTATGCTTCAGCTGTTGGGAGCCTTATGTATGCCATGCTGTGTACTAGACCCGACATCTGTTATGCAGTTGGGATTGTCAATAGGTATCAATCCAATCCAAGATTAGATCACTGGACAACCGTAAAGACAATCCTCAAGTATCTTAGGAGAACGAGGAACTGA

Coding sequence (CDS)

ATGGATGTCAAGACCGCCTTTCTGAATGGCAAACTTGACGAGACCATCTACATGGACCAGCCCAAGGGGTTCATTACCCAGGGCCAAGAGCAAAAGGTTTGCCCGCTTCATAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAGAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTCAACAACACTGTCGCATTTCTGATATTGTATGTGGATGATATCCTTCTCATTGGGAAGGAGATAGTCTGGAACCGGAGAAACAGAACGCTAGCCATGTCTCAGGCATCTTATATTGACAAGATGTTGTCTAGATATAAGATGAAGAACTTCAAGAAGGGCTTGTTGCCTTTCAGGCATGTGGTTCACCTGTCTAAGAATCAATGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGACGAATCTCTTATGCTTCAGCTGTTGGGAGCCTTATGTATGCCATGCTGTGTACTAGACCCGACATCTGTTATGCAGTTGGGATTGTCAATAGGTATCAATCCAATCCAAGATTAGATCACTGGACAACCGTAAAGACAATCCTCAAGTATCTTAGGAGAACGAGGAACTGA

Protein sequence

MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGKEIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQEVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRTRN
Homology
BLAST of Tan0011974 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 6.2e-53
Identity = 107/241 (44.40%), Postives = 148/241 (61.41%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            +DVKTAFL+G L+E IYM+QP+GF   G++  VC L++S+YGLKQA R W ++FD  +KS
Sbjct: 921  LDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKS 980

Query: 61   YGFDQNVDEPCVY-KKIVNNTVAFLILYVDDILLIGKE---------------------- 120
              + +   +PCVY K+   N    L+LYVDD+L++GK+                      
Sbjct: 981  QTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGP 1040

Query: 121  --------IVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTP 180
                    IV  R +R L +SQ  YI+++L R+ MKN K    P    + LSK  CP T 
Sbjct: 1041 AQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTV 1100

Query: 181  QEVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRR 211
            +E  +M ++ Y+SAVGSLMYAM+CTRPDI +AVG+V+R+  NP  +HW  VK IL+YLR 
Sbjct: 1101 EEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRG 1160

BLAST of Tan0011974 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 122.9 bits (307), Expect = 4.5e-27
Identity = 82/244 (33.61%), Postives = 123/244 (50.41%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L E IYM  P+G         VC L+++IYGLKQA+R W   F++A+K 
Sbjct: 1001 MDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKE 1060

Query: 61   YGFDQNVDEPCVY---KKIVNNTVAFLILYVDDILL-IGKEIVWNRRNRTLA-------- 120
              F  +  + C+Y   K  +N  + +++LYVDD+++  G     N   R L         
Sbjct: 1061 CEFVNSSVDRCIYILDKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDL 1120

Query: 121  -------------------MSQASYIDKMLSRYKMKNFKKGLLPFRHVVH---LSKNQCP 180
                               +SQ++Y+ K+LS++ M+N      P    ++   L+ ++  
Sbjct: 1121 NEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDC 1180

Query: 181  KTPQEVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKY 211
             TP            S +G LMY MLCTRPD+  AV I++RY S    + W  +K +L+Y
Sbjct: 1181 NTP----------CRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRY 1231

BLAST of Tan0011974 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 6.4e-26
Identity = 72/238 (30.25%), Postives = 111/238 (46.64%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            +DV  AFL G L + +YM QP GF+ + +   VC L ++IYGLKQA R+W +     + +
Sbjct: 1047 LDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLT 1106

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGKEIVW-------------------- 120
             GF  ++ +  ++      ++ ++++YVDDIL+ G + V                     
Sbjct: 1107 VGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDL 1166

Query: 121  --------NRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQEV 180
                     R  + L +SQ  Y   +L+R  M   K    P      L+ +   K P   
Sbjct: 1167 HYFLGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPT 1226

Query: 181  EDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 211
            E      Y   VGSL Y +  TRPD+ YAV  +++Y   P  DHW  +K +L+YL  T
Sbjct: 1227 E------YRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGT 1277

BLAST of Tan0011974 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 1.3e-23
Identity = 74/240 (30.83%), Postives = 109/240 (45.42%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            +DV  AFL G L + +YM QP GFI + +   VC L +++YGLKQA R+W +     + +
Sbjct: 1064 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1123

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGKE----------------------- 120
             GF  +V +  ++      ++ ++++YVDDIL+ G +                       
Sbjct: 1124 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEEL 1183

Query: 121  -----IVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQEV 180
                 I   R    L +SQ  YI  +L+R  M   K    P      LS     K     
Sbjct: 1184 HYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPT 1243

Query: 181  EDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRTRN 213
            E      Y   VGSL Y +  TRPDI YAV  ++++   P  +H   +K IL+YL  T N
Sbjct: 1244 E------YRGIVGSLQY-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPN 1296

BLAST of Tan0011974 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 106.7 bits (265), Expect = 3.3e-22
Identity = 70/241 (29.05%), Postives = 112/241 (46.47%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
           MDV TAFLN  +DE IY+ QP GF+ +     V  L+  +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 61  YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILL----------IGKEI------------ 120
            GF ++  E  +Y +  ++   ++ +YVDD+L+          + +E+            
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 121 -------VWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQE 180
                  +    N  + +S   YI K  S  ++  FK    P  +    SK     T   
Sbjct: 121 DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCN----SKPLFETTSPH 180

Query: 181 VEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRTR 213
           ++D+    Y S VG L++     RPDI Y V +++R+   PR  H  + + +L+YL  TR
Sbjct: 181 LKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTR 235

BLAST of Tan0011974 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 351.7 bits (901), Expect = 4.5e-93
Identity = 177/242 (73.14%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+IYM QP+GFI Q QEQKVC L +SIYGLKQASRSWNIRFD AIKS
Sbjct: 915  MDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKS 974

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGF+QNVDEPCVYKKIVN+ VAFLILYVDDILLIG                         
Sbjct: 975  YGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEA 1034

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +IV NR+N+TLAMSQASYIDK+LSRYKM+N KKG LPFRH +HLSK QCPKTPQ
Sbjct: 1035 QYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQ 1094

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMR I Y+SAVGSLMYAMLCTRPDICY+VGIV+RYQSNP  DHWT VK ILKYLRRT
Sbjct: 1095 EVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRT 1154

BLAST of Tan0011974 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 347.4 bits (890), Expect = 8.5e-92
Identity = 175/242 (72.31%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK ILKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

BLAST of Tan0011974 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 347.1 bits (889), Expect = 1.1e-91
Identity = 174/242 (71.90%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK +LKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

BLAST of Tan0011974 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 347.1 bits (889), Expect = 1.1e-91
Identity = 174/242 (71.90%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
           MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 697 MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61  YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
           YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 757 YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121 ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                 +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 817 QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181 EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
           EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK +LKYLRRT
Sbjct: 877 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

BLAST of Tan0011974 vs. NCBI nr
Match: KAA0059556.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK29968.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 346.3 bits (887), Expect = 1.9e-91
Identity = 173/242 (71.49%), Postives = 191/242 (78.93%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
           MDVKTAFLN  L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIK 
Sbjct: 165 MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKC 224

Query: 61  YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
           YGFDQNVDEPCVYKKI    VAFL+LY+DDILLIG                         
Sbjct: 225 YGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIGNDVGYLTDVKAWLAAQFQMKDVGEA 284

Query: 121 ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                 +I+ +R+N+TLA+SQA+YIDKML RY M+N KKGLLPFRH VHLSK QCPKTPQ
Sbjct: 285 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQ 344

Query: 181 EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
           EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK ILKYL+RT
Sbjct: 345 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPELDHWTAVKIILKYLKRT 404

BLAST of Tan0011974 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 2.2e-93
Identity = 177/242 (73.14%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+IYM QP+GFI Q QEQKVC L +SIYGLKQASRSWNIRFD AIKS
Sbjct: 915  MDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYGLKQASRSWNIRFDTAIKS 974

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGF+QNVDEPCVYKKIVN+ VAFLILYVDDILLIG                         
Sbjct: 975  YGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLTDVKKWLNTQFQMKDLGEA 1034

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +IV NR+N+TLAMSQASYIDK+LSRYKM+N KKG LPFRH +HLSK QCPKTPQ
Sbjct: 1035 QYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKEQCPKTPQ 1094

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMR I Y+SAVGSLMYAMLCTRPDICY+VGIV+RYQSNP  DHWT VK ILKYLRRT
Sbjct: 1095 EVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNILKYLRRT 1154

BLAST of Tan0011974 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 4.1e-92
Identity = 175/242 (72.31%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK ILKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

BLAST of Tan0011974 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 5.4e-92
Identity = 174/242 (71.90%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1    MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
            MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
            YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                  +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
            EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK +LKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

BLAST of Tan0011974 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 5.4e-92
Identity = 174/242 (71.90%), Postives = 192/242 (79.34%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
           MDVKTAFLNG L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIKS
Sbjct: 697 MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61  YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
           YGFDQNVDEPCVYKKI    VAFL+LYVDDILLIG                         
Sbjct: 757 YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121 ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                 +I+ +R+N+TLA+SQA+YIDK+L RY M+N KKGLLPFRH VHLSK Q PKTPQ
Sbjct: 817 QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181 EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
           EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK +LKYLRRT
Sbjct: 877 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

BLAST of Tan0011974 vs. ExPASy TrEMBL
Match: A0A5A7UZF3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1449G00020 PE=4 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 9.2e-92
Identity = 173/242 (71.49%), Postives = 191/242 (78.93%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFITQGQEQKVCPLHRSIYGLKQASRSWNIRFDEAIKS 60
           MDVKTAFLN  L+E+I+M QP+GFITQGQEQKVC L+RSIYGLKQASRSWNIRFD AIK 
Sbjct: 165 MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKC 224

Query: 61  YGFDQNVDEPCVYKKIVNNTVAFLILYVDDILLIGK------------------------ 120
           YGFDQNVDEPCVYKKI    VAFL+LY+DDILLIG                         
Sbjct: 225 YGFDQNVDEPCVYKKINKGKVAFLVLYMDDILLIGNDVGYLTDVKAWLAAQFQMKDVGEA 284

Query: 121 ------EIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQ 180
                 +I+ +R+N+TLA+SQA+YIDKML RY M+N KKGLLPFRH VHLSK QCPKTPQ
Sbjct: 285 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQ 344

Query: 181 EVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 213
           EVEDMRRI YASAVGSLMYAMLCTRPDICYAVGIV+RYQSNP LDHWT VK ILKYL+RT
Sbjct: 345 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPELDHWTAVKIILKYLKRT 404

BLAST of Tan0011974 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 95.1 bits (235), Expect = 7.1e-20
Identity = 73/244 (29.92%), Postives = 109/244 (44.67%), Query Frame = 0

Query: 1   MDVKTAFLNGKLDETIYMDQPKGFIT-QGQE---QKVCPLHRSIYGLKQASRSWNIRFDE 60
           +D+  AFLNG LDE IYM  P G+   QG       VC L +SIYGLKQASR W ++F  
Sbjct: 193 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 252

Query: 61  AIKSYGFDQNVDEPCVYKKIVNNTVAFLILYVDDILL----------------------- 120
            +  +GF Q+  +   + KI       +++YVDDI++                       
Sbjct: 253 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRD 312

Query: 121 -------IGKEIVWNRRNRTLAMSQASYIDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCP 180
                  +G EI   R    + + Q  Y   +L    +   K   +P    V  S +   
Sbjct: 313 LGPLKYFLGLEIA--RSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH--- 372

Query: 181 KTPQEVEDMRRISYASAVGSLMYAMLCTRPDICYAVGIVNRYQSNPRLDHWTTVKTILKY 211
            +  +  D +  +Y   +G LMY  + TR DI +AV  ++++   PRL H   V  IL Y
Sbjct: 373 -SGGDFVDAK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHY 427

BLAST of Tan0011974 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 47.0 bits (110), Expect = 2.2e-05
Identity = 44/156 (28.21%), Postives = 64/156 (41.03%), Query Frame = 0

Query: 83  FLILYVDDILLIGKE----------------------------IVWNRRNRTLAMSQASY 142
           +L+LYVDDILL G                              I        L +SQ  Y
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKY 61

Query: 143 IDKMLSRYKMKNFKKGLLPFRHVVHLSKNQCPKTPQEVEDMRRISYASAVGSLMYAMLCT 202
            +++L+   M + K    P    + L  N    T +  +      + S VG+L Y  L T
Sbjct: 62  AEQILNNAGMLDCK----PMSTPLPLKLNSSVSTAKYPDPS---DFRSIVGALQYLTL-T 121

Query: 203 RPDICYAVGIVNRYQSNPRLDHWTTVKTILKYLRRT 211
           RPDI YAV IV +    P L  +  +K +L+Y++ T
Sbjct: 122 RPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGT 149

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109786.2e-5344.40Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.5e-2733.61Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT946.4e-2630.25Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.3e-2330.83Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256003.3e-2229.05Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
ADJ18449.14.5e-9373.14gag/pol protein, partial [Bryonia dioica][more]
KAA0035907.18.5e-9272.31gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025945.11.1e-9171.90gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.11.1e-9171.90gag/pol protein [Cucumis melo var. makuwa][more]
KAA0059556.11.9e-9171.49gag/pol protein [Cucumis melo var. makuwa] >TYK29968.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
E2GK512.2e-9373.14Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
A0A5A7T2V94.1e-9272.31Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5A7TZD05.4e-9271.90Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE85.4e-9271.90Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7UZF39.2e-9271.49Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1449G000... [more]
Match NameE-valueIdentityDescription
AT4G23160.17.1e-2029.92cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.2e-0528.21DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1..99
e-value: 3.3E-27
score: 95.6
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1..94
coord: 94..210
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011974.1Tan0011974.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding