Tan0007534 (gene) Snake gourd v1

Overview
NameTan0007534
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG04: 18150410 .. 18151214 (-)
RNA-Seq ExpressionTan0007534
SyntenyTan0007534
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGATTTGGGAGAAGCTCAGTATGTTCTAGGTATCCAGATTGTCCGGAACCGGAAGAACAGAACGTTGGCCATGTCTCAAACGTCTTATATTGATAAGATATTGTCTAGATATAAGATGCAGAACTCCAAGAACGGCTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGATGTATCCCCTATGCTTCAGCTGTAGGGAGCCTGATGTATGTCCTGTTGTATACTAGGTCTGACATCTGTTATGCAATAGGGATTGTAAGTAGGTATCAATCCAATCCAGGATTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTACAGCTTAGTGTATGGAAGTGGGGATTTGTTCCTTACAGGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAAATGAAGGAGCTGTAGTATGGCGAAGCATCAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCAGAGTACGTTGCGGCTTGTGAAGCTGCAAAGGAAGTTGTTTGGCTTAGAAAGTTCATAACCGATTTGGAAGTTGTTCCAAATATGAATTTGTCGATCGCACTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCCAGAGAGCCTCGGAGCCATAAGAGAGGCAAACACATGGAGCGGAAGTATCACCTAATACGGGAGATTGTGCACCGCATGGACGCGTGGCGATCCACGCGCAGATAG

mRNA sequence

ATGAAAGATTTGGGAGAAGCTCAGTATGTTCTAGGTATCCAGATTGTCCGGAACCGGAAGAACAGAACGTTGGCCATGTCTCAAACGTCTTATATTGATAAGATATTGTCTAGATATAAGATGCAGAACTCCAAGAACGGCTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGATGTATCCCCTATGCTTCAGCTGTAGGGAGCCTGATGTATGTCCTGTTGTATACTAGGTCTGACATCTGTTATGCAATAGGGATTGTAAGTAGGTATCAATCCAATCCAGGATTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTACAGCTTAGTGTATGGAAGTGGGGATTTGTTCCTTACAGGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGCATCAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCAGAGTACGTTGCGGCTTGTGAAGCTGCAAAGGAAGTTGTTTGGCTTAGAAAGTTCATAACCGATTTGGAAGTTGTTCCAAATATGAATTTGTCGATCGCACTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCCAGAGAGCCTCGGAGCCATAAGAGAGGCAAACACATGGAGCGGAAGTATCACCTAATACGGGAGATTGTGCACCGCATGGACGCGTGGCGATCCACGCGCAGATAG

Coding sequence (CDS)

ATGAAAGATTTGGGAGAAGCTCAGTATGTTCTAGGTATCCAGATTGTCCGGAACCGGAAGAACAGAACGTTGGCCATGTCTCAAACGTCTTATATTGATAAGATATTGTCTAGATATAAGATGCAGAACTCCAAGAACGGCTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGATGTATCCCCTATGCTTCAGCTGTAGGGAGCCTGATGTATGTCCTGTTGTATACTAGGTCTGACATCTGTTATGCAATAGGGATTGTAAGTAGGTATCAATCCAATCCAGGATTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTACAGCTTAGTGTATGGAAGTGGGGATTTGTTCCTTACAGGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGCATCAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCAGAGTACGTTGCGGCTTGTGAAGCTGCAAAGGAAGTTGTTTGGCTTAGAAAGTTCATAACCGATTTGGAAGTTGTTCCAAATATGAATTTGTCGATCGCACTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCCAGAGAGCCTCGGAGCCATAAGAGAGGCAAACACATGGAGCGGAAGTATCACCTAATACGGGAGATTGTGCACCGCATGGACGCGTGGCGATCCACGCGCAGATAG

Protein sequence

MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSGIKQGCIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMDAWRSTRR
Homology
BLAST of Tan0007534 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.0e-54
Identity = 112/256 (43.75%), Postives = 159/256 (62.11%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLG AQ +LG++IVR R +R L +SQ  YI+++L R+ M+N+K    P    + LSK 
Sbjct: 1035 MKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKK 1094

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
             CP T +E  +M  +PY+SAVGSLMY ++ TR DI +A+G+VSR+  NPG +HW  VK I
Sbjct: 1095 MCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWI 1154

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGC 180
            L+YLR T    L +G  D  L GYTD+D   D D+RKS++G                Q C
Sbjct: 1155 LRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKC 1214

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            +A ST EAEY+AA E  KE++WL++F+ +L +         ++CD+  A+  S+    H 
Sbjct: 1215 VALSTTEAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQSAIDLSKNSMYHA 1274

Query: 241  RGKHMERKYHLIREIV 243
            R KH++ +YH IRE+V
Sbjct: 1275 RTKHIDVRYHWIREMV 1287

BLAST of Tan0007534 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 135.6 bits (340), Expect = 7.9e-31
Identity = 84/263 (31.94%), Postives = 137/263 (52.09%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVH---L 60
            M DL E ++ +GI+I    +   + +SQ++Y+ KILS++ M+N      P    ++   L
Sbjct: 1114 MTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELL 1173

Query: 61   SKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTV 120
            + D+   T          P  S +G LMY++L TR D+  A+ I+SRY S    + W  +
Sbjct: 1174 NSDEDCNT----------PCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNL 1233

Query: 121  KAILKYLRRTRNYSLVYGSGDLF---LTGYTDSDFQTDKDSRKSTSGI------------ 180
            K +L+YL+ T +  L++     F   + GY DSD+   +  RKST+G             
Sbjct: 1234 KRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICW 1293

Query: 181  ---KQGCIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANS 240
               +Q  +A S+ EAEY+A  EA +E +WL+  +T + +   +   I ++ DN G ++ +
Sbjct: 1294 NTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIA 1353

Query: 241  REPRSHKRGKHMERKYHLIREIV 243
              P  HKR KH++ KYH  RE V
Sbjct: 1354 NNPSCHKRAKHIDIKYHFAREQV 1362

BLAST of Tan0007534 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 109.8 bits (273), Expect = 4.7e-23
Identity = 79/257 (30.74%), Postives = 119/257 (46.30%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+  
Sbjct: 1160 VKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLH 1219

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
               K P   E      Y   VGSL Y L +TR D+ YA+  +S+Y   P  DHW  +K +
Sbjct: 1220 SGTKLPDPTE------YRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRV 1279

Query: 121  LKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQG 180
            L+YL  T ++ +    G+ L L  Y+D+D+  D D   ST+G               KQ 
Sbjct: 1280 LRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQK 1339

Query: 181  CIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSH 240
             +  S+ EAEY +    + E+ W+   +T+L +   ++    ++CDN GA      P  H
Sbjct: 1340 GVVRSSTEAEYRSVANTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFH 1399

Query: 241  KRGKHMERKYHLIREIV 243
             R KH+   YH IR  V
Sbjct: 1400 SRMKHIALDYHFIRNQV 1405

BLAST of Tan0007534 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 2.3e-22
Identity = 83/257 (32.30%), Postives = 117/257 (45.53%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            +KD  E  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS  
Sbjct: 1177 VKDHEELHYFLGIE--AKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLY 1236

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
               K     E      Y   VGSL Y L +TR DI YA+  +S++   P  +H   +K I
Sbjct: 1237 SGTKLTDPTE------YRGIVGSLQY-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRI 1296

Query: 121  LKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQG 180
            L+YL  T N+ +    G+ L L  Y+D+D+  DKD   ST+G               KQ 
Sbjct: 1297 LRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQK 1356

Query: 181  CIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSH 240
             +  S+ EAEY +    + E+ W+   +T+L +   +     ++CDN GA      P  H
Sbjct: 1357 GVVRSSTEAEYRSVANTSSEMQWICSLLTELGI--RLTRPPVIYCDNVGATYLCANPVFH 1416

Query: 241  KRGKHMERKYHLIREIV 243
             R KH+   YH IR  V
Sbjct: 1417 SRMKHIAIDYHFIRNQV 1422

BLAST of Tan0007534 vs. ExPASy Swiss-Prot
Match: P0CV72 (Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 PE=2 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 1.3e-20
Identity = 53/133 (39.85%), Postives = 83/133 (62.41%), Query Frame = 0

Query: 72  MRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYS 131
           M+ +PY SAVG++MY+++ TR D+  A+G++S++ S+P   HW  +K +L+YL+ T+ Y 
Sbjct: 1   MKNVPYLSAVGAIMYLMVVTRPDLAAAVGVLSQFASDPCPTHWQALKRVLRYLQSTQTYG 60

Query: 132 LVY-GSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEY 190
           L +  +G   L GY+D+D+  D +SR+STSG               KQ  +A S+ E EY
Sbjct: 61  LEFTRAGTAKLVGYSDADWAGDVESRRSTSGYLFKLNGGCVSWRSKKQRTVALSSTEDEY 120

BLAST of Tan0007534 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 427.2 bits (1097), Expect = 1.0e-115
Identity = 210/260 (80.77%), Postives = 228/260 (87.69%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQY+LGIQIVRNRKN+TLAMSQ SYIDK+LSRYKMQNSK G LPFRHG+HLSK+
Sbjct: 1028 MKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKE 1087

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            QCPKTPQEVEDMR IPY+SAVGSLMY +L TR DICY++GIVSRYQSNPG DHWT VK I
Sbjct: 1088 QCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNI 1147

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTRNY LVYG+ DL LTGYTDSDFQ+DKD+RKSTSG              +KQ C
Sbjct: 1148 LKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTC 1207

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+TDLEVVPNM+L I L+CDNSGAVANS+EPRSHK
Sbjct: 1208 IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHK 1267

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIVHR D
Sbjct: 1268 RGKHIERKYHLIREIVHRGD 1287

BLAST of Tan0007534 vs. NCBI nr
Match: KAA0061170.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 424.5 bits (1090), Expect = 6.5e-115
Identity = 212/260 (81.54%), Postives = 227/260 (87.31%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 97  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 156

Query: 61  QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
           Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWTTVK I
Sbjct: 157 QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKII 216

Query: 121 LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
           LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 217 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGC 276

Query: 181 IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
           IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 277 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 336

Query: 241 RGKHMERKYHLIREIVHRMD 247
           RGKH+ERKYHLIREIV R D
Sbjct: 337 RGKHIERKYHLIREIVQRGD 356

BLAST of Tan0007534 vs. NCBI nr
Match: KAA0042496.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 424.1 bits (1089), Expect = 8.5e-115
Identity = 212/260 (81.54%), Postives = 227/260 (87.31%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 1   MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 60

Query: 61  QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
           Q PKTPQEVEDMR IPYASAVGSLMYV+L TR DICYA+GIVSRYQSNPGLDHWT VK I
Sbjct: 61  QSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 120

Query: 121 LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
           LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 121 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 180

Query: 181 IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
           IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 181 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 240

Query: 241 RGKHMERKYHLIREIVHRMD 247
           RGKH+ERKYHLIREIV R D
Sbjct: 241 RGKHIERKYHLIREIVQRGD 260

BLAST of Tan0007534 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 422.2 bits (1084), Expect = 3.2e-114
Identity = 210/260 (80.77%), Postives = 226/260 (86.92%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWT VK +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIV R D
Sbjct: 1176 RGKHIERKYHLIREIVQRGD 1195

BLAST of Tan0007534 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 422.2 bits (1084), Expect = 3.2e-114
Identity = 210/260 (80.77%), Postives = 226/260 (86.92%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 810  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 869

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWT VK +
Sbjct: 870  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 929

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 930  LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 989

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 990  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1049

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIV R D
Sbjct: 1050 RGKHIERKYHLIREIVQRGD 1069

BLAST of Tan0007534 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 4.9e-116
Identity = 210/260 (80.77%), Postives = 228/260 (87.69%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQY+LGIQIVRNRKN+TLAMSQ SYIDK+LSRYKMQNSK G LPFRHG+HLSK+
Sbjct: 1028 MKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQLPFRHGIHLSKE 1087

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            QCPKTPQEVEDMR IPY+SAVGSLMY +L TR DICY++GIVSRYQSNPG DHWT VK I
Sbjct: 1088 QCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNPGRDHWTAVKNI 1147

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTRNY LVYG+ DL LTGYTDSDFQ+DKD+RKSTSG              +KQ C
Sbjct: 1148 LKYLRRTRNYMLVYGAKDLILTGYTDSDFQSDKDARKSTSGSVFTLNGGAVVWRSVKQTC 1207

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+TDLEVVPNM+L I L+CDNSGAVANS+EPRSHK
Sbjct: 1208 IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHK 1267

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIVHR D
Sbjct: 1268 RGKHIERKYHLIREIVHRGD 1287

BLAST of Tan0007534 vs. ExPASy TrEMBL
Match: A0A5A7V1F5 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G00440 PE=4 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 3.2e-115
Identity = 212/260 (81.54%), Postives = 227/260 (87.31%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 97  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 156

Query: 61  QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
           Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWTTVK I
Sbjct: 157 QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKII 216

Query: 121 LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
           LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 217 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGC 276

Query: 181 IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
           IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 277 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 336

Query: 241 RGKHMERKYHLIREIVHRMD 247
           RGKH+ERKYHLIREIV R D
Sbjct: 337 RGKHIERKYHLIREIVQRGD 356

BLAST of Tan0007534 vs. ExPASy TrEMBL
Match: A0A5A7TKM4 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold246G00470 PE=4 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 4.1e-115
Identity = 212/260 (81.54%), Postives = 227/260 (87.31%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 1   MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 60

Query: 61  QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
           Q PKTPQEVEDMR IPYASAVGSLMYV+L TR DICYA+GIVSRYQSNPGLDHWT VK I
Sbjct: 61  QSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 120

Query: 121 LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
           LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 121 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 180

Query: 181 IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
           IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 181 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 240

Query: 241 RGKHMERKYHLIREIVHRMD 247
           RGKH+ERKYHLIREIV R D
Sbjct: 241 RGKHIERKYHLIREIVQRGD 260

BLAST of Tan0007534 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 1.6e-114
Identity = 210/260 (80.77%), Postives = 226/260 (86.92%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWT VK +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIV R D
Sbjct: 1176 RGKHIERKYHLIREIVQRGD 1195

BLAST of Tan0007534 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 1.6e-114
Identity = 210/260 (80.77%), Postives = 226/260 (86.92%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+
Sbjct: 810  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 869

Query: 61   QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
            Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+GIVSRYQSNPGLDHWT VK +
Sbjct: 870  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 929

Query: 121  LKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGC 180
            LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGC
Sbjct: 930  LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 989

Query: 181  IADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHK 240
            IADSTMEAEYVAACEAAKE VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHK
Sbjct: 990  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1049

Query: 241  RGKHMERKYHLIREIVHRMD 247
            RGKH+ERKYHLIREIV R D
Sbjct: 1050 RGKHIERKYHLIREIVQRGD 1069

BLAST of Tan0007534 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 104.0 bits (258), Expect = 1.8e-22
Identity = 77/255 (30.20%), Postives = 125/255 (49.02%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKD 60
           ++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +P    V  S  
Sbjct: 310 LRDLGPLKYFLGLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFS-- 369

Query: 61  QCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAI 120
               +  +  D +   Y   +G LMY L  TR DI +A+  +S++   P L H   V  I
Sbjct: 370 --AHSGGDFVDAKA--YRRLIGRLMY-LQITRLDISFAVNKLSQFSEAPRLAHQQAVMKI 429

Query: 121 LKYLRRTRNYSLVYGS-GDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQG 180
           L Y++ T    L Y S  ++ L  ++D+ FQ+ KD+R+ST+G               KQ 
Sbjct: 430 LHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQ 489

Query: 181 CIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSH 240
            ++ S+ EAEY A   A  E++WL +F  +L++   ++    LFCDN+ A+  +     H
Sbjct: 490 VVSKSSAEAEYRALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFH 549

BLAST of Tan0007534 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 77.0 bits (188), Expect = 2.4e-14
Identity = 64/205 (31.22%), Postives = 100/205 (48.78%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSK--NGLLPFRHGVHLS 60
           MKDLG   Y LGIQI  +     L +SQT Y ++IL+   M + K  +  LP +    +S
Sbjct: 33  MKDLGPVHYFLGIQIKTHPSG--LFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVS 92

Query: 61  KDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVK 120
             + P    +  D R     S VG+L Y+ L TR DI YA+ IV +    P L  +  +K
Sbjct: 93  TAKYP----DPSDFR-----SIVGALQYLTL-TRPDISYAVNIVCQRMHEPTLADFDLLK 152

Query: 121 AILKYLRRTRNYSL-VYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------K 180
            +L+Y++ T  + L ++ +  L +  + DSD+     +R+ST+G               +
Sbjct: 153 RVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKR 212

Query: 181 QGCIADSTMEAEYVAACEAAKEVVW 189
           Q  ++ S+ E EY A    A E+ W
Sbjct: 213 QPTVSRSSTETEYRALALTAAELTW 225

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.0e-5443.75Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041467.9e-3131.94Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT944.7e-2330.74Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW22.3e-2232.30Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P0CV721.3e-2039.85Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 P... [more]
Match NameE-valueIdentityDescription
ADJ18449.11.0e-11580.77gag/pol protein, partial [Bryonia dioica][more]
KAA0061170.16.5e-11581.54gag/pol protein [Cucumis melo var. makuwa][more]
KAA0042496.18.5e-11581.54gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025945.13.2e-11480.77gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.13.2e-11480.77gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
E2GK514.9e-11680.77Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
A0A5A7V1F53.2e-11581.54Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G0044... [more]
A0A5A7TKM44.1e-11581.54Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold246G0047... [more]
A0A5A7TZD01.6e-11480.77Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE81.6e-11480.77Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.8e-2230.20cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.4e-1431.22DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 163..241
NoneNo IPR availablePANTHERPTHR11439:SF273RIBOSOME BIOGENESIS PROTEIN BOP1 HOMOLOGcoord: 163..241
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 142..242
e-value: 2.44213E-40
score: 134.133

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007534.1Tan0007534.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding