Tan0004450 (gene) Snake gourd v1

Overview
NameTan0004450
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG06: 41888037 .. 41888937 (-)
RNA-Seq ExpressionTan0004450
SyntenyTan0004450
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATTGTCCCGAACCGCAAGAACAGAATGCTAGCCATGTCTCAGGCATCTTACATTGACAAGATGTTGTCTAGGTATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGATGCCTCAAGAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAAATTGTAGGGAGCCTGATGTATGCCATGTTGTGTACTAGGCCCGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCCAATCCAGGATTAGATCATTGGATAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTATAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGAGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGGAGGACCTGTAGTGTGGCGAAGCATCAAACAGGGATGCATCGTTGATTCCACGATGGAAGCCAAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTCAGGAAGTTCATGACGGATTTAGAAGTTGTTCCAAATATGAACTTACCGATCACGTTGTTCTGTGACAACACTGGTGCAGTAGCCAACTCGAGAGAACTTCGGAGTAATAAAAGGGACAAGCATATAGAGCGTAAGTATCACTTGATACGGGAGATTGTGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTTATGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTGCCTCCTGACTAG

mRNA sequence

ATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATTGTCCCGAACCGCAAGAACAGAATGCTAGCCATGTCTCAGGCATCTTACATTGACAAGATGTTGTCTAGGTATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGATGCCTCAAGAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAAATTGTAGGGAGCCTGATGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTATAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGAGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGGAGGACCTGTAGTGTGGCGAAGCATCAAACAGGGATGCATCGTTGATTCCACGATGGAAGCCAAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTCAGGAAGTTCATGACGGATTTAGAAGTTGTTCCAAATATGAACTTACCGATCACGTTGTTCTGTGACAACACTGGTGCAGTAGCCAACTCGAGAGAACTTCGGAGTAATAAAAGGGACAAGCATATAGAGCGTAAGTATCACTTGATACGGGAGATTGTGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTTATGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTGCCTCCTGACTAG

Coding sequence (CDS)

ATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATTGTCCCGAACCGCAAGAACAGAATGCTAGCCATGTCTCAGGCATCTTACATTGACAAGATGTTGTCTAGGTATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGATGCCTCAAGAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAAATTGTAGGGAGCCTGATGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTATAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGAGATACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGGAGGACCTGTAGTGTGGCGAAGCATCAAACAGGGATGCATCGTTGATTCCACGATGGAAGCCAAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTCAGGAAGTTCATGACGGATTTAGAAGTTGTTCCAAATATGAACTTACCGATCACGTTGTTCTGTGACAACACTGGTGCAGTAGCCAACTCGAGAGAACTTCGGAGTAATAAAAGGGACAAGCATATAGAGCGTAAGTATCACTTGATACGGGAGATTGTGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTTATGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTGCCTCCTGACTAG

Protein sequence

MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASNCREPDAILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLRVPPD
Homology
BLAST of Tan0004450 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 1.6e-42
Identity = 103/294 (35.03%), Postives = 154/294 (52.38%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLG AQ +LG++IV  R +R L +SQ  YI+++L R+ M+N+K    P    + LSK 
Sbjct: 1035 MKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKK 1094

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPDA------------------------I 120
             CP   +E  +M ++PY+S          C  PD                         I
Sbjct: 1095 MCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWI 1154

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            L+YLR T    L +G  D IL  YTD+D   D D+RKS++G  F  +GG + W+S  Q C
Sbjct: 1155 LRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKC 1214

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            +  ST EA+Y+AA E  KE +WL++F+ +L +         ++CD+  A+  S+    + 
Sbjct: 1215 VALSTTEAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQSAIDLSKNSMYHA 1274

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL 262
            R KHI+ +YH IRE+V    + V +I++  N  D  TK +    FE   E +G+
Sbjct: 1275 RTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELVGM 1325

BLAST of Tan0004450 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 110.5 bits (275), Expect = 2.9e-23
Identity = 79/291 (27.15%), Postives = 140/291 (48.11%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHL--- 60
            M DL E ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N      P    ++    
Sbjct: 1114 MTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELL 1173

Query: 61   -SKDQC-------------------PKMPQEVEDMRRIPYASNC---REPDAILKYLRRT 120
             S + C                   P +   V  + R    +N    +    +L+YL+ T
Sbjct: 1174 NSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGT 1233

Query: 121  RNYSLVYGSG---DLILTRYTDSDFQTDKDSRKSTSGSAF-ILNGGPVVWRSIKQGCIVD 180
             +  L++      +  +  Y DSD+   +  RKST+G  F + +   + W + +Q  +  
Sbjct: 1234 IDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAA 1293

Query: 181  STMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDK 240
            S+ EA+Y+A  EA +EA+WL+  +T + +   +  PI ++ DN G ++ +     +KR K
Sbjct: 1294 SSTEAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAK 1353

Query: 241  HIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL 262
            HI+ KYH  RE V    + +  I +E+ + D FTK L A  F    + LGL
Sbjct: 1354 HIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of Tan0004450 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 4.2e-22
Identity = 83/293 (28.33%), Postives = 123/293 (41.98%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            +KD  E  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS  
Sbjct: 1177 VKDHEELHYFLGIE--AKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLY 1236

Query: 61   QCPKMPQEVE--------------------------DMRRIPYASNCREPDAILKYLRRT 120
               K+    E                              +P   + +    IL+YL  T
Sbjct: 1237 SGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGT 1296

Query: 121  RNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTM 180
             N+ +    G+ L L  Y+D+D+  DKD   ST+G    L   P+ W S KQ  +V S+ 
Sbjct: 1297 PNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1356

Query: 181  EAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIE 240
            EA+Y +    + E  W+   +T+L +   +  P  ++CDN GA         + R KHI 
Sbjct: 1357 EAEYRSVANTSSEMQWICSLLTELGI--RLTRPPVIYCDNVGATYLCANPVFHSRMKHIA 1416

Query: 241  RKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL-RVPP 266
              YH IR  V  G + V  +++   + D  TK L    F+     +G+ RVPP
Sbjct: 1417 IDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASKIGVTRVPP 1465

BLAST of Tan0004450 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.1e-19
Identity = 76/293 (25.94%), Postives = 123/293 (41.98%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+  
Sbjct: 1160 VKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLH 1219

Query: 61   QCPKMPQEVE--------------------------DMRRIPYASNCREPDAILKYLRRT 120
               K+P   E                              +P   +      +L+YL  T
Sbjct: 1220 SGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGT 1279

Query: 121  RNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTM 180
             ++ +    G+ L L  Y+D+D+  D D   ST+G    L   P+ W S KQ  +V S+ 
Sbjct: 1280 PDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1339

Query: 181  EAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIE 240
            EA+Y +    + E  W+   +T+L +   ++ P  ++CDN GA         + R KHI 
Sbjct: 1340 EAEYRSVANTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFHSRMKHIA 1399

Query: 241  RKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLG-LRVPP 266
              YH IR  V  G + V  +++   + D  TK L    F+     +G ++VPP
Sbjct: 1400 LDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQNFSRKIGVIKVPP 1448

BLAST of Tan0004450 vs. ExPASy Swiss-Prot
Match: P0CV72 (Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 PE=2 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 1.3e-12
Identity = 40/85 (47.06%), Postives = 56/85 (65.88%), Query Frame = 0

Query: 87  ILKYLRRTRNYSLVY-GSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQ 146
           +L+YL+ T+ Y L +  +G   L  Y+D+D+  D +SR+STSG  F LNGG V WRS KQ
Sbjct: 49  VLRYLQSTQTYGLEFTRAGTAKLVGYSDADWAGDVESRRSTSGYLFKLNGGCVSWRSKKQ 108

Query: 147 GCIVDSTMEAKYVAACEAAKEAVWL 171
             +  S+ E +Y+A  EA +EAVWL
Sbjct: 109 RTVALSSTEDEYMALSEATQEAVWL 133

BLAST of Tan0004450 vs. NCBI nr
Match: KAA0042496.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 425.2 bits (1092), Expect = 4.0e-115
Identity = 220/295 (74.58%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 1   MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 60

Query: 61  QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
           Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 61  QSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 120

Query: 121 LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
           LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 121 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 180

Query: 181 IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
           I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 181 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 240

Query: 241 RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
           R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 241 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 295

BLAST of Tan0004450 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 424.9 bits (1091), Expect = 5.3e-115
Identity = 219/295 (74.24%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1176 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1230

BLAST of Tan0004450 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 424.9 bits (1091), Expect = 5.3e-115
Identity = 219/295 (74.24%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 810  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 869

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         +
Sbjct: 870  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 929

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 930  LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 989

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 990  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1049

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1050 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1104

BLAST of Tan0004450 vs. NCBI nr
Match: KAA0061170.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 421.8 bits (1083), Expect = 4.4e-114
Identity = 219/295 (74.24%), Postives = 236/295 (80.00%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 97  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 156

Query: 61  QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
           Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 157 QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKII 216

Query: 121 LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
           LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LN G VVWRSIKQGC
Sbjct: 217 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGC 276

Query: 181 IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
           I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 277 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 336

Query: 241 RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
           R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 337 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKILTAKVFEGHLESLGLR 391

BLAST of Tan0004450 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 417.5 bits (1072), Expect = 8.4e-113
Identity = 216/295 (73.22%), Postives = 235/295 (79.66%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGE QYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 936  MKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 1055

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSIKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWL+KF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1176 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1230

BLAST of Tan0004450 vs. ExPASy TrEMBL
Match: A0A5A7TKM4 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold246G00470 PE=4 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 1.9e-115
Identity = 220/295 (74.58%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 1   MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 60

Query: 61  QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
           Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 61  QSPKTPQEVEDMRRIPYASAVGSLMYVMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 120

Query: 121 LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
           LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 121 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 180

Query: 181 IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
           I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 181 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 240

Query: 241 RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
           R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 241 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 295

BLAST of Tan0004450 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 2.5e-115
Identity = 219/295 (74.24%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 936  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         +
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 1055

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1176 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1230

BLAST of Tan0004450 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 2.5e-115
Identity = 219/295 (74.24%), Postives = 237/295 (80.34%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 810  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 869

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         +
Sbjct: 870  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIV 929

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGC
Sbjct: 930  LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGC 989

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 990  IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1049

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1050 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1104

BLAST of Tan0004450 vs. ExPASy TrEMBL
Match: A0A5A7V1F5 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G00440 PE=4 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 2.2e-114
Identity = 219/295 (74.24%), Postives = 236/295 (80.00%), Query Frame = 0

Query: 1   MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
           MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 97  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 156

Query: 61  QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
           Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 157 QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKII 216

Query: 121 LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
           LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LN G VVWRSIKQGC
Sbjct: 217 LKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGC 276

Query: 181 IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
           I DSTMEA+YVAACEAAKEAVWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 277 IADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 336

Query: 241 RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
           R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 337 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKILTAKVFEGHLESLGLR 391

BLAST of Tan0004450 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 4.1e-113
Identity = 216/295 (73.22%), Postives = 235/295 (79.66%), Query Frame = 0

Query: 1    MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKD 60
            MKDLGE QYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+
Sbjct: 936  MKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKE 995

Query: 61   QCPKMPQEVEDMRRIPYASN---------CREPD------------------------AI 120
            Q PK PQEVEDMRRIPYAS          C  PD                         I
Sbjct: 996  QSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKII 1055

Query: 121  LKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGC 180
            LKYLRRTR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSIKQGC
Sbjct: 1056 LKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGC 1115

Query: 181  IVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNK 240
            I DSTMEA+YVAACEAAKEAVWL+KF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+K
Sbjct: 1116 IADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK 1175

Query: 241  RDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR 263
            R KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Sbjct: 1176 RGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLR 1230

BLAST of Tan0004450 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 86.3 bits (212), Expect = 4.1e-17
Identity = 49/136 (36.03%), Postives = 77/136 (56.62%), Query Frame = 0

Query: 87  ILKYLRRTRNYSLVYGS-GDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQ 146
           IL Y++ T    L Y S  ++ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ
Sbjct: 420 ILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQ 479

Query: 147 GCIVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRS 206
             +  S+ EA+Y A   A  E +WL +F  +L++   ++ P  LFCDNT A+  +     
Sbjct: 480 QVVSKSSAEAEYRALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVF 539

Query: 207 NKRDKHIERKYHLIRE 222
           ++R KHIE   H +RE
Sbjct: 540 HERTKHIESDCHSVRE 553

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.6e-4235.03Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.9e-2327.15Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW24.2e-2228.33Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.1e-1925.94Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P0CV721.3e-1247.06Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 P... [more]
Match NameE-valueIdentityDescription
KAA0042496.14.0e-11574.58gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025945.15.3e-11574.24gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.15.3e-11574.24gag/pol protein [Cucumis melo var. makuwa][more]
KAA0061170.14.4e-11474.24gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.18.4e-11373.22gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7TKM41.9e-11574.58Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold246G0047... [more]
A0A5A7TZD02.5e-11574.24Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE82.5e-11574.24Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7V1F52.2e-11474.24Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G0044... [more]
A0A5A7T2V94.1e-11373.22Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
Match NameE-valueIdentityDescription
AT4G23160.14.1e-1736.03cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 122..224
NoneNo IPR availablePANTHERPTHR11439:SF273RIBOSOME BIOGENESIS PROTEIN BOP1 HOMOLOGcoord: 122..224
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 111..247
e-value: 8.20134E-62
score: 189.216

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004450.1Tan0004450.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding