Tan0017081 (gene) Snake gourd v1

Overview
NameTan0017081
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG08: 54722820 .. 54724767 (+)
RNA-Seq ExpressionTan0017081
SyntenyTan0017081
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACGATAAACTACTCACTGACAACTCTTCTTAACGAGCTACAAACCTTCCAGTCCTTGATGAGGATCAGGACGTCGGAAGCTGAGGCAAACGTTGCCATTAGGTCTTATCACAGGGGTTGGACCTCTGGGACAAAGCCTGTAGCTCCTTCACCCCCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAACTGATCGAGCTGCAGCCCAAAAGGGCAAGAAGACCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGGGGGCGGACACTGGAAGAGGAACTGCTCCAAATTCCTAGGCGAGAAAAAGAATCAAGGTAAATGTGATTTACTTGTGACAGAAACCTTTTAGTGGAGAGTAGTGACTCTACTTAGATATTGGATTCGGGCGCCACTAACCATGTTTGTTCTTTTTTTCAGGGGATTGATTGCTGGTAGCAGCTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATCTGGGAGCTTGTCTCTGCTGCAGCGGTCGGCACGGTGAAGCTACATTTCAACAAGAATTACAATTTGTTAGACAATTTGTATATAGTTCCAGAGTTTACTAGAAACCTCGTTTCTGCTTCCTACATGCTTGAACATTGTATCTCCGTTTCATTCCATGGTAATAAAGCGTTTATTTCCAGAAATGGTAATCTTATTTGTTCTGCTTCACTTGAGAATAATATGTATGTTTTAAAACCTAATTCGGTCAAAAGTGTTTTGAATACTGAATTGTTTAAAACGGCAGAAACGCGAACTAAAAGAACGAAAATTTCTCCTAAAGAAAATGCCCATCTTTGGCATCTATGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAATAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTCACCGGTGTGTGAGTCATGCCTTGAAGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCAGAGGAGCCCCTTGAGCTAATACACTCTGACCTCTGTGGTCCGATGAATATTAAAGCACGAGGTGGTTATGAATACTTCGTGTCTTTCATAGATTATTACTCGAGGTATGGGCATACTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGTAAATCGCTTAAAACACATCGATCGGATCGAGGTGGAGAGTATATGGACACTGAATTCCAAGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTATGCCACAACAGAATGGCATATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGACGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAGTGGAGACTGCGGTTTACATTTTGAACAACGTTCCGTCAAAGAGTGTTTGTGAAACACCTTTCGAACTATAGAATGACCGTAATGGTAGTTTACACCATTTCAGAATTTAGGGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTTTGCCTATTCGTAGGTTACCCAAAAGAGACCAGAGGTGGTCTGTGTTTTTATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCTTGAGGAAAATCATATCAGGGATCATTTACCAAGGAGTAAAATGGTGTTAAATGAAATCGATAGTACGTCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTATAACGTCTAGTCAAATTCGTTCCCAAGAGTTGGGAATGCCTCAACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCCCAGTTGTCACTCCTGATGATGACTGCGAGGATCCATTGACCTATGATCAGGCAATGGTAGATGTTGACAAAGACGAATAG

mRNA sequence

ATGAACACGATAAACTACTCACTGACAACTCTTCTTAACGAGCTACAAACCTTCCAGTCCTTGATGAGGATCAGGACGTCGGAAGCTGAGGCAAACGTTGCCATTAGGTCTTATCACAGGGGTTGGACCTCTGGGACAAAGCCTGTAGCTCCTTCACCCCCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAACTGATCGAGCTGCAGCCCAAAAGGGCAAGAAGACCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGGGGGCGGACACTGGAAGAGGAACTGCTCCAAATTCCTAGGCGAGAAAAAGAATCAAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAATAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTCACCGGTGTGTGAGTCATGCCTTGAAGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCAGAGGAGCCCCTTGAGCTAATACACTCTGACCTCTGTGGTCCGATGAATATTAAAGCACGAGGTGGTTATGAATACTTCGTGTCTTTCATAGATTATTACTCGAGGTATGGGCATACTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGTAAATCGCTTAAAACACATCGATCGGATCGAGGTGGAGAGTATATGGACACTGAATTCCAAGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTATGCCACAACAGAATGGCATATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGACGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAGTGGAGACTGCGGTTTACATTTTGAACAACAATTTAGGGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTTTGCCTATTCGTTACCCAAAAGAGACCAGAGGTGGTCTGTGTTTTTATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCTTGAGGAAAATCATATCAGGGATCATTTACCAAGGAGTAAAATGGTGTTAAATGAAATCGATAGTACGTCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTATAACGTCTAGTCAAATTCGTTCCCAAGAGTTGGGAATGCCTCAACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCCCAGTTGTCACTCCTGATGATGACTGCGAGGATCCATTGACCTATGATCAGGCAATGGTAGATGTTGACAAAGACGAATAG

Coding sequence (CDS)

ATGAACACGATAAACTACTCACTGACAACTCTTCTTAACGAGCTACAAACCTTCCAGTCCTTGATGAGGATCAGGACGTCGGAAGCTGAGGCAAACGTTGCCATTAGGTCTTATCACAGGGGTTGGACCTCTGGGACAAAGCCTGTAGCTCCTTCACCCCCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAACTGATCGAGCTGCAGCCCAAAAGGGCAAGAAGACCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGGGGGCGGACACTGGAAGAGGAACTGCTCCAAATTCCTAGGCGAGAAAAAGAATCAAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAATAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTCACCGGTGTGTGAGTCATGCCTTGAAGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCAGAGGAGCCCCTTGAGCTAATACACTCTGACCTCTGTGGTCCGATGAATATTAAAGCACGAGGTGGTTATGAATACTTCGTGTCTTTCATAGATTATTACTCGAGGTATGGGCATACTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGTAAATCGCTTAAAACACATCGATCGGATCGAGGTGGAGAGTATATGGACACTGAATTCCAAGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTATGCCACAACAGAATGGCATATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGACGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAGTGGAGACTGCGGTTTACATTTTGAACAACAATTTAGGGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTTTGCCTATTCGTTACCCAAAAGAGACCAGAGGTGGTCTGTGTTTTTATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCTTGAGGAAAATCATATCAGGGATCATTTACCAAGGAGTAAAATGGTGTTAAATGAAATCGATAGTACGTCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTATAACGTCTAGTCAAATTCGTTCCCAAGAGTTGGGAATGCCTCAACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCCCAGTTGTCACTCCTGATGATGACTGCGAGGATCCATTGACCTATGATCAGGCAATGGTAGATGTTGACAAAGACGAATAG

Protein sequence

MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVAIRSYHRGWTSGTKPVAPSPPKGKKKMKRGKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQGHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKVGTPFEILPIRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVVDPITSSQIRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
Homology
BLAST of Tan0017081 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.2e-39
Identity = 84/219 (38.36%), Postives = 127/219 (57.99%), Query Frame = 0

Query: 106 KNQGHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHS 165
           K  GH++   ++ L    L++  +      C+ CL GK  +  F     R    L+L++S
Sbjct: 427 KRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYS 486

Query: 166 DLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKT 225
           D+CGPM I++ GG +YFV+FID  SR    Y++  K +  + F+++   VE   G+ LK 
Sbjct: 487 DVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKR 546

Query: 226 HRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLP 285
            RSD GGEY   EF++Y   H I  + + PG PQ NG++ER NRT+++ VRS +  A+LP
Sbjct: 547 LRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLP 606

Query: 286 DSFWGYAVETAVYILNNNLGMPDPCVGVKPEKVGTPFEI 325
            SFWG AV+TA Y++N +  +  P     PE+V T  E+
Sbjct: 607 KSFWGEAVQTACYLINRSPSV--PLAFEIPERVWTNKEV 643

BLAST of Tan0017081 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 122.9 bits (307), Expect = 9.5e-27
Identity = 66/193 (34.20%), Postives = 103/193 (53.37%), Query Frame = 0

Query: 111 INLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRA--EEPLELIHSDLC 170
           + + R     +  LLN LE +   +CE CL GK  + PF     +   + PL ++HSD+C
Sbjct: 431 LEIKRKNMFSDQSLLNNLELS-CEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVC 490

Query: 171 GPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRS 230
           GP+         YFV F+D ++ Y  TYL+  KS+    F+++  + E      +     
Sbjct: 491 GPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYI 550

Query: 231 DRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSF 290
           D G EY+  E + + ++  I+  L+ P  PQ NG+SER  RT+ +  R+ +S A+L  SF
Sbjct: 551 DNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSF 610

Query: 291 WGYAVETAVYILN 302
           WG AV TA Y++N
Sbjct: 611 WGEAVLTATYLIN 622

BLAST of Tan0017081 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.6e-26
Identity = 75/237 (31.65%), Postives = 127/237 (53.59%), Query Frame = 0

Query: 109 GHINLNRIEKLVNSGLLNELEENFSPV-CESCLEGKMTKCPFSGKGYRAEEPLELIHSDL 168
           GH +L  +  ++++  L  L  +   + C  C   K  K PFS     + +PLE I+SD+
Sbjct: 451 GHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDV 510

Query: 169 CGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHR 228
                I +   Y Y+V F+D+++RY   Y + +KS+  + F  +K+ VEN     + T  
Sbjct: 511 WS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLY 570

Query: 229 SDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDS 288
           SD GGE++    +DY+ +H I+   S P  P+ NG+SER++R +++M  + +S+A +P +
Sbjct: 571 SDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKT 630

Query: 289 FWGYAVETAVYILNNNLGMPDPCVGVKPEKVGTPFEIL---PIRYPKETRGGLCFYP 342
           +W YA   AVY++N    +P P +     ++ +PF+ L   P  Y K    G   YP
Sbjct: 631 YWPYAFSVAVYLINR---LPTPLL-----QLQSPFQKLFGQPPNYEKLKVFGCACYP 676

BLAST of Tan0017081 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.5e-24
Identity = 68/213 (31.92%), Postives = 113/213 (53.05%), Query Frame = 0

Query: 113 LNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMN 172
           LN +    +  +LN   +  S  C  CL  K  K PFS     +  PLE I+SD+     
Sbjct: 479 LNSVISNYSLSVLNPSHKFLS--CSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SP 538

Query: 173 IKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGG 232
           I +   Y Y+V F+D+++RY   Y + +KS+  E F  +K  +EN     + T  SD GG
Sbjct: 539 ILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGG 598

Query: 233 EYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYA 292
           E++     +Y  +H I+   S P  P+ NG+SER++R +++   + +S+A +P ++W YA
Sbjct: 599 EFV--ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYA 658

Query: 293 VETAVYILNNNLGMPDPCVGVKPEKVGTPFEIL 326
              AVY++N    +P P +     ++ +PF+ L
Sbjct: 659 FAVAVYLINR---LPTPLL-----QLESPFQKL 678

BLAST of Tan0017081 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 4.4e-16
Identity = 59/208 (28.37%), Postives = 97/208 (46.63%), Query Frame = 0

Query: 109 GHINLNRIEKLVNSGLLNELEEN-------FSPVCESCLEGKMTKCPFSGKGYRAE---- 168
           GH N   I+K +    +  L+E+        +  C  CL GK TK     KG R +    
Sbjct: 599 GHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHV-KGSRLKYQES 658

Query: 169 -EPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSE--TLEKFKEYKTE 228
            EP + +H+D+ GP++   +    YF+SF D  +R+   Y +H + E   L  F      
Sbjct: 659 YEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAF 718

Query: 229 VENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDM 288
           ++N     +   + DRG EY +     +     IT+  +     + +G++ER NRTLL+ 
Sbjct: 719 IKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLND 778

Query: 289 VRSTMSYARLPDSFWGYAVETAVYILNN 303
            R+ +  + LP+  W  AVE +  I N+
Sbjct: 779 CRTLLHCSGLPNHLWFSAVEFSTIIRNS 805

BLAST of Tan0017081 vs. NCBI nr
Match: TYJ96910.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-138
Identity = 292/517 (56.48%), Postives = 346/517 (66.92%), Query Frame = 0

Query: 1   MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKK 60
           MN I+Y+LTTLLNELQTF+SLM+I+  + EANVA   R +HRG TSGTK +  S    K 
Sbjct: 88  MNKISYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKW 147

Query: 61  KMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQG----- 120
           K  +G    K + AAA+  KK K  A KG  FHCN  GHWKRNC K+L EKK        
Sbjct: 148 KKNKGGQQNKVNLAAAKTSKKAK--AAKGIRFHCNQEGHWKRNCPKYLAEKKKAKQVTQN 207

Query: 121 -------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFS 180
                              HINLNRIE+LV +GLL+ELEEN+ PVCESCLEGKMTK PF+
Sbjct: 208 KRLRISPKENAHLWHLRLVHINLNRIERLVQNGLLSELEENYLPVCESCLEGKMTKRPFT 267

Query: 181 GKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKE 240
           GKG+RA+EPLEL+HSDLCGPMN+KARGG+EYF++F D YSRYG+ YLM  KSE LEKFKE
Sbjct: 268 GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKE 327

Query: 241 YKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT 300
           YK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNG+SERRN+T
Sbjct: 328 YKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNQT 387

Query: 301 LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV------------ 360
           LLDMV S MSYA LP+SFWGYAV+TAVYILN    +P   V   P K+            
Sbjct: 388 LLDMVWSMMSYAHLPNSFWGYAVQTAVYILN---CVPSKSVSETPLKLWNGRKGSLHHFR 447

Query: 361 --GTPFEILPIR---------------YPKETRGGLCFYPKDNRVLVSTNATFLEENHIR 420
             G P  +L I                Y K +RGG  + PKDN+VLVSTNATFLEE+HIR
Sbjct: 448 IRGCPAHVLEINSKKLEPRSKLCLFVGYLKGSRGGYFYDPKDNKVLVSTNATFLEEDHIR 507

Query: 421 DHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVR 454
           +H PRSK+VLNE+ +     S RV +  S  TSVV   +S++  + Q L  P+RSGRV  
Sbjct: 508 EHKPRSKIVLNELSNETIEPSTRVVEEPSALTSVVHVDSSTRTHQPQSLREPRRSGRVTN 567

BLAST of Tan0017081 vs. NCBI nr
Match: TYJ97618.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 484.6 bits (1246), Expect = 9.5e-133
Identity = 274/468 (58.55%), Postives = 324/468 (69.23%), Query Frame = 0

Query: 28  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAE 87
           + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A 
Sbjct: 157 KGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK--AT 216

Query: 88  KGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKLVNSGLLNELEENFSPVCESC 147
           KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+LV +G+L+ELEEN  P+CESC
Sbjct: 217 KGICFHYNQEGHWKRNCPKYLAEKKKAKQGHINLNRIERLVKNGILSELEENSLPICESC 276

Query: 148 LEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMH 207
           LEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM 
Sbjct: 277 LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQ 336

Query: 208 KKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQ 267
            KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQ
Sbjct: 337 HKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQ 396

Query: 268 QNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV- 327
           QNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Sbjct: 397 QNGVSERRNRTLLDMVRSMISYAHLPNSFWGYAVQTAVYILN---CVPSKSVSETPLKLW 456

Query: 328 -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVST 387
                        G P  +L                + YPK TRGG  + PKDN+V VST
Sbjct: 457 NGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVST 516

Query: 388 NATFLEENHIRDHLPRSKMVLNEID----STSARVADGASTSTSVVDPITSSQI-RSQEL 447
           NATFLEE+HIR+H PRSK+VLNE+       S RV +  S  T VV   +S++  + Q L
Sbjct: 517 NATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSL 576

Query: 448 GMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE 454
             P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Sbjct: 577 REPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDE 619

BLAST of Tan0017081 vs. NCBI nr
Match: KAA0060254.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 476.9 bits (1226), Expect = 2.0e-130
Identity = 272/468 (58.12%), Postives = 323/468 (69.02%), Query Frame = 0

Query: 28  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAE 87
           + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A 
Sbjct: 157 KGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK--AT 216

Query: 88  KGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKLVNSGLLNELEENFSPVCESC 147
           KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+LV +G+L+ELEEN  P+CESC
Sbjct: 217 KGICFHYNQEGHWKRNCPKYLAEKKKAKQGHINLNRIERLVKNGILSELEENSLPICESC 276

Query: 148 LEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMH 207
           LEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM 
Sbjct: 277 LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQ 336

Query: 208 KKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQ 267
            KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQ
Sbjct: 337 HKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQ 396

Query: 268 QNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV- 327
           QNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Sbjct: 397 QNGVSERRNRTLLDMVRSMISYAHLPNSFWGYAVQTAVYILNY---VPSKSVYETPLKLW 456

Query: 328 -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVST 387
                        G P  +L                + YPK TRGG  +  KDN+V V T
Sbjct: 457 NGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDLKDNKVFVLT 516

Query: 388 NATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGASTSTSVVDPITSSQ-IRSQEL 447
           NATFLE++HIR+H PRSK+VLN    EI   S RV + +S  T VV   +S++  + Q L
Sbjct: 517 NATFLEKDHIREHKPRSKIVLNKLSKEITEPSTRVVEESSALTRVVHVGSSTRTYQPQTL 576

Query: 448 GMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE 454
             P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Sbjct: 577 REPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDE 619

BLAST of Tan0017081 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 470.3 bits (1209), Expect = 1.9e-128
Identity = 270/482 (56.02%), Postives = 310/482 (64.32%), Query Frame = 0

Query: 51  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK 110
           PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK
Sbjct: 124 PSPSGSEKIQKRKEGKGKGPTIAVEDKGKAK-VAIKRKCFHCNVDEHWKTNCPKYLVKKK 183

Query: 111 NQ---------------------------------------------GHINLNRIEKLVN 170
            +                                             GHINL+RI +LV 
Sbjct: 184 EKEGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLGHINLDRIGRLVK 243

Query: 171 SGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEY 230
           +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EY
Sbjct: 244 NGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY 303

Query: 231 FVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQD 290
           F+SFID YSRYG+ YLM  KSE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQD
Sbjct: 304 FISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQD 363

Query: 291 YMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN 350
           YMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Sbjct: 364 YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILN 423

Query: 351 NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKE 410
           N   +P   V   P ++              G P  +L                + YPKE
Sbjct: 424 N---VPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKE 483

Query: 411 TRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVV 454
           TRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V 
Sbjct: 484 TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVD 543

BLAST of Tan0017081 vs. NCBI nr
Match: TYK02840.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 470.3 bits (1209), Expect = 1.9e-128
Identity = 270/482 (56.02%), Postives = 310/482 (64.32%), Query Frame = 0

Query: 51  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK 110
           PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK
Sbjct: 124 PSPSGSEKIQKRKEGKGKGPTIAVEDKGKAK-VAIKRKCFHCNVDEHWKTNCPKYLVKKK 183

Query: 111 NQ---------------------------------------------GHINLNRIEKLVN 170
            +                                             GHINL+RI +LV 
Sbjct: 184 EKEGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLGHINLDRIGRLVK 243

Query: 171 SGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEY 230
           +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EY
Sbjct: 244 NGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY 303

Query: 231 FVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQD 290
           F+SFID YSRYG+ YLM  KSE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQD
Sbjct: 304 FISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQD 363

Query: 291 YMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN 350
           YMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Sbjct: 364 YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILN 423

Query: 351 NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKE 410
           N   +P   V   P ++              G P  +L                + YPKE
Sbjct: 424 N---VPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKE 483

Query: 411 TRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVV 454
           TRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V 
Sbjct: 484 TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVD 543

BLAST of Tan0017081 vs. ExPASy TrEMBL
Match: A0A5D3BAN6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold220G00110 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 7.3e-139
Identity = 292/517 (56.48%), Postives = 346/517 (66.92%), Query Frame = 0

Query: 1   MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKK 60
           MN I+Y+LTTLLNELQTF+SLM+I+  + EANVA   R +HRG TSGTK +  S    K 
Sbjct: 88  MNKISYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKW 147

Query: 61  KMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQG----- 120
           K  +G    K + AAA+  KK K  A KG  FHCN  GHWKRNC K+L EKK        
Sbjct: 148 KKNKGGQQNKVNLAAAKTSKKAK--AAKGIRFHCNQEGHWKRNCPKYLAEKKKAKQVTQN 207

Query: 121 -------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFS 180
                              HINLNRIE+LV +GLL+ELEEN+ PVCESCLEGKMTK PF+
Sbjct: 208 KRLRISPKENAHLWHLRLVHINLNRIERLVQNGLLSELEENYLPVCESCLEGKMTKRPFT 267

Query: 181 GKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKE 240
           GKG+RA+EPLEL+HSDLCGPMN+KARGG+EYF++F D YSRYG+ YLM  KSE LEKFKE
Sbjct: 268 GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKE 327

Query: 241 YKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT 300
           YK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNG+SERRN+T
Sbjct: 328 YKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNQT 387

Query: 301 LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV------------ 360
           LLDMV S MSYA LP+SFWGYAV+TAVYILN    +P   V   P K+            
Sbjct: 388 LLDMVWSMMSYAHLPNSFWGYAVQTAVYILN---CVPSKSVSETPLKLWNGRKGSLHHFR 447

Query: 361 --GTPFEILPIR---------------YPKETRGGLCFYPKDNRVLVSTNATFLEENHIR 420
             G P  +L I                Y K +RGG  + PKDN+VLVSTNATFLEE+HIR
Sbjct: 448 IRGCPAHVLEINSKKLEPRSKLCLFVGYLKGSRGGYFYDPKDNKVLVSTNATFLEEDHIR 507

Query: 421 DHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVR 454
           +H PRSK+VLNE+ +     S RV +  S  TSVV   +S++  + Q L  P+RSGRV  
Sbjct: 508 EHKPRSKIVLNELSNETIEPSTRVVEEPSALTSVVHVDSSTRTHQPQSLREPRRSGRVTN 567

BLAST of Tan0017081 vs. ExPASy TrEMBL
Match: A0A5D3BHG7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold639G00150 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 4.6e-133
Identity = 274/468 (58.55%), Postives = 324/468 (69.23%), Query Frame = 0

Query: 28  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAE 87
           + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A 
Sbjct: 157 KGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK--AT 216

Query: 88  KGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKLVNSGLLNELEENFSPVCESC 147
           KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+LV +G+L+ELEEN  P+CESC
Sbjct: 217 KGICFHYNQEGHWKRNCPKYLAEKKKAKQGHINLNRIERLVKNGILSELEENSLPICESC 276

Query: 148 LEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMH 207
           LEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM 
Sbjct: 277 LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQ 336

Query: 208 KKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQ 267
            KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQ
Sbjct: 337 HKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQ 396

Query: 268 QNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV- 327
           QNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Sbjct: 397 QNGVSERRNRTLLDMVRSMISYAHLPNSFWGYAVQTAVYILN---CVPSKSVSETPLKLW 456

Query: 328 -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVST 387
                        G P  +L                + YPK TRGG  + PKDN+V VST
Sbjct: 457 NGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVST 516

Query: 388 NATFLEENHIRDHLPRSKMVLNEID----STSARVADGASTSTSVVDPITSSQI-RSQEL 447
           NATFLEE+HIR+H PRSK+VLNE+       S RV +  S  T VV   +S++  + Q L
Sbjct: 517 NATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSL 576

Query: 448 GMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE 454
             P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Sbjct: 577 REPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDE 619

BLAST of Tan0017081 vs. ExPASy TrEMBL
Match: A0A5A7UYX7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22G00180 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 9.6e-131
Identity = 272/468 (58.12%), Postives = 323/468 (69.02%), Query Frame = 0

Query: 28  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAE 87
           + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A 
Sbjct: 157 KGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK--AT 216

Query: 88  KGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKLVNSGLLNELEENFSPVCESC 147
           KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+LV +G+L+ELEEN  P+CESC
Sbjct: 217 KGICFHYNQEGHWKRNCPKYLAEKKKAKQGHINLNRIERLVKNGILSELEENSLPICESC 276

Query: 148 LEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMH 207
           LEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM 
Sbjct: 277 LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQ 336

Query: 208 KKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQ 267
            KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQ
Sbjct: 337 HKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQ 396

Query: 268 QNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV- 327
           QNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Sbjct: 397 QNGVSERRNRTLLDMVRSMISYAHLPNSFWGYAVQTAVYILNY---VPSKSVYETPLKLW 456

Query: 328 -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVST 387
                        G P  +L                + YPK TRGG  +  KDN+V V T
Sbjct: 457 NGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDLKDNKVFVLT 516

Query: 388 NATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGASTSTSVVDPITSSQ-IRSQEL 447
           NATFLE++HIR+H PRSK+VLN    EI   S RV + +S  T VV   +S++  + Q L
Sbjct: 517 NATFLEKDHIREHKPRSKIVLNKLSKEITEPSTRVVEESSALTRVVHVGSSTRTYQPQTL 576

Query: 448 GMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE 454
             P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Sbjct: 577 REPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDE 619

BLAST of Tan0017081 vs. ExPASy TrEMBL
Match: A0A5D3BUN8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold218G00360 PE=4 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 9.0e-129
Identity = 270/482 (56.02%), Postives = 310/482 (64.32%), Query Frame = 0

Query: 51  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK 110
           PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK
Sbjct: 124 PSPSGSEKIQKRKEGKGKGPTIAVEDKGKAK-VAIKRKCFHCNVDEHWKTNCPKYLVKKK 183

Query: 111 NQ---------------------------------------------GHINLNRIEKLVN 170
            +                                             GHINL+RI +LV 
Sbjct: 184 EKEGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLGHINLDRIGRLVK 243

Query: 171 SGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEY 230
           +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EY
Sbjct: 244 NGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY 303

Query: 231 FVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQD 290
           F+SFID YSRYG+ YLM  KSE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQD
Sbjct: 304 FISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQD 363

Query: 291 YMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN 350
           YMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Sbjct: 364 YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILN 423

Query: 351 NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKE 410
           N   +P   V   P ++              G P  +L                + YPKE
Sbjct: 424 N---VPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKE 483

Query: 411 TRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVV 454
           TRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V 
Sbjct: 484 TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVD 543

BLAST of Tan0017081 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 9.0e-129
Identity = 270/482 (56.02%), Postives = 310/482 (64.32%), Query Frame = 0

Query: 51  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK 110
           PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK
Sbjct: 124 PSPSGSEKIQKRKEGKGKGPTIAVEDKGKAK-VAIKRKCFHCNVDEHWKTNCPKYLVKKK 183

Query: 111 NQ---------------------------------------------GHINLNRIEKLVN 170
            +                                             GHINL+RI +LV 
Sbjct: 184 EKEGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLGHINLDRIGRLVK 243

Query: 171 SGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEY 230
           +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EY
Sbjct: 244 NGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY 303

Query: 231 FVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQD 290
           F+SFID YSRYG+ YLM  KSE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQD
Sbjct: 304 FISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQD 363

Query: 291 YMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN 350
           YMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Sbjct: 364 YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILN 423

Query: 351 NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKE 410
           N   +P   V   P ++              G P  +L                + YPKE
Sbjct: 424 N---VPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKE 483

Query: 411 TRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVV 454
           TRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+E    S RV D    S+ V 
Sbjct: 484 TRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVD 543

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109782.2e-3938.36Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041469.5e-2734.20Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT941.6e-2631.65Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.5e-2431.92Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q124914.4e-1628.37Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
TYJ96910.11.5e-13856.48gag/pol protein [Cucumis melo var. makuwa][more]
TYJ97618.19.5e-13358.55gag/pol protein [Cucumis melo var. makuwa][more]
KAA0060254.12.0e-13058.12gag/pol protein [Cucumis melo var. makuwa][more]
KAA0059226.11.9e-12856.02gag/pol protein [Cucumis melo var. makuwa][more]
TYK02840.11.9e-12856.02gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3BAN67.3e-13956.48Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold220G0011... [more]
A0A5D3BHG74.6e-13358.55Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold639G0015... [more]
A0A5A7UYX79.6e-13158.12Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22G00180... [more]
A0A5D3BUN89.0e-12956.02Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold218G0036... [more]
A0A5A7UYE89.0e-12956.02Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 157..258
e-value: 5.7E-10
score: 39.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 155..323
score: 21.58275
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 152..304
e-value: 5.1E-35
score: 122.5
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 109..144
e-value: 5.4E-7
score: 29.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 432..453
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 134..301
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 84..100
score: 8.746164
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 154..302
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 73..106

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017081.1Tan0017081.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding