Tan0002455 (gene) Snake gourd v1

Overview
NameTan0002455
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG11: 22221185 .. 22223755 (+)
RNA-Seq ExpressionTan0002455
SyntenyTan0002455
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTCCTTGATAATAGCCTTACTCAAAAGTGAATGTTTAACTGGCGAGAATTATACTACGTGGAAGTCCAACCTGAATATGATTCTGGTTGTTGACGAACTTCGATTTGTACTAACTGAGAAATGTCCTCAGGTCCCTGCTCGAAGCGCTTCTCAATCTGTTAAGGATGCGTACGACCGTTGGATCAAGGCCAATGACAAGGCCAAGGTTTACATTTTGGCTAGTCTTTCTGAAGTTCTGGCCAAAAAGCACAAGGGCATGGTCTCAGCTCGTGAGATCATGAGTCCGTTGCAGAATATATTTGGACAACTGTCTGGACAGCTGCGACACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGATCGTCGGTGAAAGAACATGTTCTCGATCTGATGGTCCACTTCAACGTGGAAGATATGAACGGCGCGGTCATCGACAAGCAAAGTCAGGTGTCCTTCATCCTGGAATCTCTTTAGAAGAGTTTCCTGCAATTCTTCAGCAATGCGGTGATGAATAACATAGAGTACAACCTGACTACTCTCCTCAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGTGACAGGCTGATGGAGAGGCAAATTTGTTTGTCCATTCGACAAGGTTCCGAAGGTTCATCCTGCAGGACTAAGTCTCATGGTCCATCTTACGGCTTAAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCCGCTGCAGACAAAGGCAAGGGAAAACCCAAAGTTGCAGACAAAGGAAAATGTTTCCACTGCAATGTGGACGGGCACTGGAAGCGAAACGACCCAAAATACCTTGTTGAGCTCAAAGAGAAGAAAGGTAAATTCGATTTACTTGTTCTTCAAACTTCTTTAGTGAAAAGTGATGACTTTACCTGGATACTTGATTCAGGGACCACTTACCACGTTTGCTCTTATTTTCAGGAAACTAGTTCCCTCAAGGAGCTCGAATAGGGTGTGATGACGCTCAGGGTCGGAACGGGAGACGTCTCAGCTCGTGCAGTGGGAGATGCCAAGCTGTTTTTTAGAGATAGATTTTTATTTTTAGAAAATCTGTACATAGTTCCTAAGATTAAGAGGAACTTGATATCTATCTCTTGTCTAATTCAACATGGTTATTGTATTACCTTTTCCATTAATGAAGTGTTCATTTCAAAGAAAGGTGTCAATATTTGTTCTGCTAAGTTAGAAAGCAGCTTATATGTACATAAACCAAATCAAACTAAAGCAATTTTAAATCATGAGATGTTTAACACTGCAAATACTCAAAGTAAAAGGCAAAGAATTTCTCCAAATAATAATACCTATCTTTGGCATGTTAGTCTTGGTCATATAAATCTCAACCGGATTGAGAGATTATCTAAGAATGGACTTCTAAATAAGTTAGAAGATGATTCTTTACCTCCTTGTGAATCATGCTTGGAAGGTAAAATGACTAAGCAACCTTTTACTGGAAAAGGTTATAGAGCCAAAAAACCCTTAGAACTTATACATTCGGATCTCTGTGGTCCGGTGAATGTTAAAGCTCGAGTAGAGTACGAATATTTCATCTCTTTCATAGATGTTTATTCGAGGTATGGTTATCTATACCTAATGCATCATATGTCTGAAGCTCTTAAAAAGTTCAAAGAGTATAAGACTGAAGTAGAGAATGCATTAGGAAAAACCATAAAGACACTTCGATCCGATCGAGGTGGAGAGTATAAGGATCTAAGATTCCAGGACTATTTGATAGAACATGGAATCCAATCTCAACTCGCACAACCTAATACACCTCAGCAGAATGGTGTATCTTAAAGGAAAAATAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAATTGCTTGCCTCATTTTCGGGTTACGCAGTAGAGACTGCGGTTCAAATCTTGAACAATGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAGTTATGGAAGGGGCGTAAACCTAGTTTACAACACTTTAGAATTTGGGGTTTTCCAGCACACATGCTATGGACAAACCCAAAGAAATTAGAACCTCGTTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTTGTCTTTTCTATGACCCACAAGAAAACAAGGTGCTTGTATTGACAAACACCACTTTCTTGGAGGAAGATAACATGAGAAACCTTAAACCGCGTAGTAAGTTACTATTAAATGAAGCTACAGATGAGTCAACAAGAGTTGTTGATTAAGCTGGACCTTCATCAAGAGTTGATGAAGAAGTTGGCACATCGAGTCAGTCTCGTCCTTCTCAATTGTGGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTGGGTTTAACTGAAACTCAAGTTGTCATACTTTTTGACGGTGTAGAGGATCCATTGTCTTATAAACAGGCAATGAATGACGTAGATAAACACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTTTGGGAACTTGTAGACCAGCCTGAAGGGTAA

mRNA sequence

ATGTCGTCCTTGATAATAGCCTTACTCAAAAGTGAATGTTTAACTGGCGAGAATTATACTACGTGGAAGTCCAACCTGAATATGATTCTGGTTGTTGACGAACTTCGATTTGTACTAACTGAGAAATGTCCTCAGGTCCCTGCTCGAAGCGCTTCTCAATCTGTTAAGGATGCGTACGACCGTTGGATCAAGGCCAATGACAAGGCCAAGGTTTACATTTTGGCTAGTCTTTCTGAAGTTCTGGCCAAAAAGCACAAGGGCATGGTCTCAGCTCGTGAGATCATGAGTCCGTTGCAGAATATATTTGGACAACTGTCTGGACAGCTGCGACACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGATCGTCGGTGAAAGAACATGTTCTCGATCTGATGGTCCACTTCAACGTGGAAGATATGAACGGCGCGGTCATCGACAAGCAAAGTCAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCCGCTGCAGACAAAGGCAAGGGAAAACCCAAAGTTGCAGACAAAGGAAAATGTTTCCACTGCAATGTGGACGGGCACTGGAAGCGAAACGACCCAAAATACCTTGTTGAGCTCAAAGAGAAGAAAGGTAAAATGACTAAGCAACCTTTTACTGGAAAAGGTTATAGAGCCAAAAAACCCTTAGAACTTATACATTCGGATCTCTGTGGTCCGGTGAATGTTAAAGCTCGAGTAGAGTACGAATATTTCATCTCTTTCATAGATGTTTATTCGAGGTATGGTTATCTATACCTAATGCATCATATGTCTGAAGCTCTTAAAAAGTTCAAAGAGTATAAGACTGAAGTAGAGAATGCATTAGGAAAAACCATAAAGACACTTCGATCCGATCGAGGTGGAGAGTATAAGGATCTAAGATTCCAGGACTATTTGATAGAACATGGAATCCAATCTCAACTCGCACAACCTAATACACCTCAGCAGAATGCACACATGCTATGGACAAACCCAAAGAAATTAGAACCTCGTTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTTGTCTTTTCTATGACCCACAAGAAAACAAGGTGCTTGTATTGACAAACACCACTTTCTTGGAGGAAGATAACATGAGAAACCTTAAACCGCGTACTGGACCTTCATCAAGAGTTGATGAAGAAGTTGGCACATCGAGTCAGTCTCGTCCTTCTCAATTGTGGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTGGGTTTAACTGAAACTCAAGTTGTCATACTTTTTGACGGTGTAGAGGATCCATTGTCTTATAAACAGGCAATGAATGACGTAGATAAACACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTTTGGGAACTTGTAGACCAGCCTGAAGGGTAA

Coding sequence (CDS)

ATGTCGTCCTTGATAATAGCCTTACTCAAAAGTGAATGTTTAACTGGCGAGAATTATACTACGTGGAAGTCCAACCTGAATATGATTCTGGTTGTTGACGAACTTCGATTTGTACTAACTGAGAAATGTCCTCAGGTCCCTGCTCGAAGCGCTTCTCAATCTGTTAAGGATGCGTACGACCGTTGGATCAAGGCCAATGACAAGGCCAAGGTTTACATTTTGGCTAGTCTTTCTGAAGTTCTGGCCAAAAAGCACAAGGGCATGGTCTCAGCTCGTGAGATCATGAGTCCGTTGCAGAATATATTTGGACAACTGTCTGGACAGCTGCGACACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGATCGTCGGTGAAAGAACATGTTCTCGATCTGATGGTCCACTTCAACGTGGAAGATATGAACGGCGCGGTCATCGACAAGCAAAGTCAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCCGCTGCAGACAAAGGCAAGGGAAAACCCAAAGTTGCAGACAAAGGAAAATGTTTCCACTGCAATGTGGACGGGCACTGGAAGCGAAACGACCCAAAATACCTTGTTGAGCTCAAAGAGAAGAAAGGTAAAATGACTAAGCAACCTTTTACTGGAAAAGGTTATAGAGCCAAAAAACCCTTAGAACTTATACATTCGGATCTCTGTGGTCCGGTGAATGTTAAAGCTCGAGTAGAGTACGAATATTTCATCTCTTTCATAGATGTTTATTCGAGGTATGGTTATCTATACCTAATGCATCATATGTCTGAAGCTCTTAAAAAGTTCAAAGAGTATAAGACTGAAGTAGAGAATGCATTAGGAAAAACCATAAAGACACTTCGATCCGATCGAGGTGGAGAGTATAAGGATCTAAGATTCCAGGACTATTTGATAGAACATGGAATCCAATCTCAACTCGCACAACCTAATACACCTCAGCAGAATGCACACATGCTATGGACAAACCCAAAGAAATTAGAACCTCGTTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTTGTCTTTTCTATGACCCACAAGAAAACAAGGTGCTTGTATTGACAAACACCACTTTCTTGGAGGAAGATAACATGAGAAACCTTAAACCGCGTACTGGACCTTCATCAAGAGTTGATGAAGAAGTTGGCACATCGAGTCAGTCTCGTCCTTCTCAATTGTGGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTGGGTTTAACTGAAACTCAAGTTGTCATACTTTTTGACGGTGTAGAGGATCCATTGTCTTATAAACAGGCAATGAATGACGTAGATAAACACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTTTGGGAACTTGTAGACCAGCCTGAAGGGTAA

Protein sequence

MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYDRWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNSRMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELKEKKGKMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQNAHMLWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPRTGPSSRVDEEVGTSSQSRPSQLWGMPRRSGRVVSQPDRYLGLTETQVVILFDGVEDPLSYKQAMNDVDKHQWIKVMDLEMESMYFNSVWELVDQPEG
Homology
BLAST of Tan0002455 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 9.6e-17
Identity = 47/149 (31.54%), Postives = 77/149 (51.68%), Query Frame = 0

Query: 211 GKMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHM 270
           GK  +  F     R    L+L++SD+CGP+ +++    +YF++FID  SR  ++Y++   
Sbjct: 463 GKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTK 522

Query: 271 SEALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN 330
            +  + F+++   VE   G+ +K LRSD GGEY    F++Y   HGI+ +   P TPQ N
Sbjct: 523 DQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHN 582

Query: 331 AHMLWTNPKKLEPRSRLCQFVGYPKETRG 360
                 N   +E    + +    PK   G
Sbjct: 583 GVAERMNRTIVEKVRSMLRMAKLPKSFWG 611

BLAST of Tan0002455 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.4e-12
Identity = 45/119 (37.82%), Postives = 66/119 (55.46%), Query Frame = 0

Query: 212 KMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMS 271
           K  K PF+     + KPLE I+SD+     + +   Y Y++ F+D ++RY +LY +   S
Sbjct: 486 KSHKVPFSNSTITSSKPLEYIYSDVWSS-PILSIDNYRYYVIFVDHFTRYTWLYPLKQKS 545

Query: 272 EALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN 331
           +    F  +K+ VEN     I TL SD GGE+  LR  DYL +HGI    + P+TP+ N
Sbjct: 546 QVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLR--DYLSQHGISHFTSPPHTPEHN 601

BLAST of Tan0002455 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 1.0e-10
Identity = 38/119 (31.93%), Postives = 63/119 (52.94%), Query Frame = 0

Query: 212 KMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMS 271
           K  K PF+     + +PLE I+SD+     + +   Y Y++ F+D ++RY +LY +   S
Sbjct: 507 KSNKVPFSQSTINSTRPLEYIYSDVWSS-PILSHDNYRYYVIFVDHFTRYTWLYPLKQKS 566

Query: 272 EALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN 331
           +  + F  +K  +EN     I T  SD GGE+  +   +Y  +HGI    + P+TP+ N
Sbjct: 567 QVKETFITFKNLLENRFQTRIGTFYSDNGGEF--VALWEYFSQHGISHLTSPPHTPEHN 622

BLAST of Tan0002455 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 69.3 bits (168), Expect = 1.3e-10
Identity = 36/122 (29.51%), Postives = 61/122 (50.00%), Query Frame = 0

Query: 211 GKMTKQPFTGKGYRA--KKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMH 270
           GK  + PF     +   K+PL ++HSD+CGP+      +  YF+ F+D ++ Y   YL+ 
Sbjct: 461 GKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIK 520

Query: 271 HMSEALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQ 330
           + S+    F+++  + E      +  L  D G EY     + + ++ GI   L  P+TPQ
Sbjct: 521 YKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQ 580

BLAST of Tan0002455 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 1.7e-05
Identity = 37/129 (28.68%), Postives = 57/129 (44.19%), Query Frame = 0

Query: 211 GKMTKQPFTGKGYRAK-----KPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLY 270
           GK TK     KG R K     +P + +H+D+ GPV+   +    YFISF D  +R+ ++Y
Sbjct: 639 GKSTKHRHV-KGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVY 698

Query: 271 LMHHMSE--ALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQ 330
            +H   E   L  F      ++N     +  ++ DRG EY +     +    GI +    
Sbjct: 699 PLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITA--CY 758

Query: 331 PNTPQQNAH 333
             T    AH
Sbjct: 759 TTTADSRAH 764

BLAST of Tan0002455 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 580.5 bits (1495), Expect = 1.4e-161
Identity = 345/653 (52.83%), Postives = 388/653 (59.42%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           MSS IIALLK + LTGENY TWKS LNMILV+ +L FVL E+CP  P + ASQSV+DAYD
Sbjct: 1   MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYD 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW KANDKA+++ILAS+S++L+KKH+ MV+AR+IM  L+ +FGQ S Q++ E+   V +S
Sbjct: 61  RWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEA--NVAHS 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKV 180
           + +   S                         S+K QK+K  GKGK P  A + KGK KV
Sbjct: 121 KRRFVPS----------------------PSGSEKIQKRK-EGKGKGPTIAVEDKGKAKV 180

Query: 181 ADKGKCFHCNVDGHWKRNDPKYLVELKEK------------------------------- 240
           A K KCFHCNVD HWK N PKYLV+ KEK                               
Sbjct: 181 AIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGATNHVCSSLQETSSFKQLEDSEMTLKVGT 240

Query: 241 ----------------------------------------------KGKMTKQPFTGKGY 300
                                                         +GKMTK+PFTGKGY
Sbjct: 241 GDVISARAVGDAKLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 300

Query: 301 RAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTE 360
           RAK+PLELIHSDLCGP+NVKAR  +EYFISFID YSRYGYLYLM H SEAL+KFKEYKTE
Sbjct: 301 RAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 360

Query: 361 VENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN------------- 420
           VEN L K IK LRSDRGGEY DLRFQDY+IEHGIQSQL+ P TPQQN             
Sbjct: 361 VENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 420

Query: 421 ----------------------------------------------------------AH 480
                                                                     AH
Sbjct: 421 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 480

Query: 481 MLWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPRT- 488
           +L TNPKKLEPRSRLCQFVGYPKETRG LF+DPQEN+V V TN TFLEED+MRN KPR+ 
Sbjct: 481 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSK 540

BLAST of Tan0002455 vs. NCBI nr
Match: TYK02840.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 580.5 bits (1495), Expect = 1.4e-161
Identity = 345/653 (52.83%), Postives = 388/653 (59.42%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           MSS IIALLK + LTGENY TWKS LNMILV+ +L FVL E+CP  P + ASQSV+DAYD
Sbjct: 1   MSSSIIALLKKDQLTGENYATWKSKLNMILVIVDLSFVLMEECPPFPTKHASQSVRDAYD 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW KANDKA+++ILAS+S++L+KKH+ MV+AR+IM  L+ +FGQ S Q++ E+   V +S
Sbjct: 61  RWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEA--NVAHS 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKV 180
           + +   S                         S+K QK+K  GKGK P  A + KGK KV
Sbjct: 121 KRRFVPS----------------------PSGSEKIQKRK-EGKGKGPTIAVEDKGKAKV 180

Query: 181 ADKGKCFHCNVDGHWKRNDPKYLVELKEK------------------------------- 240
           A K KCFHCNVD HWK N PKYLV+ KEK                               
Sbjct: 181 AIKRKCFHCNVDEHWKTNCPKYLVKKKEKEGATNHVCSSLQETSSFKQLEDSEMTLKVGT 240

Query: 241 ----------------------------------------------KGKMTKQPFTGKGY 300
                                                         +GKMTK+PFTGKGY
Sbjct: 241 GDVISARAVGDAKLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 300

Query: 301 RAKKPLELIHSDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTE 360
           RAK+PLELIHSDLCGP+NVKAR  +EYFISFID YSRYGYLYLM H SEAL+KFKEYKTE
Sbjct: 301 RAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 360

Query: 361 VENALGKTIKTLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN------------- 420
           VEN L K IK LRSDRGGEY DLRFQDY+IEHGIQSQL+ P TPQQN             
Sbjct: 361 VENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 420

Query: 421 ----------------------------------------------------------AH 480
                                                                     AH
Sbjct: 421 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 480

Query: 481 MLWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPRT- 488
           +L TNPKKLEPRSRLCQFVGYPKETRG LF+DPQEN+V V TN TFLEED+MRN KPR+ 
Sbjct: 481 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSK 540

BLAST of Tan0002455 vs. NCBI nr
Match: KAA0025159.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 564.7 bits (1454), Expect = 7.8e-157
Identity = 332/605 (54.88%), Postives = 377/605 (62.31%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           MSS IIALLK + LTGENY TWKS LNMILV+ +L FVL E+CP  P + ASQSV+D YD
Sbjct: 1   MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKYASQSVRDTYD 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW KANDKA+++ILAS+S++L+KKH+ MV+AR+IM  L+ +FGQ S Q++ E+   V++ 
Sbjct: 61  RWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEA-NVVHSK 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKV 180
           R    SS                         S+K QK+K  GK K P  A + KGK KV
Sbjct: 121 RRFVPSS-----------------------SGSEKIQKRK-EGKEKGPTIAVEDKGKAKV 180

Query: 181 ADKGKCFHCNVDGHWKRNDPKYLVELKEK------------------------------- 240
           A K K FHCNVD HWK N PKYLV+ KE                                
Sbjct: 181 AIKRKYFHCNVDEHWKTNCPKYLVKKKENEETSSFKQLEESEMTLKVGTGDVISARAVGD 240

Query: 241 -----------------------------------KGKMTKQPFTGKGYRAKKPLELIHS 300
                                              +GKMTK+PFT KGYRAK+PLELIHS
Sbjct: 241 AKLGHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTEKGYRAKEPLELIHS 300

Query: 301 DLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIKT 360
           DLCG +NVKAR  +EYFISFID YSRYGYLYLM H SEAL+KFKEYKTEVEN L K IK 
Sbjct: 301 DLCGLMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 360

Query: 361 LRSDRGGEYKDLRFQDYLIEHGIQSQLAQP---------NTPQQN--------------- 420
           LRSDRGGEY DLRFQDY+IEHGIQSQL+ P         N P ++               
Sbjct: 361 LRSDRGGEYMDLRFQDYMIEHGIQSQLSTPVETAVHILNNAPSKSVSETPFELWRGRKPS 420

Query: 421 ----------AHMLWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLE 480
                      H+L TNPKKL+ RSRLCQFVGYPKETRG LF+DPQEN+V V TN TFLE
Sbjct: 421 LSHFRIWGCPTHVLVTNPKKLKSRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE 480

Query: 481 EDNMRNLKPRT------------------GPSSRVDEEVGTSSQSRPSQLWGMPRRSGRV 488
           ED+MRN KPR+                  GPSSRVDE   TS QS PSQ   MPRRSGRV
Sbjct: 481 EDHMRNHKPRSKLVLSEATDKSTRVVDEVGPSSRVDETT-TSGQSHPSQSLRMPRRSGRV 540

BLAST of Tan0002455 vs. NCBI nr
Match: TYJ97618.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 562.0 bits (1447), Expect = 5.0e-156
Identity = 318/646 (49.23%), Postives = 395/646 (61.15%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           M++  + +L ++ L G NY +WK+ +N++L++D+L+FVL E+CPQVPA +A+Q+V++ Y+
Sbjct: 1   MTTATLNMLAADKLNGNNYASWKNTINIVLIIDDLKFVLVEECPQVPAANATQTVREPYE 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW K N+K + YILASLSEVLAKKH+ M++AREIM  LQ +FGQ S Q+ H++LKY+YN+
Sbjct: 61  RWAKGNEKGRAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQINHDALKYIYNA 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQ-------------------------- 180
           RM EG+SV+EHVL++MVHFNV +MNGAVID+ SQ                          
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQGQKGEANVATSTRKFHRGSTSGTKSM 180

Query: 181 ------KTQKKKIGGKG-KAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELK 240
                 K  KKK GG+G KA  AA K   K K A KG CFH N +GHWKRN PKYL E K
Sbjct: 181 PSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK-ATKGICFHYNQEGHWKRNCPKYLAEKK 240

Query: 241 EKK----------------------------------GKMTKQPFTGKGYRAKKPLELIH 300
           + K                                  GKMTK+PFTGKG+RAK+PLEL+H
Sbjct: 241 KAKQGHINLNRIERLVKNGILSELEENSLPICESCLEGKMTKRPFTGKGHRAKEPLELVH 300

Query: 301 SDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIK 360
           SDLCGP+NVKAR E+EYFI+F D YSRYGY+YLM H SEAL+KFKEYK EVENAL KTIK
Sbjct: 301 SDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIK 360

Query: 361 TLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN----------------------- 420
           T RSDRGGEY DL+FQ+YL+E  I SQL+ P TPQQN                       
Sbjct: 361 TFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQQNGVSERRNRTLLDMVRSMISYAHL 420

Query: 421 ------------------------------------------------AHMLWTNPKKLE 480
                                                           AH+L  NPKKLE
Sbjct: 421 PNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLE 480

Query: 481 PRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPR------------ 488
           PRS+LC FVGYPK TRG  FYDP++NKV V TN TFLEED++R  KPR            
Sbjct: 481 PRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKET 540

BLAST of Tan0002455 vs. NCBI nr
Match: KAA0060254.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 558.5 bits (1438), Expect = 5.6e-155
Identity = 317/646 (49.07%), Postives = 395/646 (61.15%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           M++  + +L ++ L G NY +WK+ +N++L++D+L+FVL E+CPQVPA +A+Q+V++ Y+
Sbjct: 1   MTTATLNMLAADKLNGNNYASWKNTINIVLIIDDLKFVLVEECPQVPAANATQTVREPYE 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW K N+K + YILASLSEVLAKKH+ M++AREIM  LQ +FGQ S Q+ H++LKY+YN+
Sbjct: 61  RWAKGNEKGRAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQINHDALKYIYNA 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQ-------------------------- 180
           RM EG+SV+EHVL++MVHFNV +MNGAVID+ SQ                          
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQGQKGEANVATSTRKFHRGSTSGTKSM 180

Query: 181 ------KTQKKKIGGKG-KAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELK 240
                 K  KKK GG+G KA  AA K   K K A KG CFH N +GHWKRN PKYL E K
Sbjct: 181 PSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK-ATKGICFHYNQEGHWKRNCPKYLAEKK 240

Query: 241 EKK----------------------------------GKMTKQPFTGKGYRAKKPLELIH 300
           + K                                  GKMTK+PFTGKG+RAK+PLEL+H
Sbjct: 241 KAKQGHINLNRIERLVKNGILSELEENSLPICESCLEGKMTKRPFTGKGHRAKEPLELVH 300

Query: 301 SDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIK 360
           SDLCGP+NVKAR E+EYFI+F D YSRYGY+YLM H SEAL+KFKEYK EVENAL KTIK
Sbjct: 301 SDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIK 360

Query: 361 TLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN----------------------- 420
           T RSDRGGEY DL+FQ+YL+E  I SQL+ P TPQQN                       
Sbjct: 361 TFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQQNGVSERRNRTLLDMVRSMISYAHL 420

Query: 421 ------------------------------------------------AHMLWTNPKKLE 480
                                                           AH+L  NPKKLE
Sbjct: 421 PNSFWGYAVQTAVYILNYVPSKSVYETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLE 480

Query: 481 PRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPR------------ 488
           PRS+LC FVGYPK TRG  FYD ++NKV VLTN TFLE+D++R  KPR            
Sbjct: 481 PRSKLCLFVGYPKGTRGGYFYDLKDNKVFVLTNATFLEKDHIREHKPRSKIVLNKLSKEI 540

BLAST of Tan0002455 vs. ExPASy TrEMBL
Match: A0A5A7SIN2 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold2405G00060 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 3.8e-157
Identity = 332/605 (54.88%), Postives = 377/605 (62.31%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           MSS IIALLK + LTGENY TWKS LNMILV+ +L FVL E+CP  P + ASQSV+D YD
Sbjct: 1   MSSSIIALLKKDQLTGENYATWKSKLNMILVIADLSFVLMEECPPFPTKYASQSVRDTYD 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW KANDKA+++ILAS+S++L+KKH+ MV+AR+IM  L+ +FGQ S Q++ E+   V++ 
Sbjct: 61  RWTKANDKARLHILASMSDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEA-NVVHSK 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKV 180
           R    SS                         S+K QK+K  GK K P  A + KGK KV
Sbjct: 121 RRFVPSS-----------------------SGSEKIQKRK-EGKEKGPTIAVEDKGKAKV 180

Query: 181 ADKGKCFHCNVDGHWKRNDPKYLVELKEK------------------------------- 240
           A K K FHCNVD HWK N PKYLV+ KE                                
Sbjct: 181 AIKRKYFHCNVDEHWKTNCPKYLVKKKENEETSSFKQLEESEMTLKVGTGDVISARAVGD 240

Query: 241 -----------------------------------KGKMTKQPFTGKGYRAKKPLELIHS 300
                                              +GKMTK+PFT KGYRAK+PLELIHS
Sbjct: 241 AKLGHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTEKGYRAKEPLELIHS 300

Query: 301 DLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIKT 360
           DLCG +NVKAR  +EYFISFID YSRYGYLYLM H SEAL+KFKEYKTEVEN L K IK 
Sbjct: 301 DLCGLMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 360

Query: 361 LRSDRGGEYKDLRFQDYLIEHGIQSQLAQP---------NTPQQN--------------- 420
           LRSDRGGEY DLRFQDY+IEHGIQSQL+ P         N P ++               
Sbjct: 361 LRSDRGGEYMDLRFQDYMIEHGIQSQLSTPVETAVHILNNAPSKSVSETPFELWRGRKPS 420

Query: 421 ----------AHMLWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLE 480
                      H+L TNPKKL+ RSRLCQFVGYPKETRG LF+DPQEN+V V TN TFLE
Sbjct: 421 LSHFRIWGCPTHVLVTNPKKLKSRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE 480

Query: 481 EDNMRNLKPRT------------------GPSSRVDEEVGTSSQSRPSQLWGMPRRSGRV 488
           ED+MRN KPR+                  GPSSRVDE   TS QS PSQ   MPRRSGRV
Sbjct: 481 EDHMRNHKPRSKLVLSEATDKSTRVVDEVGPSSRVDETT-TSGQSHPSQSLRMPRRSGRV 540

BLAST of Tan0002455 vs. ExPASy TrEMBL
Match: A0A5D3BHG7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold639G00150 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 2.4e-156
Identity = 318/646 (49.23%), Postives = 395/646 (61.15%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           M++  + +L ++ L G NY +WK+ +N++L++D+L+FVL E+CPQVPA +A+Q+V++ Y+
Sbjct: 1   MTTATLNMLAADKLNGNNYASWKNTINIVLIIDDLKFVLVEECPQVPAANATQTVREPYE 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW K N+K + YILASLSEVLAKKH+ M++AREIM  LQ +FGQ S Q+ H++LKY+YN+
Sbjct: 61  RWAKGNEKGRAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQINHDALKYIYNA 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQ-------------------------- 180
           RM EG+SV+EHVL++MVHFNV +MNGAVID+ SQ                          
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQGQKGEANVATSTRKFHRGSTSGTKSM 180

Query: 181 ------KTQKKKIGGKG-KAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELK 240
                 K  KKK GG+G KA  AA K   K K A KG CFH N +GHWKRN PKYL E K
Sbjct: 181 PSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK-ATKGICFHYNQEGHWKRNCPKYLAEKK 240

Query: 241 EKK----------------------------------GKMTKQPFTGKGYRAKKPLELIH 300
           + K                                  GKMTK+PFTGKG+RAK+PLEL+H
Sbjct: 241 KAKQGHINLNRIERLVKNGILSELEENSLPICESCLEGKMTKRPFTGKGHRAKEPLELVH 300

Query: 301 SDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIK 360
           SDLCGP+NVKAR E+EYFI+F D YSRYGY+YLM H SEAL+KFKEYK EVENAL KTIK
Sbjct: 301 SDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIK 360

Query: 361 TLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN----------------------- 420
           T RSDRGGEY DL+FQ+YL+E  I SQL+ P TPQQN                       
Sbjct: 361 TFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQQNGVSERRNRTLLDMVRSMISYAHL 420

Query: 421 ------------------------------------------------AHMLWTNPKKLE 480
                                                           AH+L  NPKKLE
Sbjct: 421 PNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLE 480

Query: 481 PRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPR------------ 488
           PRS+LC FVGYPK TRG  FYDP++NKV V TN TFLEED++R  KPR            
Sbjct: 481 PRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKET 540

BLAST of Tan0002455 vs. ExPASy TrEMBL
Match: A0A5A7UYX7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22G00180 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 2.7e-155
Identity = 317/646 (49.07%), Postives = 395/646 (61.15%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           M++  + +L ++ L G NY +WK+ +N++L++D+L+FVL E+CPQVPA +A+Q+V++ Y+
Sbjct: 1   MTTATLNMLAADKLNGNNYASWKNTINIVLIIDDLKFVLVEECPQVPAANATQTVREPYE 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
           RW K N+K + YILASLSEVLAKKH+ M++AREIM  LQ +FGQ S Q+ H++LKY+YN+
Sbjct: 61  RWAKGNEKGRAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQINHDALKYIYNA 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQ-------------------------- 180
           RM EG+SV+EHVL++MVHFNV +MNGAVID+ SQ                          
Sbjct: 121 RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQGQKGEANVATSTRKFHRGSTSGTKSM 180

Query: 181 ------KTQKKKIGGKG-KAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELK 240
                 K  KKK GG+G KA  AA K   K K A KG CFH N +GHWKRN PKYL E K
Sbjct: 181 PSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK-ATKGICFHYNQEGHWKRNCPKYLAEKK 240

Query: 241 EKK----------------------------------GKMTKQPFTGKGYRAKKPLELIH 300
           + K                                  GKMTK+PFTGKG+RAK+PLEL+H
Sbjct: 241 KAKQGHINLNRIERLVKNGILSELEENSLPICESCLEGKMTKRPFTGKGHRAKEPLELVH 300

Query: 301 SDLCGPVNVKARVEYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIK 360
           SDLCGP+NVKAR E+EYFI+F D YSRYGY+YLM H SEAL+KFKEYK EVENAL KTIK
Sbjct: 301 SDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIK 360

Query: 361 TLRSDRGGEYKDLRFQDYLIEHGIQSQLAQPNTPQQN----------------------- 420
           T RSDRGGEY DL+FQ+YL+E  I SQL+ P TPQQN                       
Sbjct: 361 TFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQQNGVSERRNRTLLDMVRSMISYAHL 420

Query: 421 ------------------------------------------------AHMLWTNPKKLE 480
                                                           AH+L  NPKKLE
Sbjct: 421 PNSFWGYAVQTAVYILNYVPSKSVYETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLE 480

Query: 481 PRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPR------------ 488
           PRS+LC FVGYPK TRG  FYD ++NKV VLTN TFLE+D++R  KPR            
Sbjct: 481 PRSKLCLFVGYPKGTRGGYFYDLKDNKVFVLTNATFLEKDHIREHKPRSKIVLNKLSKEI 540

BLAST of Tan0002455 vs. ExPASy TrEMBL
Match: A0A5A7VJG3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold138G001110 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 2.9e-149
Identity = 331/702 (47.15%), Postives = 374/702 (53.28%), Query Frame = 0

Query: 1   MSSLIIALLKSECLTGENYTTWKSNLNMILVVDELRFVLTEKCPQVPARSASQSVKDAYD 60
           MS  IIALLK + LTGENY TWKS LNMILV+ +LRFVL E+CP  P + ASQSV+DAYD
Sbjct: 1   MSCSIIALLKKDQLTGENYATWKSKLNMILVIADLRFVLMEECPPFPTKYASQSVRDAYD 60

Query: 61  RWIKANDKAKVYILASLSEVLAKKHKGMVSAREIMSPLQNIFGQLSGQLRHESLKYVYNS 120
            W KANDKA ++ILAS+S++L+KKH+ MV+AR+IM  L+ +FGQ S Q++ E+    ++ 
Sbjct: 61  CWTKANDKAHLHILASISDILSKKHEIMVTARQIMDSLREMFGQPSIQIKQEA-NVAHSK 120

Query: 121 RMKEGSSVKEHVLDLMVHFNVEDMNGAVIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKV 180
           R    SS                         S+K QK+K  GKG+ P  A +GKGK KV
Sbjct: 121 RFAPSSS------------------------GSEKIQKRK-EGKGRGPTIAVEGKGKAKV 180

Query: 181 ADKGKCFHCNVDGHWKRNDPKYLVELKEK------------------------------- 240
             KGKCFHCNVD HWK N PKYLV+ KEK                               
Sbjct: 181 VIKGKCFHCNVDEHWKTNCPKYLVKKKEKEGATNHVCSSLQETSSFKQLEESEMTLMVGT 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 GDVISARAVGDVKLFFGIKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSMNEAFISK 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 NGAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLDHINLDRIG 360

Query: 361 -----------------------KGKMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARV 420
                                  +GKMTK+PFTGK YRAK+PLELIHSDLCGP+NVKAR 
Sbjct: 361 RLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKDYRAKEPLELIHSDLCGPMNVKARG 420

Query: 421 EYEYFISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDL 480
            +EYFISFID YSRYGYLYLM H  EAL+KFKEYKTEVEN L K IK LRSDRGGEY DL
Sbjct: 421 GFEYFISFIDDYSRYGYLYLMEHKYEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL 480

Query: 481 RFQDYLIEHGIQSQLAQPNTPQQNA-----------------------HMLWTNPKKLEP 488
           RFQDY+IEHGIQSQL+ P TPQQN                           W  PKKLEP
Sbjct: 481 RFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGTPKKLEP 540

BLAST of Tan0002455 vs. ExPASy TrEMBL
Match: A0A5D3DZX8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold381G00320 PE=4 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 8.4e-149
Identity = 315/592 (53.21%), Postives = 358/592 (60.47%), Query Frame = 0

Query: 28  MILVVDELRFVLTEKCPQVPARSASQSVKDAYDRWIKANDKAKVYILASLSEVLAKKHKG 87
           MILV+ +LRFVL EKCP  P +  SQSV+DAY RW KANDKA ++ILAS+S++L+KKH+ 
Sbjct: 1   MILVIGDLRFVLMEKCPPFPTKYESQSVRDAYGRWTKANDKAHLHILASMSDILSKKHEI 60

Query: 88  MVSAREIMSPLQNIFGQLSGQLRHESLKYVYNSRMKEGSSVKEHVLDLMVHFNVEDMNGA 147
           MV AR+IM  L+ +FGQ S Q++ E+    ++ R    SS                    
Sbjct: 61  MVIARQIMDSLREMFGQPSIQIKQEA-NVTHSRRFSPSSS-------------------- 120

Query: 148 VIDKQSQKTQKKKIGGKGKAPAAADKGKGKPKVADKGKCFHCNVDGHWKRNDPKYLVELK 207
                S+K QK+K  GKGK P  A + KGK KV  KGKCFHC+VD HWK N PKYLV+ K
Sbjct: 121 ----GSEKIQKRK-EGKGKGPTIAVEDKGKTKVVIKGKCFHCDVDEHWKTNCPKYLVKKK 180

Query: 208 EK---------------------------------------------------------- 267
           EK                                                          
Sbjct: 181 EKEGATNHVCSSLQETSSFKQLEDSEMTLKVGTGDVISARAVGDAKLGHINLNQIGRLIK 240

Query: 268 -------------------KGKMTKQPFTGKGYRAKKPLELIHSDLCGPVNVKARVEYEY 327
                              +GKMTK+PFT KGYRAK+PLELIHSDLCGP+NVKAR  +EY
Sbjct: 241 NGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSDLCGPMNVKARGGFEY 300

Query: 328 FISFIDVYSRYGYLYLMHHMSEALKKFKEYKTEVENALGKTIKTLRSDRGGEYKDLRFQD 387
           FISFID YSRYGYLYLM H SEAL+KFKEYK EVEN L K IK LRSD+GGEY DLRFQD
Sbjct: 301 FISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDQGGEYMDLRFQD 360

Query: 388 YLIEHGIQSQLAQPNTPQQN-------------------------------------AHM 447
           Y+IEHGIQSQL+ P TPQQN                                     +H+
Sbjct: 361 YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKVRSMMSYTQLPSSFWGDCSSYLEQSHV 420

Query: 448 LWTNPKKLEPRSRLCQFVGYPKETRGCLFYDPQENKVLVLTNTTFLEEDNMRNLKPRT-- 488
           L TNPKKL PRSRLCQFVGYPKETRG L +DPQEN+VLV TN TFLEED+ R+ KPR+  
Sbjct: 421 LVTNPKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKL 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109789.6e-1731.54Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT941.4e-1237.82Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.0e-1031.93Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041461.3e-1029.51Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q124911.7e-0528.68Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
KAA0059226.11.4e-16152.83gag/pol protein [Cucumis melo var. makuwa][more]
TYK02840.11.4e-16152.83gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025159.17.8e-15754.88gag/pol protein [Cucumis melo var. makuwa][more]
TYJ97618.15.0e-15649.23gag/pol protein [Cucumis melo var. makuwa][more]
KAA0060254.15.6e-15549.07gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7SIN23.8e-15754.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold2405G000... [more]
A0A5D3BHG72.4e-15649.23Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold639G0015... [more]
A0A5A7UYX72.7e-15549.07Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold22G00180... [more]
A0A5A7VJG32.9e-14947.15Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold138G0011... [more]
A0A5D3DZX88.4e-14953.21Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold381G0032... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 270..290
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 385..418
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 151..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 397..413
NoneNo IPR availablePANTHERPTHR35317:SF8POLYPROTEIN-LIKE PROTEINcoord: 18..156
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 18..156
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 226..327
e-value: 1.1E-10
score: 41.7
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 224..331
score: 14.104942
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 221..335
e-value: 8.3E-20
score: 73.0
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 223..333

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002455.1Tan0002455.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding