Tan0015873 (gene) Snake gourd v1

Overview
NameTan0015873
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG06: 42020811 .. 42024077 (+)
RNA-Seq ExpressionTan0015873
SyntenyTan0015873
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAGCTCGACTGCGTCGCGAAGTGTTCACGATGCATACGATCGACGGATCAGGGCCAATGAAAAGGCCAAGGACTATATCATTGCCAGCATGTCTGATGTTTTGGCAAAGAAGCATGAGCTGATGGTCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCTTTTCACGTCCGACATGACTCGCTCAAATACGTCTTCAACGCACGGATGGAAGAGGGGTCGTTTGTCCGTAAACATGTTCTAGACATGATAACCCACTTTAATCTAGCGAAGATGAATGGGGCTTCGATCGACGAGTCAAGCCAGGTCAGTTTTATCTTTGAGACTCTTCCGAAGAGTTTCCTTTAGTTTCGTAGCAATGCTGTTATGAACAAAATTAGCTACACTCTAACTACCCTCCTCAATGAGCTACAGAATTTCCAGTCCTTGATGAGGGTCAAGGCACCGGAATCTGAGGCAAATGTTACCTACAGGTCTTATCACAGGGGTTTGACCTCTGGGACTAAACATGTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCAAAAAGGCAAGAAGGTCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGAGGGCGAACACTGGAAGAGGAACTGTCCCAAATTCGTAGCAGGGAGGAAGAATCAAGGTAAATGTGATTTAATTGTGACAGAAACCTGTTTAGTGGAGAGTAGTTACTCTGCCTGGATATTAAATTCGGGCGCCGCTAACCATGTTTGTTCTTTCTTGAGAGGATTGATTCTGGCGCGGTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATCCAGGAGCTTGTCTGCTGCGTGATCAAACGCGGTGAAGCTACACTTTACACTTTGGCAGGAATTACATTTTGTTGGACAACATGTATATAGTTCCATGGTTTACTAGAAACCTAGTTTCTATTTCTTGCCTTTATTTCCAGAAATGGCAATCTTATTTGTTCTGCTTCACTTGAGCATAATCAGTATGTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATCTTTGAATTGTTTTAAAGCTATGAAACACGAACTAAAAGAGCGAAAGTTTCTCCTAAGGAAAATGTCCATCTTTGGCATCTACTGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTTCTAAGCGAGTTGGAAGAAAACTCTTTACCGGTGTGTGAGTCATTCCTTGAGGGCAAAATGACCAAACGTCTTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCCCTTGAGTTAGTACATTCTGGCCTCTGTGGTTCGATGAATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATTGACGATTACTCGAGGTATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTAAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTAGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTATGCCAAAGCAGAATGGTGTATCGGAGAGGAGAAACAAAACCTTGTTGGACATGGTTCGGTCGATGATGAGCTATGTTCGTCTCCCTGATTCTTTTTGGGGTTATGCAGTAGAGACTGCGGTCTAGATTTTGAACAACGTTCCGTCGAAGAGTGTTTGTAAAACACCTTTCGAACTCTGAAATGGCCGTAAAGGAAGTTTACATCATTTCAGAATTTAGGGATGCCCGACACATGTGTTGGTGTCAAACCCAAAAAAGTTGGAACCCCGTTCGAAATTGTGCCTATTTGTAGATTACCCTAAAGAGACTGGGGGTGGTCTATTTTACGATCCTAAGGAAAATAAGGTGCTTGTGTCGACAAACGTCATTTTCCTAGAGGAAGACCATGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTAACACGTCTAGTCAAATTAGTTCCCAAAAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCTCAGTTGTCGCTTCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGATATGAACCAGGAAATGAGTCGATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGGTGAAGACCTTCAAAGCCCGACTAGTGGTAAAGGGTTTTACCCAGGTTGAAGGGATTGACTATGAGGAGACCTTTTCACCTGTTGCCATGGTAAAGTCAATCAGGATCTTTCTGGCCATTGTCGCGTATTATGACTACGAGGTATGACAGATGGACGTCAAGACAGCCTTTCTTAATGGCAAATTTGATGAAACCATCTACATGGACCAGTCCTAAGGGTTCATTGCCCAAGGACAAGAGCAAAAGGTTTGCCGACTTCATAGGTCTATTTATGGACTGAAACAAGGTTCGAGGTCTTGGAATATAAGGTTTGATGAGACGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGGATGATATTCTTCTCATTGGAAATGAGGTAGAATTTCTTACTGACGTGAAGAAATGGCTAGCTTCGCAATTTCAAATGAAAGATTTGGGAGAAACTCAGTATGTTCTAGGTATCCAGATAGTCCGGAACCTGAAGAACAGAACGCTAGCCTTGTCTCAGGCGTCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTTTTAAGACTCCTCAAGATGTTGAGGATATGAGATGGATTCCATATGCTTCAGCTGTAGGGAGCCTGATGTATGTCATGTTGTGTACTAGGCCCGACATCTGTTATGCAATAGGGATTGTCAATAGGTATCAATCCAATCTAGAGATTAGATCTTTGGGACGGTAG

mRNA sequence

ATGTCGAGCTCGACTGCGTCGCGAAGTGTTCACGATGCATACGATCGACGGATCAGGGCCAATGAAAAGGCCAAGGACTATATCATTGCCAGCATGTCTGATGTTTTGGCAAAGAAGCATGAGCTGATGGTCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCTTTTCACGTCCGACATGACTCGCTCAAATACGTCTTCAACGCACGGATGGAAGAGGGGTCGTTTGTCCGTAAACATGTTCTAGACATGATAACCCACTTTAATCTAGCGAAGATGAATGGGGCTTCGATCGACGAGTCAAGCCAGAATTTCCAGTCCTTGATGAGGGTCAAGGCACCGGAATCTGAGGCAAATGTTACCTACAGGTCTTATCACAGGGGTTTGACCTCTGGGACTAAACATGTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCAAAAAGGCAAGAAGGTCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGAGGGCGAACACTGGAAGAGGAACTGTCCCAAATTCGTAGCAGGGAGGAAGAATCAAGGATATAGAGCCAAAGAGCCCCTTGAGTTAGTACATTCTGGCCTCTGTGGTTCGATGAATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATTGACGATTACTCGAGGTATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTAAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTAGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGATTACCCTAAAGAGACTGGGGGTGGTCTATTTTACGATCCTAAGGAAAATAAGGTGCTTGTGTCGACAAACGTCATTTTCCTAGAGGAAGACCATGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTAACACGTCTAGTCAAATTAGTTCCCAAAAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCTCAGTTGTCGCTTCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGATATGAACCAGGAAATGAGTCGATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCAGATGGGGTCTATTTATGGACTGAAACAAGGTTCGAGGTCTTGGAATATAAGGTTTGATGAGACGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGGATGATATTCTTCTCATTGGAAATGAGGTAGAATTTCTTACTGACGTGAAGAAATGGCTAGCTTCGCAATTTCAAATGAAAGATTTGGGAGAAACTCAGTATGTTCTAGGTATCCAGATAGTCCGGAACCTGAAGAACAGAACGCTAGCCTTGTCTCAGGCGTCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTTTTAAGACTCCTCAAGATGTTGAGGATATGAGATGGATTCCATATGCTTCAGCTGTAGGGAGCCTGATGTATGTCATGTTGTGTACTAGGCCCGACATCTGTTATGCAATAGGGATTGTCAATAGGTATCAATCCAATCTAGAGATTAGATCTTTGGGACGGTAG

Coding sequence (CDS)

ATGTCGAGCTCGACTGCGTCGCGAAGTGTTCACGATGCATACGATCGACGGATCAGGGCCAATGAAAAGGCCAAGGACTATATCATTGCCAGCATGTCTGATGTTTTGGCAAAGAAGCATGAGCTGATGGTCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCTTTTCACGTCCGACATGACTCGCTCAAATACGTCTTCAACGCACGGATGGAAGAGGGGTCGTTTGTCCGTAAACATGTTCTAGACATGATAACCCACTTTAATCTAGCGAAGATGAATGGGGCTTCGATCGACGAGTCAAGCCAGAATTTCCAGTCCTTGATGAGGGTCAAGGCACCGGAATCTGAGGCAAATGTTACCTACAGGTCTTATCACAGGGGTTTGACCTCTGGGACTAAACATGTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCAAAAAGGCAAGAAGGTCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGAGGGCGAACACTGGAAGAGGAACTGTCCCAAATTCGTAGCAGGGAGGAAGAATCAAGGATATAGAGCCAAAGAGCCCCTTGAGTTAGTACATTCTGGCCTCTGTGGTTCGATGAATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATTGACGATTACTCGAGGTATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTAAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTAGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGATTACCCTAAAGAGACTGGGGGTGGTCTATTTTACGATCCTAAGGAAAATAAGGTGCTTGTGTCGACAAACGTCATTTTCCTAGAGGAAGACCATGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTAACACGTCTAGTCAAATTAGTTCCCAAAAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCTCAGTTGTCGCTTCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGATATGAACCAGGAAATGAGTCGATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCAGATGGGGTCTATTTATGGACTGAAACAAGGTTCGAGGTCTTGGAATATAAGGTTTGATGAGACGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGGATGATATTCTTCTCATTGGAAATGAGGTAGAATTTCTTACTGACGTGAAGAAATGGCTAGCTTCGCAATTTCAAATGAAAGATTTGGGAGAAACTCAGTATGTTCTAGGTATCCAGATAGTCCGGAACCTGAAGAACAGAACGCTAGCCTTGTCTCAGGCGTCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTTTTAAGACTCCTCAAGATGTTGAGGATATGAGATGGATTCCATATGCTTCAGCTGTAGGGAGCCTGATGTATGTCATGTTGTGTACTAGGCCCGACATCTGTTATGCAATAGGGATTGTCAATAGGTATCAATCCAATCTAGAGATTAGATCTTTGGGACGGTAG

Protein sequence

MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESSQNFQSLMRVKAPESEANVTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRKNQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMDSTSARVADGASTSTSVVDPNTSSQISSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRCTSILSGSLWINQMGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNLEIRSLGR
Homology
BLAST of Tan0015873 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 4.1e-39
Identity = 80/184 (43.48%), Postives = 119/184 (64.67%), Query Frame = 0

Query: 457  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVY-KKIVDKTVAFLVLYVDDILLIGNE 516
            S+YGLKQ  R W ++FD  +KS  + +   +PCVY K+  +     L+LYVDD+L++G +
Sbjct: 959  SLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKD 1018

Query: 517  VEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNS 576
               +  +K  L+  F MKDLG  Q +LG++IVR   +R L LSQ  YI+++L R+ M+N+
Sbjct: 1019 KGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNA 1078

Query: 577  KKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNR 636
            K    P    + LSK  C  T ++  +M  +PY+SAVGSLMY M+CTRPDI +A+G+V+R
Sbjct: 1079 KPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSR 1138

Query: 637  YQSN 640
            +  N
Sbjct: 1139 FLEN 1142


HSP 2 Score: 89.4 bits (220), Expect = 1.7e-16
Identity = 41/103 (39.81%), Postives = 64/103 (62.14%), Query Frame = 0

Query: 209 LELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL 268
           L+LV+S +CG M +++ GG +YFV+FIDD SR  ++Y++  K +  + F+++   VE   
Sbjct: 481 LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERET 540

Query: 269 GKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETG 312
           G+ LK LRSD G EY   EF++Y   H I  + + P  P+  G
Sbjct: 541 GRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNG 583

BLAST of Tan0015873 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 107.8 bits (268), Expect = 4.5e-22
Identity = 62/189 (32.80%), Postives = 109/189 (57.67%), Query Frame = 0

Query: 457  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDK----TVAFLVLYVDDILLI 516
            +IYGLKQ +R W   F++ +K   F  +  + C+Y  I+DK       +++LYVDD+++ 
Sbjct: 1037 AIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIY--ILDKGNINENIYVLLYVDDVVIA 1096

Query: 517  GNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKM 576
              ++  + + K++L  +F+M DL E ++ +GI+I   ++   + LSQ++Y+ K+LS++ M
Sbjct: 1097 TGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNM 1156

Query: 577  QNSKKGLLPFRHGVH---LSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYA 636
            +N      P    ++   L+ D+   T          P  S +G LMY+MLCTRPD+  A
Sbjct: 1157 ENCNAVSTPLPSKINYELLNSDEDCNT----------PCRSLIGCLMYIMLCTRPDLTTA 1211

Query: 637  IGIVNRYQS 639
            + I++RY S
Sbjct: 1217 VNILSRYSS 1211


HSP 2 Score: 65.1 bits (157), Expect = 3.4e-09
Identity = 33/106 (31.13%), Postives = 53/106 (50.00%), Query Frame = 0

Query: 206 KEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVE 265
           K PL +VHS +CG +         YFV F+D ++ Y   YL+  KS+    F+++  K E
Sbjct: 478 KRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSE 537

Query: 266 NLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETG 312
                 +  L  D GREY+  E + + ++  I+  L+ P  P+  G
Sbjct: 538 AHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNG 583

BLAST of Tan0015873 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 8.3e-16
Identity = 58/180 (32.22%), Postives = 90/180 (50.00%), Query Frame = 0

Query: 457  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEV 516
            ++YGLKQ  R+W +     + + GF  +V +  ++     K++ ++++YVDDIL+ GN+ 
Sbjct: 1102 ALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDP 1161

Query: 517  EFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSK 576
              L +    L+ +F +KD  E  Y LGI+  R      L LSQ  YI  +L+R  M  +K
Sbjct: 1162 TLLHNTLDNLSQRFSVKDHEELHYFLGIEAKR--VPTGLHLSQRRYILDLLARTNMITAK 1221

Query: 577  KGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY 636
                P      LS     K     E      Y   VGSL Y+   TRPDI YA+  ++++
Sbjct: 1222 PVTTPMAPSPKLSLYSGTKLTDPTE------YRGIVGSLQYLAF-TRPDISYAVNRLSQF 1272


HSP 2 Score: 59.7 bits (143), Expect = 1.4e-07
Identity = 35/125 (28.00%), Postives = 64/125 (51.20%), Query Frame = 0

Query: 208 PLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENL 267
           PLE ++S +  S  + +   Y Y+V F+D ++RY ++Y + +KS+  E F  +K  +EN 
Sbjct: 523 PLEYIYSDVWSS-PILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENR 582

Query: 268 LGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETGGGLFYDPKENKVLVST 327
               + T  SD G E++     +Y  +H I+   S P  P+  G       ++++ +V T
Sbjct: 583 FQTRIGTFYSDNGGEFV--ALWEYFSQHGISHLTSPPHTPEHNG----LSERKHRHIVET 640

Query: 328 NVIFL 333
            +  L
Sbjct: 643 GLTLL 640

BLAST of Tan0015873 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.8e-15
Identity = 54/180 (30.00%), Postives = 90/180 (50.00%), Query Frame = 0

Query: 457  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEV 516
            +IYGLKQ  R+W +     + + GF  ++ +  ++     +++ ++++YVDDIL+ GN+ 
Sbjct: 1085 AIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDT 1144

Query: 517  EFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSK 576
              L      L+ +F +K+  +  Y LGI+  R    + L LSQ  Y   +L+R  M  +K
Sbjct: 1145 VLLKHTLDALSQRFSVKEHEDLHYFLGIEAKR--VPQGLHLSQRRYTLDLLARTNMLTAK 1204

Query: 577  KGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY 636
                P      L+     K P   E      Y   VGSL Y+   TRPD+ YA+  +++Y
Sbjct: 1205 PVATPMATSPKLTLHSGTKLPDPTE------YRGIVGSLQYLAF-TRPDLSYAVNRLSQY 1255


HSP 2 Score: 62.8 bits (151), Expect = 1.7e-08
Identity = 35/112 (31.25%), Postives = 62/112 (55.36%), Query Frame = 0

Query: 200 NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKE 259
           N    + +PLE ++S +  S  + +   Y Y+V F+D ++RY ++Y + +KS+  + F  
Sbjct: 494 NSTITSSKPLEYIYSDVWSS-PILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFII 553

Query: 260 YKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETG 312
           +K+ VEN     + TL SD G E++    +DY+ +H I+   S P  P+  G
Sbjct: 554 FKSLVENRFQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHNG 602

BLAST of Tan0015873 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 84.3 bits (207), Expect = 5.4e-15
Identity = 55/181 (30.39%), Postives = 88/181 (48.62%), Query Frame = 0

Query: 456 GSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNE 515
           G +YGLKQ    WN   + T+K  GF ++  E  +Y +       ++ +YVDD+L+    
Sbjct: 38  GGMYGLKQAPLLWNEHINNTLKKIGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPS 97

Query: 516 VEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNS 575
            +    VK+ L   + MKDLG+    LG+ I ++  N  + LS   YI K  S  ++   
Sbjct: 98  PKIYDRVKQELTKLYSMKDLGKVDKFLGLNIHQS-SNGDITLSLQDYIAKAASESEINTF 157

Query: 576 KKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNR 635
           K    P  +    SK     T   ++D+   PY S VG L++     RPDI Y + +++R
Sbjct: 158 KLTQTPLCN----SKPLFETTSPHLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSR 211

Query: 636 Y 637
           +
Sbjct: 218 F 211

BLAST of Tan0015873 vs. NCBI nr
Match: KAA0051952.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 718.4 bits (1853), Expect = 5.6e-203
Identity = 420/862 (48.72%), Postives = 513/862 (59.51%), Query Frame = 0

Query: 3   SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
           ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 48  AANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 107

Query: 63  HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS-------------- 122
            ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA IDE+S              
Sbjct: 108 QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFL 167

Query: 123 ----------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAP 182
                                 Q F+SLM++K  + EANV  + R +HRG TSGTK +  
Sbjct: 168 QFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPS 227

Query: 183 SRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK- 242
           S    K + K+G    K +  AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K 
Sbjct: 228 SSGNKKWKKKKGGQGNKANLAAAKTTKKAK--AAKGICFHCNQEGHWKRNCPKYLAEKKK 287

Query: 243 ------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM 302
                        +G++AKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Sbjct: 288 AKQGKMTKRPFTGKGHKAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLM 347

Query: 303 HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP--- 362
             KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLS P   
Sbjct: 348 QHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSVPGTP 407

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 408 QQNGVSKRRNRTLLDMVRSMMSYTHLPNSFWGYAVQTAVYILNCVPSKSVSKTPLKLWNG 467

Query: 423 -----------------------------------DYPKETGGGLFYDPKENKVLVSTNV 482
                                               YPK T GG FYDPK+NKV VSTN 
Sbjct: 468 RKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNA 527

Query: 483 IFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGM 542
            FLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++    Q L  
Sbjct: 528 TFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLRE 587

Query: 543 PRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-- 602
           PRRSGRV   P  YM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Sbjct: 588 PRRSGRVTNLPIHYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESILV 647

Query: 603 ---------------------------------------------TSILSGSL----WIN 640
                                                        T+ L+G+L    ++ 
Sbjct: 648 AKGYTQVEGVDYEEIFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQ 707

BLAST of Tan0015873 vs. NCBI nr
Match: KAA0026233.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 697.6 bits (1799), Expect = 1.0e-196
Identity = 421/902 (46.67%), Postives = 511/902 (56.65%), Query Frame = 0

Query: 3    SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
            ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 110  AANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 169

Query: 63   HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS-------------- 122
             ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A MN A IDE+S              
Sbjct: 170  QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAVMNEAVIDEASQVSFILESLPESFL 229

Query: 123  ----------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAP 182
                                  Q F+SLM++K  + EANV  + R +HRG TSGTK +  
Sbjct: 230  QFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPS 289

Query: 183  SRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK- 242
            S    K + K+G    K +  AA+  KK K  A KG CF CN+  HWKRNCPK++A +K 
Sbjct: 290  SSGNKKWKKKKGGQGNKANLAAAKTTKKAK--AAKGICFLCNQEGHWKRNCPKYLAKKKK 349

Query: 243  ------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM 302
                         +G+RAKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Sbjct: 350  AKQGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLM 409

Query: 303  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD-- 362
              KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLSAPD  
Sbjct: 410  QHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPDTP 469

Query: 363  ------------------------------------------------------------ 422
                                                                        
Sbjct: 470  QQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNG 529

Query: 423  ------------------------------------YPKETGGGLFYDPKENKVLVSTNV 482
                                                YPK T GG FYDPK+NKV VSTN 
Sbjct: 530  HKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNA 589

Query: 483  IFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGM 542
             FLEEDH+R+H PRSKIVLNE+       S RV +  S    VV   +S++    Q L  
Sbjct: 590  TFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALIRVVHVRSSTRTHQPQSLRE 649

Query: 543  PRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-- 602
            PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Sbjct: 650  PRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYF 709

Query: 603  ------------------------------------------------------------ 640
                                                                        
Sbjct: 710  NSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVA 769

BLAST of Tan0015873 vs. NCBI nr
Match: KAA0037371.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 682.2 bits (1759), Expect = 4.4e-192
Identity = 398/775 (51.35%), Postives = 489/775 (63.10%), Query Frame = 0

Query: 1   MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQ 60
           +S++ A+R+V +AY+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ 
Sbjct: 12  VSAANATRTVREAYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQT 71

Query: 61  SFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS------------ 120
           S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+M  A IDE+S            
Sbjct: 72  SYQIKHDALKYIYNARMNEGASVREHVLNMMIHFNVAEMKEAVIDEASQIAYTLTTLLNE 131

Query: 121 -QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRV 180
            Q F+SLM++K  + EANV  + R +HRG  SG K +  S    K + K+G    K +  
Sbjct: 132 LQTFESLMKIKGQKGEANVATSTRKFHRGSNSGNKFMPSSSGNKKWKKKKGGQGNKANLA 191

Query: 181 AAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEP 240
           AA+  KK K+   KG CFHCN+  HWKRNCPK++A +K              +G+R KEP
Sbjct: 192 AAKTSKKAKDA--KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKMTKRPFTGKGHRTKEP 251

Query: 241 LELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL 300
           LELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK +VEN L
Sbjct: 252 LELVHSNLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENAL 311

Query: 301 --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP--------- 360
                   G S       L  +RS      +   F  Y ++  +      P         
Sbjct: 312 TPGTPQQNGVSERRNRTLLDMVRSIMSYTRLPNSFWGYAVQTAVYILNCVPSKSVSETPL 371

Query: 361 ----------------------------------------DYPKETGGGLFYDPKENKVL 420
                                                    YPK T GG FYDPK+NKV 
Sbjct: 372 KLWNGRKGSLRHFRIWGCPAHVLKNNPKKLEPRSKLCLFVSYPKGTRGGYFYDPKDNKVF 431

Query: 421 VSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SS 480
           VSTN  FLEED++R+H PRSKIVLNE+       S RV +  S  T VV   + ++    
Sbjct: 432 VSTNATFLEEDYIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSFTRTHQP 491

Query: 481 QKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM 540
           Q L  PRRSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDEWIK MN E+
Sbjct: 492 QSLREPRRSGRVTNLPIRYMSLTETLTVIYDGDIEDPLTFKKAMEDVDKDEWIKAMNLEL 551

Query: 541 --------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS 600
                         S  T+ L+             GS+ + Q         SIYGLKQ S
Sbjct: 552 ESMYFNSVWDLVDQSDETAFLNDNLEETIYMQQPEGSIILGQEQQVCKLNRSIYGLKQAS 611

Query: 601 RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKW 641
           RSWNIRFD  IKSYGFDQ VDEPCVYK+I++  VAFLVLYV DILLIGN+V  LTD+K+W
Sbjct: 612 RSWNIRFDTAIKSYGFDQIVDEPCVYKRIINNLVAFLVLYVVDILLIGNDVGLLTDIKQW 671

BLAST of Tan0015873 vs. NCBI nr
Match: TYJ97618.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 680.2 bits (1754), Expect = 1.7e-191
Identity = 414/899 (46.05%), Postives = 502/899 (55.84%), Query Frame = 0

Query: 3   SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
           ++ A+++V + Y+R  + NEK + YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 48  AANATQTVREPYERWAKGNEKGRAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 107

Query: 63  HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESSQNFQSLMRVKAPES 122
            + HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA IDE+SQ           + 
Sbjct: 108 QINHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQ---------GQKG 167

Query: 123 EANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKG 182
           EANV  + R +HRG TSGTK +  S    K + K+G    K +  AA+  KK K  A KG
Sbjct: 168 EANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKSK--ATKG 227

Query: 183 KCFHCNEGEHWKRNCPKFVAGRK------------------------------------- 242
            CFH N+  HWKRNCPK++A +K                                     
Sbjct: 228 ICFHYNQEGHWKRNCPKYLAEKKKAKQGHINLNRIERLVKNGILSELEENSLPICESCLE 287

Query: 243 ---------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKK 302
                     +G+RAKEPLELVHS LCG MNVKARG +EYF++F DDYSRYGY+YLM  K
Sbjct: 288 GKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGEFEYFITFTDDYSRYGYVYLMQHK 347

Query: 303 SETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP------ 362
           SE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E EI SQLSAP      
Sbjct: 348 SEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECEILSQLSAPGTPQQN 407

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 408 GVSERRNRTLLDMVRSMISYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKG 467

Query: 423 --------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFL 482
                                            YPK T GG FYDPK+NKV VSTN  FL
Sbjct: 468 SLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFL 527

Query: 483 EEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRR 542
           EEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++    Q L  PRR
Sbjct: 528 EEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRR 587

Query: 543 SGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC----- 602
           SGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+        
Sbjct: 588 SGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSV 647

Query: 603 ------------------------------------------------------------ 640
                                                                       
Sbjct: 648 WDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLK 707

BLAST of Tan0015873 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 639.0 bits (1647), Expect = 4.3e-179
Identity = 430/1095 (39.27%), Postives = 521/1095 (47.58%), Query Frame = 0

Query: 6    ASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVR 65
            A+R+V +AYDR ++AN+KA+ YI+ASM+DVLAKKH+ + TAK IM+SL+EMFGQ S+ +R
Sbjct: 51   ANRTVREAYDRWVKANDKARVYILASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLR 110

Query: 66   HDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS----------------- 125
            H+++K+++  RM+EG+ VR+HVLDM+ HFN+A++NG  IDE++                 
Sbjct: 111  HEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQ 170

Query: 126  -------------------QNFQSLMRVKAPESEAN--VTYRSYHRGLTSGTKHVAPSRP 185
                               Q FQ+L   K  E EAN  VT R + RG +S  K V PS+ 
Sbjct: 171  TNASLNKIEFNLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSSKNK-VGPSKA 230

Query: 186  KGKKRMKRGKTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------- 245
            + KK+ K GK     A    KVK+ A+KGKCFHCN+  HWKRNCPK++A +K        
Sbjct: 231  QMKKKGK-GK-----APNTSKVKKNADKGKCFHCNQDGHWKRNCPKYLAEKKAEKATQGK 290

Query: 246  ------------------------------------------------------------ 305
                                                                        
Sbjct: 291  YDLLVVETCLVECDASTWILDSGATNHICFSFQETSSWKKLKEGEITLKVGTGEVVSAEA 350

Query: 306  ------------------------------------------------------------ 365
                                                                        
Sbjct: 351  VGDLTLFFQDRYLILKDVLYVPLMKRNLISIACILEHIYTISFEVNEVFILCKGIQICSA 410

Query: 366  ------------------------------------------------------------ 425
                                                                        
Sbjct: 411  IRENNLYKLRPTRANVVLNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKS 470

Query: 426  ----------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYF 485
                                         +G RAK PLELVHS LCG MNVKARGGYEYF
Sbjct: 471  GILNQLEDNSLPPCESCLEGKMTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYF 530

Query: 486  VSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDY 545
            +SFIDD+SRYG++YL+H KSE+ EKFKEYK +VEN +GK++KTLRSDRG EYMD++FQDY
Sbjct: 531  ISFIDDFSRYGHVYLLHHKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDY 590

Query: 546  MIEHEITSQLSAPD---------------------------------------------- 605
            +IE  I SQLSAP                                               
Sbjct: 591  LIEFGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNN 650

Query: 606  ----------------------------------------------------YPKETGGG 640
                                                                YPKE+ GG
Sbjct: 651  VPSKSVLETPYELWKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGG 710

BLAST of Tan0015873 vs. ExPASy TrEMBL
Match: A0A5A7U869 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60G004290 PE=4 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 2.7e-203
Identity = 420/862 (48.72%), Postives = 513/862 (59.51%), Query Frame = 0

Query: 3   SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
           ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 48  AANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 107

Query: 63  HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS-------------- 122
            ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA IDE+S              
Sbjct: 108 QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFL 167

Query: 123 ----------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAP 182
                                 Q F+SLM++K  + EANV  + R +HRG TSGTK +  
Sbjct: 168 QFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPS 227

Query: 183 SRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK- 242
           S    K + K+G    K +  AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K 
Sbjct: 228 SSGNKKWKKKKGGQGNKANLAAAKTTKKAK--AAKGICFHCNQEGHWKRNCPKYLAEKKK 287

Query: 243 ------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM 302
                        +G++AKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Sbjct: 288 AKQGKMTKRPFTGKGHKAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLM 347

Query: 303 HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP--- 362
             KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLS P   
Sbjct: 348 QHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSVPGTP 407

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 408 QQNGVSKRRNRTLLDMVRSMMSYTHLPNSFWGYAVQTAVYILNCVPSKSVSKTPLKLWNG 467

Query: 423 -----------------------------------DYPKETGGGLFYDPKENKVLVSTNV 482
                                               YPK T GG FYDPK+NKV VSTN 
Sbjct: 468 RKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNA 527

Query: 483 IFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGM 542
            FLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++    Q L  
Sbjct: 528 TFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLRE 587

Query: 543 PRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-- 602
           PRRSGRV   P  YM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Sbjct: 588 PRRSGRVTNLPIHYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESILV 647

Query: 603 ---------------------------------------------TSILSGSL----WIN 640
                                                        T+ L+G+L    ++ 
Sbjct: 648 AKGYTQVEGVDYEEIFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQ 707

BLAST of Tan0015873 vs. ExPASy TrEMBL
Match: A0A5A7SNP8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19G002160 PE=4 SV=1)

HSP 1 Score: 697.6 bits (1799), Expect = 4.9e-197
Identity = 421/902 (46.67%), Postives = 511/902 (56.65%), Query Frame = 0

Query: 3    SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
            ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 110  AANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 169

Query: 63   HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS-------------- 122
             ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A MN A IDE+S              
Sbjct: 170  QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAVMNEAVIDEASQVSFILESLPESFL 229

Query: 123  ----------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAP 182
                                  Q F+SLM++K  + EANV  + R +HRG TSGTK +  
Sbjct: 230  QFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPS 289

Query: 183  SRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK- 242
            S    K + K+G    K +  AA+  KK K  A KG CF CN+  HWKRNCPK++A +K 
Sbjct: 290  SSGNKKWKKKKGGQGNKANLAAAKTTKKAK--AAKGICFLCNQEGHWKRNCPKYLAKKKK 349

Query: 243  ------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM 302
                         +G+RAKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Sbjct: 350  AKQGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLM 409

Query: 303  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD-- 362
              KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLSAPD  
Sbjct: 410  QHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPDTP 469

Query: 363  ------------------------------------------------------------ 422
                                                                        
Sbjct: 470  QQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNG 529

Query: 423  ------------------------------------YPKETGGGLFYDPKENKVLVSTNV 482
                                                YPK T GG FYDPK+NKV VSTN 
Sbjct: 530  HKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNA 589

Query: 483  IFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGM 542
             FLEEDH+R+H PRSKIVLNE+       S RV +  S    VV   +S++    Q L  
Sbjct: 590  TFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALIRVVHVRSSTRTHQPQSLRE 649

Query: 543  PRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-- 602
            PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Sbjct: 650  PRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYF 709

Query: 603  ------------------------------------------------------------ 640
                                                                        
Sbjct: 710  NSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVA 769

BLAST of Tan0015873 vs. ExPASy TrEMBL
Match: A0A5A7T706 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold278G00760 PE=4 SV=1)

HSP 1 Score: 682.2 bits (1759), Expect = 2.2e-192
Identity = 398/775 (51.35%), Postives = 489/775 (63.10%), Query Frame = 0

Query: 1   MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQ 60
           +S++ A+R+V +AY+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ 
Sbjct: 12  VSAANATRTVREAYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQT 71

Query: 61  SFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS------------ 120
           S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+M  A IDE+S            
Sbjct: 72  SYQIKHDALKYIYNARMNEGASVREHVLNMMIHFNVAEMKEAVIDEASQIAYTLTTLLNE 131

Query: 121 -QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRV 180
            Q F+SLM++K  + EANV  + R +HRG  SG K +  S    K + K+G    K +  
Sbjct: 132 LQTFESLMKIKGQKGEANVATSTRKFHRGSNSGNKFMPSSSGNKKWKKKKGGQGNKANLA 191

Query: 181 AAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEP 240
           AA+  KK K+   KG CFHCN+  HWKRNCPK++A +K              +G+R KEP
Sbjct: 192 AAKTSKKAKDA--KGICFHCNQEGHWKRNCPKYLAEKKKAKQGKMTKRPFTGKGHRTKEP 251

Query: 241 LELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL 300
           LELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK +VEN L
Sbjct: 252 LELVHSNLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENAL 311

Query: 301 --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP--------- 360
                   G S       L  +RS      +   F  Y ++  +      P         
Sbjct: 312 TPGTPQQNGVSERRNRTLLDMVRSIMSYTRLPNSFWGYAVQTAVYILNCVPSKSVSETPL 371

Query: 361 ----------------------------------------DYPKETGGGLFYDPKENKVL 420
                                                    YPK T GG FYDPK+NKV 
Sbjct: 372 KLWNGRKGSLRHFRIWGCPAHVLKNNPKKLEPRSKLCLFVSYPKGTRGGYFYDPKDNKVF 431

Query: 421 VSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SS 480
           VSTN  FLEED++R+H PRSKIVLNE+       S RV +  S  T VV   + ++    
Sbjct: 432 VSTNATFLEEDYIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSFTRTHQP 491

Query: 481 QKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM 540
           Q L  PRRSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDEWIK MN E+
Sbjct: 492 QSLREPRRSGRVTNLPIRYMSLTETLTVIYDGDIEDPLTFKKAMEDVDKDEWIKAMNLEL 551

Query: 541 --------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS 600
                         S  T+ L+             GS+ + Q         SIYGLKQ S
Sbjct: 552 ESMYFNSVWDLVDQSDETAFLNDNLEETIYMQQPEGSIILGQEQQVCKLNRSIYGLKQAS 611

Query: 601 RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKW 641
           RSWNIRFD  IKSYGFDQ VDEPCVYK+I++  VAFLVLYV DILLIGN+V  LTD+K+W
Sbjct: 612 RSWNIRFDTAIKSYGFDQIVDEPCVYKRIINNLVAFLVLYVVDILLIGNDVGLLTDIKQW 671

BLAST of Tan0015873 vs. ExPASy TrEMBL
Match: A0A5A7UXG8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G005010 PE=4 SV=1)

HSP 1 Score: 631.3 bits (1627), Expect = 4.4e-177
Identity = 370/739 (50.07%), Postives = 461/739 (62.38%), Query Frame = 0

Query: 3   SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
           ++ A R+V +AY+RR +ANEKA+ YI+AS+S VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 48  AANAIRTVREAYERRAKANEKARAYILASLSKVLAKKHESMLTAREIMDSLQEMFGQASY 107

Query: 63  HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESSQNFQSLMRVKAPES 122
            ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA IDE+       +R KA  +
Sbjct: 108 QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEA-------IRKKAKAA 167

Query: 123 EANVTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVA-------------AQKGKKV 182
           +       +H       K   P     KK+ K+G T+ V              A +   +
Sbjct: 168 KG----ICFHCNQEGHWKRNCPKYLAEKKKAKQGATNHVCSSFLGISSWRQLEAGEMTMI 227

Query: 183 KEVAEKGK-----------CFHCNEGEHWKRNCPKFVAGRKNQGYRAKEPLELVHSGLCG 242
           + + + G            C  C EG+  KR           +G+RAKEPLELVHS LC 
Sbjct: 228 ERLVKNGLLSGLEENSLPICESCLEGKMTKRPF-------TGKGHRAKEPLELVHSDLCD 287

Query: 243 SMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSD 302
            MNVKARGG+EYF++F DDYSRY Y+YLM  KSE LEKFKEYK +VEN L K++KT R +
Sbjct: 288 PMNVKARGGFEYFITFTDDYSRYVYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRLN 347

Query: 303 RGREYMDTEFQDYMIEHEITSQLSAP----------------------------DYPKET 362
           RG EYMD +FQ+Y++E  I SQL AP                             YPK T
Sbjct: 348 RGEEYMDLKFQNYLMECGIVSQLLAPVQTAVYILNCVPSKSIYETPLKLWNGRKGYPKGT 407

Query: 363 GGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMDS----TSARVADGASTST 422
            GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLNE+ +     S RV +  S  T
Sbjct: 408 TGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSNETTEPSTRVVEEPSALT 467

Query: 423 SVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVD 482
            V    +S++    Q L  P+RSG                         +PLT+ +AM D
Sbjct: 468 RVFHVGSSTRTHQPQSLREPQRSG------------------------TNPLTFKKAMED 527

Query: 483 VDKDEWIKDMNQEMSRCTSILSGSLW---------------------------------- 542
           VDKDEWIK MN E+    S+   S+W                                  
Sbjct: 528 VDKDEWIKAMNLELE---SMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGDHLYATTR 587

Query: 543 -INQMG----------SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVA 602
            I+  G          SIYGLKQ SRSWNIRFD  IKSYGFDQ+VDEPCVYK+I++  +A
Sbjct: 588 RIHNPGQEQKICKLNLSIYGLKQASRSWNIRFDTAIKSYGFDQSVDEPCVYKRIINNLIA 647

Query: 603 FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQA 640
           FLVLYVD ILL GN++  LTD+K+WLA+QFQMKDLGE Q+VLGIQI ++ KN+TLALSQA
Sbjct: 648 FLVLYVDYILLTGNDIGLLTDIKQWLAAQFQMKDLGEAQFVLGIQIFKDRKNKTLALSQA 707

BLAST of Tan0015873 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 6.3e-176
Identity = 423/1105 (38.28%), Postives = 514/1105 (46.52%), Query Frame = 0

Query: 3    SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSF 62
            ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+
Sbjct: 48   AANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASY 107

Query: 63   HVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESS-------------- 122
             ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA IDE+S              
Sbjct: 108  QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFL 167

Query: 123  ----------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAP 182
                                  Q F+SLM++K  + EANV  + R +HRG TSGTK +  
Sbjct: 168  QFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPS 227

Query: 183  SRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK- 242
            S    K + K+G    K +  AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K 
Sbjct: 228  SSGNKKWKKKKGGQGNKANLAAAKTTKKAK--AAKGICFHCNQEGHWKRNCPKYLAEKKK 287

Query: 243  ------------------------------------------------------------ 302
                                                                        
Sbjct: 288  AKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV 347

Query: 303  ------------------------------------------------------------ 362
                                                                        
Sbjct: 348  VSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGV 407

Query: 363  ------------------------------------------------------------ 422
                                                                        
Sbjct: 408  EICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNR 467

Query: 423  -----------------------------------NQGYRAKEPLELVHSGLCGSMNVKA 482
                                                +G+RAKEPLELVHS LCG MNVKA
Sbjct: 468  IERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKA 527

Query: 483  RGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYM 542
            RGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK +VEN L K++KT RSDRG EYM
Sbjct: 528  RGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYM 587

Query: 543  DTEFQDYMIEHEITSQLSAP---------------------------------------- 602
            D +FQ+Y++E  I SQLSAP                                        
Sbjct: 588  DLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQT 647

Query: 603  ----------------------------------------------------------DY 640
                                                                       Y
Sbjct: 648  AVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGY 707

BLAST of Tan0015873 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 84.3 bits (207), Expect = 3.8e-16
Identity = 54/180 (30.00%), Postives = 92/180 (51.11%), Query Frame = 0

Query: 457 SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEV 516
           SIYGLKQ SR W ++F  T+  +GF Q+  +   + KI       +++YVDDI++  N  
Sbjct: 235 SIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNND 294

Query: 517 EFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSK 576
             + ++K  L S F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K
Sbjct: 295 AAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAG--INICQRKYALDLLDETGLLGCK 354

Query: 577 KGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY 636
              +P    V  S      +  D  D +   Y   +G LMY+ + TR DI +A+  ++++
Sbjct: 355 PSSVPMDPSVTFSA----HSGGDFVDAK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQF 405

BLAST of Tan0015873 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 70.1 bits (170), Expect = 7.4e-12
Identity = 52/134 (38.81%), Postives = 70/134 (52.24%), Query Frame = 0

Query: 501 FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQA 560
           +L+LYVDDILL G+    L  +   L+S F MKDLG   Y LGIQI  +     L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 561 SYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQC-FKTPQDVEDMRWIPYASAVGSLMYVM 620
            Y +++L+   M + K    P    ++ S     +  P D        + S VG+L Y+ 
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSD--------FRSIVGALQYLT 121

Query: 621 LCTRPDICYAIGIV 634
           L TRPDI YA+ IV
Sbjct: 122 L-TRPDISYAVNIV 124

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109784.1e-3943.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.5e-2232.80Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW28.3e-1632.22Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.8e-1530.00Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P256005.4e-1530.39Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
KAA0051952.15.6e-20348.72gag/pol protein [Cucumis melo var. makuwa][more]
KAA0026233.11.0e-19646.67gag/pol protein [Cucumis melo var. makuwa][more]
KAA0037371.14.4e-19251.35gag/pol protein [Cucumis melo var. makuwa][more]
TYJ97618.11.7e-19146.05gag/pol protein [Cucumis melo var. makuwa][more]
ADJ18449.14.3e-17939.27gag/pol protein, partial [Bryonia dioica][more]
Match NameE-valueIdentityDescription
A0A5A7U8692.7e-20348.72Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60G00429... [more]
A0A5A7SNP84.9e-19746.67Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19G00216... [more]
A0A5A7T7062.2e-19251.35Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold278G0076... [more]
A0A5A7UXG84.4e-17750.07Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G0050... [more]
A0A5A7SMH86.3e-17638.28Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
Match NameE-valueIdentityDescription
AT4G23160.13.8e-1630.00cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.17.4e-1238.81DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 207..306
e-value: 4.9E-8
score: 33.2
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 204..311
score: 12.724917
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 457..581
e-value: 5.5E-23
score: 81.8
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 19..108
e-value: 1.3E-6
score: 28.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 136..158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..383
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 144..158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..389
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 199..304
coord: 456..639
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 200..312
e-value: 1.9E-16
score: 62.1
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 177..193
score: 8.531888
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 174..199
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 201..309

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015873.1Tan0015873.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding