Tan0021760 (gene) Snake gourd v1

Overview
NameTan0021760
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationLG07: 10418935 .. 10423528 (+)
RNA-Seq ExpressionTan0021760
SyntenyTan0021760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACTTTGTTAAGATGGTATCAGAGCGTTGAAAAGTCTAATGACTAAGAAAGAATTTTTTTTCCGTAATGATGAGCTCGTCCACCTCAGAAGAGGACCAAATGTTGGTATCTCTCTCGAAACTTGTTAATCCCGGAAGTACACCAAAGCTCGATGAGGAAAATTTCCTTCTATGGAAATTTCAGATTCTTATTACGTTGCGAGACCATGGACTGCAACACTACATTGAGGAAGGTTGTGAAGCCCCTGTGAAATACCTTCAATCAACTCACGAATCCTCTTTAGCAACATTGATTAATCCTGATTACGACAAATGGGTGCGACAAGACAATCTTATTACGACATGGCTTCTAGGAGCAATGTCTGCATCAATTGTTGGTGAATTGCTCGACTGCAAGACTTCTCGCAAGGTATGGGCTCATCTTTCTCCAAGATTTGCTTCCAAGAATATTGCTCGTGTTTTTGAATTGAAATCAAAACTTGAAGGATTAAAGAAAGGAGGTTTAGGATTGCAAGAATATTTTTCTAAAATCAAGACTGTAGTTGATGCCTTTACTGCTGTTGGGAAGCCAATATCACATGATGACCATATTCTTCATATATTTGCTGGTTTAGGATCTGAATATGATCCTACGGTATCAGTATTAGCTGGTAGTGATGAGACTCCTTCCTTACAGAAAGTTTATATAATGCTTCTTACCCAAGAGAGTAGAATACAACGGCATACCAGTGTTACTACTACATCTGATCCCACATTACCTTAAGTGAATCTAACACAATCAAAGGTTCAACAATCTCCACATCAGTCGACTTCGTTTGCTGATAATAGAAGATCAAAGCCTCAAGGTGGACAGAATCACGGGCAAAATCATGGACAAGGGAGTCAAGATTACAGGTCTAATCGTCGAATTGGAATAATCAGAAACCACAGTGCCAATTATGTGGCAAATTTGGGCACACTGCACTTCGGTGCTATTTTCGGTTTGAAAGATCATATCAAGGTCCAAATTCATCTTCCTCTAGTTCTTCGACATATCATCAACAACAATATAATCATCCATCAAATTTCGATCCCTCTATACCACATCAACAACAACAGGTTGCTGCCTATATTCTGAGTCATGAGATGAATAAGGAAAATCAATGGTTTCCTAATTCTGGGGCCTCGAATCATGTAACAAATGATGTGAACAATTTGTCCTTTGAACAAAATACAATGGTGATAGCAGAGTCCATATTGGAAATGGTATAGGTTTGAACATTCAAAATATTGGTACTTCCTATCTGAAATCTTCCTCTCACAATGTCTTTTCACTTAATAATTTTCTTCATATCTCACATATTACCAAAAATCTTATAAGTGTGAGTCAATTTGTTAAAGACAATAATGTTTTCTTGGAATTTCACCCCTTTACTTGCTTTGTGAAGGACATTACTATGGGCAAAACTCTTCTCCAAGGTGATGGACTTTATAGGTTTCAAATGACACAGGCCAATCTTTCCATTCTTTCTTCACAACATCATCAGTCTTCTATCTCTGCACCACAAGTACACAATGTCTCTCTATCCTCTTCATTAAATCAATCTGTATTCCATAAATGTACTTCACTTGAAAAATGGCACAATAGGCTAGGGCATTCCGCTATACCTATTGTTCAACATATTATGAGTATGTGTAATCTTGATACTTCAAATAAATCCTTTCATTTCTGTCATGCTTGTGCGGTTGGGAAGTCCCATAATCTTCCATTTCATGATTCCACATCTCACTATGATTTTCCTTTACAATTAATTGTTGTTGATGTATGGGGTCCCGCTTATGAGTCATCTAGGAATGGTTTCAAATATTATGTGAGTTTTGTTGATGTATTCTCACGGTATACCTTGATCTATTTCCTTAATAACAAATCAGAAGCTTTTTCTGCCTTTTTATTGTTTAAAACTCAAGTAGAAAAGATGTTCAATCGTTCTATCCTTAGCTTACAAACTGATAATGGGGGAGAATTTCGTTCTTTCATTCATTTTCTCAAAACAAATGGTATCACTCATAGAGTCACATGCCCCTACACTTCCCAACAAAATGGTATAGTCGAGAGAAAGCACCGCCACATAGTTGAAATGGGTCTTACACTCTTATCCTATGCTTCTCTTTCCATTAAATTCTGGGATGATGCATTTGCTACGGCTGTGTACTTAATTAATAGATTACCTACAAAAGTCCATCACACCGTTTCTCCCATGGAAAAACTGTTTGGTCATAAACCAACCTATGTAGATCTCAGGACTTTTGGATGTCTTTGTTATCCATCATTAAGAGCATACAATAAACATAAACTTGACCCTCGGTCTACTCCATGTCTCTTTCTCGGTTATAGCTCTTTTCACAAGGGATATAAGTGTCTTTCCTCATTTGGAAGAATGTATATATCGAGGCATGTAACATTTGATGAATCAACTTTTCCTTCTTTCAATTTTCTTTATCCTTCAGAGTCTAAGTCCAATACATTGCACCACAGTGTATTACCCATTGTACATTCAGATCCCACGCCTATCAACTTCCAACAGTCAAATCTACCACCCTCTTGTCCTTTATCTACTTTACCTATGGACCCCCTTATCACCAACTCTATCCCATCACCTCAAACTTTATCTCTACCCACTCCCCTAGACTCACACGCCCATCCTACACCTTGCCTGATTGCTTCTGGTAGTGTATCCAATATTTTGAATGTTCCACCAGTCACTGGGACGACAACCAAAAATAATACTCATACTTTGATTACTCGAGGGAAAGCTAGAGTTTTTAAGCCAAAGGTCTTCTTGGCTAATTATAAAGAGGTTGAACCTCCAAACGTCAAGGAAGCTCTCAAATGTGACCATTGGATCCAAGCCATGATAGATGAATATAGAGCTTTAATGAGCAACGATACTTGGTCTTTGCTGGATAGGCCAGTTAACAAGAAGATTATCGGATGTAAATGGGTGTTCAAGATCAAAAGGCACTCTGATGGATCGGTTGCAAGATATAAAGCACGATTAGTTGCCCAGGGATTTCACCAGCAAGCTGATGTTGATTATACAGAGACGTTTAGTCCTGTTGTCAAGCCCGTCACTATTCGAATCCTTTTCACATTAGCACTGACGAATGGCTGGAAATTGAGGCTGGTGGATATCAATAATGCATTCCTTCATGGTTTGCTATCTGAGGAAGTTTTTATGAGTCAACCCCTGGCATGGTAATATCCACCAAAGAAAAACAAGTGTGTAGACTGAAGAAAGCCTTGTATGGCCTGAAGCAAGCATCAAGAGCATGGTACGAACGGTTAAGCTCATTTCTATGCTCCCTTGGGTTCTCTCATTCAAAAGCTGACTCATCTCTTTTTATATATCAACATCATCATGTTATTTGCTACATTTTAGTGTATGTTGATGATATAGTAGTTGCTGGAAACTCTGATGATTTTGTTGACAATTTGCTGAAATAATTGAACATGAAATTCTCTCTTAAAGACCTGGGGTTATTGAGTTAGTTTCTTGGAGTTGAAGTCTCAGCAACACCAACTGGATCATTATTTCTTTCACAACTGAAGTATATTTCAGATCTTCTTCATAGAGCTAACATGAGTCATGCAAACCCGATTGCTACGCCTATGATAAGTGGATCAGTGTTATCTGCCTTTCAGGGAGAATCGTTTCAGGATGTTCATCTTTACAGAAGTATTGTGGGAGCTTTACAATATGTTACCATTACAAGACCAGAGATACTTACAGTGTTAACAAAATCTATCAATTTATGCAATCTCCTACAGTGCATCATTGGCAGACAGTCAAGAGGATATTGTGATACTTGAAGGGCACTTTGAATCATGGCCTAGTTTTCCATAAATCGACTGAATTGGTGCTGCAAGGGTATGCCGATGCTGATTGGGCCTCTGATCCAGATGACAGGAAGTCCACCTCAGGGTTTTGTATATATTTTGGTGGCAATCTGATACAATGATCATCGAAGAAACAAGGAATCATATCCCGATCGAGTACTGAAGCAGAATATAGAAGTTTGGCTCACATATCTGCGGATGTGGTTTGGATACAATCCTTATTTTCTGAGTTAAACATCAGGTTAGCTTTTATGCCAAGATTATGGTGCGATAATCTAAGTGCTGTTCATTTAAGTCCTAATCCGATTTTACACTCAAGAACTAAGCATGTAGAGATTGATATATACTTTGTTCGAGATCTTGTATTCCAGAAACGTCTGCAGATTTCTCATCTTCCGGCTTCAGCTCAAGTGGCTGACATCTTTACTAAACCTTTGTCTGCTTCGAAGTTTTTGGCTCTTACACACAAGCTCAATGTTTGCTCTTCAGTTGACATTGGCTTGGGGGATGTTACGAGAGCGCATTAAAGGTTTAGTTAGAATTTTATAATTTTATCAGTTTCATAAACTTACGATAGTTATAACTACTTTTCTGAGTTGTTGTACTTAATAGAGAGCCTATAAATATGGCATATGTATTCTTTGAGAATGTGAAAAGGAAATCTTATCCTTTGTGATCATATTCACTTTGTTAAGAAATATTAATTACCATTATAAAAA

mRNA sequence

ACTTTGTTAAGATGGTATCAGAGCGTTGAAAAGTCTAATGACTAAGAAAGAATTTTTTTTCCGTAATGATGAGCTCGTCCACCTCAGAAGAGGACCAAATGTTGGTATCTCTCTCGAAACTTGTTAATCCCGGAAGTACACCAAAGCTCGATGAGGAAAATTTCCTTCTATGGAAATTTCAGATTCTTATTACGTTGCGAGACCATGGACTGCAACACTACATTGAGGAAGGTTGTGAAGCCCCTGTGAAATACCTTCAATCAACTCACGAATCCTCTTTAGCAACATTGATTAATCCTGATTACGACAAATGGGTGCGACAAGACAATCTTATTACGACATGGCTTCTAGGAGCAATGTCTGCATCAATTGTTGGTGAATTGCTCGACTGCAAGACTTCTCGCAAGGATCTGAATATGATCCTACGGTATCAGTATTAGCTGGTAGTGATGAGACTCCTTCCTTACAGAAAGTTTATATAATGCTTCTTACCCAAGAGAGTAGAATACAACGGCATACCAGTGTTACTACTACATCTGATCCCACATTACCTTAAGTGAATCTAACACAATCAAAGGTTCAACAATCTCCACATCAGTCGACTTCGTTTGCTGATAATAGAAGATCAAAGCCTCAAGGTGGACAGAATCACGGGCAAAATCATGGACAAGGGAGTCAAGATTACAGGTCTAATCGTCGAATTGGAATAATCAGAAACCACAGTGCCAATTATGTGGCAAATTTGGGCACACTGCACTTCGGTGCTATTTTCGGTTTGAAAGATCATATCAAGGTCCAAATTCATCTTCCTCTAGTTCTTCGACATATCATCAACAACAATATAATCATCCATCAAATTTCGATCCCTCTATACCACATCAACAACAACAGGTTGCTGCCTATATTCTGAGTCATGAGATGAATAAGGAAAATCAATGGTTTCCTAATTCTGGGGCCTCGAATCATGTAACAAATGATGTGAACAATTTGTCCTTTGAACAAAATACAATGGTGATAGCAGAGTCCATATTGGAAATGGTATAGGTTTGAACATTCAAAATATTGGTACTTCCTATCTGAAATCTTCCTCTCACAATGTCTTTTCACTTAATAATTTTCTTCATATCTCACATATTACCAAAAATCTTATAAGTGTGAGTCAATTTGTTAAAGACAATAATGTTTTCTTGGAATTTCACCCCTTTACTTGCTTTGTGAAGGACATTACTATGGGCAAAACTCTTCTCCAAGGTGATGGACTTTATAGGTTTCAAATGACACAGGCCAATCTTTCCATTCTTTCTTCACAACATCATCAGTCTTCTATCTCTGCACCACAAGTACACAATGTCTCTCTATCCTCTTCATTAAATCAATCTGTATTCCATAAATGTACTTCACTTGAAAAATGGCACAATAGGCTAGGGCATTCCGCTATACCTATTGTTCAACATATTATGAGTATGTGTAATCTTGATACTTCAAATAAATCCTTTCATTTCTGTCATGCTTGTGCGGTTGGGAAGTCCCATAATCTTCCATTTCATGATTCCACATCTCACTATGATTTTCCTTTACAATTAATTGTTGTTGATGTATGGGGTCCCGCTTATGAGTCATCTAGGAATGGTTTCAAATATTATGTGAGTTTTGTTGATGTATTCTCACGGTATACCTTGATCTATTTCCTTAATAACAAATCAGAAGCTTTTTCTGCCTTTTTATTGTTTAAAACTCAAGTAGAAAAGATGTTCAATCGTTCTATCCTTAGCTTACAAACTGATAATGGGGGAGAATTTCGTTCTTTCATTCATTTTCTCAAAACAAATGGTATCACTCATAGAGTCACATGCCCCTACACTTCCCAACAAAATGGTATAGTCGAGAGAAAGCACCGCCACATAGTTGAAATGGGTCTTACACTCTTATCCTATGCTTCTCTTTCCATTAAATTCTGGGATGATGCATTTGCTACGGCTGTGTACTTAATTAATAGATTACCTACAAAAGTCCATCACACCGTTTCTCCCATGGAAAAACTGTTTGGTCATAAACCAACCTATGTAGATCTCAGGACTTTTGGATGTCTTTGTTATCCATCATTAAGAGCATACAATAAACATAAACTTGACCCTCGGTCTACTCCATGTCTCTTTCTCGGTTATAGCTCTTTTCACAAGGGATATAAGTGTCTTTCCTCATTTGGAAGAATGTATATATCGAGGCATGTAACATTTGATGAATCAACTTTTCCTTCTTTCAATTTTCTTTATCCTTCAGAGTCTAAGTCCAATACATTGCACCACAGTGTATTACCCATTGTACATTCAGATCCCACGCCTATCAACTTCCAACAGTCAAATCTACCACCCTCTTGTCCTTTATCTACTTTACCTATGGACCCCCTTATCACCAACTCTATCCCATCACCTCAAACTTTATCTCTACCCACTCCCCTAGACTCACACGCCCATCCTACACCTTGCCTGATTGCTTCTGGTAGTGTATCCAATATTTTGAATGTTCCACCAGTCACTGGGACGACAACCAAAAATAATACTCATACTTTGATTACTCGAGGGAAAGCTAGAGTTTTTAAGCCAAAGGTCTTCTTGGCTAATTATAAAGAGGTTGAACCTCCAAACGTCAAGGAAGCTCTCAAATGTGACCATTGGATCCAAGCCATGATAGATGAATATAGAGCTTTAATGAGCAACGATACTTGGTCTTTGCTGGATAGGCCAGTTAACAAGAAGATTATCGGATGTAAATGGGTGTTCAAGATCAAAAGGCACTCTGATGGATCGGTTGCAAGATATAAAGCACGATTAGTTGCCCAGGGATTTCACCAGCAAGCTGATGTTGATTATACAGAGACGTTTAGTCCTGTTGTCAAGCCCGTCACTATTCGAATCCTTTTCACATTAGCACTGACGAATGGCTGGAAATTGAGGCTGGTGGATATCAATAATGCATTCCTTCATGGTTTGCTATCTGAGGAAGTTTTTATGAGTCAACCCCTGGCATGGTAATATCCACCAAAGAAAAACAAGTGTGTAGACTGAAGAAAGCCTTGTATGGCCTGAAGCAAGCATCAAGAGCATGGTACGAACGGTTAAGCTCATTTCTATGCTCCCTTGGGTTCTCTCATTCAAAAGCTGACTCATCTCTTTTTATATATCAACATCATCATGTTATTTGCTACATTTTAGTGTATGTTGATGATATAGTAGTTGCTGGAAACTCTGATGATTTTGTTGACAATTTGCTGAAATAATTGAACATGAAATTCTCTCTTAAAGACCTGGGGTTATTGAGTTAGTTTCTTGGAGTTGAAGTCTCAGCAACACCAACTGGATCATTATTTCTTTCACAACTGAAGTATATTTCAGATCTTCTTCATAGAGCTAACATGAGTCATGCAAACCCGATTGCTACGCCTATGATAAGTGGATCAGTGTTATCTGCCTTTCAGGGAGAATCGTTTCAGGATGTTCATCTTTACAGAAGTATTGTGGGAGCTTTACAATATGTTACCATTACAAGACCAGAGATACTTACAGTGTTAACAAAATCTATCAATTTATGCAATCTCCTACAGTGCATCATTGGCAGACAGTCAAGAGGATATTGTGATACTTGAAGGGCACTTTGAATCATGGCCTAGTTTTCCATAAATCGACTGAATTGGTGCTGCAAGGGTATGCCGATGCTGATTGGGCCTCTGATCCAGATGACAGGAAGTCCACCTCAGGGTTTTGTATATATTTTGGTGGCAATCTGATACAATGATCATCGAAGAAACAAGGAATCATATCCCGATCGAGTACTGAAGCAGAATATAGAAGTTTGGCTCACATATCTGCGGATGTGGTTTGGATACAATCCTTATTTTCTGAGTTAAACATCAGGTTAGCTTTTATGCCAAGATTATGGTGCGATAATCTAAGTGCTGTTCATTTAAGTCCTAATCCGATTTTACACTCAAGAACTAAGCATGTAGAGATTGATATATACTTTGTTCGAGATCTTGTATTCCAGAAACGTCTGCAGATTTCTCATCTTCCGGCTTCAGCTCAAGTGGCTGACATCTTTACTAAACCTTTGTCTGCTTCGAAGTTTTTGGCTCTTACACACAAGCTCAATGTTTGCTCTTCAGTTGACATTGGCTTGGGGGATGTTACGAGAGCGCATTAAAGGTTTAGTTAGAATTTTATAATTTTATCAGTTTCATAAACTTACGATAGTTATAACTACTTTTCTGAGTTGTTGTACTTAATAGAGAGCCTATAAATATGGCATATGTATTCTTTGAGAATGTGAAAAGGAAATCTTATCCTTTGTGATCATATTCACTTTGTTAAGAAATATTAATTACCATTATAAAAA

Coding sequence (CDS)

ATGGGCAAAACTCTTCTCCAAGGTGATGGACTTTATAGGTTTCAAATGACACAGGCCAATCTTTCCATTCTTTCTTCACAACATCATCAGTCTTCTATCTCTGCACCACAAGTACACAATGTCTCTCTATCCTCTTCATTAAATCAATCTGTATTCCATAAATGTACTTCACTTGAAAAATGGCACAATAGGCTAGGGCATTCCGCTATACCTATTGTTCAACATATTATGAGTATGTGTAATCTTGATACTTCAAATAAATCCTTTCATTTCTGTCATGCTTGTGCGGTTGGGAAGTCCCATAATCTTCCATTTCATGATTCCACATCTCACTATGATTTTCCTTTACAATTAATTGTTGTTGATGTATGGGGTCCCGCTTATGAGTCATCTAGGAATGGTTTCAAATATTATGTGAGTTTTGTTGATGTATTCTCACGGTATACCTTGATCTATTTCCTTAATAACAAATCAGAAGCTTTTTCTGCCTTTTTATTGTTTAAAACTCAAGTAGAAAAGATGTTCAATCGTTCTATCCTTAGCTTACAAACTGATAATGGGGGAGAATTTCGTTCTTTCATTCATTTTCTCAAAACAAATGGTATCACTCATAGAGTCACATGCCCCTACACTTCCCAACAAAATGGTATAGTCGAGAGAAAGCACCGCCACATAGTTGAAATGGGTCTTACACTCTTATCCTATGCTTCTCTTTCCATTAAATTCTGGGATGATGCATTTGCTACGGCTGTGTACTTAATTAATAGATTACCTACAAAAGTCCATCACACCGTTTCTCCCATGGAAAAACTGTTTGGTCATAAACCAACCTATGTAGATCTCAGGACTTTTGGATGTCTTTGTTATCCATCATTAAGAGCATACAATAAACATAAACTTGACCCTCGGTCTACTCCATGTCTCTTTCTCGGTTATAGCTCTTTTCACAAGGGATATAAGTGTCTTTCCTCATTTGGAAGAATGTATATATCGAGGCATGTAACATTTGATGAATCAACTTTTCCTTCTTTCAATTTTCTTTATCCTTCAGAGTCTAAGTCCAATACATTGCACCACAGTGTATTACCCATTGTACATTCAGATCCCACGCCTATCAACTTCCAACAGTCAAATCTACCACCCTCTTGTCCTTTATCTACTTTACCTATGGACCCCCTTATCACCAACTCTATCCCATCACCTCAAACTTTATCTCTACCCACTCCCCTAGACTCACACGCCCATCCTACACCTTGCCTGATTGCTTCTGGTAGTGTATCCAATATTTTGAATGTTCCACCAGTCACTGGGACGACAACCAAAAATAATACTCATACTTTGATTACTCGAGGGAAAGCTAGAGTTTTTAAGCCAAAGGTCTTCTTGGCTAATTATAAAGAGGTTGAACCTCCAAACGTCAAGGAAGCTCTCAAATGTGACCATTGGATCCAAGCCATGATAGATGAATATAGAGCTTTAATGAGCAACGATACTTGGTCTTTGCTGGATAGGCCAGTTAACAAGAAGATTATCGGATGTAAATGGGTGTTCAAGATCAAAAGGCACTCTGATGGATCGGTTGCAAGATATAAAGCACGATTAGTTGCCCAGGGATTTCACCAGCAAGCTGATGTTGATTATACAGAGACGTTTAGTCCTGTTGTCAAGCCCGTCACTATTCGAATCCTTTTCACATTAGCACTGACGAATGGCTGGAAATTGAGGCTGGTGGATATCAATAATGCATTCCTTCATGGTTTGCTATCTGAGGAAGTTTTTATGAGTCAACCCCTGGCATGGTAA

Protein sequence

MGKTLLQGDGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKSFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQPLAW
Homology
BLAST of Tan0021760 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 4.1e-110
Identity = 248/639 (38.81%), Postives = 337/639 (52.74%), Query Frame = 0

Query: 41   VSLSSSLNQSVFHKCTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVG 100
            VSL +S +    H       WH RLGH A  I+  ++S  +L   N S  F  C  C + 
Sbjct: 452  VSLFASPSSKATH-----SSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLIN 511

Query: 101  KSHNLPFHDSTSHYDFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKS 160
            KS+ +PF  ST +   PL+ I  DVW     S  N ++YYV FVD F+RYT +Y L  KS
Sbjct: 512  KSNKVPFSQSTINSTRPLEYIYSDVWSSPILSHDN-YRYYVIFVDHFTRYTWLYPLKQKS 571

Query: 161  EAFSAFLLFKTQVEKMFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIV 220
            +    F+ FK  +E  F   I +  +DNGGEF +   +   +GI+H  + P+T + NG+ 
Sbjct: 572  QVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLS 631

Query: 221  ERKHRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTY 280
            ERKHRHIVE GLTLLS+AS+   +W  AFA AVYLINRLPT +    SP +KLFG  P Y
Sbjct: 632  ERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNY 691

Query: 281  VDLRTFGCLCYPSLRAYNKHKLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFD 340
              LR FGC CYP LR YN+HKLD +S  C+FLGYS     Y CL     R+YISRHV FD
Sbjct: 692  DKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFD 751

Query: 341  ESTFPSFNFL--------YPSESKSNTLHHSVLPI---VHSDPTPINFQQSNLPPSCP-- 400
            E+ FP  N+L           ES      H+ LP    V   P+  +   +  PPS P  
Sbjct: 752  ENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSA 811

Query: 401  ------LSTLPMDPLITNSIPS-----------PQTLSLPTPLDSHAH------------ 460
                  +S+  +D   ++S PS           PQ  + PT   +  H            
Sbjct: 812  PFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTN 871

Query: 461  --------------------PTPCLIASGSVSN------ILNVPPVTGTTTKN------N 520
                                P+P   AS S ++      +++ PP       N      N
Sbjct: 872  ESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLN 931

Query: 521  THTLITRGKARVFKP----KVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTW 580
            TH++ TR KA + KP     + ++   E EP    +ALK + W  AM  E  A + N TW
Sbjct: 932  THSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTW 991

Query: 581  SLLDRPVNK-KIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPV 598
             L+  P +   I+GC+W+F  K +SDGS+ RYKARLVA+G++Q+  +DY ETFSPV+K  
Sbjct: 992  DLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKST 1051

BLAST of Tan0021760 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 7.7e-109
Identity = 247/623 (39.65%), Postives = 333/623 (53.45%), Query Frame = 0

Query: 61   WHNRLGHSAIPIVQHIMSMCNLDTSNKSFHF--CHACAVGKSHNLPFHDSTSHYDFPLQL 120
            WH+RLGH ++ I+  ++S  +L   N S     C  C + KSH +PF +ST     PL+ 
Sbjct: 446  WHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEY 505

Query: 121  IVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRS 180
            I  DVW     S  N ++YYV FVD F+RYT +Y L  KS+    F++FK+ VE  F   
Sbjct: 506  IYSDVWSSPILSIDN-YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTR 565

Query: 181  ILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASL 240
            I +L +DNGGEF     +L  +GI+H  + P+T + NG+ ERKHRHIVEMGLTLLS+AS+
Sbjct: 566  IGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASV 625

Query: 241  SIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKH 300
               +W  AF+ AVYLINRLPT +    SP +KLFG  P Y  L+ FGC CYP LR YN+H
Sbjct: 626  PKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRH 685

Query: 301  KLDPRSTPCLFLGYSSFHKGYKCLS-SFGRMYISRHVTFDESTFP--------SFNFLYP 360
            KL+ +S  C F+GYS     Y CL    GR+Y SRHV FDE  FP        S +    
Sbjct: 686  KLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQR 745

Query: 361  SESKSNTLHHSVLPIV--------------------HSDPTPI---NFQQSNLP------ 420
            S+S  N   H+ LP                       S P+P+       SNLP      
Sbjct: 746  SDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISS 805

Query: 421  -----PSCPLSTLPM--------------DPLITN---SIPSP----QTLSLP-TPLDSH 480
                 P+ P    P                P++ N   + PSP    Q   LP +P+ S 
Sbjct: 806  PSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSP 865

Query: 481  AHPTPCLI-------ASGSVSN-----ILNVPPVTGTTTKN--NTHTLITRGKARVFKPK 540
              PTP          +S S S      +L  PP+     +   NTH++ TR K  + KP 
Sbjct: 866  HIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPN 925

Query: 541  VFLANYKEV----EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLL-DRPVNKKIIGCK 598
               +    +    EP    +A+K D W QAM  E  A + N TW L+   P +  I+GC+
Sbjct: 926  QKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCR 985

BLAST of Tan0021760 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 9.6e-67
Identity = 172/553 (31.10%), Postives = 262/553 (47.38%), Query Frame = 0

Query: 57  SLEKWHNRLGHSAIPIVQHIMSMCNLD-TSNKSFHFCHACAVGKSHNLPFHDSTSHYDFP 116
           S++ WH R+GH +   +Q +     +      +   C  C  GK H + F  S+      
Sbjct: 421 SVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNI 480

Query: 117 LQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMF 176
           L L+  DV GP    S  G KY+V+F+D  SR   +Y L  K + F  F  F   VE+  
Sbjct: 481 LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERET 540

Query: 177 NRSILSLQTDNGGEF--RSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLL 236
            R +  L++DNGGE+  R F  +  ++GI H  T P T Q NG+ ER +R IVE   ++L
Sbjct: 541 GRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSML 600

Query: 237 SYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR 296
             A L   FW +A  TA YLINR P+       P       + +Y  L+ FGC  +  + 
Sbjct: 601 RMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVP 660

Query: 297 AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPSFNFLYPSES 356
              + KLD +S PC+F+GY     GY+      +  I SR V F ES       +  +  
Sbjct: 661 KEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESE------VRTAAD 720

Query: 357 KSNTLHHSVLPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLDS 416
            S  + + ++P   + P+      SN P S   +T   D +        + +     LD 
Sbjct: 721 MSEKVKNGIIPNFVTIPS-----TSNNPTSAESTT---DEVSEQGEQPGEVIEQGEQLDE 780

Query: 417 HAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG-----KARVFKPKVFLANYKE 476
                              V  V   T     H  + R      ++R +    ++    +
Sbjct: 781 ------------------GVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDD 840

Query: 477 VEPPNVKEAL---KCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSD 536
            EP ++KE L   + +  ++AM +E  +L  N T+ L++ P  K+ + CKWVFK+K+  D
Sbjct: 841 REPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGD 900

Query: 537 GSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFL 596
             + RYKARLV +GF Q+  +D+ E FSPVVK  +IR + +LA +   ++  +D+  AFL
Sbjct: 901 CKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFL 941

Query: 597 HGLLSEEVFMSQP 598
           HG L EE++M QP
Sbjct: 961 HGDLEEEIYMEQP 941

BLAST of Tan0021760 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 207.2 bits (526), Expect = 5.1e-52
Identity = 172/611 (28.15%), Postives = 257/611 (42.06%), Query Frame = 0

Query: 61   WHNRLGHSA------IPIVQHIMSMCNLDTSNKSFHFCHACAVGKSHNLPFHD--STSHY 120
            WH R GH +      I           L+    S   C  C  GK   LPF      +H 
Sbjct: 418  WHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHI 477

Query: 121  DFPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVE 180
              PL ++  DV GP    + +   Y+V FVD F+ Y + Y +  KS+ FS F  F  + E
Sbjct: 478  KRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSE 537

Query: 181  KMFNRSILSLQTDNGGEFRS--FIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGL 240
              FN  ++ L  DNG E+ S     F    GI++ +T P+T Q NG+ ER  R I E   
Sbjct: 538  AHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKAR 597

Query: 241  TLLSYASLSIKFWDDAFATAVYLINRLPTK--VHHTVSPMEKLFGHKPTYVDLRTFGCLC 300
            T++S A L   FW +A  TA YLINR+P++  V  + +P E     KP    LR FG   
Sbjct: 598  TMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATV 657

Query: 301  YPSLRAYNKH-KLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYI-SRHVTFDESTFPS--- 360
            Y  ++  NK  K D +S   +F+GY     G+K   +    +I +R V  DE+   +   
Sbjct: 658  YVHIK--NKQGKFDDKSFKSIFVGYEP--NGFKLWDAVNEKFIVARDVVVDETNMVNSRA 717

Query: 361  --FNFLYPSESK----------SNTLHHSVLPIVHSDPTPINF-------QQSNLPPSCP 420
              F  ++  +SK          S  +  +  P    +   I F       +  N P    
Sbjct: 718  VKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSR 777

Query: 421  LSTLPMDPLITNSIPSPQTL-------------SLPTPLDSHAHPT-----PCLIASGSV 480
                   P  +    + Q L             S     D H + +     P        
Sbjct: 778  KIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESET 837

Query: 481  SNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKC------- 540
            +  L    +   T  +    +  R +    KP++   +Y E +    K  L         
Sbjct: 838  AEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQI---SYNEEDNSLNKVVLNAHTIFNDV 897

Query: 541  -------------DHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGS 598
                           W +A+  E  A   N+TW++  RP NK I+  +WVF +K +  G+
Sbjct: 898  PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGN 957

BLAST of Tan0021760 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 9.1e-25
Identity = 62/125 (49.60%), Postives = 81/125 (64.80%), Query Frame = 0

Query: 447 LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLL 506
           ++TR KA + K  PK  L     +  EP +V  ALK   W QAM +E  AL  N TW L+
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 507 DRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRI 566
             PVN+ I+GCKWVFK K HSDG++ R KARLVA+GFHQ+  + + ET+SPVV+  TIR 
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

Query: 567 LFTLA 568
           +  +A
Sbjct: 121 ILNVA 125

BLAST of Tan0021760 vs. NCBI nr
Match: KAG8479334.1 (hypothetical protein CXB51_029681 [Gossypium anomalum])

HSP 1 Score: 524.2 bits (1349), Expect = 1.4e-144
Identity = 289/606 (47.69%), Postives = 378/606 (62.38%), Query Frame = 0

Query: 3    KTLLQG---DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLE 62
            +TLL G   +GLYRF   ++N +                 + +L  S  Q         +
Sbjct: 493  QTLLCGSELNGLYRFDTVKSNFAF-------------NTESTALPESFQQQ--GSDVQFD 552

Query: 63   KWHNRLGHSAIPIVQHIMSMCNLDTS-NKSFHFCHACAVGKSHNLPFHDSTSHYDFPLQL 122
            +WH RLGH +  +V+ I++ CN+  S  K++  C+AC +GK H LPF  S   Y  PL+L
Sbjct: 553  RWHRRLGHPSWDVVRSILTSCNIRISTRKNYTLCNACELGKGHKLPFRSSGCVYTAPLEL 612

Query: 123  IVVDVWGPA-YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNR 182
            +V DVWGPA Y SS  G++YY+SFVD FSR+T IYFL  KS+AFSAFL FK  VE     
Sbjct: 613  VVADVWGPAPYFSS--GYQYYLSFVDSFSRHTWIYFLKKKSDAFSAFLSFKKYVELQLGV 672

Query: 183  SILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYAS 242
             +  LQTD GGEFRSF  +LK   I HRV+CP+TS+QNG+VE +HR IVE GL LL+ AS
Sbjct: 673  KLKQLQTDGGGEFRSFDVYLKQCRIGHRVSCPHTSEQNGLVEHRHRQIVETGLVLLAQAS 732

Query: 243  LSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNK 302
            L I +W DAFATAVY++NRLPTK    VSP E+LFGHKP Y  LR FGCLCYP LR YN+
Sbjct: 733  LPISYWADAFATAVYIMNRLPTKSLPGVSPCEQLFGHKPDYQQLRVFGCLCYPLLRPYNR 792

Query: 303  HKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFP----SFNFLYPSESK 362
            HKL  RS PC FLGY++ H+GYKC+  +GR+YISRHV FDE T+P    S + +   +S+
Sbjct: 793  HKLQYRSAPCTFLGYATNHRGYKCVDRYGRVYISRHVRFDEDTYPFAQLSKSVVSVPDSR 852

Query: 363  SNTLHHSV--LPIVHSDPTPINFQQSNLPPSCPLSTLPMDPLITNSIPSPQTLSLPTPLD 422
            S      V  LPI  + P       +N+  S P+ + P       S  SP   SL  P  
Sbjct: 853  SGQFMRDVTSLPIFMTSP-------ANISESSPVDSTPA------SNSSPVDTSLMDPTS 912

Query: 423  SHAHPTPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPP 482
            S++     LI     S+++            N H ++TR K  ++KPK ++A   +VEP 
Sbjct: 913  SNSEDHLALIVEQQGSSLI------------NRHPMMTRSKMGIYKPKTYMAVVSDVEPL 972

Query: 483  NVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYK 542
             + EA+    W QA+ DE +AL+ N TW L+  PVN+ ++GCKW+FKIKR+SDGSVAR K
Sbjct: 973  TIHEAMAIPSWKQAVNDELQALIRNRTWDLVSVPVNQSLVGCKWLFKIKRNSDGSVARNK 1032

Query: 543  ARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEE 598
             RLVAQGF Q A +DY ETFS VVK  T+R++  LA++  WKLR VD+NNAFL+G L E+
Sbjct: 1033 VRLVAQGFSQAAGLDYHETFSLVVKINTVRLILALAVSRKWKLRQVDVNNAFLNGDLVED 1056

BLAST of Tan0021760 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 518.5 bits (1334), Expect = 7.9e-143
Identity = 292/626 (46.65%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 5    LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHK 64
            LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH 
Sbjct: 598  LLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFH- 657

Query: 65   CTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCHACAVGKSHNLPFHDSTSHYD 124
                + WH RLGH A  IV  +++   +  S KS    C AC +GKSHNLPF  S + Y 
Sbjct: 658  --VFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYT 717

Query: 125  FPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEK 184
             PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E 
Sbjct: 718  KPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAEL 777

Query: 185  MFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLL 244
             F   + + QTD GGEFRS   + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL
Sbjct: 778  QFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLL 837

Query: 245  SYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR 304
            + ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Sbjct: 838  AQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKVFGCLCFPHLR 897

Query: 305  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSES 364
             YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P + 
Sbjct: 898  PYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDETRFPFADRLQKPVQI 957

Query: 365  KS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPMDPLITNSIPSPQ-------- 424
             S +T+    +P+V  +  P++   S +LP S   S+  +D  + + I S Q        
Sbjct: 958  VSHSTVGLPCIPLV-KNLEPLSVSPSLSLPTSSAQSSHQLDENLGSDIRSVQQDLSNTDS 1017

Query: 425  ---------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG 484
                     + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR 
Sbjct: 1018 SSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPV---TFPQQPHHMVTRS 1077

Query: 485  KARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKII 544
            K  +FKPKV+  +    EP   +EA+    W +AM +E+RALM N TWSL+  P N+  +
Sbjct: 1078 KNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKTWSLVSLPTNRTSV 1137

Query: 545  GCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG 598
            GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Sbjct: 1138 GCRWVFKLKRNPDGSVSRYKARLVAKGYSQVPGFDFYETFSPVVKPTTIRVVLAIAVSQS 1197

BLAST of Tan0021760 vs. NCBI nr
Match: RVW44519.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 518.5 bits (1334), Expect = 7.9e-143
Identity = 292/626 (46.65%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 5    LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHK 64
            LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH 
Sbjct: 462  LLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFH- 521

Query: 65   CTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCHACAVGKSHNLPFHDSTSHYD 124
                + WH RLGH A  IV  +++   +  S KS    C AC +GKSHNLPF  S + Y 
Sbjct: 522  --VFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYT 581

Query: 125  FPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEK 184
             PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E 
Sbjct: 582  KPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAEL 641

Query: 185  MFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLL 244
             F   + + QTD GGEFRS   + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL
Sbjct: 642  QFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLL 701

Query: 245  SYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR 304
            + ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Sbjct: 702  AQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKVFGCLCFPHLR 761

Query: 305  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSES 364
             YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P + 
Sbjct: 762  PYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDETRFPFADRLQKPVQI 821

Query: 365  KS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPMDPLITNSIPSPQ-------- 424
             S +T+    +P+V  +  P++   S +LP S   S+  +D  + + I S Q        
Sbjct: 822  VSHSTVGLPCIPLV-KNLEPLSVSPSLSLPTSSAQSSHQLDENLGSDIRSVQQDLSNTDS 881

Query: 425  ---------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG 484
                     + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR 
Sbjct: 882  SSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPV---TFPQQPHHMVTRS 941

Query: 485  KARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKII 544
            K  +FKPKV+  +    EP   +EA+    W +AM +E+RALM N TWSL+  P N+  +
Sbjct: 942  KNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKTWSLVSLPTNRTSV 1001

Query: 545  GCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG 598
            GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Sbjct: 1002 GCRWVFKLKRNPDGSVSRYKARLVAKGYSQVPGFDFYETFSPVVKPTTIRVVLAIAVSQS 1061

BLAST of Tan0021760 vs. NCBI nr
Match: CAN81099.1 (hypothetical protein VITISV_017741 [Vitis vinifera])

HSP 1 Score: 513.1 bits (1320), Expect = 3.3e-141
Identity = 276/599 (46.08%), Postives = 374/599 (62.44%), Query Frame = 0

Query: 9    DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHS 68
            DGLY F  +   L    S     S+ A    +   ++SL+       ++ + WH RLGH 
Sbjct: 479  DGLYAFDSSHLALRPTQSLSKSPSVVASSFSSKVCTTSLS-------STFDLWHKRLGHP 538

Query: 69   AIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPA 128
            +   +++++S CN+   NK   +FC +C +GK H  PF  S + Y  PL+LI +D+WGP 
Sbjct: 539  SAATIKNVLSKCNVAHINKMDSNFCSSCCLGKIHRFPFSLSHTTYTKPLELIHLDLWGPT 598

Query: 129  YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNG 188
               S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD G
Sbjct: 599  LVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWG 658

Query: 189  GEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAF 248
            GEFR+F  +L  NGI HRV+CP+T QQNG+ ERKHR IVE GLTLL  ASL +KFWD++F
Sbjct: 659  GEFRAFQSYLAENGIVHRVSCPHTQQQNGVAERKHRTIVEHGLTLLHTASLPLKFWDESF 718

Query: 249  ATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC 308
             T VYL NRLPT + H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Sbjct: 719  RTVVYLSNRLPTAILHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNTHKLQYRSEEC 778

Query: 309  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHS 368
             FLGYS  HKGYKC+SS GR+YIS  V F+E++FP    +  S    +T+  S   +  S
Sbjct: 779  TFLGYSLKHKGYKCMSSNGRVYISHDVIFNETSFPYSKTIQVSSCLLSTVSPSTSHLSPS 838

Query: 369  DPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP---QTLSLPTPLDSHAHPTPCL 428
               P+        P+ P+S+      MD +++    +P    T   P  + S+   TP  
Sbjct: 839  ASPPVLSPTMLPTPTSPISSARPISEMDNIVSTHPHAPNSADTTLTPAQVVSNPVATPVQ 898

Query: 429  IASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALK 488
                S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+
Sbjct: 899  HVVSSIAD----ASVTRTIAKDADNTHPMITRAKSGIVKPKIFIAAIR--EPSSVSAALQ 958

Query: 489  CDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQG 548
             D W +AM+ EY AL  N+TWSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+G
Sbjct: 959  QDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKTKENPDGTVQKYKARLVAKG 1018

Query: 549  FHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP 598
            FHQQA  D+TETFSPVVKP T+R++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Sbjct: 1019 FHQQAGFDFTETFSPVVKPSTVRVVFTIALSRNWAIKQLDVNNAFLNGDLQEEVFMQQP 1064

BLAST of Tan0021760 vs. NCBI nr
Match: RVX14937.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 510.8 bits (1314), Expect = 1.6e-140
Identity = 281/599 (46.91%), Postives = 376/599 (62.77%), Query Frame = 0

Query: 9    DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHS 68
            DGLY F     + S L+ +  QS   +P V   S SS +   +    ++ + WH RLG  
Sbjct: 436  DGLYAF-----DSSHLALRPTQSLSKSPSVVASSFSSKV--CIASLSSTFDLWHKRLGQP 495

Query: 69   AIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPA 128
            +   +++++S CN+   NK   +FC +C +GK H  PF  S + Y  PL+LI  D+WGPA
Sbjct: 496  SAATIKNVLSKCNVAHINKMDSNFCSSCCLGKIHMFPFSLSHTTYTKPLELIHSDLWGPA 555

Query: 129  YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNG 188
               S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD G
Sbjct: 556  PVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWG 615

Query: 189  GEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAF 248
            GEFR+F  +L  NGI HRV+CP+T QQNG+ ERKHR IVE GLTLL   SL +KFWD++F
Sbjct: 616  GEFRAFQSYLAENGIVHRVSCPHTQQQNGVAERKHRTIVEHGLTLLHTVSLPLKFWDESF 675

Query: 249  ATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC 308
             T VYL NRLPT V H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Sbjct: 676  RTVVYLSNRLPTAVLHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNTHKLQYRSEEC 735

Query: 309  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHS 368
             FLGYS  HKGYKC+SS GR+YISR V F+E++FP    +  S    +T+  S   +  S
Sbjct: 736  TFLGYSLKHKGYKCMSSNGRVYISRDVIFNETSFPYSKTIQVSSCLPSTVSPSTSHLSPS 795

Query: 369  DPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP---QTLSLPTPLDSHAHPTPCL 428
               P+        P+ P+S+      MD +++    +P    T   P  + S+   TP  
Sbjct: 796  ASPPVLSPTMLPAPTSPISSARPISEMDNIVSTHPHAPNSADTTLTPAQVVSNPVATPVQ 855

Query: 429  IASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALK 488
                S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+
Sbjct: 856  HVVSSIAD----ASVTRTIAKDADNTHPMITRAKSGIVKPKIFIAAVR--EPSSVSAALQ 915

Query: 489  CDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQG 548
             D W +AM+ EY AL  N+TWSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+G
Sbjct: 916  QDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKTKENPDGTVQKYKARLVAKG 975

Query: 549  FHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP 598
            FHQQA  D+TETFSPVVKP TIR++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Sbjct: 976  FHQQAGFDFTETFSPVVKPSTIRVVFTIALSRNWAIKQLDVNNAFLNGDLQEEVFMQQP 1021

BLAST of Tan0021760 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 3.8e-143
Identity = 292/626 (46.65%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 5    LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHK 64
            LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH 
Sbjct: 598  LLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFH- 657

Query: 65   CTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCHACAVGKSHNLPFHDSTSHYD 124
                + WH RLGH A  IV  +++   +  S KS    C AC +GKSHNLPF  S + Y 
Sbjct: 658  --VFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYT 717

Query: 125  FPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEK 184
             PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E 
Sbjct: 718  KPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAEL 777

Query: 185  MFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLL 244
             F   + + QTD GGEFRS   + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL
Sbjct: 778  QFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLL 837

Query: 245  SYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR 304
            + ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Sbjct: 838  AQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKVFGCLCFPHLR 897

Query: 305  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSES 364
             YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P + 
Sbjct: 898  PYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDETRFPFADRLQKPVQI 957

Query: 365  KS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPMDPLITNSIPSPQ-------- 424
             S +T+    +P+V  +  P++   S +LP S   S+  +D  + + I S Q        
Sbjct: 958  VSHSTVGLPCIPLV-KNLEPLSVSPSLSLPTSSAQSSHQLDENLGSDIRSVQQDLSNTDS 1017

Query: 425  ---------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG 484
                     + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR 
Sbjct: 1018 SSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPV---TFPQQPHHMVTRS 1077

Query: 485  KARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKII 544
            K  +FKPKV+  +    EP   +EA+    W +AM +E+RALM N TWSL+  P N+  +
Sbjct: 1078 KNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKTWSLVSLPTNRTSV 1137

Query: 545  GCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG 598
            GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Sbjct: 1138 GCRWVFKLKRNPDGSVSRYKARLVAKGYSQVPGFDFYETFSPVVKPTTIRVVLAIAVSQS 1197

BLAST of Tan0021760 vs. ExPASy TrEMBL
Match: A0A438EA49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2917 PE=4 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 3.8e-143
Identity = 292/626 (46.65%), Postives = 392/626 (62.62%), Query Frame = 0

Query: 5    LLQGD---GLYRFQMTQ------ANLSILSSQHHQSSISAPQVHNVSLS-SSLNQSVFHK 64
            LLQG+   GLY+F +++      + LS+ + ++  +  +A  VHN +        S FH 
Sbjct: 462  LLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFH- 521

Query: 65   CTSLEKWHNRLGHSAIPIVQHIMSMCNLDTSNKS-FHFCHACAVGKSHNLPFHDSTSHYD 124
                + WH RLGH A  IV  +++   +  S KS    C AC +GKSHNLPF  S + Y 
Sbjct: 522  --VFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYT 581

Query: 125  FPLQLIVVDVWGPAYESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEK 184
             PLQL+V D+WGPA  +S  GF YYVSFVD +SRYT +YFL  KS+   AFL+FK Q E 
Sbjct: 582  KPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAEL 641

Query: 185  MFNRSILSLQTDNGGEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLL 244
             F   + + QTD GGEFRS   + + NGI HR++CP+TS+QNGI+ERKHRHIVE+GLTLL
Sbjct: 642  QFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLL 701

Query: 245  SYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLR 304
            + ASL +K+W DAF+TAV+LINRLPT+V     P E LF  KP Y  L+ FGCLC+P LR
Sbjct: 702  AQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKVFGCLCFPHLR 761

Query: 305  AYNKHKLDPRSTPCLFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLY-PSES 364
             YNKHKLD RS+PC FLGYSS HKGYKCL+  GRM+ISR V FDE+ FP  + L  P + 
Sbjct: 762  PYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDETRFPFADRLQKPVQI 821

Query: 365  KS-NTLHHSVLPIVHSDPTPINFQQS-NLPPSCPLSTLPMDPLITNSIPSPQ-------- 424
             S +T+    +P+V  +  P++   S +LP S   S+  +D  + + I S Q        
Sbjct: 822  VSHSTVGLPCIPLV-KNLEPLSVSPSLSLPTSSAQSSHQLDENLGSDIRSVQQDLSNTDS 881

Query: 425  ---------TLSLPTPLDSHAHP--TPCLIASGSVSNILNVPPVTGTTTKNNTHTLITRG 484
                     + S+P+  + +A P   P    S   +  +N  PV   T     H ++TR 
Sbjct: 882  SSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPV---TFPQQPHHMVTRS 941

Query: 485  KARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKII 544
            K  +FKPKV+  +    EP   +EA+    W +AM +E+RALM N TWSL+  P N+  +
Sbjct: 942  KNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKTWSLVSLPTNRTSV 1001

Query: 545  GCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNG 598
            GC+WVFK+KR+ DGSV+RYKARLVA+G+ Q    D+ ETFSPVVKP TIR++  +A++  
Sbjct: 1002 GCRWVFKLKRNPDGSVSRYKARLVAKGYSQVPGFDFYETFSPVVKPTTIRVVLAIAVSQS 1061

BLAST of Tan0021760 vs. ExPASy TrEMBL
Match: A5BFT3 (Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_017741 PE=4 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 1.6e-141
Identity = 276/599 (46.08%), Postives = 374/599 (62.44%), Query Frame = 0

Query: 9    DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHS 68
            DGLY F  +   L    S     S+ A    +   ++SL+       ++ + WH RLGH 
Sbjct: 479  DGLYAFDSSHLALRPTQSLSKSPSVVASSFSSKVCTTSLS-------STFDLWHKRLGHP 538

Query: 69   AIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPA 128
            +   +++++S CN+   NK   +FC +C +GK H  PF  S + Y  PL+LI +D+WGP 
Sbjct: 539  SAATIKNVLSKCNVAHINKMDSNFCSSCCLGKIHRFPFSLSHTTYTKPLELIHLDLWGPT 598

Query: 129  YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNG 188
               S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD G
Sbjct: 599  LVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWG 658

Query: 189  GEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAF 248
            GEFR+F  +L  NGI HRV+CP+T QQNG+ ERKHR IVE GLTLL  ASL +KFWD++F
Sbjct: 659  GEFRAFQSYLAENGIVHRVSCPHTQQQNGVAERKHRTIVEHGLTLLHTASLPLKFWDESF 718

Query: 249  ATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC 308
             T VYL NRLPT + H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Sbjct: 719  RTVVYLSNRLPTAILHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNTHKLQYRSEEC 778

Query: 309  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHS 368
             FLGYS  HKGYKC+SS GR+YIS  V F+E++FP    +  S    +T+  S   +  S
Sbjct: 779  TFLGYSLKHKGYKCMSSNGRVYISHDVIFNETSFPYSKTIQVSSCLLSTVSPSTSHLSPS 838

Query: 369  DPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP---QTLSLPTPLDSHAHPTPCL 428
               P+        P+ P+S+      MD +++    +P    T   P  + S+   TP  
Sbjct: 839  ASPPVLSPTMLPTPTSPISSARPISEMDNIVSTHPHAPNSADTTLTPAQVVSNPVATPVQ 898

Query: 429  IASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALK 488
                S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+
Sbjct: 899  HVVSSIAD----ASVTRTIAKDADNTHPMITRAKSGIVKPKIFIAAIR--EPSSVSAALQ 958

Query: 489  CDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQG 548
             D W +AM+ EY AL  N+TWSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+G
Sbjct: 959  QDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKTKENPDGTVQKYKARLVAKG 1018

Query: 549  FHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP 598
            FHQQA  D+TETFSPVVKP T+R++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Sbjct: 1019 FHQQAGFDFTETFSPVVKPSTVRVVFTIALSRNWAIKQLDVNNAFLNGDLQEEVFMQQP 1064

BLAST of Tan0021760 vs. ExPASy TrEMBL
Match: A0A438K147 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2516 PE=4 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 8.0e-141
Identity = 281/599 (46.91%), Postives = 376/599 (62.77%), Query Frame = 0

Query: 9    DGLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHS 68
            DGLY F     + S L+ +  QS   +P V   S SS +   +    ++ + WH RLG  
Sbjct: 436  DGLYAF-----DSSHLALRPTQSLSKSPSVVASSFSSKV--CIASLSSTFDLWHKRLGQP 495

Query: 69   AIPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPA 128
            +   +++++S CN+   NK   +FC +C +GK H  PF  S + Y  PL+LI  D+WGPA
Sbjct: 496  SAATIKNVLSKCNVAHINKMDSNFCSSCCLGKIHMFPFSLSHTTYTKPLELIHSDLWGPA 555

Query: 129  YESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNG 188
               S +G++YY+ FVD FSR++ I+ L NKSEA   F+ FKTQVE  F+  I SLQTD G
Sbjct: 556  PVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWG 615

Query: 189  GEFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAF 248
            GEFR+F  +L  NGI HRV+CP+T QQNG+ ERKHR IVE GLTLL   SL +KFWD++F
Sbjct: 616  GEFRAFQSYLAENGIVHRVSCPHTQQQNGVAERKHRTIVEHGLTLLHTVSLPLKFWDESF 675

Query: 249  ATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPC 308
             T VYL NRLPT V H   P+E LF   P Y  L+ FGC C+P+LR YN HKL  RS  C
Sbjct: 676  RTVVYLSNRLPTAVLHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNTHKLQYRSEEC 735

Query: 309  LFLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHS 368
             FLGYS  HKGYKC+SS GR+YISR V F+E++FP    +  S    +T+  S   +  S
Sbjct: 736  TFLGYSLKHKGYKCMSSNGRVYISRDVIFNETSFPYSKTIQVSSCLPSTVSPSTSHLSPS 795

Query: 369  DPTPINFQQSNLPPSCPLSTL----PMDPLITNSIPSP---QTLSLPTPLDSHAHPTPCL 428
               P+        P+ P+S+      MD +++    +P    T   P  + S+   TP  
Sbjct: 796  ASPPVLSPTMLPAPTSPISSARPISEMDNIVSTHPHAPNSADTTLTPAQVVSNPVATPVQ 855

Query: 429  IASGSVSNILNVPPVTGTTTK--NNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALK 488
                S+++      VT T  K  +NTH +ITR K+ + KPK+F+A  +  EP +V  AL+
Sbjct: 856  HVVSSIAD----ASVTRTIAKDADNTHPMITRAKSGIVKPKIFIAAVR--EPSSVSAALQ 915

Query: 489  CDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQG 548
             D W +AM+ EY AL  N+TWSL+  P  ++ IGCKWV+K K + DG+V +YKARLVA+G
Sbjct: 916  QDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKTKENPDGTVQKYKARLVAKG 975

Query: 549  FHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP 598
            FHQQA  D+TETFSPVVKP TIR++FT+AL+  W ++ +D+NNAFL+G L EEVFM QP
Sbjct: 976  FHQQAGFDFTETFSPVVKPSTIRVVFTIALSRNWAIKQLDVNNAFLNGDLQEEVFMQQP 1021

BLAST of Tan0021760 vs. ExPASy TrEMBL
Match: A0A438J431 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_230 PE=4 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 2.2e-138
Identity = 275/591 (46.53%), Postives = 356/591 (60.24%), Query Frame = 0

Query: 10  GLYRFQMTQANLSILSSQHHQSSISAPQVHNVSLSSSLNQSVFHKCTSLEKWHNRLGHSA 69
           GLY F  TQ  L + S +   SS  A    + +L S          +    WHNRLGH +
Sbjct: 144 GLYVFDNTQLKLPLHSVETFNSSCFA----STTLPSKEPTVPASPTSPFTLWHNRLGHPS 203

Query: 70  IPIVQHIMSMCNLDTSNK-SFHFCHACAVGKSHNLPFHDSTSHYDFPLQLIVVDVWGPAY 129
             IV  +++ CNL   NK     C AC +GK H  PF  STS Y  PL+LI  D+WGPA 
Sbjct: 204 SHIVSLVLNKCNLPHLNKIPSLICSACCMGKIHKSPFLHSTSSYTKPLELIHTDLWGPAS 263

Query: 130 ESSRNGFKYYVSFVDVFSRYTLIYFLNNKSEAFSAFLLFKTQVEKMFNRSILSLQTDNGG 189
             S +G +YY+ F+D +SR+T IY L +KSEAF  FL FK+QVE      I ++Q+D GG
Sbjct: 264 TPSSHGHQYYIHFIDAYSRFTWIYMLKHKSEAFQVFLHFKSQVELQLGHKIKAVQSDWGG 323

Query: 190 EFRSFIHFLKTNGITHRVTCPYTSQQNGIVERKHRHIVEMGLTLLSYASLSIKFWDDAFA 249
           E+RSF  +L +NGI HR++CPYT +QNG+ ERKHRHIVE G+ LL+ ASL  K+WD+AF 
Sbjct: 324 EYRSFTQYLTSNGIIHRISCPYTHEQNGLAERKHRHIVEHGIALLAQASLPFKYWDEAFR 383

Query: 250 TAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDLRTFGCLCYPSLRAYNKHKLDPRSTPCL 309
           T+V+LINRLPT V    SP+E LF  KP+Y  L+ FGC+CYP+LR +N HKL  RS PC 
Sbjct: 384 TSVHLINRLPTPVLKNKSPLEVLFHQKPSYSQLKVFGCMCYPNLRPFNHHKLQFRSIPCT 443

Query: 310 FLGYSSFHKGYKCLSSFGRMYISRHVTFDESTFPSFNFLYPSESKSNTLHHSVLPIVHSD 369
           FLGYS   KGYKCLS  G + ISR V FDE  FP   F      K  T            
Sbjct: 444 FLGYSLNRKGYKCLSPNGNILISRDVIFDEHAFP---FAQLQSQKQTT------------ 503

Query: 370 PTPINFQQSNLP--PSCPLSTLPMDPLITNSIPSPQTLSLPTPLDSHAHPTPCLIASGSV 429
            + ++   ++LP   S PL  LP     + S P+                 P +  + S 
Sbjct: 504 -SSLSSSSTSLPCQTSLPLMVLPSSTFCSTSSPT----------------NPSIFPATSN 563

Query: 430 SNILNVPPVTGTTTKNNTHTLITRGKARVFKPKVFLANYKEVEPPNVKEALKCDHWIQAM 489
            N+ + PP   +     TH +ITR K  +FKPK +L +     P +V EAL+  HW QAM
Sbjct: 564 HNVASQPP-PSSAPPFPTHHMITRSKNGIFKPKAYLIS---TTPTSVPEALQLCHWKQAM 623

Query: 490 IDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVD 549
            DEY AL+ N+TW L+  P + K+IGCKWVFK+K + DG++ +YKARLVA+GFHQ A  D
Sbjct: 624 TDEYLALLRNNTWDLVPPPTDCKLIGCKWVFKVKENPDGTINKYKARLVAKGFHQIAGFD 683

Query: 550 YTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLHGLLSEEVFMSQP 598
           + ETFS VVKP TIRI+ T+AL   WK+R +D+NNAFL+G L E++FM QP
Sbjct: 684 FNETFSLVVKPTTIRIVLTIALNLQWKVRQLDVNNAFLNGDLHEDIFMHQP 694

BLAST of Tan0021760 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 129.4 bits (324), Expect = 9.6e-30
Identity = 61/132 (46.21%), Postives = 86/132 (65.15%), Query Frame = 0

Query: 466 KEVEPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLLDRPVNKKIIGCKWVFKIKRHSDG 525
           K  EP    EA +   W  AM DE  A+ +  TW +   P NKK IGCKWV+KIK +SDG
Sbjct: 82  KAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDG 141

Query: 526 SVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRILFTLALTNGWKLRLVDINNAFLH 585
           ++ RYKARLVA+G+ QQ  +D+ ETFSPV K  +++++  ++    + L  +DI+NAFL+
Sbjct: 142 TIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLN 201

Query: 586 GLLSEEVFMSQP 598
           G L EE++M  P
Sbjct: 202 GDLDEEIYMKLP 213

BLAST of Tan0021760 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 116.7 bits (291), Expect = 6.4e-26
Identity = 62/125 (49.60%), Postives = 81/125 (64.80%), Query Frame = 0

Query: 447 LITRGKARVFK--PKVFLANYKEV--EPPNVKEALKCDHWIQAMIDEYRALMSNDTWSLL 506
           ++TR KA + K  PK  L     +  EP +V  ALK   W QAM +E  AL  N TW L+
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 507 DRPVNKKIIGCKWVFKIKRHSDGSVARYKARLVAQGFHQQADVDYTETFSPVVKPVTIRI 566
             PVN+ I+GCKWVFK K HSDG++ R KARLVA+GFHQ+  + + ET+SPVV+  TIR 
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

Query: 567 LFTLA 568
           +  +A
Sbjct: 121 ILNVA 125

BLAST of Tan0021760 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 47.4 bits (111), Expect = 4.8e-05
Identity = 30/83 (36.14%), Postives = 43/83 (51.81%), Query Frame = 0

Query: 222 HRHIVEMGLTLLSYASLSIKFWDDAFATAVYLINRLPTKVHHTVSPMEKLFGHKPTYVDL 281
           +R I+E   ++L    L   F  DA  TAV++IN+ P+   +   P E  F   PTY  L
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 282 RTFGCLCYPSLRAYNKHKLDPRS 305
           R FGC+ Y      ++ KL PR+
Sbjct: 62  RRFGCVAYIHC---DEGKLKPRA 81

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW24.1e-11038.81Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT947.7e-10939.65Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109789.6e-6731.10Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041465.1e-5228.15Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925209.1e-2549.60Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
KAG8479334.11.4e-14447.69hypothetical protein CXB51_029681 [Gossypium anomalum][more]
RVW60229.17.9e-14346.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW44519.17.9e-14346.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
CAN81099.13.3e-14146.08hypothetical protein VITISV_017741 [Vitis vinifera][more]
RVX14937.11.6e-14046.91Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
A0A438FJP63.8e-14346.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438EA493.8e-14346.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A5BFT31.6e-14146.08Integrase catalytic domain-containing protein OS=Vitis vinifera OX=29760 GN=VITI... [more]
A0A438K1478.0e-14146.91Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438J4312.2e-13846.53Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
AT4G23160.19.6e-3046.21cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.16.4e-2649.60Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.14.8e-0536.14Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 47..100
e-value: 2.1E-8
score: 33.9
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 115..211
e-value: 7.0E-13
score: 48.8
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 111..275
score: 23.010916
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 496..598
e-value: 1.5E-29
score: 103.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 108..283
e-value: 1.3E-36
score: 127.8
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 480..597
coord: 59..341
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 113..283

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021760.1Tan0021760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding