CsaV3_1G014410 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G014410
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionRNA-directed DNA polymerase
Locationchr1 : 9860882 .. 9866766 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAACTCAACTCCAAGTTAAACTAACTCAACCCCAAGTTAGTTTAACAACTCTTGTAAAGGCTTCTCATTTTCATTAATAAGACATAAAAAAAAGAGGCATTATACATAAAGAATTCTAACTTTGCAGTCACTGCATCAATTTGTATCAGAGCGTGATCCTAGCTCACTCTTCCCTAAAAAAAAACTTGGATCCTACCACTCTTATCCATGGACGCGGAAGCCTCCTTCAACCAACGCTTGCAGTTGGTAGAACAAGCTGTTGAAGAACTTCGAAAAGGGCAGTCGGATATACAGCAATCGTTAGTAGAATTAACAAATATTTTTACCCGATTTCATGTCCAACCTTTAGAAAAAGCAATTCAGCCCCCAAGAAACCCAATTACAACAAACCAAGAAAGTCAAGAAAGACCAAGAAACCCAATTGCAGCAAACCAAGAAAGACAAGAAAGACCAAGAAACCCTGCTGTAAACCAAGAACTACTAAGAAATCAACCTATTCACACCCACAATAATCGATTGTATGGAAACAACGTACCTGGACATTTCTTAGACGATTCTGACTCTTCCGATGAAGACAACTTGTTGAACATTCATCAAGAACCTCAACAAAGGCTTGGACTTCGACCATACTTTCAAGAAGATCAAGAAATACGTATGAAAGTGGATCTCCCTACCTTCATTGGTCGAATGGATGTGGATAAGTTTCTTGATTGGATCAAAAACGTAGAAATTTTTTTTGTCTATGCCAATACACCCGAACACAAGAAGGTCCGATTAGTTGCTCTCAAACTTCAAGGTGGCGCAAGCGCTTGGTGGGATCAATTACAAAACAATAGACGATTATTCGGTAAACAATCAATAAGAAATTGGCCAAAGATGTTACGTCTAATGAAGAAAAGATTCTTACCAATCAACTATCAGCAACTACTTTACAATCAGTATCAACAATGTCATCAAGGCTCACAAAGCATCATGGATTATACAGAAGAATTCTATCGACTTGGTGCTCGAAATAATCTTCCCGAAGAACACCAATAAATTTCCAGGTTTATTCATGGTCTACGAGATGAGATTAAAGATATTGTACACTTACATCCTTTAACTTTTCTTTCAGATGCCATCTCCTTAGCTTCCAAGATTGAGGATAGTGAAGAGATCAAGAAAACCAAGAATTCTCAAAGAAAGAATAATTGGGACAAACAACAAAGAACTAACCTAACTAATTCATTTAGAAACTTTCAACAAGGAAGTTCATCCACAACTTCACAGCTCGCCAAGAAAGATGAAAATTCATCAAAGATTCCAGCCACTAAACCAGGTGAGAATAACGCAAAGAAGAAGGTTGACAACATTTATAACCGTCCTACTTTGGGTAAATGTTTCAGGTGTGGACAACAAGGGCACCTCTCCAACGAGTGCCCTCAAAGGAGAACTTTGACAATTGAAGAAGGACAAGAAGATAATGACTCGGATGACATCTTTGAGATTTCAACACCGGATGAAGGAGATCAACTATCTTGTGTCATTCAGAGAATTCTTTTCACCCCTACGGCTGGACAAATACCTCAAAGGAATTTCCTCTTTCGAACAAGATGCACCATTAACGGTAAAGTGTGTCAAGTTATCATTGATAGTGGTAGTAGTGGAAATTTAGTATCTAAAAAACTAGTTTCTGCCCTAAATTTGAAAATTGACCCATACTTGAATCCCTATAAGGTTAGTTGGATAAAAAAAGGGGGCAAAGCTACTGTTAGTGAGGTTTGTACTATTTCTTTGTCAATAGGTCAGCACTATAAAGACCAAATAATATGTGACGTTCTTGATATGGATGCTTGCCACATCCTTTTAGGAAGACCTTGGCAATATAACACATATTACATTCATAGAGGAAGATCTAACACTATTGAGTTTGATTGGATGAGTCTTCGTGTAGTTCTTTTACCTATAGGCAACTCCACCGACACAAAAGCTAACCTTAGTAAAGGAAAACAACTTTTTTAGTTCCTGCCTTAAAAAATAACACCTATCTTAGCCTTTGTCATTAAAGGAAACAACAGTGAACCTTCATAAAACTACCTACCTAGTGAGATATCAAACCTTCTAACCAAATTTCCAAATATTATAGAACCAACCACTTCTCTGCCACCTCTTAGAAATATACAACACACTATAGATTTAATGCCCGGTAGTACCTTACCTAATTTGCCACATTATCGCATGAACCCTAGAGAGTACAAAATATTACATGAACAAATAGAAGAGCTTCTTAACAAAGGTCATATCCAACCTAGCCTAAGTCCTTGTGCTGTCCCAGCTCTACTTGCACAAAAAAAAGATGGTAACTAGAGACTTTGTGTAGATAGTAGATCTATAAACAAAATTACAATCAAATATAGGTTTCCCATACCTCGTTTATCCGACCTCCTTGATCAATTAGGTAAAGCTTCCATCTTTTCCAAAGTAGATCTTAAGAGTGGCTACCACCAAATTAGAATTAAACCAGGGGATGAGTGGAAAACAGCCTTTAAAACCAATGAAGGATTGTTTGAATGGTTAGTTATGCCCTTTGGTCTTTCTAATGCCCCTAGCACCTTTATGAGATTGATGAACCAAATCCTACAACCTTTTTTGAATCAATTCATTGTGGTATACTTTGATGACATACTCATTTATAGCTCATCTAGGGAAGATCACCTAAAACACATCCAACTACTTTTTACAGCCTTACAAGAAAATGAATTACAAATTAATCTAAAAAAATGTGAATTTTTGTGTTATAGTATTCATTTTCTTGAATTTATCATAAGCTGTGATGGTATTTCTGTTGATCCCAAAAAGATTGACTCTATTAGTTCTTGGCCACAACCAAAAACTCCAAAAGATATTCAATGCTTTTTAGGTTTAGCTTCTTTCTATAGAAGGTTCATCAAGAATTTCAGCACCATAGCAGCACCTTTGACAAACTGTTTAAAGAAAGGCAGTTTCCAATGGGGACAAGTAGAAGAAGACAGTTTCTGTAAATTAAAATTAGCCTTAGCTAGCCCCCCTGTTTTAAAACTACCAAACTTTGATAAACCATTTGAAGTCACTGTAGATGCTTCGGGTTTAGGTATAGGAGTTGTCCTTAGTCAAGAAGGTCACCCTTTAGAGTATTTTAGTGAAAAATTAAGTACTTATAGACAAAATTGGAGTACCTATGAACAAGAACTATATGCCCTAGTGCGTGCTCTAAAACAATGGGAACATTACCTACTTGCTAATACTTTCATCTTGCTCACTGACCACTTTTCTCTGAAATTTCTAAACTCCCAAAAAACAATTAGTAGAATGCATGCCCGATGGCTACAATTCTTGCAAAGATTTGATTTTGTGATTAAACATACTTCGGGTAAGTCTAATAAGGTTGCTGATGCTCTCAGCCGAAAAGGTATACTTCTCACCACCCTCCAATCTCAAATAATTGCTTTTGATCACCTTCATACACTGTACCCTACTGATATTGATTTTAGAAGTATTTGGGAAAATTGCTCTAACCATAAACCTTGTAAAGATTACCACATTGTTAATAGTTTTCTCTTTAAAGGTGATGTTTTGTGTGTCCCTCATACATCTTTAAGAGAAGCAATAATTAAAGAAACACATTCTAGGGGATTAGCAGGACATTTTGGCCGAGACAAAACTTTGGCTACAATTATCTCTAAATTCTTTTGGCCACAACTCAATAGGAATGTTACTAACTTCATCAAAAGATGCTCCATTTGCCAAACAGCCAAAGGTAACTCTCAAAATACAGGTTTGTACACTCCTTTACCTATTTCTTCTACAATTTGGGAGGATTTATCTATGGATTTTGTTTTAGGATTACCAAGAACCCAAAGGGGCCATGATTCTATTTTTGTAGTTGTAAATCGCTTTAGCAAAATGGCTCATTTTATTCCTTGCAAAAAAACTTCTGATGCTTTGAATATAGCTAACTTGTTTTTTCGAGAAATTGTTAGATTGCATGGAATTCCTAAAACTATAGTTTCGGATAGAGATGTAAAATTCTTGAGCCATTTTTGGAGATCCCTATGTAAGAAATTCGACACCAACCTCCTGTTTAGTACCGCCAGCCACCCACAAACTGACGGCCAAACAGAGGTGACCAACCGAACTCTTGGCAACCTCATTCGATGCCTTAGCGGGGACAAGCCTAAACAATGGGACCTAGCTTTACCTCAAGCATAATTCGCCTTCGACCACATGCCAAATCGCTCAACAGGGAAGTCTCCATTTGAGGTTGTATACACTAGTCTCCCTCGGGTAACTTTTGATTTAGCTAATCTACCTTCTGTTATTGATGTCAGTATGGAGGCTGAAGCAATGGCTGAACGTATCTCCAAGCTACACCAAGAAGTCAAGAGTCATCTAGGGCTTGCTAATGACTCCTACTAAACAGCCGCCAGCTATCACAAACGCTTTAAGGAATATCAAGTAGGTGACCTAGTTATGGTGCACCTTCGAAAGTCAAGACTACCTGCTGGTCATCACTCCAAAGTGACCAACAAGCGCATGGGCCCATTTCAAATATTGGAAAGACTCGGCCCCAATGCTTACCGAGTAGACCTCCCAGCAAACATAAGAATCAGCCCATCTTTCAACGTTGTCGACTTATCACCTTACTATGCACCAGATTCCTTCTCTTTAACTCCCTAATCTTCACTCGAGGTCGAGTTTTATCCTAGAGGAGGGGTTTGATGTAGAACATCTCCCTTCTTGATATAGAACATCTCCCTTCTTGATGTTTCTTGAATTTCATCCTAGAAGAGGGGTTTGGATTACCATTTTATAGTTTATGATTAATTACCAATTTTGTATCTTGAGTTAGTTATAACTAACTCAACTCCAAGTTAAACTAACTCAACCTCAAGTTAGTTTAACAACTCTTGTAATCTCCAGCCTATAAAAATGTTCTCATTTTCATTAATAAGACATCAAAAAAAGAGGCATTATACATAAAGAATTCTAACTTTGCAGTCAATGCATCAGCATGTCTCACAAGCTTAAATAACTAACCACCTTTTCCTCTTTTACTTGGCTTTTACATTTGAGGTTCATTCCTTAGTTGGCTTCAACCCTTGACAACCTGGGGAGGAAAAACTTAAAACGTAAGTCAATGACTTAGTGAATCATGGGTTTAGGAAATAGCTTTTTGGGAAATACAACAATTTCAGACATTATAACCACAAGGTCATTTCATATGCAAATATATATAGATCTCATCGCATCAAACATAATCACAAGCTTTAACACAACCACAACAATTGCAACAATTTCACGCAGATCAAGTCTTCAAGCATTCACTACTCATTGTCTAAACCTGGGATTCTCCCATTACCTACACCTAGGATTCGCTTTCACCAACGTCTCATCGCCTAGACTTGGAATTCACTTCACCAGAGTCACTGATGGATCTAGGATTCACTTTCACGAGCATCACACAATTTTTCTGTGATTTCCAATAGAGCGTTGTAATACTCATTCTTTCTCATACCTGGATTCACTTTCACCATCATGTAGACCTGAGATTCACATTCACCAACATCACGCTATGGACCTGGGATTCGCTTTCACCAGCGTCTCTCATAAAATTCCTCATCTAGATCTAGGATTCGTATTCACCAGCATTCAGCATCAATAACACATCTCATACTCGCTTAGACTTAGGATTCACTTTCACTAGCATCTCATTGCCTAGACCTAGGATTCAGATTCACCAGCATCACGTGGAAGACCTAAGATTCACTTTCACCAGAGTCATGTGATAGATCAAACTACACTACACCGTACAATCATTACAGACCATAACAAAGGAATAGTAACTCAAGAGCTTTTCATACAACAC

mRNA sequence

ATGCCCTTTGGTCTTTCTAATGCCCCTAGCACCTTTATGAGATTGATGAACCAAATCCTACAACCTTTTTTGAATCAATTCATTGTGGTATACTTTGATGACATACTCATTTATAGCTCATCTAGGGAAGATCACCTAAAACACATCCAACTACTTTTTACAGCCTTACAAGAAAATGAATTACAAATTAATCTAAAAAAATGTGAATTTTTGTGTTATAGTATTCATTTTCTTGAATTTATCATAAGCTGTGATGGTATTTCTGTTGATCCCAAAAAGATTGACTCTATTAGTTCTTGGCCACAACCAAAAACTCCAAAAGATATTCAATGCTTTTTAGGTTTAGCTTCTTTCTATAGAAGGTTCATCAAGAATTTCAGCACCATAGCAGCACCTTTGACAAACTGTTTAAAGAAAGGCAGTTTCCAATGGGGACAAGTAGAAGAAGACAGTTTCTGTAAATTAAAATTAGCCTTAGCTAGCCCCCCTGTTTTAAAACTACCAAACTTTGATAAACCATTTGAAGTCACTGTAGATGCTTCGGGTTTAGGTATAGGAGTTGTCCTTAGTCAAGAAGGTCACCCTTTAGAGTATTTTAGTGAAAAATTAAGTACTTATAGACAAAATTGGAGTACCTATGAACAAGAACTATATGCCCTAGTGCGTGCTCTAAAACAATGGGAACATTACCTACTTGCTAATACTTTCATCTTGCTCACTGACCACTTTTCTCTGAAATTTCTAAACTCCCAAAAAACAATTAGTAGAATGCATGCCCGATGGCTACAATTCTTGCAAAGATTTGATTTTGTGATTAAACATACTTCGGGTAAGTCTAATAAGGTTGCTGATGCTCTCAGCCGAAAAGGTATACTTCTCACCACCCTCCAATCTCAAATAATTGCTTTTGATCACCTTCATACACTGTACCCTACTGATATTGATTTTAGAAGTATTTGGGAAAATTGCTCTAACCATAAACCTTGTAAAGATTACCACATTGTTAATAGTTTTCTCTTTAAAGGTGATGTTTTGTGTGTCCCTCATACATCTTTAAGAGAAGCAATAATTAAAGAAACACATTCTAGGGGATTAGCAGGACATTTTGGCCGAGACAAAACTTTGGCTACAATTATCTCTAAATTCTTTTGGCCACAACTCAATAGGAATGTTACTAACTTCATCAAAAGATGCTCCATTTGCCAAACAGCCAAAGGTAACTCTCAAAATACAGGTTTGTACACTCCTTTACCTATTTCTTCTACAATTTGGGAGGATTTATCTATGGATTTTGTTTTAGGATTACCAAGAACCCAAAGGGGCCATGATTCTATTTTTGTAGTTGTAAATCGCTTTAGCAAAATGGCTCATTTTATTCCTTGCAAAAAAACTTCTGATGCTTTGAATATAGCTAACTTGTTTTTTCGAGAAATTGTTAGATTGCATGGAATTCCTAAAACTATAGTTTCGGATAGAGATGTAAAATTCTTGAGCCATTTTTGGAGATCCCTATGTAAGAAATTCGACACCAACCTCCTGTTTAGTACCGCCAGCCACCCACAAACTGACGGCCAAACAGAGGTGACCAACCGAACTCTTGGCAACCTCATTCGATGCCTTAGCGGGGACAAGCCTAAACAATGGGACCTAGCTTTACCTCAAGCATAA

Coding sequence (CDS)

ATGCCCTTTGGTCTTTCTAATGCCCCTAGCACCTTTATGAGATTGATGAACCAAATCCTACAACCTTTTTTGAATCAATTCATTGTGGTATACTTTGATGACATACTCATTTATAGCTCATCTAGGGAAGATCACCTAAAACACATCCAACTACTTTTTACAGCCTTACAAGAAAATGAATTACAAATTAATCTAAAAAAATGTGAATTTTTGTGTTATAGTATTCATTTTCTTGAATTTATCATAAGCTGTGATGGTATTTCTGTTGATCCCAAAAAGATTGACTCTATTAGTTCTTGGCCACAACCAAAAACTCCAAAAGATATTCAATGCTTTTTAGGTTTAGCTTCTTTCTATAGAAGGTTCATCAAGAATTTCAGCACCATAGCAGCACCTTTGACAAACTGTTTAAAGAAAGGCAGTTTCCAATGGGGACAAGTAGAAGAAGACAGTTTCTGTAAATTAAAATTAGCCTTAGCTAGCCCCCCTGTTTTAAAACTACCAAACTTTGATAAACCATTTGAAGTCACTGTAGATGCTTCGGGTTTAGGTATAGGAGTTGTCCTTAGTCAAGAAGGTCACCCTTTAGAGTATTTTAGTGAAAAATTAAGTACTTATAGACAAAATTGGAGTACCTATGAACAAGAACTATATGCCCTAGTGCGTGCTCTAAAACAATGGGAACATTACCTACTTGCTAATACTTTCATCTTGCTCACTGACCACTTTTCTCTGAAATTTCTAAACTCCCAAAAAACAATTAGTAGAATGCATGCCCGATGGCTACAATTCTTGCAAAGATTTGATTTTGTGATTAAACATACTTCGGGTAAGTCTAATAAGGTTGCTGATGCTCTCAGCCGAAAAGGTATACTTCTCACCACCCTCCAATCTCAAATAATTGCTTTTGATCACCTTCATACACTGTACCCTACTGATATTGATTTTAGAAGTATTTGGGAAAATTGCTCTAACCATAAACCTTGTAAAGATTACCACATTGTTAATAGTTTTCTCTTTAAAGGTGATGTTTTGTGTGTCCCTCATACATCTTTAAGAGAAGCAATAATTAAAGAAACACATTCTAGGGGATTAGCAGGACATTTTGGCCGAGACAAAACTTTGGCTACAATTATCTCTAAATTCTTTTGGCCACAACTCAATAGGAATGTTACTAACTTCATCAAAAGATGCTCCATTTGCCAAACAGCCAAAGGTAACTCTCAAAATACAGGTTTGTACACTCCTTTACCTATTTCTTCTACAATTTGGGAGGATTTATCTATGGATTTTGTTTTAGGATTACCAAGAACCCAAAGGGGCCATGATTCTATTTTTGTAGTTGTAAATCGCTTTAGCAAAATGGCTCATTTTATTCCTTGCAAAAAAACTTCTGATGCTTTGAATATAGCTAACTTGTTTTTTCGAGAAATTGTTAGATTGCATGGAATTCCTAAAACTATAGTTTCGGATAGAGATGTAAAATTCTTGAGCCATTTTTGGAGATCCCTATGTAAGAAATTCGACACCAACCTCCTGTTTAGTACCGCCAGCCACCCACAAACTGACGGCCAAACAGAGGTGACCAACCGAACTCTTGGCAACCTCATTCGATGCCTTAGCGGGGACAAGCCTAAACAATGGGACCTAGCTTTACCTCAAGCATAA

Protein sequence

MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENELQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYRRFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDASGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLTDHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQIIAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQWDLALPQA
BLAST of CsaV3_1G014410 vs. NCBI nr
Match: XP_025979678.1 (uncharacterized protein LOC112997809 [Glycine max])

HSP 1 Score: 744.2 bits (1920), Expect = 3.2e-211
Identity = 341/554 (61.55%), Postives = 430/554 (77.62%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGLSNAPSTFMRLMNQ+L+PF+  F+VVYFDDILIYS  +E+HL+H++L+   LQEN+
Sbjct: 722  MPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSKIKEEHLEHVRLVLQVLQENQ 781

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L INLKKC F    + FL F++  DGI VD +K+ +I  WP P +  +++ F GLA+FYR
Sbjct: 782  LYINLKKCTFSTNKLLFLGFVVGEDGIQVDEEKVRAIRDWPAPTSVTEVRSFHGLATFYR 841

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RFI++FSTI AP+T CLKKG + WG  +E SF  +K  L + PVL LP+FDK F+V  DA
Sbjct: 842  RFIRDFSTITAPITECLKKGKYNWGFEQEQSFALIKEKLCTAPVLALPDFDKVFQVECDA 901

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
            SG+GIG VLSQE  P+ +FSEKLS  R+ WSTY+QE YA+ RAL+QWEHYL+   FIL T
Sbjct: 902  SGIGIGAVLSQEKKPIAFFSEKLSEARRKWSTYDQEFYAVFRALRQWEHYLIHREFILFT 961

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH +LKFL+SQK I++MHARW+ FLQ+F F+I+H SG  NKVADALSR+  LL TL  ++
Sbjct: 962  DHQALKFLHSQKLINKMHARWVSFLQKFPFIIQHKSGALNKVADALSRRDSLLVTLAQEV 1021

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
            + F+ L  LY  D +F+ +W  C  H PC D+H+   FLFKG+ LC+P +SLRE +I++ 
Sbjct: 1022 VGFECLKELYENDAEFQELWAKCREH-PCDDFHVREGFLFKGNRLCIPCSSLREKLIRDL 1081

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H  GL+GH GRDKT+A++  +F+WP L ++    +K+C  CQ +KG SQNTGLY PLPI 
Sbjct: 1082 HGGGLSGHMGRDKTIASLEERFYWPHLRKDAGTIVKKCYTCQVSKGQSQNTGLYMPLPIP 1141

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
              IW+DL+MDFVLGLPRTQRG DS+FVVV+RFSKM+HFI CKKT+DA NIA LFFRE+V 
Sbjct: 1142 DDIWQDLAMDFVLGLPRTQRGVDSVFVVVDRFSKMSHFIACKKTADASNIAKLFFREVVH 1201

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHG+PK+I SDRD KFLSHFW +L K FDT+L  S+ +HPQTDGQTEVTNRTLGN+IRC+
Sbjct: 1202 LHGVPKSITSDRDTKFLSHFWITLWKLFDTSLNRSSTAHPQTDGQTEVTNRTLGNMIRCV 1261

Query: 541  SGDKPKQWDLALPQ 555
             GDKPKQWDLALPQ
Sbjct: 1262 CGDKPKQWDLALPQ 1274

BLAST of CsaV3_1G014410 vs. NCBI nr
Match: PKU71894.1 (RNA-directed DNA polymerase [Dendrobium catenatum])

HSP 1 Score: 729.2 bits (1881), Expect = 1.1e-206
Identity = 340/555 (61.26%), Postives = 425/555 (76.58%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGL NAPSTFMRLM+++LQPF  +F V YFDDIL+YS+S EDH+ H+  LF  L+ ++
Sbjct: 789  MPFGLCNAPSTFMRLMSEVLQPFAGKFCVSYFDDILVYSNSLEDHILHLTRLFQTLRNSK 848

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NL KCEF    ++FL F++S +GI VDP+K+ ++  WP PK+  DI+ F GLA+FYR
Sbjct: 849  LYLNLPKCEFATTQVYFLGFVVSREGIQVDPRKVSAVREWPVPKSLSDIRSFHGLANFYR 908

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RFI+ FS I AP+T+ LK  SF W   ++ SF  +K AL+S P+L LPNF+KPF+V  DA
Sbjct: 909  RFIRGFSLIMAPITDVLKGHSFIWSTTQQQSFENIKKALSSAPILALPNFEKPFQVDTDA 968

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
            SG+GIG VL QE  PLEYFSEKLST RQ W+ YEQELYA+VRALKQWEHYLL   F+L +
Sbjct: 969  SGIGIGAVLYQEDRPLEYFSEKLSTSRQKWTVYEQELYAVVRALKQWEHYLLHQDFVLCS 1028

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH SL++LNSQKTI+RMHARW+ FLQRF FVI+H +GK N+VADALSR+  LL  LQ+++
Sbjct: 1029 DHQSLQYLNSQKTINRMHARWVIFLQRFSFVIRHKTGKMNRVADALSRRAALLVQLQTEV 1088

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
               + + +LY TD DF   W +C+  KP  D+ + + FLFKG++LCVP +S R  +I+E 
Sbjct: 1089 TGLEEMKSLYDTDEDFAIPWASCNAGKPEADFSLRHGFLFKGNLLCVPASSWRHQLIREI 1148

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H  GLA H GRDKT+  +  +FFWP L R+VT  I+RC+ CQT KG +QNTGLY PLP+ 
Sbjct: 1149 HCNGLAAHVGRDKTVQQLQMRFFWPHLKRDVTRLIERCATCQTYKGTAQNTGLYLPLPVP 1208

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
             +IWEDLS+DFVLGLPRT+RG DSI VVV+RFSKMAHFIPCKKT DALNIA LFF+EIVR
Sbjct: 1209 DSIWEDLSLDFVLGLPRTKRGSDSIMVVVDRFSKMAHFIPCKKTFDALNIAKLFFKEIVR 1268

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHGIP+++ SDRDVKF+SHFWR L KKF T+L  S+  HPQTDGQTEV NRTL  ++RCL
Sbjct: 1269 LHGIPRSLTSDRDVKFISHFWRELWKKFQTDLKLSSTYHPQTDGQTEVVNRTLAAMLRCL 1328

Query: 541  SGDKPKQWDLALPQA 556
              D PK W+  L QA
Sbjct: 1329 VQDNPKHWEEVLGQA 1343

BLAST of CsaV3_1G014410 vs. NCBI nr
Match: PWA81295.1 (transposon Ty3-I Gag-Pol polyprotein [Artemisia annua])

HSP 1 Score: 723.0 bits (1865), Expect = 7.7e-205
Identity = 326/555 (58.74%), Postives = 432/555 (77.84%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGLSNAPSTFMRLM Q+L+PF+ +F+VVYFDDIL+YS + ++HL H++ +  AL ENE
Sbjct: 786  MPFGLSNAPSTFMRLMTQVLRPFMGKFVVVYFDDILVYSQTEKEHLDHLRKVLKALTENE 845

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NLKKC FL   + FL +I+S DGI VD  K+ ++  WP PKT  +++ F GLA+FYR
Sbjct: 846  LFVNLKKCTFLTNKLLFLGYIVSSDGIHVDEDKVKAVRDWPSPKTLTEVRSFHGLATFYR 905

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RF++NFS+I AP+TNC+KKG F+W Q  E+SF  +K  L + PVL LPNFD  FE+  DA
Sbjct: 906  RFVRNFSSIVAPITNCMKKGPFKWTQEAEESFKIIKERLTTAPVLSLPNFDNVFELECDA 965

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
             G GIG VLSQEG P+ + SEKL+  RQ WSTYEQELYA+V+A+K+WEHYL+   F++ +
Sbjct: 966  CGTGIGAVLSQEGRPVAFHSEKLNEARQKWSTYEQELYAVVQAMKKWEHYLIQREFVVYS 1025

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH +LK+  +Q+ ++++HARW  FL++F++VIKH SG SNKVADALSRK  LL T+ + +
Sbjct: 1026 DHQALKYFQTQRHLNKIHARWASFLEKFNYVIKHKSGASNKVADALSRKTTLLVTISNDV 1085

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
            + F+ +  LY  D DFRS WE     +   ++ +++ +LFKG+ LC+P TSLR  +IKE 
Sbjct: 1086 VGFESIKGLYENDEDFRSTWEEIETKQHRGEFLLLDGYLFKGNRLCIPKTSLRSQLIKEV 1145

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H+ GL+ H GRDKT+A++ S+F+WPQL R+V +F++RC +CQ  KG +QNTGLY PLP+ 
Sbjct: 1146 HAGGLSAHLGRDKTIASMESRFYWPQLKRDVGSFVRRCVVCQEGKGKAQNTGLYMPLPVP 1205

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
             + W D+SMDFVLGLPRTQRG DS+FVVV+RFSKMAHFIPCKKTSDA +IA LFF+E+VR
Sbjct: 1206 ESPWVDISMDFVLGLPRTQRGVDSVFVVVDRFSKMAHFIPCKKTSDAAHIARLFFQEVVR 1265

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHG+PK+I SDRD KFL+HFW +L ++  T+L FS+ +HPQTDGQTEV NRTLGN+IRCL
Sbjct: 1266 LHGVPKSITSDRDSKFLAHFWLTLWRRLGTSLNFSSTAHPQTDGQTEVVNRTLGNMIRCL 1325

Query: 541  SGDKPKQWDLALPQA 556
             G+KPK WD++L QA
Sbjct: 1326 CGEKPKLWDVSLAQA 1340

BLAST of CsaV3_1G014410 vs. NCBI nr
Match: PKU72440.1 (RNA-directed DNA polymerase [Dendrobium catenatum])

HSP 1 Score: 721.8 bits (1862), Expect = 1.7e-204
Identity = 342/557 (61.40%), Postives = 425/557 (76.30%), Query Frame = 0

Query: 1   MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
           MPFGL NAP+TFMRLMN+IL+  L ++ VVYFDDILIYS S E+H  H+  +   LQE++
Sbjct: 276 MPFGLCNAPATFMRLMNEILKSVLGKYCVVYFDDILIYSQSIEEHRVHLSNVLAILQEHK 335

Query: 61  LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
           L INL KCEF   ++HFL FIIS  G+  DP+KI +I+ WP P++  +++ F+GLA+FYR
Sbjct: 336 LYINLPKCEFATGTVHFLGFIISKQGVHTDPQKIQAITEWPAPRSLTEVRSFIGLANFYR 395

Query: 121 RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
           RFI+ FS I AP+TNCLK  SFQW   +E SF  LK AL S PVL +P+F+KPF V  DA
Sbjct: 396 RFIRGFSIIVAPITNCLKGKSFQWTAEQERSFSTLKAALTSAPVLAVPDFNKPFHVDTDA 455

Query: 181 SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
           S +G+G VLSQE  P+E+FSEKLS  RQNWS YEQELYA+VRALKQWEHYLL   F+L +
Sbjct: 456 SAVGVGAVLSQEDKPIEFFSEKLSPARQNWSAYEQELYAVVRALKQWEHYLLHQDFVLRS 515

Query: 241 DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
           D+ +L+F+NSQKTI++MHARWL FLQRF FV++H SG  N+VADALSR+  LLT LQ   
Sbjct: 516 DNHALQFINSQKTINKMHARWLTFLQRFSFVLRHKSGAQNRVADALSRRTALLTKLQVDA 575

Query: 301 IAFDHLHTLYPTDIDFRSIWENCSNH--KPCKDYHIVNSFLFKGDVLCVPHTSLREAIIK 360
                L  LY TD DF   W   +NH  +PC+D+ I ++FLFK ++LCVP +S RE +IK
Sbjct: 576 PGLQALQELYATDHDFADPWNQLTNHPSRPCRDFSIRHNFLFKDNLLCVPASSWREHLIK 635

Query: 361 ETHSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLP 420
           E HS GL+ H GR KTL  + S+F+WP L R+V  F++RC ICQ  KGN+QNTGLYTPLP
Sbjct: 636 ELHSGGLSAHVGRTKTLEQMQSRFYWPHLRRDVVRFVERCPICQLYKGNAQNTGLYTPLP 695

Query: 421 ISSTIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREI 480
           +  +IWEDLSMDF+LGLPRT+RG DSI VVV+RFSKMAHF+ CKKT DALN+A LFF+EI
Sbjct: 696 VLQSIWEDLSMDFILGLPRTKRGSDSIMVVVDRFSKMAHFLACKKTFDALNVAILFFKEI 755

Query: 481 VRLHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIR 540
           VRLHGIP++I SDRDVKF+SHFWR L K+  T +  S+A HPQTDGQTEV NRTL N++R
Sbjct: 756 VRLHGIPRSITSDRDVKFVSHFWRELWKRLRTEIKLSSAYHPQTDGQTEVVNRTLSNMLR 815

Query: 541 CLSGDKPKQWDLALPQA 556
           CL+ + PKQWD  + QA
Sbjct: 816 CLAHENPKQWDDYIGQA 832

BLAST of CsaV3_1G014410 vs. NCBI nr
Match: PKU81400.1 (RNA-directed DNA polymerase [Dendrobium catenatum])

HSP 1 Score: 718.0 bits (1852), Expect = 2.5e-203
Identity = 329/555 (59.28%), Postives = 425/555 (76.58%), Query Frame = 0

Query: 1   MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
           MPFGL NAPSTFMRLM+ +L+P+L++  VVYFDDIL++SSS E+HL  +Q +F  L+E++
Sbjct: 157 MPFGLCNAPSTFMRLMHDVLKPYLDKCCVVYFDDILVFSSSFEEHLTQLQSIFETLREHQ 216

Query: 61  LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
           L INL KC+F    IHFL FI+S DGI   P+KI +I  WP PK+  +++ F GLA++YR
Sbjct: 217 LLINLDKCDFAAADIHFLGFILSSDGIRTSPQKIAAIRDWPTPKSITEVRSFHGLANYYR 276

Query: 121 RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
           RFI+ F  I AP+TNCL+  +F WG  ++ SF  +K AL++ P+L  P+FDKPF+V  DA
Sbjct: 277 RFIRGFGEITAPITNCLRSSTFSWGPNQQQSFELIKAALSTAPMLAFPDFDKPFQVDTDA 336

Query: 181 SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
           S +G+G VLSQ+  P+EYFSEKLS  RQ WS YEQELYA+VRALKQWEHYLL   F+L +
Sbjct: 337 SAIGVGAVLSQDDKPVEYFSEKLSASRQKWSAYEQELYAVVRALKQWEHYLLHRDFVLCS 396

Query: 241 DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
           D+ +L+++NSQK ++RMHARW+ F QRF FVIKH +G+SN+VADALSR+  LL  LQ++I
Sbjct: 397 DNHALQYINSQKNVNRMHARWIVFFQRFTFVIKHKAGRSNRVADALSRRSALLVRLQTEI 456

Query: 301 IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
                L  LY +D DF  +WE C+   P  DY +   FLFKG+ LC+P +S R+ +I+E 
Sbjct: 457 TGLQTLQELYNSDKDFSQVWEVCNAGHPTLDYSLQQGFLFKGNSLCIPDSSWRQQLIREL 516

Query: 361 HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
           H  GLA H GRDKT++ + S+FFWP L R+VT FI++C +CQ  KG +QNTGLY PLPI 
Sbjct: 517 HCGGLAAHLGRDKTISILQSRFFWPHLRRDVTRFIEKCFVCQRFKGTAQNTGLYLPLPIP 576

Query: 421 STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
            +IWEDLS+DFVLGLPRT+RG DSI VVV+RFSKMAHFIPCKKT DA+NIA LFF+EIVR
Sbjct: 577 DSIWEDLSLDFVLGLPRTKRGSDSIMVVVDRFSKMAHFIPCKKTFDAVNIAQLFFKEIVR 636

Query: 481 LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
           LHG+P+++ SDRDVKF+SHFWR L K+ +T+L  S+A HPQTDGQTEV NRTLGN++RCL
Sbjct: 637 LHGVPRSLTSDRDVKFVSHFWRELWKRLNTDLKLSSAYHPQTDGQTEVVNRTLGNMLRCL 696

Query: 541 SGDKPKQWDLALPQA 556
             D P++W+  L  A
Sbjct: 697 VQDNPRKWEEMLCHA 711

BLAST of CsaV3_1G014410 vs. TAIR10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein)

HSP 1 Score: 106.3 bits (264), Expect = 6.2e-23
Identity = 51/135 (37.78%), Postives = 80/135 (59.26%), Query Frame = 0

Query: 46  LKHIQLLFTALQENELQINLKKCEFLCYSIHFL--EFIISCDGISVDPKKIDSISSWPQP 105
           + H+ ++    ++++   N KKC F    I +L    IIS +G+S DP K++++  WP+P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 106 KTPKDIQCFLGLASFYRRFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPP 165
           K   +++ FLGL  +YRRF+KN+  I  PLT  LKK S +W ++   +F  LK A+ + P
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

Query: 166 VLKLPNFDKPFEVTV 179
           VL LP+   PF   V
Sbjct: 121 VLALPDLKLPFVTRV 135

BLAST of CsaV3_1G014410 vs. Swiss-Prot
Match: sp|Q99315|YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 350.9 bits (899), Expect = 2.6e-95
Identity = 216/586 (36.86%), Postives = 310/586 (52.90%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGL NAPSTF R M    +    +F+ VY DDILI+S S E+H KH+  +   L+   
Sbjct: 718  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNEN 777

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +  KKC+F      FL + I    I+    K  +I  +P PKT K  Q FLG+ ++YR
Sbjct: 778  LIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYR 837

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RFI N S IA P+   +   S QW + ++ +  KLK AL + PVL   N    + +T DA
Sbjct: 838  RFIPNCSKIAQPIQLFICDKS-QWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDA 897

Query: 181  SGLGIGVVLSQEGHP------LEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLAN 240
            S  GIG VL +  +       + YFS+ L + ++N+   E EL  +++AL  + + L   
Sbjct: 898  SKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGK 957

Query: 241  TFILLTDHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLT 300
             F L TDH SL  L ++   +R   RWL  L  +DF +++ +G  N VADA+SR    +T
Sbjct: 958  HFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTIT 1017

Query: 301  TLQSQIIAFD-----------------HL-----HTLYPTDID-FRSIWENCSNHKPC-K 360
               S+ I  +                 H+     H + P D+  FRS  +     +   K
Sbjct: 1018 PETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRK 1077

Query: 361  DYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGL-AGHFGRDKTLATIISKFFWPQLNR 420
            +Y + +  ++  D L VP    + A+++  H   L  GHFG   TLA I   ++WP+L  
Sbjct: 1078 NYSLEDEMIYYQDRLVVP-IKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQH 1137

Query: 421  NVTNFIKRCSICQTAKGNSQNT-GLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVV 480
            ++  +I+ C  CQ  K +     GL  PLPI+   W D+SMDFV GLP T    + I VV
Sbjct: 1138 SIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVV 1197

Query: 481  VNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKF 540
            V+RFSK AHFI  +KT DA  + +L FR I   HG P+TI SDRDV+  +  ++ L K+ 
Sbjct: 1198 VDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRL 1257

Query: 541  DTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQWDLALPQ 555
                  S+A+HPQTDGQ+E T +TL  L+R  +    + W + LPQ
Sbjct: 1258 GIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNIQNWHVYLPQ 1299

BLAST of CsaV3_1G014410 vs. Swiss-Prot
Match: sp|Q7LHG5|YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=3 SV=2)

HSP 1 Score: 350.1 bits (897), Expect = 4.4e-95
Identity = 216/586 (36.86%), Postives = 309/586 (52.73%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGL NAPSTF R M    +    +F+ VY DDILI+S S E+H KH+  +   L+   
Sbjct: 744  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNEN 803

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +  KKC+F      FL + I    I+    K  +I  +P PKT K  Q FLG+ ++YR
Sbjct: 804  LIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYR 863

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RFI N S IA P+   +   S QW + ++ +  KLK AL + PVL   N    + +T DA
Sbjct: 864  RFIPNCSKIAQPIQLFICDKS-QWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDA 923

Query: 181  SGLGIGVVLSQEGHP------LEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLAN 240
            S  GIG VL +  +       + YFS+ L + ++N+   E EL  +++AL  + + L   
Sbjct: 924  SKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGK 983

Query: 241  TFILLTDHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLT 300
             F L TDH SL  L ++   +R   RWL  L  +DF +++ +G  N VADA+SR    +T
Sbjct: 984  HFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTIT 1043

Query: 301  TLQSQIIAFD-----------------HL-----HTLYPTDID-FRSIWENCSNHKPC-K 360
               S+ I  +                 H+     H + P D+  FRS  +     +   K
Sbjct: 1044 PETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRK 1103

Query: 361  DYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGL-AGHFGRDKTLATIISKFFWPQLNR 420
            +Y + +  ++  D L VP    + A+++  H   L  GHFG   TLA I   ++WP+L  
Sbjct: 1104 NYSLEDEMIYYQDRLVVP-IKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQH 1163

Query: 421  NVTNFIKRCSICQTAKGNSQNT-GLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVV 480
            ++  +I+ C  CQ  K +     GL  PLPI+   W D+SMDFV GLP T    + I VV
Sbjct: 1164 SIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVV 1223

Query: 481  VNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKF 540
            V+RFSK AHFI  +KT DA  + +L FR I   HG P+TI SDRDV+  +  ++ L K+ 
Sbjct: 1224 VDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRL 1283

Query: 541  DTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQWDLALPQ 555
                  S+A+HPQTDGQ+E T +TL  L+R       + W + LPQ
Sbjct: 1284 GIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNIQNWHVYLPQ 1325

BLAST of CsaV3_1G014410 vs. Swiss-Prot
Match: sp|P0CT41|TF212_SCHPO (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 6.0e-92
Identity = 201/580 (34.66%), Postives = 304/580 (52.41%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MP+G+S AP+ F   +N IL       +V Y DDILI+S S  +H+KH++ +   L+   
Sbjct: 534  MPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNAN 593

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L IN  KCEF    + F+ + IS  G +   + ID +  W QPK  K+++ FLG  ++ R
Sbjct: 594  LIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLR 653

Query: 121  RFIKNFSTIAAPLTNCLKKG-SFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVD 180
            +FI   S +  PL N LKK   ++W   +  +   +K  L SPPVL+  +F K   +  D
Sbjct: 654  KFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETD 713

Query: 181  ASGLGIGVVLSQEG-----HPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYL--L 240
            AS + +G VLSQ+      +P+ Y+S K+S  + N+S  ++E+ A++++LK W HYL   
Sbjct: 714  ASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLEST 773

Query: 241  ANTFILLTDHFSL--KFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKG 300
               F +LTDH +L  +  N  +  ++  ARW  FLQ F+F I +  G +N +ADALSR  
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR-- 833

Query: 301  ILLTT-------------LQSQIIAFD----HLHTLYPTDIDFRSIWEN----CSNHKPC 360
            I+  T               +QI   D     + T Y  D    ++  N       +   
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQL 893

Query: 361  KDYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGLAGHFGRDKTLATIISKFFWPQLNR 420
            KD  ++NS   K  +L    T L   IIK+ H  G   H G +     I+ +F W  + +
Sbjct: 894  KDGLLINS---KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRK 953

Query: 421  NVTNFIKRCSICQTAKG-NSQNTGLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVV 480
             +  +++ C  CQ  K  N +  G   P+P S   WE LSMDF+  LP +  G++++FVV
Sbjct: 954  QIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVV 1013

Query: 481  VNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKF 540
            V+RFSKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  W+    K+
Sbjct: 1014 VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKY 1073

Query: 541  DTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQW 549
            +  + FS    PQTDGQTE TN+T+  L+RC+    P  W
Sbjct: 1074 NFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTW 1107

BLAST of CsaV3_1G014410 vs. Swiss-Prot
Match: sp|P0CT34|TF21_SCHPO (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 6.0e-92
Identity = 201/580 (34.66%), Postives = 304/580 (52.41%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MP+G+S AP+ F   +N IL       +V Y DDILI+S S  +H+KH++ +   L+   
Sbjct: 534  MPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNAN 593

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L IN  KCEF    + F+ + IS  G +   + ID +  W QPK  K+++ FLG  ++ R
Sbjct: 594  LIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLR 653

Query: 121  RFIKNFSTIAAPLTNCLKKG-SFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVD 180
            +FI   S +  PL N LKK   ++W   +  +   +K  L SPPVL+  +F K   +  D
Sbjct: 654  KFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETD 713

Query: 181  ASGLGIGVVLSQEG-----HPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYL--L 240
            AS + +G VLSQ+      +P+ Y+S K+S  + N+S  ++E+ A++++LK W HYL   
Sbjct: 714  ASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLEST 773

Query: 241  ANTFILLTDHFSL--KFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKG 300
               F +LTDH +L  +  N  +  ++  ARW  FLQ F+F I +  G +N +ADALSR  
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR-- 833

Query: 301  ILLTT-------------LQSQIIAFD----HLHTLYPTDIDFRSIWEN----CSNHKPC 360
            I+  T               +QI   D     + T Y  D    ++  N       +   
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQL 893

Query: 361  KDYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGLAGHFGRDKTLATIISKFFWPQLNR 420
            KD  ++NS   K  +L    T L   IIK+ H  G   H G +     I+ +F W  + +
Sbjct: 894  KDGLLINS---KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRK 953

Query: 421  NVTNFIKRCSICQTAKG-NSQNTGLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVV 480
             +  +++ C  CQ  K  N +  G   P+P S   WE LSMDF+  LP +  G++++FVV
Sbjct: 954  QIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVV 1013

Query: 481  VNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKF 540
            V+RFSKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  W+    K+
Sbjct: 1014 VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKY 1073

Query: 541  DTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQW 549
            +  + FS    PQTDGQTE TN+T+  L+RC+    P  W
Sbjct: 1074 NFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTW 1107

BLAST of CsaV3_1G014410 vs. Swiss-Prot
Match: sp|P0CT35|TF22_SCHPO (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 6.0e-92
Identity = 201/580 (34.66%), Postives = 304/580 (52.41%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MP+G+S AP+ F   +N IL       +V Y DDILI+S S  +H+KH++ +   L+   
Sbjct: 534  MPYGISTAPAHFQYFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNAN 593

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L IN  KCEF    + F+ + IS  G +   + ID +  W QPK  K+++ FLG  ++ R
Sbjct: 594  LIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLR 653

Query: 121  RFIKNFSTIAAPLTNCLKKG-SFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVD 180
            +FI   S +  PL N LKK   ++W   +  +   +K  L SPPVL+  +F K   +  D
Sbjct: 654  KFIPKTSQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETD 713

Query: 181  ASGLGIGVVLSQEG-----HPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYL--L 240
            AS + +G VLSQ+      +P+ Y+S K+S  + N+S  ++E+ A++++LK W HYL   
Sbjct: 714  ASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLEST 773

Query: 241  ANTFILLTDHFSL--KFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKG 300
               F +LTDH +L  +  N  +  ++  ARW  FLQ F+F I +  G +N +ADALSR  
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSR-- 833

Query: 301  ILLTT-------------LQSQIIAFD----HLHTLYPTDIDFRSIWEN----CSNHKPC 360
            I+  T               +QI   D     + T Y  D    ++  N       +   
Sbjct: 834  IVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQL 893

Query: 361  KDYHIVNSFLFKGDVLCVPHTSLREAIIKETHSRGLAGHFGRDKTLATIISKFFWPQLNR 420
            KD  ++NS   K  +L    T L   IIK+ H  G   H G +     I+ +F W  + +
Sbjct: 894  KDGLLINS---KDQILLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRK 953

Query: 421  NVTNFIKRCSICQTAKG-NSQNTGLYTPLPISSTIWEDLSMDFVLGLPRTQRGHDSIFVV 480
             +  +++ C  CQ  K  N +  G   P+P S   WE LSMDF+  LP +  G++++FVV
Sbjct: 954  QIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESS-GYNALFVV 1013

Query: 481  VNRFSKMAHFIPCKKTSDALNIANLFFREIVRLHGIPKTIVSDRDVKFLSHFWRSLCKKF 540
            V+RFSKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  W+    K+
Sbjct: 1014 VDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKY 1073

Query: 541  DTNLLFSTASHPQTDGQTEVTNRTLGNLIRCLSGDKPKQW 549
            +  + FS    PQTDGQTE TN+T+  L+RC+    P  W
Sbjct: 1074 NFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHPNTW 1107

BLAST of CsaV3_1G014410 vs. TrEMBL
Match: tr|M5WCC7|M5WCC7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa017790mg PE=4 SV=1)

HSP 1 Score: 738.8 bits (1906), Expect = 9.0e-210
Identity = 336/554 (60.65%), Postives = 430/554 (77.62%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGLSN PSTFMRLMNQ+L+PF+  F+VVYFDDILIYS+++E+HL H++ +   L+EN+
Sbjct: 748  MPFGLSNTPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHLVHLRQVLDVLRENK 807

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NLKKC F    + FL F++   GI VD +KI +I  WP PKT  +++ F GLA+FYR
Sbjct: 808  LFVNLKKCTFCTNKLLFLGFVVGEHGIQVDDEKIKAILDWPAPKTVSEVRSFHGLATFYR 867

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RF+++FS+I AP+T CLKKG F WG+ +E SF  +K  L + PVL LPNF+K FEV  DA
Sbjct: 868  RFVRHFSSIVAPITECLKKGRFSWGEEQERSFADIKEKLCTAPVLALPNFEKVFEVECDA 927

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
            SG+G+G VLSQ+  P+ +FSEKLS  RQ WSTY+QE YA+VRALKQWEHYL+   F+L T
Sbjct: 928  SGVGVGAVLSQDKRPVAFFSEKLSDARQKWSTYDQEFYAVVRALKQWEHYLIQKEFVLFT 987

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH +LK++NSQK I +MHARW+ FLQ+F FVIKHTSGK+N+VADALSR+  LL TL  ++
Sbjct: 988  DHQALKYINSQKNIDKMHARWVTFLQKFSFVIKHTSGKTNRVADALSRRASLLITLTQEV 1047

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
            + F+ L  LY  D DF  IW  C+N +P  DY +   +LFKG+ LC+P +SLRE +I++ 
Sbjct: 1048 VGFECLKELYEGDADFGEIWTKCTNQEPMADYFLNEGYLFKGNQLCIPVSSLREKLIRDL 1107

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H  GL+GH GRDKT+A +  +F+WPQL R+V   +++C  CQT+KG  QNTGLY PLP+ 
Sbjct: 1108 HGGGLSGHLGRDKTIAGMEERFYWPQLKRDVGTIVRKCYTCQTSKGQVQNTGLYMPLPVP 1167

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
            + IW+DL+MDFVLGLPRTQRG DS+FVVV+RFSKMAHFI C+KT+DA NIA LFFRE+VR
Sbjct: 1168 NDIWQDLAMDFVLGLPRTQRGVDSVFVVVDRFSKMAHFIACRKTADASNIAKLFFREVVR 1227

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHG+P +I SDRD KFLSHFW +L + F T L  S+ +HPQTDGQTEVTNRTLGN++R +
Sbjct: 1228 LHGVPTSITSDRDTKFLSHFWITLWRLFGTTLNRSSTAHPQTDGQTEVTNRTLGNMVRSV 1287

Query: 541  SGDKPKQWDLALPQ 555
             G+KPKQWD ALPQ
Sbjct: 1288 CGEKPKQWDYALPQ 1301

BLAST of CsaV3_1G014410 vs. TrEMBL
Match: tr|M5W531|M5W531_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa026856mg PE=4 SV=1)

HSP 1 Score: 736.9 bits (1901), Expect = 3.4e-209
Identity = 335/554 (60.47%), Postives = 429/554 (77.44%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGLSNAPSTFMRLMNQ+L+PF+  F+VVYFDDILIYS+++E+HL H++ +   L+EN+
Sbjct: 756  MPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHLVHLRQVLDVLRENK 815

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NLKKC F    + FL F++  +GI VD +KI +I  WP PKT  +++ F GLA+FY 
Sbjct: 816  LYVNLKKCTFCTNKLLFLGFVVGENGIQVDDEKIKAILDWPAPKTVSEVRSFHGLATFYM 875

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RF+++FS+IAAP+T CLKKG F WG+ +E SF  +K  L + PVL LPNF+K FEV  DA
Sbjct: 876  RFVRHFSSIAAPITECLKKGRFSWGEEQERSFADIKEKLCTAPVLALPNFEKVFEVECDA 935

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
            SG+G+G VL Q+  P+ +FSEKLS  RQ WSTY+QE YA+VRALKQWEHYL+   F+L T
Sbjct: 936  SGVGVGAVLLQDKRPVAFFSEKLSDARQKWSTYDQEFYAVVRALKQWEHYLIQKEFVLFT 995

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH +LK++NSQK I +MHARW+ FLQ+F FVIKHTSGK+N+VADALSR+  LL TL  ++
Sbjct: 996  DHQALKYINSQKNIDKMHARWVTFLQKFSFVIKHTSGKTNRVADALSRRASLLITLTQEV 1055

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
            + F+ L  LY  D DFR IW  C+N +P  DY +   +LFKG+ LC+P +SLRE +I++ 
Sbjct: 1056 VGFECLKELYEGDDDFREIWTKCTNQEPMTDYFLTEGYLFKGNQLCIPVSSLREKLIRDL 1115

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H  GL+GH GRDKT+A +  +F+WPQL R+V   +++C  CQT+KG  QNTGLY PLP+ 
Sbjct: 1116 HGGGLSGHLGRDKTIAGMEERFYWPQLKRDVGTIVRKCYTCQTSKGQVQNTGLYMPLPVP 1175

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
            + IW+DL+MDFVLG PRTQR  DS+FVV +RFSKMAHFI CKKT+DA NIA LFFRE+VR
Sbjct: 1176 NDIWQDLAMDFVLGFPRTQRRVDSVFVVADRFSKMAHFIACKKTADASNIAKLFFREVVR 1235

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHG+P +I SDRD KFLSHFW +L + F T L  S+ +HPQTDGQTEVTNRTLGN++R +
Sbjct: 1236 LHGVPTSITSDRDTKFLSHFWITLWRLFGTTLNRSSTAHPQTDGQTEVTNRTLGNMVRSV 1295

Query: 541  SGDKPKQWDLALPQ 555
             G+KPKQWD ALPQ
Sbjct: 1296 CGEKPKQWDYALPQ 1309

BLAST of CsaV3_1G014410 vs. TrEMBL
Match: tr|A0A2I0W8A8|A0A2I0W8A8_9ASPA (RNA-directed DNA polymerase OS=Dendrobium catenatum OX=906689 GN=MA16_Dca026621 PE=4 SV=1)

HSP 1 Score: 729.2 bits (1881), Expect = 7.1e-207
Identity = 340/555 (61.26%), Postives = 425/555 (76.58%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGL NAPSTFMRLM+++LQPF  +F V YFDDIL+YS+S EDH+ H+  LF  L+ ++
Sbjct: 789  MPFGLCNAPSTFMRLMSEVLQPFAGKFCVSYFDDILVYSNSLEDHILHLTRLFQTLRNSK 848

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NL KCEF    ++FL F++S +GI VDP+K+ ++  WP PK+  DI+ F GLA+FYR
Sbjct: 849  LYLNLPKCEFATTQVYFLGFVVSREGIQVDPRKVSAVREWPVPKSLSDIRSFHGLANFYR 908

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RFI+ FS I AP+T+ LK  SF W   ++ SF  +K AL+S P+L LPNF+KPF+V  DA
Sbjct: 909  RFIRGFSLIMAPITDVLKGHSFIWSTTQQQSFENIKKALSSAPILALPNFEKPFQVDTDA 968

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
            SG+GIG VL QE  PLEYFSEKLST RQ W+ YEQELYA+VRALKQWEHYLL   F+L +
Sbjct: 969  SGIGIGAVLYQEDRPLEYFSEKLSTSRQKWTVYEQELYAVVRALKQWEHYLLHQDFVLCS 1028

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH SL++LNSQKTI+RMHARW+ FLQRF FVI+H +GK N+VADALSR+  LL  LQ+++
Sbjct: 1029 DHQSLQYLNSQKTINRMHARWVIFLQRFSFVIRHKTGKMNRVADALSRRAALLVQLQTEV 1088

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
               + + +LY TD DF   W +C+  KP  D+ + + FLFKG++LCVP +S R  +I+E 
Sbjct: 1089 TGLEEMKSLYDTDEDFAIPWASCNAGKPEADFSLRHGFLFKGNLLCVPASSWRHQLIREI 1148

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H  GLA H GRDKT+  +  +FFWP L R+VT  I+RC+ CQT KG +QNTGLY PLP+ 
Sbjct: 1149 HCNGLAAHVGRDKTVQQLQMRFFWPHLKRDVTRLIERCATCQTYKGTAQNTGLYLPLPVP 1208

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
             +IWEDLS+DFVLGLPRT+RG DSI VVV+RFSKMAHFIPCKKT DALNIA LFF+EIVR
Sbjct: 1209 DSIWEDLSLDFVLGLPRTKRGSDSIMVVVDRFSKMAHFIPCKKTFDALNIAKLFFKEIVR 1268

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHGIP+++ SDRDVKF+SHFWR L KKF T+L  S+  HPQTDGQTEV NRTL  ++RCL
Sbjct: 1269 LHGIPRSLTSDRDVKFISHFWRELWKKFQTDLKLSSTYHPQTDGQTEVVNRTLAAMLRCL 1328

Query: 541  SGDKPKQWDLALPQA 556
              D PK W+  L QA
Sbjct: 1329 VQDNPKHWEEVLGQA 1343

BLAST of CsaV3_1G014410 vs. TrEMBL
Match: tr|A0A2U1P6A2|A0A2U1P6A2_ARTAN (Transposon Ty3-I Gag-Pol polyprotein OS=Artemisia annua OX=35608 GN=CTI12_AA189480 PE=4 SV=1)

HSP 1 Score: 723.0 bits (1865), Expect = 5.1e-205
Identity = 326/555 (58.74%), Postives = 432/555 (77.84%), Query Frame = 0

Query: 1    MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
            MPFGLSNAPSTFMRLM Q+L+PF+ +F+VVYFDDIL+YS + ++HL H++ +  AL ENE
Sbjct: 786  MPFGLSNAPSTFMRLMTQVLRPFMGKFVVVYFDDILVYSQTEKEHLDHLRKVLKALTENE 845

Query: 61   LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
            L +NLKKC FL   + FL +I+S DGI VD  K+ ++  WP PKT  +++ F GLA+FYR
Sbjct: 846  LFVNLKKCTFLTNKLLFLGYIVSSDGIHVDEDKVKAVRDWPSPKTLTEVRSFHGLATFYR 905

Query: 121  RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
            RF++NFS+I AP+TNC+KKG F+W Q  E+SF  +K  L + PVL LPNFD  FE+  DA
Sbjct: 906  RFVRNFSSIVAPITNCMKKGPFKWTQEAEESFKIIKERLTTAPVLSLPNFDNVFELECDA 965

Query: 181  SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
             G GIG VLSQEG P+ + SEKL+  RQ WSTYEQELYA+V+A+K+WEHYL+   F++ +
Sbjct: 966  CGTGIGAVLSQEGRPVAFHSEKLNEARQKWSTYEQELYAVVQAMKKWEHYLIQREFVVYS 1025

Query: 241  DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
            DH +LK+  +Q+ ++++HARW  FL++F++VIKH SG SNKVADALSRK  LL T+ + +
Sbjct: 1026 DHQALKYFQTQRHLNKIHARWASFLEKFNYVIKHKSGASNKVADALSRKTTLLVTISNDV 1085

Query: 301  IAFDHLHTLYPTDIDFRSIWENCSNHKPCKDYHIVNSFLFKGDVLCVPHTSLREAIIKET 360
            + F+ +  LY  D DFRS WE     +   ++ +++ +LFKG+ LC+P TSLR  +IKE 
Sbjct: 1086 VGFESIKGLYENDEDFRSTWEEIETKQHRGEFLLLDGYLFKGNRLCIPKTSLRSQLIKEV 1145

Query: 361  HSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLPIS 420
            H+ GL+ H GRDKT+A++ S+F+WPQL R+V +F++RC +CQ  KG +QNTGLY PLP+ 
Sbjct: 1146 HAGGLSAHLGRDKTIASMESRFYWPQLKRDVGSFVRRCVVCQEGKGKAQNTGLYMPLPVP 1205

Query: 421  STIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREIVR 480
             + W D+SMDFVLGLPRTQRG DS+FVVV+RFSKMAHFIPCKKTSDA +IA LFF+E+VR
Sbjct: 1206 ESPWVDISMDFVLGLPRTQRGVDSVFVVVDRFSKMAHFIPCKKTSDAAHIARLFFQEVVR 1265

Query: 481  LHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIRCL 540
            LHG+PK+I SDRD KFL+HFW +L ++  T+L FS+ +HPQTDGQTEV NRTLGN+IRCL
Sbjct: 1266 LHGVPKSITSDRDSKFLAHFWLTLWRRLGTSLNFSSTAHPQTDGQTEVVNRTLGNMIRCL 1325

Query: 541  SGDKPKQWDLALPQA 556
             G+KPK WD++L QA
Sbjct: 1326 CGEKPKLWDVSLAQA 1340

BLAST of CsaV3_1G014410 vs. TrEMBL
Match: tr|A0A2I0W9V4|A0A2I0W9V4_9ASPA (RNA-directed DNA polymerase OS=Dendrobium catenatum OX=906689 GN=MA16_Dca017929 PE=4 SV=1)

HSP 1 Score: 721.8 bits (1862), Expect = 1.1e-204
Identity = 342/557 (61.40%), Postives = 425/557 (76.30%), Query Frame = 0

Query: 1   MPFGLSNAPSTFMRLMNQILQPFLNQFIVVYFDDILIYSSSREDHLKHIQLLFTALQENE 60
           MPFGL NAP+TFMRLMN+IL+  L ++ VVYFDDILIYS S E+H  H+  +   LQE++
Sbjct: 276 MPFGLCNAPATFMRLMNEILKSVLGKYCVVYFDDILIYSQSIEEHRVHLSNVLAILQEHK 335

Query: 61  LQINLKKCEFLCYSIHFLEFIISCDGISVDPKKIDSISSWPQPKTPKDIQCFLGLASFYR 120
           L INL KCEF   ++HFL FIIS  G+  DP+KI +I+ WP P++  +++ F+GLA+FYR
Sbjct: 336 LYINLPKCEFATGTVHFLGFIISKQGVHTDPQKIQAITEWPAPRSLTEVRSFIGLANFYR 395

Query: 121 RFIKNFSTIAAPLTNCLKKGSFQWGQVEEDSFCKLKLALASPPVLKLPNFDKPFEVTVDA 180
           RFI+ FS I AP+TNCLK  SFQW   +E SF  LK AL S PVL +P+F+KPF V  DA
Sbjct: 396 RFIRGFSIIVAPITNCLKGKSFQWTAEQERSFSTLKAALTSAPVLAVPDFNKPFHVDTDA 455

Query: 181 SGLGIGVVLSQEGHPLEYFSEKLSTYRQNWSTYEQELYALVRALKQWEHYLLANTFILLT 240
           S +G+G VLSQE  P+E+FSEKLS  RQNWS YEQELYA+VRALKQWEHYLL   F+L +
Sbjct: 456 SAVGVGAVLSQEDKPIEFFSEKLSPARQNWSAYEQELYAVVRALKQWEHYLLHQDFVLRS 515

Query: 241 DHFSLKFLNSQKTISRMHARWLQFLQRFDFVIKHTSGKSNKVADALSRKGILLTTLQSQI 300
           D+ +L+F+NSQKTI++MHARWL FLQRF FV++H SG  N+VADALSR+  LLT LQ   
Sbjct: 516 DNHALQFINSQKTINKMHARWLTFLQRFSFVLRHKSGAQNRVADALSRRTALLTKLQVDA 575

Query: 301 IAFDHLHTLYPTDIDFRSIWENCSNH--KPCKDYHIVNSFLFKGDVLCVPHTSLREAIIK 360
                L  LY TD DF   W   +NH  +PC+D+ I ++FLFK ++LCVP +S RE +IK
Sbjct: 576 PGLQALQELYATDHDFADPWNQLTNHPSRPCRDFSIRHNFLFKDNLLCVPASSWREHLIK 635

Query: 361 ETHSRGLAGHFGRDKTLATIISKFFWPQLNRNVTNFIKRCSICQTAKGNSQNTGLYTPLP 420
           E HS GL+ H GR KTL  + S+F+WP L R+V  F++RC ICQ  KGN+QNTGLYTPLP
Sbjct: 636 ELHSGGLSAHVGRTKTLEQMQSRFYWPHLRRDVVRFVERCPICQLYKGNAQNTGLYTPLP 695

Query: 421 ISSTIWEDLSMDFVLGLPRTQRGHDSIFVVVNRFSKMAHFIPCKKTSDALNIANLFFREI 480
           +  +IWEDLSMDF+LGLPRT+RG DSI VVV+RFSKMAHF+ CKKT DALN+A LFF+EI
Sbjct: 696 VLQSIWEDLSMDFILGLPRTKRGSDSIMVVVDRFSKMAHFLACKKTFDALNVAILFFKEI 755

Query: 481 VRLHGIPKTIVSDRDVKFLSHFWRSLCKKFDTNLLFSTASHPQTDGQTEVTNRTLGNLIR 540
           VRLHGIP++I SDRDVKF+SHFWR L K+  T +  S+A HPQTDGQTEV NRTL N++R
Sbjct: 756 VRLHGIPRSITSDRDVKFVSHFWRELWKRLRTEIKLSSAYHPQTDGQTEVVNRTLSNMLR 815

Query: 541 CLSGDKPKQWDLALPQA 556
           CL+ + PKQWD  + QA
Sbjct: 816 CLAHENPKQWDDYIGQA 832

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_025979678.13.2e-21161.55uncharacterized protein LOC112997809 [Glycine max][more]
PKU71894.11.1e-20661.26RNA-directed DNA polymerase [Dendrobium catenatum][more]
PWA81295.17.7e-20558.74transposon Ty3-I Gag-Pol polyprotein [Artemisia annua][more]
PKU72440.11.7e-20461.40RNA-directed DNA polymerase [Dendrobium catenatum][more]
PKU81400.12.5e-20359.28RNA-directed DNA polymerase [Dendrobium catenatum][more]
Match NameE-valueIdentityDescription
ATMG00860.16.2e-2337.78DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q99315|YG31B_YEAST2.6e-9536.86Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
sp|Q7LHG5|YI31B_YEAST4.4e-9536.86Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
sp|P0CT41|TF212_SCHPO6.0e-9234.66Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
sp|P0CT34|TF21_SCHPO6.0e-9234.66Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
sp|P0CT35|TF22_SCHPO6.0e-9234.66Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
tr|M5WCC7|M5WCC7_PRUPE9.0e-21060.65Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa017790mg PE=4 SV=1[more]
tr|M5W531|M5W531_PRUPE3.4e-20960.47Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa026856mg PE=4 SV=1[more]
tr|A0A2I0W8A8|A0A2I0W8A8_9ASPA7.1e-20761.26RNA-directed DNA polymerase OS=Dendrobium catenatum OX=906689 GN=MA16_Dca026621 ... [more]
tr|A0A2U1P6A2|A0A2U1P6A2_ARTAN5.1e-20558.74Transposon Ty3-I Gag-Pol polyprotein OS=Artemisia annua OX=35608 GN=CTI12_AA1894... [more]
tr|A0A2I0W9V4|A0A2I0W9V4_9ASPA1.1e-20461.40RNA-directed DNA polymerase OS=Dendrobium catenatum OX=906689 GN=MA16_Dca017929 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR036397RNaseH_sf
IPR000477RT_dom
IPR001584Integrase_cat-core
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G014410.1CsaV3_1G014410.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 427..535
e-value: 1.4E-13
score: 51.0
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 414..555
score: 16.335
NoneNo IPR availableGENE3DG3DSA:1.10.340.70coord: 327..405
e-value: 3.8E-15
score: 57.8
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 1..89
e-value: 5.8E-28
score: 99.6
NoneNo IPR availableGENE3DG3DSA:3.10.20.370coord: 166..275
e-value: 1.5E-66
score: 225.8
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 96..165
e-value: 1.5E-66
score: 225.8
coord: 276..288
e-value: 1.5E-66
score: 225.8
NoneNo IPR availablePANTHERPTHR24559:SF247SUBFAMILY NOT NAMEDcoord: 1..553
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 1..553
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 175..289
e-value: 2.77092E-48
score: 163.818
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1..82
e-value: 1.18144E-32
score: 122.319
NoneNo IPR availableSUPERFAMILYSSF56672DNA/RNA polymerasescoord: 1..274
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1..78
e-value: 1.0E-18
score: 67.6
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 1..82
score: 13.262
IPR036397Ribonuclease H superfamilyGENE3DG3DSA:3.30.420.10coord: 416..555
e-value: 9.4E-44
score: 151.0
IPR012337Ribonuclease H-like superfamilySUPERFAMILYSSF53098Ribonuclease H-likecoord: 416..554

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G014410CmaCh12G007990Cucurbita maxima (Rimu)cmacucB0188
CsaV3_1G014410ClCG01G014040Watermelon (Charleston Gray)cucwcgB016
The following gene(s) are paralogous to this gene:

None