CSPI07G08510 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI07G08510
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7: 6330788 .. 6334401 (-)
RNA-Seq ExpressionCSPI07G08510
SyntenyCSPI07G08510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATAAAGCTTATTCTTGTTACCTATCTAGATCTAGTGCCTGTAATGTCTATGAAAAAATGAAATGGATCCCTAAATATGTAAATGCTAACATTCTAGGACCCAAACAAGTATGGGTACCAAAGGATCAAACTTGAAATTAGTTGTTTTTAGGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAAAATAATTGGTAAGGGTAATATAGGAAATGATTCATCTACTTTGATTGAAAATGTTCATTTGGTTGATGGTTTAAAGCATGATTTGCTTAGTATTAGTCAATTGTGTGATAAAGGATTTAGAGTAATATTTGATAAAAAAAAATTGCATAATTGAAAATGTTAGTGATAGAAAAGTTTTGTTTGTTGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATGATTATCCTATTATTGATAAATGTCTTTCGGTTTTGCATAATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGATTTTTGTCAAGAAAATGGTTTTTCCCATCATTTTTTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCACGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATTATTATCCTATTATTGATAAATGTCTTTCGGTTTTCCATGATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCGGGTTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAACTCTTTGTGATTTTGGATTAA

mRNA sequence

ATGGACATAAAGCTTATTCTTGTTACCTATCTAGATCTAGTGCCTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGATTTTTGTCAAGAAAATGGTTTTTCCCATCATTTTTTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCACGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATTATTATCCTATTATTGATAAATGTCTTTCGGTTTTCCATGATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCGGGTTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAACTCTTTGTGATTTTGGATTAA

Coding sequence (CDS)

ATGGACATAAAGCTTATTCTTGTTACCTATCTAGATCTAGTGCCTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGATTTTTGTCAAGAAAATGGTTTTTCCCATCATTTTTTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCACGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATTATTATCCTATTATTGATAAATGTCTTTCGGTTTTCCATGATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCGGGTTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAACTCTTTGTGATTTTGGATTAA

Protein sequence

MDIKLILVTYLDLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLCKNLLDHGNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVILD*
Homology
BLAST of CSPI07G08510 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 445.3 bits (1144), Expect = 1.8e-123
Identity = 287/830 (34.58%), Postives = 433/830 (52.17%), Query Frame = 0

Query: 143  LWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTR 202
            LWH+R+GH S   +  ++K  L+      K    K CD C  GKQ + SF++ +      
Sbjct: 424  LWHKRMGHMSEKGLQILAKKSLI---SYAKGTTVKPCDYCLFGKQHRVSFQTSSE-RKLN 483

Query: 203  PLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNE 262
             L L++ D+ GP  I S GGN Y    +DD SR  WV ++K KD   + F  F   V+ E
Sbjct: 484  ILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERE 543

Query: 263  KGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSM 322
             G  + ++RSD+GGE+ +  F+ +C  +G  H  + P TPQ NGV ER NRT+ E  RSM
Sbjct: 544  TGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSM 603

Query: 323  LNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILN 382
            L    LPK FW EAV TACY+ NR    P   + P  +W  K  +  + KVFGC+ F   
Sbjct: 604  LRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHV 663

Query: 383  NKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSD 442
             KE+  K D K+   IF+GY      YR+++     +  S  VVF ES   V   +  S+
Sbjct: 664  PKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES--EVRTAADMSE 723

Query: 443  DLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG----- 502
             ++       V        P+  +    E  E+G    P E    +   + L  G     
Sbjct: 724  KVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQ--PGE---VIEQGEQLDEGVEEVE 783

Query: 503  NPEQGVKTRSSL----------NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQ 562
            +P QG +    L            + +  +V      EP S K+     E ++  + AMQ
Sbjct: 784  HPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQL-MKAMQ 843

Query: 563  EELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDY 622
            EE+   ++N  +KLV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID+
Sbjct: 844  EEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDF 903

Query: 623  EETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPN 682
            +E F+PV ++ +IR +L+ A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     +
Sbjct: 904  DEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKH 963

Query: 683  HVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK-VKNNDMLIVQIYVDD 742
             V KL K+LYGLKQAPR WY +   F+    +     D  ++ K    N+ +I+ +YVDD
Sbjct: 964  MVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDD 1023

Query: 743  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQI--KQLKDGIFISQEKYTRDLLK 802
            ++    +  L  +    +   F+M  +G     LG++I  ++    +++SQEKY   +L+
Sbjct: 1024 MLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLE 1083

Query: 803  KFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKTYRGMIGSLLY-LTASRPDIM 862
            +F +   K   TP++   KL K       +EKG    +  Y   +GSL+Y +  +RPDI 
Sbjct: 1084 RFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKV-PYSSAVGSLMYAMVCTRPDIA 1143

Query: 863  FSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLD 922
             +V + +RF   P + H+ AVK IL+YL GT    L +  +    L GY+DAD AG + +
Sbjct: 1144 HAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDIDN 1203

Query: 923  RKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
            RKS++G         +SW SK Q  VALSTTEAEYIA      +++W+K+
Sbjct: 1204 RKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKR 1239

BLAST of CSPI07G08510 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 8.7e-123
Identity = 332/1075 (30.88%), Postives = 498/1075 (46.33%), Query Frame = 0

Query: 24   NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKK 83
            N W LDSG + H+T D + L       GG   MV  G                    N  
Sbjct: 329  NNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLH 388

Query: 84   GVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLNKMVWLKGKIVLCKNLLDHGN 143
             ++   N+   L+ ++       V    FP  F +++L   +  L+GK            
Sbjct: 389  NILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGK-----------T 448

Query: 144  RDENVYTLDLNYYPII-DKCLSVFHDDS-----WLWHRRLGHASMHLISNISKNCLVRGL 203
            +DE      L  +PI   + +S+F   S       WH RLGH +  +++++  N  +  L
Sbjct: 449  KDE------LYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSVL 508

Query: 204  -PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAF 263
             PS KF     C  C + K  K  F S++ I++TRPL+ ++ D++  S I S+    Y  
Sbjct: 509  NPSHKFLS---CSDCLINKSNKVPF-SQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYV 568

Query: 264  VIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFC 323
            + VD F+R+TW+  +K K    ++FI+F   ++N     I    SD+GGEF   A   + 
Sbjct: 569  IFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF--VALWEYF 628

Query: 324  EENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRV 383
             ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A   A Y+ NR 
Sbjct: 629  SQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINR- 688

Query: 384  LVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS 443
            L  P L  ++P++   G  PN    +VFGC C+         K D K+   +FLGYS T 
Sbjct: 689  LPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQ 748

Query: 444  KAYRVFNKKTLVIEESIHVVFDESWNNVSN-------------ESIC------------- 503
             AY   + +T  +  S HV FDE+    SN             ES C             
Sbjct: 749  SAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTP 808

Query: 504  ------------------------------SDDLEKDFGDLL----------VNDKGKEI 563
                                          S +L+  F               N      
Sbjct: 809  VLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTT 868

Query: 564  VPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH-------- 623
             P+                       Q    +    + SSS P     A S         
Sbjct: 869  QPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPS 928

Query: 624  -------PKDLILGNPEQ------GVKTRSSLNLFS-------NLAFVSQIEPRSFKDAE 683
                   P   I+ N  Q       + TR+   +          ++  ++ EPR+   A 
Sbjct: 929  ILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQAL 988

Query: 684  CDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVA 743
             DE W  AM  E+N    N  W LV P PS+ +I+G +W+F  K + +G++ R KARLVA
Sbjct: 989  KDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVA 1048

Query: 744  QGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQ 803
            +GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + ++VY+ Q
Sbjct: 1049 KGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQ 1108

Query: 804  PPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNND 863
            PPGF   D PN+V KL+KALYGLKQAPRAWY  L  +LL   F     D +LF+  +   
Sbjct: 1109 PPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKS 1168

Query: 864  MLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQE 923
            ++ + +YVDDI+    + +L       +   F +    EL +FLG++ K++  G+ +SQ 
Sbjct: 1169 IVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQR 1228

Query: 924  KYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIM 945
            +Y  DLL +  +   K   TPM+ + KL      K  D   YRG++GSL YL  +RPDI 
Sbjct: 1229 RYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDIS 1288

BLAST of CSPI07G08510 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.7e-118
Identity = 316/1082 (29.21%), Postives = 492/1082 (45.47%), Query Frame = 0

Query: 24   NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKK 83
            N W LDSG + H+T D + L       GG   M+  G                    +  
Sbjct: 308  NNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLN 367

Query: 84   GVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLNKMVWLKGKIVLCKNLLDHGN 143
             V+   N+   L+ ++       V    FP  F +++L   +  L+GK            
Sbjct: 368  KVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGK-----------T 427

Query: 144  RDENVYTLDLNYYPIIDK---------CLSVFHDDSWLWHRRLGHASMHLISNISKNCLV 203
            +DE      L  +PI            C    H     WH RLGH S+ +++++  N  +
Sbjct: 428  KDE------LYEWPIASSQAVSMFASPCSKATHSS---WHSRLGHPSLAILNSVISNHSL 487

Query: 204  RGL-PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNY 263
              L PS K      C  C + K  K  F S + I++++PL+ ++ D++  S I S     
Sbjct: 488  PVLNPSHKLLS---CSDCFINKSHKVPF-SNSTITSSKPLEYIYSDVWS-SPILSIDNYR 547

Query: 264  YAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK 323
            Y  + VD F+R+TW+  +K K     +FI F   V+N     I  + SD+GGEF     +
Sbjct: 548  YYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEF--VVLR 607

Query: 324  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVS 383
             +  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A + A Y+ 
Sbjct: 608  DYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLI 667

Query: 384  NRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYS 443
            NR L  P L  ++P++   G+ PN    KVFGC C+         K + K+    F+GYS
Sbjct: 668  NR-LPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYS 727

Query: 444  STSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPS 503
             T  AY   +  T  +  S HV FDE     S  +      ++   D   N      +P+
Sbjct: 728  LTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPT 787

Query: 504  MQDV---------------------NIIEKKEEGSSSLP--------------------- 563
               V                     + +   +  SS+LP                     
Sbjct: 788  TPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQ 847

Query: 564  ---KEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNL----------------------- 623
               +  +   S+    IL NP     + +S N  S L                       
Sbjct: 848  PTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSP 907

Query: 624  ----------------------------------------------------AFVSQIEP 683
                                                                +  +  EP
Sbjct: 908  SSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEP 967

Query: 684  RSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIR 743
            R+   A  D+ W  AM  E+N    N  W LV P P + +I+G +W+F  K + +G++ R
Sbjct: 968  RTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNR 1027

Query: 744  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIV 803
             KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + 
Sbjct: 1028 YKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLT 1087

Query: 804  EEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLF 863
            +EVY+ QPPGF   D P++V +L+KA+YGLKQAPRAWY  L  +LL   F     D +LF
Sbjct: 1088 DEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLF 1147

Query: 864  IKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKD 923
            +  +   ++ + +YVDDI+    ++ L +     +   F +    +L +FLG++ K++  
Sbjct: 1148 VLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQ 1207

Query: 924  GIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLT 945
            G+ +SQ +YT DLL +  +   K   TPM+T+ KL      K  D   YRG++GSL YL 
Sbjct: 1208 GLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLA 1267

BLAST of CSPI07G08510 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 422.2 bits (1084), Expect = 1.6e-116
Identity = 315/1043 (30.20%), Postives = 508/1043 (48.71%), Query Frame = 0

Query: 28   LDSGCSRHMTGDRSKL---------ISFSKKNGGMVTFGDNKKGVITEENLIMMLLK--I 87
            LDSG S H+  D S           +  +    G   +   K+G++   N   + L+  +
Sbjct: 291  LDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYA-TKRGIVRLRNDHEITLEDVL 350

Query: 88   FVKKMVFPI--IFSLQELLNKMVWLKGKIVLCKN---LLDHGNRDENVYTLDLNYYPIID 147
            F K+    +  +  LQE    + + K  + + KN   ++ +     NV  ++   Y I  
Sbjct: 351  FCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSINA 410

Query: 148  KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLV--RGLPSFKFEKDKVCDACQMGKQTK 207
            K  + F     LWH R GH S   +  I +  +   + L +      ++C+ C  GKQ +
Sbjct: 411  KHKNNFR----LWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQAR 470

Query: 208  SSFKS-KNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDA 267
              FK  K+     RPL ++H D+ GP    +     Y  + VD F+ +    +IK+K D 
Sbjct: 471  LPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDV 530

Query: 268  LKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVV 327
               F  F  + +      +  +  D+G E+ ++  + FC + G S++ + P TPQ NGV 
Sbjct: 531  FSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVS 590

Query: 328  ERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIP 387
            ER  RT+ E AR+M++   L K FW EAV TA Y+ NR+  R  +D  KTPYE+WH K P
Sbjct: 591  ERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKP 650

Query: 388  NIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS-KAYRVFNKKTLVIEESIHV 447
             + + +VFG   ++ + K K GKFD K+   IF+GY     K +   N+K +V  +   V
Sbjct: 651  YLKHLRVFGATVYV-HIKNKQGKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVARD---V 710

Query: 448  VFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDV 507
            V DE+ N V++     E++   D ++       ND  K I             +  ++D 
Sbjct: 711  VVDET-NMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDS 770

Query: 508  NIIEKK---------------------------------------------------EEG 567
               E K                                                   E  
Sbjct: 771  KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESK 830

Query: 568  SSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LF 627
             S  P E R +    H K++ + NP             + +KT+         +SLN + 
Sbjct: 831  GSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVV 890

Query: 628  SNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWV 687
             N   +    P SF + +  +    W  A+  ELN  + N  W +  RP N +I+ ++WV
Sbjct: 891  LNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWV 950

Query: 688  FRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQ 747
            F  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+     N  ++Q
Sbjct: 951  FSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQ 1010

Query: 748  MDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLE 807
            MDVK+AFLNG + EE+Y+  P G       ++V KL KA+YGLKQA R W++   + L E
Sbjct: 1011 MDVKTAFLNGTLKEEIYMRLPQGISCNS--DNVCKLNKAIYGLKQAARCWFEVFEQALKE 1070

Query: 808  NDFKMGKIDNTLFIKVKN--NDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMG 867
             +F    +D  ++I  K   N+ + V +YVDD++  + + +    F + +  +F M+ + 
Sbjct: 1071 CEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLN 1130

Query: 868  ELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVD 927
            E+  F+G++I+  +D I++SQ  Y + +L KF +       TP+   +K++ +      D
Sbjct: 1131 EIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPL--PSKINYELLNSDED 1190

Query: 928  IKT-YRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGL 946
              T  R +IG L+Y +  +RPD+  +V + +R+ S      +  +KR+L+YL GTID+ L
Sbjct: 1191 CNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKL 1250

BLAST of CSPI07G08510 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 3.4e-34
Identity = 81/225 (36.00%), Postives = 132/225 (58.67%), Query Frame = 0

Query: 715 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 774
           +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 775 LLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFS 834
           +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++
Sbjct: 65  ILN----NAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 124

Query: 835 VCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRK 894
           V +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+
Sbjct: 125 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRR 184

Query: 895 STSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW 937
           ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Sbjct: 185 STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G08510 vs. ExPASy TrEMBL
Match: A0A438GI90 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2030 PE=4 SV=1)

HSP 1 Score: 1267.7 bits (3279), Expect = 0.0e+00
Identity = 625/945 (66.14%), Postives = 751/945 (79.47%), Query Frame = 0

Query: 19  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MML 78
           + SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++
Sbjct: 37  EGSKEDKWFLDSGCSRHMTGDESKFAFLTKRKGGYVTFGDNAKGRIIGQGNIGNGTSSLI 96

Query: 79  LKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLDH-----GNRDENVYTLDLNY 138
             + +   +   + S+ +L +K   V  +    + K++ +      G+R ENVY ++++ 
Sbjct: 97  ESVLLVDGLKHNLLSISQLCDKGFKVIFEASHCIIKDIQNDKTIFMGHRCENVYAINISK 156

Query: 139 YPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGK 198
           Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGK
Sbjct: 157 YDGHDRCFSSMHDQSWLWHRRLGHANMDLISQLNKDELVRGLPKINFQKDKICEACQMGK 216

Query: 199 QTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKD 258
           Q K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K 
Sbjct: 217 QIKNSFKNKNFISTSRPLELLHMDLFGPSRTPSLGGKSYAYVIVDDFSRYTWVLFLSQKS 276

Query: 259 DALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG 318
           +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C ++G +HNFS+PRTPQQNG
Sbjct: 277 EAFYEFSKFCNKVQNEKGFSITCIRSDHGREFENFDFEEYCNKHGINHNFSAPRTPQQNG 336

Query: 319 VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIP 378
           VVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K P
Sbjct: 337 VVERKNRTLQEMARTMLNENNLPKYFWAEAVNTSCYVLNRILLRPILKKTPYELWKNKKP 396

Query: 379 NIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVV 438
           NI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EESIHV+
Sbjct: 397 NISYFKVFGCKCFILNTKDNLGKFDAKSDVGIFLGYSTSSKAFRVFNKRTMVVEESIHVI 456

Query: 439 FDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEE 498
           FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E
Sbjct: 457 FDESNNSLQERESVDDDLGLETSMGKLQIEDKRQQEESGEDPKKEDSPLALPPPQQVQGE 516

Query: 499 GSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEF 558
            S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE 
Sbjct: 517 SSQDLPKDWKFVINHPQDQIIGNPSSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDEN 576

Query: 559 WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ 618
           W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY Q
Sbjct: 577 WMIAMQEELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQ 636

Query: 619 EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFE 678
           EEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+
Sbjct: 637 EEGIDYEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEVYVEQPPGFQ 696

Query: 679 SFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQ 738
           SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  DML+VQ
Sbjct: 697 SFNFPNHVFKLKKALYGLKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKEKDMLLVQ 756

Query: 739 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 798
           IYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+G FI+Q KY +D
Sbjct: 757 IYVDDIIFGATNDSLCEDFSKCMHSEFEMSMMGELNYFLGLQIKQLKEGTFINQAKYIKD 816

Query: 799 LLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCL 858
           LLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCL
Sbjct: 817 LLKRFNMEEAKVMKTPMSSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCL 876

Query: 859 CARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS 918
           CARFQSCPKESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTS
Sbjct: 877 CARFQSCPKESHLSAVKRILRYLKGTMNIGLWYPKGDNFELIGFSDADFAGCRVERKSTS 936

Query: 919 GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
           GTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQ
Sbjct: 937 GTCHFLGHSLVSWHSKKQNSVALSTAEAEYIAAGLCCAQILWMKQ 981

BLAST of CSPI07G08510 vs. ExPASy TrEMBL
Match: A5C8K0 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_001808 PE=4 SV=1)

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 582/941 (61.85%), Postives = 701/941 (74.50%), Query Frame = 0

Query: 12   DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI- 71
            +++    K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I 
Sbjct: 460  EMILASQKCSKEDKWFLDSGCSRHMTGDESKFAFLTKRKGGYVTFGDNAKGRIIGQGNIG 519

Query: 72   ----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLDH-----GNRDENV 131
                 ++  + +   +   + S+ +L NK   V  +    + K++ +      G+R ENV
Sbjct: 520  NGTSSLIESVLLVDGLKHNLLSISQLCNKGFKVIFEASHCIIKDIQNDKTIFMGHRCENV 579

Query: 132  YTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVC 191
            Y ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C
Sbjct: 580  YAINISKYDGHDRCFSSMHDQSWLWHRRLGHANMDLISQLNKDELVRGLPKINFQKDKIC 639

Query: 192  DACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWV 251
            +ACQMGKQ K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWV
Sbjct: 640  EACQMGKQIKNSFKNKNFISTSRPLELLHMDLFGPSRTPSLGGKSYAYVIVDDFSRYTWV 699

Query: 252  LMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP 311
            L +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +P
Sbjct: 700  LFLSQKSEAFYEFSKFCNKVQNEKGFSITCIRSDHGREFENFDFEEYCNKYGINHNFLAP 759

Query: 312  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYE 371
            RT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYE
Sbjct: 760  RTSQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAINTSCYVLNRILLRPILKKTPYE 819

Query: 372  LWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVI 431
            LW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+
Sbjct: 820  LWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIFLGYSTSSKAFRVFNKRTMVV 879

Query: 432  EESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSS 491
            EESIH  +   W N    +       KD      N K  E +P        +K       
Sbjct: 880  EESIH-DWRLPWENCKLRT-------KD------NKKKVERIPR-------KKNHLWHYL 939

Query: 492  LPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILA 551
            L  + ++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++A
Sbjct: 940  LLNKCKFVINHPQDQIIGNPLSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIA 999

Query: 552  MQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI 611
            MQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGI
Sbjct: 1000 MQKELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGI 1059

Query: 612  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDL 671
            DYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ 
Sbjct: 1060 DYEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEIYVEQPPGFQSFNF 1119

Query: 672  PNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVD 731
            PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K NDML+VQIYVD
Sbjct: 1120 PNHVFKLKKALYGLKQAPRAWYERLSKFLLKKSFKMGKIDTTLFIKTKENDMLLVQIYVD 1179

Query: 732  DIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKK 791
            DI FG+TN SLCE+FSKCMH                               KY +DLLK+
Sbjct: 1180 DITFGATNDSLCEDFSKCMHT------------------------------KYIKDLLKR 1239

Query: 792  FKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARF 851
            F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARF
Sbjct: 1240 FNMGEAKVMKTPMSSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCLCARF 1299

Query: 852  QSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ 911
            QSCPKESH  AVKRIL+YL GT+ +GLWYP+   F L+G+SDADFAG  ++RKSTSGTC 
Sbjct: 1300 QSCPKESHLSAVKRILRYLKGTMSIGLWYPKGDNFELIGFSDADFAGCRVERKSTSGTCH 1349

Query: 912  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
             LG SLVSW SKKQNS+ALST EAEY A + C AQILWMKQ
Sbjct: 1360 SLGHSLVSWHSKKQNSIALSTAEAEYTAASLCYAQILWMKQ 1349

BLAST of CSPI07G08510 vs. ExPASy TrEMBL
Match: A0A151UHG7 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_048795 PE=4 SV=1)

HSP 1 Score: 1142.5 bits (2954), Expect = 0.0e+00
Identity = 581/940 (61.81%), Postives = 705/940 (75.00%), Query Frame = 0

Query: 17   CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEE 76
            CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KG I         + +
Sbjct: 537  CLRA-KNLLWYLDSGCSRHMTGDPSKFTNLKLKNEGYVTYGDNNKGKILGHGNVGNPSSQ 596

Query: 77   NLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLD 136
             LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LD
Sbjct: 597  TLIENVLLVDGLKHNLLSISQLSDKGFKIEFDNTCCLICDKKSKEIRFIGKRIDNIYMLD 656

Query: 137  LNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDA 196
            L +   +   KCL    ++ WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDA
Sbjct: 657  LEHSITMSNTKCLITQEENIWLWHRRAAHIHMDHLNKLCRKELVVGLPKLKFGKDKLCDA 716

Query: 197  CQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLM 256
            CQ GKQ K+SFKSKN IST+RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ 
Sbjct: 717  CQKGKQVKASFKSKNQISTSRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMF 776

Query: 257  IKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT 316
            + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Sbjct: 777  LANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNILFQKFCEEHGINHNFSAPRT 836

Query: 317  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELW 376
            PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++
Sbjct: 837  PQQNGVVERKNRSLEELARTMLNETNLPKYFWADAINTACHVLNKVLIRPILKKTPYEIY 896

Query: 377  HGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE 436
             GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Sbjct: 897  KGKKPNISYFRVFGCKCYVLNNGKEQLGKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVE 956

Query: 437  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSL 496
            ES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    L
Sbjct: 957  ESVHVVFDES-NKQETRQTEIEDLNELLDQSLLENEPNEVPKESES---LEKAKETCEQL 1016

Query: 497  PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAM 556
            PKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AM
Sbjct: 1017 PKEWKTSRDLSMDNIIGNIGKGVSTRSAIKNICNTMAFVSQVEPKNIDEALKDEHWLMAM 1076

Query: 557  QEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID 616
            QEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Sbjct: 1077 QEELNQFERNEVWDLVPLPKDYPIIGTKWVFRNKLDESGIILRNKARLVAKGYNQEEGID 1136

Query: 617  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLP 676
            Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  P
Sbjct: 1137 YDETFAPVARIEAIRLLLAYSSIKNFKLYQMDVKSAFLNGFIQEEVYVEQPPGFVDYKNP 1196

Query: 677  NHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD 736
            NHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Sbjct: 1197 NHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYERGKVDNTLFVKKFKNDTMYVQIYVDD 1256

Query: 737  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKF 796
            I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF
Sbjct: 1257 IVFGSTNTSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMHDGIFISQSKYCNELLKKF 1316

Query: 797  KLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ 856
             +   K A TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ
Sbjct: 1317 GMEGCKEAATPISNNCNLDLDEKGIAVDSSKYRGIIGSLLYLTASRPDIMFAVCLCARFQ 1376

Query: 857  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF 916
            + PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGTC  
Sbjct: 1377 ANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDYAGCRLDRKSTSGTCHL 1436

Query: 917  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
            LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Sbjct: 1437 LGSALVSWHSKKQACVALSTAEAEYIAAGSCCAQILWMKQ 1471

BLAST of CSPI07G08510 vs. ExPASy TrEMBL
Match: A0A151RY83 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_030937 PE=4 SV=1)

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 586/943 (62.14%), Postives = 702/943 (74.44%), Query Frame = 0

Query: 18   LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEEN 77
            L  +K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + 
Sbjct: 542  LLRAKNLLWYLDSGCSRHMTGDPSKFSSLKLKNEGYVTYGDNNKGKILGHGNVGNSSSQT 601

Query: 78   LIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLDL 137
            LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LDL
Sbjct: 602  LIENVLLVDGLKHNLLSISQLSDKGFKIEFDDTCCLICDKRSKEIRFIGKRIDNIYMLDL 661

Query: 138  NYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDAC 197
             +   +   KCL    + +WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDAC
Sbjct: 662  EHSISMSNTKCLITQEESTWLWHRRAAHIHMDHLNKLSRKELVVGLPKLKFGKDKLCDAC 721

Query: 198  QMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMI 257
            Q GKQ K+SFKSKN IST+RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ +
Sbjct: 722  QKGKQVKASFKSKNQISTSRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMFL 781

Query: 258  KHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP 317
             +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRTP
Sbjct: 782  ANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNIMFQKFCEEHGINHNFSAPRTP 841

Query: 318  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWH 377
            QQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V NRVL+RP L KTPYE++ 
Sbjct: 842  QQNGVVERKNRSLEELARTMLNETNLPKYFWADAINTACHVLNRVLIRPILKKTPYEIYK 901

Query: 378  GKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE 437
            GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SK+YR++NK+TLV+EE
Sbjct: 902  GKKPNISYFRVFGCKCYVLNNGKEQLGKFDAKADEAIFLGYSTNSKSYRIYNKRTLVVEE 961

Query: 438  SIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGS 497
            S+HVVFDES N         +DL +     L+    N+K KE V         EK++   
Sbjct: 962  SVHVVFDES-NKQETRQTEIEDLNELLDQPLLESEPNEKSKESVSH-------EKEKVTC 1021

Query: 498  SSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWI 557
              LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W+
Sbjct: 1022 EQLPKEWKTSRELSIDNIIGNIGKGVTTRSAIKNICNTMAFVSQVEPKNIDEALKDEHWL 1081

Query: 558  LAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE 617
            +AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEE
Sbjct: 1082 MAMQEELNQFERNEVWDLVPLPKDYPIIGTKWVFRNKLDESGIILRNKARLVAKGYNQEE 1141

Query: 618  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESF 677
            GIDY+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F
Sbjct: 1142 GIDYDETFAPVARIEAIRLLLAYSSIKNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFVDF 1201

Query: 678  DLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 737
              PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIY
Sbjct: 1202 KNPNHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYERGKVDNTLFVKRFKNDTMYVQIY 1261

Query: 738  VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 797
            VDDI+FGSTNSSLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DG FISQ KY  +LL
Sbjct: 1262 VDDIVFGSTNSSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMHDGTFISQSKYCNELL 1321

Query: 798  KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 857
            KKF +   K A TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCA
Sbjct: 1322 KKFGMEGCKEAATPISNNCNLDLDEKGIAVDNSKYRGIIGSLLYLTASRPDIMFAVCLCA 1381

Query: 858  RFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT 917
            RFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+DFAG  LDRKSTSGT
Sbjct: 1382 RFQANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDFAGCRLDRKSTSGT 1441

Query: 918  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
            C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Sbjct: 1442 CHLLGSALVSWHSKKQACVALSTAEAEYIAAGSCCAQILWMKQ 1476

BLAST of CSPI07G08510 vs. ExPASy TrEMBL
Match: A0A151QSZ9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_045700 PE=4 SV=1)

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 586/942 (62.21%), Postives = 703/942 (74.63%), Query Frame = 0

Query: 17   CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEE 76
            CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + +
Sbjct: 537  CLRA-KNLLWYLDSGCSRHMTGDPSKFSSLKLKNEGYVTYGDNNKGKILGHGNVGNSSSQ 596

Query: 77   NLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLD 136
             LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LD
Sbjct: 597  TLIENVLLVDGLKHNLLSISQLSDKGFKIEFDNTCCLICDKKSKEIRFIGKRIDNIYMLD 656

Query: 137  LNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDA 196
            L +   I   KCL    ++ WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDA
Sbjct: 657  LEHSIAISNTKCLITQEENIWLWHRRAAHIHMDHLNKLSRKELVVGLPKLKFGKDKLCDA 716

Query: 197  CQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLM 256
            CQ GKQ K+SFKSKN ISTTRPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ 
Sbjct: 717  CQKGKQVKASFKSKNQISTTRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMF 776

Query: 257  IKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT 316
            + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Sbjct: 777  LANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNIMFQKFCEEHGINHNFSAPRT 836

Query: 317  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELW 376
            PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++
Sbjct: 837  PQQNGVVERKNRSLEELARTMLNETKLPKYFWADAINTACHVLNKVLIRPILKKTPYEIY 896

Query: 377  HGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE 436
            +G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Sbjct: 897  NGRKPNISYFRVFGCKCFVLNNGKEQLGKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVE 956

Query: 437  ESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGS 496
            ES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S
Sbjct: 957  ESVHVVFDESNKQETRQTEIEDLTDLLDQPLLESETNNKPKESESH-------ENTKETS 1016

Query: 497  SSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWI 556
              LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W+
Sbjct: 1017 EQLPKEWKTSRDLSIDNIIGNIGKGVSTRSAIKNICNTMAFVSQVEPKSIDEALKDEHWL 1076

Query: 557  LAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE 616
            +AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Sbjct: 1077 MAMQEELNQFERNEVWDLVPLPTDYPIIGTKWVFRNKLDESGIIIRNKARLVAKGYNQEE 1136

Query: 617  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESF 676
            GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPGF  F
Sbjct: 1137 GIDYDETFAPVARIEAIRLLLAYSSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFVDF 1196

Query: 677  DLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 736
              PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END+  GK+DNTLF+K   ND + VQIY
Sbjct: 1197 KNPNHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYVRGKVDNTLFVKKFKNDTMYVQIY 1256

Query: 737  VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 796
            VDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LL
Sbjct: 1257 VDDIVFGSTNTSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMSDGIFISQSKYCNELL 1316

Query: 797  KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 856
            KKF +   K   TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF VCLCA
Sbjct: 1317 KKFGMEGCKEVATPISNNCNLDLDEKGIVVDNSKYRGIIGSLLYLTASRPDIMFVVCLCA 1376

Query: 857  RFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT 916
            RFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGT
Sbjct: 1377 RFQANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDYAGCRLDRKSTSGT 1436

Query: 917  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK 939
            C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMK
Sbjct: 1437 CHLLGSALVSWNSKKQACVALSTAEAEYIAAGSCCAQILWMK 1470

BLAST of CSPI07G08510 vs. NCBI nr
Match: RVW71911.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1267.7 bits (3279), Expect = 0.0e+00
Identity = 625/945 (66.14%), Postives = 751/945 (79.47%), Query Frame = 0

Query: 19  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MML 78
           + SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++
Sbjct: 37  EGSKEDKWFLDSGCSRHMTGDESKFAFLTKRKGGYVTFGDNAKGRIIGQGNIGNGTSSLI 96

Query: 79  LKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLDH-----GNRDENVYTLDLNY 138
             + +   +   + S+ +L +K   V  +    + K++ +      G+R ENVY ++++ 
Sbjct: 97  ESVLLVDGLKHNLLSISQLCDKGFKVIFEASHCIIKDIQNDKTIFMGHRCENVYAINISK 156

Query: 139 YPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGK 198
           Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGK
Sbjct: 157 YDGHDRCFSSMHDQSWLWHRRLGHANMDLISQLNKDELVRGLPKINFQKDKICEACQMGK 216

Query: 199 QTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKD 258
           Q K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K 
Sbjct: 217 QIKNSFKNKNFISTSRPLELLHMDLFGPSRTPSLGGKSYAYVIVDDFSRYTWVLFLSQKS 276

Query: 259 DALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG 318
           +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C ++G +HNFS+PRTPQQNG
Sbjct: 277 EAFYEFSKFCNKVQNEKGFSITCIRSDHGREFENFDFEEYCNKHGINHNFSAPRTPQQNG 336

Query: 319 VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIP 378
           VVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K P
Sbjct: 337 VVERKNRTLQEMARTMLNENNLPKYFWAEAVNTSCYVLNRILLRPILKKTPYELWKNKKP 396

Query: 379 NIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVV 438
           NI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EESIHV+
Sbjct: 397 NISYFKVFGCKCFILNTKDNLGKFDAKSDVGIFLGYSTSSKAFRVFNKRTMVVEESIHVI 456

Query: 439 FDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEE 498
           FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E
Sbjct: 457 FDESNNSLQERESVDDDLGLETSMGKLQIEDKRQQEESGEDPKKEDSPLALPPPQQVQGE 516

Query: 499 GSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEF 558
            S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE 
Sbjct: 517 SSQDLPKDWKFVINHPQDQIIGNPSSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDEN 576

Query: 559 WILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ 618
           W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY Q
Sbjct: 577 WMIAMQEELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQ 636

Query: 619 EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFE 678
           EEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+
Sbjct: 637 EEGIDYEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEVYVEQPPGFQ 696

Query: 679 SFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQ 738
           SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  DML+VQ
Sbjct: 697 SFNFPNHVFKLKKALYGLKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKEKDMLLVQ 756

Query: 739 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 798
           IYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+G FI+Q KY +D
Sbjct: 757 IYVDDIIFGATNDSLCEDFSKCMHSEFEMSMMGELNYFLGLQIKQLKEGTFINQAKYIKD 816

Query: 799 LLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCL 858
           LLK+F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCL
Sbjct: 817 LLKRFNMEEAKVMKTPMSSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCL 876

Query: 859 CARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS 918
           CARFQSCPKESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTS
Sbjct: 877 CARFQSCPKESHLSAVKRILRYLKGTMNIGLWYPKGDNFELIGFSDADFAGCRVERKSTS 936

Query: 919 GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
           GTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQ
Sbjct: 937 GTCHFLGHSLVSWHSKKQNSVALSTAEAEYIAAGLCCAQILWMKQ 981

BLAST of CSPI07G08510 vs. NCBI nr
Match: CAN64335.1 (hypothetical protein VITISV_001808 [Vitis vinifera])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 582/941 (61.85%), Postives = 701/941 (74.50%), Query Frame = 0

Query: 12   DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI- 71
            +++    K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I 
Sbjct: 460  EMILASQKCSKEDKWFLDSGCSRHMTGDESKFAFLTKRKGGYVTFGDNAKGRIIGQGNIG 519

Query: 72   ----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLDH-----GNRDENV 131
                 ++  + +   +   + S+ +L NK   V  +    + K++ +      G+R ENV
Sbjct: 520  NGTSSLIESVLLVDGLKHNLLSISQLCNKGFKVIFEASHCIIKDIQNDKTIFMGHRCENV 579

Query: 132  YTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVC 191
            Y ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C
Sbjct: 580  YAINISKYDGHDRCFSSMHDQSWLWHRRLGHANMDLISQLNKDELVRGLPKINFQKDKIC 639

Query: 192  DACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWV 251
            +ACQMGKQ K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWV
Sbjct: 640  EACQMGKQIKNSFKNKNFISTSRPLELLHMDLFGPSRTPSLGGKSYAYVIVDDFSRYTWV 699

Query: 252  LMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP 311
            L +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +P
Sbjct: 700  LFLSQKSEAFYEFSKFCNKVQNEKGFSITCIRSDHGREFENFDFEEYCNKYGINHNFLAP 759

Query: 312  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYE 371
            RT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYE
Sbjct: 760  RTSQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAINTSCYVLNRILLRPILKKTPYE 819

Query: 372  LWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVI 431
            LW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+
Sbjct: 820  LWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIFLGYSTSSKAFRVFNKRTMVV 879

Query: 432  EESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSS 491
            EESIH  +   W N    +       KD      N K  E +P        +K       
Sbjct: 880  EESIH-DWRLPWENCKLRT-------KD------NKKKVERIPR-------KKNHLWHYL 939

Query: 492  LPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILA 551
            L  + ++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++A
Sbjct: 940  LLNKCKFVINHPQDQIIGNPLSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIA 999

Query: 552  MQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI 611
            MQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGI
Sbjct: 1000 MQKELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGI 1059

Query: 612  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDL 671
            DYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ 
Sbjct: 1060 DYEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEIYVEQPPGFQSFNF 1119

Query: 672  PNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVD 731
            PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K NDML+VQIYVD
Sbjct: 1120 PNHVFKLKKALYGLKQAPRAWYERLSKFLLKKSFKMGKIDTTLFIKTKENDMLLVQIYVD 1179

Query: 732  DIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKK 791
            DI FG+TN SLCE+FSKCMH                               KY +DLLK+
Sbjct: 1180 DITFGATNDSLCEDFSKCMHT------------------------------KYIKDLLKR 1239

Query: 792  FKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARF 851
            F + E KV KTPMS++ KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARF
Sbjct: 1240 FNMGEAKVMKTPMSSSIKLDMDEKGKSIDSTMYRGMIGSLLYLTASRPDIMYSVCLCARF 1299

Query: 852  QSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ 911
            QSCPKESH  AVKRIL+YL GT+ +GLWYP+   F L+G+SDADFAG  ++RKSTSGTC 
Sbjct: 1300 QSCPKESHLSAVKRILRYLKGTMSIGLWYPKGDNFELIGFSDADFAGCRVERKSTSGTCH 1349

Query: 912  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
             LG SLVSW SKKQNS+ALST EAEY A + C AQILWMKQ
Sbjct: 1360 SLGHSLVSWHSKKQNSIALSTAEAEYTAASLCYAQILWMKQ 1349

BLAST of CSPI07G08510 vs. NCBI nr
Match: KYP78729.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1142.5 bits (2954), Expect = 0.0e+00
Identity = 581/940 (61.81%), Postives = 705/940 (75.00%), Query Frame = 0

Query: 17   CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEE 76
            CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KG I         + +
Sbjct: 537  CLRA-KNLLWYLDSGCSRHMTGDPSKFTNLKLKNEGYVTYGDNNKGKILGHGNVGNPSSQ 596

Query: 77   NLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLD 136
             LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LD
Sbjct: 597  TLIENVLLVDGLKHNLLSISQLSDKGFKIEFDNTCCLICDKKSKEIRFIGKRIDNIYMLD 656

Query: 137  LNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDA 196
            L +   +   KCL    ++ WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDA
Sbjct: 657  LEHSITMSNTKCLITQEENIWLWHRRAAHIHMDHLNKLCRKELVVGLPKLKFGKDKLCDA 716

Query: 197  CQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLM 256
            CQ GKQ K+SFKSKN IST+RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ 
Sbjct: 717  CQKGKQVKASFKSKNQISTSRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMF 776

Query: 257  IKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT 316
            + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Sbjct: 777  LANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNILFQKFCEEHGINHNFSAPRT 836

Query: 317  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELW 376
            PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++
Sbjct: 837  PQQNGVVERKNRSLEELARTMLNETNLPKYFWADAINTACHVLNKVLIRPILKKTPYEIY 896

Query: 377  HGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE 436
             GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Sbjct: 897  KGKKPNISYFRVFGCKCYVLNNGKEQLGKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVE 956

Query: 437  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSL 496
            ES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    L
Sbjct: 957  ESVHVVFDES-NKQETRQTEIEDLNELLDQSLLENEPNEVPKESES---LEKAKETCEQL 1016

Query: 497  PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAM 556
            PKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AM
Sbjct: 1017 PKEWKTSRDLSMDNIIGNIGKGVSTRSAIKNICNTMAFVSQVEPKNIDEALKDEHWLMAM 1076

Query: 557  QEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID 616
            QEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Sbjct: 1077 QEELNQFERNEVWDLVPLPKDYPIIGTKWVFRNKLDESGIILRNKARLVAKGYNQEEGID 1136

Query: 617  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLP 676
            Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  P
Sbjct: 1137 YDETFAPVARIEAIRLLLAYSSIKNFKLYQMDVKSAFLNGFIQEEVYVEQPPGFVDYKNP 1196

Query: 677  NHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD 736
            NHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Sbjct: 1197 NHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYERGKVDNTLFVKKFKNDTMYVQIYVDD 1256

Query: 737  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKF 796
            I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF
Sbjct: 1257 IVFGSTNTSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMHDGIFISQSKYCNELLKKF 1316

Query: 797  KLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQ 856
             +   K A TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ
Sbjct: 1317 GMEGCKEAATPISNNCNLDLDEKGIAVDSSKYRGIIGSLLYLTASRPDIMFAVCLCARFQ 1376

Query: 857  SCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF 916
            + PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGTC  
Sbjct: 1377 ANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDYAGCRLDRKSTSGTCHL 1436

Query: 917  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
            LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Sbjct: 1437 LGSALVSWHSKKQACVALSTAEAEYIAAGSCCAQILWMKQ 1471

BLAST of CSPI07G08510 vs. NCBI nr
Match: KYP47407.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 586/943 (62.14%), Postives = 702/943 (74.44%), Query Frame = 0

Query: 18   LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEEN 77
            L  +K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + 
Sbjct: 542  LLRAKNLLWYLDSGCSRHMTGDPSKFSSLKLKNEGYVTYGDNNKGKILGHGNVGNSSSQT 601

Query: 78   LIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLDL 137
            LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LDL
Sbjct: 602  LIENVLLVDGLKHNLLSISQLSDKGFKIEFDDTCCLICDKRSKEIRFIGKRIDNIYMLDL 661

Query: 138  NYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDAC 197
             +   +   KCL    + +WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDAC
Sbjct: 662  EHSISMSNTKCLITQEESTWLWHRRAAHIHMDHLNKLSRKELVVGLPKLKFGKDKLCDAC 721

Query: 198  QMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMI 257
            Q GKQ K+SFKSKN IST+RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ +
Sbjct: 722  QKGKQVKASFKSKNQISTSRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMFL 781

Query: 258  KHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP 317
             +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRTP
Sbjct: 782  ANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNIMFQKFCEEHGINHNFSAPRTP 841

Query: 318  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWH 377
            QQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V NRVL+RP L KTPYE++ 
Sbjct: 842  QQNGVVERKNRSLEELARTMLNETNLPKYFWADAINTACHVLNRVLIRPILKKTPYEIYK 901

Query: 378  GKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE 437
            GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SK+YR++NK+TLV+EE
Sbjct: 902  GKKPNISYFRVFGCKCYVLNNGKEQLGKFDAKADEAIFLGYSTNSKSYRIYNKRTLVVEE 961

Query: 438  SIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGS 497
            S+HVVFDES N         +DL +     L+    N+K KE V         EK++   
Sbjct: 962  SVHVVFDES-NKQETRQTEIEDLNELLDQPLLESEPNEKSKESVSH-------EKEKVTC 1021

Query: 498  SSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWI 557
              LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W+
Sbjct: 1022 EQLPKEWKTSRELSIDNIIGNIGKGVTTRSAIKNICNTMAFVSQVEPKNIDEALKDEHWL 1081

Query: 558  LAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE 617
            +AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEE
Sbjct: 1082 MAMQEELNQFERNEVWDLVPLPKDYPIIGTKWVFRNKLDESGIILRNKARLVAKGYNQEE 1141

Query: 618  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESF 677
            GIDY+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F
Sbjct: 1142 GIDYDETFAPVARIEAIRLLLAYSSIKNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFVDF 1201

Query: 678  DLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 737
              PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIY
Sbjct: 1202 KNPNHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYERGKVDNTLFVKRFKNDTMYVQIY 1261

Query: 738  VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 797
            VDDI+FGSTNSSLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DG FISQ KY  +LL
Sbjct: 1262 VDDIVFGSTNSSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMHDGTFISQSKYCNELL 1321

Query: 798  KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 857
            KKF +   K A TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCA
Sbjct: 1322 KKFGMEGCKEAATPISNNCNLDLDEKGIAVDNSKYRGIIGSLLYLTASRPDIMFAVCLCA 1381

Query: 858  RFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT 917
            RFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+DFAG  LDRKSTSGT
Sbjct: 1382 RFQANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDFAGCRLDRKSTSGT 1441

Query: 918  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ 940
            C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Sbjct: 1442 CHLLGSALVSWHSKKQACVALSTAEAEYIAAGSCCAQILWMKQ 1476

BLAST of CSPI07G08510 vs. NCBI nr
Match: KYP33441.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 586/942 (62.21%), Postives = 703/942 (74.63%), Query Frame = 0

Query: 17   CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEE 76
            CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + +
Sbjct: 537  CLRA-KNLLWYLDSGCSRHMTGDPSKFSSLKLKNEGYVTYGDNNKGKILGHGNVGNSSSQ 596

Query: 77   NLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC----KNLLDHGNRDENVYTLD 136
             LI  +L +   K     I  L +   K+ +     ++C    K +   G R +N+Y LD
Sbjct: 597  TLIENVLLVDGLKHNLLSISQLSDKGFKIEFDNTCCLICDKKSKEIRFIGKRIDNIYMLD 656

Query: 137  LNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDA 196
            L +   I   KCL    ++ WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDA
Sbjct: 657  LEHSIAISNTKCLITQEENIWLWHRRAAHIHMDHLNKLSRKELVVGLPKLKFGKDKLCDA 716

Query: 197  CQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLM 256
            CQ GKQ K+SFKSKN ISTTRPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ 
Sbjct: 717  CQKGKQVKASFKSKNQISTTRPLQLIHMDLFGPSRTMSLGGNYYGLVIVDDYSRYTWVMF 776

Query: 257  IKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT 316
            + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Sbjct: 777  LANKNDAFNAFRKFAKLVQNEKCSNITSIRSDHGGEFQNIMFQKFCEEHGINHNFSAPRT 836

Query: 317  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELW 376
            PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++
Sbjct: 837  PQQNGVVERKNRSLEELARTMLNETKLPKYFWADAINTACHVLNKVLIRPILKKTPYEIY 896

Query: 377  HGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE 436
            +G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Sbjct: 897  NGRKPNISYFRVFGCKCFVLNNGKEQLGKFDAKADEAIFLGYSTNSKAYRIYNKRTLVVE 956

Query: 437  ESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGS 496
            ES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S
Sbjct: 957  ESVHVVFDESNKQETRQTEIEDLTDLLDQPLLESETNNKPKESESH-------ENTKETS 1016

Query: 497  SSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWI 556
              LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W+
Sbjct: 1017 EQLPKEWKTSRDLSIDNIIGNIGKGVSTRSAIKNICNTMAFVSQVEPKSIDEALKDEHWL 1076

Query: 557  LAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE 616
            +AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Sbjct: 1077 MAMQEELNQFERNEVWDLVPLPTDYPIIGTKWVFRNKLDESGIIIRNKARLVAKGYNQEE 1136

Query: 617  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESF 676
            GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPGF  F
Sbjct: 1137 GIDYDETFAPVARIEAIRLLLAYSSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFVDF 1196

Query: 677  DLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 736
              PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END+  GK+DNTLF+K   ND + VQIY
Sbjct: 1197 KNPNHVYKLKKALYGLKQAPRSWYDRLSKFLIENDYVRGKVDNTLFVKKFKNDTMYVQIY 1256

Query: 737  VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 796
            VDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LL
Sbjct: 1257 VDDIVFGSTNTSLCKEFAKTMQGEFEMSMMGELTFFLGLQIKQMSDGIFISQSKYCNELL 1316

Query: 797  KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 856
            KKF +   K   TP+S    LD DEKG  VD   YRG+IGSLLYLTASRPDIMF VCLCA
Sbjct: 1317 KKFGMEGCKEVATPISNNCNLDLDEKGIVVDNSKYRGIIGSLLYLTASRPDIMFVVCLCA 1376

Query: 857  RFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT 916
            RFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGT
Sbjct: 1377 RFQANPKESHMKSVKRILKYLKGTTNVGLWYPKGVSLSLIGYSDSDYAGCRLDRKSTSGT 1436

Query: 917  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK 939
            C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMK
Sbjct: 1437 CHLLGSALVSWNSKKQACVALSTAEAEYIAAGSCCAQILWMK 1470

BLAST of CSPI07G08510 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 318.5 bits (815), Expect = 1.8e-86
Identity = 171/424 (40.33%), Postives = 252/424 (59.43%), Query Frame = 0

Query: 522 EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNII 581
           EP ++ +A+    W  AM +E+   E    W++   P N   IG KWV++ K + +G I 
Sbjct: 85  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144

Query: 582 RNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYI 641
           R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++  NF L+Q+D+ +AFLNG +
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204

Query: 642 VEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKI 701
            EE+Y++ PPG+ +       PN V  LKK++YGLKQA R W+ + S  L+   F     
Sbjct: 205 DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHS 264

Query: 702 DNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQI 761
           D+T F+K+     L V +YVDDII  S N +  +E    + + F++  +G L +FLGL+I
Sbjct: 265 DHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEI 324

Query: 762 KQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGS 821
            +   GI I Q KY  DLL +  L   K +  PM  +        G  VD K YR +IG 
Sbjct: 325 ARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGR 384

Query: 822 LLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVG 881
           L+YL  +R DI F+V   ++F   P+ +H  AV +IL Y+ GT+  GL+Y    E  L  
Sbjct: 385 LMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQV 444

Query: 882 YSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWM 941
           +SDA F      R+ST+G C FLG+SL+SW SKKQ  V+ S+ EAEY A++    +++W+
Sbjct: 445 FSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWL 504

BLAST of CSPI07G08510 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 148.7 bits (374), Expect = 2.4e-35
Identity = 81/225 (36.00%), Postives = 132/225 (58.67%), Query Frame = 0

Query: 715 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 774
           +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 775 LLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFS 834
           +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++
Sbjct: 65  ILN----NAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 124

Query: 835 VCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRK 894
           V +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+
Sbjct: 125 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRR 184

Query: 895 STSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW 937
           ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Sbjct: 185 STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G08510 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 100.5 bits (249), Expect = 7.5e-21
Identity = 51/99 (51.52%), Postives = 65/99 (65.66%), Query Frame = 0

Query: 522 EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNII 581
           EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 582 RNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA 621
           R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI07G08510 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 59.7 bits (143), Expect = 1.5e-08
Identity = 30/76 (39.47%), Postives = 41/76 (53.95%), Query Frame = 0

Query: 312 NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYF 371
           NRT+ E  RSML E GLPK F  +A NTA ++ N+          P E+W   +P   Y 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 372 KVFGCKCFILNNKEKL 388
           + FGC  +I  ++ KL
Sbjct: 62  RRFGCVAYIHCDEGKL 77

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.8e-12334.58Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW28.7e-12330.88Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.7e-11829.21Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041461.6e-11630.20Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925193.4e-3436.00Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A438GI900.0e+0066.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A5C8K00.0e+0061.85Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_001808 PE=4 SV=1[more]
A0A151UHG70.0e+0061.81Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151RY830.0e+0062.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151QSZ90.0e+0062.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
Match NameE-valueIdentityDescription
RVW71911.10.0e+0066.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
CAN64335.10.0e+0061.85hypothetical protein VITISV_001808 [Vitis vinifera][more]
KYP78729.10.0e+0061.81Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP47407.10.0e+0062.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP33441.10.0e+0062.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
Match NameE-valueIdentityDescription
AT4G23160.11.8e-8640.33cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.4e-3536.00DNA/RNA polymerases superfamily protein [more]
ATMG00820.17.5e-2151.52Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.11.5e-0839.47Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 194..373
e-value: 1.7E-45
score: 156.8
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 201..302
e-value: 9.0E-16
score: 58.0
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 199..365
score: 25.129093
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 549..791
e-value: 8.8E-78
score: 261.2
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 119..187
e-value: 1.1E-16
score: 60.4
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 144..860
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 876..941
e-value: 1.37872E-33
score: 124.118
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 201..374
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 548..931

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G08510.1CSPI07G08510.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016491 oxidoreductase activity