CSPI05G13760.1 (mRNA) Cucumber (PI 183967) v1

Overview
NameCSPI05G13760.1
TypemRNA
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr5: 13697554 .. 13700612 (+)
Sequence length2715
RNA-Seq ExpressionCSPI05G13760.1
SyntenyCSPI05G13760.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACATGATTTTCCAAGGAGTTCTCTGGCAGGCACTCACATATTGAAATTTTTTCCACAGATGAAGAACATGATATGTTACTGGTTGTTCATAAACATTGAAGAAACACATCTTTCAGTTACATGAAAGTCCTCACCACACTACAACACAATAGATAGGGTGAATGAGATAGTTTTGGAAATATAAATTCCACCTCCAATCCAATCCTTTAATTTGCTTAAAAAAAAATCCTTGTGTTAGAAATTCCCATAAAAATGCACAGAGGAAGTCCATAAAATATATAAAGCAAAGTAAAACCAACTCAAGTTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTGAAACTACCCTTGCTTTATCGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTTATTAATAGAATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTGTCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGAGTTAGAATATAGAGCTATGACACAAACTATGTGCAAAATAGTCTGAATTCACCAACTATTATCTGAGATAGGCTTCAGTATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA

mRNA sequence

ATGAAACATGATTTTCCAAGGAGTTCTCTGGCAGGCACTCACATATTGAAATTTTTTCCACAGATGAAGAACATGATATGTTACTGGTTGTTCATAAACATTGAAGAAACACATCTTTCAAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTGAAACTACCCTTGCTTTATCGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTTATTAATAGAATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTGTCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGATATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA

Coding sequence (CDS)

ATGAAACATGATTTTCCAAGGAGTTCTCTGGCAGGCACTCACATATTGAAATTTTTTCCACAGATGAAGAACATGATATGTTACTGGTTGTTCATAAACATTGAAGAAACACATCTTTCAAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTGAAACTACCCTTGCTTTATCGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTTATTAATAGAATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTGTCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGATATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA

Protein sequence

MKHDFPRSSLAGTHILKFFPQMKNMICYWLFINIEETHLSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFFINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSADITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA*
Homology
BLAST of CSPI05G13760.1 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 1.6e-166
Identity = 359/987 (36.37%), Postives = 506/987 (51.27%), Query Frame = 0

Query: 34   IEETHLSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVV 93
            I    LS L P  + LS   C  C   K +++  S +    +  P E ++SD+W   P++
Sbjct: 483  ISNYSLSVLNPSHKFLS---CSDCLINKSNKVPFS-QSTINSTRPLEYIYSDVWS-SPIL 542

Query: 94   SQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEY 153
            S   +RY+V FVD  +R TWLY +K +S++   F  F   ++N+F   I T  +DN GE+
Sbjct: 543  SHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF 602

Query: 154  FSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVS 213
               +L  Y  ++GI H +S   TP  NG++ERK+RH++ET L L     + K +W  A +
Sbjct: 603  V--ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFA 662

Query: 214  TACFFINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLK 273
             A + INR+P+ +L  E P++ LF T   +    ++FGC C+   +RP++  KLD KS +
Sbjct: 663  VAVYLINRLPTPLLQLESPFQKLFGTSPNYD-KLRVFGCACYPW-LRPYNQHKLDDKSRQ 722

Query: 274  CIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDT-PFTSSPSSLCQGEDDNLFIYEVTS 333
            C+FLGYS  Q  Y C      R  +S  V F E+  PF++  ++L   ++       V S
Sbjct: 723  CVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWS 782

Query: 334  PTPSLSTDVP--PSRPLISQVYSRRPPPQPS-------------DSCPPSMLPSSCDPA- 393
            P  +L T  P  P+       ++  PP  PS             DS   S  PSS +P  
Sbjct: 783  PHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTA 842

Query: 394  -----------------------------PSDDLPIALRK------------------GK 453
                                         P+++ P  L +                    
Sbjct: 843  PRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSAS 902

Query: 454  RKCTYPVSSFISYHQLSP------------------------------STYAFITSLEST 513
               T P    I  H   P                                Y+   SL + 
Sbjct: 903  SSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAE 962

Query: 514  SIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGT 573
            S P +  +AL    W+NAM  E+ A   N TWDLV  P     I GC+W+F  K N DG+
Sbjct: 963  SEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGS 1022

Query: 574  VARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHG 633
            + R KARLVAKGY Q  G DY++TFSPV K TSIR+ L +A    W + QLD+ NAFL G
Sbjct: 1023 LNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQG 1082

Query: 634  DLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDH 693
             L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+ +    L+  G   S SD 
Sbjct: 1083 TLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDT 1142

Query: 694  SVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMR 753
            S+F  +  K IV ++VYVDDI+ITGND   + +    L  +F  KD  +L YFLGIE  R
Sbjct: 1143 SLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKR 1202

Query: 754  SKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLN 813
               G++LSQR+Y+LDLL+ T  + AKP  TPM P+ +L +  G    DP  YR +VG L 
Sbjct: 1203 VPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQ 1262

Query: 814  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFS 873
            YL  TRPDI+Y+V+ +SQFM  PT +H  A+++IL YL   P  GI  K      +  +S
Sbjct: 1263 YLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYS 1322

Query: 874  DADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD------------------ 898
            DADWAG ++D  ST+GY V++G + +SW SKKQ  V RSS +                  
Sbjct: 1323 DADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICS 1382

BLAST of CSPI05G13760.1 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 9.7e-164
Identity = 356/978 (36.40%), Postives = 492/978 (50.31%), Query Frame = 0

Query: 52   LNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRL 111
            L+C  C   K H++  S      +  P E ++SD+W   P++S   +RY+V FVD  +R 
Sbjct: 477  LSCSDCFINKSHKVPFSNSTITSS-KPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRY 536

Query: 112  TWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQS 171
            TWLY +K +S++   F  F + ++N+F   I TL +DN GE+    L  YL ++GI H +
Sbjct: 537  TWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFT 596

Query: 172  SCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFFINRMPSSVLNGEI 231
            S   TP  NG++ERK+RH++E  L L     V K +W  A S A + INR+P+ +L  + 
Sbjct: 597  SPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQS 656

Query: 232  PYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCP 291
            P++ LF     +    K+FGC C+   +RP++  KL+ KS +C F+GYS  Q  Y C   
Sbjct: 657  PFQKLFGQPPNYE-KLKVFGCACYPW-LRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHI 716

Query: 292  TLKRYLVSPDVVF------FEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPS- 351
               R   S  V F      F  T F  S S   + +    +    T PT  L    PP  
Sbjct: 717  PTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCL 776

Query: 352  RPLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA---------------- 411
             P +    S RPP  PS  C      S LPSS   +PS   P A                
Sbjct: 777  GPHLDT--SPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQ 836

Query: 412  ------------------LRKGKRKCTYPVSSFISYHQLSPST----------------- 471
                                   +    P S   S H  +PST                 
Sbjct: 837  NSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPP 896

Query: 472  --------------------------------------YAFITSLESTSIPNSVHEALSH 531
                                                  Y++ TSL + S P +  +A+  
Sbjct: 897  LPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKD 956

Query: 532  PGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKG 591
              W+ AM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKG
Sbjct: 957  DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKG 1016

Query: 592  YAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPP 651
            Y Q  G DY++TFSPV K TSIR+ L +A    W + QLD+ NAFL G L +EVYM QPP
Sbjct: 1017 YNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPP 1076

Query: 652  GFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIV 711
            GFV +   D VCRLRK++YGLKQ+PRAW+ +    L+  G   S SD S+F  +  + I+
Sbjct: 1077 GFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSII 1136

Query: 712  LLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY 771
             ++VYVDDI+ITGND + +      L  +F  K+   L YFLGIE  R  +G++LSQR+Y
Sbjct: 1137 YMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRY 1196

Query: 772  VLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYS 831
             LDLL+ T  L AKP  TPM  + +L +  G    DP  YR +VG L YL  TRPD++Y+
Sbjct: 1197 TLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYA 1256

Query: 832  VSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRR 891
            V+ +SQ+M  PT DHW A++++L YL   P  GI  K      +  +SDADWAG  +D  
Sbjct: 1257 VNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYV 1316

Query: 892  STSGYCVFVGGNLVSWKSKKQNVVSRSSAD--------------------------ITVP 901
            ST+GY V++G + +SW SKKQ  V RSS +                          ++ P
Sbjct: 1317 STNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHP 1376

BLAST of CSPI05G13760.1 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 3.4e-148
Identity = 322/893 (36.06%), Postives = 470/893 (52.63%), Query Frame = 0

Query: 54   CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTW 113
            CD C F K HR+S      +R +   +LV+SD+ GP  + S  G +YFVTF+DD SR  W
Sbjct: 457  CDYCLFGKQHRVSFQTS-SERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLW 516

Query: 114  LYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSC 173
            +Y++K + ++   F  FH  ++ +    +K LR+DN GEY S     Y   +GI H+ + 
Sbjct: 517  VYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTV 576

Query: 174  ADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFFINRMPSSVLNGEIPY 233
              TP  NGVAER NR ++E   ++     + K FW +AV TAC+ INR PS  L  EIP 
Sbjct: 577  PGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPE 636

Query: 234  RVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLK 293
            RV +  K +     K+FGC  F    +   TKLD KS+ CIF+GY   + GYR + P  K
Sbjct: 637  RV-WTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKK 696

Query: 294  RYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSR 353
            + + S DVVF E    T++  S  +   + +    VT P+ S +                
Sbjct: 697  KVIRSRDVVFRESEVRTAADMS--EKVKNGIIPNFVTIPSTSNN---------------- 756

Query: 354  RPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFIT 413
               P  ++S    +      P    +    L +G  +  +P      +  L  S    + 
Sbjct: 757  ---PTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVE 816

Query: 414  SLESTSI----------PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVSRPAGKK 473
            S    S           P S+ E LSHP       AM EEM +L  NGT+ LV  P GK+
Sbjct: 817  SRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKR 876

Query: 474  AIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAAT 533
             + CKWVF +K + D  + R KARLV KG+ Q  G D+ + FSPV K+TSIR  LS+AA+
Sbjct: 877  PLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAAS 936

Query: 534  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGK 593
                + QLD+K AFLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ+PR W+ K
Sbjct: 937  LDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMK 996

Query: 594  FSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQF 653
            F   +      K+ SD  V+++R SE   ++L++YVDD++I G D   I+ LK  L   F
Sbjct: 997  FDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSF 1056

Query: 654  YTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK 713
              KDLG  +  LG++++R +  + ++LSQ KY+  +L       AKP  TP+  + +L K
Sbjct: 1057 DMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSK 1116

Query: 714  ---------EGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAV 773
                     +G + K P  Y   VG L Y  V TRPDIA++V VVS+F+ +P  +HW AV
Sbjct: 1117 KMCPTTVEEKGNMAKVP--YSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAV 1176

Query: 774  EQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSK 833
            + IL YL+   G  + +       ++ ++DAD AG  ++R+S++GY     G  +SW+SK
Sbjct: 1177 KWILRYLRGTTGDCLCF-GGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSK 1236

Query: 834  KQNVVSRSSADITVPAK-------------------------LWCDNQAALHIASNPVFH 893
             Q  V+ S+ +    A                          ++CD+Q+A+ ++ N ++H
Sbjct: 1237 LQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYH 1296

Query: 894  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKL 896
             RTKHI+V  H+IRE + D  +    + T E   D+LTK +   +   LC +L
Sbjct: 1297 ARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFE-LCKEL 1322

BLAST of CSPI05G13760.1 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 414.5 bits (1064), Expect = 3.2e-114
Identity = 289/963 (30.01%), Postives = 458/963 (47.56%), Query Frame = 0

Query: 49   LSSLNCDSCQFAKFHRLSSSPRVDKRAI-APFELVHSDIWGPCPVVSQTGFRYFVTFVDD 108
            LS   C+ C   K  RL      DK  I  P  +VHSD+ GP   V+     YFV FVD 
Sbjct: 450  LSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQ 509

Query: 109  HSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGI 168
             +     YL+K +S++ S F  F  + +  FN+ +  L  DN  EY S+ +  +  + GI
Sbjct: 510  FTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGI 569

Query: 169  IHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFFINRMPSSVL 228
             +  +   TP  NGV+ER  R + E    +     + K FW +AV TA + INR+PS  L
Sbjct: 570  SYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRAL 629

Query: 229  --NGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGY 288
              + + PY  ++  K  +    ++FG   +V  ++    K D KS K IF+GY     G+
Sbjct: 630  VDSSKTPYE-MWHNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYE--PNGF 689

Query: 289  RCYCPTLKRYLVSPDV-----------------VFFEDTP------FTSSPSSLCQGE-- 348
            + +    ++++V+ DV                 VF +D+       F +    + Q E  
Sbjct: 690  KLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFP 749

Query: 349  ------------------------DDNLFIYEVTSPTPS--------LSTDVPPSRPLIS 408
                                    +D+  I +   P  S        L      ++  ++
Sbjct: 750  NESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLN 809

Query: 409  QVYSRRPPPQPSDS----CPPSMLPSSC----------DPAPSDDLPIALRKGKRKCTYP 468
            +   R+     ++S     P     S            +P  +D + I  R+ +R  T P
Sbjct: 810  ESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKP 869

Query: 469  VSSFISYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPGWQNAMIEEMTALDDNGT 528
                ISY++   S    + +  +    +PNS  E         W+ A+  E+ A   N T
Sbjct: 870  Q---ISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNT 929

Query: 529  WDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLT 588
            W +  RP  K  +  +WVF+VK N  G   R KARLVA+G+ Q Y  DY +TF+PVA+++
Sbjct: 930  WTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARIS 989

Query: 589  SIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG 648
            S R  LS+       +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L K++YG
Sbjct: 990  SFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISC--NSDNVCKLNKAIYG 1049

Query: 649  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDDIVITGNDALG 708
            LKQ+ R WF  F QAL       S+ D  ++   + +    + +++YVDD+VI   D   
Sbjct: 1050 LKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTR 1109

Query: 709  ISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGT 768
            +++ K +L  +F   DL ++K+F+GI +   +  IYLSQ  YV  +LS+          T
Sbjct: 1110 MNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVST 1169

Query: 769  PMMP--NQQLVKEGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHW 828
            P+    N +L+   E C  P   R L+G L Y+ + TRPD+  +V+++S++ S    + W
Sbjct: 1170 PLPSKINYELLNSDEDCNTP--CRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELW 1229

Query: 829  AAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNL 888
              ++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+GY       NL
Sbjct: 1230 QNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNL 1289

Query: 889  VSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHI 899
            + W +K+QN V+ SS +                          +  P K++ DNQ  + I
Sbjct: 1290 ICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISI 1349

BLAST of CSPI05G13760.1 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 4.0e-40
Identity = 83/208 (39.90%), Postives = 124/208 (59.62%), Query Frame = 0

Query: 611 LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYV 670
           L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 671 LDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVS 730
             +L+  G L  KP  TP+              DP  +R +VG L YLT+TRPDI+Y+V+
Sbjct: 63  EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVN 122

Query: 731 VVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST 790
           +V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST
Sbjct: 123 IVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRST 182

Query: 791 SGYCVFVGGNLVSWKSKKQNVVSRSSAD 819
           +G+C F+G N++SW +K+Q  VSRSS +
Sbjct: 183 TGFCTFLGCNIISWSAKRQPTVSRSSTE 210

BLAST of CSPI05G13760.1 vs. ExPASy TrEMBL
Match: A0A438IRR9 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1320 PE=4 SV=1)

HSP 1 Score: 1256.5 bits (3250), Expect = 0.0e+00
Identity = 611/914 (66.85%), Postives = 725/914 (79.32%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KL P+F +L SL+C+SC FAK HR S  PR++KRA + FELVHSD+WGPCPV SQTGF
Sbjct: 411  LKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWGPCPVTSQTGF 470

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S 
Sbjct: 471  RYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSDNGKEYVSNSF 530

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
             +Y+ +NGI+HQ+SC DTPSQNGVAERKNRHLLET  AL FQM V K FW DAVSTACF 
Sbjct: 531  QNYMSQNGILHQTSCVDTPSQNGVAERKNRHLLETARALMFQMKVPKQFWADAVSTACFL 590

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMP+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGY
Sbjct: 591  INRMPTVVLKGDIPYKVIHPQKSLFPLAPRIFGCTCYVRDTRPFVTKLDPKALQCVFLGY 650

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP---- 338
            SR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P    
Sbjct: 651  SRLQKGYRCFSPDLNKYLVSTDVVFSEDTSFFSSPTSSASEEDEEWLVYQVVNSRPTVGQ 710

Query: 339  --------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDD 398
                    SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   D
Sbjct: 711  SSVVDSDASLAPSGPVVHIPPAPAKPPIVQVYSRR--PVTTDTC-PAPAPSSSDPSSDLD 770

Query: 399  LPIALRKGKR--KCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAM 458
            LPI+LRKGKR  K  Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM
Sbjct: 771  LPISLRKGKRHYKSIYSIANFVSYDHLSSSSSVLVASIDSISVPKTVTEALNHPGWKNAM 830

Query: 459  IEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTD 518
            +EE+ AL+DN TW LV  P GKK +GCKWVFAVK+N DG+VARLKARLVA+GYAQ YG D
Sbjct: 831  LEEICALEDNHTWKLVDLPQGKKVVGCKWVFAVKVNLDGSVARLKARLVARGYAQTYGVD 890

Query: 519  YSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGES 578
            YSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE 
Sbjct: 891  YSDTFSPVAKLNSVRLFISIAASQQWMIHQLDIKNAFLHGDLEEEVYLEQPPGFVAQGEY 950

Query: 579  DKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDD 638
             KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDD
Sbjct: 951  GKVCRLKKALYGLKQSPRAWFGKFSKEIQAFGMNKSEKDHSVFYKKSAAGIILLVVYVDD 1010

Query: 639  IVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSET 698
            IVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ET
Sbjct: 1011 IVITGNDHAGISDLKTFMHSKFHTKDLGELKYFLGIEVSRSKKGMFLSQRKYVLDLLKET 1070

Query: 699  GKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFM 758
            GK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF 
Sbjct: 1071 GKIEAKPCTTPMVPNVQLMPDDGDPFYNPERYRRVVGKLNYLTVTRPDIAYAVSVVSQFT 1130

Query: 759  SSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVF 818
            S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF
Sbjct: 1131 SAPTIKHWAALEQILCYLKKAPGLGILYSSQGHTRIECFSDADWAGSKFDRRSTTGYCVF 1190

Query: 819  VGGNLVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQ 878
             GGNLV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQ
Sbjct: 1191 FGGNLVAWKSKKQSVVSRSSAESEYRAMAQATCEIIWIHQLLCEVGMKCTMPAKLWCDNQ 1250

Query: 879  AALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISY 905
            AALHIA+NP++HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TK+LNGTR+ Y
Sbjct: 1251 AALHIAANPIYHERTKHIEVDCHFIREKIEENLVSTGYVKTGEQLGDIFTKALNGTRVEY 1310

BLAST of CSPI05G13760.1 vs. ExPASy TrEMBL
Match: Q6L3Q0 (Polyprotein, putative OS=Solanum demissum OX=50514 GN=SDM1_42t00018 PE=4 SV=2)

HSP 1 Score: 1201.8 bits (3108), Expect = 0.0e+00
Identity = 590/899 (65.63%), Postives = 700/899 (77.86%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KL P+F ++ S++C+SC FAK HR+S SPR +KRA   FELVHSD+WGPCPVVS+ GF
Sbjct: 506  LKKLCPQFHNVPSIDCESCHFAKHHRISLSPRNNKRANFAFELVHSDVWGPCPVVSKVGF 565

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTF+DD SR+TW+Y MKNRSE+ SHF  F  EIK QFN S+  LR+DNA E+ S S 
Sbjct: 566  RYFVTFMDDFSRMTWIYFMKNRSEVFSHFSNFCAEIKTQFNASVHILRSDNAREFMSASF 625

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
             +Y+ + GI+HQSSC DTPSQNGVAERKNRHLLET   L FQM V K FW D VSTA F 
Sbjct: 626  QNYMNQYGILHQSSCVDTPSQNGVAERKNRHLLETARVLLFQMKVPKQFWADTVSTASFL 685

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMPS+VLNG+IPY VLFP K LFP+ PK+FG  C+VRDVRPH TKLDPK+LKC+FLGY
Sbjct: 686  INRMPSTVLNGDIPYGVLFPNKPLFPLEPKVFGSTCYVRDVRPHITKLDPKALKCVFLGY 745

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSL---CQGEDDNLFIYEVTSPTPS 338
            SR+QKGYRCY PTL RY+VS DVVF E   F SSP +     Q ED+   IY  T     
Sbjct: 746  SRLQKGYRCYSPTLNRYMVSIDVVFSESISFFSSPDTFPTQGQQEDEEWLIYRTTPSRSE 805

Query: 339  LSTDVPPS-----------------RPLISQVYSRRPPPQPSDSCPPSMLPSS----CDP 398
               +VP S                 +P I QVYSRR     +D+CP   L SS     +P
Sbjct: 806  QHKEVPGSVEQSMENVSSDAPLAQTKPPIVQVYSRR--QVTNDTCPAPTLSSSDPLPVNP 865

Query: 399  APSD--DLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSH 458
            +P++  D+PIALRKGKR+C   Y +++FISY  LSP++ + I SL+S  +P +V EAL+H
Sbjct: 866  SPTENLDIPIALRKGKRQCPSIYSIANFISYDHLSPTSCSLIASLDSIFVPKTVREALNH 925

Query: 459  PGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGY 518
            PGW +AM++E+ ALDDN TW+LV  P GKKA+GCKWVF +K+NPDG++ARLKARLVAKGY
Sbjct: 926  PGWYDAMLDEIHALDDNHTWNLVDLPKGKKAVGCKWVFTIKVNPDGSMARLKARLVAKGY 985

Query: 519  AQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPG 578
            AQ YG DYSDTFSPVAKLTS+RLF+S+AA+  W LHQL IKNAFLHGDLQEEVYMEQPPG
Sbjct: 986  AQTYGVDYSDTFSPVAKLTSVRLFISLAASQNWPLHQLAIKNAFLHGDLQEEVYMEQPPG 1045

Query: 579  FVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL 638
            FVAQGE+ KVC L+K LYGLKQSPRAWFGKFS+ +  FG+ KS  DHSVFYR+S  GI+L
Sbjct: 1046 FVAQGENGKVCHLKKPLYGLKQSPRAWFGKFSEVVQKFGLTKSNCDHSVFYRQSAVGIIL 1105

Query: 639  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYV 698
            LVVYVDDIVIT +D  GISSLK FL   F+TKDLGQLKYFLGIEV RSKKGI+LSQRKY+
Sbjct: 1106 LVVYVDDIVITRSDYAGISSLKLFLHSMFHTKDLGQLKYFLGIEVSRSKKGIFLSQRKYI 1165

Query: 699  LDLLSETGKLGAKPSGTPMMPNQQLVK-EGELCKDPERYRRLVGKLNYLTVTRPDIAYSV 758
            LDLL ETGK  AKP  TPM+PN QL   +G+   DPERYRRLVGKLNYLTVTRPDI+++V
Sbjct: 1166 LDLLEETGKSAAKPCSTPMVPNTQLTNDDGDPFDDPERYRRLVGKLNYLTVTRPDISFAV 1225

Query: 759  SVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRS 818
            S+VSQFMS+PT+ HWAA+EQILCYLK APG GI+Y+++ HTR+ECF+D DWAGS+ DRRS
Sbjct: 1226 SIVSQFMSTPTIKHWAALEQILCYLKGAPGLGIVYRNNEHTRIECFADVDWAGSKIDRRS 1285

Query: 819  TSGYCVFVGGNLVSWKSKKQNVVSRSSADITV----PAKLWCDNQAALHIASNPVFHERT 878
            T+GYCVFVGGNLVSW+ +  +     +  + +      KLWCDNQAALHIASNPV+HERT
Sbjct: 1286 TTGYCVFVGGNLVSWRMQNPSTELWHNPQVRLCGYFNPKLWCDNQAALHIASNPVYHERT 1345

Query: 879  KHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            KHIEVDCHFIREKIQ+ L+ST YVKTGEQL D+ TK+L G R++YLCNKLGMI+I+APA
Sbjct: 1346 KHIEVDCHFIREKIQENLISTSYVKTGEQLADLFTKALTGARVNYLCNKLGMINIYAPA 1402

BLAST of CSPI05G13760.1 vs. ExPASy TrEMBL
Match: A0A438DZQ8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_3537 PE=4 SV=1)

HSP 1 Score: 1191.4 bits (3081), Expect = 0.0e+00
Identity = 580/891 (65.10%), Postives = 695/891 (78.00%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KL P+F +L SL+C+SC FAK HR S  PR++KR  + FELVHSD+WGPCPV SQTGF
Sbjct: 471  LKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRVESLFELVHSDVWGPCPVTSQTGF 530

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S 
Sbjct: 531  RYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSDNGKEYVSNSF 590

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
             +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLET  AL FQM V K FW DAVSTACF 
Sbjct: 591  QNYMSHNGILHQTSCVDTPSQNGVAERKNRHLLETARALMFQMKVPKQFWADAVSTACFL 650

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMP+ VL G+IPY+ + P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGY
Sbjct: 651  INRMPTVVLKGDIPYKAIHPQKSLFPLAPRIFGCTCYVRDTRPFVTKLDPKALQCVFLGY 710

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS- 338
            SR+QKGYRC+ P L +YLVS D+VF EDT F SSP+S    ED+   +Y+V +  P++  
Sbjct: 711  SRLQKGYRCFSPNLNKYLVSTDIVFSEDTSFFSSPTSSASEEDEEWLVYQVVNSRPTVGQ 770

Query: 339  ----------------TDVP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDD 398
                             ++P  P++P I Q+YSRR  P  +D+C P+  PSS DP+   D
Sbjct: 771  SSVVDSDASLAHSSPVVNIPPAPAKPPIVQMYSRR--PVTTDTC-PAPAPSSSDPSSDLD 830

Query: 399  LPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAM 458
            LPI+L KGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM
Sbjct: 831  LPISLWKGKRHCKSIYSIANFVSYDHLSSSSSVLVASIDSISVPKTVTEALNHPGWKNAM 890

Query: 459  IEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTD 518
            +EE+ AL+DN TW LV  P GKK +GCKWVFAVK++PDG+VARLKARLVA+GYAQ YG D
Sbjct: 891  LEEIYALEDNHTWQLVDLPQGKKVVGCKWVFAVKVHPDGSVARLKARLVARGYAQTYGVD 950

Query: 519  YSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGES 578
            YSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE 
Sbjct: 951  YSDTFSPVAKLNSVRLFISIAASQQWMIHQLDIKNAFLHGDLEEEVYLEQPPGFVAQGEY 1010

Query: 579  DKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDD 638
             KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYV+D
Sbjct: 1011 GKVCRLKKALYGLKQSPRAWFGKFSKEIQAFGMNKSEKDHSVFYKKSAAGIILLVVYVND 1070

Query: 639  IVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSET 698
            IVIT ND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKK ++LSQRKYVLDLL ET
Sbjct: 1071 IVITRNDHAGISDLKTFMHSKFHTKDLGELKYFLGIEVSRSKKRMFLSQRKYVLDLLKET 1130

Query: 699  GKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFM 758
            GK+ AKP  TPM+PN QL+  +G+   +PER RR+VGKLNYLTVTRPDIAY+VSVVSQF 
Sbjct: 1131 GKIEAKPCTTPMVPNVQLMPDDGDPFYNPERCRRVVGKLNYLTVTRPDIAYAVSVVSQFT 1190

Query: 759  SSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVF 818
             +PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSD DWAGS+ DRRST+GYCVF
Sbjct: 1191 FAPTIKHWAALEQILCYLKKAPGLGILYSSQGHTRIECFSDTDWAGSKFDRRSTTGYCVF 1250

Query: 819  VGGNLVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQ 878
             GGNLV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQ
Sbjct: 1251 FGGNLVAWKSKKQSVVSRSSAESEYRAMAQATCEIIWIHQLLCEVGMKCTMPAKLWCDNQ 1310

Query: 879  AALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK 882
            AALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLG  L K
Sbjct: 1311 AALHIAANPVYHERTKHIEVDCHFIREKIEENLVSTGYVKTGEQLGIFLQK 1358

BLAST of CSPI05G13760.1 vs. ExPASy TrEMBL
Match: A0A438HPS2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2969 PE=4 SV=1)

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 580/872 (66.51%), Postives = 690/872 (79.13%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KL P+F +L SL+C+SC FAK HR S  PR++KRA + FELVHSD+WG CPV SQTGF
Sbjct: 471  LKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWGLCPVTSQTGF 530

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            +YFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S 
Sbjct: 531  QYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFFAEIKTQYDVSVKILRSDNGKEYVSNSF 590

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
             +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLETT AL FQM V K FWVDAVSTACF 
Sbjct: 591  QNYMSHNGILHQTSCVDTPSQNGVAERKNRHLLETTRALMFQMKVPKQFWVDAVSTACFL 650

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            IN MP+ VL G+IPY+V+ P K LFP+ P+IFGC C+VRD RP  TKLDPK+L+C+FLGY
Sbjct: 651  INCMPTVVLKGDIPYKVIHPQKSLFPLEPRIFGCTCYVRDTRPFFTKLDPKALQCVFLGY 710

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
            SR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +       
Sbjct: 711  SRLQKGYRCFSPDLNKYLVSTDVVFSEDTSFFSSPTSSASEEDEEWLVYQVVN------- 770

Query: 339  DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVS 398
                SRP + Q  S    P  +D+C P+  PSS DP+   DL I+LRKGKR C   Y ++
Sbjct: 771  ----SRPTVGQ-SSVLSAPVTTDTC-PAPAPSSSDPSSDLDLSISLRKGKRHCKSIYSIA 830

Query: 399  SFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRP 458
            +F+SY  LS S+   + S++S S+P +V EAL+HPGW+NA++EE+ AL+DN TW LV  P
Sbjct: 831  NFVSYDHLSSSSSVLVASIDSISVPKTVTEALNHPGWKNAILEEIYALEDNHTWKLVDSP 890

Query: 459  AGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLS 518
             GKK +GCKWVFAVK+NPDG+VARLKARLVAKGYAQ YG DYSDTFSPVAKL S+RLF+S
Sbjct: 891  QGKKVVGCKWVFAVKVNPDGSVARLKARLVAKGYAQTYGVDYSDTFSPVAKLNSVRLFIS 950

Query: 519  MAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRA 578
            +AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRA
Sbjct: 951  IAASQQWMIHQLDIKNAFLHGDLEEEVYLEQPPGFVAQGEYGKVCRLKKALYGLKQSPRA 1010

Query: 579  WFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQ 638
            WFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+ 
Sbjct: 1011 WFGKFSKEIQAFGMNKSEKDHSVFYKKSAVGIILLVVYVDDIVITGNDHAGISDLKTFMH 1070

Query: 639  GQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV 698
             +F+TKDLG+ KYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+
Sbjct: 1071 SKFHTKDLGEQKYFLGIEVSRSKKGMFLSQRKYVLDLLKETGKIEAKPCTTPMVPNVQLM 1130

Query: 699  -KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLK 758
              +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK
Sbjct: 1131 PDDGDPFYNPERYRRVVGKLNYLTVTRPDIAYAVSVVSQFTSAPTIKHWAALEQILCYLK 1190

Query: 759  AAPGRGILYKDHGHTRVECFSDADWAGSREDRRST--SGYCVFVGGNL-VSWKSKKQNVV 818
             A G GILY   GHTR+ECFSDADWAGS+ DRRST  S Y         + W      ++
Sbjct: 1191 KALGLGILYSSQGHTRIECFSDADWAGSKFDRRSTTESEYRAMAQATCEIIW---IHQLL 1250

Query: 819  SRSSADITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTG 878
                   T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKTG
Sbjct: 1251 CEVGMKCTMPAKLWCDNQAALHIAANPVYHERTKHIEVDCHFIREKIEENLVSTGYVKTG 1310

Query: 879  EQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            EQLGDI TK+LNGTR+ Y CNKLGMI+I+APA
Sbjct: 1311 EQLGDIFTKALNGTRVEYFCNKLGMINIYAPA 1326

BLAST of CSPI05G13760.1 vs. ExPASy TrEMBL
Match: A0A438H537 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_4131 PE=4 SV=1)

HSP 1 Score: 1185.2 bits (3065), Expect = 0.0e+00
Identity = 584/914 (63.89%), Postives = 697/914 (76.26%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KL P+F +L SL+C+SC FAK HR S  PR++KRA + FELVHSD+WGPCPV SQTGF
Sbjct: 471  LKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWGPCPVTSQTGF 530

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S 
Sbjct: 531  RYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSDNGKEYVSNSF 590

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
             +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLET  AL FQM V K FW DAVS ACF 
Sbjct: 591  QNYMSHNGILHQTSCVDTPSQNGVAERKNRHLLETARALMFQMKVPKQFWADAVSPACFL 650

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMP+ VL G+I Y+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGY
Sbjct: 651  INRMPTVVLKGDILYKVIHPQKSLFPLAPRIFGCTCYVRDTRPFVTKLDPKALQCVFLGY 710

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS- 338
            SR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P++  
Sbjct: 711  SRLQKGYRCFSPDLNKYLVSTDVVFSEDTSFFSSPTSSESEEDEEWLVYQVVNSRPTVGQ 770

Query: 339  ----------------TDVP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDD 398
                             ++P  P++P I QVYSRR  P  +D+C P+  PSS DP+   D
Sbjct: 771  SSVVNSDASLAHSGPVVNIPPAPAKPPIVQVYSRR--PVTTDTC-PAPAPSSSDPSSDLD 830

Query: 399  LPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAM 458
            LPI+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM
Sbjct: 831  LPISLRKGKRHCKSIYSIANFVSYDHLSSSSSVLVASIDSISVPKTVTEALNHPGWKNAM 890

Query: 459  IEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTD 518
            +EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG D
Sbjct: 891  LEEICALEDNHTWKLVDLPQGKKVVGCKWVFAVKVNPDGSVARLKARLVARGYAQTYGVD 950

Query: 519  YSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGES 578
            YSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEV              
Sbjct: 951  YSDTFSPVAKLNSVRLFISIAASQQWMIHQLDIKNAFLHGDLEEEV-------------- 1010

Query: 579  DKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDD 638
                           SPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDD
Sbjct: 1011 -----------ATAWSPRAWFGKFSKEIQAFGMNKSEKDHSVFYKKSAVGIILLVVYVDD 1070

Query: 639  IVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSET 698
            IVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ET
Sbjct: 1071 IVITGNDHAGISDLKTFMHSKFHTKDLGELKYFLGIEVSRSKKGMFLSQRKYVLDLLKET 1130

Query: 699  GKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFM 758
            GK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF 
Sbjct: 1131 GKIEAKPCTTPMVPNVQLMPDDGDPFYNPERYRRVVGKLNYLTVTRPDIAYAVSVVSQFT 1190

Query: 759  SSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVF 818
            S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF
Sbjct: 1191 SAPTIKHWAALEQILCYLKKAPGLGILYSSQGHTRIECFSDADWAGSKFDRRSTTGYCVF 1250

Query: 819  VGGNLVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQ 878
             GGNLV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQ
Sbjct: 1251 FGGNLVAWKSKKQSVVSRSSAESEYRAMTQATCEIIWIHQLLCEVGMKCTMPAKLWCDNQ 1310

Query: 879  AALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISY 905
            AALHIA+NPV+HERTKHI+VDCHFIREKI++ LVSTGYVKTGEQLGDI TK+LNGTR+ Y
Sbjct: 1311 AALHIAANPVYHERTKHIKVDCHFIREKIEENLVSTGYVKTGEQLGDIFTKALNGTRVEY 1356

BLAST of CSPI05G13760.1 vs. NCBI nr
Match: XP_031744754.1 (uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus])

HSP 1 Score: 1753.4 bits (4540), Expect = 0.0e+00
Identity = 859/892 (96.30%), Postives = 861/892 (96.52%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF
Sbjct: 456  LKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 515

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL
Sbjct: 516  RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 575

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
            GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLET  ALSFQMHVSKIFWVDAVSTACF 
Sbjct: 576  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFL 635

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY
Sbjct: 636  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 695

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
            SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
Sbjct: 696  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 755

Query: 339  DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 398
            DV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF
Sbjct: 756  DVSPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 815

Query: 399  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 458
            ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG
Sbjct: 816  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 875

Query: 459  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 518
            KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA
Sbjct: 876  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 935

Query: 519  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 578
            ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF
Sbjct: 936  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 995

Query: 579  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 638
            GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
Sbjct: 996  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 1055

Query: 639  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 698
            FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
Sbjct: 1056 FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 1115

Query: 699  GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 758
            GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP
Sbjct: 1116 GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 1175

Query: 759  GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD 818
            GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSA+
Sbjct: 1176 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE 1235

Query: 819  --------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 878
                                      ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC
Sbjct: 1236 SEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 1295

Query: 879  HFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            HFIREKIQDGLVSTGYVKTGEQLGDILTK+LNGTRISYLCNKLGMIDIFAPA
Sbjct: 1296 HFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA 1347

BLAST of CSPI05G13760.1 vs. NCBI nr
Match: XP_031744758.1 (uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus])

HSP 1 Score: 1753.4 bits (4540), Expect = 0.0e+00
Identity = 859/892 (96.30%), Postives = 861/892 (96.52%), Query Frame = 0

Query: 39  LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
           L KLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF
Sbjct: 79  LKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 138

Query: 99  RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
           RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL
Sbjct: 139 RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 198

Query: 159 GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
           GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLET  ALSFQMHVSKIFWVDAVSTACF 
Sbjct: 199 GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFL 258

Query: 219 INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
           INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY
Sbjct: 259 INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 318

Query: 279 SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
           SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
Sbjct: 319 SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 378

Query: 339 DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 398
           DV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF
Sbjct: 379 DVSPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 438

Query: 399 ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 458
           ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG
Sbjct: 439 ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 498

Query: 459 KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 518
           KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA
Sbjct: 499 KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 558

Query: 519 ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 578
           ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF
Sbjct: 559 ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 618

Query: 579 GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 638
           GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
Sbjct: 619 GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 678

Query: 639 FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 698
           FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
Sbjct: 679 FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 738

Query: 699 GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 758
           GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP
Sbjct: 739 GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 798

Query: 759 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD 818
           GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSA+
Sbjct: 799 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE 858

Query: 819 --------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 878
                                     ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC
Sbjct: 859 SEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 918

Query: 879 HFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
           HFIREKIQDGLVSTGYVKTGEQLGDILTK+LNGTRISYLCNKLGMIDIFAPA
Sbjct: 919 HFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA 970

BLAST of CSPI05G13760.1 vs. NCBI nr
Match: XP_031744753.1 (uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus])

HSP 1 Score: 1753.4 bits (4540), Expect = 0.0e+00
Identity = 859/892 (96.30%), Postives = 861/892 (96.52%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF
Sbjct: 471  LKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 530

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL
Sbjct: 531  RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 590

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
            GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLET  ALSFQMHVSKIFWVDAVSTACF 
Sbjct: 591  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFL 650

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY
Sbjct: 651  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 710

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
            SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
Sbjct: 711  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 770

Query: 339  DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 398
            DV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF
Sbjct: 771  DVSPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 830

Query: 399  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 458
            ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG
Sbjct: 831  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 890

Query: 459  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 518
            KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA
Sbjct: 891  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 950

Query: 519  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 578
            ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF
Sbjct: 951  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 1010

Query: 579  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 638
            GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
Sbjct: 1011 GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 1070

Query: 639  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 698
            FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
Sbjct: 1071 FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 1130

Query: 699  GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 758
            GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP
Sbjct: 1131 GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 1190

Query: 759  GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD 818
            GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSA+
Sbjct: 1191 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE 1250

Query: 819  --------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 878
                                      ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC
Sbjct: 1251 SEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 1310

Query: 879  HFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            HFIREKIQDGLVSTGYVKTGEQLGDILTK+LNGTRISYLCNKLGMIDIFAPA
Sbjct: 1311 HFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA 1362

BLAST of CSPI05G13760.1 vs. NCBI nr
Match: XP_031744755.1 (uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus])

HSP 1 Score: 1753.4 bits (4540), Expect = 0.0e+00
Identity = 859/892 (96.30%), Postives = 861/892 (96.52%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF
Sbjct: 423  LKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 482

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL
Sbjct: 483  RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 542

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
            GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLET  ALSFQMHVSKIFWVDAVSTACF 
Sbjct: 543  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFL 602

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY
Sbjct: 603  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 662

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
            SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
Sbjct: 663  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 722

Query: 339  DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 398
            DV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF
Sbjct: 723  DVSPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 782

Query: 399  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 458
            ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG
Sbjct: 783  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 842

Query: 459  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 518
            KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA
Sbjct: 843  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 902

Query: 519  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 578
            ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF
Sbjct: 903  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 962

Query: 579  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 638
            GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
Sbjct: 963  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 1022

Query: 639  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 698
            FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
Sbjct: 1023 FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 1082

Query: 699  GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 758
            GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP
Sbjct: 1083 GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 1142

Query: 759  GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD 818
            GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSA+
Sbjct: 1143 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE 1202

Query: 819  --------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 878
                                      ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC
Sbjct: 1203 SEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 1262

Query: 879  HFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            HFIREKIQDGLVSTGYVKTGEQLGDILTK+LNGTRISYLCNKLGMIDIFAPA
Sbjct: 1263 HFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA 1314

BLAST of CSPI05G13760.1 vs. NCBI nr
Match: XP_031744756.1 (uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus] >XP_031744757.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus])

HSP 1 Score: 1753.4 bits (4540), Expect = 0.0e+00
Identity = 859/892 (96.30%), Postives = 861/892 (96.52%), Query Frame = 0

Query: 39   LSKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 98
            L KLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF
Sbjct: 285  LKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGF 344

Query: 99   RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 158
            RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL
Sbjct: 345  RYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSL 404

Query: 159  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETTLALSFQMHVSKIFWVDAVSTACFF 218
            GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLET  ALSFQMHVSKIFWVDAVSTACF 
Sbjct: 405  GSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFL 464

Query: 219  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 278
            INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY
Sbjct: 465  INRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGY 524

Query: 279  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 338
            SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
Sbjct: 525  SRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST 584

Query: 339  DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 398
            DV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF
Sbjct: 585  DVSPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSF 644

Query: 399  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 458
            ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG
Sbjct: 645  ISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAG 704

Query: 459  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 518
            KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA
Sbjct: 705  KKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 764

Query: 519  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 578
            ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF
Sbjct: 765  ATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWF 824

Query: 579  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 638
            GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
Sbjct: 825  GKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ 884

Query: 639  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 698
            FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
Sbjct: 885  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE 944

Query: 699  GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 758
            GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP
Sbjct: 945  GELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAP 1004

Query: 759  GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAD 818
            GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSA+
Sbjct: 1005 GRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE 1064

Query: 819  --------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 878
                                      ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC
Sbjct: 1065 SEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDC 1124

Query: 879  HFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA 905
            HFIREKIQDGLVSTGYVKTGEQLGDILTK+LNGTRISYLCNKLGMIDIFAPA
Sbjct: 1125 HFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA 1176

BLAST of CSPI05G13760.1 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 464.9 bits (1195), Expect = 1.5e-130
Identity = 243/544 (44.67%), Postives = 345/544 (63.42%), Query Frame = 0

Query: 393 YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDL 452
           + +S F+SY ++SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++
Sbjct: 58  HDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEI 117

Query: 453 VSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIR 512
            + P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q  G D+ +TFSPV KLTS++
Sbjct: 118 CTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVK 177

Query: 513 LFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLY 572
           L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG+S   + VC L+KS+Y
Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIY 237

Query: 573 GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI 632
           GLKQ+ R WF KFS  L+ FG  +S SDH+ F + +    + ++VYVDDI+I  N+   +
Sbjct: 238 GLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAV 297

Query: 633 SSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTP 692
             LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG KPS  P
Sbjct: 298 DELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVP 357

Query: 693 MMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAV 752
           M P+       G    D + YRRL+G+L YL +TR DI+++V+ +SQF  +P + H  AV
Sbjct: 358 MDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAV 417

Query: 753 EQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSK 812
            +IL Y+K   G+G+ Y      +++ FSDA +   ++ RRST+GYC+F+G +L+SWKSK
Sbjct: 418 MKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSK 477

Query: 813 KQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVF 872
           KQ VVS+SSA+                          ++ P  L+CDN AA+HIA+N VF
Sbjct: 478 KQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVF 537

Query: 873 HERTKHIEVDCHFIREK-IQDGLVSTGYVKTGEQLG--DILTKSLNGTRISYLCNKLGMI 903
           HERTKHIE DCH +RE+ +    +S  +    EQ G  + L+  L GT I Y+ +  G+ 
Sbjct: 538 HERTKHIESDCHSVRERSVYQATLSYSFQAYDEQDGFTEYLSPILRGT-IMYIVSMFGLA 597

BLAST of CSPI05G13760.1 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 168.3 bits (425), Expect = 2.8e-41
Identity = 83/208 (39.90%), Postives = 124/208 (59.62%), Query Frame = 0

Query: 611 LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYV 670
           L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 671 LDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVS 730
             +L+  G L  KP  TP+              DP  +R +VG L YLT+TRPDI+Y+V+
Sbjct: 63  EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVN 122

Query: 731 VVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST 790
           +V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST
Sbjct: 123 IVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRST 182

Query: 791 SGYCVFVGGNLVSWKSKKQNVVSRSSAD 819
           +G+C F+G N++SW +K+Q  VSRSS +
Sbjct: 183 TGFCTFLGCNIISWSAKRQPTVSRSSTE 210

BLAST of CSPI05G13760.1 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 102.1 bits (253), Expect = 2.5e-21
Identity = 53/117 (45.30%), Postives = 72/117 (61.54%), Query Frame = 0

Query: 402 HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKA 461
           ++L+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  
Sbjct: 10  NKLNPK-YSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNI 69

Query: 462 IGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMA 519
           +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T+SPV +  +IR  L++A
Sbjct: 70  LGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI05G13760.1 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 73.2 bits (178), Expect = 1.2e-12
Identity = 31/81 (38.27%), Postives = 51/81 (62.96%), Query Frame = 0

Query: 717 YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFS 776
           YLT+TRPD+ ++V+ +SQF S+       AV ++L Y+K   G+G+ Y      +++ F+
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61

Query: 777 DADWAGSREDRRSTSGYCVFV 798
           D+DWA   + RRS +G+C  V
Sbjct: 62  DSDWASCPDTRRSVTGFCSLV 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW21.6e-16636.37Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT949.7e-16436.40Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109783.4e-14836.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.2e-11430.01Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925194.0e-4039.90Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A438IRR90.0e+0066.85Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Q6L3Q00.0e+0065.63Polyprotein, putative OS=Solanum demissum OX=50514 GN=SDM1_42t00018 PE=4 SV=2[more]
A0A438DZQ80.0e+0065.10Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438HPS20.0e+0066.51Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438H5370.0e+0063.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
XP_031744754.10.0e+0096.30uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus][more]
XP_031744758.10.0e+0096.30uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus][more]
XP_031744753.10.0e+0096.30uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus][more]
XP_031744755.10.0e+0096.30uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus][more]
XP_031744756.10.0e+0096.30uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus] >XP_031744757.... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.5e-13044.67cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.8e-4139.90DNA/RNA polymerases superfamily protein [more]
ATMG00820.12.5e-2145.30Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.11.2e-1238.27Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 447..689
e-value: 7.6E-76
score: 254.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 71..249
e-value: 3.2E-35
score: 123.2
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 77..173
e-value: 5.9E-13
score: 49.0
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 74..240
score: 19.095497
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 333..374
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 418..765
coord: 53..320
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 773..885
e-value: 3.95202E-60
score: 198.846
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 447..853
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 74..238

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI05G13760CSPI05G13760gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI05G13760.1.cds1CSPI05G13760.1.cds1CDS
CSPI05G13760.1.cds2CSPI05G13760.1.cds2CDS
CSPI05G13760.1.cds3CSPI05G13760.1.cds3CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI05G13760.1CSPI05G13760.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding