CSPI01G22700.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI01G22700.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-G Gag-Pol polyprotein
LocationChr1 : 18273044 .. 18275715 (-)
Sequence length2274
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCTTGGGCATCTAAAAACGGATAGACAGATCATGGTTAAGGAATCTATGGAAGAACTTCGACCAGAATTTGAGCAATTACAACTGGAGTTTGAGAATGTGTTTAATATGCCGGCAGAGCTTTCCCCAATGAGACAGGTTGACCACCGAATCAAATTGAAGGAGGGCACAGACCCCATCAACGTGAGACCTTACCGCTACCCACATGCTCAGAAGAATGAAATTGAGAAGCTGGTGAATGAGATGCTCGATTTTGGTATTATACAGCCAAGCATTAGTCCTTTCTCTAGTCTCGTGATCTTAGTGAAAAAGAAGGATGGGGGATGGAGATTCTGCGTTGATTATAGAGCGTTGAATAGAGCAACGGTACCCGATAAATTTCCAATTCCTATGATTCAGTTGTTGGATGAGTTGAATGGGGCAAGTGTTTTCTCTAAGATAGATTTGAAATCGGGGTATCACCAAATTAGGGTGCGCAATGAGGATGTGAGAAAGACTGCTTTTCGAACGCACGAGGGGCACTACGAATTCCTAGTCATGCCATTCGAACTCACCAATGCACCCTCGACGTTCCAAGCCCTTATGAATCAGGTTTTTCGACCCTATCTACTTAAATTCTTGCTAGTATATTTTGACGATACTCTCATGTACAGCAAGGATGTGGAAACTCATTTGGAGCATCTTACAATGGTGTTTCAACTATTAAGACAGCACTGCCTGTTTGCGAACCGGAAGAAGTGCCACTTCGTCAAAGATCGTATTGAATATTTGGGTCATTGGGTCTCAGCCAAGGGGGTAGAGGCTGACCATGAAAAGGTTAAAGCTATGTTGGAGTGGCCTGTGCCGAAGAATGTAAGGGAACTTAGGGGTTTTTTGGGGTTGACCGGGTATTATCGCCGATTTGTAGCAAACTATGGCGCCATTGCCATGCCCCTTATGCGATTGACCAAGAAAAATAATTTTCGTTGGTCGGAAGAAGCAACCCAAGCATTTGAATTCCTCAAGAAAGCCATGGTTACGCTGCCTATTCTAGTACTGCCGAATTTCCAGCTACCTTTCGAAATTGAAACAGATGCATCATGGTTCGGACTAAGTGTGGTCTTGTCTCAGAACAAGAAGCTGATTGCGTACTTCAGTCAAAAACTATCAGAAGCAGCACGTGAAAAATCTGTTTACGAGAGGGAGCTCATGGCCATAGTCCTAGCAGTGGAAAAATGGCGGCACTACTTGTTGGGCCATCGTTTTGTGGTGTATACTGATCAGAAGGCATTGAGGCATATCCTAGAACAGAGGGAGTTAATACTGGGTGTTCAAAAGTGGATAATGAAGTTAATGGGGTTTGACTTTGTGATCTTCTATCGAGCATGATCGGAGAGCAAAGTGAGGCCCAGTTGAATGTGATTATAGTCTCATCTTTACTAGACATTGTGGTAATGTAAAAGGAAGTTCAAGAGGATACGAAACTAAAGGCTATTTTCGTTCGATTGTTAGCAGATCCGGATTGCATTCCTCACTATACAGTTCGACAAGGCAAGTTGTTTTATAGAGGCAGGTTGGTTCTCCCTAAGACTTTGAGTTTAATTCCCACCATCTTGCACACCTTCCATGACTCGGTCATAGGGGGTCATTCCGGACAATTATGCACCTATAAACGAATCGCAGCAGAGTTGTTTTTGGGAAGGAATGAAGAATGATATTAAATTATATGTGGATCAGTGTCATGTGTGCCAACAAAGTAAGATCCAAGCGTTATCTCCGGCCGGACTGCTACAACCTCTCTCTATTCCAAATCGTATTTGTTAGGATATTTTCATGGATTTTGTGGAGGGATTACCACGTTCCAAGGGGGTCGACACCGTATTGGTGGTAGTGGATCGCCTAAGCAAATATGCTCACTTCATAACCTTGGGTCATCCATTCTCGGCCCAAACAGTAGCTATGGTGTTTGTCAAAGAGATAGTGCGCCTTCACGGATATCCTCGTTCAATAGTATCTGATCGAGATCGAGTGTTCCTAAGTCACTTTTGGAAAGAGTTATACCGATTGCAAGGTACCCAACTAAAGAGAAGCACGACATACCATCCACAAACAGATGGGCAGTCGGAGGTTATTAACAAATGTTTAGAGCTGTATTTAAGATGGTTTTGCCAAGAGAAACCGAGGACATGGAGCGATAAGATTGCGTGGGCCGAGTATTGGTACAATACCAACTACCAATCTTCGATAAAAAACACCCTTTATGCTGTAGTTTATGGACAGCCCCCTCCACCTATCATCTCTTATGGCCAGGCAGGTACAACCCCAAATGATTAAGTGGAATATCAATTGCAATCGCGAGATGAAATGTTAGCTACCCTGAAGAGTCATTTGCAACATGCCCAAGAACAAATGAAGAAGTTTGCCGATGTGTACCGTCGCAATGTGATTTTTGACATTGGGGACTGGGTGTATCTGAAATTACAGCCCTATAGGCAGCAGTCAGTAGCGAAGAAGCGTTGTGAGAAATTATCTCCTAGATATTTTGGGCCATACATGATATTGGGTCGGAGAAGTAGCTTACATGCTGGACCTACCAAAAACTACCAAAATACATCCGGTTTTCCATGTATCACAACTCAAGAAGGCGGTGGGAGACTAACATCAAATTCAACTGGACATAGCAATGCTCAATGA

mRNA sequence

ATGGTTAAGGAATCTATGGAAGAACTTCGACCAGAATTTGAGCAATTACAACTGGAGTTTGAGAATGTGTTTAATATGCCGGCAGAGCTTTCCCCAATGAGACAGGTTGACCACCGAATCAAATTGAAGGAGGGCACAGACCCCATCAACGTGAGACCTTACCGCTACCCACATGCTCAGAAGAATGAAATTGAGAAGCTGGTGAATGAGATGCTCGATTTTGGTATTATACAGCCAAGCATTAGTCCTTTCTCTAGTCTCGTGATCTTAGTGAAAAAGAAGGATGGGGGATGGAGATTCTGCGTTGATTATAGAGCGTTGAATAGAGCAACGGTACCCGATAAATTTCCAATTCCTATGATTCAGTTGTTGGATGAGTTGAATGGGGCAAGTGTTTTCTCTAAGATAGATTTGAAATCGGGGTATCACCAAATTAGGGTGCGCAATGAGGATGTGAGAAAGACTGCTTTTCGAACGCACGAGGGGCACTACGAATTCCTAGTCATGCCATTCGAACTCACCAATGCACCCTCGACGTTCCAAGCCCTTATGAATCAGGTTTTTCGACCCTATCTACTTAAATTCTTGCTAGTATATTTTGACGATACTCTCATGTACAGCAAGGATGTGGAAACTCATTTGGAGCATCTTACAATGGTGTTTCAACTATTAAGACAGCACTGCCTGTTTGCGAACCGGAAGAAGTGCCACTTCGTCAAAGATCGTATTGAATATTTGGGTCATTGGGTCTCAGCCAAGGGGGTAGAGGCTGACCATGAAAAGGTTAAAGCTATGTTGGAGTGGCCTGTGCCGAAGAATGTAAGGGAACTTAGGGGTTTTTTGGGGTTGACCGGGTATTATCGCCGATTTGTAGCAAACTATGGCGCCATTGCCATGCCCCTTATGCGATTGACCAAGAAAAATAATTTTCGTTGGTCGGAAGAAGCAACCCAAGCATTTGAATTCCTCAAGAAAGCCATGGTTACGCTGCCTATTCTAGTACTGCCGAATTTCCAGCTACCTTTCGAAATTGAAACAGATGCATCATGGTTCGGACTAAGTGTGGTCTTGTCTCAGAACAAGAAGCTGATTGCGTACTTCAGTCAAAAACTATCAGAAGCAGCACGTGAAAAATCTGTTTACGAGAGGGAGCTCATGGCCATAGTCCTAGCAGTGGAAAAATGGCGGCACTACTTGTTGGGCCATCGTTTTGTGGTGTATACTGATCAGAAGGCATTGAGGCATATCCTAGAACAGAGGGAGTTAATACTGGGTGTTCAAAAGTGGATAATGAAGTTAATGGGGTTTGACTTTGAAGTTCAAGAGGATACGAAACTAAAGGCTATTTTCGTTCGATTGTTAGCAGATCCGGATTGCATTCCTCACTATACAGGGGTCATTCCGGACAATTATGCACCTATAAACGAATCGCAGCAGAGTTGTTTTTGGGAAGGAATGAAGAATGATATTAAATTATATGTGGATCAGTGTCATGATATTTTCATGGATTTTGTGGAGGGATTACCACGTTCCAAGGGGGTCGACACCGTATTGGTGGTAGTGGATCGCCTAAGCAAATATGCTCACTTCATAACCTTGGGTCATCCATTCTCGGCCCAAACAGTAGCTATGGTGTTTGTCAAAGAGATAGTGCGCCTTCACGGATATCCTCGTTCAATAGTATCTGATCGAGATCGAGTGTTCCTAAGTCACTTTTGGAAAGAGTTATACCGATTGCAAGGTACCCAACTAAAGAGAAGCACGACATACCATCCACAAACAGATGGGCAGTCGGAGGTTATTAACAAATGTTTAGAGCTGTATTTAAGATGGTTTTGCCAAGAGAAACCGAGGACATGGAGCGATAAGATTGCGTGGGCCGAGTATTGGTACAATACCAACTACCAATCTTCGATAAAAAACACCCTTTATGCTGTAGTTTATGGACAGCCCCCTCCACCTATCATCTCTTATGGCCAGGCAGAACAAATGAAGAAGTTTGCCGATGTGTACCGTCGCAATGTGATTTTTGACATTGGGGACTGGGTGTATCTGAAATTACAGCCCTATAGGCAGCAGTCAGTAGCGAAGAAGCGTTGTGAGAAATTATCTCCTAGATATTTTGGGCCATACATGATATTGGGTCGGAGAAGTAGCTTACATGCTGGACCTACCAAAAACTACCAAAATACATCCGGTTTTCCATGTATCACAACTCAAGAAGGCGGTGGGAGACTAACATCAAATTCAACTGGACATAGCAATGCTCAATGA

Coding sequence (CDS)

ATGGTTAAGGAATCTATGGAAGAACTTCGACCAGAATTTGAGCAATTACAACTGGAGTTTGAGAATGTGTTTAATATGCCGGCAGAGCTTTCCCCAATGAGACAGGTTGACCACCGAATCAAATTGAAGGAGGGCACAGACCCCATCAACGTGAGACCTTACCGCTACCCACATGCTCAGAAGAATGAAATTGAGAAGCTGGTGAATGAGATGCTCGATTTTGGTATTATACAGCCAAGCATTAGTCCTTTCTCTAGTCTCGTGATCTTAGTGAAAAAGAAGGATGGGGGATGGAGATTCTGCGTTGATTATAGAGCGTTGAATAGAGCAACGGTACCCGATAAATTTCCAATTCCTATGATTCAGTTGTTGGATGAGTTGAATGGGGCAAGTGTTTTCTCTAAGATAGATTTGAAATCGGGGTATCACCAAATTAGGGTGCGCAATGAGGATGTGAGAAAGACTGCTTTTCGAACGCACGAGGGGCACTACGAATTCCTAGTCATGCCATTCGAACTCACCAATGCACCCTCGACGTTCCAAGCCCTTATGAATCAGGTTTTTCGACCCTATCTACTTAAATTCTTGCTAGTATATTTTGACGATACTCTCATGTACAGCAAGGATGTGGAAACTCATTTGGAGCATCTTACAATGGTGTTTCAACTATTAAGACAGCACTGCCTGTTTGCGAACCGGAAGAAGTGCCACTTCGTCAAAGATCGTATTGAATATTTGGGTCATTGGGTCTCAGCCAAGGGGGTAGAGGCTGACCATGAAAAGGTTAAAGCTATGTTGGAGTGGCCTGTGCCGAAGAATGTAAGGGAACTTAGGGGTTTTTTGGGGTTGACCGGGTATTATCGCCGATTTGTAGCAAACTATGGCGCCATTGCCATGCCCCTTATGCGATTGACCAAGAAAAATAATTTTCGTTGGTCGGAAGAAGCAACCCAAGCATTTGAATTCCTCAAGAAAGCCATGGTTACGCTGCCTATTCTAGTACTGCCGAATTTCCAGCTACCTTTCGAAATTGAAACAGATGCATCATGGTTCGGACTAAGTGTGGTCTTGTCTCAGAACAAGAAGCTGATTGCGTACTTCAGTCAAAAACTATCAGAAGCAGCACGTGAAAAATCTGTTTACGAGAGGGAGCTCATGGCCATAGTCCTAGCAGTGGAAAAATGGCGGCACTACTTGTTGGGCCATCGTTTTGTGGTGTATACTGATCAGAAGGCATTGAGGCATATCCTAGAACAGAGGGAGTTAATACTGGGTGTTCAAAAGTGGATAATGAAGTTAATGGGGTTTGACTTTGAAGTTCAAGAGGATACGAAACTAAAGGCTATTTTCGTTCGATTGTTAGCAGATCCGGATTGCATTCCTCACTATACAGGGGTCATTCCGGACAATTATGCACCTATAAACGAATCGCAGCAGAGTTGTTTTTGGGAAGGAATGAAGAATGATATTAAATTATATGTGGATCAGTGTCATGATATTTTCATGGATTTTGTGGAGGGATTACCACGTTCCAAGGGGGTCGACACCGTATTGGTGGTAGTGGATCGCCTAAGCAAATATGCTCACTTCATAACCTTGGGTCATCCATTCTCGGCCCAAACAGTAGCTATGGTGTTTGTCAAAGAGATAGTGCGCCTTCACGGATATCCTCGTTCAATAGTATCTGATCGAGATCGAGTGTTCCTAAGTCACTTTTGGAAAGAGTTATACCGATTGCAAGGTACCCAACTAAAGAGAAGCACGACATACCATCCACAAACAGATGGGCAGTCGGAGGTTATTAACAAATGTTTAGAGCTGTATTTAAGATGGTTTTGCCAAGAGAAACCGAGGACATGGAGCGATAAGATTGCGTGGGCCGAGTATTGGTACAATACCAACTACCAATCTTCGATAAAAAACACCCTTTATGCTGTAGTTTATGGACAGCCCCCTCCACCTATCATCTCTTATGGCCAGGCAGAACAAATGAAGAAGTTTGCCGATGTGTACCGTCGCAATGTGATTTTTGACATTGGGGACTGGGTGTATCTGAAATTACAGCCCTATAGGCAGCAGTCAGTAGCGAAGAAGCGTTGTGAGAAATTATCTCCTAGATATTTTGGGCCATACATGATATTGGGTCGGAGAAGTAGCTTACATGCTGGACCTACCAAAAACTACCAAAATACATCCGGTTTTCCATGTATCACAACTCAAGAAGGCGGTGGGAGACTAACATCAAATTCAACTGGACATAGCAATGCTCAATGA
BLAST of CSPI01G22700.1 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 5.4e-80
Identity = 166/421 (39.43%), Postives = 250/421 (59.38%), Query Frame = 1

Query: 29  ELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQPSISPFSSLV 88
           +L+   Q  H I  K    P+  + Y YP A + E+E  + +ML+ GII+ S SP++S +
Sbjct: 190 KLTFTNQTKHTINTKHNL-PLYSK-YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPI 249

Query: 89  ILVKKKDGG-----WRFCVDYRALNRATVPDKFPIP-MIQLLDELNGASVFSKIDLKSGY 148
            +V KK        +R  +DYR LN  TV D+ PIP M ++L +L   + F+ IDL  G+
Sbjct: 250 WVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGF 309

Query: 149 HQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLVYFDD 208
           HQI +  E V KTAF T  GHYE+L MPF L NAP+TFQ  MN + RP L K  LVY DD
Sbjct: 310 HQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDD 369

Query: 209 TLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEADHEKV 268
            +++S  ++ HL+ L +VF+ L +  L     KC F+K    +LGH ++  G++ + EK+
Sbjct: 370 IIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKI 429

Query: 269 KAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNN--FRWSEEATQAF 328
           +A+ ++P+P   +E++ FLGLTGYYR+F+ N+  IA P+ +  KKN      + E   AF
Sbjct: 430 EAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAF 489

Query: 329 EFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLIAYFSQKLSEAAREKSV 388
           + LK  +   PIL +P+F   F + TDAS   L  VLSQ+   ++Y S+ L+E     S 
Sbjct: 490 KKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQDGHPLSYISRTLNEHEINYST 549

Query: 389 YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMGFDFEV 442
            E+EL+AIV A + +RHYLLG  F + +D + L  +   ++    + +W +KL  FDF++
Sbjct: 550 IEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDI 608

BLAST of CSPI01G22700.1 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 7.1e-80
Identity = 175/462 (37.88%), Postives = 265/462 (57.36%), Query Frame = 1

Query: 3   KESMEELRPEFEQLQLEF---ENVFNMPAELSPMRQVDHR----------IKLKEGT--- 62
           +ES+++L  +F Q +L+    E  F +   L+  R ++++          IK    T   
Sbjct: 147 QESIKKL--DFSQFRLDHLNQEETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHN 206

Query: 63  DPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGG-----WRFC 122
            PI  + Y      + E+E  V EML+ G+I+ S SP++S   +V KK        +R  
Sbjct: 207 SPIYSKQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVV 266

Query: 123 VDYRALNRATVPDKFPIP-MIQLLDELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTH 182
           +DYR LN  T+PD++PIP M ++L +L     F+ IDL  G+HQI +  E + KTAF T 
Sbjct: 267 IDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTK 326

Query: 183 EGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMV 242
            GHYE+L MPF L NAP+TFQ  MN + RP L K  LVY DD +++S  +  HL  + +V
Sbjct: 327 SGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLV 386

Query: 243 FQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGF 302
           F  L    L     KC F+K    +LGH V+  G++ +  KVKA++ +P+P   +E+R F
Sbjct: 387 FTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAF 446

Query: 303 LGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSE--EATQAFEFLKKAMVTLPILVLPNF 362
           LGLTGYYR+F+ NY  IA P+    KK     ++  E  +AFE LK  ++  PIL LP+F
Sbjct: 447 LGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDF 506

Query: 363 QLPFEIETDASWFGLSVVLSQNKKLIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHY 422
           +  F + TDAS   L  VLSQN   I++ S+ L++     S  E+EL+AIV A + +RHY
Sbjct: 507 EKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHY 566

Query: 423 LLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMGFDFEV 441
           LLG +F++ +D + LR +   +E    +++W ++L  + F++
Sbjct: 567 LLGRQFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKI 606

BLAST of CSPI01G22700.1 vs. Swiss-Prot
Match: YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2)

HSP 1 Score: 293.5 bits (750), Expect = 6.6e-78
Identity = 172/465 (36.99%), Postives = 259/465 (55.70%), Query Frame = 1

Query: 27   PAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQPSISPFSS 86
            PA+++ +  V H I++K G     ++PY      + EI K+V ++LD   I PS SP SS
Sbjct: 602  PADINNI-PVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSS 661

Query: 87   LVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIQ-LLDELNGASVFSKIDLKSGYHQI 146
             V+LV KKDG +R CVDYR LN+AT+ D FP+P I  LL  +  A +F+ +DL SGYHQI
Sbjct: 662  PVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQI 721

Query: 147  RVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLVYFDDTLM 206
             +  +D  KTAF T  G YE+ VMPF L NAPSTF   M   FR   L+F+ VY DD L+
Sbjct: 722  PMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRD--LRFVNVYLDDILI 781

Query: 207  YSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEADHEKVKAM 266
            +S+  E H +HL  V + L+   L   +KKC F  +  E+LG+ +  + +     K  A+
Sbjct: 782  FSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAI 841

Query: 267  LEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEATQAFEFLKK 326
             ++P PK V++ + FLG+  YYRRF+ N   IA P+ +L   +  +W+E+  +A E LK 
Sbjct: 842  RDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAIEKLKA 901

Query: 327  AMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQ--NKK----LIAYFSQKLSEAAREKS 386
            A+   P+LV  N +  + + TDAS  G+  VL +  NK     ++ YFS+ L  A +   
Sbjct: 902  ALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYP 961

Query: 387  VYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMGFDFE 446
              E EL+ I+ A+  +R+ L G  F + TD  +L  +  + E    VQ+W+  L  +DF 
Sbjct: 962  AGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFT 1021

Query: 447  VQEDTKLKAIFVRLLADPDCIPHYTGVIPDNYAPINESQQSCFWE 485
            ++     K     ++AD      YT + P+   PI+      +++
Sbjct: 1022 LEYLAGPK----NVVADAISRAIYT-ITPETSRPIDTESWKSYYK 1057

BLAST of CSPI01G22700.1 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 292.4 bits (747), Expect = 1.5e-77
Identity = 171/465 (36.77%), Postives = 259/465 (55.70%), Query Frame = 1

Query: 27   PAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQPSISPFSS 86
            PA+++ +  V H I++K G     ++PY      + EI K+V ++LD   I PS SP SS
Sbjct: 576  PADINNI-PVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSS 635

Query: 87   LVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIQ-LLDELNGASVFSKIDLKSGYHQI 146
             V+LV KKDG +R CVDYR LN+AT+ D FP+P I  LL  +  A +F+ +DL SGYHQI
Sbjct: 636  PVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQI 695

Query: 147  RVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLVYFDDTLM 206
             +  +D  KTAF T  G YE+ VMPF L NAPSTF   M   FR   L+F+ VY DD L+
Sbjct: 696  PMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRD--LRFVNVYLDDILI 755

Query: 207  YSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEADHEKVKAM 266
            +S+  E H +HL  V + L+   L   +KKC F  +  E+LG+ +  + +     K  A+
Sbjct: 756  FSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAI 815

Query: 267  LEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEATQAFEFLKK 326
             ++P PK V++ + FLG+  YYRRF+ N   IA P+ +L   +  +W+E+  +A + LK 
Sbjct: 816  RDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAIDKLKD 875

Query: 327  AMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQ--NKK----LIAYFSQKLSEAAREKS 386
            A+   P+LV  N +  + + TDAS  G+  VL +  NK     ++ YFS+ L  A +   
Sbjct: 876  ALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYP 935

Query: 387  VYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMGFDFE 446
              E EL+ I+ A+  +R+ L G  F + TD  +L  +  + E    VQ+W+  L  +DF 
Sbjct: 936  AGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFT 995

Query: 447  VQEDTKLKAIFVRLLADPDCIPHYTGVIPDNYAPINESQQSCFWE 485
            ++     K     ++AD      YT + P+   PI+      +++
Sbjct: 996  LEYLAGPK----NVVADAISRAVYT-ITPETSRPIDTESWKSYYK 1031

BLAST of CSPI01G22700.1 vs. Swiss-Prot
Match: POL5_DROME (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 1.1e-75
Identity = 165/448 (36.83%), Postives = 260/448 (58.04%), Query Frame = 1

Query: 19  EFENVFNMPAELSPMRQVDHRIKLKEGT---DPINVRPYRYPHAQKNEIEKLVNEMLDFG 78
           EF  +F  P  LS M  V+  +K +  T   DPI  + Y YP   + E+E+ ++E+L  G
Sbjct: 94  EFPRIFEPP--LSGM-SVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQIDELLQDG 153

Query: 79  IIQPSISPFSSLVILVKKK-----DGGWRFCVDYRALNRATVPDKFPIPMIQL-LDELNG 138
           II+PS SP++S + +V KK     +  +R  VD++ LN  T+PD +PIP I   L  L  
Sbjct: 154 IIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINATLASLGN 213

Query: 139 ASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFR 198
           A  F+ +DL SG+HQI ++  D+ KTAF T  G YEFL +PF L NAP+ FQ +++ + R
Sbjct: 214 AKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDILR 273

Query: 199 PYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHW 258
            ++ K   VY DD +++S+D +TH ++L +V   L +  L  N +K HF+  ++E+LG+ 
Sbjct: 274 EHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLGYI 333

Query: 259 VSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTK--- 318
           V+A G++AD +KV+A+ E P P +V+EL+ FLG+T YYR+F+ +Y  +A PL  LT+   
Sbjct: 334 VTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRGLY 393

Query: 319 ---------KNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVL 378
                    K      E A Q+F  LK  + +  IL  P F  PF + TDAS + +  VL
Sbjct: 394 ANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGAVL 453

Query: 379 SQN----KKLIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFV-VYTDQKA 438
           SQ+     + IAY S+ L++     +  E+E++AI+ +++  R YL G   + VYTD + 
Sbjct: 454 SQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDHQP 513

Query: 439 LRHILEQRELILGVQKWIMKLMGFDFEV 441
           L   L  R     +++W  ++  ++ E+
Sbjct: 514 LTFALGNRNFNAKLKRWKARIEEYNCEL 538

BLAST of CSPI01G22700.1 vs. TrEMBL
Match: A5BRL2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013478 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 6.0e-203
Identity = 363/777 (46.72%), Postives = 488/777 (62.81%), Query Frame = 1

Query: 4   ESMEELRPEFEQLQLEFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNE 63
           E  + +  + +QL   FE++F  P +L P R++DHRI LKEGT+P+NVRPYRY + QK E
Sbjct: 155 EVQQAIHLDMQQLIKAFEDIFQKPNQLPPAREIDHRITLKEGTEPVNVRPYRYAYFQKAE 214

Query: 64  IEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIQ- 123
           IEK V +ML  G+I+ S S FSS V+LVKKKDG WRFC DYRALN  T+ D+FPIP +  
Sbjct: 215 IEKQVCDMLKLGLIKASTSLFSSPVLLVKKKDGTWRFCTDYRALNAVTIKDRFPIPTVDD 274

Query: 124 LLDELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQA 183
           +LDEL+GA+ F+K+DL++GYH +RV   D+ KTAFRTH GHYE+LVMPF L+NAPSTFQA
Sbjct: 275 MLDELHGATYFTKLDLRAGYHYVRVHPPDIPKTAFRTHNGHYEYLVMPFGLSNAPSTFQA 334

Query: 184 LMNQVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDR 243
           +MN +FRPYL KF+LV+F D L+YS +   HLEH+   F++LRQH  F    KC F +  
Sbjct: 335 IMNSIFRPYLGKFVLVFFXDILIYSPNXNMHLEHVKQAFEILRQHQFFVKISKCAFGQXE 394

Query: 244 IEYLGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLM 303
           +EYLGH V+  GV+ D  K+KAML WP P N+ EL GFLGLTGYYR+FV NYG IA  L 
Sbjct: 395 LEYLGHIVTXXGVQVDXGKIKAMLNWPRPTNISELHGFLGLTGYYRKFVRNYGIIARALT 454

Query: 304 RLTKKNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKK 363
            L KK  F W+++A  AF+ LK+AM + P L +PNF  PF IE+DA   G+  VL+Q  K
Sbjct: 455 NLLKKGQFAWTKDAETAFQALKQAMTSTPTLAMPNFNEPFVIESDALGDGIGAVLTQQGK 514

Query: 364 LIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQREL 423
            IA+ S+ L  + R  S+Y RE++AIV A++ WR YLLG +F + TDQ++L+++LEQR  
Sbjct: 515 PIAFMSRALGVSKRSWSIYAREMLAIVHAIQTWRPYLLGRKFYIQTDQRSLKYLLEQRIA 574

Query: 424 ILGVQKWIMKLMGFDFEVQ----EDTKLKAIFVRLLADPDCIPHYTGVIP---------D 483
               Q+W+ KL+G+D+E+      +   +    R+++ P     +    P          
Sbjct: 575 TPEQQEWVAKLLGYDYEITYKXGRENSAENALSRVVSSPSLNALFVPQAPLWDEIKAEAI 634

Query: 484 NYAPINESQQSCFWEGMKNDIKLYVDQCH------------------------DIFMDFV 543
            +  +++  +   W+    D     D C                         DI MDF+
Sbjct: 635 KHPYMDKIDKLANWQQTVQDYVSSCDVCQRVKSETLALAGLLQPLPIPCLVWDDITMDFI 694

Query: 544 EGLPRSKGVDTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRD 603
           EGLP S G +T+LVVVD LSK AHF  L HPF+A+ VA  FV+ +V+LHG P+SI+SDRD
Sbjct: 695 EGLPTSNGKNTILVVVDHLSKSAHFFALAHPFTAKMVAEKFVEGVVKLHGMPKSIISDRD 754

Query: 604 RVFLSHFWKELYRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIA 663
            VF+S FW+E ++L GTQLK S++YHPQTDGQSEV+N+C+E YL  +    PR WS  + 
Sbjct: 755 PVFMSQFWQEFFKLSGTQLKMSSSYHPQTDGQSEVVNRCVEQYLCCYAHHHPRKWSFFLP 814

Query: 664 WAEYWYNTNYQSSIKNTLYAVVYGQPPPPIISY------------------GQAEQMKKF 714
           W E+WYNT Y +S   T +  +YG+ PP I  Y                      Q+K  
Sbjct: 815 WVEFWYNTTYHTSTGMTPFQALYGRLPPNIPHYLMGTTPVHAVDQNLASRDAILRQLKTN 874

BLAST of CSPI01G22700.1 vs. TrEMBL
Match: A0A087GH17_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G098300 PE=4 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 1.4e-199
Identity = 367/727 (50.48%), Postives = 476/727 (65.47%), Query Frame = 1

Query: 19   EFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQ 78
            EF +VF  P  L P R  +H I L+ G   ++VRP+RYP  Q+ E+EK V  ML  GII+
Sbjct: 338  EFASVFEEPQGLPPCRDKEHAIVLETGASLVSVRPFRYPQVQREELEKQVATMLAAGIIK 397

Query: 79   PSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-QLLDELNGASVFSKID 138
             S SPFSS V+LVKKKDG WRFCVDYRALN+ TV D +PIPMI QLLDEL+GA +FSK+D
Sbjct: 398  ESTSPFSSPVLLVKKKDGTWRFCVDYRALNKVTVGDSYPIPMIDQLLDELHGAIMFSKLD 457

Query: 139  LKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLL 198
            +++GYHQIRV+ EDV KTAFRTH+GHYEFLVMPF LTNAP+TFQ+LM+ VFR +L +F+L
Sbjct: 458  MRAGYHQIRVKAEDVPKTAFRTHDGHYEFLVMPFGLTNAPTTFQSLMDDVFRQFLRRFVL 517

Query: 199  VYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEA 258
            V+FDD L+YSK    H  H+ +V Q L  H L+AN KKC F K  +EYLGH +S +GV A
Sbjct: 518  VFFDDILIYSKTEAEHQAHVRIVLQTLADHQLYANAKKCEFGKSEVEYLGHVISGRGVAA 577

Query: 259  DHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEAT 318
            D  KVKAM++WP PKNV+ LRGFLGLTGYYR+FV  YG IA PL  L KK+ F+WS  A 
Sbjct: 578  DPTKVKAMVDWPPPKNVKALRGFLGLTGYYRKFVKGYGGIARPLTALLKKDQFKWSPTAE 637

Query: 319  QAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLI--AYFSQKLSEAA 378
              F+ LK AM T+P+L L +F   F +E+DAS  GL             AYFSQ L++  
Sbjct: 638  ATFQALKAAMSTVPVLALVDFSKQFVVESDASGIGLGXXXXXXXXXXXXAYFSQALTDRH 697

Query: 379  REKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMG 438
            + KSVYERELMA+V A++KWRHYLLG RFVV TDQ++L+ +LEQRE+ L  Q+W+ K++G
Sbjct: 698  KLKSVYERELMAVVFAIQKWRHYLLGRRFVVRTDQRSLKFLLEQREINLEYQRWLSKILG 757

Query: 439  FDFEVQEDTKLKAIFVRLLADPDCIPHYTGVIPDNYAPINESQQSCFWEGMKNDIKLYVD 498
            FDFE+Q    L+      L+  +     T  +     P+   Q S F   +  D  L   
Sbjct: 758  FDFEIQYKPGLENKAADALSRVE-----THQLLALSMPV-AIQMSEFESEVDQDEDL--S 817

Query: 499  QCHDIFMDFVEGLPRSKGVDTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHG 558
            +     +      P    V   L+   RL+KYAHFI + HP+ A  VA+ FVKE+VRLHG
Sbjct: 818  KLKKAVLANPGDHPDYSIVQGRLLRKGRLTKYAHFIKMSHPYEAAEVALTFVKEVVRLHG 877

Query: 559  YPRSIVSDRDRVFLSHFWKELYRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQE 618
            YPR+IV DRD  F   FW EL+RL GT L  ST YHPQ+DGQ+EV N+ +E YLR FC E
Sbjct: 878  YPRTIVLDRDITFTGKFWGELFRLAGTHLCFSTAYHPQSDGQTEVTNRGMETYLRCFCSE 937

Query: 619  KPRTWSDKIAWAEYWYNTNYQSSIKNTLYAVVYGQPPPPIISYGQA-------------- 678
            KP+ WS  + WAE  YNT+Y ++I+ T +  VYG+ PP ++ + +               
Sbjct: 938  KPKKWSGYLVWAELSYNTSYHTAIRMTPFKAVYGREPPTLLQFERGSTDNATLEDQLLER 997

Query: 679  ---------------EQMKKFADVYRRNVIFDIGDWVYLKLQPYRQQSVAKKRCEKLSPR 714
                           + MK+ AD +RR V F +GD V+LK++PYRQ+++A++  EKL+ R
Sbjct: 998  DEMLGIQQQQLLRTQQIMKQQADNHRREVEFAVGDMVFLKIRPYRQKTLARRANEKLAAR 1056

BLAST of CSPI01G22700.1 vs. TrEMBL
Match: Q2QZQ5_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os11g45200 PE=4 SV=2)

HSP 1 Score: 616.7 bits (1589), Expect = 3.8e-173
Identity = 334/759 (44.01%), Postives = 457/759 (60.21%), Query Frame = 1

Query: 4    ESMEELRPEFEQLQLEFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNE 63
            E  + +    +++  EF  VF  P  L P R  DHRI L EG  P+N+RPYRY    K+E
Sbjct: 465  EEHQNVPAPVQKILQEFAGVFAEPRGLPPTRYCDHRIPLIEGAQPVNLRPYRYNPELKDE 524

Query: 64   IEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-Q 123
            IE+ V EML  G+IQPS S +SS  +LV+KK G WR CVDYR LN  T+  K+P+P+I +
Sbjct: 525  IERQVAEMLSSGVIQPSQSTWSSPALLVRKKYGTWRLCVDYRHLNALTIKSKYPVPIIKE 584

Query: 124  LLDELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQA 183
            LLDEL+GA  FSK+DL++GYHQIR+   +  KTAF+TH  HYE+ VM F LT AP+TFQ 
Sbjct: 585  LLDELSGAKWFSKLDLRAGYHQIRMVPGEEHKTAFQTHSSHYEYRVMSFGLTGAPATFQG 644

Query: 184  LMNQVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDR 243
            +MN+     L K  LV+FDD L+YS D+++HL HL  V QLLRQ        KC F + +
Sbjct: 645  VMNKTLASVLRKCALVFFDDILVYSPDLQSHLTHLKQVLQLLRQDHWQVKMSKCSFAQPQ 704

Query: 244  IEYLGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLM 303
            + YLGH + A+GV  + +K++ +L WP P +V++LRGFLGL GYYR+FV N+G I+ PL 
Sbjct: 705  VSYLGHIIGAQGVSTEPKKIQDVLTWPTPISVKKLRGFLGLAGYYRKFVKNFGIISKPLT 764

Query: 304  RLTKKN-NFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNK 363
            +L +K  +FRW  EA  AF+ LK+A+ + P+L LP+F   F +ET+AS  G+  VLSQ  
Sbjct: 765  QLLRKGVSFRWGSEAEAAFQQLKQALTSAPVLGLPDFSKQFTVETNASDAGIGAVLSQEG 824

Query: 364  KLIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRE 423
              IAY S+ L   ++  S YE+E MAI+LAV+ WR YL    F++ TD  +L H+ +QR 
Sbjct: 825  HPIAYLSKALGPRSKGLSTYEKECMAILLAVDHWRSYLQHQEFLILTDYHSLVHLDDQRL 884

Query: 424  LILGVQKWIMKLMGFDF-------------------EVQEDTKLKAIFVRLLADPDCIPH 483
                 Q+   KL+G  +                   EV E  +L AI V + A P+ +  
Sbjct: 885  HTPWQQRAFTKLLGLQYKIGYRKGSSNAVADALSRREVGEGGQLSAISVCIQAKPERVK- 944

Query: 484  YTGVIPDNYAPINESQQSCFWEGMKNDIKLYVDQCHDIFMDFVEGLPRSKGVDTVLVVVD 543
            Y G++     P+ E      W+               I MDF+EGLP+S+  + +LVVVD
Sbjct: 945  YPGLLQP--LPVPEGA----WQ--------------TITMDFLEGLPKSERYNCILVVVD 1004

Query: 544  RLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHFWKELYRLQGT 603
            + SKYAHF+ L HPF+A+TVA  F+K I +LHG PR IVSDRD++F S FW+ L+   GT
Sbjct: 1005 KFSKYAHFVPLTHPFTAETVATAFMKNIYKLHGMPRVIVSDRDKIFTSQFWEYLFTKSGT 1064

Query: 604  QLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIAWAEYWYNTNYQSSIKNT 663
            +L  S+ YHPQ+DGQ+E +N+C+EL+LR F    P  W+  +  AE+WYN  Y S++K T
Sbjct: 1065 ELHMSSAYHPQSDGQTERVNQCVELFLRCFVHATPTKWAAWLHLAEFWYNNAYHSAVKQT 1124

Query: 664  LYAVVYGQPPP-----------PIIS-----------------YGQAEQMKKFADVYRRN 714
             + V+YG  P            P +                  +   +QMK +AD  R  
Sbjct: 1125 PFEVIYGHQPAHFGITMEDCAVPDLQEWLRDRKFMHQLIQQHLHRAQQQMKAYADKNRSF 1184

BLAST of CSPI01G22700.1 vs. TrEMBL
Match: M5XEL3_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019597mg PE=4 SV=1)

HSP 1 Score: 616.3 bits (1588), Expect = 5.0e-173
Identity = 354/866 (40.88%), Postives = 473/866 (54.62%), Query Frame = 1

Query: 20   FENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQP 79
            F ++F     L P R +DHRI L  GT PINVRPYRYPH QK EIE  V  ML  GII+ 
Sbjct: 485  FSDLFEESLGLPPSRAIDHRIPLLPGTGPINVRPYRYPHWQKAEIESQVKAMLQAGIIRR 544

Query: 80   SISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-QLLDELNGASVFSKIDL 139
            S SPFSS V+LV KK+G WRFCVDYRALN+ TV DKFPIP+I ++LDELNGA+ FSK+DL
Sbjct: 545  SSSPFSSPVLLVSKKEGTWRFCVDYRALNQVTVKDKFPIPVIDEMLDELNGAAWFSKLDL 604

Query: 140  KSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLV 199
            +SGYHQIR+R+ D+ KTAFRTHEGHYEFLVMPF L+NAPSTFQALMN +FRPYL KF+LV
Sbjct: 605  RSGYHQIRMRDADILKTAFRTHEGHYEFLVMPFGLSNAPSTFQALMNDIFRPYLRKFVLV 664

Query: 200  YFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEAD 259
            +FDD L+YS+ +  H+ HLT VF++LR   L     KC F +  ++YLGH +S  GV  D
Sbjct: 665  FFDDILVYSRTLNEHVHHLTTVFEVLRVAQLKMKASKCTFAQSTVDYLGHTISEAGVSVD 724

Query: 260  HEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEATQ 319
             +K++ +  WP P+ V+ LRGFLGL GYYR+FV ++G I+ PL  L +K+NF WS  A  
Sbjct: 725  KKKIQCIDNWPRPETVKGLRGFLGLAGYYRKFVHHFGTISKPLTDLLRKDNFHWSPAADS 784

Query: 320  AFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLIAYFSQKLSE----- 379
            AF+ LK A+ T P+L LP+F   F +E+DAS  G+  +LSQ ++ IAY S+ LSE     
Sbjct: 785  AFQALKTALTTTPVLRLPDFSKQFVVESDASNNGVGAILSQEQRPIAYLSKSLSERHRSL 844

Query: 380  ----------------------AAREKSVYERELMAIVL-------AVEKWRHYLLGHRF 439
                                    + K V + + +   L         EKW   LLG+ +
Sbjct: 845  SVYDKEMLAVVLAVQQWRPYLLGRQFKIVTDHQTIKHFLEQRITTPTQEKWLLKLLGYNY 904

Query: 440  VVYT---DQKALRHILEQRELILGVQKWIMKLMGFDFEVQE----DTKLKAIFVRLLADP 499
             +      + A    L ++  +L +      +     ++Q+    D++ + +   L ADP
Sbjct: 905  EIEYRAGSKNAGPDALSRKSELLAIMGLSTPIFYCIPQIQQAYTSDSEAQQLISLLQADP 964

Query: 500  DCIPHYTGVIPDNY------APINESQQSCF---------------------------WE 559
               PHY+      Y       P++   ++                             W 
Sbjct: 965  TAKPHYSWQNNCLYYKERVFVPVSSQWRTMILEEFHSTPMGGHSGQLRTYKRILRNFRWP 1024

Query: 560  GMKNDIKLYVDQC---------------------------HDIFMDFVEGLPRSKGVDTV 619
             +K D++ +V  C                            DI MDFVEGLP   G + +
Sbjct: 1025 RLKKDVQAFVAACDTCQRQNYEALHPPGLLQPLPIPDSIWQDIAMDFVEGLPSVNGKNAI 1084

Query: 620  LVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHFWKELY 679
            LVVVDRLSKY HFI + HP++A  VA  F+ E+ +LHG PR+IVSDRD  F S FW   +
Sbjct: 1085 LVVVDRLSKYGHFIPIKHPYTASQVADFFICEVFKLHGMPRTIVSDRDPTFTSQFWTSFF 1144

Query: 680  RLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIAWAEYWYNTNYQS 739
              QGT+L               ++N+ LE YLR F  +KP +W   + WAE+WYNT Y S
Sbjct: 1145 THQGTKL-------------CHILNRTLEHYLRCFVGDKPTSWVSWLPWAEWWYNTTYHS 1204

Query: 740  SIKNTLYAVVYGQPPPPIISYGQA-----------------------------EQMKKFA 752
            +IK T Y  VYGQPPP +  Y                                E+M  FA
Sbjct: 1205 AIKMTPYQAVYGQPPPSVEFYTSGSSAVQAVDLALRDRDTLLRRLRQNMQIAQERMTFFA 1264

BLAST of CSPI01G22700.1 vs. TrEMBL
Match: A0A151RRN1_CAJCA (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cajanus cajan GN=KK1_033279 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 1.1e-161
Identity = 321/737 (43.55%), Postives = 432/737 (58.62%), Query Frame = 1

Query: 67  LVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-QLLD 126
           +V +ML  GII PS SPFSS ++LVKKKDG WRFC DYRALN  TV D FP+P + +LLD
Sbjct: 1   MVADMLAEGIITPSTSPFSSPILLVKKKDGSWRFCTDYRALNTITVKDNFPMPTVDELLD 60

Query: 127 ELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMN 186
           EL GA  FSK+DL+SGYHQI V+ ED  KTAFRTH+GHYE+LVMPF LTNAP+TFQ LMN
Sbjct: 61  ELFGAQFFSKLDLRSGYHQILVKPEDRHKTAFRTHQGHYEWLVMPFGLTNAPATFQQLMN 120

Query: 187 QVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEY 246
           +VF+  L K +LV+FDD L+YS +  +HL+HL  V QLL+ H L+A   KC F   +++Y
Sbjct: 121 RVFQKLLRKCVLVFFDDILVYSPNWSSHLQHLEAVLQLLQSHVLYAKLSKCTFATQQVDY 180

Query: 247 LGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLT 306
           LGH VSAKGV  D  KV+A+L WP P N+++LRGFLG+TGYYRRF+ NY A+A PL  L 
Sbjct: 181 LGHTVSAKGVSMDKAKVQAILNWPEPTNLKQLRGFLGITGYYRRFIKNYAALAEPLTNLL 240

Query: 307 KKNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLIA 366
           KK+ F WS+ A++ F+ L++A+ T P+L LPNF  PF +ETDAS  G+         + A
Sbjct: 241 KKDAFHWSDIASKTFQSLREAITTAPVLALPNFNQPFILETDASGTGIGAYKPGKDNIPA 300

Query: 367 -YFSQKLSEAARE-KSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELI 426
              S+    A  E +  + +EL   +   E W+  L               H+  + +L+
Sbjct: 301 DALSRSFYMAWSETQPTFLQELKHDIATDEYWKQQLQDCEL----GNNQNPHLSSKDQLL 360

Query: 427 LGVQKWIMKLMGFDFEVQEDTKLKAIFVRLLADPDCIP--HYTGVIPDNYAPINESQQSC 486
                W  +L+     + + + L     ++L +  C P   ++G+       I+  +   
Sbjct: 361 F----WKGRLV-----IPQQSPL---ISKILEEYHCSPIGGHSGIA----RTISRVKAEF 420

Query: 487 FWEGMKNDIKLYVDQC---------------------------HDIFMDFVEGLPRSKGV 546
           +W  MK  I  +V  C                            DI MDF+ GLP SKG 
Sbjct: 421 YWPKMKEQIHRFVQHCSICQQAKYAAVQPAGLLQPLPIPSQIWEDISMDFITGLPVSKGF 480

Query: 547 DTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHFWK 606
             +LV+VDRLSKYAHF  L   +++  VA +F   +VRLHG P+SIVSDRD+ F S FW+
Sbjct: 481 TVILVIVDRLSKYAHFQPLKADYTSTQVADLFCNTVVRLHGMPKSIVSDRDKTFTSKFWQ 540

Query: 607 ELYRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIAWAEYWYNTN 666
           +L++LQGT L  ST YHPQ+DGQ+E +NK LELYLR F  + P+TW + + WAEYWYNT+
Sbjct: 541 QLFKLQGTTLAMSTAYHPQSDGQTEAVNKALELYLRCFTSQSPKTWVNFLPWAEYWYNTS 600

Query: 667 YQSSIKNTLYAVVYGQPPPPIISYGQA-----------------------------EQMK 726
           +  SI  T + VVYG+ PP ++ Y  +                             + MK
Sbjct: 601 FHHSIGMTPFKVVYGRDPPGLLRYQPSPSDNQSVKDSLLARDALLNKLKENLFRAQQYMK 660

Query: 727 KFADVYRRNVIFDIGDWVYLKLQPYRQQSVAKKRCEKLSPRYFGPYMILGRRSSLHAGPT 743
             AD  R    F IGD V++KL+PYRQ SV  ++ +KLS RYFGP+ IL +         
Sbjct: 661 HQADKKRIEKHFQIGDKVWVKLKPYRQHSVQLRQNQKLSMRYFGPFTILAK--------I 709

BLAST of CSPI01G22700.1 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 154.8 bits (390), Expect = 2.1e-37
Identity = 75/131 (57.25%), Postives = 93/131 (70.99%), Query Frame = 1

Query: 214 LEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHW--VSAKGVEADHEKVKAMLEWPVP 273
           + HL MV Q+  QH  +ANRKKC F + +I YLGH   +S +GV AD  K++AM+ WP P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 274 KNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEATQAFEFLKKAMVTLP 333
           KN  ELRGFLGLTGYYRRFV NYG I  PL  L KKN+ +W+E A  AF+ LK A+ TLP
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

Query: 334 ILVLPNFQLPF 343
           +L LP+ +LPF
Sbjct: 121 VLALPDLKLPF 131

BLAST of CSPI01G22700.1 vs. TAIR10
Match: ATMG00850.1 (ATMG00850.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 51.6 bits (122), Expect = 2.5e-06
Identity = 22/39 (56.41%), Postives = 31/39 (79.49%), Query Frame = 1

Query: 60 QKNEIEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGGW 99
          ++  ++  + EML+  IIQPSISP+SS V+LV+KKDGGW
Sbjct: 41 RRTRLKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGW 79

BLAST of CSPI01G22700.1 vs. NCBI nr
Match: gi|922465109|ref|XP_013633118.1| (PREDICTED: uncharacterized protein LOC106338764 [Brassica oleracea var. oleracea])

HSP 1 Score: 729.9 bits (1883), Expect = 4.4e-207
Identity = 366/767 (47.72%), Postives = 493/767 (64.28%), Query Frame = 1

Query: 7    EELRPEFEQLQLEFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEK 66
            +EL  +   L  EF  +F  P  L P+R ++H I LKEGT+PINVRPYRY + QK+EIE+
Sbjct: 502  KELNDDIRVLLDEFNGIFKTPDGLPPLRDIEHSITLKEGTNPINVRPYRYAYFQKDEIER 561

Query: 67   LVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIQ-LLD 126
             VNEML  GII  S SPFSS V+LVKKKDG WRFC DYRALN AT+ D+FPIP ++ +L+
Sbjct: 562  QVNEMLQAGIIGTSSSPFSSPVLLVKKKDGSWRFCTDYRALNSATIKDRFPIPTVEDMLN 621

Query: 127  ELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMN 186
            EL+G++ F+K+DL +G+HQ+R+ + D+ KTAFRTH GH+E+LVMPF L NAPSTFQALMN
Sbjct: 622  ELHGSAYFTKLDLTAGFHQVRMSSADIHKTAFRTHHGHFEYLVMPFGLCNAPSTFQALMN 681

Query: 187  QVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEY 246
             +FRPY+ KF+LV+FDD L+YS   E HL+H+  V  L++ H L    KKC F K  +EY
Sbjct: 682  DIFRPYMRKFVLVFFDDILVYSPTWEAHLQHVREVLSLIQHHKLSVKFKKCEFGKRELEY 741

Query: 247  LGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLT 306
            LGH +S  GV  D  KV+AM +W VP +V +LRGFLGLTGYYR+FV +YG IA PL  L 
Sbjct: 742  LGHIISNTGVTVDQSKVQAMTDWQVPTSVTDLRGFLGLTGYYRKFVRDYGLIARPLTNLL 801

Query: 307  KKNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLIA 366
            +K  F WS +A  AF  LK+A+ T P L LP+F  PF IETDAS  G+  VLSQN + IA
Sbjct: 802  RKVKFIWSPQADTAFNNLKEALTTTPTLALPDFSKPFVIETDASGEGIGAVLSQNGQPIA 861

Query: 367  YFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILG 426
            + S+ L    +  S Y RE++AI++A+  WR YLLG +F + TDQ++LR++LEQ  L   
Sbjct: 862  FMSRSLGVTKKAWSTYAREMLAIIIAIRTWRPYLLGRKFTIQTDQRSLRYMLEQHILTPE 921

Query: 427  VQKWIMKLMGFDFEVQEDTKLKAIFVRLLADPDCIPHYTGVIPDNYAPINESQQSCFWEG 486
             QKW+ KL+G+D++++      +I +  L + + +  ++G +              +W  
Sbjct: 922  QQKWMSKLVGYDYDIRH--VCFSIRMSHLDEDETLGGHSGFL----RTFKRLSHHFYWPS 981

Query: 487  MKNDIKLYVDQC---------------------------HDIFMDFVEGLPRSKGVDTVL 546
            M      Y+  C                            DI MDFV+GLPRS  + +++
Sbjct: 982  MHTTEVDYISHCDTCQRAKSQTMSPAGLLQPLPVPEQIWEDISMDFVDGLPRSGSLTSIM 1041

Query: 547  VVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHFWKELYR 606
            V V+RLSK AH I L HP++A  VA  F+  IV+LHG PR+I+SDRD +FLSHFWKEL+R
Sbjct: 1042 VFVNRLSKSAHLIPLSHPYTASIVATQFIANIVKLHGPPRTILSDRDPIFLSHFWKELWR 1101

Query: 607  LQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIAWAEYWYNTNYQSS 666
            L GT L+ ST YHPQTDGQ+EV+N+C+E YLR F Q++P  WS  + WAEYWYNT + SS
Sbjct: 1102 LSGTTLQMSTAYHPQTDGQTEVVNRCIEQYLRCFVQQRPTHWSSFLPWAEYWYNTTFHSS 1161

Query: 667  IKNTLYAVVYGQPPPPIISY-------GQAEQMKKFADVYR------------------- 717
               T +  +YG+PPP I  Y       G+ ++  +  D                      
Sbjct: 1162 TGTTPFQTLYGRPPPAIPRYELGSTLVGEIDEQLQHRDELLDELKHHLEASNNRMKQLAD 1221

BLAST of CSPI01G22700.1 vs. NCBI nr
Match: gi|147775005|emb|CAN70471.1| (hypothetical protein VITISV_013478 [Vitis vinifera])

HSP 1 Score: 715.7 bits (1846), Expect = 8.6e-203
Identity = 363/777 (46.72%), Postives = 488/777 (62.81%), Query Frame = 1

Query: 4   ESMEELRPEFEQLQLEFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNE 63
           E  + +  + +QL   FE++F  P +L P R++DHRI LKEGT+P+NVRPYRY + QK E
Sbjct: 155 EVQQAIHLDMQQLIKAFEDIFQKPNQLPPAREIDHRITLKEGTEPVNVRPYRYAYFQKAE 214

Query: 64  IEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMIQ- 123
           IEK V +ML  G+I+ S S FSS V+LVKKKDG WRFC DYRALN  T+ D+FPIP +  
Sbjct: 215 IEKQVCDMLKLGLIKASTSLFSSPVLLVKKKDGTWRFCTDYRALNAVTIKDRFPIPTVDD 274

Query: 124 LLDELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQA 183
           +LDEL+GA+ F+K+DL++GYH +RV   D+ KTAFRTH GHYE+LVMPF L+NAPSTFQA
Sbjct: 275 MLDELHGATYFTKLDLRAGYHYVRVHPPDIPKTAFRTHNGHYEYLVMPFGLSNAPSTFQA 334

Query: 184 LMNQVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDR 243
           +MN +FRPYL KF+LV+F D L+YS +   HLEH+   F++LRQH  F    KC F +  
Sbjct: 335 IMNSIFRPYLGKFVLVFFXDILIYSPNXNMHLEHVKQAFEILRQHQFFVKISKCAFGQXE 394

Query: 244 IEYLGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLM 303
           +EYLGH V+  GV+ D  K+KAML WP P N+ EL GFLGLTGYYR+FV NYG IA  L 
Sbjct: 395 LEYLGHIVTXXGVQVDXGKIKAMLNWPRPTNISELHGFLGLTGYYRKFVRNYGIIARALT 454

Query: 304 RLTKKNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKK 363
            L KK  F W+++A  AF+ LK+AM + P L +PNF  PF IE+DA   G+  VL+Q  K
Sbjct: 455 NLLKKGQFAWTKDAETAFQALKQAMTSTPTLAMPNFNEPFVIESDALGDGIGAVLTQQGK 514

Query: 364 LIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQREL 423
            IA+ S+ L  + R  S+Y RE++AIV A++ WR YLLG +F + TDQ++L+++LEQR  
Sbjct: 515 PIAFMSRALGVSKRSWSIYAREMLAIVHAIQTWRPYLLGRKFYIQTDQRSLKYLLEQRIA 574

Query: 424 ILGVQKWIMKLMGFDFEVQ----EDTKLKAIFVRLLADPDCIPHYTGVIP---------D 483
               Q+W+ KL+G+D+E+      +   +    R+++ P     +    P          
Sbjct: 575 TPEQQEWVAKLLGYDYEITYKXGRENSAENALSRVVSSPSLNALFVPQAPLWDEIKAEAI 634

Query: 484 NYAPINESQQSCFWEGMKNDIKLYVDQCH------------------------DIFMDFV 543
            +  +++  +   W+    D     D C                         DI MDF+
Sbjct: 635 KHPYMDKIDKLANWQQTVQDYVSSCDVCQRVKSETLALAGLLQPLPIPCLVWDDITMDFI 694

Query: 544 EGLPRSKGVDTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRD 603
           EGLP S G +T+LVVVD LSK AHF  L HPF+A+ VA  FV+ +V+LHG P+SI+SDRD
Sbjct: 695 EGLPTSNGKNTILVVVDHLSKSAHFFALAHPFTAKMVAEKFVEGVVKLHGMPKSIISDRD 754

Query: 604 RVFLSHFWKELYRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIA 663
            VF+S FW+E ++L GTQLK S++YHPQTDGQSEV+N+C+E YL  +    PR WS  + 
Sbjct: 755 PVFMSQFWQEFFKLSGTQLKMSSSYHPQTDGQSEVVNRCVEQYLCCYAHHHPRKWSFFLP 814

Query: 664 WAEYWYNTNYQSSIKNTLYAVVYGQPPPPIISY------------------GQAEQMKKF 714
           W E+WYNT Y +S   T +  +YG+ PP I  Y                      Q+K  
Sbjct: 815 WVEFWYNTTYHTSTGMTPFQALYGRLPPNIPHYLMGTTPVHAVDQNLASRDAILRQLKTN 874

BLAST of CSPI01G22700.1 vs. NCBI nr
Match: gi|674236404|gb|KFK29169.1| (hypothetical protein AALP_AA7G098300 [Arabis alpina])

HSP 1 Score: 704.5 bits (1817), Expect = 2.0e-199
Identity = 367/727 (50.48%), Postives = 476/727 (65.47%), Query Frame = 1

Query: 19   EFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQ 78
            EF +VF  P  L P R  +H I L+ G   ++VRP+RYP  Q+ E+EK V  ML  GII+
Sbjct: 338  EFASVFEEPQGLPPCRDKEHAIVLETGASLVSVRPFRYPQVQREELEKQVATMLAAGIIK 397

Query: 79   PSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-QLLDELNGASVFSKID 138
             S SPFSS V+LVKKKDG WRFCVDYRALN+ TV D +PIPMI QLLDEL+GA +FSK+D
Sbjct: 398  ESTSPFSSPVLLVKKKDGTWRFCVDYRALNKVTVGDSYPIPMIDQLLDELHGAIMFSKLD 457

Query: 139  LKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLL 198
            +++GYHQIRV+ EDV KTAFRTH+GHYEFLVMPF LTNAP+TFQ+LM+ VFR +L +F+L
Sbjct: 458  MRAGYHQIRVKAEDVPKTAFRTHDGHYEFLVMPFGLTNAPTTFQSLMDDVFRQFLRRFVL 517

Query: 199  VYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEA 258
            V+FDD L+YSK    H  H+ +V Q L  H L+AN KKC F K  +EYLGH +S +GV A
Sbjct: 518  VFFDDILIYSKTEAEHQAHVRIVLQTLADHQLYANAKKCEFGKSEVEYLGHVISGRGVAA 577

Query: 259  DHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEAT 318
            D  KVKAM++WP PKNV+ LRGFLGLTGYYR+FV  YG IA PL  L KK+ F+WS  A 
Sbjct: 578  DPTKVKAMVDWPPPKNVKALRGFLGLTGYYRKFVKGYGGIARPLTALLKKDQFKWSPTAE 637

Query: 319  QAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLI--AYFSQKLSEAA 378
              F+ LK AM T+P+L L +F   F +E+DAS  GL             AYFSQ L++  
Sbjct: 638  ATFQALKAAMSTVPVLALVDFSKQFVVESDASGIGLGXXXXXXXXXXXXAYFSQALTDRH 697

Query: 379  REKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELILGVQKWIMKLMG 438
            + KSVYERELMA+V A++KWRHYLLG RFVV TDQ++L+ +LEQRE+ L  Q+W+ K++G
Sbjct: 698  KLKSVYERELMAVVFAIQKWRHYLLGRRFVVRTDQRSLKFLLEQREINLEYQRWLSKILG 757

Query: 439  FDFEVQEDTKLKAIFVRLLADPDCIPHYTGVIPDNYAPINESQQSCFWEGMKNDIKLYVD 498
            FDFE+Q    L+      L+  +     T  +     P+   Q S F   +  D  L   
Sbjct: 758  FDFEIQYKPGLENKAADALSRVE-----THQLLALSMPV-AIQMSEFESEVDQDEDL--S 817

Query: 499  QCHDIFMDFVEGLPRSKGVDTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHG 558
            +     +      P    V   L+   RL+KYAHFI + HP+ A  VA+ FVKE+VRLHG
Sbjct: 818  KLKKAVLANPGDHPDYSIVQGRLLRKGRLTKYAHFIKMSHPYEAAEVALTFVKEVVRLHG 877

Query: 559  YPRSIVSDRDRVFLSHFWKELYRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQE 618
            YPR+IV DRD  F   FW EL+RL GT L  ST YHPQ+DGQ+EV N+ +E YLR FC E
Sbjct: 878  YPRTIVLDRDITFTGKFWGELFRLAGTHLCFSTAYHPQSDGQTEVTNRGMETYLRCFCSE 937

Query: 619  KPRTWSDKIAWAEYWYNTNYQSSIKNTLYAVVYGQPPPPIISYGQA-------------- 678
            KP+ WS  + WAE  YNT+Y ++I+ T +  VYG+ PP ++ + +               
Sbjct: 938  KPKKWSGYLVWAELSYNTSYHTAIRMTPFKAVYGREPPTLLQFERGSTDNATLEDQLLER 997

Query: 679  ---------------EQMKKFADVYRRNVIFDIGDWVYLKLQPYRQQSVAKKRCEKLSPR 714
                           + MK+ AD +RR V F +GD V+LK++PYRQ+++A++  EKL+ R
Sbjct: 998  DEMLGIQQQQLLRTQQIMKQQADNHRREVEFAVGDMVFLKIRPYRQKTLARRANEKLAAR 1056

BLAST of CSPI01G22700.1 vs. NCBI nr
Match: gi|727652650|ref|XP_010497069.1| (PREDICTED: uncharacterized protein LOC104774101 [Camelina sativa])

HSP 1 Score: 645.6 bits (1664), Expect = 1.1e-181
Identity = 341/637 (53.53%), Postives = 429/637 (67.35%), Query Frame = 1

Query: 20   FENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYPHAQKNEIEKLVNEMLDFGIIQP 79
            F  VF +P+ L P+R  +H I L++G   I VRPYRYPHA K  +EK+V++ML  GII+P
Sbjct: 413  FAVVFEVPSGLPPVRGQEHAIVLQQGIHSITVRPYRYPHATKELMEKMVDDMLGAGIIRP 472

Query: 80   SISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFPIPMI-QLLDELNGASVFSKIDL 139
            S SPFSS V+LVKKKD  WRF VDYRALNRATVPDKFPIP+I QLLDEL+GA +FSKIDL
Sbjct: 473  STSPFSSPVLLVKKKDSSWRFYVDYRALNRATVPDKFPIPVIDQLLDELHGAVIFSKIDL 532

Query: 140  KSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNAPSTFQALMNQVFRPYLLKFLLV 199
            +SGYHQI +++ED+ KTAFRT EGHYEFLVMPF LTNAP+TFQALMN++F+ YL KF+LV
Sbjct: 533  RSGYHQIHMKDEDIAKTAFRTLEGHYEFLVMPFGLTNAPATFQALMNKIFKQYLRKFVLV 592

Query: 200  YFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKCHFVKDRIEYLGHWVSAKGVEAD 259
            +FDD L+YS   E H++HL +V Q L  H LFAN KKC      +EYLGH +SA GV  D
Sbjct: 593  FFDDILVYSASEEEHVQHLCVVLQALVSHQLFANSKKCMLGVTHVEYLGHIISAAGVATD 652

Query: 260  HEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGAIAMPLMRLTKKNNFRWSEEATQ 319
              K +AM  WP P NV++LRGFLGLTGYYR+FV  YG +A PL  L KK+ F WS EA +
Sbjct: 653  IVKTEAMTTWPTPVNVKQLRGFLGLTGYYRKFVRGYGTMARPLTELLKKDQFHWSPEAQK 712

Query: 320  AFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVVLSQNKKLIAYFSQKLSEAAREK 379
            AF+ LK  MV  P+L L +F  PF IE+DAS  G+  VL Q+K+ IAYFS  L+   + K
Sbjct: 713  AFDMLKDTMVKAPVLGLHDFSKPFIIESDASGTGVGAVLLQDKRPIAYFSHGLTSREQLK 772

Query: 380  SVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHILEQRELIL------GVQKWIMK 439
              YERELMAIVLAV KW+HY LG +F+V+TDQ++L+ +LEQR+L        G++  I +
Sbjct: 773  PAYERELMAIVLAVLKWKHYQLGRKFIVHTDQRSLKFLLEQRDLYKEIDQDEGIKAIITQ 832

Query: 440  LMGFDFEVQEDTKLKAIF---VRLLAD------PDCIPHYTGVIPDNYAPINESQ---QS 499
            L   D      + L        RL+        P  +  Y   +   +A I ++    Q+
Sbjct: 833  LGDADSTKGHYSMLNGRLWYKKRLVIPRSSSFIPLVLHEYHDSVVGGHAGILKTLKRIQT 892

Query: 500  CF-WEGMKNDIKLYVDQCH---------------------------DIFMDFVEGLPRSK 559
            CF WEGM+ D++ YV  C                            D+ +DF+EGLP S 
Sbjct: 893  CFHWEGMQQDVQCYVQACRVCQTHKYSTLALAGLLQPLPVPTAIWEDVSLDFIEGLPMSG 952

Query: 560  GVDTVLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHF 610
            GV+ +LVVVDRLSK AHF+ L HPFSA  VA  FV+ +VRLH +P+SIVSDRDR+FL   
Sbjct: 953  GVNVILVVVDRLSKAAHFLGLKHPFSALDVANKFVEGVVRLHSFPKSIVSDRDRIFLGEV 1012

BLAST of CSPI01G22700.1 vs. NCBI nr
Match: gi|645267554|ref|XP_008239126.1| (PREDICTED: uncharacterized protein LOC103337735 [Prunus mume])

HSP 1 Score: 622.5 bits (1604), Expect = 9.9e-175
Identity = 345/767 (44.98%), Postives = 453/767 (59.06%), Query Frame = 1

Query: 1    MVKESMEELRPEFEQLQL---EFENVFNMPAELSPMRQVDHRIKLKEGTDPINVRPYRYP 60
            M K + +   P+  +LQ     F  VF  P  L  +R+ DHRI L  G  P ++RPY Y 
Sbjct: 397  MAKPAEDLSSPQQHELQALLDSFSAVFGTPTTLPLVREHDHRIPLISGCKPPSIRPYAYG 456

Query: 61   HAQKNEIEKLVNEMLDFGIIQPSISPFSSLVILVKKKDGGWRFCVDYRALNRATVPDKFP 120
              QK+EIEK V E+LD G I+ S SPFSS V+LVKKKD  WR C+DYR LN  T+ DK+P
Sbjct: 457  PLQKSEIEKCVKELLDSGFIRNSHSPFSSPVLLVKKKDSTWRMCMDYRQLNEFTIKDKYP 516

Query: 121  IPMIQ-LLDELNGASVFSKIDLKSGYHQIRVRNEDVRKTAFRTHEGHYEFLVMPFELTNA 180
            IP+I  LLDEL+GA  FSK+DL++GYHQIRV  ED+ KTAFRTHEGHYEFLVMPF LTNA
Sbjct: 517  IPLIDDLLDELHGAKYFSKLDLRNGYHQIRVHLEDIEKTAFRTHEGHYEFLVMPFGLTNA 576

Query: 181  PSTFQALMNQVFRPYLLKFLLVYFDDTLMYSKDVETHLEHLTMVFQLLRQHCLFANRKKC 240
            P+TFQ LMN +FR  L KF+LV+FDD L+YS     HL HL  V ++L+ H LF    KC
Sbjct: 577  PATFQGLMNAIFRNCLRKFVLVFFDDILVYSTSWSDHLRHLHTVLEILKHHQLFVKMSKC 636

Query: 241  HFVKDRIEYLGHWVSAKGVEADHEKVKAMLEWPVPKNVRELRGFLGLTGYYRRFVANYGA 300
             F    IEYLGH VS +GV AD  K+ A+ +WPVP +V+ LRGFLGLTGYYR+F+ +YG 
Sbjct: 637  AFGVSTIEYLGHIVSRQGVSADPSKLNAVADWPVPTSVKSLRGFLGLTGYYRKFIPHYGR 696

Query: 301  IAMPLMRLTKKNNFRWSEEATQAFEFLKKAMVTLPILVLPNFQLPFEIETDASWFGLSVV 360
             + PL +LTKK+ F W+ EAT AF  LK+ M++  +L L +F  PF IE+DAS  G+  V
Sbjct: 697  ESFPLTQLTKKDGFLWTPEATAAFHRLKELMLSPRVLALLDFTKPFIIESDASGSGIGAV 756

Query: 361  LSQNKKLIAYFSQKLSEAAREKSVYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI 420
            L Q  + IA+ S+ L    +  S YERE+MAIV A++KW HYL G  F++ TD  +L++ 
Sbjct: 757  LQQEGRPIAFTSKTLGPRNQALSTYEREMMAIVHAIKKWHHYLQGRHFIIKTDHHSLKYF 816

Query: 421  LEQRELILGVQKWIMKLMGFDFEVQ----EDTKLKAIF-------------VRLLADPDC 480
            L  +      QKW+ KL+G+D+E+      D K                  V++L    C
Sbjct: 817  LNHKAHTPFQQKWVTKLLGYDYEIHYRQGSDNKAADALSRFPISHSSSTDQVQVLFHVSC 876

Query: 481  IPHYTG--VIPDNYAP-INESQQSCFWEGMKNDIKLYVDQCHDIFMDFVEGLPRSKGVDT 540
            +  + G  V P    P I E     + EGMK+D++  V +CH       E +  + G+  
Sbjct: 877  LKKHLGTHVTPSLTLPRITE-----YNEGMKHDVQKMVAECHICQQHKYETVTPA-GLLQ 936

Query: 541  VLVVVDRLSKYAHFITLGHPFSAQTVAMVFVKEIVRLHGYPRSIVSDRDRVFLSHFWKEL 600
             L + D+L                     FV  + +LHG P SIV DRD VF+S FWKE 
Sbjct: 937  PLPIPDKL---------------------FVDHVFKLHGMPSSIVCDRDPVFVSDFWKEF 996

Query: 601  YRLQGTQLKRSTTYHPQTDGQSEVINKCLELYLRWFCQEKPRTWSDKIAWAEYWYNTNYQ 660
            ++L    L+ S+ YHPQTDGQ+EV+N+CLE YLR F   +P+ W   ++WAE+ YNT Y 
Sbjct: 997  FKLHDVALRMSSGYHPQTDGQTEVVNRCLETYLRCFAAAQPKKWLLWLSWAEFSYNTAYH 1056

Query: 661  SSIKNTLYAVVYGQPPPPIISYGQA-----------------------------EQMKKF 715
            +S K T + VVYGQPPP +  Y  +                              +MK  
Sbjct: 1057 TSTKLTPFEVVYGQPPPRVTPYEPSTTRFANVDRSLAARDRVLTLLKSNLLMAQTRMKTQ 1116

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL3_DROME5.4e-8039.43Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
POL2_DROME7.1e-8037.88Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
YI31B_YEAST6.6e-7836.99Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YG31B_YEAST1.5e-7736.77Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
POL5_DROME1.1e-7536.83Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
Match NameE-valueIdentityDescription
A5BRL2_VITVI6.0e-20346.72Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013478 PE=4 SV=1[more]
A0A087GH17_ARAAL1.4e-19950.48Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G098300 PE=4 SV=1[more]
Q2QZQ5_ORYSJ3.8e-17344.01Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
M5XEL3_PRUPE5.0e-17340.88Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019597mg PE=4 S... [more]
A0A151RRN1_CAJCA1.1e-16143.55Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cajanus cajan GN=KK1_... [more]
Match NameE-valueIdentityDescription
ATMG00860.12.1e-3757.25ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
ATMG00850.12.5e-0656.41ATMG00850.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|922465109|ref|XP_013633118.1|4.4e-20747.72PREDICTED: uncharacterized protein LOC106338764 [Brassica oleracea var. oleracea... [more]
gi|147775005|emb|CAN70471.1|8.6e-20346.72hypothetical protein VITISV_013478 [Vitis vinifera][more]
gi|674236404|gb|KFK29169.1|2.0e-19950.48hypothetical protein AALP_AA7G098300 [Arabis alpina][more]
gi|727652650|ref|XP_010497069.1|1.1e-18153.53PREDICTED: uncharacterized protein LOC104774101 [Camelina sativa][more]
gi|645267554|ref|XP_008239126.1|9.9e-17544.98PREDICTED: uncharacterized protein LOC103337735 [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI01G22700CSPI01G22700gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI01G22700.1CSPI01G22700.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G22700.1.cds5CSPI01G22700.1.cds5CDS
CSPI01G22700.1.cds4CSPI01G22700.1.cds4CDS
CSPI01G22700.1.cds3CSPI01G22700.1.cds3CDS
CSPI01G22700.1.cds2CSPI01G22700.1.cds2CDS
CSPI01G22700.1.cds1CSPI01G22700.1.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI01G22700.1.utr5p1CSPI01G22700.1.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 91..250
score: 1.0
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 72..250
score: 11
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 500..605
score: 4.9
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 492..660
score: 15
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 502..651
score: 5.6
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 499..642
score: 5.46
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 40..169
score: 9.5
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 170..250
score: 1.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 84..716
score: 3.2E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 84..716
score: 3.2E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 18..440
score: 2.22E