CSPI04G27540 (gene) Wild cucumber (PI 183967)

NameCSPI04G27540
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTy3/gypsy retrotransposon protein
LocationChr4 : 24155717 .. 24158806 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGGCCTCAAGAATATGATTAAGTCATGGAGGGATTCTGACCAAGGATTCTTAATTGAGTGCCGAGCAATGGAGACAATGTATGAGCCCCCAGAAGATAATGGAATTGAGGAAGTACTAGCGGTGGACGAGGCAGTTTCAGTTGTCCTGAAGAAATTCGAAGATGTTTTTACATGGCCGGAGACTCTACCTCCACGAAGAAGTATTGAGCATCATATCTATTTGAAACAAGGAACTGACCCGGTAAATGTGAGGCCGTATCGCTATGGATACCAACAAAAGGCAGAGATGGAGAGATTGGTGGAAGAGATGCTGAGTTCAGGGGTTATTCGCCCAAGTAATAGCCCATATTCCAGCCCGGTTCTGTTAGTACGGAAGAAGGATGGAAGCTGGAGATTCTGCGTAGATTATAGAGTCTTAAACAGCGTGACTATACCCGATAAGTTTCCCATTCCAGTTATTGAGGAGTTATTCGACGAGTTGAATGGAGCTAGATGGTTTTCAAAGATTGACTTGAAAGCTGGGTACCATCAAATTAGGATGGCAAGTGGAGATATCGAAAAGACAGCATTCCGTACACACGAAGGACATTATGAGTTCTTAGTCATGCCCTTCGGATTAACTAATGCCCCCTCAACTTTCCAATCTTTGATGAACACGGTATTTAAGCCATATCTAAGGAAGTTCATATTGGTGTTCTTCGATGATATTTTGATCTATAGCAAAAACTTAGAGGTACATCTCACGCATCTTGGACTAGCACTGGAAATCCTAAGAAGAAACGAACTCTATGCTAATCGAAAGAAGTGTAGCTTTGCACAAGAACGCGTGGATTACCTAGGCCACATCATATCAGCTCAAGGAGTTGAGGTCGATCCGGAAAAGATAAGAGCAATTAAAGAATGGCCTACACCTACAAATATAAGAGAAGTTCGCGGGTTCCTGGGCTTGACAGGATATTACCGAAAGTTCGTCCAACATTATGGATCAATGGCAGCTCCTTTAACTCAGTTAGTGAAGAAAGGAGGGTTCAATTGGACTGACAATTCAGAAGAAGCTTTTCAAAGGCTGCAACAAGCTATGATGACTCTACCGGTCTTAGCATTGCCAGATTTCAGTAGCACCTTTGAATTGGAAACGGATGCATCGGGGTATGGCATAGGAGCTGTGTTAATGCAAGCCAAGAAACCGATTGCCTACTTCAGCCACACCTTGGCAGTAAGGGACAGAGTAAAGCCAGTCTATGAAAGAGAATTAATGGCCGTAGTCATGGCTGTACAGAGATGGAGACCGTATTTACTCGGAAAACCATTCATAGTAAGGACTGACCAGAAATCATTGAAGTTTTTACTAGAGCAGAGAGTAATTCAACCACAGTACCAAAAGTGGGTAGCCAAGCTATTGGGATACTCTTTTGAAGTACAATACAAACCAGGACTAGAGAATAAAGCCGCTGACGCCCTATCCCGTGTACCTTCAGCGGTTCAATTCAGTAGCTTGACGGCACCAGCATTAATAGACTTATTAGTAGTCAAGAAGGAAGTGGAAGAAGATACCAGACTTCGCAAAGTATGGGATGAACTACAATCAGGGGAGGAGAATACTGAAAGAAAATTTTCTATCAGACACGGAATGCTAAGGTACAAGGACAGGTTGGTGTTATCTCAGTCATCAGCTTTGATTCCAGCCATACTGTATACTTATCATGATTCTGTGATAGGTGGACATTCTGGGTTTCTAAGAACTTATAAGAGAATTACTGGAGAGTTATATTGGGCCGGAATGAAAACTGACATCAAAAGGTACTGTGATGAATGTCTTATTTGTCAGAAGAATAAATCCTTGGCTTTAACCCCGGCTGGGCTATTACTGCCATTGGAAGTTCCCACCAACATTTGGAGTGATATCTCAATGGACTTCATTGAGGGGTTGCCGAAATCATGTGGCTTTGAAACTATATTTGTAGTGGTCGATAGGTTCAGCAAATATGGTCATTTCCTTGCCCTCAAACATCCCTTCACAGCAAAAACTGTAGCAGAAGTATTTGTTAAAGAAATTGTGCGTCTACACGGATTTCCTAAATCCATCGTATCGGATAGAGATAAGGTATTTGTGAGCAGTTTTTGGAAAGGAATGTTCAAATTGGCTGGCACTAAATTAAATAGAAGCACGGCATATCACCCACAAACAGATGGTCAGACAGAAGTGGTCAACAGGAGCGTGGAAACATATTTAAGATGTTTCTGTAGTGAGAAACCGAAGGAATGGGCAAAATGGCTACACTGGGCGGAGTATTGGTACAATACCACCTTCCACCGGTCATTGGGAATCACTCCTTTTCAGGCTGTATACGGTCGCACACCCCCTCCGTTATTATACTATGGGGATCAAAGCACATCTAACTTCCTGTTAGATGAACAACTCAAGGCAAGGGATGAAGTCTTGGAAGTATTGAAGGAACACCTTCGAGTGGCTCAAGATAAAATGAAAAAGACTGCAGATTTGAAGAGAAGAGATGTGGAATACAAGGTTGGTGACATGGTGTTTTTAAAAATTAGGCCATACAGGCAATCTTCCTTGCGTAAGAAAAAGAATGAGAAACTGTCCCCAAAGTTTTTTGGGCCATTTGAAGTTACAGAACGAATAGGACTGGTAGCATACAAGCTCCAACTACCGAAATCGTCATGTATTCACCCAGTCTTTCATGTCTCGCAACTTAGAAAGATGGTGGGAAATCACACCATGTTGAAACCAGAAGAAATGGCCTGTTTAAATGAAAACTATGAGTGGTTGGCTATTCCGGAGGAGATATATGGGTATTCAAAGAACAAGGAAGGAATGTGGGAGGTGCTGATTAAATGGCAAGGGCTACCACCTCAGGATGCATCTTGGGAGGAGTATGAAGAATTTCAGAAGAAATTTCCGAATTTTCACCTTGAGGACAAGGTGCATTTGGAAAGGGAATGTAATGATAGACCCCCAATTATACACCAGTACAGTAGAAGGAAGAAGAAGCAAGGTTAATCGCACGTGCTAGGGAGAATAGCCTTAGTTTTAATATTGTTAGTAGATGCTCCATAATAAAAT

mRNA sequence

ATGGTTGGCCTCAAGAATATGATTAAGTCATGGAGGGATTCTGACCAAGGATTCTTAATTGAGTGCCGAGCAATGGAGACAATGTATGAGCCCCCAGAAGATAATGGAATTGAGGAAGTACTAGCGGTGGACGAGGCAGTTTCAGTTGTCCTGAAGAAATTCGAAGATGTTTTTACATGGCCGGAGACTCTACCTCCACGAAGAAGTATTGAGCATCATATCTATTTGAAACAAGGAACTGACCCGGTAAATGTGAGGCCGTATCGCTATGGATACCAACAAAAGGCAGAGATGGAGAGATTGGTGGAAGAGATGCTGAGTTCAGGGGTTATTCGCCCAAGTAATAGCCCATATTCCAGCCCGGTTCTGTTAGTACGGAAGAAGGATGGAAGCTGGAGATTCTGCGTAGATTATAGAGTCTTAAACAGCGTGACTATACCCGATAAGTTTCCCATTCCAGTTATTGAGGAGTTATTCGACGAGTTGAATGGAGCTAGATGGTTTTCAAAGATTGACTTGAAAGCTGGGTACCATCAAATTAGGATGGCAAGTGGAGATATCGAAAAGACAGCATTCCGTACACACGAAGGACATTATGAGTTCTTAGTCATGCCCTTCGGATTAACTAATGCCCCCTCAACTTTCCAATCTTTGATGAACACGGTATTTAAGCCATATCTAAGGAAGTTCATATTGGTGTTCTTCGATGATATTTTGATCTATAGCAAAAACTTAGAGGTACATCTCACGCATCTTGGACTAGCACTGGAAATCCTAAGAAGAAACGAACTCTATGCTAATCGAAAGAAGTGTAGCTTTGCACAAGAACGCGTGGATTACCTAGGCCACATCATATCAGCTCAAGGAGTTGAGGTCGATCCGGAAAAGATAAGAGCAATTAAAGAATGGCCTACACCTACAAATATAAGAGAAGTTCGCGGGTTCCTGGGCTTGACAGGATATTACCGAAAGTTCGTCCAACATTATGGATCAATGGCAGCTCCTTTAACTCAGTTAGTGAAGAAAGGAGGGTTCAATTGGACTGACAATTCAGAAGAAGCTTTTCAAAGGCTGCAACAAGCTATGATGACTCTACCGGTCTTAGCATTGCCAGATTTCAGTAGCACCTTTGAATTGGAAACGGATGCATCGGGGTATGGCATAGGAGCTGTGTTAATGCAAGCCAAGAAACCGATTGCCTACTTCAGCCACACCTTGGCAGTAAGGGACAGAGTAAAGCCAGTCTATGAAAGAGAATTAATGGCCGTAGTCATGGCTGTACAGAGATGGAGACCGTATTTACTCGGAAAACCATTCATAGTAAGGACTGACCAGAAATCATTGAAGTTTTTACTAGAGCAGAGAGTAATTCAACCACAGTACCAAAAGTGGGTAGCCAAGCTATTGGGATACTCTTTTGAAGTACAATACAAACCAGGACTAGAGAATAAAGCCGCTGACGCCCTATCCCGTGTACCTTCAGCGGTTCAATTCAGTAGCTTGACGGCACCAGCATTAATAGACTTATTAGTAGTCAAGAAGGAAGTGGAAGAAGATACCAGACTTCGCAAAGTATGGGATGAACTACAATCAGGGGAGGAGAATACTGAAAGAAAATTTTCTATCAGACACGGAATGCTAAGGTACAAGGACAGGTTGGTGTTATCTCAGTCATCAGCTTTGATTCCAGCCATACTGTATACTTATCATGATTCTGTGATAGGTGGACATTCTGGGTTTCTAAGAACTTATAAGAGAATTACTGGAGAGTTATATTGGGCCGGAATGAAAACTGACATCAAAAGGTACTGTGATGAATGTCTTATTTGTCAGAAGAATAAATCCTTGGCTTTAACCCCGGCTGGGCTATTACTGCCATTGGAAGTTCCCACCAACATTTGGAGTGATATCTCAATGGACTTCATTGAGGGGTTGCCGAAATCATGTGGCTTTGAAACTATATTTGTAGTGGTCGATAGGTTCAGCAAATATGGTCATTTCCTTGCCCTCAAACATCCCTTCACAGCAAAAACTGTAGCAGAAGTATTTGTTAAAGAAATTGTGCGTCTACACGGATTTCCTAAATCCATCGTATCGGATAGAGATAAGGTATTTGTGAGCAGTTTTTGGAAAGGAATGTTCAAATTGGCTGGCACTAAATTAAATAGAAGCACGGCATATCACCCACAAACAGATGGTCAGACAGAAGTGGTCAACAGGAGCGTGGAAACATATTTAAGATGTTTCTGTAGTGAGAAACCGAAGGAATGGGCAAAATGGCTACACTGGGCGGAGTATTGGTACAATACCACCTTCCACCGGTCATTGGGAATCACTCCTTTTCAGGCTGTATACGGTCGCACACCCCCTCCGTTATTATACTATGGGGATCAAAGCACATCTAACTTCCTGTTAGATGAACAACTCAAGGCAAGGGATGAAGTCTTGGAAGTATTGAAGGAACACCTTCGAGTGGCTCAAGATAAAATGAAAAAGACTGCAGATTTGAAGAGAAGAGATGTGGAATACAAGGTTGGTGACATGGTGTTTTTAAAAATTAGGCCATACAGGCAATCTTCCTTGCGTAAGAAAAAGAATGAGAAACTGTCCCCAAAGTTTTTTGGGCCATTTGAAGTTACAGAACGAATAGGACTGGTAGCATACAAGCTCCAACTACCGAAATCGTCATGTATTCACCCAGTCTTTCATGTCTCGCAACTTAGAAAGATGGTGGGAAATCACACCATGTTGAAACCAGAAGAAATGGCCTGTTTAAATGAAAACTATGAGTGGTTGGCTATTCCGGAGGAGATATATGGGTATTCAAAGAACAAGGAAGGAATGTGGGAGGTGCTGATTAAATGGCAAGGGCTACCACCTCAGGATGCATCTTGGGAGGAGTATGAAGAATTTCAGAAGAAATTTCCGAATTTTCACCTTGAGGACAAGGTGCATTTGGAAAGGGAATGTAATGATAGACCCCCAATTATACACCAGTACAGTAGAAGGAAGAAGAAGCAAGGTTAA

Coding sequence (CDS)

ATGGTTGGCCTCAAGAATATGATTAAGTCATGGAGGGATTCTGACCAAGGATTCTTAATTGAGTGCCGAGCAATGGAGACAATGTATGAGCCCCCAGAAGATAATGGAATTGAGGAAGTACTAGCGGTGGACGAGGCAGTTTCAGTTGTCCTGAAGAAATTCGAAGATGTTTTTACATGGCCGGAGACTCTACCTCCACGAAGAAGTATTGAGCATCATATCTATTTGAAACAAGGAACTGACCCGGTAAATGTGAGGCCGTATCGCTATGGATACCAACAAAAGGCAGAGATGGAGAGATTGGTGGAAGAGATGCTGAGTTCAGGGGTTATTCGCCCAAGTAATAGCCCATATTCCAGCCCGGTTCTGTTAGTACGGAAGAAGGATGGAAGCTGGAGATTCTGCGTAGATTATAGAGTCTTAAACAGCGTGACTATACCCGATAAGTTTCCCATTCCAGTTATTGAGGAGTTATTCGACGAGTTGAATGGAGCTAGATGGTTTTCAAAGATTGACTTGAAAGCTGGGTACCATCAAATTAGGATGGCAAGTGGAGATATCGAAAAGACAGCATTCCGTACACACGAAGGACATTATGAGTTCTTAGTCATGCCCTTCGGATTAACTAATGCCCCCTCAACTTTCCAATCTTTGATGAACACGGTATTTAAGCCATATCTAAGGAAGTTCATATTGGTGTTCTTCGATGATATTTTGATCTATAGCAAAAACTTAGAGGTACATCTCACGCATCTTGGACTAGCACTGGAAATCCTAAGAAGAAACGAACTCTATGCTAATCGAAAGAAGTGTAGCTTTGCACAAGAACGCGTGGATTACCTAGGCCACATCATATCAGCTCAAGGAGTTGAGGTCGATCCGGAAAAGATAAGAGCAATTAAAGAATGGCCTACACCTACAAATATAAGAGAAGTTCGCGGGTTCCTGGGCTTGACAGGATATTACCGAAAGTTCGTCCAACATTATGGATCAATGGCAGCTCCTTTAACTCAGTTAGTGAAGAAAGGAGGGTTCAATTGGACTGACAATTCAGAAGAAGCTTTTCAAAGGCTGCAACAAGCTATGATGACTCTACCGGTCTTAGCATTGCCAGATTTCAGTAGCACCTTTGAATTGGAAACGGATGCATCGGGGTATGGCATAGGAGCTGTGTTAATGCAAGCCAAGAAACCGATTGCCTACTTCAGCCACACCTTGGCAGTAAGGGACAGAGTAAAGCCAGTCTATGAAAGAGAATTAATGGCCGTAGTCATGGCTGTACAGAGATGGAGACCGTATTTACTCGGAAAACCATTCATAGTAAGGACTGACCAGAAATCATTGAAGTTTTTACTAGAGCAGAGAGTAATTCAACCACAGTACCAAAAGTGGGTAGCCAAGCTATTGGGATACTCTTTTGAAGTACAATACAAACCAGGACTAGAGAATAAAGCCGCTGACGCCCTATCCCGTGTACCTTCAGCGGTTCAATTCAGTAGCTTGACGGCACCAGCATTAATAGACTTATTAGTAGTCAAGAAGGAAGTGGAAGAAGATACCAGACTTCGCAAAGTATGGGATGAACTACAATCAGGGGAGGAGAATACTGAAAGAAAATTTTCTATCAGACACGGAATGCTAAGGTACAAGGACAGGTTGGTGTTATCTCAGTCATCAGCTTTGATTCCAGCCATACTGTATACTTATCATGATTCTGTGATAGGTGGACATTCTGGGTTTCTAAGAACTTATAAGAGAATTACTGGAGAGTTATATTGGGCCGGAATGAAAACTGACATCAAAAGGTACTGTGATGAATGTCTTATTTGTCAGAAGAATAAATCCTTGGCTTTAACCCCGGCTGGGCTATTACTGCCATTGGAAGTTCCCACCAACATTTGGAGTGATATCTCAATGGACTTCATTGAGGGGTTGCCGAAATCATGTGGCTTTGAAACTATATTTGTAGTGGTCGATAGGTTCAGCAAATATGGTCATTTCCTTGCCCTCAAACATCCCTTCACAGCAAAAACTGTAGCAGAAGTATTTGTTAAAGAAATTGTGCGTCTACACGGATTTCCTAAATCCATCGTATCGGATAGAGATAAGGTATTTGTGAGCAGTTTTTGGAAAGGAATGTTCAAATTGGCTGGCACTAAATTAAATAGAAGCACGGCATATCACCCACAAACAGATGGTCAGACAGAAGTGGTCAACAGGAGCGTGGAAACATATTTAAGATGTTTCTGTAGTGAGAAACCGAAGGAATGGGCAAAATGGCTACACTGGGCGGAGTATTGGTACAATACCACCTTCCACCGGTCATTGGGAATCACTCCTTTTCAGGCTGTATACGGTCGCACACCCCCTCCGTTATTATACTATGGGGATCAAAGCACATCTAACTTCCTGTTAGATGAACAACTCAAGGCAAGGGATGAAGTCTTGGAAGTATTGAAGGAACACCTTCGAGTGGCTCAAGATAAAATGAAAAAGACTGCAGATTTGAAGAGAAGAGATGTGGAATACAAGGTTGGTGACATGGTGTTTTTAAAAATTAGGCCATACAGGCAATCTTCCTTGCGTAAGAAAAAGAATGAGAAACTGTCCCCAAAGTTTTTTGGGCCATTTGAAGTTACAGAACGAATAGGACTGGTAGCATACAAGCTCCAACTACCGAAATCGTCATGTATTCACCCAGTCTTTCATGTCTCGCAACTTAGAAAGATGGTGGGAAATCACACCATGTTGAAACCAGAAGAAATGGCCTGTTTAAATGAAAACTATGAGTGGTTGGCTATTCCGGAGGAGATATATGGGTATTCAAAGAACAAGGAAGGAATGTGGGAGGTGCTGATTAAATGGCAAGGGCTACCACCTCAGGATGCATCTTGGGAGGAGTATGAAGAATTTCAGAAGAAATTTCCGAATTTTCACCTTGAGGACAAGGTGCATTTGGAAAGGGAATGTAATGATAGACCCCCAATTATACACCAGTACAGTAGAAGGAAGAAGAAGCAAGGTTAA
BLAST of CSPI04G27540 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 490.7 bits (1262), Expect = 3.8e-137
Identity = 306/883 (34.65%), Postives = 457/883 (51.76%), Query Frame = 1

Query: 64   LPPRRS------IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSP 123
            LPPR +      ++H I +K G     ++PY    + + E+ ++V+++L +  I PS SP
Sbjct: 572  LPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSP 631

Query: 124  YSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGARWFSKIDLKAGY 183
             SSPV+LV KKDG++R CVDYR LN  TI D FP+P I+ L   +  A+ F+ +DL +GY
Sbjct: 632  CSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGY 691

Query: 184  HQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPYLRKFILVFFDD 243
            HQI M   D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    +F+ V+ DD
Sbjct: 692  HQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYLDD 751

Query: 244  ILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIISAQGVEVDPEKI 303
            ILI+S++ E H  HL   LE L+   L   +KKC FA E  ++LG+ I  Q +     K 
Sbjct: 752  ILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKC 811

Query: 304  RAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQR 363
             AI+++PTP  +++ + FLG+  YYR+F+ +   +A P+ QL       WT+  ++A  +
Sbjct: 812  AAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAIDK 871

Query: 364  LQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKP------IAYFSHTLAVRDR 423
            L+ A+   PVL   +  + + L TDAS  GIGAVL +          + YFS +L    +
Sbjct: 872  LKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQK 931

Query: 424  VKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQKWVAKLLGY 483
              P  E EL+ ++ A+  +R  L GK F +RTD  SL  L  +     + Q+W+  L  Y
Sbjct: 932  NYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATY 991

Query: 484  SFEVQYKPGLENKAADALSRV-----------------PSAVQFSSLTAPALIDL--LVV 543
             F ++Y  G +N  ADA+SR                   S  +   L +  LI +  L  
Sbjct: 992  DFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQ 1051

Query: 544  KKEVEEDTRLRKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHD- 603
                 ED    + + +     E   + +S+   M+ Y+DRLV+        A++  YHD 
Sbjct: 1052 HNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVPIKQQ--NAVMRLYHDH 1111

Query: 604  SVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPT 663
            ++ GGH G   T  +I+   YW  ++  I +Y   C+ CQ  KS      GLL PL +  
Sbjct: 1112 TLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAE 1171

Query: 664  NIWSDISMDFIEGL-PKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRL 723
              W DISMDF+ GL P S     I VVVDRFSK  HF+A +    A  + ++  + I   
Sbjct: 1172 GRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSY 1231

Query: 724  HGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFC 783
            HGFP++I SDRD    +  ++ + K  G K   S+A HPQTDGQ+E   +++   LR + 
Sbjct: 1232 HGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYA 1291

Query: 784  SEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVYGRTP--PPLLYYGDQSTSNFLLDEQ 843
            S   + W  +L   E+ YN+T  R+LG +PF+   G  P  P +    + +  +F   E 
Sbjct: 1292 STNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVEL 1351

Query: 844  LKARDEVLEVLKEHLRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEK 903
             K    +    KE L  AQ +M+   + +R+ +   +GD V +    +R +  +K    K
Sbjct: 1352 AKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLV----HRDAYFKKGAYMK 1411

Query: 904  LSPKFFGPFEVTERIGLVAYKLQLPKSSCIHPVFHVSQLRKMV 912
            +   + GPF V ++I   AY+L L      H V +V  L+K V
Sbjct: 1412 VQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKKFV 1445

BLAST of CSPI04G27540 vs. Swiss-Prot
Match: TF22_SCHPO (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 1.9e-136
Identity = 297/890 (33.37%), Postives = 459/890 (51.57%), Query Frame = 1

Query: 50   VLKKFEDVF--TWPETLP-PRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEML 109
            + K+F+D+   T  E LP P + +E  + L Q    + +R Y     +   M   + + L
Sbjct: 377  IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436

Query: 110  SSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGAR 169
             SG+IR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  ++ G+ 
Sbjct: 437  KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496

Query: 170  WFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPY 229
             F+K+DLK+ YH IR+  GD  K AFR   G +E+LVMP+G++ AP+ FQ  +NT+    
Sbjct: 497  IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556

Query: 230  LRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIIS 289
                ++ + DDILI+SK+   H+ H+   L+ L+   L  N+ KC F Q +V ++G+ IS
Sbjct: 557  KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616

Query: 290  AQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKG-GF 349
             +G     E I  + +W  P N +E+R FLG   Y RKF+     +  PL  L+KK   +
Sbjct: 617  EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676

Query: 350  NWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKK-----PIA 409
             WT    +A + ++Q +++ PVL   DFS    LETDAS   +GAVL Q        P+ 
Sbjct: 677  KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736

Query: 410  YFSHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLG--KPFIVRTDQKSL--KFLLEQRV 469
            Y+S  ++       V ++E++A++ +++ WR YL    +PF + TD ++L  +   E   
Sbjct: 737  YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796

Query: 470  IQPQYQKWVAKLLGYSFEVQYKPGLENKAADALSR-------VPSAVQFSSLTAPALIDL 529
               +  +W   L  ++FE+ Y+PG  N  ADALSR       +P   + +S+     I +
Sbjct: 797  ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISI 856

Query: 530  L--VVKKEVEEDTRLRKVWDELQSGEENTERKFSIRHGML-RYKDRLVLSQSSALIPAIL 589
                  + V E T   K+ + L + ++  E    ++ G+L   KD+++L   + L   I+
Sbjct: 857  TDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTII 916

Query: 590  YTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLP 649
              YH+     H G       I     W G++  I+ Y   C  CQ NKS    P G L P
Sbjct: 917  KKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQP 976

Query: 650  LEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKE 709
            +      W  +SMDFI  LP+S G+  +FVVVDRFSK    +      TA+  A +F + 
Sbjct: 977  IPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQR 1036

Query: 710  IVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYL 769
            ++   G PK I++D D +F S  WK         +  S  Y PQTDGQTE  N++VE  L
Sbjct: 1037 VIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLL 1096

Query: 770  RCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVY----GRTPPPLLYYGDQSTSN 829
            RC CS  P  W   +   +  YN   H +  +TPF+ V+      +P  L  + D++   
Sbjct: 1097 RCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT--- 1156

Query: 830  FLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRDV-EYKVGDMVFLKIRPYRQSSL 889
               DE  +   +V + +KEHL     KMKK  D+K +++ E++ GD+V +K    R  + 
Sbjct: 1157 ---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTG 1216

Query: 890  RKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKS--SCIHPVFHVSQLRK 910
               K+ KL+P F GPF V ++ G   Y+L LP S        FHVS L K
Sbjct: 1217 FLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256

BLAST of CSPI04G27540 vs. Swiss-Prot
Match: TF212_SCHPO (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 1.9e-136
Identity = 297/890 (33.37%), Postives = 459/890 (51.57%), Query Frame = 1

Query: 50   VLKKFEDVF--TWPETLP-PRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEML 109
            + K+F+D+   T  E LP P + +E  + L Q    + +R Y     +   M   + + L
Sbjct: 377  IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436

Query: 110  SSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGAR 169
             SG+IR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  ++ G+ 
Sbjct: 437  KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496

Query: 170  WFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPY 229
             F+K+DLK+ YH IR+  GD  K AFR   G +E+LVMP+G++ AP+ FQ  +NT+    
Sbjct: 497  IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556

Query: 230  LRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIIS 289
                ++ + DDILI+SK+   H+ H+   L+ L+   L  N+ KC F Q +V ++G+ IS
Sbjct: 557  KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616

Query: 290  AQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKG-GF 349
             +G     E I  + +W  P N +E+R FLG   Y RKF+     +  PL  L+KK   +
Sbjct: 617  EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676

Query: 350  NWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKK-----PIA 409
             WT    +A + ++Q +++ PVL   DFS    LETDAS   +GAVL Q        P+ 
Sbjct: 677  KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736

Query: 410  YFSHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLG--KPFIVRTDQKSL--KFLLEQRV 469
            Y+S  ++       V ++E++A++ +++ WR YL    +PF + TD ++L  +   E   
Sbjct: 737  YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796

Query: 470  IQPQYQKWVAKLLGYSFEVQYKPGLENKAADALSR-------VPSAVQFSSLTAPALIDL 529
               +  +W   L  ++FE+ Y+PG  N  ADALSR       +P   + +S+     I +
Sbjct: 797  ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISI 856

Query: 530  L--VVKKEVEEDTRLRKVWDELQSGEENTERKFSIRHGML-RYKDRLVLSQSSALIPAIL 589
                  + V E T   K+ + L + ++  E    ++ G+L   KD+++L   + L   I+
Sbjct: 857  TDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTII 916

Query: 590  YTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLP 649
              YH+     H G       I     W G++  I+ Y   C  CQ NKS    P G L P
Sbjct: 917  KKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQP 976

Query: 650  LEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKE 709
            +      W  +SMDFI  LP+S G+  +FVVVDRFSK    +      TA+  A +F + 
Sbjct: 977  IPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQR 1036

Query: 710  IVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYL 769
            ++   G PK I++D D +F S  WK         +  S  Y PQTDGQTE  N++VE  L
Sbjct: 1037 VIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLL 1096

Query: 770  RCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVY----GRTPPPLLYYGDQSTSN 829
            RC CS  P  W   +   +  YN   H +  +TPF+ V+      +P  L  + D++   
Sbjct: 1097 RCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT--- 1156

Query: 830  FLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRDV-EYKVGDMVFLKIRPYRQSSL 889
               DE  +   +V + +KEHL     KMKK  D+K +++ E++ GD+V +K    R  + 
Sbjct: 1157 ---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTG 1216

Query: 890  RKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKS--SCIHPVFHVSQLRK 910
               K+ KL+P F GPF V ++ G   Y+L LP S        FHVS L K
Sbjct: 1217 FLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256

BLAST of CSPI04G27540 vs. Swiss-Prot
Match: TF21_SCHPO (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 1.9e-136
Identity = 297/890 (33.37%), Postives = 459/890 (51.57%), Query Frame = 1

Query: 50   VLKKFEDVF--TWPETLP-PRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEML 109
            + K+F+D+   T  E LP P + +E  + L Q    + +R Y     +   M   + + L
Sbjct: 377  IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436

Query: 110  SSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGAR 169
             SG+IR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  ++ G+ 
Sbjct: 437  KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496

Query: 170  WFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPY 229
             F+K+DLK+ YH IR+  GD  K AFR   G +E+LVMP+G++ AP+ FQ  +NT+    
Sbjct: 497  IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556

Query: 230  LRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIIS 289
                ++ + DDILI+SK+   H+ H+   L+ L+   L  N+ KC F Q +V ++G+ IS
Sbjct: 557  KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616

Query: 290  AQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKG-GF 349
             +G     E I  + +W  P N +E+R FLG   Y RKF+     +  PL  L+KK   +
Sbjct: 617  EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676

Query: 350  NWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKK-----PIA 409
             WT    +A + ++Q +++ PVL   DFS    LETDAS   +GAVL Q        P+ 
Sbjct: 677  KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736

Query: 410  YFSHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLG--KPFIVRTDQKSL--KFLLEQRV 469
            Y+S  ++       V ++E++A++ +++ WR YL    +PF + TD ++L  +   E   
Sbjct: 737  YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796

Query: 470  IQPQYQKWVAKLLGYSFEVQYKPGLENKAADALSR-------VPSAVQFSSLTAPALIDL 529
               +  +W   L  ++FE+ Y+PG  N  ADALSR       +P   + +S+     I +
Sbjct: 797  ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISI 856

Query: 530  L--VVKKEVEEDTRLRKVWDELQSGEENTERKFSIRHGML-RYKDRLVLSQSSALIPAIL 589
                  + V E T   K+ + L + ++  E    ++ G+L   KD+++L   + L   I+
Sbjct: 857  TDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTII 916

Query: 590  YTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLP 649
              YH+     H G       I     W G++  I+ Y   C  CQ NKS    P G L P
Sbjct: 917  KKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQP 976

Query: 650  LEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKE 709
            +      W  +SMDFI  LP+S G+  +FVVVDRFSK    +      TA+  A +F + 
Sbjct: 977  IPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQR 1036

Query: 710  IVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYL 769
            ++   G PK I++D D +F S  WK         +  S  Y PQTDGQTE  N++VE  L
Sbjct: 1037 VIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLL 1096

Query: 770  RCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVY----GRTPPPLLYYGDQSTSN 829
            RC CS  P  W   +   +  YN   H +  +TPF+ V+      +P  L  + D++   
Sbjct: 1097 RCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT--- 1156

Query: 830  FLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRDV-EYKVGDMVFLKIRPYRQSSL 889
               DE  +   +V + +KEHL     KMKK  D+K +++ E++ GD+V +K    R  + 
Sbjct: 1157 ---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTG 1216

Query: 890  RKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKS--SCIHPVFHVSQLRK 910
               K+ KL+P F GPF V ++ G   Y+L LP S        FHVS L K
Sbjct: 1217 FLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256

BLAST of CSPI04G27540 vs. Swiss-Prot
Match: TF23_SCHPO (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 1.9e-136
Identity = 297/890 (33.37%), Postives = 459/890 (51.57%), Query Frame = 1

Query: 50   VLKKFEDVF--TWPETLP-PRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEML 109
            + K+F+D+   T  E LP P + +E  + L Q    + +R Y     +   M   + + L
Sbjct: 377  IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRNYPLPPGKMQAMNDEINQGL 436

Query: 110  SSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGAR 169
             SG+IR S +  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  ++ G+ 
Sbjct: 437  KSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGST 496

Query: 170  WFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPY 229
             F+K+DLK+ YH IR+  GD  K AFR   G +E+LVMP+G++ AP+ FQ  +NT+    
Sbjct: 497  IFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEA 556

Query: 230  LRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIIS 289
                ++ + DDILI+SK+   H+ H+   L+ L+   L  N+ KC F Q +V ++G+ IS
Sbjct: 557  KESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHIS 616

Query: 290  AQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKG-GF 349
             +G     E I  + +W  P N +E+R FLG   Y RKF+     +  PL  L+KK   +
Sbjct: 617  EKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRW 676

Query: 350  NWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKK-----PIA 409
             WT    +A + ++Q +++ PVL   DFS    LETDAS   +GAVL Q        P+ 
Sbjct: 677  KWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVG 736

Query: 410  YFSHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLG--KPFIVRTDQKSL--KFLLEQRV 469
            Y+S  ++       V ++E++A++ +++ WR YL    +PF + TD ++L  +   E   
Sbjct: 737  YYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEP 796

Query: 470  IQPQYQKWVAKLLGYSFEVQYKPGLENKAADALSR-------VPSAVQFSSLTAPALIDL 529
               +  +W   L  ++FE+ Y+PG  N  ADALSR       +P   + +S+     I +
Sbjct: 797  ENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISI 856

Query: 530  L--VVKKEVEEDTRLRKVWDELQSGEENTERKFSIRHGML-RYKDRLVLSQSSALIPAIL 589
                  + V E T   K+ + L + ++  E    ++ G+L   KD+++L   + L   I+
Sbjct: 857  TDDFKNQVVTEYTNDTKLLNLLNNEDKRVEENIQLKDGLLINSKDQILLPNDTQLTRTII 916

Query: 590  YTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLP 649
              YH+     H G       I     W G++  I+ Y   C  CQ NKS    P G L P
Sbjct: 917  KKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQP 976

Query: 650  LEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKE 709
            +      W  +SMDFI  LP+S G+  +FVVVDRFSK    +      TA+  A +F + 
Sbjct: 977  IPPSERPWESLSMDFITALPESSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQR 1036

Query: 710  IVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYL 769
            ++   G PK I++D D +F S  WK         +  S  Y PQTDGQTE  N++VE  L
Sbjct: 1037 VIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLL 1096

Query: 770  RCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVY----GRTPPPLLYYGDQSTSN 829
            RC CS  P  W   +   +  YN   H +  +TPF+ V+      +P  L  + D++   
Sbjct: 1097 RCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKT--- 1156

Query: 830  FLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRDV-EYKVGDMVFLKIRPYRQSSL 889
               DE  +   +V + +KEHL     KMKK  D+K +++ E++ GD+V +K    R  + 
Sbjct: 1157 ---DENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVK----RTKTG 1216

Query: 890  RKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKS--SCIHPVFHVSQLRK 910
               K+ KL+P F GPF V ++ G   Y+L LP S        FHVS L K
Sbjct: 1217 FLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1256

BLAST of CSPI04G27540 vs. TrEMBL
Match: E2DMZ5_BETVU (Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 555/1015 (54.68%), Postives = 720/1015 (70.94%), Query Frame = 1

Query: 2    VGLKNMIKSWRDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWP 61
            V LK M ++ R    GFL++   M +    P      E+  V   +  +L  ++ VF  P
Sbjct: 548  VSLKAMYRTLRKEGGGFLVDLNQMASHEGLPR-----ELPEVPSCLQPLLSSYQQVFNMP 607

Query: 62   ETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSP 121
              LPP R   H I L+ GT+PV+VRPYRY   QK E+E+L+ +ML++G+I+ S+S +SSP
Sbjct: 608  LGLPPDRGHVHAINLQHGTNPVSVRPYRYPQSQKDEIEQLIHDMLAAGIIQQSHSAFSSP 667

Query: 122  VLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGARWFSKIDLKAGYHQIR 181
            VLLV+KKDGSWRFCVDYR LN+VT+PDK+PIP+I+EL DEL+GA  FSK+DLK+GYHQI+
Sbjct: 668  VLLVKKKDGSWRFCVDYRALNNVTVPDKYPIPIIDELLDELHGACVFSKLDLKSGYHQIK 727

Query: 182  MASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPYLRKFILVFFDDILIY 241
            M   D+ KTAFRTHEGHYEFLVMPFGLTNAP+TFQ+LMN VFKPYLRKF+LVFFDDIL+Y
Sbjct: 728  MKPSDVHKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNEVFKPYLRKFVLVFFDDILVY 787

Query: 242  SKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIISAQGVEVDPEKIRAIK 301
            S +LE H+ HL + L +L  N L+AN KKC F +E V YLGHIIS++GV +DP K++A+ 
Sbjct: 788  STSLEQHMHHLNVVLGLLATNHLFANLKKCEFGKEEVAYLGHIISSKGVAMDPSKVQAMM 847

Query: 302  EWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQA 361
            +W  P+ +RE+RGFLGLTGYYR+FV+ Y S+A PLT  +KK  F W+  +  AF+ L++A
Sbjct: 848  DWSIPSTLRELRGFLGLTGYYRRFVKGYASIAHPLTNQLKKDSFGWSPAATRAFETLKRA 907

Query: 362  MMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYFSHTLAVRDRVKPVYERELM 421
            +   PVL +P+FS  F +E DASGYG+GAVL+Q   PIAYFS TL  R R K +YE+ELM
Sbjct: 908  LTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHPIAYFSKTLGERARAKSIYEKELM 967

Query: 422  AVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQKWVAKLLGYSFEVQYKPGL 481
            AVVMAVQ+W+ +LLG+ F++ +DQ+SL+ LL QR I P YQKWV KLLG+ FE++YKPG 
Sbjct: 968  AVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIGPAYQKWVGKLLGFDFEIKYKPGG 1027

Query: 482  ENKAADALSRV-PSAVQFSSLTAPALIDLLVVKKEVEEDTRLRKVWDELQSGEENTERKF 541
             NK ADALSR  P   +++ LT+       ++ + + +D  L+ +  E+ +G    +  F
Sbjct: 1028 HNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIRQDADLQHLMAEVTAGRTPLQ-GF 1087

Query: 542  SIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDI 601
            ++ HG+L+Y  RLV+ ++  L   +L  YH S +GGHSG  +TYKR+ GE YW GMK D+
Sbjct: 1088 TVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHSGIFKTYKRLAGEWYWKGMKKDV 1147

Query: 602  KRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDR 661
              +   C ICQ+ K+  L+PAGLL PL +P  IW DISMDF+EGLPKS G++TI VVVDR
Sbjct: 1148 TTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDISMDFVEGLPKSQGWDTILVVVDR 1207

Query: 662  FSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTK 721
             SKY HF+ LKHPFTA TVA VF+KEIV+LHGFP +IVSDRDKVF+S FWK +FKL GT 
Sbjct: 1208 LSKYAHFITLKHPFTAPTVAAVFIKEIVKLHGFPSTIVSDRDKVFMSLFWKELFKLQGTL 1267

Query: 722  LNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITP 781
            L+RSTAYHPQ+DGQTEVVN+S+E YLRCFC+ +PK WA+W+ WAEYWYNT+ H S   TP
Sbjct: 1268 LHRSTAYHPQSDGQTEVVNKSLEAYLRCFCNGRPKAWAQWISWAEYWYNTSTHSSSHFTP 1327

Query: 782  FQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRD 841
            F+ VYGR  PPL  +   ST+ F L+EQL  RD  L+ LK HL  AQ+ MK   D  RR 
Sbjct: 1328 FKIVYGRDSPPLFRFEKGSTAIFSLEEQLLDRDATLDELKFHLLEAQNSMKIQEDKHRRA 1387

Query: 842  VEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKSSCIHP 901
            V ++ G MV+LKI+PYR  SL KK+NEKL+P+F+GPF V +RIG VAY+LQLP  + +HP
Sbjct: 1388 VHFEPGAMVYLKIQPYRHQSLAKKRNEKLAPRFYGPFSVLKRIGQVAYQLQLPLGAKLHP 1447

Query: 902  VFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPE---EIYGYSKNKEGMWEVLIKWQG 961
            VFH+SQL+K VG+     P     L  +    A PE    I  + +    + EVLIKW  
Sbjct: 1448 VFHISQLKKAVGS-LQSSPTIPPQLTNDLVLDAQPESLLNIRSHPQKPAEVTEVLIKWLN 1507

Query: 962  LPPQDASWEEYEEFQKKFPNFHLEDKV------HLERECNDRPPIIHQYSRRKKK 1007
            LP  +A+WE+   F  +FP+FHLEDKV        +      PPI+H YSRR+KK
Sbjct: 1508 LPAFEATWEDAALFNARFPDFHLEDKVLNWEGSIAKSPTRIIPPIVHTYSRRRKK 1555

BLAST of CSPI04G27540 vs. TrEMBL
Match: B5U9V8_LOTJA (Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1)

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 536/970 (55.26%), Postives = 711/970 (73.30%), Query Frame = 1

Query: 43   VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLV 102
            V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+RPYRY + QK E+E+LV
Sbjct: 586  VPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLV 645

Query: 103  EEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDEL 162
            +EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TIPDKFPIP+I+EL DE+
Sbjct: 646  KEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 705

Query: 163  NGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTV 222
              A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PFGLTNAPSTFQ+LMN V
Sbjct: 706  GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 765

Query: 223  FKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLG 282
             +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L AN+KKCSF Q  + YLG
Sbjct: 766  LRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825

Query: 283  HIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKK 342
            H+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV++Y  +A PL QL+KK
Sbjct: 826  HVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKK 885

Query: 343  GGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYF 402
              F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG G+GAVLMQ  +P+AY 
Sbjct: 886  NSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYM 945

Query: 403  SHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQ 462
            S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+SL+FL +QR++  + Q
Sbjct: 946  SKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ 1005

Query: 463  KWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPALIDLLVVKKEVEEDTRL 522
            KW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++    +   ++ E+ ED R 
Sbjct: 1006 KWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQCAEWADLEAEILEDERY 1065

Query: 523  RKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLR 582
            RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L  +HD+ IGGH+G  R
Sbjct: 1066 RKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTAIGGHAGIFR 1125

Query: 583  TYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFI 642
            TYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L PL +P+  W+DISMDFI
Sbjct: 1126 TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFI 1185

Query: 643  EGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRD 702
             GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE+VRLHGFP SIVSDRD
Sbjct: 1186 GGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRD 1245

Query: 703  KVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLH 762
            +VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYLRC    KPK+W KWL 
Sbjct: 1246 RVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLS 1305

Query: 763  WAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEH 822
            WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   +++    R+ +LE LK +
Sbjct: 1306 WAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSN 1365

Query: 823  LRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTER 882
            L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N+KLSP+++GP+ +  +
Sbjct: 1366 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1425

Query: 883  IGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPEEIYGYS 942
            I   AYKLQLP+ S +HPVFH+S L+K V      +P   A L E +E    PE I    
Sbjct: 1426 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLP-AALTEEWELKVEPEAIMDTR 1485

Query: 943  KNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLE-----RECNDRPPII 1002
            +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L+        + RP   
Sbjct: 1486 ENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQGGRDVANPSSRPRFG 1545

Query: 1003 HQYSRRKKKQ 1008
            + Y+RR K Q
Sbjct: 1546 NVYARRPKPQ 1550

BLAST of CSPI04G27540 vs. TrEMBL
Match: B5U9W1_LOTJA (Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1)

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 535/970 (55.15%), Postives = 711/970 (73.30%), Query Frame = 1

Query: 43   VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLV 102
            V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+RPYRY + QK E+E+LV
Sbjct: 586  VPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLV 645

Query: 103  EEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDEL 162
            +EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TIPDKFPIP+I+EL DE+
Sbjct: 646  KEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 705

Query: 163  NGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTV 222
              A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PFGLTNAPSTFQ+LMN V
Sbjct: 706  GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 765

Query: 223  FKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLG 282
             +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L AN+KKCSF Q  + YLG
Sbjct: 766  LRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825

Query: 283  HIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKK 342
            H+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV++Y  +A PL QL+KK
Sbjct: 826  HVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKK 885

Query: 343  GGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYF 402
              F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG G+GAVLMQ  +P+AY 
Sbjct: 886  NSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYM 945

Query: 403  SHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQ 462
            S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+SL+FL +QR++  + Q
Sbjct: 946  SKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ 1005

Query: 463  KWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPALIDLLVVKKEVEEDTRL 522
            KW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++    +   ++ E+ ED R 
Sbjct: 1006 KWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQCAEWADLEAEILEDERY 1065

Query: 523  RKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLR 582
            RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L  +HD+ +GGH+G  R
Sbjct: 1066 RKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFR 1125

Query: 583  TYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFI 642
            TYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L PL +P+  W+DISMDFI
Sbjct: 1126 TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFI 1185

Query: 643  EGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRD 702
             GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE+VRLHGFP SIVSDRD
Sbjct: 1186 GGLPKTMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRD 1245

Query: 703  KVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLH 762
            +VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYLRC    KPK+W KWL 
Sbjct: 1246 RVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLS 1305

Query: 763  WAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEH 822
            WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   +++    R+ +LE LK +
Sbjct: 1306 WAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSN 1365

Query: 823  LRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTER 882
            L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N+KLSP+++GP+ +  +
Sbjct: 1366 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1425

Query: 883  IGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPEEIYGYS 942
            I   AYKLQLP+ S +HPVFH+S L+K V      +P   A L E +E    PE I    
Sbjct: 1426 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLP-AALTEEWELKVEPEAIMDTR 1485

Query: 943  KNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLE-----RECNDRPPII 1002
            +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L+        + RP   
Sbjct: 1486 ENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQGGRDVANPSSRPRFG 1545

Query: 1003 HQYSRRKKKQ 1008
            + Y+RR K Q
Sbjct: 1546 NVYARRPKPQ 1550

BLAST of CSPI04G27540 vs. TrEMBL
Match: B5U9W0_LOTJA (Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1)

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 535/970 (55.15%), Postives = 711/970 (73.30%), Query Frame = 1

Query: 43   VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLV 102
            V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+RPYRY + QK E+E+LV
Sbjct: 586  VPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLV 645

Query: 103  EEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDEL 162
            +EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TIPDKFPIP+I+EL DE+
Sbjct: 646  KEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 705

Query: 163  NGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTV 222
              A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PFGLTNAPSTFQ+LMN V
Sbjct: 706  GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 765

Query: 223  FKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLG 282
             +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L AN+KKCSF Q  + YLG
Sbjct: 766  LRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825

Query: 283  HIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKK 342
            H+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV++Y  +A PL QL+KK
Sbjct: 826  HVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKK 885

Query: 343  GGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYF 402
              F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG G+GAVLMQ  +P+AY 
Sbjct: 886  NSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYM 945

Query: 403  SHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQ 462
            S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+SL+FL +QR++  + Q
Sbjct: 946  SKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ 1005

Query: 463  KWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPALIDLLVVKKEVEEDTRL 522
            KW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++    +   ++ E+ ED R 
Sbjct: 1006 KWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQCAEWADLEAEILEDERY 1065

Query: 523  RKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLR 582
            RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L  +HD+ +GGH+G  R
Sbjct: 1066 RKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFR 1125

Query: 583  TYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFI 642
            TYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L PL +P+  W+DISMDFI
Sbjct: 1126 TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFI 1185

Query: 643  EGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRD 702
             GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE+VRLHGFP SIVSDRD
Sbjct: 1186 GGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRD 1245

Query: 703  KVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLH 762
            +VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYLRC    KPK+W KWL 
Sbjct: 1246 RVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLS 1305

Query: 763  WAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEH 822
            WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   +++    R+ +LE LK +
Sbjct: 1306 WAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSN 1365

Query: 823  LRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTER 882
            L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N+KLSP+++GP+ +  +
Sbjct: 1366 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1425

Query: 883  IGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPEEIYGYS 942
            I   AYKLQLP+ S +HPVFH+S L+K V      +P   A L E +E    PE I    
Sbjct: 1426 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLP-AALTEEWELKVEPEAIMDTR 1485

Query: 943  KNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLE-----RECNDRPPII 1002
            +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L+        + RP   
Sbjct: 1486 ENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQGGRDVANPSSRPRFG 1545

Query: 1003 HQYSRRKKKQ 1008
            + Y+RR K Q
Sbjct: 1546 NVYARRPKPQ 1550

BLAST of CSPI04G27540 vs. TrEMBL
Match: B5U9W2_LOTJA (Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1)

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 539/986 (54.67%), Postives = 717/986 (72.72%), Query Frame = 1

Query: 28   MYEPPEDNGIEEVLA-VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVR 87
            + E  ED   E+  A V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+R
Sbjct: 381  LMEVEEDEEEEKTEAEVPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIR 440

Query: 88   PYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTI 147
            PYRY + QK E+E+LV+EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TI
Sbjct: 441  PYRYPFYQKNEIEKLVKEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATI 500

Query: 148  PDKFPIPVIEELFDELNGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPF 207
            PDKFPIP+I+EL DE+  A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PF
Sbjct: 501  PDKFPIPIIDELLDEIGAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPF 560

Query: 208  GLTNAPSTFQSLMNTVFKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYA 267
            GLTNAPSTFQ+LMN V +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L A
Sbjct: 561  GLTNAPSTFQALMNQVLRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVA 620

Query: 268  NRKKCSFAQERVDYLGHIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFV 327
            N+KKCSF Q  + YLGH+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV
Sbjct: 621  NQKKCSFGQPEIIYLGHVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFV 680

Query: 328  QHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGY 387
            ++Y  +A PL QL+KK  F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG 
Sbjct: 681  KNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGK 740

Query: 388  GIGAVLMQAKKPIAYFSHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQK 447
            G+GAVLMQ  +P+AY S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+
Sbjct: 741  GLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQR 800

Query: 448  SLKFLLEQRVIQPQYQKWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPAL 507
            SL+FL +QR++  + QKW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++   
Sbjct: 801  SLRFLADQRIMGEEQQKWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQC 860

Query: 508  IDLLVVKKEVEEDTRLRKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAIL 567
             +   ++ E+ ED R RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L
Sbjct: 861  AEWADLEAEILEDERYRKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVL 920

Query: 568  YTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLP 627
              +HD+ +GGH+G  RTYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L P
Sbjct: 921  KEFHDTALGGHAGIFRTYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQP 980

Query: 628  LEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKE 687
            L +P+  W+DISMDFI GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE
Sbjct: 981  LPIPSQGWTDISMDFIGGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKE 1040

Query: 688  IVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYL 747
            +VRLHGFP SIVSDRD+VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYL
Sbjct: 1041 VVRLHGFPTSIVSDRDRVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYL 1100

Query: 748  RCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLD 807
            RC    KPK+W KWL WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   ++
Sbjct: 1101 RCVTGSKPKQWPKWLSWAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVE 1160

Query: 808  EQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKN 867
            +    R+ +LE LK +L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N
Sbjct: 1161 KLTAERNLILEELKSNLEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSN 1220

Query: 868  EKLSPKFFGPFEVTERIGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLN 927
            +KLSP+++GP+ +  +I   AYKLQLP+ S +HPVFH+S L+K        +P   A L 
Sbjct: 1221 QKLSPRYYGPYPIIAKINPAAYKLQLPEGSQVHPVFHISLLKKAENAGVQSQPLP-AALT 1280

Query: 928  ENYEWLAIPEEIYGYSKNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHL 987
            E +E    PE I    +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L
Sbjct: 1281 EEWELKVEPEAIMDTRENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNL 1340

Query: 988  E-----RECNDRPPIIHQYSRRKKKQ 1008
            +        + RP   + Y+RR K Q
Sbjct: 1341 QGGRDVANPSSRPRFGNVYARRPKPQ 1361

BLAST of CSPI04G27540 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 152.9 bits (385), Expect = 1.0e-36
Identity = 69/131 (52.67%), Postives = 93/131 (70.99%), Query Frame = 1

Query: 249 LTHLGLALEILRRNELYANRKKCSFAQERVDYLGH--IISAQGVEVDPEKIRAIKEWPTP 308
           + HLG+ L+I  +++ YANRKKC+F Q ++ YLGH  IIS +GV  DP K+ A+  WP P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 309 TNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQAMMTLP 368
            N  E+RGFLGLTGYYR+FV++YG +  PLT+L+KK    WT+ +  AF+ L+ A+ TLP
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

Query: 369 VLALPDFSSTF 378
           VLALPD    F
Sbjct: 121 VLALPDLKLPF 131

BLAST of CSPI04G27540 vs. TAIR10
Match: ATMG00850.1 (ATMG00850.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 50.1 bits (118), Expect = 9.5e-06
Identity = 21/39 (53.85%), Postives = 30/39 (76.92%), Query Frame = 1

Query: 94  QKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVRKKDGSW 133
           ++  ++  + EML + +I+PS SPYSSPVLLV+KKDG W
Sbjct: 41  RRTRLKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGW 79

BLAST of CSPI04G27540 vs. NCBI nr
Match: gi|729344250|ref|XP_010541181.1| (PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana])

HSP 1 Score: 1196.0 bits (3093), Expect = 0.0e+00
Identity = 571/1014 (56.31%), Postives = 769/1014 (75.84%), Query Frame = 1

Query: 1    MVGLKNMIKSWRDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTW 60
            +V L+++IKS  D DQ +L++   +E      E  G++  L   E +  VL++F  VF  
Sbjct: 777  LVTLRSLIKSVCDGDQSYLVKLETLE------EQVGVDSNLP--EKLQAVLEEFGPVFEI 836

Query: 61   PETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSS 120
            P  LPP R  EH I LK+GT PV+VRPYRY +  K E+E+LV++ML +G++RPS SP+SS
Sbjct: 837  PTELPPERGREHPINLKEGTGPVSVRPYRYPHAHKEEIEKLVKDMLKAGIVRPSQSPFSS 896

Query: 121  PVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGARWFSKIDLKAGYHQI 180
            PVLLV+KKDGSWRFC+DYR LN VT+ DKFPIP+I++L DEL+GAR FSK+DL++GYHQI
Sbjct: 897  PVLLVKKKDGSWRFCIDYRALNKVTVLDKFPIPMIDQLLDELHGARVFSKLDLRSGYHQI 956

Query: 181  RMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPYLRKFILVFFDDILI 240
            RM + DI KTAFRTH+GHYEFLVMPFGLTNAP+TFQ+LMN +F+PYLRKF+LVFFDDIL+
Sbjct: 957  RMKTEDIPKTAFRTHDGHYEFLVMPFGLTNAPATFQALMNEIFRPYLRKFVLVFFDDILV 1016

Query: 241  YSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIISAQGVEVDPEKIRAI 300
            YS +L+ H THL   L +L++++LYAN+KKC F ++++DYLGHIIS +GV  DP K  A+
Sbjct: 1017 YSCSLQDHATHLQTVLAVLQKHKLYANKKKCEFGRQQIDYLGHIISQEGVSTDPAKTAAM 1076

Query: 301  KEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQ 360
            ++WPTP+N++E+RGFLGLTGYYR+FVQ+YG++A PLT L+KK GFNW++++  AF++L+Q
Sbjct: 1077 QKWPTPSNVKELRGFLGLTGYYRRFVQNYGTIARPLTDLLKKDGFNWSEDASSAFRKLKQ 1136

Query: 361  AMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYFSHTLAVRDRVKPVYEREL 420
            AM + PVL LPDF   F +ETDASG+GIGAVLMQ  +PIA+FS  L+ R+R+KPVYEREL
Sbjct: 1137 AMTSAPVLGLPDFREDFVVETDASGFGIGAVLMQKHRPIAFFSQALSERERLKPVYEREL 1196

Query: 421  MAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQKWVAKLLGYSFEVQYKPG 480
            MAVV+++QRWR YLLG+ F+V TDQK+LKFLLEQR +  +YQ+W+ KLLGY F++ Y+PG
Sbjct: 1197 MAVVLSIQRWRHYLLGRSFLVCTDQKALKFLLEQREVSMEYQRWLTKLLGYDFQIVYRPG 1256

Query: 481  LENKAADALSRVPSAVQFS------SLTAPALIDLLVVKKEVEEDTRLRKVWDELQSGEE 540
            +ENKAAD LSR+P            ++T P  I L+ V+KE+ ED+ L+++  +L+ GE 
Sbjct: 1257 VENKAADGLSRMPHNTILEPTCMGLAITIPRNIQLVEVEKEIGEDSDLKEIVSKLKEGET 1316

Query: 541  NTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLRTYKRITGELYWA 600
                K+ +  GMLRYK+RLV+S+ S+ IP IL  +HDS +GGHSG LRT KRI    +W 
Sbjct: 1317 KV-GKYHLLQGMLRYKNRLVVSKHSSFIPTILAEFHDSKMGGHSGVLRTLKRIQELFHWV 1376

Query: 601  GMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFIEGLPKSCGFETI 660
            GMK DIK+Y  EC +CQ  K   L PAGLL PL +P +IW DISMDFIEGLP+S G+  +
Sbjct: 1377 GMKADIKKYVAECAVCQSQKYSTLAPAGLLQPLPIPEHIWEDISMDFIEGLPRSAGYNVV 1436

Query: 661  FVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRDKVFVSSFWKGMF 720
             VVVDR SKY HF+ALKHPFTA  VA+VFV+E+VRLHGFPKSIVSDRDKVF+S+FW  +F
Sbjct: 1437 LVVVDRLSKYAHFIALKHPFTAMVVAKVFVQEVVRLHGFPKSIVSDRDKVFLSNFWSELF 1496

Query: 721  KLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLHWAEYWYNTTFHR 780
            ++AGTKL  STAYHPQTDGQTEV+NR +ETYLRC+ ++ P++W ++L WAE+WYNT+FH 
Sbjct: 1497 RIAGTKLKFSTAYHPQTDGQTEVLNRCLETYLRCYANDHPRKWIQFLSWAEFWYNTSFHT 1556

Query: 781  SLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEHLRVAQDKMKKTA 840
            +L  TPFQ VYGR PP LL Y + STSNF L++ L+ RD ++  +K+ L+ AQ +MK +A
Sbjct: 1557 ALQSTPFQIVYGREPPTLLKYEEGSTSNFELEKALRERDRMILEIKQKLQAAQQRMKVSA 1616

Query: 841  DLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPK 900
            D  RRD+   VG+ V+LKIRPYRQ++L  + N+KL+ +++GPF++  R+G VAYKL+LPK
Sbjct: 1617 DKGRRDLTLTVGEWVYLKIRPYRQNTLAARSNQKLAARYYGPFQIESRMGEVAYKLKLPK 1676

Query: 901  SSCIHPVFHVSQLRKMVGNHTMLKPEEM-ACLNENYEWLAIPEEIYGYSKNKEGMWEVLI 960
               IHPVFH+SQL+K +G +  ++P ++   L  + E    P++I      KEG  EVL+
Sbjct: 1677 GCNIHPVFHISQLKKALGGN--IQPNQLPRQLTRDLELQVQPKDIKDSRYTKEGRLEVLV 1736

Query: 961  KWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLERECNDRPPIIHQYSRRKKKQ 1008
            +WQ LP  +++WE  E+F K+FP+F LEDK+  +    D+   ++   R++ K+
Sbjct: 1737 EWQDLPEHESTWEVAEDFNKQFPSFQLEDKLRQKGGSIDKYFRVYVRGRKRGKE 1779

BLAST of CSPI04G27540 vs. NCBI nr
Match: gi|731338584|ref|XP_010680400.1| (PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 1129.4 bits (2920), Expect = 0.0e+00
Identity = 553/1016 (54.43%), Postives = 739/1016 (72.74%), Query Frame = 1

Query: 2    VGLKNMIKSWRDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWP 61
            + LK M+++ R   QG L+E   +E   EPP    IE  + V   +  +L ++  VF  P
Sbjct: 561  ISLKAMLRALRIEGQGVLVEMNHIEREKEPPGKWDIE--VEVPRPLQPLLNQYSQVFNMP 620

Query: 62   ETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSP 121
              LPP R  EH I LK+G++PV+VRPYRY + QK E+ERLV++ML++G+I+PS SP+SSP
Sbjct: 621  SGLPPSRGREHSITLKEGSNPVSVRPYRYPHVQKGEIERLVKDMLAAGIIQPSTSPFSSP 680

Query: 122  VLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGARWFSKIDLKAGYHQIR 181
            VLLV+KKDGSWRFCVDYR LN  T+PDK+PIPVI+EL DEL G+  FSK+DLK+GYHQIR
Sbjct: 681  VLLVKKKDGSWRFCVDYRALNKETVPDKYPIPVIDELLDELYGSVVFSKLDLKSGYHQIR 740

Query: 182  MASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPYLRKFILVFFDDILIY 241
            +   DI KTAFRTHEGHYEFLVMPFGLTNAP+TFQSLMN VF+P+LRKF+LVFFDDIL+Y
Sbjct: 741  VRKEDIHKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFRPFLRKFVLVFFDDILVY 800

Query: 242  SKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIISAQGVEVDPEKIRAIK 301
            S + E H  HL   L IL  N LYAN +KC F +++V YLGH+ISAQGV  D +KI+A+ 
Sbjct: 801  SPDEETHFHHLEQVLHILAENSLYANLEKCEFGRQQVAYLGHVISAQGVAADMDKIKAMV 860

Query: 302  EWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQA 361
            EWP P  IRE+RGFLGLTGYYRKF+ +Y  +A+PLT  ++K  + WT  + +AF+ L++A
Sbjct: 861  EWPLPKTIRELRGFLGLTGYYRKFIANYAKVASPLTDQLRKDSYAWTPAATQAFEALKKA 920

Query: 362  MMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYFSHTLAVRDRVKPVYERELM 421
            M+  PVLA+PDFS  F +E DASG+G+GAVLMQ  +PIA++SH L  R R+K +YE+ELM
Sbjct: 921  MVAAPVLAMPDFSQQFVIEADASGFGLGAVLMQNNRPIAFYSHILGPRGRLKSIYEKELM 980

Query: 422  AVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQKWVAKLLGYSFEVQYKPGL 481
            A+VMAVQ+WR YLLG+ F++RTDQKSLKF++EQR +  +YQ+WV+KL+G+ FE+ YKPG+
Sbjct: 981  AIVMAVQKWRHYLLGRRFVIRTDQKSLKFIMEQREVGAEYQRWVSKLMGFEFEIHYKPGI 1040

Query: 482  ENKAADALSRV-PSAVQFSSLTAPALIDLLVVKKEVEEDTRLRKVWDELQSGEENTERKF 541
             N+ ADALSR  P+  +  +L + +   L  V+ +++ D  ++++  ELQ G+      F
Sbjct: 1041 ANRVADALSRQNPAQTELKALLSSSGPSLEAVQNQLKADPYIQQIMAELQ-GDGPPMEGF 1100

Query: 542  SIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDI 601
            S+ +G++ YK R+VL   S L   +L  YHDS  GGHSG L+TY R+  E YW GM+ ++
Sbjct: 1101 SVENGLVMYKGRIVLPPKSPLTHELLKFYHDSPNGGHSGDLKTYLRMASEWYWVGMRKNV 1160

Query: 602  KRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDR 661
             +Y  +C ICQ+NK+    PAGLL PL  P  +W DI+MDF+EGLP S G +TI VVVDR
Sbjct: 1161 AQYVKDCQICQQNKTSTQNPAGLLQPLPPPNQVWEDITMDFVEGLPPSRGVDTILVVVDR 1220

Query: 662  FSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTK 721
            F+K+ HFL LKHPFTA TVA  F+KEIVRLHGFP SI+SDRD+VF+S FWK +F+L GTK
Sbjct: 1221 FTKFAHFLGLKHPFTAATVAGTFIKEIVRLHGFPASIISDRDRVFMSLFWKELFRLQGTK 1280

Query: 722  LNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITP 781
            L RSTAYHPQTDGQ+E VN+++ETYLRCF + +P++WA WL W E+WYNT+ H S  +TP
Sbjct: 1281 LKRSTAYHPQTDGQSENVNKALETYLRCFVNGQPRKWAGWLPWVEFWYNTSPHVSTKMTP 1340

Query: 782  FQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRD 841
            F+A+YGR PPPL+  G   T    LD  L+ RD VL+ L+ +L  AQ KMK  AD +RRD
Sbjct: 1341 FKALYGRDPPPLVRTGHNQTPVDSLDSYLQERDAVLDDLRVNLLRAQQKMKFWADKRRRD 1400

Query: 842  VEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKSSCIHP 901
            +  +VG  V+LK++PYRQ SL ++  EKL+ +++GP++V ERIG VAY+L LP +S IHP
Sbjct: 1401 ILLEVGSFVYLKLQPYRQKSLARRPYEKLAARYYGPYQVLERIGAVAYRLDLPATSKIHP 1460

Query: 902  VFHVSQLRKMVGNHTMLKPEEM-ACLNENYEWLAIPEEI----YGYSKNKEGMWEVLIKW 961
            VFHVSQL+   GN  + +P ++   L ++ E +  PE +    YG   +K+ + EVLIKW
Sbjct: 1461 VFHVSQLKPAAGN--IHQPSQLPEQLTQDLELIVEPEALLDVRYGAPGHKKPL-EVLIKW 1520

Query: 962  QGLPPQDASWEEYEEFQKKFPNFHLEDKVHLERECN----DRPPIIHQYSRRKKKQ 1008
            + LP  +A+WE+     ++FP FHLEDKV+L    N     +PP+   Y+RR+K++
Sbjct: 1521 KHLPETEATWEDLTAMVQRFPTFHLEDKVNLWAAGNVMMAPKPPLKFVYARRQKRR 1570

BLAST of CSPI04G27540 vs. NCBI nr
Match: gi|261865347|gb|ACY01928.1| (hypothetical protein [Beta vulgaris])

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 555/1015 (54.68%), Postives = 720/1015 (70.94%), Query Frame = 1

Query: 2    VGLKNMIKSWRDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWP 61
            V LK M ++ R    GFL++   M +    P      E+  V   +  +L  ++ VF  P
Sbjct: 548  VSLKAMYRTLRKEGGGFLVDLNQMASHEGLPR-----ELPEVPSCLQPLLSSYQQVFNMP 607

Query: 62   ETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSP 121
              LPP R   H I L+ GT+PV+VRPYRY   QK E+E+L+ +ML++G+I+ S+S +SSP
Sbjct: 608  LGLPPDRGHVHAINLQHGTNPVSVRPYRYPQSQKDEIEQLIHDMLAAGIIQQSHSAFSSP 667

Query: 122  VLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDELNGARWFSKIDLKAGYHQIR 181
            VLLV+KKDGSWRFCVDYR LN+VT+PDK+PIP+I+EL DEL+GA  FSK+DLK+GYHQI+
Sbjct: 668  VLLVKKKDGSWRFCVDYRALNNVTVPDKYPIPIIDELLDELHGACVFSKLDLKSGYHQIK 727

Query: 182  MASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTVFKPYLRKFILVFFDDILIY 241
            M   D+ KTAFRTHEGHYEFLVMPFGLTNAP+TFQ+LMN VFKPYLRKF+LVFFDDIL+Y
Sbjct: 728  MKPSDVHKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNEVFKPYLRKFVLVFFDDILVY 787

Query: 242  SKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLGHIISAQGVEVDPEKIRAIK 301
            S +LE H+ HL + L +L  N L+AN KKC F +E V YLGHIIS++GV +DP K++A+ 
Sbjct: 788  STSLEQHMHHLNVVLGLLATNHLFANLKKCEFGKEEVAYLGHIISSKGVAMDPSKVQAMM 847

Query: 302  EWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKKGGFNWTDNSEEAFQRLQQA 361
            +W  P+ +RE+RGFLGLTGYYR+FV+ Y S+A PLT  +KK  F W+  +  AF+ L++A
Sbjct: 848  DWSIPSTLRELRGFLGLTGYYRRFVKGYASIAHPLTNQLKKDSFGWSPAATRAFETLKRA 907

Query: 362  MMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYFSHTLAVRDRVKPVYERELM 421
            +   PVL +P+FS  F +E DASGYG+GAVL+Q   PIAYFS TL  R R K +YE+ELM
Sbjct: 908  LTEAPVLQMPNFSLPFVIEADASGYGLGAVLLQQGHPIAYFSKTLGERARAKSIYEKELM 967

Query: 422  AVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQKWVAKLLGYSFEVQYKPGL 481
            AVVMAVQ+W+ +LLG+ F++ +DQ+SL+ LL QR I P YQKWV KLLG+ FE++YKPG 
Sbjct: 968  AVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIGPAYQKWVGKLLGFDFEIKYKPGG 1027

Query: 482  ENKAADALSRV-PSAVQFSSLTAPALIDLLVVKKEVEEDTRLRKVWDELQSGEENTERKF 541
             NK ADALSR  P   +++ LT+       ++ + + +D  L+ +  E+ +G    +  F
Sbjct: 1028 HNKVADALSRKHPPEAEYNLLTSSHSPHQELIAQAIRQDADLQHLMAEVTAGRTPLQ-GF 1087

Query: 542  SIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLRTYKRITGELYWAGMKTDI 601
            ++ HG+L+Y  RLV+ ++  L   +L  YH S +GGHSG  +TYKR+ GE YW GMK D+
Sbjct: 1088 TVEHGLLKYNGRLVIPKNVPLTTTLLEEYHSSPMGGHSGIFKTYKRLAGEWYWKGMKKDV 1147

Query: 602  KRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFIEGLPKSCGFETIFVVVDR 661
              +   C ICQ+ K+  L+PAGLL PL +P  IW DISMDF+EGLPKS G++TI VVVDR
Sbjct: 1148 TTFVQNCQICQQFKTSTLSPAGLLQPLPIPLAIWEDISMDFVEGLPKSQGWDTILVVVDR 1207

Query: 662  FSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRDKVFVSSFWKGMFKLAGTK 721
             SKY HF+ LKHPFTA TVA VF+KEIV+LHGFP +IVSDRDKVF+S FWK +FKL GT 
Sbjct: 1208 LSKYAHFITLKHPFTAPTVAAVFIKEIVKLHGFPSTIVSDRDKVFMSLFWKELFKLQGTL 1267

Query: 722  LNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLHWAEYWYNTTFHRSLGITP 781
            L+RSTAYHPQ+DGQTEVVN+S+E YLRCFC+ +PK WA+W+ WAEYWYNT+ H S   TP
Sbjct: 1268 LHRSTAYHPQSDGQTEVVNKSLEAYLRCFCNGRPKAWAQWISWAEYWYNTSTHSSSHFTP 1327

Query: 782  FQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEHLRVAQDKMKKTADLKRRD 841
            F+ VYGR  PPL  +   ST+ F L+EQL  RD  L+ LK HL  AQ+ MK   D  RR 
Sbjct: 1328 FKIVYGRDSPPLFRFEKGSTAIFSLEEQLLDRDATLDELKFHLLEAQNSMKIQEDKHRRA 1387

Query: 842  VEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTERIGLVAYKLQLPKSSCIHP 901
            V ++ G MV+LKI+PYR  SL KK+NEKL+P+F+GPF V +RIG VAY+LQLP  + +HP
Sbjct: 1388 VHFEPGAMVYLKIQPYRHQSLAKKRNEKLAPRFYGPFSVLKRIGQVAYQLQLPLGAKLHP 1447

Query: 902  VFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPE---EIYGYSKNKEGMWEVLIKWQG 961
            VFH+SQL+K VG+     P     L  +    A PE    I  + +    + EVLIKW  
Sbjct: 1448 VFHISQLKKAVGS-LQSSPTIPPQLTNDLVLDAQPESLLNIRSHPQKPAEVTEVLIKWLN 1507

Query: 962  LPPQDASWEEYEEFQKKFPNFHLEDKV------HLERECNDRPPIIHQYSRRKKK 1007
            LP  +A+WE+   F  +FP+FHLEDKV        +      PPI+H YSRR+KK
Sbjct: 1508 LPAFEATWEDAALFNARFPDFHLEDKVLNWEGSIAKSPTRIIPPIVHTYSRRRKK 1555

BLAST of CSPI04G27540 vs. NCBI nr
Match: gi|208609051|dbj|BAG72148.1| (hypothetical protein [Lotus japonicus])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 536/970 (55.26%), Postives = 711/970 (73.30%), Query Frame = 1

Query: 43   VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLV 102
            V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+RPYRY + QK E+E+LV
Sbjct: 586  VPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLV 645

Query: 103  EEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDEL 162
            +EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TIPDKFPIP+I+EL DE+
Sbjct: 646  KEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 705

Query: 163  NGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTV 222
              A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PFGLTNAPSTFQ+LMN V
Sbjct: 706  GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 765

Query: 223  FKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLG 282
             +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L AN+KKCSF Q  + YLG
Sbjct: 766  LRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825

Query: 283  HIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKK 342
            H+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV++Y  +A PL QL+KK
Sbjct: 826  HVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKK 885

Query: 343  GGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYF 402
              F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG G+GAVLMQ  +P+AY 
Sbjct: 886  NSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYM 945

Query: 403  SHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQ 462
            S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+SL+FL +QR++  + Q
Sbjct: 946  SKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ 1005

Query: 463  KWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPALIDLLVVKKEVEEDTRL 522
            KW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++    +   ++ E+ ED R 
Sbjct: 1006 KWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQCAEWADLEAEILEDERY 1065

Query: 523  RKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLR 582
            RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L  +HD+ IGGH+G  R
Sbjct: 1066 RKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTAIGGHAGIFR 1125

Query: 583  TYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFI 642
            TYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L PL +P+  W+DISMDFI
Sbjct: 1126 TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFI 1185

Query: 643  EGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRD 702
             GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE+VRLHGFP SIVSDRD
Sbjct: 1186 GGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRD 1245

Query: 703  KVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLH 762
            +VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYLRC    KPK+W KWL 
Sbjct: 1246 RVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLS 1305

Query: 763  WAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEH 822
            WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   +++    R+ +LE LK +
Sbjct: 1306 WAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSN 1365

Query: 823  LRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTER 882
            L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N+KLSP+++GP+ +  +
Sbjct: 1366 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1425

Query: 883  IGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPEEIYGYS 942
            I   AYKLQLP+ S +HPVFH+S L+K V      +P   A L E +E    PE I    
Sbjct: 1426 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLP-AALTEEWELKVEPEAIMDTR 1485

Query: 943  KNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLE-----RECNDRPPII 1002
            +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L+        + RP   
Sbjct: 1486 ENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQGGRDVANPSSRPRFG 1545

Query: 1003 HQYSRRKKKQ 1008
            + Y+RR K Q
Sbjct: 1546 NVYARRPKPQ 1550

BLAST of CSPI04G27540 vs. NCBI nr
Match: gi|208609055|dbj|BAG72150.1| (hypothetical protein [Lotus japonicus])

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 535/970 (55.15%), Postives = 711/970 (73.30%), Query Frame = 1

Query: 43   VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLV 102
            V E +  +L+++ +VF  P+ LPPRR+ +H I L++G    N+RPYRY + QK E+E+LV
Sbjct: 586  VPEGMRKILEEYPEVFQEPKGLPPRRTTDHAIQLQEGASIPNIRPYRYPFYQKNEIEKLV 645

Query: 103  EEMLSSGVIRPSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIPVIEELFDEL 162
            +EML+SG+IR S SP+SSP +LV+KKDG WRFCVDYR LN  TIPDKFPIP+I+EL DE+
Sbjct: 646  KEMLNSGIIRHSTSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEI 705

Query: 163  NGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPSTFQSLMNTV 222
              A  FSK+DLK+GYHQIRM   DI KTAFRTHEGHYE+LV+PFGLTNAPSTFQ+LMN V
Sbjct: 706  GAAVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQV 765

Query: 223  FKPYLRKFILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKKCSFAQERVDYLG 282
             +PYLRKF+LVFFDDILIYSKN E+H  HL + L++L+ N L AN+KKCSF Q  + YLG
Sbjct: 766  LRPYLRKFVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYLG 825

Query: 283  HIISAQGVEVDPEKIRAIKEWPTPTNIREVRGFLGLTGYYRKFVQHYGSMAAPLTQLVKK 342
            H+IS  GV  DP KI+ + +WP P  ++ +RGFLGLTGYYR+FV++Y  +A PL QL+KK
Sbjct: 826  HVISQAGVAADPSKIKDMLDWPIPKEVKGLRGFLGLTGYYRRFVKNYSKLAQPLNQLLKK 885

Query: 343  GGFNWTDNSEEAFQRLQQAMMTLPVLALPDFSSTFELETDASGYGIGAVLMQAKKPIAYF 402
              F WT+ + +AF +L++ M T+PVL  P+F   F LETDASG G+GAVLMQ  +P+AY 
Sbjct: 886  NSFQWTEGATQAFVKLKEVMTTVPVLVPPNFDKPFILETDASGKGLGAVLMQEGRPVAYM 945

Query: 403  SHTLAVRDRVKPVYERELMAVVMAVQRWRPYLLGKPFIVRTDQKSLKFLLEQRVIQPQYQ 462
            S TL+ R + K VYERELMAVV+AVQ+WR YLLG  F++ TDQ+SL+FL +QR++  + Q
Sbjct: 946  SKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFVIHTDQRSLRFLADQRIMGEEQQ 1005

Query: 463  KWVAKLLGYSFEVQYKPGLENKAADALSRVPSAVQFSSLTAPALIDLLVVKKEVEEDTRL 522
            KW++KL+GY FE++YKPG+ENKAADALSR    +QFS++++    +   ++ E+ ED R 
Sbjct: 1006 KWMSKLMGYDFEIKYKPGIENKAADALSR---KLQFSAISSVQCAEWADLEAEILEDERY 1065

Query: 523  RKVWDELQSGEENTERKFSIRHGMLRYKDRLVLSQSSALIPAILYTYHDSVIGGHSGFLR 582
            RKV  EL + + N+   + ++ G L YKDR+VL + S  I  +L  +HD+ +GGH+G  R
Sbjct: 1066 RKVLQELAT-QGNSAVGYQLKRGRLLYKDRIVLPKGSTKILTVLKEFHDTALGGHAGIFR 1125

Query: 583  TYKRITGELYWAGMKTDIKRYCDECLICQKNKSLALTPAGLLLPLEVPTNIWSDISMDFI 642
            TYKRI+   YW GMK DI+ Y  +C +CQ+NK  AL PAG L PL +P+  W+DISMDFI
Sbjct: 1126 TYKRISALFYWEGMKLDIQNYVQKCEVCQRNKYEALNPAGFLQPLPIPSQGWTDISMDFI 1185

Query: 643  EGLPKSCGFETIFVVVDRFSKYGHFLALKHPFTAKTVAEVFVKEIVRLHGFPKSIVSDRD 702
             GLPK+ G +TI VVVDRF+KY HF+AL HP+ AK +AEVF+KE+VRLHGFP SIVSDRD
Sbjct: 1186 GGLPKAMGKDTILVVVDRFTKYAHFIALSHPYNAKEIAEVFIKEVVRLHGFPTSIVSDRD 1245

Query: 703  KVFVSSFWKGMFKLAGTKLNRSTAYHPQTDGQTEVVNRSVETYLRCFCSEKPKEWAKWLH 762
            +VF+S+FW  MFKLAGTKL  S+AYHPQTDGQTEVVNR VETYLRC    KPK+W KWL 
Sbjct: 1246 RVFLSTFWSEMFKLAGTKLKFSSAYHPQTDGQTEVVNRCVETYLRCVTGSKPKQWPKWLS 1305

Query: 763  WAEYWYNTTFHRSLGITPFQAVYGRTPPPLLYYGDQSTSNFLLDEQLKARDEVLEVLKEH 822
            WAE+WYNT +H ++  TPF+A+YGR PP +    D  TS   +++    R+ +LE LK +
Sbjct: 1306 WAEFWYNTNYHSAIKTTPFKALYGREPPVIFKGNDSLTSVDEVEKLTAERNLILEELKSN 1365

Query: 823  LRVAQDKMKKTADLKRRDVEYKVGDMVFLKIRPYRQSSLRKKKNEKLSPKFFGPFEVTER 882
            L  AQ++M++ A+  RRDV+Y+VGD+V+LKI+PY+  SL K+ N+KLSP+++GP+ +  +
Sbjct: 1366 LEKAQNRMRQQANKHRRDVQYEVGDLVYLKIQPYKLKSLAKRSNQKLSPRYYGPYPIIAK 1425

Query: 883  IGLVAYKLQLPKSSCIHPVFHVSQLRKMVGNHTMLKPEEMACLNENYEWLAIPEEIYGYS 942
            I   AYKLQLP+ S +HPVFH+S L+K V      +P   A L E +E    PE I    
Sbjct: 1426 INPAAYKLQLPEGSQVHPVFHISLLKKAVNAGVQSQPLP-AALTEEWELKVEPEAIMDTR 1485

Query: 943  KNKEGMWEVLIKWQGLPPQDASWEEYEEFQKKFPNFHLEDKVHLE-----RECNDRPPII 1002
            +N++G  EVLI+W+ LP  + SWE++ +   +FPN  LEDK++L+        + RP   
Sbjct: 1486 ENRDGDLEVLIRWKDLPTFEDSWEDFSKLLDQFPNHQLEDKLNLQGGRDVANPSSRPRFG 1545

Query: 1003 HQYSRRKKKQ 1008
            + Y+RR K Q
Sbjct: 1546 NVYARRPKPQ 1550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YG31B_YEAST3.8e-13734.65Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
TF22_SCHPO1.9e-13633.37Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF212_SCHPO1.9e-13633.37Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
TF21_SCHPO1.9e-13633.37Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF23_SCHPO1.9e-13633.37Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
E2DMZ5_BETVU0.0e+0054.68Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1[more]
B5U9V8_LOTJA0.0e+0055.26Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1[more]
B5U9W1_LOTJA0.0e+0055.15Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1[more]
B5U9W0_LOTJA0.0e+0055.15Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1[more]
B5U9W2_LOTJA0.0e+0054.67Putative uncharacterized protein OS=Lotus japonicus PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.11.0e-3652.67ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
ATMG00850.19.5e-0653.85ATMG00850.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|729344250|ref|XP_010541181.1|0.0e+0056.31PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana][more]
gi|731338584|ref|XP_010680400.1|0.0e+0054.43PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Beta vulgaris subsp. vulgari... [more]
gi|261865347|gb|ACY01928.1|0.0e+0054.68hypothetical protein [Beta vulgaris][more]
gi|208609051|dbj|BAG72148.1|0.0e+0055.26hypothetical protein [Lotus japonicus][more]
gi|208609055|dbj|BAG72150.1|0.0e+0055.15hypothetical protein [Lotus japonicus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR000953Chromo/chromo_shadow_dom
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
IPR016197Chromo-like_dom_sf
IPR023780Chromo_domain
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G27540.1CSPI04G27540.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 125..285
score: 1.9
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 106..285
score: 16
IPR000953Chromo/chromo shadow domainPROFILEPS50013CHROMO_2coord: 933..1008
score: 10
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 637..744
score: 3.4
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 626..788
score: 20
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 637..788
score: 2.7
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 627..786
score: 1.26
IPR016197Chromo domain-likeunknownSSF54160Chromo domain-likecoord: 898..978
score: 9.03
IPR023780Chromo domainPFAMPF00385Chromocoord: 934..977
score: 1.
NoneNo IPR availableGENE3DG3DSA:2.40.50.40coord: 935..978
score: 1.
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 74..204
score: 1.3
NoneNo IPR availableGENE3DG3DSA:3.30.70.270coord: 205..286
score: 1.8
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 118..978
score:
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 118..978
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 51..478
score: 6.78E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI04G27540Cucumber (Chinese Long) v2cpicuB181
CSPI04G27540Melon (DHL92) v3.5.1cpimeB311
CSPI04G27540Melon (DHL92) v3.6.1cpimedB302
CSPI04G27540Cucumber (Gy14) v2cgybcpiB169
CSPI04G27540Cucumber (Chinese Long) v3cpicucB212
CSPI04G27540Wax gourdcpiwgoB418