CSPI06G14740 (gene) Wild cucumber (PI 183967)

NameCSPI06G14740
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr6 : 12855080 .. 12859457 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTCATTTTTTTTTCGTGTTTAAAGCTCAACTTATTAACTTCCACTCAATTACTAATGACAACCTAATTAAAACAGATTTCATCTCCGCCGCCGCCGCCGCCGCCAACCACCCGTCGCCCATCTCCATCCAAGCGTCGATCCTCACCCGCACGTCAGTCGCGTCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTTTTCGCCGTCTGTGGGTCTCGGGCTGCCGCAACTTCCCGTCGCCCATCTCCATCCACGAAGTGCAGTCGCAGCCTCGATCTGTCTCAGGTGTCCATTTCTGTCTCCGATCTGTCTCGAATACGCCGCTCCGATTTGTTTCCGCTCCGATCTGTCTCCATCCGCGAAGTGCAGTTGTGCTTCACCCTTGCAGTTCGCCGTCTCCGCTCCGATCTGTCTCAAGCTATTTTTTTTTTGTTTGAAAAATCATTGGGCATTGCTTCTCTATCTCAAGCCGTCCAGATCCGTCGTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAACTTGACCAAGCGAACATCAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGCTTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTGCTAGCACATACACTCCCAACAGAGTCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAACATGGCCACATTCTGGATAACTGCCCTATCAGACCACCCCGACCCCCTGGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGACCTCGGATGATTCATCCCTCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCCTCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTGATTCTGCCTGTTGTAATCATATGACCTCTGACGTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATCTCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTTGGCTTAAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGGTTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTTAACAATTTGACTAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACCTTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGTTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATATCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCGTCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCTTCTGTTCTTCAAAACACCTCTCCATTTGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCGTGCACGTCCCGTCTCTGTTGTTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCGGCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCTTTCTCTAGTCCTCAATCTTTCTTTACAAATACATCTGTTGACCTTTTTCCTCTCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGTCACCTGCTACTCCTCTTCGTCGCTCTACCCGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTATTAACCCAGTATGGCAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAACGACCCATTGGTTGCAAATGGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATTGACTATGAAGAAACATTTGCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTCTTCAGATGGATGTCAAAAATGCATTTCTTAACGGCAATCTATCTGAAGAAGTGTATATGAAGCCACCTCAGGGAACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGTTTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTATGTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCTCGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTGTTATCTCAAGCGAAATATGCATCTGATCTAATAGCACGCTCAGGAATTACAGACTCCACCACATCTTCAACACCGTTAGATCCTCATGTCCATCTAACTCCGTTTGATGGTGTTCCTCTTGACGATGCAAGCTTGTATCGGCAACTTGTTGGCAGTCTTATATACCTAACAGTAACTCGCCCAGATATTGCATATGCTGTTCATATTGTCAGTCAATTTATGGCTGCTCCTCGAACAATTCATTTCACTGCTGTTCTACGCATACTTCGCTATGTCAAAGGCACCTTGGGACATGGTCTTCAATTCTCATCTCAGTCTTCCCTTGTGTTGTCGGGATATTCTGATGCTGATTGGGCGGGGGATCCTACTGATCGACGATCCACTACAGGATACTGTTTTTACTTAGGTGATTCTCTCATCTCATGGCGTAGTAAGAAACAAAGTGTTATATCTCGTTCCAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCCGATATGGGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGCCATTCAGATTGCTCACAATGATGTGTTTCATGAACGTACAAAACACATTGAAAATGACTGTCACTTTGTTCGTCACCACCTCTTAAACAACACCCTCCTCTTACGTTCTGTTTCTACTATTGAACAACCTGCGGATATCTTCACCAAAGCCTTGCCATCTAATCGATTCTGTCACTTACTTACCAAACTCAAGTTGATCGCTACTCTACCACCTTGA

mRNA sequence

ATGTGTCATTTTTTTTTCGTGTTTAAAGCTCAACTTATTAACTTCCACTCAATTACTAATGACAACCTAATTAAAACAGATTTCATCTCCGCCGCCGCCGCCGCCGCCAACCACCCGTCGCCCATCTCCATCCAAGCGTCGATCCTCACCCGCACGTCAGTCGCGTCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTTTTCGCCGTCTGTGGGTCTCGGGCTGCCGCAACTTCCCGTCGCCCATCTCCATCCACGAAGTGCAGTCGCAGCCTCGATCTGTCTCAGGTGTCCATTTCTGTCTCCGATCTGTCTCGAATACGCCGCTCCGATTTGTTTCCGCTCCGATCTGTCTCCATCCGCGAAGTGCAGTTGTGCTTCACCCTTGCAGTTCGCCGTCTCCGCTCCGATCTGTCTCAAGCTATTTTTTTTTTGTTTGAAAAATCATTGGGCATTGCTTCTCTATCTCAAGCCGTCCAGATCCGTCGTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAACTTGACCAAGCGAACATCAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGCTTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTGCTAGCACATACACTCCCAACAGAGTCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAACATGGCCACATTCTGGATAACTGCCCTATCAGACCACCCCGACCCCCTGGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGACCTCGGATGATTCATCCCTCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCCTCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTGATTCTGCCTGTTGTAATCATATGACCTCTGACGTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATCTCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTTGGCTTAAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGGTTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTTAACAATTTGACTAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACCTTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGTTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATATCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCGTCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCTTCTGTTCTTCAAAACACCTCTCCATTTGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCGTGCACGTCCCGTCTCTGTTGTTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCGGCATTCCTCAATCTTTCTTTACAAATACATCTGTTGACCTTTTTCCTCTCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGTCACCTGCTACTCCTCTTCGTCGCTCTACCCGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTATTAACCCAGTATGGCAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAACGACCCATTGGTTGCAAATGGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATTGACTATGAAGAAACATTTGCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTCTTCAGATGGATGTCAAAAATGCATTTCTTAACGGCAATCTATCTGAAGAAGTGTATATGAAGCCACCTCAGGGAACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGTTTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTATGTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCTCGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTGTTATCTCAAGCGAAATATGCATCTGATCTAATAGCACGCTCAGGAATTACAGACTCCACCACATCTTCAACACCGTTAGATCCTCATGTCCATCTAACTCCGTTTGATGGTGTTCCTCTTGACGATGCAAGCTTGTATCGGCAACTTGTTGGCAGTCTTATATACCTAACAGTAACTCGCCCAGATATTGCATATGCTGTTCATATTGTCAGTCAATTTATGGCTGCTCCTCGAACAATTCATTTCACTGCTGTTCTACGCATACTTCGCTATGTCAAAGGCACCTTGGGACATGGTCTTCAATTCTCATCTCAGTCTTCCCTTGTGTTGTCGGGATATTCTGATGCTGATTGGGCGGGGGATCCTACTGATCGACGATCCACTACAGGATACTGTTTTTACTTAGGTGATTCTCTCATCTCATGGCGTAGTAAGAAACAAAGTGTTATATCTCGTTCCAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCCGATATGGGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGCCATTCAGATTGCTCACAATGATGTGTTTCATGAACGTACAAAACACATTGAAAATGACTGTCACTTTGTTCGTCACCACCTCTTAAACAACACCCTCCTCTTACGTTCTGTTTCTACTATTGAACAACCTGCGGATATCTTCACCAAAGCCTTGCCATCTAATCGATTCTGTCACTTACTTACCAAACTCAAGTTGATCGCTACTCTACCACCTTGA

Coding sequence (CDS)

ATGTGTCATTTTTTTTTCGTGTTTAAAGCTCAACTTATTAACTTCCACTCAATTACTAATGACAACCTAATTAAAACAGATTTCATCTCCGCCGCCGCCGCCGCCGCCAACCACCCGTCGCCCATCTCCATCCAAGCGTCGATCCTCACCCGCACGTCAGTCGCGTCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTCTTCGCACGTCAGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGTCGCGCCCTCTCCTCTATCACGCACGCTGTCCGCCGCCGCCAGTATCATTCGTGCCGCGGTCCTCCTGAGTTTCTTTTCGCCGTCTGTGGGTCTCGGGCTGCCGCAACTTCCCGTCGCCCATCTCCATCCACGAAGTGCAGTCGCAGCCTCGATCTGTCTCAGGTGTCCATTTCTGTCTCCGATCTGTCTCGAATACGCCGCTCCGATTTGTTTCCGCTCCGATCTGTCTCCATCCGCGAAGTGCAGTTGTGCTTCACCCTTGCAGTTCGCCGTCTCCGCTCCGATCTGTCTCAAGCTATTTTTTTTTTGTTTGAAAAATCATTGGGCATTGCTTCTCTATCTCAAGCCGTCCAGATCCGTCGTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAACTTGACCAAGCGAACATCAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGCTTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTGCTAGCACATACACTCCCAACAGAGTCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAACATGGCCACATTCTGGATAACTGCCCTATCAGACCACCCCGACCCCCTGGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGACCTCGGATGATTCATCCCTCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCCTCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTGATTCTGCCTGTTGTAATCATATGACCTCTGACGTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATCTCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTTGGCTTAAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGGTTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTTAACAATTTGACTAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACCTTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGTTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATATCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCGTCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCTTCTGTTCTTCAAAACACCTCTCCATTTGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCGTGCACGTCCCGTCTCTGTTGTTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCGGCATTCCTCAATCTTTCTTTACAAATACATCTGTTGACCTTTTTCCTCTCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGTCACCTGCTACTCCTCTTCGTCGCTCTACCCGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTATTAACCCAGTATGGCAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAACGACCCATTGGTTGCAAATGGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATTGACTATGAAGAAACATTTGCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTCTTCAGATGGATGTCAAAAATGCATTTCTTAACGGCAATCTATCTGAAGAAGTGTATATGAAGCCACCTCAGGGAACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGTTTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTATGTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCTCGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTGTTATCTCAAGCGAAATATGCATCTGATCTAATAGCACGCTCAGGAATTACAGACTCCACCACATCTTCAACACCGTTAGATCCTCATGTCCATCTAACTCCGTTTGATGGTGTTCCTCTTGACGATGCAAGCTTGTATCGGCAACTTGTTGGCAGTCTTATATACCTAACAGTAACTCGCCCAGATATTGCATATGCTGTTCATATTGTCAGTCAATTTATGGCTGCTCCTCGAACAATTCATTTCACTGCTGTTCTACGCATACTTCGCTATGTCAAAGGCACCTTGGGACATGGTCTTCAATTCTCATCTCAGTCTTCCCTTGTGTTGTCGGGATATTCTGATGCTGATTGGGCGGGGGATCCTACTGATCGACGATCCACTACAGGATACTGTTTTTACTTAGGTGATTCTCTCATCTCATGGCGTAGTAAGAAACAAAGTGTTATATCTCGTTCCAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCCGATATGGGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGCCATTCAGATTGCTCACAATGATGTGTTTCATGAACGTACAAAACACATTGAAAATGACTGTCACTTTGTTCGTCACCACCTCTTAAACAACACCCTCCTCTTACGTTCTGTTTCTACTATTGAACAACCTGCGGATATCTTCACCAAAGCCTTGCCATCTAATCGATTCTGTCACTTACTTACCAAACTCAAGTTGATCGCTACTCTACCACCTTGA
BLAST of CSPI06G14740 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 2.5e-102
Identity = 234/602 (38.87%), Postives = 347/602 (57.64%), Query Frame = 1

Query: 840  GIPQSFFTNTSVDLFPLSEPTLDTELAQSS--PATANLDPPSVSDDVPESSPAT------ 899
            GI  +F T  S    P S  +   E+++    P         + + V E    T      
Sbjct: 722  GIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQH 781

Query: 900  -PLRRSTRVREPPPHLTDYHCFSTIVSLV----EPTSYQEASINPV---WQKAMDEELQA 959
             PLRRS R     P +      ST   L+    EP S +E   +P      KAM EE+++
Sbjct: 782  QPLRRSER-----PRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMES 841

Query: 960  LEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFA 1019
            L+K  T+  V+LP GKRP+ CKW++K+K   D  + RYKARLV KG+ Q+ GID++E F+
Sbjct: 842  LQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFS 901

Query: 1020 PVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTSPPPNK--VCLL 1079
            PV +MTS+R++L++AA+    + Q+DVK AFL+G+L EE+YM+ P+G      K  VC L
Sbjct: 902  PVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKL 961

Query: 1080 RRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNAL-FTRQTTHGIVLLLLYVDDMIITG 1139
             ++LYGLKQAPR W+  F S +    +  +  D  + F R + +  ++LLLYVDDM+I G
Sbjct: 962  NKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVG 1021

Query: 1140 NDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV--SHRSDGYLLSQAKYASDLIARSGIT 1199
             D+  I+ L+  L + F+MKDLG     LG+++     S    LSQ KY   ++ R  + 
Sbjct: 1022 KDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMK 1081

Query: 1200 DSTTSSTPLDPHVHL------TPFDGVPLDDASLYRQLVGSLIYLTV-TRPDIAYAVHIV 1259
            ++   STPL  H+ L      T  +         Y   VGSL+Y  V TRPDIA+AV +V
Sbjct: 1082 NAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVV 1141

Query: 1260 SQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTG 1319
            S+F+  P   H+ AV  ILRY++GT G  L F   S  +L GY+DAD AGD  +R+S+TG
Sbjct: 1142 SRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGG-SDPILKGYTDADMAGDIDNRKSSTG 1201

Query: 1320 YCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLH 1379
            Y F      ISW+SK Q  ++ S+TE+EY A  +   E+IWL+  L ++G+ Q+   +++
Sbjct: 1202 YLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQK-EYVVY 1261

Query: 1380 CDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVSTIEQPADIFTKALPSN 1414
            CD++SAI ++ N ++H RTKHI+   H++R  + + +L +  +ST E PAD+ TK +P N
Sbjct: 1262 CDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRN 1316

BLAST of CSPI06G14740 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 361.7 bits (927), Expect = 3.7e-98
Identity = 192/497 (38.63%), Postives = 299/497 (60.16%), Query Frame = 1

Query: 933  WQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQ 992
            W++A++ EL A +  +TW     P  K  +  +W++ +K +  G   RYKARLVA+G++Q
Sbjct: 906  WEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQ 965

Query: 993  EYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTS 1052
            +Y IDYEETFAPVAR++S R +L++       + QMDVK AFLNG L EE+YM+ PQG S
Sbjct: 966  KYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGIS 1025

Query: 1053 PPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALF--TRQTTHGIVLLL 1112
               + VC L +A+YGLKQA R WF  F   + +  F +SS D  ++   +   +  + +L
Sbjct: 1026 CNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVL 1085

Query: 1113 LYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASD 1172
            LYVDD++I   D   +++ ++YL + F M DL  + +F+G+ +  + D   LSQ+ Y   
Sbjct: 1086 LYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKK 1145

Query: 1173 LIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIYLTV-TRPDIAYAVH 1232
            ++++  + +    STPL   ++    +    D  +  R L+G L+Y+ + TRPD+  AV+
Sbjct: 1146 ILSKFNMENCNAVSTPLPSKINYELLNS-DEDCNTPCRSLIGCLMYIMLCTRPDLTTAVN 1205

Query: 1233 IVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSL--VLSGYSDADWAGDPTDRR 1292
            I+S++ +   +  +  + R+LRY+KGT+   L F    +    + GY D+DWAG   DR+
Sbjct: 1206 ILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRK 1265

Query: 1293 STTGYCFYLGD-SLISWRSKKQSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQG 1352
            STTGY F + D +LI W +K+Q+ ++ SSTE+EY AL +A  E +WL++LL  + +  + 
Sbjct: 1266 STTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLEN 1325

Query: 1353 PTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVSTIEQPADIFTK 1412
            P  ++ DN+  I IA+N   H+R KHI+   HF R  + NN + L  + T  Q ADIFTK
Sbjct: 1326 PIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTK 1385

Query: 1413 ALPSNRFCHLLTKLKLI 1424
             LP+ RF  L  KL L+
Sbjct: 1386 PLPAARFVELRDKLGLL 1401

BLAST of CSPI06G14740 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 1.9e-49
Identity = 105/224 (46.88%), Postives = 144/224 (64.29%), Query Frame = 1

Query: 1109 LLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYA 1168
            LLLYVDD+++TG+    ++ L   L   F MKDLG ++YFLG+++     G  LSQ KYA
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1169 SDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIYLTVTRPDIAYAV 1228
              ++  +G+ D    STPL   ++ +        D S +R +VG+L YLT+TRPDI+YAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1229 HIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRS 1288
            +IV Q M  P    F  + R+LRYVKGT+ HGL     S L +  + D+DWAG  + RRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1289 TTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIW 1333
            TTG+C +LG ++ISW +K+Q  +SRSSTE+EYRALA   AEL W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI06G14740 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 166.4 bits (420), Expect = 2.3e-39
Identity = 105/308 (34.09%), Postives = 162/308 (52.60%), Query Frame = 1

Query: 1028 MDVKNAFLNGNLSEEVYMKPPQG--TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQ 1087
            MDV  AFLN  + E +Y+K P G      P+ V  L   +YGLKQAP  W    ++T+ +
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 1088 LGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSL 1147
            +GF     ++ L+ R T+ G + + +YVDD+++     +    ++Q L + + MKDLG +
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 1148 NYFLGLEVSHRSDGYL-LSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDA 1207
            + FLGL +   S+G + LS   Y +   + S I     + TPL     L       L D 
Sbjct: 121  DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 1208 SLYRQLVGSLIYLTVT-RPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQF 1267
            + Y+ +VG L++   T RPDI+Y V ++S+F+  PR IH  +  R+LRY+  T    L++
Sbjct: 181  TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKY 240

Query: 1268 SSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKK-QSVISRSSTESEYRA 1327
             S S L L+ Y DA          ST GY   L  + ++W SKK + VI   STE+EY  
Sbjct: 241  RSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYIT 300

Query: 1328 LADATAEL 1331
             ++   E+
Sbjct: 301  ASETVMEI 308

BLAST of CSPI06G14740 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 111.3 bits (277), Expect = 8.8e-23
Identity = 123/514 (23.93%), Postives = 225/514 (43.77%), Query Frame = 1

Query: 933  WQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIER---YKARLVAKG 992
            +++A  +ELQ L+    +D VD+   +  I    I  + T++  T +R   YKAR+V +G
Sbjct: 1289 YKQAYHKELQNLKDMKVFD-VDVKYSRSEIPDNLI--VPTNTIFTKKRNGIYKARIVCRG 1348

Query: 993  YSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQ 1052
             +Q     Y            ++  L +A  +   +  +D+ +AFL   L EE+Y+  P 
Sbjct: 1349 DTQSPDT-YSVITTESLNHNHIKIFLMIANNRNMFMKTLDINHAFLYAKLEEEIYIPHPH 1408

Query: 1053 GTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLL 1112
                    V  L +ALYGLKQ+P+ W       +  +G   +S+   L+  QT    +++
Sbjct: 1409 DR----RCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGLY--QTEDKNLMI 1468

Query: 1113 LLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSL------NYFLGLEVSHRS-----D 1172
             +YVDD +I  +++Q + +    L  +FE+K  G+L         LG+++ +       D
Sbjct: 1469 AVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTID 1528

Query: 1173 GYLLS-----QAKYASDL--IARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASL-YRQL 1232
              L S       KY  +L  I +S I   +T    +DP   +            L  +QL
Sbjct: 1529 LTLKSFINRMDKKYNEELKKIRKSSIPHMSTYK--IDPKKDVLQMSEEEFRQGVLKLQQL 1588

Query: 1233 VGSLIYLT-VTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQF--SSQS 1292
            +G L Y+    R DI +AV  V++ +  P    F  + +I++Y+      G+ +      
Sbjct: 1589 LGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIHYDRDCNK 1648

Query: 1293 SLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADAT 1352
               +   +DA   G   D +S  G   + G ++ +  S K +    SSTE+E  A+ +  
Sbjct: 1649 DKKVIAITDAS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGY 1708

Query: 1353 AELIWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNN 1412
            A+   L+  L ++G       ++  D++ AIQ  +      + K        ++  +   
Sbjct: 1709 ADSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEK 1768

Query: 1413 TLLLRSVSTIEQPADIFTKALPSNRFCHLLTKLK 1422
            ++ L  ++     AD+ TK + ++ F   +  LK
Sbjct: 1769 SIKLLKITGKGNIADLLTKPVSASDFKRFIQVLK 1789

BLAST of CSPI06G14740 vs. TrEMBL
Match: A0A151SM08_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_002043 PE=4 SV=1)

HSP 1 Score: 1177.9 bits (3046), Expect = 0.0e+00
Identity = 611/1120 (54.55%), Postives = 767/1120 (68.48%), Query Frame = 1

Query: 274  KDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLA 333
            +D  +LI+ LM L  +YE VRA+LLH+ PLP+L+ A+  +  EE RLG+   K       
Sbjct: 28   RDGTKLIQFLMALTDDYEPVRASLLHQEPLPTLEDALPRLQSEETRLGLLCAKPDMAFAV 87

Query: 334  STYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPIRPPRPPGTSTKEKI 393
            ST   N     +C NC+ SGH +  CP IEC +C K GHI  NCP R P    T     +
Sbjct: 88   STSKGN-----YCGNCRQSGHVYIECPIIECHHCRKKGHIAPNCPTRDPNRSST-----L 147

Query: 394  FTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALAVSSGNRWLLDSACCNHMTSD 453
            F ++                       L+  +I   +        RW  DSACCNHMTS 
Sbjct: 148  FCRY---------------------CKLVGHIIDHCN-------TRWYFDSACCNHMTSA 207

Query: 454  VSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCD 513
              + S  S    +  I+ ADG+ M +SH G I TPS+ +P  Y +P L FNLVSVGQLCD
Sbjct: 208  SHVFSDLSSRDRISHIHTADGSLMEVSHKGPISTPSLSMPDAYLIPKLNFNLVSVGQLCD 267

Query: 514  LGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHL 573
            LG  ++FS  GC VQDP+TG+ IG GRK+GR+FELT+L V S +++ A+ T S  + WH 
Sbjct: 268  LGYILTFSSTGCSVQDPRTGKIIGNGRKIGRMFELTTLHVPSSNNLCAASTPSSIHLWHQ 327

Query: 574  RLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSD 633
            RLGH S  KLR LIS+ +L ++ K    +C  C+ AKQ AL F+ S S+    FDLVH D
Sbjct: 328  RLGHTSLSKLRPLISMGSLGSI-KEDKLDCTACQTAKQAALPFNDSTSSSVSLFDLVHYD 387

Query: 634  IWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKIL 693
            +WGPAP  T+ G RY+++FIDD+SRFTWIY +K RSE+ + YI FA MIRTQFS  IK  
Sbjct: 388  VWGPAPTPTMGGCRYFIIFIDDFSRFTWIYLMKSRSEIPQIYINFATMIRTQFSKCIKTF 447

Query: 694  RTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPE 753
            R DN  EY+DS LL FL++QGT  + SCP  SQQNGRAERKHRHILDS+RA+L+S+SCPE
Sbjct: 448  RRDNASEYRDSKLLHFLAEQGTTSEFSCPGTSQQNGRAERKHRHILDSIRAMLISSSCPE 507

Query: 754  KFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHP----RA 813
            + WGEAALT+VY INRLPSSVL + +PFE+L+G  P Y  L+VFG ACFVLL P    + 
Sbjct: 508  RTWGEAALTAVYVINRLPSSVLGDKTPFERLFGTPPSYESLRVFGCACFVLLQPHEYTKL 567

Query: 814  RPVSVVSLAMAPNTKD--------------------FVVGTLFPTDSGYLGIPQS---FF 873
            +P + +   +   T+                     F    +F + S +  IP +    F
Sbjct: 568  QPRARLCCFLGYGTEHKGYRVWDPISQCIRISRHVVFWEHKMFSSLSTFKSIPSTSTPLF 627

Query: 874  TNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDV-PESSPAT-------PLRRSTR 933
            TN S+DLFP      D +   S   T   D P + +D+ P   PA        P     R
Sbjct: 628  TNPSIDLFP-----HDFDAGSSDELTGASDLPIIPNDLTPAVDPAVQDPALPPPPGLPPR 687

Query: 934  VREPPPHLTDYHCFSTIVSLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPG 993
            VR+PP +L DYH FSTI+S  EP +Y+EAS +P W+++M  ELQALE T+TWD VD PP 
Sbjct: 688  VRKPPSYLHDYHYFSTIMSHYEPQTYREASADPKWRESMQAELQALENTNTWDLVDHPPD 747

Query: 994  KRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVA 1053
            K  + CKW++K+KT+SDG+IERYKARLVA+G++QEYGIDYEETFAPVAR+TS+R+LLA+A
Sbjct: 748  KNLMSCKWVFKVKTYSDGSIERYKARLVARGFTQEYGIDYEETFAPVARLTSLRTLLAIA 807

Query: 1054 AAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFAT 1113
            A+K+W + QMDVKNAFLNG+L  EVYM+PP G S PP+KVC LR+ALYGLKQAPR+WFA 
Sbjct: 808  ASKKWFIDQMDVKNAFLNGDLDAEVYMQPPPGYSCPPHKVCRLRKALYGLKQAPRSWFAK 867

Query: 1114 FSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFE 1173
            F  TI QLGFTSS++DNALF R+  HG V+LLLYVDDMIITG+D   IS+L+Q+L  HFE
Sbjct: 868  FHDTIAQLGFTSSTYDNALFIRRNDHGTVILLLYVDDMIITGDDSNGISELKQFLNLHFE 927

Query: 1174 MKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDG 1233
            MKDLGSL+YFLG+++    DG  LSQAKYASDLI+R+G+TD  T STPL+   H TP DG
Sbjct: 928  MKDLGSLSYFLGIQILSCDDGLFLSQAKYASDLISRAGLTDCKTESTPLETRAHFTPLDG 987

Query: 1234 VPLDDASLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLG 1293
             PL+D +LYRQLVGSLIYLTVTRPDIAYAVH+VSQFM APR+ HF AVLRI+RY+KGT+ 
Sbjct: 988  TPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHLVSQFMCAPRSTHFAAVLRIIRYIKGTIF 1047

Query: 1294 HGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTES 1353
            HGL +S  S L+L  YSDADW GDPTDRRS TG+C +LGDSLISWRSKKQ +++RSSTE+
Sbjct: 1048 HGLHYSVDSPLILRAYSDADWGGDPTDRRSVTGFCIFLGDSLISWRSKKQQLVARSSTEA 1103

Query: 1354 EYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSAI 1359
            EYRA+AD T+E++W+RWLL D+G  Q  PT L CDNRSAI
Sbjct: 1108 EYRAMADTTSEIVWIRWLLGDLGFLQSFPTDLFCDNRSAI 1103

BLAST of CSPI06G14740 vs. TrEMBL
Match: A5BVC1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027174 PE=4 SV=1)

HSP 1 Score: 1129.8 bits (2921), Expect = 0.0e+00
Identity = 626/1260 (49.68%), Postives = 818/1260 (64.92%), Query Frame = 1

Query: 254  NEYLAVLQPIWTQLD-------------QANISKDHLRLIKVLMGLRPEYESVRAALLHR 313
            N+Y   L+ IW Q+D             Q    +D  RL + LM L  ++E +R  LL+R
Sbjct: 134  NDYYDQLRFIWDQIDLSDPIWECSKDAQQYASIRDEFRLYEFLMSLHKDFEPIRGQLLNR 193

Query: 314  NPLPSLDAAIQEILFEEKRLGINSTKQSDVVLAST-YTPNRVANMFCKNCKLSGHKFSNC 373
            +  PSLD A+ E++ EE RL     +    +LA T  TP         +   S ++    
Sbjct: 194  SXAPSLDTAVNELVREEARLATLQAQNKLNILAITPSTPLIEQPQQLGDFSGSNNRRKQN 253

Query: 374  PKIECRYCHKHGHILDNCPIR----------PPRPPGTSTKEKIFTKHGSSSVVAATSDD 433
             K  C YC + GH ++ C  R           P PP  ST +       S S +  +S +
Sbjct: 254  NKKFCNYCKRPGHTIETCYRRNKSTATVANTAPTPPTVSTSQS------SGSTINLSSTE 313

Query: 434  SSLIQISDLQSLLNQLISSSSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI 493
               I    ++ + N  +S++ ++       WL DSACCNHMT   SL +   P      I
Sbjct: 314  LQEIIAQAVRMVGNASLSTALSVLPGKSQTWLFDSACCNHMTPHSSLFTNLDPAPHPLNI 373

Query: 494  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQD 553
            + ADG+ M+ +  G + T ++ +P  + VP+L++NL SVGQL +LG  + F  +GC VQD
Sbjct: 374  HIADGSTMHGNSLGFVSTSTLSVPGVFHVPDLSYNLCSVGQLAELGYRLIFXYSGCIVQD 433

Query: 554  PQTGQTIGTGRKVGRLFELTSLRVS--SPSSISASVTDSDTYQ----WHLRLGHASSEKL 613
             +TGQ +GTG +VGR+F + +L +   +P S++A+     +      WH RLGHA S ++
Sbjct: 434  XRTGQELGTGPRVGRMFPVNNLHLPPVAPVSVAAATAAVSSLPSLALWHSRLGHAPSSRV 493

Query: 614  RHLISVNNLTNLTKFV-------PFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWG 673
            + L+S   L ++++ +        F+C +C+L KQPAL F+ S S     F+L+HSD+WG
Sbjct: 494  QQLVSRGLLGSVSRGLLGSVSKDNFDCTSCQLGKQPALPFNNSDSISKSIFELIHSDVWG 553

Query: 674  PAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTD 733
            P+P+ ++ G RY+V+FIDDYSR++WI+ +K RSE+   Y  FA M+ TQFS  IK  R+D
Sbjct: 554  PSPVASIGGSRYFVVFIDDYSRYSWIFPMKSRSEILSIYSNFAKMVETQFSKRIKTFRSD 613

Query: 734  NVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFW 793
            N LEY        L   GTI   +CP  SQQNGRAERK RHILD VRALLLSA  P  FW
Sbjct: 614  NALEYTQHAFQXLLHSYGTIHHLTCPGTSQQNGRAERKLRHILDXVRALLLSAKIPAPFW 673

Query: 794  GEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLL----------- 853
            GEA+L +V+ INR+PS+V+ N +P+E+L+G  P+Y  L+ FGS CFVLL           
Sbjct: 674  GEASLHAVHAINRIPSTVIHNQTPYERLFGSPPNYHHLRSFGSXCFVLLQPHEHNKLEPR 733

Query: 854  ------------HPRARPVSVVS--LAMAPNTKDFVVGTLFPTDSGYLGIPQSFFTNTSV 913
                            R    VS  L ++ N   F    LF   S +    +S  TN+SV
Sbjct: 734  SRLCCFLGYGETQKGYRCYDPVSHRLRVSRNVV-FWEHRLFVELSHF----RSSLTNSSV 793

Query: 914  -DLFP-----LSEPTLDTELAQSSPATANLDPPSVSDD-----------------VPESS 973
             ++FP      S  TLD  L   SP   +  P  V+D+                 +PE  
Sbjct: 794  LEIFPDESLVPSANTLDLHL-DFSPDIFDASPRQVADEQIIHELPHFEPGSPAPALPEDP 853

Query: 974  PAT-PLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASINPVWQKAMDEELQALEKT 1033
            P   P R STRVR  PPHL DYHC++ + +L EP +Y+EAS +P+WQ AM EEL AL K 
Sbjct: 854  PQDIPPRHSTRVRSIPPHLLDYHCYTALATLHEPQTYREASTDPLWQIAMKEELDALTKN 913

Query: 1034 HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVAR 1093
            HTWD V LPPG+  +GCKWIYKIKT SDG++ERYKARLVAKG++QEYGIDYEETFAPVAR
Sbjct: 914  HTWDLVTLPPGQSVVGCKWIYKIKTRSDGSVERYKARLVAKGFTQEYGIDYEETFAPVAR 973

Query: 1094 MTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYG 1153
            ++SVR+LLAVAAA++W L QMDVKNAFLNG+LSEEVYM+PP G S   NKVC LRRALYG
Sbjct: 974  ISSVRALLAVAAARKWDLFQMDVKNAFLNGDLSEEVYMQPPPGLSIESNKVCHLRRALYG 1033

Query: 1154 LKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAIS 1213
            LKQAPRAWFA FSSTI +LG+T+S +D+ALF R+T    +LLLLYVDDMIITG+D   I 
Sbjct: 1034 LKQAPRAWFAKFSSTIFRLGYTASPYDSALFLRRTDKXTILLLLYVDDMIITGDDLSGIQ 1093

Query: 1214 DLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPL 1273
            +L+ +L Q FEMKDLG L+YFLGLE++H +DG  ++QAKYASDL++++G+TDS T  TP+
Sbjct: 1094 ELKDFLSQQFEMKDLGHLSYFLGLEITHSTDGLYITQAKYASDLLSQAGLTDSKTVDTPV 1153

Query: 1274 DPHVHLTPFDGVPLDDASLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1333
            + + HLTP  G PL + SLYR+LVGSL+YLTVTRPDI+YAVH VSQ+++APR+ H+ AVL
Sbjct: 1154 ELNAHLTPLGGKPLSNPSLYRRLVGSLVYLTVTRPDISYAVHQVSQYLSAPRSTHYAAVL 1213

Query: 1334 RILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKK 1393
            RILRY+KGTL HGL +S+QS L+L  +SDADWAGDPTDRRSTTGYCF LG SLISWRSKK
Sbjct: 1214 RILRYLKGTLFHGLFYSAQSPLILXAFSDADWAGDPTDRRSTTGYCFLLGSSLISWRSKK 1273

Query: 1394 QSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFH 1428
            Q+ ++RSSTE+EYRALAD T+EL+WLRWLL D+GV     T L+CDN+SAI IAHNDVFH
Sbjct: 1274 QTFVARSSTEAEYRALADTTSELLWLRWLLKDLGVSTSSATPLYCDNQSAIHIAHNDVFH 1333

BLAST of CSPI06G14740 vs. TrEMBL
Match: A5BCZ7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010987 PE=3 SV=1)

HSP 1 Score: 1016.1 bits (2626), Expect = 4.0e-293
Identity = 582/1226 (47.47%), Postives = 767/1226 (62.56%), Query Frame = 1

Query: 254  NEYLAVLQPIWTQLD-------------QANISKDHLRLIKVLMGLRPEYESVRAALLHR 313
            N+Y   L+ IW Q+D             Q    +D  RL + LM L  ++E +R  LL+R
Sbjct: 455  NDYYDQLRFIWDQIDLSYPTWTCSKNAQQYASIRDEFRLYEFLMSLHKDFEPIRGQLLNR 514

Query: 314  NPLPSLDAAIQEILFEEKRLGINSTKQSDVVLAST---YTPNRVANMFCKNCKLSGHKFS 373
            +P PSLD A+ E++ EE RL     +    VLA T     P +  + +      S ++  
Sbjct: 515  SPAPSLDTAVNELVREEARLATLQAQNKFNVLAITPLIEQPQQSGDSYG-----SSNRRK 574

Query: 374  NCPKIECRYCHKHGHILDNCPIRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISD 433
               K  C YC + GH ++ C  R       +  E       S+SV + +S  +  +  ++
Sbjct: 575  QTNKKFCNYCKRPGHTIETCYRRNKSTAAVANIEPT-PPMASTSVESKSSGSTINLSSTE 634

Query: 434  LQSLLNQLI------SSSSALAVSSGNR--WLLDSACCNHMTSDVSLMSTSSPTKSLPPI 493
            LQ ++ Q++      S S+AL+V  G    WL  SACCNHMT   SL S   P      I
Sbjct: 635  LQEIIAQVVRMAGNASLSTALSVLPGKSQTWLFYSACCNHMTPHSSLFSKLDPAPHPLNI 694

Query: 494  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQD 553
            + ADG+ M+ +  G + T ++ +P  + VP             DL  N+      C V  
Sbjct: 695  HIADGSTMHGNSLGFVSTSNLFVPGVFHVP-------------DLSYNL------CSV-- 754

Query: 554  PQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASSEKLRHLISV 613
               GQ    G +   +F  +   V  P +               R    +  ++  + +V
Sbjct: 755  ---GQLAELGYRF--IFYYSGCIVQDPKT---------------RQELGTGPRVGRMFTV 814

Query: 614  NNLTNLTKFVP--FNCLNCKLAKQPALSF------SQSISNCDKPFDLVHSDIWGPAPIT 673
            +NL +L    P     +   ++  P+L+         S S     F+L+HSD+W P+P+ 
Sbjct: 815  SNL-HLPPVAPVYIAIVAAAVSSLPSLALWHSRLGHASSSRVQHIFELIHSDVWEPSPVA 874

Query: 674  TVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEY 733
            ++ G RY+V+FIDDYSR++WI+ +K RSE+   Y  FA MI TQFS  IK  R+DN LEY
Sbjct: 875  SIGGSRYFVIFIDDYSRYSWIFPMKSRSEILSIYNNFAKMIETQFSKRIKTFRSDNALEY 934

Query: 734  KDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAAL 793
                  + L   GT+   +CP  SQQNGRAERK RHILD+VRALLLSA  P  FWGEAAL
Sbjct: 935  TQHAFQALLHSYGTVHHLTCPGTSQQNGRAERKLRHILDTVRALLLSAKIPAPFWGEAAL 994

Query: 794  TSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLL----HPRARPVSVVSL 853
             +V+ INR+PS+V+ N +P+E+L+G  P Y  L+ FGSACFVLL    H +  P S +  
Sbjct: 995  HAVHAINRIPSAVIHNQTPYERLFGSPPVYHHLRSFGSACFVLLQSHEHNKLEPRSRLCC 1054

Query: 854  AMA------------PNTKDFVVGTLFPTDSGYLGIPQSFFTNT-------SVDLFPLS- 913
             +             P +    V  +FP +S    +P    TNT       S D+F +S 
Sbjct: 1055 FLGYGETQKGYRCYDPVSHRLRVSQIFPNESL---VPS---TNTFDPPLDFSPDIFDVSP 1114

Query: 914  EPTLDTELAQSSPATANLDP-PSVSDDVPESSPATPLRRSTRVREPPPHLTDYHCFSTIV 973
                D ++    P      P P++ +D P+  P    R STRVR  PPHL DYHC++ + 
Sbjct: 1115 RQVADEQIDDELPHFETRSPAPTLPEDPPQDIPP---RHSTRVRSIPPHLLDYHCYTALA 1174

Query: 974  SLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDG 1033
            +L EP +Y+EAS +P+WQ AM EEL AL K HTWD V LPPG+  +GCKWIYKIKT SDG
Sbjct: 1175 TLHEPQTYREASTDPLWQIAMKEELDALTKNHTWDLVPLPPGQSVVGCKWIYKIKTRSDG 1234

Query: 1034 TIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLN 1093
            ++ERYKARLVAKG++QEYGIDYEETFAPVAR++SVR+LLAVAAA+QW L QMDVKNAFLN
Sbjct: 1235 SVERYKARLVAKGFTQEYGIDYEETFAPVARISSVRALLAVAAARQWDLFQMDVKNAFLN 1294

Query: 1094 GNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNA 1153
            G+LSE VYM+PP G S   NKVC LRRALYGLKQAPRAWFA FSSTI +LG+T+S +D+A
Sbjct: 1295 GDLSEAVYMQPPPGLSVESNKVCHLRRALYGLKQAPRAWFAKFSSTIFRLGYTASPYDSA 1354

Query: 1154 LFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHR 1213
            LF R+T    +LLLLYVDDMIIT ND   I +L+ +L Q FEMKDLG L+YFLGLE++H 
Sbjct: 1355 LFLRRTDKDTILLLLYVDDMIITSNDLSGIQELKDFLSQQFEMKDLGHLSYFLGLEITHS 1414

Query: 1214 SDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIY 1273
            +DG  ++QAKYASDL++++G+TDS    TP++ + HLTP  G PL + SLYR+LVGSL+Y
Sbjct: 1415 TDGLYITQAKYASDLLSQAGLTDSKNVDTPVELNAHLTPSGGKPLSNPSLYRRLVGSLVY 1474

Query: 1274 LTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSD 1333
            LTVTRPDI+Y VH VSQ+++APR+ H+  VLRILRY+KGT+ HGL +S+QS LVL  +SD
Sbjct: 1475 LTVTRPDISYVVHQVSQYLSAPRSTHYATVLRILRYLKGTIFHGLFYSAQSPLVLRAFSD 1534

Query: 1334 ADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIWLRWL 1393
            ADWAGDPT+RRSTTGYCF LG SLISWRSKKQ+ ++RSSTE+EYRALAD T+EL+WLRWL
Sbjct: 1535 ADWAGDPTNRRSTTGYCFLLGSSLISWRSKKQTFVARSSTEAEYRALADTTSELLWLRWL 1594

Query: 1394 LADMGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVST 1423
            L D+GV     T L+CDN+SAI IAHNDVF+ERTKHIE +CHF+ +HL++  L L  VS+
Sbjct: 1595 LKDLGVSTSSATPLYCDNQSAIHIAHNDVFYERTKHIEINCHFICYHLVHGALKLFFVSS 1623

BLAST of CSPI06G14740 vs. TrEMBL
Match: A5BN86_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033646 PE=4 SV=1)

HSP 1 Score: 1007.3 bits (2603), Expect = 1.9e-290
Identity = 551/1048 (52.58%), Postives = 714/1048 (68.13%), Query Frame = 1

Query: 399  SSSVVAATSDDSSLIQISDLQSLLNQLI------SSSSALAVSSGNR--WLLDSACCNHM 458
            S+SV + +S  +  +  ++LQ ++ Q +      S S+AL+V  G    WL DSA CNHM
Sbjct: 3    STSVESKSSGSTINLSSTELQEIIAQAVRMAGNASLSTALSVLPGKSQTWLFDSAYCNHM 62

Query: 459  TSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQ 518
            T   SL S   P      I+ ADG+ M+ +  G + T ++ +P  + VP+L++NL  +GQ
Sbjct: 63   TPHSSLFSKLDPAPHPLHIHIADGSTMHGNSLGFVSTSNLSVPGVFHVPDLSYNLCYMGQ 122

Query: 519  LCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQ 578
            L +LG     S +G           +GTG +VGR+F +++L +   + IS +   +    
Sbjct: 123  LAELG-----SEDG----------ELGTGPRVGRMFPVSNLHLPPVAPISIATAAAAVSS 182

Query: 579  ------WHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCD 638
                  WH RLGHASS +++ L+S   L  ++K + F C +C+L KQP L F+ S S  +
Sbjct: 183  LPSLALWHSRLGHASSSRVQQLVSRGLLGFVSKDI-FYCTSCQLGKQPTLPFNNSESISN 242

Query: 639  KPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRT 698
              F+L+HSD+WGP+P+ ++ G RY+V+FIDDYSR+ WI+ +K  SE+   Y  FA MI T
Sbjct: 243  SIFELIHSDVWGPSPVASIGGSRYFVVFIDDYSRYIWIFPMKSCSEILSIYSNFAKMIET 302

Query: 699  QFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRA 758
            QFS  IK  R+DN LEY      + L   GT+   +CP  SQQNGRAERK RHILD+VRA
Sbjct: 303  QFSKRIKTFRSDNALEYTQHAFQALLHSYGTVHHLTCPGTSQQNGRAERKLRHILDTVRA 362

Query: 759  LLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVL 818
            LLLSA  P  FWGEAAL +V+ INR+PS+V+ N +P+E+L+G  P Y  L+ FGSACFVL
Sbjct: 363  LLLSAKIPAPFWGEAALHAVHAINRIPSAVIHNQTPYERLFGSPPVYHHLRSFGSACFVL 422

Query: 819  L----HPRARPVSVVSLAMAPNTKDFVVGTLFPTDSGYLGI-PQSFFTNTSVDLFPLSEP 878
            L    H +  P S +   +              T  GY    P S     S ++    E 
Sbjct: 423  LQSHEHNKLEPRSRLCCFLGYGE----------TQKGYRCYDPVSHRLRVSHNVV-FWEH 482

Query: 879  TLDTELAQSSPATANLDPPSVSDDVPESSPATPLRRSTRVREPPPHLTDYHCFSTIVSLV 938
             L  EL+    +  N    SV +  P+ S    L  ST   +PP    D+       S  
Sbjct: 483  RLFVELSHFRSSLTN---SSVLEIFPDES----LVPSTNTFDPP---LDFSPDIFDASPR 542

Query: 939  EPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIE 998
            +P +Y+EAS +P+WQ AM EEL AL K HTWD V LPPG+  +GCKWIYKIKT SDG++E
Sbjct: 543  QPQTYREASTDPLWQIAMKEELDALTKNHTWDLVPLPPGQSVVGCKWIYKIKTRSDGSVE 602

Query: 999  RYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNL 1058
            RYKARLVAKG++QEYGIDYEETFAPVAR++SVR+LLAVA A+QW L QMDVKNAFLNG+L
Sbjct: 603  RYKARLVAKGFTQEYGIDYEETFAPVARISSVRALLAVATARQWDLFQMDVKNAFLNGDL 662

Query: 1059 SEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFT 1118
            +E VYM+PP   S   NKVC LRRALYGLKQAPRAWFA FSSTI +LG+T+S +D+ALF 
Sbjct: 663  NEAVYMQPPPSLSVESNKVCHLRRALYGLKQAPRAWFAKFSSTIFRLGYTASPYDSALFL 722

Query: 1119 RQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDG 1178
            R+T  G +LLLLYVDDMIITGND   I +L+ +L Q FEMKDLG L+YFLGLE++H +DG
Sbjct: 723  RRTDKGTILLLLYVDDMIITGNDLSGIQELKDFLSQQFEMKDLGHLSYFLGLEITHSTDG 782

Query: 1179 YLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIYLTV 1238
              ++QAKYASDL+++ G+TDS    TP++ + HLTP  G PL + SLYR+LVG+L+YLTV
Sbjct: 783  LYITQAKYASDLLSQVGLTDSKNVDTPVELNAHLTPSRGKPLSNPSLYRRLVGNLVYLTV 842

Query: 1239 TRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSDADW 1298
            TRPDI+YAVH VSQ+++APR+ H+ AVLRILRY+KGT+ HGL +S+QS LVL  +SDADW
Sbjct: 843  TRPDISYAVHQVSQYLSAPRSTHYAAVLRILRYLKGTIFHGLFYSAQSPLVLRAFSDADW 902

Query: 1299 AGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIWLRWLLAD 1358
            AGDPTDRRSTTGYCF LG SLISWRSKKQ+ ++RSSTE+EYRALAD T+ELIWLRWLL D
Sbjct: 903  AGDPTDRRSTTGYCFLLGSSLISWRSKKQTFVARSSTEAEYRALADTTSELIWLRWLLKD 962

Query: 1359 MGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVSTIEQ 1418
            +GV     T L+CDN+SAI IAHNDVFHERTKHI+ DCHF+R+HL++  L L  VS+ +Q
Sbjct: 963  LGVSTSSATPLYCDNQSAIHIAHNDVFHERTKHIKIDCHFIRYHLVHGALKLFFVSSKDQ 1013

Query: 1419 PADIFTKALPSNRFCHLLTKLKLIATLP 1428
             ADIFTK+LP+ R   L+  LKL++  P
Sbjct: 1023 LADIFTKSLPTRRTRDLIDNLKLVSHPP 1013

BLAST of CSPI06G14740 vs. TrEMBL
Match: Q710T7_POPDE (Gag-pol polyprotein OS=Populus deltoides GN=60I2G14 PE=4 SV=1)

HSP 1 Score: 989.9 bits (2558), Expect = 3.1e-285
Identity = 591/1293 (45.71%), Postives = 786/1293 (60.79%), Query Frame = 1

Query: 213  VQLCFTLAVRRLRSDLSQAIFFLFEKSLGIASLSQAVQIRRNEYLAVLQPIWTQLDQANI 272
            +Q  FT +    +  L   I  L +K++ I     A+    ++ LA+ + +  +   A I
Sbjct: 98   LQRLFTQSNFAKQYQLENDIRALHQKNMSIQEFYSAMTDLWDQ-LALTESVELKACGAYI 157

Query: 273  SK-DHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTK----- 332
             + +  RL++ L  LR ++E +R ++LHR+PLPS+D+ + E+L EE RL   S K     
Sbjct: 158  ERREQQRLVQFLTALRSDFEGLRGSILHRSPLPSVDSVVSELLAEEIRLQSYSEKGILSA 217

Query: 333  QSDVVLASTYTP---------NRVANMFCKNCKLSGHKFSNCPKI-ECRYCHKHGHILDN 392
             +  VLA    P          RV    C  CK  GH  + CPK+ +     K G    +
Sbjct: 218  SNPSVLAVPSKPFSNHQNKPYTRVGFDECSFCKQKGHWKAQCPKLRQQNQAWKSGSQSQS 277

Query: 393  CPIRPP---RPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALA 452
               R P   +PP  +T          +S  + T  ++   Q     SL  Q +S+SS   
Sbjct: 278  NAHRSPQGYKPPHHNTA-------AVASPGSITDPNTLAEQFQKFLSLQPQAMSASSIGQ 337

Query: 453  V---SSG---NRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDT 512
            +   SSG   + W+LDS   +HM+ D S  ++ SP  S+ P+  ADG  M ++  G++ T
Sbjct: 338  LPHSSSGISHSEWVLDSGASHHMSPDSSSFTSVSPLSSI-PVMTADGTPMPLAGVGSVVT 397

Query: 513  PSVHLPHTYCVPNLTFNLVSVGQLCDLG-LNVSFSPNGCQVQDPQTGQTIGTGRKVGRLF 572
              + LP+ Y +P L  NL S+GQ+CD G   V FS + C VQD Q+ + IGTGR+   L+
Sbjct: 398  LHLSLPNVYLIPKLKLNLASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLY 457

Query: 573  ELTSLRVSSPSSISASVTD----------SDTYQWHLRLGHASSEKLRHLISVNNLTNLT 632
             L  L+V  P  ++A+  D          S  Y WH RLGH SS +LR L S   L NL 
Sbjct: 458  ILDELKV--PVVVAATTVDLSFFRLSLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLK 517

Query: 633  KFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDY 692
                 +C  CKLAK  AL F++S S    PFDL+HSD+WGP+P++T  G RYYV FIDD+
Sbjct: 518  TCDISDCSGCKLAKFSALPFNRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDH 577

Query: 693  SRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTI 752
            +R+ W+Y +KHRSE    Y  F  +I+TQ S+ IK  R D   EY  +     L+  GTI
Sbjct: 578  TRYCWVYLMKHRSEFFEIYAAFRALIKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTI 637

Query: 753  VQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQ 812
             Q SC    +QNG AERKHRHI+++ R+LLLSA    +FWGEA LT+V  IN +PSS   
Sbjct: 638  HQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSSHSS 697

Query: 813  NTSPFEKLYGISPDYSKLKVFGSACFVLLHPR-------ARPVSVVSLAMA--------- 872
              SPFEKLYG  PDYS  +VFG   FV LHP        +R    V L            
Sbjct: 698  GLSPFEKLYGHVPDYSSFRVFGCTYFV-LHPHVERNKLSSRSAICVFLGYGEGKKGYRCF 757

Query: 873  -PNTKDFVVG--TLFPTDSGYLGIPQSFFTNTSVDLF---PLSE-------PTLDTELAQ 932
             P T+   V    +F     +  IP +  + T  DL    P SE       P + +    
Sbjct: 758  DPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDPFSEDSGNDTSPYVRSICTH 817

Query: 933  SSPATANL----DPPSVSDDVPESSPA---TPLRRSTRVREPPPHLTD--YHCFST---- 992
            +S  T  L       S S   P++S      P R+S R+R+    L D  Y C+S+    
Sbjct: 818  NSAGTGTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRK-STKLPDFAYSCYSSSFTS 877

Query: 993  ----IVSLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKI 1052
                I  L EP+SY+EA ++P+ Q+AMDEEL AL KT TWD V LPPGK  +GC+W+YKI
Sbjct: 878  FLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKI 937

Query: 1053 KTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDV 1112
            KT+SDG+IERYKARLVAKGYSQ+YG+DYEETFAP+A+MT++R+L+AVA+ +QW + Q+DV
Sbjct: 938  KTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDV 997

Query: 1113 KNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTS 1172
            KNAFLNG+L EEVYM PP G S     VC L++ALYGLKQAPRAWF  FS  I+ LGF S
Sbjct: 998  KNAFLNGDLQEEVYMAPPPGISHDSGYVCKLKKALYGLKQAPRAWFEKFSIVISSLGFVS 1057

Query: 1173 SSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLG 1232
            SSHD+ALF + T  G ++L LYVDDMIITG+D   IS L+  L + FEMKDLG L YFLG
Sbjct: 1058 SSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVLKTELARRFEMKDLGYLRYFLG 1117

Query: 1233 LEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQL 1292
            +EV++   GYLLSQ+KY ++++ R+ +TD+ T  TP++ +   +  DG+PL D +LYR +
Sbjct: 1118 IEVAYSPRGYLLSQSKYVANILERARLTDNKTVDTPIEVNARYSSSDGLPLIDPTLYRTI 1177

Query: 1293 VGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLV 1352
            VGSL+YLT+T PDIAYAVH+VSQF+A+P TIH+ AVLRILRY++GT+   L  SS SSL 
Sbjct: 1178 VGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLRILRYLRGTVFQSLLLSSTSSLE 1237

Query: 1353 LSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAEL 1412
            L  YSDAD   DPTDR+S TG+C +LGDSLISW+SKKQS++S+SSTE+EY A+A  T E+
Sbjct: 1238 LRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQSIVSQSSTEAEYCAMASTTKEI 1297

Query: 1413 IWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLL 1424
            +W RWLLADMG+     T ++CDN+S+IQIAHN VFHERTKHIE DCH  RHHL + T+ 
Sbjct: 1298 VWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHERTKHIEIDCHLTRHHLKHGTIA 1357

BLAST of CSPI06G14740 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 499.6 bits (1285), Expect = 6.4e-141
Identity = 268/580 (46.21%), Postives = 364/580 (62.76%), Query Frame = 1

Query: 862  DTELAQSSPATANLDPPSVSDDVPESSPATPLRRSTRVREPPPHLTDYHCFST------- 921
            D + + SS +   +   ++ +DVPE S  T  RR+ +    P +L DY+C S        
Sbjct: 4    DADASTSSSSIDIMPSANIQNDVPEPSVHTSHRRTRK----PAYLQDYYCHSVASLTIHD 63

Query: 922  --------------------IVSLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVD 981
                                I    EP++Y EA    VW  AMD+E+ A+E THTW+   
Sbjct: 64   ISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICT 123

Query: 982  LPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSL 1041
            LPP K+PIGCKW+YKIK +SDGTIERYKARLVAKGY+Q+ GID+ ETF+PV ++TSV+ +
Sbjct: 124  LPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLI 183

Query: 1042 LAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPP------QGTSPPPNKVCLLRRALYGL 1101
            LA++A   + L Q+D+ NAFLNG+L EE+YMK P      QG S PPN VC L++++YGL
Sbjct: 184  LAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGL 243

Query: 1102 KQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISD 1161
            KQA R WF  FS T+   GF  S  D+  F + T    + +L+YVDD+II  N+  A+ +
Sbjct: 244  KQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDE 303

Query: 1162 LQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPLD 1221
            L+  L   F+++DLG L YFLGLE++  + G  + Q KYA DL+  +G+     SS P+D
Sbjct: 304  LKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMD 363

Query: 1222 PHVHLTPFDGVPLDDASLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLR 1281
            P V  +   G    DA  YR+L+G L+YL +TR DI++AV+ +SQF  APR  H  AV++
Sbjct: 364  PSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMK 423

Query: 1282 ILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQ 1341
            IL Y+KGT+G GL +SSQ+ + L  +SDA +      RRST GYC +LG SLISW+SKKQ
Sbjct: 424  ILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQ 483

Query: 1342 SVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFHE 1401
             V+S+SS E+EYRAL+ AT E++WL     ++ +P   PTLL CDN +AI IA N VFHE
Sbjct: 484  QVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHE 543

Query: 1402 RTKHIENDCHFVRHHLLNNTLLLRSVSTIEQPADIFTKAL 1409
            RTKHIE+DCH VR   +    L  S    ++  D FT+ L
Sbjct: 544  RTKHIESDCHSVRERSVYQATLSYSFQAYDE-QDGFTEYL 578

BLAST of CSPI06G14740 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 199.9 bits (507), Expect = 1.1e-50
Identity = 105/224 (46.88%), Postives = 144/224 (64.29%), Query Frame = 1

Query: 1109 LLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYA 1168
            LLLYVDD+++TG+    ++ L   L   F MKDLG ++YFLG+++     G  LSQ KYA
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1169 SDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIYLTVTRPDIAYAV 1228
              ++  +G+ D    STPL   ++ +        D S +R +VG+L YLT+TRPDI+YAV
Sbjct: 63   EQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 1229 HIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRS 1288
            +IV Q M  P    F  + R+LRYVKGT+ HGL     S L +  + D+DWAG  + RRS
Sbjct: 123  NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 1289 TTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIW 1333
            TTG+C +LG ++ISW +K+Q  +SRSSTE+EYRALA   AEL W
Sbjct: 183  TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI06G14740 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 105.1 bits (261), Expect = 3.5e-22
Identity = 50/99 (50.51%), Postives = 67/99 (67.68%), Query Frame = 1

Query: 920  EPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIE 979
            EP S   A  +P W +AM EEL AL +  TW  V  P  +  +GCKW++K K HSDGT++
Sbjct: 27   EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 980  RYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVA 1019
            R KARLVAKG+ QE GI + ET++PV R  ++R++L VA
Sbjct: 87   RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI06G14740 vs. TAIR10
Match: ATMG00240.1 (ATMG00240.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 90.1 bits (222), Expect = 1.2e-17
Identity = 41/79 (51.90%), Postives = 57/79 (72.15%), Query Frame = 1

Query: 1215 IYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGY 1274
            +YLT+TRPD+ +AV+ +SQF +A RT    AV ++L YVKGT+G GL +S+ S L L  +
Sbjct: 1    MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 1275 SDADWAGDPTDRRSTTGYC 1294
            +D+DWA  P  RRS TG+C
Sbjct: 61   ADSDWASCPDTRRSVTGFC 79

BLAST of CSPI06G14740 vs. NCBI nr
Match: gi|1012344626|gb|KYP55818.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1177.9 bits (3046), Expect = 0.0e+00
Identity = 611/1120 (54.55%), Postives = 767/1120 (68.48%), Query Frame = 1

Query: 274  KDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLA 333
            +D  +LI+ LM L  +YE VRA+LLH+ PLP+L+ A+  +  EE RLG+   K       
Sbjct: 28   RDGTKLIQFLMALTDDYEPVRASLLHQEPLPTLEDALPRLQSEETRLGLLCAKPDMAFAV 87

Query: 334  STYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPIRPPRPPGTSTKEKI 393
            ST   N     +C NC+ SGH +  CP IEC +C K GHI  NCP R P    T     +
Sbjct: 88   STSKGN-----YCGNCRQSGHVYIECPIIECHHCRKKGHIAPNCPTRDPNRSST-----L 147

Query: 394  FTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALAVSSGNRWLLDSACCNHMTSD 453
            F ++                       L+  +I   +        RW  DSACCNHMTS 
Sbjct: 148  FCRY---------------------CKLVGHIIDHCN-------TRWYFDSACCNHMTSA 207

Query: 454  VSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCD 513
              + S  S    +  I+ ADG+ M +SH G I TPS+ +P  Y +P L FNLVSVGQLCD
Sbjct: 208  SHVFSDLSSRDRISHIHTADGSLMEVSHKGPISTPSLSMPDAYLIPKLNFNLVSVGQLCD 267

Query: 514  LGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHL 573
            LG  ++FS  GC VQDP+TG+ IG GRK+GR+FELT+L V S +++ A+ T S  + WH 
Sbjct: 268  LGYILTFSSTGCSVQDPRTGKIIGNGRKIGRMFELTTLHVPSSNNLCAASTPSSIHLWHQ 327

Query: 574  RLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSD 633
            RLGH S  KLR LIS+ +L ++ K    +C  C+ AKQ AL F+ S S+    FDLVH D
Sbjct: 328  RLGHTSLSKLRPLISMGSLGSI-KEDKLDCTACQTAKQAALPFNDSTSSSVSLFDLVHYD 387

Query: 634  IWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKIL 693
            +WGPAP  T+ G RY+++FIDD+SRFTWIY +K RSE+ + YI FA MIRTQFS  IK  
Sbjct: 388  VWGPAPTPTMGGCRYFIIFIDDFSRFTWIYLMKSRSEIPQIYINFATMIRTQFSKCIKTF 447

Query: 694  RTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPE 753
            R DN  EY+DS LL FL++QGT  + SCP  SQQNGRAERKHRHILDS+RA+L+S+SCPE
Sbjct: 448  RRDNASEYRDSKLLHFLAEQGTTSEFSCPGTSQQNGRAERKHRHILDSIRAMLISSSCPE 507

Query: 754  KFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHP----RA 813
            + WGEAALT+VY INRLPSSVL + +PFE+L+G  P Y  L+VFG ACFVLL P    + 
Sbjct: 508  RTWGEAALTAVYVINRLPSSVLGDKTPFERLFGTPPSYESLRVFGCACFVLLQPHEYTKL 567

Query: 814  RPVSVVSLAMAPNTKD--------------------FVVGTLFPTDSGYLGIPQS---FF 873
            +P + +   +   T+                     F    +F + S +  IP +    F
Sbjct: 568  QPRARLCCFLGYGTEHKGYRVWDPISQCIRISRHVVFWEHKMFSSLSTFKSIPSTSTPLF 627

Query: 874  TNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDV-PESSPAT-------PLRRSTR 933
            TN S+DLFP      D +   S   T   D P + +D+ P   PA        P     R
Sbjct: 628  TNPSIDLFP-----HDFDAGSSDELTGASDLPIIPNDLTPAVDPAVQDPALPPPPGLPPR 687

Query: 934  VREPPPHLTDYHCFSTIVSLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPG 993
            VR+PP +L DYH FSTI+S  EP +Y+EAS +P W+++M  ELQALE T+TWD VD PP 
Sbjct: 688  VRKPPSYLHDYHYFSTIMSHYEPQTYREASADPKWRESMQAELQALENTNTWDLVDHPPD 747

Query: 994  KRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVA 1053
            K  + CKW++K+KT+SDG+IERYKARLVA+G++QEYGIDYEETFAPVAR+TS+R+LLA+A
Sbjct: 748  KNLMSCKWVFKVKTYSDGSIERYKARLVARGFTQEYGIDYEETFAPVARLTSLRTLLAIA 807

Query: 1054 AAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFAT 1113
            A+K+W + QMDVKNAFLNG+L  EVYM+PP G S PP+KVC LR+ALYGLKQAPR+WFA 
Sbjct: 808  ASKKWFIDQMDVKNAFLNGDLDAEVYMQPPPGYSCPPHKVCRLRKALYGLKQAPRSWFAK 867

Query: 1114 FSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFE 1173
            F  TI QLGFTSS++DNALF R+  HG V+LLLYVDDMIITG+D   IS+L+Q+L  HFE
Sbjct: 868  FHDTIAQLGFTSSTYDNALFIRRNDHGTVILLLYVDDMIITGDDSNGISELKQFLNLHFE 927

Query: 1174 MKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDG 1233
            MKDLGSL+YFLG+++    DG  LSQAKYASDLI+R+G+TD  T STPL+   H TP DG
Sbjct: 928  MKDLGSLSYFLGIQILSCDDGLFLSQAKYASDLISRAGLTDCKTESTPLETRAHFTPLDG 987

Query: 1234 VPLDDASLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLG 1293
             PL+D +LYRQLVGSLIYLTVTRPDIAYAVH+VSQFM APR+ HF AVLRI+RY+KGT+ 
Sbjct: 988  TPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHLVSQFMCAPRSTHFAAVLRIIRYIKGTIF 1047

Query: 1294 HGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTES 1353
            HGL +S  S L+L  YSDADW GDPTDRRS TG+C +LGDSLISWRSKKQ +++RSSTE+
Sbjct: 1048 HGLHYSVDSPLILRAYSDADWGGDPTDRRSVTGFCIFLGDSLISWRSKKQQLVARSSTEA 1103

Query: 1354 EYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSAI 1359
            EYRA+AD T+E++W+RWLL D+G  Q  PT L CDNRSAI
Sbjct: 1108 EYRAMADTTSEIVWIRWLLGDLGFLQSFPTDLFCDNRSAI 1103

BLAST of CSPI06G14740 vs. NCBI nr
Match: gi|147790768|emb|CAN75041.1| (hypothetical protein VITISV_027174 [Vitis vinifera])

HSP 1 Score: 1129.8 bits (2921), Expect = 0.0e+00
Identity = 626/1260 (49.68%), Postives = 818/1260 (64.92%), Query Frame = 1

Query: 254  NEYLAVLQPIWTQLD-------------QANISKDHLRLIKVLMGLRPEYESVRAALLHR 313
            N+Y   L+ IW Q+D             Q    +D  RL + LM L  ++E +R  LL+R
Sbjct: 134  NDYYDQLRFIWDQIDLSDPIWECSKDAQQYASIRDEFRLYEFLMSLHKDFEPIRGQLLNR 193

Query: 314  NPLPSLDAAIQEILFEEKRLGINSTKQSDVVLAST-YTPNRVANMFCKNCKLSGHKFSNC 373
            +  PSLD A+ E++ EE RL     +    +LA T  TP         +   S ++    
Sbjct: 194  SXAPSLDTAVNELVREEARLATLQAQNKLNILAITPSTPLIEQPQQLGDFSGSNNRRKQN 253

Query: 374  PKIECRYCHKHGHILDNCPIR----------PPRPPGTSTKEKIFTKHGSSSVVAATSDD 433
             K  C YC + GH ++ C  R           P PP  ST +       S S +  +S +
Sbjct: 254  NKKFCNYCKRPGHTIETCYRRNKSTATVANTAPTPPTVSTSQS------SGSTINLSSTE 313

Query: 434  SSLIQISDLQSLLNQLISSSSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI 493
               I    ++ + N  +S++ ++       WL DSACCNHMT   SL +   P      I
Sbjct: 314  LQEIIAQAVRMVGNASLSTALSVLPGKSQTWLFDSACCNHMTPHSSLFTNLDPAPHPLNI 373

Query: 494  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQD 553
            + ADG+ M+ +  G + T ++ +P  + VP+L++NL SVGQL +LG  + F  +GC VQD
Sbjct: 374  HIADGSTMHGNSLGFVSTSTLSVPGVFHVPDLSYNLCSVGQLAELGYRLIFXYSGCIVQD 433

Query: 554  PQTGQTIGTGRKVGRLFELTSLRVS--SPSSISASVTDSDTYQ----WHLRLGHASSEKL 613
             +TGQ +GTG +VGR+F + +L +   +P S++A+     +      WH RLGHA S ++
Sbjct: 434  XRTGQELGTGPRVGRMFPVNNLHLPPVAPVSVAAATAAVSSLPSLALWHSRLGHAPSSRV 493

Query: 614  RHLISVNNLTNLTKFV-------PFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWG 673
            + L+S   L ++++ +        F+C +C+L KQPAL F+ S S     F+L+HSD+WG
Sbjct: 494  QQLVSRGLLGSVSRGLLGSVSKDNFDCTSCQLGKQPALPFNNSDSISKSIFELIHSDVWG 553

Query: 674  PAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTD 733
            P+P+ ++ G RY+V+FIDDYSR++WI+ +K RSE+   Y  FA M+ TQFS  IK  R+D
Sbjct: 554  PSPVASIGGSRYFVVFIDDYSRYSWIFPMKSRSEILSIYSNFAKMVETQFSKRIKTFRSD 613

Query: 734  NVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFW 793
            N LEY        L   GTI   +CP  SQQNGRAERK RHILD VRALLLSA  P  FW
Sbjct: 614  NALEYTQHAFQXLLHSYGTIHHLTCPGTSQQNGRAERKLRHILDXVRALLLSAKIPAPFW 673

Query: 794  GEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLL----------- 853
            GEA+L +V+ INR+PS+V+ N +P+E+L+G  P+Y  L+ FGS CFVLL           
Sbjct: 674  GEASLHAVHAINRIPSTVIHNQTPYERLFGSPPNYHHLRSFGSXCFVLLQPHEHNKLEPR 733

Query: 854  ------------HPRARPVSVVS--LAMAPNTKDFVVGTLFPTDSGYLGIPQSFFTNTSV 913
                            R    VS  L ++ N   F    LF   S +    +S  TN+SV
Sbjct: 734  SRLCCFLGYGETQKGYRCYDPVSHRLRVSRNVV-FWEHRLFVELSHF----RSSLTNSSV 793

Query: 914  -DLFP-----LSEPTLDTELAQSSPATANLDPPSVSDD-----------------VPESS 973
             ++FP      S  TLD  L   SP   +  P  V+D+                 +PE  
Sbjct: 794  LEIFPDESLVPSANTLDLHL-DFSPDIFDASPRQVADEQIIHELPHFEPGSPAPALPEDP 853

Query: 974  PAT-PLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASINPVWQKAMDEELQALEKT 1033
            P   P R STRVR  PPHL DYHC++ + +L EP +Y+EAS +P+WQ AM EEL AL K 
Sbjct: 854  PQDIPPRHSTRVRSIPPHLLDYHCYTALATLHEPQTYREASTDPLWQIAMKEELDALTKN 913

Query: 1034 HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVAR 1093
            HTWD V LPPG+  +GCKWIYKIKT SDG++ERYKARLVAKG++QEYGIDYEETFAPVAR
Sbjct: 914  HTWDLVTLPPGQSVVGCKWIYKIKTRSDGSVERYKARLVAKGFTQEYGIDYEETFAPVAR 973

Query: 1094 MTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYG 1153
            ++SVR+LLAVAAA++W L QMDVKNAFLNG+LSEEVYM+PP G S   NKVC LRRALYG
Sbjct: 974  ISSVRALLAVAAARKWDLFQMDVKNAFLNGDLSEEVYMQPPPGLSIESNKVCHLRRALYG 1033

Query: 1154 LKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAIS 1213
            LKQAPRAWFA FSSTI +LG+T+S +D+ALF R+T    +LLLLYVDDMIITG+D   I 
Sbjct: 1034 LKQAPRAWFAKFSSTIFRLGYTASPYDSALFLRRTDKXTILLLLYVDDMIITGDDLSGIQ 1093

Query: 1214 DLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPL 1273
            +L+ +L Q FEMKDLG L+YFLGLE++H +DG  ++QAKYASDL++++G+TDS T  TP+
Sbjct: 1094 ELKDFLSQQFEMKDLGHLSYFLGLEITHSTDGLYITQAKYASDLLSQAGLTDSKTVDTPV 1153

Query: 1274 DPHVHLTPFDGVPLDDASLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1333
            + + HLTP  G PL + SLYR+LVGSL+YLTVTRPDI+YAVH VSQ+++APR+ H+ AVL
Sbjct: 1154 ELNAHLTPLGGKPLSNPSLYRRLVGSLVYLTVTRPDISYAVHQVSQYLSAPRSTHYAAVL 1213

Query: 1334 RILRYVKGTLGHGLQFSSQSSLVLSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKK 1393
            RILRY+KGTL HGL +S+QS L+L  +SDADWAGDPTDRRSTTGYCF LG SLISWRSKK
Sbjct: 1214 RILRYLKGTLFHGLFYSAQSPLILXAFSDADWAGDPTDRRSTTGYCFLLGSSLISWRSKK 1273

Query: 1394 QSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFH 1428
            Q+ ++RSSTE+EYRALAD T+EL+WLRWLL D+GV     T L+CDN+SAI IAHNDVFH
Sbjct: 1274 QTFVARSSTEAEYRALADTTSELLWLRWLLKDLGVSTSSATPLYCDNQSAIHIAHNDVFH 1333

BLAST of CSPI06G14740 vs. NCBI nr
Match: gi|147815260|emb|CAN74430.1| (hypothetical protein VITISV_010987 [Vitis vinifera])

HSP 1 Score: 1016.1 bits (2626), Expect = 5.8e-293
Identity = 582/1226 (47.47%), Postives = 767/1226 (62.56%), Query Frame = 1

Query: 254  NEYLAVLQPIWTQLD-------------QANISKDHLRLIKVLMGLRPEYESVRAALLHR 313
            N+Y   L+ IW Q+D             Q    +D  RL + LM L  ++E +R  LL+R
Sbjct: 455  NDYYDQLRFIWDQIDLSYPTWTCSKNAQQYASIRDEFRLYEFLMSLHKDFEPIRGQLLNR 514

Query: 314  NPLPSLDAAIQEILFEEKRLGINSTKQSDVVLAST---YTPNRVANMFCKNCKLSGHKFS 373
            +P PSLD A+ E++ EE RL     +    VLA T     P +  + +      S ++  
Sbjct: 515  SPAPSLDTAVNELVREEARLATLQAQNKFNVLAITPLIEQPQQSGDSYG-----SSNRRK 574

Query: 374  NCPKIECRYCHKHGHILDNCPIRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISD 433
               K  C YC + GH ++ C  R       +  E       S+SV + +S  +  +  ++
Sbjct: 575  QTNKKFCNYCKRPGHTIETCYRRNKSTAAVANIEPT-PPMASTSVESKSSGSTINLSSTE 634

Query: 434  LQSLLNQLI------SSSSALAVSSGNR--WLLDSACCNHMTSDVSLMSTSSPTKSLPPI 493
            LQ ++ Q++      S S+AL+V  G    WL  SACCNHMT   SL S   P      I
Sbjct: 635  LQEIIAQVVRMAGNASLSTALSVLPGKSQTWLFYSACCNHMTPHSSLFSKLDPAPHPLNI 694

Query: 494  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQD 553
            + ADG+ M+ +  G + T ++ +P  + VP             DL  N+      C V  
Sbjct: 695  HIADGSTMHGNSLGFVSTSNLFVPGVFHVP-------------DLSYNL------CSV-- 754

Query: 554  PQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASSEKLRHLISV 613
               GQ    G +   +F  +   V  P +               R    +  ++  + +V
Sbjct: 755  ---GQLAELGYRF--IFYYSGCIVQDPKT---------------RQELGTGPRVGRMFTV 814

Query: 614  NNLTNLTKFVP--FNCLNCKLAKQPALSF------SQSISNCDKPFDLVHSDIWGPAPIT 673
            +NL +L    P     +   ++  P+L+         S S     F+L+HSD+W P+P+ 
Sbjct: 815  SNL-HLPPVAPVYIAIVAAAVSSLPSLALWHSRLGHASSSRVQHIFELIHSDVWEPSPVA 874

Query: 674  TVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEY 733
            ++ G RY+V+FIDDYSR++WI+ +K RSE+   Y  FA MI TQFS  IK  R+DN LEY
Sbjct: 875  SIGGSRYFVIFIDDYSRYSWIFPMKSRSEILSIYNNFAKMIETQFSKRIKTFRSDNALEY 934

Query: 734  KDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAAL 793
                  + L   GT+   +CP  SQQNGRAERK RHILD+VRALLLSA  P  FWGEAAL
Sbjct: 935  TQHAFQALLHSYGTVHHLTCPGTSQQNGRAERKLRHILDTVRALLLSAKIPAPFWGEAAL 994

Query: 794  TSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLL----HPRARPVSVVSL 853
             +V+ INR+PS+V+ N +P+E+L+G  P Y  L+ FGSACFVLL    H +  P S +  
Sbjct: 995  HAVHAINRIPSAVIHNQTPYERLFGSPPVYHHLRSFGSACFVLLQSHEHNKLEPRSRLCC 1054

Query: 854  AMA------------PNTKDFVVGTLFPTDSGYLGIPQSFFTNT-------SVDLFPLS- 913
             +             P +    V  +FP +S    +P    TNT       S D+F +S 
Sbjct: 1055 FLGYGETQKGYRCYDPVSHRLRVSQIFPNESL---VPS---TNTFDPPLDFSPDIFDVSP 1114

Query: 914  EPTLDTELAQSSPATANLDP-PSVSDDVPESSPATPLRRSTRVREPPPHLTDYHCFSTIV 973
                D ++    P      P P++ +D P+  P    R STRVR  PPHL DYHC++ + 
Sbjct: 1115 RQVADEQIDDELPHFETRSPAPTLPEDPPQDIPP---RHSTRVRSIPPHLLDYHCYTALA 1174

Query: 974  SLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDG 1033
            +L EP +Y+EAS +P+WQ AM EEL AL K HTWD V LPPG+  +GCKWIYKIKT SDG
Sbjct: 1175 TLHEPQTYREASTDPLWQIAMKEELDALTKNHTWDLVPLPPGQSVVGCKWIYKIKTRSDG 1234

Query: 1034 TIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLN 1093
            ++ERYKARLVAKG++QEYGIDYEETFAPVAR++SVR+LLAVAAA+QW L QMDVKNAFLN
Sbjct: 1235 SVERYKARLVAKGFTQEYGIDYEETFAPVARISSVRALLAVAAARQWDLFQMDVKNAFLN 1294

Query: 1094 GNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNA 1153
            G+LSE VYM+PP G S   NKVC LRRALYGLKQAPRAWFA FSSTI +LG+T+S +D+A
Sbjct: 1295 GDLSEAVYMQPPPGLSVESNKVCHLRRALYGLKQAPRAWFAKFSSTIFRLGYTASPYDSA 1354

Query: 1154 LFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHR 1213
            LF R+T    +LLLLYVDDMIIT ND   I +L+ +L Q FEMKDLG L+YFLGLE++H 
Sbjct: 1355 LFLRRTDKDTILLLLYVDDMIITSNDLSGIQELKDFLSQQFEMKDLGHLSYFLGLEITHS 1414

Query: 1214 SDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIY 1273
            +DG  ++QAKYASDL++++G+TDS    TP++ + HLTP  G PL + SLYR+LVGSL+Y
Sbjct: 1415 TDGLYITQAKYASDLLSQAGLTDSKNVDTPVELNAHLTPSGGKPLSNPSLYRRLVGSLVY 1474

Query: 1274 LTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSD 1333
            LTVTRPDI+Y VH VSQ+++APR+ H+  VLRILRY+KGT+ HGL +S+QS LVL  +SD
Sbjct: 1475 LTVTRPDISYVVHQVSQYLSAPRSTHYATVLRILRYLKGTIFHGLFYSAQSPLVLRAFSD 1534

Query: 1334 ADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIWLRWL 1393
            ADWAGDPT+RRSTTGYCF LG SLISWRSKKQ+ ++RSSTE+EYRALAD T+EL+WLRWL
Sbjct: 1535 ADWAGDPTNRRSTTGYCFLLGSSLISWRSKKQTFVARSSTEAEYRALADTTSELLWLRWL 1594

Query: 1394 LADMGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVST 1423
            L D+GV     T L+CDN+SAI IAHNDVF+ERTKHIE +CHF+ +HL++  L L  VS+
Sbjct: 1595 LKDLGVSTSSATPLYCDNQSAIHIAHNDVFYERTKHIEINCHFICYHLVHGALKLFFVSS 1623

BLAST of CSPI06G14740 vs. NCBI nr
Match: gi|147775172|emb|CAN70361.1| (hypothetical protein VITISV_033646 [Vitis vinifera])

HSP 1 Score: 1007.3 bits (2603), Expect = 2.7e-290
Identity = 551/1048 (52.58%), Postives = 714/1048 (68.13%), Query Frame = 1

Query: 399  SSSVVAATSDDSSLIQISDLQSLLNQLI------SSSSALAVSSGNR--WLLDSACCNHM 458
            S+SV + +S  +  +  ++LQ ++ Q +      S S+AL+V  G    WL DSA CNHM
Sbjct: 3    STSVESKSSGSTINLSSTELQEIIAQAVRMAGNASLSTALSVLPGKSQTWLFDSAYCNHM 62

Query: 459  TSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQ 518
            T   SL S   P      I+ ADG+ M+ +  G + T ++ +P  + VP+L++NL  +GQ
Sbjct: 63   TPHSSLFSKLDPAPHPLHIHIADGSTMHGNSLGFVSTSNLSVPGVFHVPDLSYNLCYMGQ 122

Query: 519  LCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQ 578
            L +LG     S +G           +GTG +VGR+F +++L +   + IS +   +    
Sbjct: 123  LAELG-----SEDG----------ELGTGPRVGRMFPVSNLHLPPVAPISIATAAAAVSS 182

Query: 579  ------WHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCD 638
                  WH RLGHASS +++ L+S   L  ++K + F C +C+L KQP L F+ S S  +
Sbjct: 183  LPSLALWHSRLGHASSSRVQQLVSRGLLGFVSKDI-FYCTSCQLGKQPTLPFNNSESISN 242

Query: 639  KPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRT 698
              F+L+HSD+WGP+P+ ++ G RY+V+FIDDYSR+ WI+ +K  SE+   Y  FA MI T
Sbjct: 243  SIFELIHSDVWGPSPVASIGGSRYFVVFIDDYSRYIWIFPMKSCSEILSIYSNFAKMIET 302

Query: 699  QFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRA 758
            QFS  IK  R+DN LEY      + L   GT+   +CP  SQQNGRAERK RHILD+VRA
Sbjct: 303  QFSKRIKTFRSDNALEYTQHAFQALLHSYGTVHHLTCPGTSQQNGRAERKLRHILDTVRA 362

Query: 759  LLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVL 818
            LLLSA  P  FWGEAAL +V+ INR+PS+V+ N +P+E+L+G  P Y  L+ FGSACFVL
Sbjct: 363  LLLSAKIPAPFWGEAALHAVHAINRIPSAVIHNQTPYERLFGSPPVYHHLRSFGSACFVL 422

Query: 819  L----HPRARPVSVVSLAMAPNTKDFVVGTLFPTDSGYLGI-PQSFFTNTSVDLFPLSEP 878
            L    H +  P S +   +              T  GY    P S     S ++    E 
Sbjct: 423  LQSHEHNKLEPRSRLCCFLGYGE----------TQKGYRCYDPVSHRLRVSHNVV-FWEH 482

Query: 879  TLDTELAQSSPATANLDPPSVSDDVPESSPATPLRRSTRVREPPPHLTDYHCFSTIVSLV 938
             L  EL+    +  N    SV +  P+ S    L  ST   +PP    D+       S  
Sbjct: 483  RLFVELSHFRSSLTN---SSVLEIFPDES----LVPSTNTFDPP---LDFSPDIFDASPR 542

Query: 939  EPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIE 998
            +P +Y+EAS +P+WQ AM EEL AL K HTWD V LPPG+  +GCKWIYKIKT SDG++E
Sbjct: 543  QPQTYREASTDPLWQIAMKEELDALTKNHTWDLVPLPPGQSVVGCKWIYKIKTRSDGSVE 602

Query: 999  RYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNAFLNGNL 1058
            RYKARLVAKG++QEYGIDYEETFAPVAR++SVR+LLAVA A+QW L QMDVKNAFLNG+L
Sbjct: 603  RYKARLVAKGFTQEYGIDYEETFAPVARISSVRALLAVATARQWDLFQMDVKNAFLNGDL 662

Query: 1059 SEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFT 1118
            +E VYM+PP   S   NKVC LRRALYGLKQAPRAWFA FSSTI +LG+T+S +D+ALF 
Sbjct: 663  NEAVYMQPPPSLSVESNKVCHLRRALYGLKQAPRAWFAKFSSTIFRLGYTASPYDSALFL 722

Query: 1119 RQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDG 1178
            R+T  G +LLLLYVDDMIITGND   I +L+ +L Q FEMKDLG L+YFLGLE++H +DG
Sbjct: 723  RRTDKGTILLLLYVDDMIITGNDLSGIQELKDFLSQQFEMKDLGHLSYFLGLEITHSTDG 782

Query: 1179 YLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQLVGSLIYLTV 1238
              ++QAKYASDL+++ G+TDS    TP++ + HLTP  G PL + SLYR+LVG+L+YLTV
Sbjct: 783  LYITQAKYASDLLSQVGLTDSKNVDTPVELNAHLTPSRGKPLSNPSLYRRLVGNLVYLTV 842

Query: 1239 TRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLVLSGYSDADW 1298
            TRPDI+YAVH VSQ+++APR+ H+ AVLRILRY+KGT+ HGL +S+QS LVL  +SDADW
Sbjct: 843  TRPDISYAVHQVSQYLSAPRSTHYAAVLRILRYLKGTIFHGLFYSAQSPLVLRAFSDADW 902

Query: 1299 AGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAELIWLRWLLAD 1358
            AGDPTDRRSTTGYCF LG SLISWRSKKQ+ ++RSSTE+EYRALAD T+ELIWLRWLL D
Sbjct: 903  AGDPTDRRSTTGYCFLLGSSLISWRSKKQTFVARSSTEAEYRALADTTSELIWLRWLLKD 962

Query: 1359 MGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLLLRSVSTIEQ 1418
            +GV     T L+CDN+SAI IAHNDVFHERTKHI+ DCHF+R+HL++  L L  VS+ +Q
Sbjct: 963  LGVSTSSATPLYCDNQSAIHIAHNDVFHERTKHIKIDCHFIRYHLVHGALKLFFVSSKDQ 1013

Query: 1419 PADIFTKALPSNRFCHLLTKLKLIATLP 1428
             ADIFTK+LP+ R   L+  LKL++  P
Sbjct: 1023 LADIFTKSLPTRRTRDLIDNLKLVSHPP 1013

BLAST of CSPI06G14740 vs. NCBI nr
Match: gi|40644190|emb|CAC95126.1| (gag-pol polyprotein [Populus deltoides])

HSP 1 Score: 989.9 bits (2558), Expect = 4.5e-285
Identity = 591/1293 (45.71%), Postives = 786/1293 (60.79%), Query Frame = 1

Query: 213  VQLCFTLAVRRLRSDLSQAIFFLFEKSLGIASLSQAVQIRRNEYLAVLQPIWTQLDQANI 272
            +Q  FT +    +  L   I  L +K++ I     A+    ++ LA+ + +  +   A I
Sbjct: 98   LQRLFTQSNFAKQYQLENDIRALHQKNMSIQEFYSAMTDLWDQ-LALTESVELKACGAYI 157

Query: 273  SK-DHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTK----- 332
             + +  RL++ L  LR ++E +R ++LHR+PLPS+D+ + E+L EE RL   S K     
Sbjct: 158  ERREQQRLVQFLTALRSDFEGLRGSILHRSPLPSVDSVVSELLAEEIRLQSYSEKGILSA 217

Query: 333  QSDVVLASTYTP---------NRVANMFCKNCKLSGHKFSNCPKI-ECRYCHKHGHILDN 392
             +  VLA    P          RV    C  CK  GH  + CPK+ +     K G    +
Sbjct: 218  SNPSVLAVPSKPFSNHQNKPYTRVGFDECSFCKQKGHWKAQCPKLRQQNQAWKSGSQSQS 277

Query: 393  CPIRPP---RPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALA 452
               R P   +PP  +T          +S  + T  ++   Q     SL  Q +S+SS   
Sbjct: 278  NAHRSPQGYKPPHHNTA-------AVASPGSITDPNTLAEQFQKFLSLQPQAMSASSIGQ 337

Query: 453  V---SSG---NRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDT 512
            +   SSG   + W+LDS   +HM+ D S  ++ SP  S+ P+  ADG  M ++  G++ T
Sbjct: 338  LPHSSSGISHSEWVLDSGASHHMSPDSSSFTSVSPLSSI-PVMTADGTPMPLAGVGSVVT 397

Query: 513  PSVHLPHTYCVPNLTFNLVSVGQLCDLG-LNVSFSPNGCQVQDPQTGQTIGTGRKVGRLF 572
              + LP+ Y +P L  NL S+GQ+CD G   V FS + C VQD Q+ + IGTGR+   L+
Sbjct: 398  LHLSLPNVYLIPKLKLNLASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLY 457

Query: 573  ELTSLRVSSPSSISASVTD----------SDTYQWHLRLGHASSEKLRHLISVNNLTNLT 632
             L  L+V  P  ++A+  D          S  Y WH RLGH SS +LR L S   L NL 
Sbjct: 458  ILDELKV--PVVVAATTVDLSFFRLSLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLK 517

Query: 633  KFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDY 692
                 +C  CKLAK  AL F++S S    PFDL+HSD+WGP+P++T  G RYYV FIDD+
Sbjct: 518  TCDISDCSGCKLAKFSALPFNRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDH 577

Query: 693  SRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTI 752
            +R+ W+Y +KHRSE    Y  F  +I+TQ S+ IK  R D   EY  +     L+  GTI
Sbjct: 578  TRYCWVYLMKHRSEFFEIYAAFRALIKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTI 637

Query: 753  VQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQ 812
             Q SC    +QNG AERKHRHI+++ R+LLLSA    +FWGEA LT+V  IN +PSS   
Sbjct: 638  HQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSSHSS 697

Query: 813  NTSPFEKLYGISPDYSKLKVFGSACFVLLHPR-------ARPVSVVSLAMA--------- 872
              SPFEKLYG  PDYS  +VFG   FV LHP        +R    V L            
Sbjct: 698  GLSPFEKLYGHVPDYSSFRVFGCTYFV-LHPHVERNKLSSRSAICVFLGYGEGKKGYRCF 757

Query: 873  -PNTKDFVVG--TLFPTDSGYLGIPQSFFTNTSVDLF---PLSE-------PTLDTELAQ 932
             P T+   V    +F     +  IP +  + T  DL    P SE       P + +    
Sbjct: 758  DPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDPFSEDSGNDTSPYVRSICTH 817

Query: 933  SSPATANL----DPPSVSDDVPESSPA---TPLRRSTRVREPPPHLTD--YHCFST---- 992
            +S  T  L       S S   P++S      P R+S R+R+    L D  Y C+S+    
Sbjct: 818  NSAGTGTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRK-STKLPDFAYSCYSSSFTS 877

Query: 993  ----IVSLVEPTSYQEASINPVWQKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKI 1052
                I  L EP+SY+EA ++P+ Q+AMDEEL AL KT TWD V LPPGK  +GC+W+YKI
Sbjct: 878  FLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKI 937

Query: 1053 KTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDV 1112
            KT+SDG+IERYKARLVAKGYSQ+YG+DYEETFAP+A+MT++R+L+AVA+ +QW + Q+DV
Sbjct: 938  KTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDV 997

Query: 1113 KNAFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTS 1172
            KNAFLNG+L EEVYM PP G S     VC L++ALYGLKQAPRAWF  FS  I+ LGF S
Sbjct: 998  KNAFLNGDLQEEVYMAPPPGISHDSGYVCKLKKALYGLKQAPRAWFEKFSIVISSLGFVS 1057

Query: 1173 SSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLG 1232
            SSHD+ALF + T  G ++L LYVDDMIITG+D   IS L+  L + FEMKDLG L YFLG
Sbjct: 1058 SSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVLKTELARRFEMKDLGYLRYFLG 1117

Query: 1233 LEVSHRSDGYLLSQAKYASDLIARSGITDSTTSSTPLDPHVHLTPFDGVPLDDASLYRQL 1292
            +EV++   GYLLSQ+KY ++++ R+ +TD+ T  TP++ +   +  DG+PL D +LYR +
Sbjct: 1118 IEVAYSPRGYLLSQSKYVANILERARLTDNKTVDTPIEVNARYSSSDGLPLIDPTLYRTI 1177

Query: 1293 VGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVLRILRYVKGTLGHGLQFSSQSSLV 1352
            VGSL+YLT+T PDIAYAVH+VSQF+A+P TIH+ AVLRILRY++GT+   L  SS SSL 
Sbjct: 1178 VGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLRILRYLRGTVFQSLLLSSTSSLE 1237

Query: 1353 LSGYSDADWAGDPTDRRSTTGYCFYLGDSLISWRSKKQSVISRSSTESEYRALADATAEL 1412
            L  YSDAD   DPTDR+S TG+C +LGDSLISW+SKKQS++S+SSTE+EY A+A  T E+
Sbjct: 1238 LRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQSIVSQSSTEAEYCAMASTTKEI 1297

Query: 1413 IWLRWLLADMGVPQQGPTLLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLLNNTLL 1424
            +W RWLLADMG+     T ++CDN+S+IQIAHN VFHERTKHIE DCH  RHHL + T+ 
Sbjct: 1298 VWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHERTKHIEIDCHLTRHHLKHGTIA 1357

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC2.5e-10238.87Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME3.7e-9838.63Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.9e-4946.88Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST2.3e-3934.09Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YJ41B_YEAST8.8e-2323.93Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A151SM08_CAJCA0.0e+0054.55Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A5BVC1_VITVI0.0e+0049.68Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027174 PE=4 SV=1[more]
A5BCZ7_VITVI4.0e-29347.47Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010987 PE=3 SV=1[more]
A5BN86_VITVI1.9e-29052.58Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_033646 PE=4 SV=1[more]
Q710T7_POPDE3.1e-28545.71Gag-pol polyprotein OS=Populus deltoides GN=60I2G14 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.16.4e-14146.21 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.11.1e-5046.88ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
ATMG00820.13.5e-2250.51ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00240.11.2e-1751.90ATMG00240.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|1012344626|gb|KYP55818.1|0.0e+0054.55Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|147790768|emb|CAN75041.1|0.0e+0049.68hypothetical protein VITISV_027174 [Vitis vinifera][more]
gi|147815260|emb|CAN74430.1|5.8e-29347.47hypothetical protein VITISV_010987 [Vitis vinifera][more]
gi|147775172|emb|CAN70361.1|2.7e-29052.58hypothetical protein VITISV_033646 [Vitis vinifera][more]
gi|40644190|emb|CAC95126.1|4.5e-28545.71gag-pol polyprotein [Populus deltoides][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR012337RNaseH-like_sf
IPR013103RVT_2
IPR025724GAG-pre-integrase_dom
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G14740.1CSPI06G14740.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 625..740
score: 1.4
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 622..788
score: 1
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 342..378
score: 8.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 345..361
score: 0.38coord: 363..379
score:
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 338..380
score: 1.
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 623..780
score: 7.9
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 624..797
score: 2.28
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 948..1188
score: 2.5
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 550..611
score: 2.0
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 851..1348
score: 0.0coord: 254..392
score: 0.0coord: 440..812
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1197..1378
score: 1.96E-39coord: 947..1167
score: 1.96

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None