CmaCh04G019870 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G019870
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionReverse transcriptase
LocationCma_Chr04: 11717245 .. 11722051 (-)
RNA-Seq ExpressionCmaCh04G019870
SyntenyCmaCh04G019870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACGTGAAAGTTTGAGTCCTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGCGTATGTGTGTTGATTGTAGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGGTAAGTTTGTGGTTGTTTATTTTGATGACATCCTTGTTTACTCTAAATCTTTAGATGATCATATTACCCATGTACGCAATGTTTTGACTAGTTTAAGAAACGAATGTTTGTACCTAAATTTAAAGAAATGTAGCTTTTGCATGGAAAAAGTTAACTTTCTTGGGTTTGTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGATTTATTCAGAATTTTAGTACAATTGCTTCACCCTTCAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAGGTTAATGGGGAGAGACTGTAAATGGCAATATGCCTACGCTGGTTTAAATCCAGCTCGGCCCACTAATTCGCGTATCCCACCATAAAATGATATGTTCCATTTGTCCTTCGCCAGAACTGCTGGATATGAAGGAAGGAATATTAACCCCTCCCTCTTTTTTGATTGATTGCCAGATAGATGGATTATCAATCAATTTGACAATAACGACTATGAATCCATGCTCCTCTTACTAAAGTAACTGTCTCCACGCATATCAATCTATTCCTCGCGTCTCTGTCTGGTAAAATATGCCAATTCGAATCAAAAATCTACATGAAAAGGTTTGAAGGAAATGAAATCAACAACATGTACACTTGATTTCCCCGGGATTGTAGTTCAATTGGTCAGAGCCCTGTCAAGGCGGAAGCTGGGGGGCCCCGTCAGTCAGTCCCGATGGATCCAAATCCAATAAAAACTAATAGGGCAATTTCCATTTCTTCAACGAGTATTGATTGATGGGAGAAAGCCATATGTGGCAACAAACTCTTGTTTCTTTTACTATTGGAAACTATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCTTGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCTTTTACTCACAACAGTAGAAAAACTACTCTTATTCCATTGTCTCCAAAAGATGTATTTATATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGGTGATGCAAAAGAAAAGATTGAAAAAGAATCAAGTGTAAAAAAAAGCTTGAGGGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGTCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGACTAGGAATGTTTTGATCTCTAACCAGACTACTCTTGTACTTATGTACAAGGGATCTTGTTACTTTTCCATGCTTAACCCTTTTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACATAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACATGAAAGTTTGAGTCCTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTTTTGGCGTATGTGTGTTGATTGTAGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGGTAAGTTTGTGGTTGTTTATTTTGATGACATCCTTGTTTACTCTAAATCTTTAGATGATCATATTACCCATGTACGCAATGTTTTGACTACTTTAAGAAACGAATGTTTGTACGTAAATTTAAAGAAATGTAGCTTTTGCATGGAAAAAGTTAACTTTCTTGGGTTTGTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAGAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTGATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGCGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCTTATGTCATAAAATATAAACAAGGTAAGGAGAACATTGTTGCAGATGCATTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTCTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGCTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCGGTTCCTAATGGTCCTTGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATCTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATTGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAATATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGTTTGTTTGCCCTTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCCCTTGACTTGTTGCCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAACTTCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCTACCCGAATTAATAAAGGGCGTAAGTTTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGTATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTACAACTTTTAATGTTGTTGATTTGAGTCCTTTTGATGTAGGTGATGGCTTTGATTCGAGGACGAATCTTTTTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTGAAGGTCCAATTACAAGGACGAGAGCTAAGAAGCTACAACAAACCTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGGAGACCTCCCAATGTTGTGCAAAGTTGAGATTCAAGAAAGAGATGAATTAAATGCACTTTAA

mRNA sequence

ATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATACTACTCTTGTACTTATGTACAAGGGATCTTGTTACTTTTCCATGCTTAACCCTTTTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACATAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAGAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTGATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGCGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCTTATGTCATAAAATATAAACAAGGTAAGGAGAACATTGTTGCAGATGCATTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTCTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGCTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCGGTTCCTAATGGTCCTTGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATCTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATTGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAATATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGTTTGTTTGCCCTTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCCCTTGACTTGTTGCCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAACTTCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCTACCCGAATTAATAAAGGGCGTAAGTTTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGTATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTACAACTTTTAATGTTGTTGATTTGAGTCCTTTTGATGTAGGTGATGGCTTTGATTCGAGGACGAATCTTTTTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTGAAGGTCCAATTACAAGGACGAGAGCTAAGAAGCTACAACAAACCTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGGAGACCTCCCAATGTTGTGCAAAGTTGAGATTCAAGAAAGAGATGAATTAAATGCACTTTAA

Coding sequence (CDS)

ATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATACTACTCTTGTACTTATGTACAAGGGATCTTGTTACTTTTCCATGCTTAACCCTTTTTTGCCTAGTGATTTTGTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCGAGGAGAAGCCTAGTAGTTTGCCACCACTTAGAGGGATTGAACATAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAACCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGACGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACACCTAAAAATGTAAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAGAATTTTAGTACAATTGCTTCACCCTTGAATGAACTTGTTAAGAAAAATGTATCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTGATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGCGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCTTATGTCATAAAATATAAACAAGGTAAGGAGAACATTGTTGCAGATGCATTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTCTTTTGGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGCTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCGGTTCCTAATGGTCCTTGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATCTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATTGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAATATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGTTTGTTTGCCCTTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCCCTTGACTTGTTGCCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAACTTCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCTACCCGAATTAATAAAGGGCGTAAGTTTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGTATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTACAACTTTTAATGTTGTTGATTTGAGTCCTTTTGATGTAGGTGATGGCTTTGATTCGAGGACGAATCTTTTTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTGAAGGTCCAATTACAAGGACGAGAGCTAAGAAGCTACAACAAACCTTATACAGTTATATTCAAGCTATGGTGAGCTCATCAAAGGAAATTCTAGAAGACGCTGGAGACCTCCCAATGTTGTGCAAAGTTGAGATTCAAGAAAGAGATGAATTAAATGCACTTTAA

Protein sequence

MLNPSLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNTTLVLMYKGSCYFSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKGAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGENDMNHDQGISIPEGPITRTRAKKLQQTLYSYIQAMVSSSKEILEDAGDLPMLCKVEIQERDELNAL
Homology
BLAST of CmaCh04G019870 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 452.2 bits (1162), Expect = 1.6e-125
Identity = 298/895 (33.30%), Postives = 434/895 (48.49%), Query Frame = 0

Query: 211  VVLLQEFEDLFSEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVS 270
            V L Q++ ++   + P     +  I  +H I+  PGA +P    Y    K  +EI + V 
Sbjct: 558  VWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ 617

Query: 271  ELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELH 330
            +LL                                  +NK TI    P+PR+D++L  + 
Sbjct: 618  KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIG 677

Query: 331  GCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVL 390
               +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    
Sbjct: 678  NAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTF 737

Query: 391  R----------------------------------------------------EYLVSSN 450
            R                                                    E+L  S 
Sbjct: 738  RDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSI 797

Query: 451  GVEVD---EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVS 510
            G++     + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA P+   +     
Sbjct: 798  GIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ 857

Query: 511  FIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM-- 570
              W + Q+ A + LK+ L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+  
Sbjct: 858  --WTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGV 917

Query: 571  --FFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLN 630
              +FS+ L  A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  
Sbjct: 918  VGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPA 977

Query: 631  RRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE----------- 690
            RR  +WL+ + T+ + ++Y  G +N+VADA+SR    +    +R +  E           
Sbjct: 978  RRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPL 1037

Query: 691  ------HIKDLYQHDM------FFAPFVESCE-KGLIVDNYLLLDGFLFRKGKLCIPSCS 750
                  H+K+L QH++       F  + +  E       NY L D  ++ + +L +P   
Sbjct: 1038 CSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IK 1097

Query: 751  IRELLVREAHGGGLM-AHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKS-RLQ 810
             +  ++R  H   L   H GV+ T   +S  ++WPK++H + +    C+ C+  KS R +
Sbjct: 1098 QQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPR 1157

Query: 811  PHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKH 870
             HGL  PLP+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  
Sbjct: 1158 LHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQ 1217

Query: 871  IADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVV 930
            + DL FR +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E  
Sbjct: 1218 LIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERT 1277

Query: 931  NRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPI 984
             +T+  +LRA    N++ W V LP IEF YN     T   +PFEI  G+ P TP     I
Sbjct: 1278 IQTLNRLLRAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AI 1337

BLAST of CmaCh04G019870 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 450.3 bits (1157), Expect = 6.1e-125
Identity = 294/877 (33.52%), Postives = 429/877 (48.92%), Query Frame = 0

Query: 211  VVLLQEFEDLFSEEKPSSLPPLRGI--EHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVS 270
            V L Q++ ++   + P     +  I  +H I+  PGA +P    Y    K  +EI + V 
Sbjct: 584  VWLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ 643

Query: 271  ELLAK------------------------------GAINKITIKYRHPIPRLDDMLDELH 330
            +LL                                  +NK TI    P+PR+D++L  + 
Sbjct: 644  KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIG 703

Query: 331  GCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVL 390
               +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    
Sbjct: 704  NAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTF 763

Query: 391  R----------------------------------------------------EYLVSSN 450
            R                                                    E+L  S 
Sbjct: 764  RDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSI 823

Query: 451  GVEVD---EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVS 510
            G++     + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA P+   +     
Sbjct: 824  GIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ 883

Query: 511  FIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM-- 570
              W + Q+ A   LK  L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+  
Sbjct: 884  --WTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGV 943

Query: 571  --FFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLN 630
              +FS+ L  A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  
Sbjct: 944  VGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPA 1003

Query: 631  RRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE----------- 690
            RR  +WL+ + T+ + ++Y  G +N+VADA+SR    +    +R +  E           
Sbjct: 1004 RRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPL 1063

Query: 691  ------HIKDLYQHDM------FFAPFVESCE-KGLIVDNYLLLDGFLFRKGKLCIPSCS 750
                  H+K+L QH++       F  + +  E       NY L D  ++ + +L +P   
Sbjct: 1064 CSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IK 1123

Query: 751  IRELLVREAHGGGLM-AHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKS-RLQ 810
             +  ++R  H   L   H GV+ T   +S  ++WPK++H + +    C+ C+  KS R +
Sbjct: 1124 QQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPR 1183

Query: 811  PHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKH 870
             HGL  PLP+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  
Sbjct: 1184 LHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQ 1243

Query: 871  IADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVV 930
            + DL FR +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E  
Sbjct: 1244 LIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERT 1303

Query: 931  NRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPI 966
             +T+  +LRA +  N++ W V LP IEF YN     T   +PFEI  G+ P TP     I
Sbjct: 1304 IQTLNRLLRAYVSTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AI 1363

BLAST of CmaCh04G019870 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 1.5e-110
Identity = 278/959 (28.99%), Postives = 454/959 (47.34%), Query Frame = 0

Query: 163  VSSNGVEVDEEKVKAIKDWPTPKNTTLVLMYKGSCYFS---------------MLNPFLP 222
            +S NG+ +  E +  +K +  P   +   +Y  +   S               +  P LP
Sbjct: 317  ISLNGISIKTEFL-VVKKFSHPAAISFTTLYDNNIEISSSKHTLSQMNKVSNIVKEPELP 376

Query: 223  SDFVVLLQEFEDLFSEEKPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKEAEEI 282
                 + +EF+D+ +E     LP P++G+E +++        PI N   Y   P + + +
Sbjct: 377  D----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGKMQAM 436

Query: 283  QRQVSELLAKG------AIN--------------KITIKYR----------HPIPRLDDM 342
              ++++ L  G      AIN              ++ + Y+          +P+P ++ +
Sbjct: 437  NDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQL 496

Query: 343  LDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRL 402
            L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ F   
Sbjct: 497  LAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYF 556

Query: 403  MNHVLRE----------------------------------------------------- 462
            +N +L E                                                     
Sbjct: 557  INTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQV 616

Query: 463  ----YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNE 522
                Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN 
Sbjct: 617  KFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNN 676

Query: 523  LVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ- 582
            L+KK+V + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q   
Sbjct: 677  LLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD 736

Query: 583  ----RPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKH 642
                 P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L  
Sbjct: 737  DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG 796

Query: 643  LRVQNKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------R 702
             R+ N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                
Sbjct: 797  -RITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNS 856

Query: 703  YVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIP 762
               +N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P
Sbjct: 857  INFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLP 916

Query: 763  S-CSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSR 822
            +   +   ++++ H  G + H G+    +++   F W  +R  + +    C  C+  KSR
Sbjct: 917  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 976

Query: 823  -LQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDD 882
              +P+G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  
Sbjct: 977  NHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSIT 1036

Query: 883  AKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQT 942
            A+  A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQT
Sbjct: 1037 AEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQT 1096

Query: 943  EVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPLD 984
            E  N+T+  +LR +   +  TW   +  ++ +YN  +HS T+ TPFEIV+ ++P L+PL+
Sbjct: 1097 ERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLE 1156

BLAST of CmaCh04G019870 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 1.5e-110
Identity = 278/959 (28.99%), Postives = 454/959 (47.34%), Query Frame = 0

Query: 163  VSSNGVEVDEEKVKAIKDWPTPKNTTLVLMYKGSCYFS---------------MLNPFLP 222
            +S NG+ +  E +  +K +  P   +   +Y  +   S               +  P LP
Sbjct: 317  ISLNGISIKTEFL-VVKKFSHPAAISFTTLYDNNIEISSSKHTLSQMNKVSNIVKEPELP 376

Query: 223  SDFVVLLQEFEDLFSEEKPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKEAEEI 282
                 + +EF+D+ +E     LP P++G+E +++        PI N   Y   P + + +
Sbjct: 377  D----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGKMQAM 436

Query: 283  QRQVSELLAKG------AIN--------------KITIKYR----------HPIPRLDDM 342
              ++++ L  G      AIN              ++ + Y+          +P+P ++ +
Sbjct: 437  NDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQL 496

Query: 343  LDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRL 402
            L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ F   
Sbjct: 497  LAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYF 556

Query: 403  MNHVLRE----------------------------------------------------- 462
            +N +L E                                                     
Sbjct: 557  INTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQV 616

Query: 463  ----YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNE 522
                Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN 
Sbjct: 617  KFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNN 676

Query: 523  LVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ- 582
            L+KK+V + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q   
Sbjct: 677  LLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD 736

Query: 583  ----RPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKH 642
                 P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L  
Sbjct: 737  DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG 796

Query: 643  LRVQNKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------R 702
             R+ N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                
Sbjct: 797  -RITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNS 856

Query: 703  YVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIP 762
               +N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P
Sbjct: 857  INFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLP 916

Query: 763  S-CSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSR 822
            +   +   ++++ H  G + H G+    +++   F W  +R  + +    C  C+  KSR
Sbjct: 917  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 976

Query: 823  -LQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDD 882
              +P+G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  
Sbjct: 977  NHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSIT 1036

Query: 883  AKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQT 942
            A+  A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQT
Sbjct: 1037 AEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQT 1096

Query: 943  EVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPLD 984
            E  N+T+  +LR +   +  TW   +  ++ +YN  +HS T+ TPFEIV+ ++P L+PL+
Sbjct: 1097 ERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLE 1156

BLAST of CmaCh04G019870 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 1.5e-110
Identity = 278/959 (28.99%), Postives = 454/959 (47.34%), Query Frame = 0

Query: 163  VSSNGVEVDEEKVKAIKDWPTPKNTTLVLMYKGSCYFS---------------MLNPFLP 222
            +S NG+ +  E +  +K +  P   +   +Y  +   S               +  P LP
Sbjct: 317  ISLNGISIKTEFL-VVKKFSHPAAISFTTLYDNNIEISSSKHTLSQMNKVSNIVKEPELP 376

Query: 223  SDFVVLLQEFEDLFSEEKPSSLP-PLRGIEHKIDFIP---GAPIPNRPAYRTNPKEAEEI 282
                 + +EF+D+ +E     LP P++G+E +++        PI N   Y   P + + +
Sbjct: 377  D----IYKEFKDITAETNTEKLPKPIKGLEFEVELTQENYRLPIRN---YPLPPGKMQAM 436

Query: 283  QRQVSELLAKG------AIN--------------KITIKYR----------HPIPRLDDM 342
              ++++ L  G      AIN              ++ + Y+          +P+P ++ +
Sbjct: 437  NDEINQGLKSGIIRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQL 496

Query: 343  LDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRL 402
            L ++ G ++FTK+DLKS YH IR+  GDE K AF+   G++E+LVMP+G++ AP+ F   
Sbjct: 497  LAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYF 556

Query: 403  MNHVLRE----------------------------------------------------- 462
            +N +L E                                                     
Sbjct: 557  INTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQV 616

Query: 463  ----YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNE 522
                Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +  PLN 
Sbjct: 617  KFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNN 676

Query: 523  LVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ- 582
            L+KK+V + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q   
Sbjct: 677  LLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHD 736

Query: 583  ----RPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKH 642
                 P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L  
Sbjct: 737  DDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG 796

Query: 643  LRVQNKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------R 702
             R+ N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                
Sbjct: 797  -RITNESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNS 856

Query: 703  YVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIP 762
               +N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P
Sbjct: 857  INFVNQISITDDFKNQVVTEYTNDTKLLNLLNNEDK-RVEENIQLKDGLLINSKDQILLP 916

Query: 763  S-CSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSR 822
            +   +   ++++ H  G + H G+    +++   F W  +R  + +    C  C+  KSR
Sbjct: 917  NDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSR 976

Query: 823  -LQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDD 882
              +P+G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  
Sbjct: 977  NHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSIT 1036

Query: 883  AKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQT 942
            A+  A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQT
Sbjct: 1037 AEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQT 1096

Query: 943  EVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPLD 984
            E  N+T+  +LR +   +  TW   +  ++ +YN  +HS T+ TPFEIV+ ++P L+PL+
Sbjct: 1097 ERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLE 1156

BLAST of CmaCh04G019870 vs. ExPASy TrEMBL
Match: A0A2N9G0F9 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS20920 PE=4 SV=1)

HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 616/942 (65.39%), Postives = 723/942 (76.75%), Query Frame = 0

Query: 190  VLMYKGSCY-FSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 249
            VL+YK +C+  + L+  LPS  V LLQE+ED+F  + PS LPP+RGIEH+IDF+PGA IP
Sbjct: 613  VLLYKEACFNTNELDESLPSVVVSLLQEYEDVFPNDVPSGLPPIRGIEHQIDFVPGATIP 672

Query: 250  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 309
            NRPAYR+NP+E +E+QRQV ELLAKG                              AIN 
Sbjct: 673  NRPAYRSNPEETKELQRQVEELLAKGHVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINN 732

Query: 310  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 369
            IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLV
Sbjct: 733  ITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLV 792

Query: 370  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 429
            MPFGLTNAPSTFMRLMNH LR +L                                    
Sbjct: 793  MPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDILVYSKSLDEHIDHLHCVLTVLRKEK 852

Query: 430  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 489
                                 V + G+ VDEEKVKAIK+WPTPK+++EVRSFHGLASFYR
Sbjct: 853  LYANLKKCSFCLDKVVFLGFVVGAKGIAVDEEKVKAIKEWPTPKSITEVRSFHGLASFYR 912

Query: 490  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 549
            RF+K+FST+A+PL E+VKK+V F W  +Q+ AF  +KE+L  APLLALP+F  TFEIECD
Sbjct: 913  RFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIEIKERLCGAPLLALPDFSKTFEIECD 972

Query: 550  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 609
            ASG+GIGAVLMQ +RP+ +FSEKL GA+L YPTYDKELYALVRAL+TWQHYLWPKEF+IH
Sbjct: 973  ASGIGIGAVLMQEKRPIAYFSEKLNGAALNYPTYDKELYALVRALETWQHYLWPKEFVIH 1032

Query: 610  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 669
            TDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKYKQGKENIVADALSRRY L++TLNA+
Sbjct: 1033 TDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKYKQGKENIVADALSRRYALISTLNAK 1092

Query: 670  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 729
            LLGFE++K+LY +D  FA    +CEK      +  LDG+LFR+ +LC+P+ S+RELLVRE
Sbjct: 1093 LLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYRLDGYLFRENRLCVPNSSMRELLVRE 1152

Query: 730  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 789
            AHGGGLM H GV KT D+L EHFFWPKM+ DV +VC+RC+ C+QAKSR+ PHGLY+PLPV
Sbjct: 1153 AHGGGLMGHFGVRKTLDVLHEHFFWPKMKRDVERVCSRCVTCRQAKSRVLPHGLYTPLPV 1212

Query: 790  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 849
            P+ PW+DISMDFVLGLPR+RKG DSIFVVVDRFSKMAHFI CHKTDDA HIADLFFRE+V
Sbjct: 1213 PSAPWVDISMDFVLGLPRSRKGRDSIFVVVDRFSKMAHFISCHKTDDATHIADLFFREIV 1272

Query: 850  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 909
            RLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LR 
Sbjct: 1273 RLHGVPRSIVSDRDVKFLSYFWKVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRT 1332

Query: 910  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 969
            II KNLK WE CLPFIEFAYNR VHSTT  +PFEIVYGFNPLTPLDLLP+P  E  + D 
Sbjct: 1333 IIQKNLKNWEDCLPFIEFAYNRSVHSTTDFSPFEIVYGFNPLTPLDLLPLPVNERTSLDG 1392

Query: 970  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1029
              K E V KLH+ V++ IEK+N + A + NKGR+ VIF+PGDWVWVH RKERFP +R+SK
Sbjct: 1393 QKKAEMVKKLHESVRQHIEKKNEQYANKANKGRRQVIFEPGDWVWVHMRKERFPARRRSK 1452

Query: 1030 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1040
            L PRGDGPFQVLERINDNAYK+DLPG+Y +S TFNV DLS FDVGD  DSR+N F+E  N
Sbjct: 1453 LHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVSDLSLFDVGD--DSRSNPFEERGN 1512

BLAST of CmaCh04G019870 vs. ExPASy TrEMBL
Match: A0A2N9HBD3 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12373 PE=4 SV=1)

HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 615/942 (65.29%), Postives = 723/942 (76.75%), Query Frame = 0

Query: 190  VLMYKGSCY-FSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 249
            VL+YK +C+  + L+  LPS  + LLQE+ED+F  + PS LPP+RGIEH+IDF+PGA IP
Sbjct: 613  VLLYKEACFNTNELDESLPSVVISLLQEYEDVFPNDVPSGLPPIRGIEHQIDFVPGATIP 672

Query: 250  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 309
            NRPAYR+NP+E +E+QRQV ELLAKG                              AIN 
Sbjct: 673  NRPAYRSNPEETKELQRQVEELLAKGHVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINN 732

Query: 310  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 369
            IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLV
Sbjct: 733  ITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLV 792

Query: 370  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 429
            MPFGLTNAPSTFMRLMNH LR +L                                    
Sbjct: 793  MPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDILVYSKSLDEHIDHLHCVLTVLRKEK 852

Query: 430  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 489
                                 V + G+ VDEEKVKAIK+WPTPK+++EVRSFHGLASFYR
Sbjct: 853  LYANLKKCSFCLDKVVFLGFVVGAKGITVDEEKVKAIKEWPTPKSITEVRSFHGLASFYR 912

Query: 490  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 549
            RF+K+FST+A+PL E+VKK+V F W  +Q+ AF  +KE+L  APLLALP+F  TFEIECD
Sbjct: 913  RFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIEIKERLCGAPLLALPDFSKTFEIECD 972

Query: 550  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 609
            ASG+GIGAVLMQ +RP+ +FSEKL GA L YPTYDKELYALVRAL+TWQHYLWPKEF+IH
Sbjct: 973  ASGIGIGAVLMQEKRPIAYFSEKLNGAVLNYPTYDKELYALVRALETWQHYLWPKEFVIH 1032

Query: 610  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 669
            TDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKYKQGKENIVADALSRRY L++TLNA+
Sbjct: 1033 TDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKYKQGKENIVADALSRRYALISTLNAK 1092

Query: 670  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 729
            LLGFE++K+LY +D  FA    +CEK      +  +DG+LFR+ +LC+P+ S+RELLVRE
Sbjct: 1093 LLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYRIDGYLFRENRLCVPNSSMRELLVRE 1152

Query: 730  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 789
            AHGGGLM H GV KT DML EHFFWPKM+ DV +VC+RC+ C+QAKSR+ PHGLY+PLPV
Sbjct: 1153 AHGGGLMGHFGVRKTLDMLHEHFFWPKMKRDVERVCSRCVTCRQAKSRVLPHGLYTPLPV 1212

Query: 790  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 849
            P+ PW+DISMDFVLGLPR+RKG DSIFVVVDRFSKMAHFI CHKTDDA HIADLFFRE+V
Sbjct: 1213 PSAPWVDISMDFVLGLPRSRKGRDSIFVVVDRFSKMAHFISCHKTDDATHIADLFFREIV 1272

Query: 850  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 909
            RLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LR 
Sbjct: 1273 RLHGVPRSIVSDRDVKFLSYFWKVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRT 1332

Query: 910  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 969
            II KNLK WE CLPFIEFAYNR VHSTT+ +PFEIVYGFNPLTPLDLLP+P  E  + D 
Sbjct: 1333 IIQKNLKNWEDCLPFIEFAYNRSVHSTTEFSPFEIVYGFNPLTPLDLLPLPVNERTSLDG 1392

Query: 970  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1029
              K E V KLH+ V++ IEK+N + A + NKGR+ VIF+PGDWVWVH RKERFP +R+SK
Sbjct: 1393 QKKAEMVKKLHESVRQHIEKKNEQYANKANKGRRQVIFQPGDWVWVHMRKERFPARRRSK 1452

Query: 1030 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1040
            L PRGDGPFQVLERINDNAYK+DLPG+Y +S TFNV DLS FDVGD  DSR+N F+E  N
Sbjct: 1453 LHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVSDLSLFDVGD--DSRSNPFEERGN 1512

BLAST of CmaCh04G019870 vs. ExPASy TrEMBL
Match: A0A2N9IF29 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50797 PE=4 SV=1)

HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 615/942 (65.29%), Postives = 723/942 (76.75%), Query Frame = 0

Query: 190  VLMYKGSCY-FSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 249
            VL+YK +C+  + L+  LPS  + LLQE+ED+F  + PS LPP+RGIEH+IDF+PGA IP
Sbjct: 121  VLLYKEACFNTNELDESLPSVVISLLQEYEDVFPNDVPSGLPPIRGIEHQIDFVPGATIP 180

Query: 250  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 309
            NRPAYR+NP+E +E+QRQV ELLAKG                              AIN 
Sbjct: 181  NRPAYRSNPEETKELQRQVEELLAKGHVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINN 240

Query: 310  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 369
            IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLV
Sbjct: 241  ITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLV 300

Query: 370  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 429
            MPFGLTNAPSTFMRLMNH LR +L                                    
Sbjct: 301  MPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDILVYSKSLDEHIDHLHCVLTVLRKEK 360

Query: 430  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 489
                                 V + G+ VDEEKVKAIK+WPTPK+++EVRSFHGLASFYR
Sbjct: 361  LYANLKKCSFCLDKVVFLGFVVGAKGITVDEEKVKAIKEWPTPKSITEVRSFHGLASFYR 420

Query: 490  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 549
            RF+K+FST+A+PL E+VKK+V F W  +Q+ AF  +KE+L  APLLALP+F  TFEIECD
Sbjct: 421  RFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIEIKERLCGAPLLALPDFSKTFEIECD 480

Query: 550  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 609
            ASG+GIGAVLMQ +RP+ +FSEKL GA L YPTYDKELYALVRAL+TWQHYLWPKEF+IH
Sbjct: 481  ASGIGIGAVLMQEKRPIAYFSEKLNGAVLNYPTYDKELYALVRALETWQHYLWPKEFVIH 540

Query: 610  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 669
            TDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKYKQGKENIVADALSRRY L++TLNA+
Sbjct: 541  TDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKYKQGKENIVADALSRRYALISTLNAK 600

Query: 670  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 729
            LLGFE++K+LY +D  FA    +CEK      +  +DG+LFR+ +LC+P+ S+RELLVRE
Sbjct: 601  LLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYRIDGYLFRENRLCVPNSSMRELLVRE 660

Query: 730  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 789
            AHGGGLM H GV KT DML EHFFWPKM+ DV +VC+RC+ C+QAKSR+ PHGLY+PLPV
Sbjct: 661  AHGGGLMGHFGVRKTLDMLHEHFFWPKMKRDVERVCSRCVTCRQAKSRVLPHGLYTPLPV 720

Query: 790  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 849
            P+ PW+DISMDFVLGLPR+RKG DSIFVVVDRFSKMAHFI CHKTDDA HIADLFFRE+V
Sbjct: 721  PSAPWVDISMDFVLGLPRSRKGRDSIFVVVDRFSKMAHFISCHKTDDATHIADLFFREIV 780

Query: 850  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 909
            RLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LR 
Sbjct: 781  RLHGVPRSIVSDRDVKFLSYFWKVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRT 840

Query: 910  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 969
            II KNLK WE CLPFIEFAYNR VHSTT+ +PFEIVYGFNPLTPLDLLP+P  E  + D 
Sbjct: 841  IIQKNLKNWEDCLPFIEFAYNRSVHSTTEFSPFEIVYGFNPLTPLDLLPLPVNERTSLDG 900

Query: 970  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1029
              K E V KLH+ V++ IEK+N + A + NKGR+ VIF+PGDWVWVH RKERFP +R+SK
Sbjct: 901  QKKAEMVKKLHESVRQHIEKKNEQYANKANKGRRQVIFQPGDWVWVHMRKERFPARRRSK 960

Query: 1030 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1040
            L PRGDGPFQVLERINDNAYK+DLPG+Y +S TFNV DLS FDVGD  DSR+N F+E  N
Sbjct: 961  LHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVSDLSLFDVGD--DSRSNPFEERGN 1020

BLAST of CmaCh04G019870 vs. ExPASy TrEMBL
Match: A0A2N9ENW8 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8418 PE=4 SV=1)

HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 616/942 (65.39%), Postives = 723/942 (76.75%), Query Frame = 0

Query: 190  VLMYKGSCY-FSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 249
            VL+YK +C+  + L+  LPS  V LLQE+ED+F  + PS LPP+RGIEH+IDF+PGA IP
Sbjct: 422  VLLYKEACFNTNELDESLPSVVVSLLQEYEDVFPNDVPSGLPPIRGIEHQIDFVPGATIP 481

Query: 250  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 309
            NRPAYR+NP+E +E+QRQV ELLAKG                              AIN 
Sbjct: 482  NRPAYRSNPEETKELQRQVEELLAKGHVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINN 541

Query: 310  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 369
            IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLV
Sbjct: 542  ITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLV 601

Query: 370  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 429
            MPFGLTNAPSTFMRLMNH LR +L                                    
Sbjct: 602  MPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDILVYSKSLDEHIDHLHCVLTVLRKEK 661

Query: 430  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 489
                                 V + G+ VDEEKVKAIK+WPTPK+++EVRSFHGLASFYR
Sbjct: 662  LYANLKKCSFCLDKVVFLGFVVGAKGIAVDEEKVKAIKEWPTPKSITEVRSFHGLASFYR 721

Query: 490  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 549
            RF+K+FST+A+PL E+VKK+V F W  +Q+ AF  +KE+L  APLLALP+F  TFEIECD
Sbjct: 722  RFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIEIKERLCGAPLLALPDFSKTFEIECD 781

Query: 550  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 609
            ASG+GIGAVLMQ +RP+ +FSEKL GA+L YPTYDKELYALVRAL+TWQHYLWPKEF+IH
Sbjct: 782  ASGIGIGAVLMQEKRPIAYFSEKLNGAALNYPTYDKELYALVRALETWQHYLWPKEFVIH 841

Query: 610  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 669
            TDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKYKQGKENIVADALSRRY L++TLNA+
Sbjct: 842  TDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKYKQGKENIVADALSRRYALISTLNAK 901

Query: 670  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 729
            LLGFE++K+LY +D  FA    +CEK      +  LDG+LFR+ +LC+P+ S+RELLVRE
Sbjct: 902  LLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYRLDGYLFRENRLCVPNSSMRELLVRE 961

Query: 730  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 789
            AHGGGLM H GV KT D+L EHFFWPKM+ DV +VC+RC+ C+QAKSR+ PHGLY+PLPV
Sbjct: 962  AHGGGLMGHFGVRKTLDVLHEHFFWPKMKRDVERVCSRCVTCRQAKSRVLPHGLYTPLPV 1021

Query: 790  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 849
            P+ PW+DISMDFVLGLPR+RKG DSIFVVVDRFSKMAHFI CHKTDDA HIADLFFRE+V
Sbjct: 1022 PSAPWVDISMDFVLGLPRSRKGRDSIFVVVDRFSKMAHFISCHKTDDATHIADLFFREIV 1081

Query: 850  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 909
            RLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LR 
Sbjct: 1082 RLHGVPRSIVSDRDVKFLSYFWKVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRT 1141

Query: 910  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 969
            II KNLK WE CLPFIEFAYNR VHSTT  +PFEIVYGFNPLTPLDLLP+P  E  + D 
Sbjct: 1142 IIQKNLKNWEDCLPFIEFAYNRSVHSTTDFSPFEIVYGFNPLTPLDLLPLPVNERTSLDG 1201

Query: 970  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1029
              K E V KLH+ V++ IEK+N + A + NKGR+ VIF+PGDWVWVH RKERFP +R+SK
Sbjct: 1202 QKKAEMVKKLHESVRQHIEKKNEQYANKANKGRRQVIFEPGDWVWVHMRKERFPARRRSK 1261

Query: 1030 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1040
            L PRGDGPFQVLERINDNAYK+DLPG+Y +S TFNV DLS FDVGD  DSR+N F+E  N
Sbjct: 1262 LHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVSDLSLFDVGD--DSRSNPFEERGN 1321

BLAST of CmaCh04G019870 vs. ExPASy TrEMBL
Match: A0A2N9F7E8 (Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10964 PE=4 SV=1)

HSP 1 Score: 1271.5 bits (3289), Expect = 0.0e+00
Identity = 615/942 (65.29%), Postives = 723/942 (76.75%), Query Frame = 0

Query: 190  VLMYKGSCY-FSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 249
            VL+YK +C+  + L+  LPS  + LLQE+ED+F  + PS LPP+RGIEH+IDF+PGA IP
Sbjct: 915  VLLYKEACFNTNELDESLPSVVISLLQEYEDVFPNDVPSGLPPIRGIEHQIDFVPGATIP 974

Query: 250  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 309
            NRPAYR+NP+E +E+QRQV ELLAKG                              AIN 
Sbjct: 975  NRPAYRSNPEETKELQRQVEELLAKGHVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINN 1034

Query: 310  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 369
            IT+KYRHPIPRLDDMLDELHG  +FTKIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLV
Sbjct: 1035 ITVKYRHPIPRLDDMLDELHGSCIFTKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLV 1094

Query: 370  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 429
            MPFGLTNAPSTFMRLMNH LR +L                                    
Sbjct: 1095 MPFGLTNAPSTFMRLMNHALRAFLGRFVVVYFDDILVYSKSLDEHIDHLHCVLTVLRKEK 1154

Query: 430  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 489
                                 V + G+ VDEEKVKAIK+WPTPK+++EVRSFHGLASFYR
Sbjct: 1155 LYANLKKCSFCLDKVVFLGFVVGAKGIAVDEEKVKAIKEWPTPKSITEVRSFHGLASFYR 1214

Query: 490  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 549
            RF+K+FST+A+PL E+VKK+V F W  +Q+ AF  +KE+L  APLLALP+F  TFEIECD
Sbjct: 1215 RFVKDFSTLAAPLTEIVKKSVGFKWGSEQDRAFIEIKERLCGAPLLALPDFSKTFEIECD 1274

Query: 550  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 609
            ASG+GIGAVLMQ +RP+ +FSEKL GA+L YPTYDKELYALVRAL+TWQHYLWPKEF+IH
Sbjct: 1275 ASGIGIGAVLMQEKRPIAYFSEKLNGAALNYPTYDKELYALVRALETWQHYLWPKEFVIH 1334

Query: 610  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 669
            TDHESLKHL+ Q KLNRRHA+W+EFIETFPYVIKYKQGKENIVADALSRRY L++TLNA+
Sbjct: 1335 TDHESLKHLKGQGKLNRRHAQWMEFIETFPYVIKYKQGKENIVADALSRRYALISTLNAK 1394

Query: 670  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 729
            LLGFE++K+LY +D  FA    +CEK      +  LDG+LFR+ +LC+P+ S+RELLVRE
Sbjct: 1395 LLGFEYVKELYVNDDDFASVFAACEKAAF-GKFYRLDGYLFRENRLCVPNSSMRELLVRE 1454

Query: 730  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 789
            AHGGGLM H GV KT D+L EHFFWPKM+ DV +VC+RC+ C+QAKSR+ PHGLY+PLPV
Sbjct: 1455 AHGGGLMGHFGVRKTLDVLHEHFFWPKMKRDVERVCSRCVTCRQAKSRVLPHGLYTPLPV 1514

Query: 790  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 849
            P+ PW+DISMDFVLGLPR+RKG DSIFVVVDRFSKMAHFI CHKTDDA HIADLFFRE+V
Sbjct: 1515 PSAPWVDISMDFVLGLPRSRKGRDSIFVVVDRFSKMAHFISCHKTDDATHIADLFFREIV 1574

Query: 850  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 909
            RLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LR 
Sbjct: 1575 RLHGVPRSIVSDRDVKFLSYFWKVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRT 1634

Query: 910  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 969
            II KNLK WE CLPFIEFAYNR VHSTT  +PFEIVYGFNPLTPLDLLP+P  E  + D 
Sbjct: 1635 IIQKNLKNWEDCLPFIEFAYNRSVHSTTDFSPFEIVYGFNPLTPLDLLPLPVNERTSLDG 1694

Query: 970  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1029
              K E V KLH+ V++ IEK+N + A + NKGR+ VIF+PGDWVWVH RKERFP +R+SK
Sbjct: 1695 QKKAEMVKKLHESVRQHIEKKNEQYANKANKGRRQVIFEPGDWVWVHMRKERFPARRRSK 1754

Query: 1030 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1040
            L PRGDGPFQVLERINDNAYK+DLPG+Y +S TFNV DLS FDVGD  DSR+N F+E  N
Sbjct: 1755 LHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVSDLSLFDVGD--DSRSNPFEERGN 1814

BLAST of CmaCh04G019870 vs. NCBI nr
Match: OWM74668.1 (hypothetical protein CDL15_Pgr005248 [Punica granatum])

HSP 1 Score: 1260.0 bits (3259), Expect = 0.0e+00
Identity = 612/954 (64.15%), Postives = 721/954 (75.58%), Query Frame = 0

Query: 189  LVLMYKGSCYFSMLNPFLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIP 248
            ++ +YK +   +  N +LPS  V LLQEF+D+F E  P  LPP+RGIEH+IDFIPGAPIP
Sbjct: 652  ILFLYKEAYLSNTFNLYLPSVVVSLLQEFDDVFLEGTPPGLPPIRGIEHQIDFIPGAPIP 711

Query: 249  NRPAYRTNPKEAEEIQRQVSELLAKG------------------------------AINK 308
            NRPAYR NP+EA+E+Q+QV ELL KG                              A+NK
Sbjct: 712  NRPAYRCNPEEAKELQKQVDELLTKGYVRESMSPCSVPVLLVPKKDGTWRMCVDCRAVNK 771

Query: 309  ITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLV 368
            IT+KYR+PIPRLDDMLDELHG ++F+KIDLKSGYHQIRM  GDEWKTAFKTK GLYEWLV
Sbjct: 772  ITVKYRYPIPRLDDMLDELHGSTIFSKIDLKSGYHQIRMKEGDEWKTAFKTKSGLYEWLV 831

Query: 369  MPFGLTNAPSTFMRLMNHVLREYL------------------------------------ 428
            MPFGLTNAPSTFMRLMNHVLR Y+                                    
Sbjct: 832  MPFGLTNAPSTFMRLMNHVLRAYIGKFVVVYFDDILIYSKTEHDHMNHLRCVLEVLRHEK 891

Query: 429  ---------------------VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYR 488
                                 VSS GVEVDEEKVKAI++WPTP  ++EVRSFHGLA FYR
Sbjct: 892  LYANLKKCEFFLESVVFLGFVVSSKGVEVDEEKVKAIREWPTPTTIAEVRSFHGLAGFYR 951

Query: 489  RFIKNFSTIASPLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECD 548
            RF++NFST+A+PL E++KK V F W K+QE AFNTLKEKLSSAPLL LP+F   FEIECD
Sbjct: 952  RFVRNFSTVAAPLTEIIKKEVGFRWGKEQENAFNTLKEKLSSAPLLILPDFSKPFEIECD 1011

Query: 549  ASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIH 608
            ASG+GIGAVLMQ +RP+ +FSEKL GA+L Y TYDKELYALVRAL+TWQHYLW KEFIIH
Sbjct: 1012 ASGIGIGAVLMQEKRPIAYFSEKLNGAALNYSTYDKELYALVRALETWQHYLWSKEFIIH 1071

Query: 609  TDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNAR 668
            TDHESLKHL+ Q+KLNRRH +W+EFIE FPYVI+YK+GKEN+VADALSRRY L++TL+A+
Sbjct: 1072 TDHESLKHLKGQSKLNRRHTRWIEFIEMFPYVIQYKKGKENVVADALSRRYTLISTLDAK 1131

Query: 669  LLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 728
            LLGFE+IK+LY HD  F      CEKG   D +   +G+LFR+ KLCIP  S+RELLVRE
Sbjct: 1132 LLGFEYIKELYLHDHDFKEVFSECEKGAF-DKFYKHEGYLFRENKLCIPQSSMRELLVRE 1191

Query: 729  AHGGGLMAHHGVSKTYDMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPV 788
            AHGGGLM H GV+KT D+L EHFFWP M+ DV ++C RC+ CK+AKS++QPHGLY PLPV
Sbjct: 1192 AHGGGLMGHFGVAKTLDVLREHFFWPHMKRDVERICLRCVTCKKAKSKIQPHGLYMPLPV 1251

Query: 789  PNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVV 848
            P+ PW D+SMDFVLGLPRT+ G DSIFVVVDRFSKMAHFIPC KTDDA H+A LFF+EVV
Sbjct: 1252 PSHPWTDVSMDFVLGLPRTKNGKDSIFVVVDRFSKMAHFIPCKKTDDATHVAGLFFKEVV 1311

Query: 849  RLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRA 908
            RLHGIP++IVSDRDVKFLSHFWRVLWGKLGTKL++STTCHPQTDGQTEVVNRT+  +LRA
Sbjct: 1312 RLHGIPRTIVSDRDVKFLSHFWRVLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRA 1371

Query: 909  IIDKNLKTWEVCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDA 968
            +I +N+K+WE C+PFIEFAYNR +HS+TK +PFE+VYGFNPLTPLDL P+P  E V+ D 
Sbjct: 1372 VIKRNVKSWEDCIPFIEFAYNRAMHSSTKFSPFEVVYGFNPLTPLDLTPLPIGEIVSLDG 1431

Query: 969  NAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSK 1028
              K E V K+H++ +  I  +N + ATR NKGRK V F+PGDWVWVHFRKERF +QRKSK
Sbjct: 1432 KRKAELVKKIHEEARNHILHKNEQAATRANKGRKHVTFEPGDWVWVHFRKERFSSQRKSK 1491

Query: 1029 LLPRGDGPFQVLERINDNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEN 1048
            L PRGDGPFQVLE+INDNAYK+DLPG+Y VS+TFNV DLSPFDV  G DSRTN F+EG N
Sbjct: 1492 LNPRGDGPFQVLEKINDNAYKLDLPGEYQVSSTFNVSDLSPFDV--GADSRTNPFEEGGN 1551

BLAST of CmaCh04G019870 vs. NCBI nr
Match: PSS05945.1 (Integrase [Actinidia chinensis var. chinensis])

HSP 1 Score: 1239.9 bits (3207), Expect = 0.0e+00
Identity = 607/1013 (59.92%), Postives = 736/1013 (72.66%), Query Frame = 0

Query: 159  REYLVSSNGVEVDEE--------KVKAIKDWPTPKNTTLVLMYKGS-CYFSMLNPFLPSD 218
            RE  +    +EV  E        K K IK         +VL+YK +    + L+  +PS 
Sbjct: 570  REKAIIDPAIEVKTERKQKNFYAKAKEIKRAMFSNQPMIVLLYKEAFINTNGLDSVIPSS 629

Query: 219  FVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSE 278
             V LLQEFED+F +E P  LPP+RGIEH+IDF+PGA IPNRPAYR+NP E +E+QRQVSE
Sbjct: 630  VVSLLQEFEDVFPDEMPHGLPPIRGIEHQIDFVPGASIPNRPAYRSNPDETKELQRQVSE 689

Query: 279  LLAKG------------------------------AINKITIKYRHPIPRLDDMLDELHG 338
            LL KG                              AIN IT+KYRHPIPRLDDMLDELHG
Sbjct: 690  LLEKGYVRESMSPCAVPVLLVPKKDGTWRMCVDCRAINNITVKYRHPIPRLDDMLDELHG 749

Query: 339  CSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLR 398
              +F+KIDLKSGYHQIRM  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR
Sbjct: 750  SCVFSKIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR 809

Query: 399  EYL--------------------------------------------------------- 458
             ++                                                         
Sbjct: 810  AFIGKCVVVYFDDILIYSKNLDDHVQHLKSVLDVLRQEKLFANLKKCTFCTDNLVFLGFV 869

Query: 459  VSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNV 518
            VS+ G+ VD EK++AI++WP+P  V  VRSFHGLASFYRRF+K+FST+ +PL E++KKNV
Sbjct: 870  VSAQGLHVDAEKIRAIQEWPSPTTVGNVRSFHGLASFYRRFVKDFSTLVAPLTEVIKKNV 929

Query: 519  SFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFS 578
             F W  +QE AF  +K++L++APLL+LPNF   FEIECDASG+GIGAVLMQ  RP+ +FS
Sbjct: 930  GFKWGDEQEKAFQLVKQRLTNAPLLSLPNFAKMFEIECDASGMGIGAVLMQEGRPIAYFS 989

Query: 579  EKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAK 638
            EKL+GA+L YPTYDKELYALVRAL+TW+HYLW +EF+IHTDHESLKHL+ Q+KLN+RHA+
Sbjct: 990  EKLSGAALNYPTYDKELYALVRALETWRHYLWHREFVIHTDHESLKHLKGQHKLNKRHAR 1049

Query: 639  WLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFV 698
            W+EFIETFPYVI+YKQGKEN+VADALSRRYVLL+TL+A+LLGFE IK+LY  D  F    
Sbjct: 1050 WMEFIETFPYVIRYKQGKENVVADALSRRYVLLSTLDAKLLGFEQIKELYATDHDF---- 1109

Query: 699  ESCEKGLIVDN-----YLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTY 758
              CE+  + +N     Y   DGFLFR+ KLC+P+CS+RELLVRE+HGGGLM H G++KT 
Sbjct: 1110 --CEEYKLSENSANGRYFRHDGFLFRENKLCVPNCSVRELLVRESHGGGLMGHFGIAKTL 1169

Query: 759  DMLSEHFFWPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGL 818
             +L EHF+WP M+ D+ ++C RCI CKQAKSR+Q HGLY+PLP+P+ PWIDISMDFVLGL
Sbjct: 1170 AILQEHFYWPHMKRDIERICGRCITCKQAKSRVQHHGLYTPLPIPSEPWIDISMDFVLGL 1229

Query: 819  PRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVK 878
            PR+++G DS+FVVVDRFSKMAHFIPCHKTDDA H+A+LFF+E+VRLHG+P++IVSDRD K
Sbjct: 1230 PRSKRGKDSVFVVVDRFSKMAHFIPCHKTDDASHVAELFFKEIVRLHGLPRTIVSDRDAK 1289

Query: 879  FLSHFWRVLWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEVCLPFI 938
            FLS+FW+ LWGKLGTKL++STTCHPQTDGQTEVVNRT++ +LRAII KN+KTWE CLP +
Sbjct: 1290 FLSYFWKTLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRAIIKKNIKTWEDCLPHV 1349

Query: 939  EFAYNRVVHSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDANAKVEFVHKLHKQVKE 998
            EFAYNR VHS TK +PFEIVYGFNPLTPLDL P+P  E VN D   K E V ++H++ K 
Sbjct: 1350 EFAYNRSVHSATKFSPFEIVYGFNPLTPLDLSPLPLTEHVNLDGKRKAELVKQIHEKAKL 1409

Query: 999  QIEKQNSKVATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERIN 1058
             IE++  + A + NKGRK V+F+PGDWVW+H RKERFPTQRKSKLLPRGDGPFQVLERIN
Sbjct: 1410 NIERRTEQYAKQANKGRKQVVFEPGDWVWLHMRKERFPTQRKSKLLPRGDGPFQVLERIN 1469

Query: 1059 DNAYKIDLPGKYGVSTTFNVVDLSPFDVGDGFDSRTNLFQEGEND-------MNHDQGIS 1060
            DNAYK+DLPG+Y VS TFN+ DLSPF VGD  D RTN FQE END         +   I 
Sbjct: 1470 DNAYKLDLPGEYNVSATFNISDLSPFAVGDELDLRTNPFQEEENDEDMANTRSRNADPIQ 1529

BLAST of CmaCh04G019870 vs. NCBI nr
Match: TYK22420.1 (Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 1229.9 bits (3181), Expect = 0.0e+00
Identity = 595/917 (64.89%), Postives = 710/917 (77.43%), Query Frame = 0

Query: 216  EFEDLF-SEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKG 275
            EF D+F  E+ P+ LPPLRGIEH+IDFIPGA +PN  AYRTNP E +EIQRQV EL+ KG
Sbjct: 442  EFNDMFPHEDAPTGLPPLRGIEHQIDFIPGATLPNMAAYRTNPTETKEIQRQVEELMDKG 501

Query: 276  ------------------------------AINKITIKYRHPIPRLDDMLDELHGCSLFT 335
                                          AINKIT+KYRHPIPRLDDMLDELHG +LF+
Sbjct: 502  YIRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELHGANLFS 561

Query: 336  KIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREY--- 395
            KIDLKSGYHQIRMH+GDEWKTAFKTK+GLYEWLVMPFGLTNAPSTFMRLMNHVL+EY   
Sbjct: 562  KIDLKSGYHQIRMHVGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNHVLKEYIGK 621

Query: 396  ------------------------------------------------------LVSSNG 455
                                                                  +V  +G
Sbjct: 622  FVVVYFDDILVYSKGLNDHILHVKTILLKLREEKLYANFKKCSFCLEQIHFLGFIVGKDG 681

Query: 456  VEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIWE 515
            V+VDEEKVKAI++WPTP N SEVRSFHGLASFYRRFIK+FS+IASPL ELVKK+V F W+
Sbjct: 682  VKVDEEKVKAIREWPTPTNASEVRSFHGLASFYRRFIKDFSSIASPLTELVKKHVKFEWK 741

Query: 516  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTG 575
            + QE AFN LKEKL  AP LALPNF+ +FEIECDASG+GIGAVLMQ ++P+MFFSEKL G
Sbjct: 742  EKQENAFNELKEKLIKAPCLALPNFDKSFEIECDASGIGIGAVLMQEKQPIMFFSEKLNG 801

Query: 576  ASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFI 635
            A L Y TYDKEL+ALVRAL+ WQHYLWPKEF+IHTDHESLKHL+ Q KLN+RHAKW+EFI
Sbjct: 802  AQLNYSTYDKELHALVRALKVWQHYLWPKEFVIHTDHESLKHLKGQTKLNKRHAKWVEFI 861

Query: 636  ETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMF-FAPFVESCE 695
            ETFPYVI YK+GK+N+VADALSRRY L ++L+A++LGF+H+ +LY+ +   F      C 
Sbjct: 862  ETFPYVIHYKKGKDNMVADALSRRYALFSSLSAKVLGFKHMIELYKVEKSEFYDVYAQCL 921

Query: 696  KGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFW 755
            +G  V +Y++ DG LFRKGKLCIP CSIRELLV+EAHGGGLM H G  KTY ML EHF+W
Sbjct: 922  EGKNVQDYIVFDGMLFRKGKLCIPKCSIRELLVKEAHGGGLMGHFGEFKTYSMLCEHFYW 981

Query: 756  PKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDS 815
             KMR DV+KVC +C  CK+AKS+ QPHGLY+PL VPN PW+DISMDFVLGLP+TR+ +DS
Sbjct: 982  LKMRKDVNKVCKQCFKCKEAKSKTQPHGLYTPLDVPNEPWVDISMDFVLGLPKTRRHHDS 1041

Query: 816  IFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVL 875
            IFVVVDRFSKMAHFIPC+KTDDA +IA+LFFREVVRLHGIPK+IVSDRDVKFLSHFW+VL
Sbjct: 1042 IFVVVDRFSKMAHFIPCNKTDDATNIANLFFREVVRLHGIPKTIVSDRDVKFLSHFWKVL 1101

Query: 876  WGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVVH 935
            WGKLGTKL++STTCHPQTDGQTEVVNRT+ A+LR++I KNLK+WE  LPF+EFAYNR +H
Sbjct: 1102 WGKLGTKLLFSTTCHPQTDGQTEVVNRTLGALLRSLISKNLKSWEETLPFVEFAYNRAIH 1161

Query: 936  STTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKV 995
            STT C+PFE+VYGFNPLTPLDL P+P   F +  A+++VE++  LHK++KE+IEK+N K+
Sbjct: 1162 STTHCSPFEVVYGFNPLTPLDLSPLPPNMFTSDAASSRVEYIKTLHKEIKERIEKKNQKL 1221

Query: 996  ATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLP 1042
             TR N+GRK +IFKPGDWVWVH RKERFP QRKSKL  RGDGPFQVLERIN+NAYK+DL 
Sbjct: 1222 VTRKNQGRKELIFKPGDWVWVHLRKERFPDQRKSKLQQRGDGPFQVLERINNNAYKLDLR 1281

BLAST of CmaCh04G019870 vs. NCBI nr
Match: TYK02449.1 (F15O4.13 [Cucumis melo var. makuwa])

HSP 1 Score: 1228.0 bits (3176), Expect = 0.0e+00
Identity = 593/918 (64.60%), Postives = 711/918 (77.45%), Query Frame = 0

Query: 215  QEFEDLF-SEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAK 274
            +EF D+F  E+ P+ LPPLRGIEH+IDFIPGA +PN  AYRTNP E +EIQRQV EL+ K
Sbjct: 542  EEFNDMFPHEDAPTGLPPLRGIEHQIDFIPGATLPNMAAYRTNPTETKEIQRQVEELMDK 601

Query: 275  G------------------------------AINKITIKYRHPIPRLDDMLDELHGCSLF 334
            G                              AINKIT+KYRHPIPRLDDMLDELHG +LF
Sbjct: 602  GYIRESMSPCSVPVILVPKKDGTWRMCVDCRAINKITVKYRHPIPRLDDMLDELHGANLF 661

Query: 335  TKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREY-- 394
            +KIDLKSGYHQIRMH+GDEWKTAFKTK+GLYEWLVMPFGLTNAPSTFMRLMNHVL+EY  
Sbjct: 662  SKIDLKSGYHQIRMHVGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNHVLKEYIG 721

Query: 395  -------------------------------------------------------LVSSN 454
                                                                   +V  +
Sbjct: 722  KFVVVYFDDILVYSKGLNDHILHVKTILLKLREEKLYANFKKCSFCLEQIHFLGFIVGKD 781

Query: 455  GVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKKNVSFIW 514
            GV+VDEEKVKAI++WPTP N SEVRSFHGLASFYRRFIK+FS+IASPL ELVKK+V F W
Sbjct: 782  GVKVDEEKVKAIREWPTPTNASEVRSFHGLASFYRRFIKDFSSIASPLTELVKKHVKFEW 841

Query: 515  EKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLT 574
            ++ QE AFN LKEKL  AP LALPNF+ +FEIECDASG+GIGAVLMQ ++P+MFFSEKL 
Sbjct: 842  KEKQENAFNELKEKLIKAPCLALPNFDKSFEIECDASGIGIGAVLMQEKQPIMFFSEKLN 901

Query: 575  GASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEF 634
            GA L Y TYDKEL+ALVRAL+ WQHYLWPKEF+IHTDHESLKHL+ Q KLN+RHAKW+EF
Sbjct: 902  GAQLNYSTYDKELHALVRALKVWQHYLWPKEFVIHTDHESLKHLKGQTKLNKRHAKWVEF 961

Query: 635  IETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMF-FAPFVESC 694
            IETFPYVI YK+GK+N+VADALSRRY L ++L+A++LGF+H+ +LY+ +   F      C
Sbjct: 962  IETFPYVIHYKKGKDNMVADALSRRYALFSSLSAKVLGFKHMIELYKVEKSEFYDVYAQC 1021

Query: 695  EKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFF 754
             +G  V +Y++ DG LFRKGKLCIP CSIRELLV+EAHGGGLM H G  KTY +L EHF+
Sbjct: 1022 LEGKNVQDYIVFDGMLFRKGKLCIPKCSIRELLVKEAHGGGLMGHFGEFKTYSILCEHFY 1081

Query: 755  WPKMRHDVHKVCARCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYD 814
            W KMR DV+KVC +C  CK+AKS+ QPHGLY+PL VPN PW+DISMDFVLGLP+TR+ +D
Sbjct: 1082 WLKMRKDVNKVCKQCFKCKEAKSKTQPHGLYTPLDVPNEPWVDISMDFVLGLPKTRRHHD 1141

Query: 815  SIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRV 874
            SIFVVVDRFSKMAHFIPC+KTDDA +IA+LFFREVVRLHGIPK+IVSDRDVKFLSHFW+V
Sbjct: 1142 SIFVVVDRFSKMAHFIPCNKTDDATNIANLFFREVVRLHGIPKTIVSDRDVKFLSHFWKV 1201

Query: 875  LWGKLGTKLIYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEVCLPFIEFAYNRVV 934
            LWGKLGTKL++STTCHPQTDGQTEVVNRT+ A+LR++I KNLK+WE  LPF+EFAYNR +
Sbjct: 1202 LWGKLGTKLLFSTTCHPQTDGQTEVVNRTLGALLRSLISKNLKSWEETLPFVEFAYNRAI 1261

Query: 935  HSTTKCTPFEIVYGFNPLTPLDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSK 994
            HSTT C+PFE+VYGFNPLTPLDL P+P   F +  A+++VE++  LHK++KE+IEK+N K
Sbjct: 1262 HSTTHCSPFEVVYGFNPLTPLDLSPLPPNMFTSDAASSRVEYIKTLHKEIKERIEKKNQK 1321

Query: 995  VATRINKGRKFVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDL 1042
            + TR N+GRK +IFKPGDWVWVH RKERFP QRKSKL  +GDGPFQVLERIN+NAYK+DL
Sbjct: 1322 LVTRKNQGRKELIFKPGDWVWVHLRKERFPDQRKSKLQQQGDGPFQVLERINNNAYKLDL 1381

BLAST of CmaCh04G019870 vs. NCBI nr
Match: KAG7559450.1 (Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa])

HSP 1 Score: 1213.4 bits (3138), Expect = 0.0e+00
Identity = 671/1534 (43.74%), Postives = 830/1534 (54.11%), Query Frame = 0

Query: 5    SLPSDFVVLLQEFEDLFSEEKPSSLPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQ 64
            ++PS    LLQ++ D+F EE P  LPP+RGIEH+IDF+PGA +PNRPAYRTN  E +E++
Sbjct: 699  AIPSKIKFLLQDYTDVFPEENPQGLPPIRGIEHQIDFVPGASLPNRPAYRTNHVETKELE 758

Query: 65   RQVSELLAKG------------------------------AINKITIKYRHPIPRLDDML 124
            +QV+EL+ +G                              AIN IT+KYRHPIPRLDDML
Sbjct: 759  KQVTELMERGHIRESMSPCAVPVLLVPKKDGSWRMCVDCRAINNITVKYRHPIPRLDDML 818

Query: 125  DELHGCSLFTKIDLKSGYHQIRMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLM 184
            DELHG S+F+K+DLKSGYHQIRM  GDEWKTAFKTK GLYEWLVMPFGLTNAPSTFMRLM
Sbjct: 819  DELHGSSIFSKVDLKSGYHQIRMKEGDEWKTAFKTKQGLYEWLVMPFGLTNAPSTFMRLM 878

Query: 185  NHVLREYL---------------------------------------------------- 244
            NHVLR Y+                                                    
Sbjct: 879  NHVLRAYIGHFVVVYFDDILVYSKSLEEHVDHLKMVLEVLRKEKLYANLKKCTFGTDNLV 938

Query: 245  -----VSSNGVEVDEEKVKAIKDWPTPKNT------------------------------ 304
                 VS++GV+VDEEKVKAI++WP+PK+                               
Sbjct: 939  FLGFVVSTDGVKVDEEKVKAIREWPSPKSVGEAQEDAFQALKEKLTNAPVLSLPDFIKTF 998

Query: 305  ------------------------------------------------------------ 364
                                                                        
Sbjct: 999  AIECDASGVGIGAVLMQDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQTWQHYLWPK 1058

Query: 365  ----------------------------------TLVLMYK------------------- 424
                                                V+ YK                   
Sbjct: 1059 EFVIHTDHESLKHLKGQQKLNKRHARWVEFIETFPYVIKYKKGKDNVVAMHCHGDEQQEE 1118

Query: 425  -------------------------------------------------------GSC-- 484
                                                                   GSC  
Sbjct: 1119 NSSSEDCEAPSKGELLFAMKALSVVAKTDEQEQRENLFHSRCIVNDKVCSLIIDGGSCTN 1178

Query: 485  ------------------------------------------------------------ 544
                                                                        
Sbjct: 1179 VASETMVEKLGLKVMKHPKPYKLQWLNEDGEMSVNRQVKVPLSIGKYEDEILCDILPMDA 1238

Query: 545  ------------------------------------------------------------ 604
                                                                        
Sbjct: 1239 SHILLGRPWQSDRRVMHDGFTNRQIFEFKGRKTILAPMTPHEVYLDQLSMKMRRKQKEKS 1298

Query: 605  ---------------------------YFSMLNP--FLPSDFVVLLQEFEDLFSEEKPSS 664
                                         S+ NP   +PS    LLQ++ D+F EE P  
Sbjct: 1299 SNLMITESKQKGSDLHSSKLLFVFKETLVSITNPEKAIPSKIKFLLQDYTDVFPEENPQG 1358

Query: 665  LPPLRGIEHKIDFIPGAPIPNRPAYRTNPKEAEEIQRQVSELLAKG-------------- 724
            LPP+RGIEH+IDF+PGA +PNRPAYRTNP E +E+++QV+EL+ +G              
Sbjct: 1359 LPPIRGIEHQIDFVPGASLPNRPAYRTNPVETKELEKQVTELMERGHIRESMSPCAVPVL 1418

Query: 725  ----------------AINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMH 784
                            AIN IT+KYRHPIPRLDDMLDELHG S+F+K+DLKSGYHQIRM 
Sbjct: 1419 LVPKKDGSWRMCVDCRAINNITVKYRHPIPRLDDMLDELHGSSIFSKVDLKSGYHQIRMK 1478

Query: 785  IGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLRE------------------ 844
             GDEWKTAFKTK GLYE L             ++++  VLR+                  
Sbjct: 1479 EGDEWKTAFKTKQGLYECLQQK--SLEEHVDHLKMVLEVLRKEKLYANFKKCTFGTDNLV 1538

Query: 845  ---YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNEL 904
               ++VS++GV+VDEEKVKAI++WP+PK+V EVRSFHGLA FYRRF+K+FST+A+PL E+
Sbjct: 1539 FLGFVVSTDGVKVDEEKVKAIREWPSPKSVGEVRSFHGLAGFYRRFVKDFSTLAAPLTEV 1598

Query: 905  VKKNVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRP 964
            +KKNV F WE+ QE AF  LKEKL++AP+L+LP+F  TFEIECDASGVGIGAVLMQ+++P
Sbjct: 1599 IKKNVGFKWEQAQEDAFQALKEKLTNAPVLSLPDFIKTFEIECDASGVGIGAVLMQDKKP 1658

Query: 965  LMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLN 1024
            + +FSEKL GA+L YPTYDKELYALVRALQTWQHYLWPKEF+IHTDHESLKHL+ Q KLN
Sbjct: 1659 IAYFSEKLGGATLNYPTYDKELYALVRALQTWQHYLWPKEFVIHTDHESLKHLKGQQKLN 1718

Query: 1025 RRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMF 1041
            +RH +W+EFIETFPYVIKYK+GK N+VADALSRRYVLL++L+A+LLGFEHIK LY +D  
Sbjct: 1719 KRHVRWVEFIETFPYVIKYKKGKNNVVADALSRRYVLLSSLDAKLLGFEHIKSLYANDSD 1778

BLAST of CmaCh04G019870 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 91.7 bits (226), Expect = 3.9e-18
Identity = 43/96 (44.79%), Postives = 62/96 (64.58%), Query Frame = 0

Query: 361 YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASPLNELVKK 420
           +++S  GV  D  K++A+  WP PKN +E+R F GL  +YRRF+KN+  I  PL EL+KK
Sbjct: 37  HIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK 96

Query: 421 NVSFIWEKDQELAFNTLKEKLSSAPLLALPNFESTF 457
           N S  W +   LAF  LK  +++ P+LALP+ +  F
Sbjct: 97  N-SLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q993151.6e-12533.30Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG56.1e-12533.52Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P0CT411.5e-11028.99Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT341.5e-11028.99Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT351.5e-11028.99Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A2N9G0F90.0e+0065.39Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS20920 PE=4 SV=1[more]
A0A2N9HBD30.0e+0065.29Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12373 PE=4 SV=1[more]
A0A2N9IF290.0e+0065.29Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50797 PE=4 SV=1[more]
A0A2N9ENW80.0e+0065.39Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8418 PE=4 SV=1[more]
A0A2N9F7E80.0e+0065.29Reverse transcriptase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10964 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
OWM74668.10.0e+0064.15hypothetical protein CDL15_Pgr005248 [Punica granatum][more]
PSS05945.10.0e+0059.92Integrase [Actinidia chinensis var. chinensis][more]
TYK22420.10.0e+0064.89Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa][more]
TYK02449.10.0e+0064.60F15O4.13 [Cucumis melo var. makuwa][more]
KAG7559450.10.0e+0043.74Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa][more]
Match NameE-valueIdentityDescription
ATMG00860.13.9e-1844.79DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 372..458
e-value: 5.1E-31
score: 108.5
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 285..361
e-value: 1.7E-34
score: 121.0
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 85..161
e-value: 1.7E-34
score: 121.0
NoneNo IPR availableGENE3D1.10.340.70coord: 605..687
e-value: 9.9E-18
score: 66.2
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 275..345
e-value: 1.7E-34
score: 121.0
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 75..145
e-value: 1.7E-34
score: 121.0
NoneNo IPR availablePANTHERPTHR34072ENZYMATIC POLYPROTEIN-RELATEDcoord: 275..360
coord: 75..164
coord: 363..968
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 457..572
e-value: 2.43496E-54
score: 182.692
NoneNo IPR availableCDDcd01647RT_LTRcoord: 76..170
e-value: 4.51588E-40
score: 144.275
NoneNo IPR availableCDDcd01647RT_LTRcoord: 276..370
e-value: 4.51588E-40
score: 144.275
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 76..162
e-value: 4.7E-13
score: 49.2
coord: 276..362
e-value: 4.8E-13
score: 49.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 698..909
e-value: 4.8E-50
score: 171.6
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 452..550
e-value: 3.3E-31
score: 107.7
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 633..689
e-value: 3.0E-16
score: 59.2
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 701..861
score: 18.549906
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 698..860
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 213..557
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 13..191

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G019870.1CmaCh04G019870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding