Cla97C11G216264 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G216264
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionReverse transcriptase
LocationCla97Chr11: 16542360 .. 16544341 (+)
RNA-Seq ExpressionCla97C11G216264
SyntenyCla97C11G216264
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGATAGACTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTACAATCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAATGACATTTACATGTCCCTATGGGACATTTGCTTTTAAAAGAATGCCATTTGGTTTATGTAATGCCCCTACTACTTTTCAACGTTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTATGGATTATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACCCGTGTTTTGCAGAGGTGTCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGTCACTTCATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTAAAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCTCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTGGCTTTTAATACTTTGAAGGAAAGACTAATTGCTGCACCCATTATTGTTGTACCAGATTGGAGTCAGTCTTTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATTCTAGTAGGACTCTTGATAATACTCAACAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCTCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTAACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATATTAAGTCTTACCATTGGGAGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGTCATTTTGGACCCACCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCATCCCTTTTTAAAGACTGTTGTACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATATTTCTAGACAACATGAGCTTCCGATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATAATGGCTACCTATATATTCTAGTTGCAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGTTTTGGTACACCTAGAGCTATTATTAGTGATGAGGGCTCTCACTTTTGCAATAAACTATTTAAATCTAGGATGCATAAATATAATGTTAATCATAAAATTGCTACAGCTATCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGAGAAATCAAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGAATGGGCCCTTAAGCACGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGACAAGGAAAAGCTTGTCACTTACCGGTAGAGCTCGAGCATAG

mRNA sequence

ATGCTTGATAGACTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTACAATCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAATGACATTTACATGTCCCTATGGGACATTTGCTTTTAAAAGAATGCCATTTGGTTTATGTAATGCCCCTACTACTTTTCAACGTTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTATGGATTATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACCCGTGTTTTGCAGAGGTGTCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGTCACTTCATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTAAAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCTCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTGGCTTTTAATACTTTGAAGGAAAGACTAATTGCTGCACCCATTATTGTTGTACCAGATTGGAGTCAGTCTTTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATTCTAGTAGGACTCTTGATAATACTCAACAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCTCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTAACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATATTAAGTCTTACCATTGGGAGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGTCATTTTGGACCCACCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCATCCCTTTTTAAAGACTGTTGTACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATATTTCTAGACAACATGAGCTTCCGATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATAATGGCTACCTATATATTCTAGTTGCAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGTTTTGTCAAGACCAATAGGAAGGAATGGGCCCTTAAGCACGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGACAAGGAAAAGCTTGTCACTTACCGGTAGAGCTCGAGCATAG

Coding sequence (CDS)

ATGCTTGATAGACTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTACAATCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAATGACATTTACATGTCCCTATGGGACATTTGCTTTTAAAAGAATGCCATTTGGTTTATGTAATGCCCCTACTACTTTTCAACGTTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTATGGATTATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACCCGTGTTTTGCAGAGGTGTCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGTCACTTCATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTAAAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCTCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTGGCTTTTAATACTTTGAAGGAAAGACTAATTGCTGCACCCATTATTGTTGTACCAGATTGGAGTCAGTCTTTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATTCTAGTAGGACTCTTGATAATACTCAACAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCTCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTAACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATATTAAGTCTTACCATTGGGAGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGTCATTTTGGACCCACCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCATCCCTTTTTAAAGACTGTTGTACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATATTTCTAGACAACATGAGCTTCCGATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATAATGGCTACCTATATATTCTAGTTGCAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGTTTTGTCAAGACCAATAGGAAGGAATGGGCCCTTAAGCACGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGACAAGGAAAAGCTTGTCACTTACCGGTAGAGCTCGAGCATAG

Protein sequence

MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRCMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFVKTNRKEWALKHDDALWAYRTAFKTPIGTSPYRLTRKSLSLTGRARA
Homology
BLAST of Cla97C11G216264 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 922.2 bits (2382), Expect = 2.4e-264
Identity = 441/647 (68.16%), Postives = 512/647 (79.13%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRLAGYS+YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 962  MLDRLAGYSYYCFLDGYSGYNQIAIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 1021

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMM+IF  ++EDIME+FMD FSVFG+SFD CL NL  VLQRC+D NLVLNWEKCHFMV E
Sbjct: 1022 CMMAIFSDMVEDIMEIFMDDFSVFGTSFDHCLHNLALVLQRCEDKNLVLNWEKCHFMVQE 1081

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGH+VS KG+EVDRAKI  IE+LPPP NVKG+RSFLGHAGFYRRFIKDFSK++KPL 
Sbjct: 1082 GIVLGHRVSSKGIEVDRAKIATIEKLPPPKNVKGIRSFLGHAGFYRRFIKDFSKLSKPLC 1141

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            NLLEK + F FDD CL AFN +KE+LI+AP++ VPDWSQ FE+MCDASD+ALGAVLGQRR
Sbjct: 1142 NLLEKNSAFDFDDVCLQAFNAIKEKLISAPVMTVPDWSQPFEVMCDASDFALGAVLGQRR 1201

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +FRAIYY+SRTL+  Q  YTTTEKE+LAVVFA DKFRSYL+ +K++V TDHAAL+YLF
Sbjct: 1202 DKLFRAIYYASRTLNEAQLNYTTTEKEMLAVVFACDKFRSYLICTKVIVFTDHAALRYLF 1261

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             KKD+KPRL+RWILLLQEFDLE++D+KG EN VADHLSR+E EE +    I E F DEQL
Sbjct: 1262 SKKDAKPRLIRWILLLQEFDLEVRDKKGSENSVADHLSRLEQEEVRPDLVIQEAFPDEQL 1321

Query: 361  YQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKC 420
            +  +  LPW+ADIVN+LA   LPP++ Y Q+K+FLH++K Y W++PLL+K C D +IR+C
Sbjct: 1322 FACEIKLPWYADIVNFLACKVLPPDLTYHQRKKFLHDVKYYLWDEPLLFKRCPDQIIRRC 1381

Query: 421  VPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGN 480
            VP+EE+ +IL+ CH+S YGGHFG TRTAAKVLQSGF+WPS+F+D  T VK+CDRCQR GN
Sbjct: 1382 VPEEEMQAILHHCHSSSYGGHFGVTRTAAKVLQSGFFWPSIFRDSYTLVKTCDRCQRMGN 1441

Query: 481  ISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDA 540
            ISR+ ELP+K ILEVELFDVWGIDFMGPFP S+ G++YIL+AVDYVSKWVEAIAT TNDA
Sbjct: 1442 ISRRQELPLKNILEVELFDVWGIDFMGPFPPSF-GFVYILLAVDYVSKWVEAIATTTNDA 1501

Query: 541  RTVVKFLHKNIFTRF--------------------------------------------- 590
            + V+KFLHKNIFTRF                                             
Sbjct: 1502 KVVLKFLHKNIFTRFGTPRAIISDEGTHFCNKLFDNLLSKYGVKHKIALAYHPQTNGQAE 1561

BLAST of Cla97C11G216264 vs. NCBI nr
Match: XP_012833687.1 (PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata] >XP_012857704.1 PREDICTED: uncharacterized protein LOC105976985 [Erythranthe guttata])

HSP 1 Score: 881.3 bits (2276), Expect = 4.6e-252
Identity = 421/647 (65.07%), Postives = 497/647 (76.82%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 947  MLDRLGGFEYYCFLDGYSGYNQISIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 1006

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+RC + NLVLNWEKCHFMV E
Sbjct: 1007 CMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETNLVLNWEKCHFMVRE 1066

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL 
Sbjct: 1067 GIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYRRFIKDFSKIVKPLC 1126

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            +LLEKEA F FD ACL AF  LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRR
Sbjct: 1127 HLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCDASDYAVGAVLGQRR 1186

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Sbjct: 1187 DKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQVIIYTDHAAIRYLF 1246

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL
Sbjct: 1247 AKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQL 1306

Query: 361  YQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKC 420
              +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + Y W++PLL++   D +IR+C
Sbjct: 1307 LAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDEPLLFRTGPDRVIRRC 1366

Query: 421  VPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGN 480
            VP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN
Sbjct: 1367 VPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDSYEFVKRCDRCQRTGN 1426

Query: 481  ISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDA 540
            +S + ++P+  + EVELFDVWGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDA
Sbjct: 1427 LSNKSQMPLNNMQEVELFDVWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTTNDA 1486

Query: 541  RTVVKFLHKNIFTRF--------------------------------------------- 590
            RTV+KF HKNIF+RF                                             
Sbjct: 1487 RTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAE 1546

BLAST of Cla97C11G216264 vs. NCBI nr
Match: XP_012847037.1 (PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata])

HSP 1 Score: 881.3 bits (2276), Expect = 4.6e-252
Identity = 421/647 (65.07%), Postives = 497/647 (76.82%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 782  MLDRLGGFEYYCFLDGYSGYNQISIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 841

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+RC + NLVLNWEKCHFMV E
Sbjct: 842  CMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETNLVLNWEKCHFMVRE 901

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL 
Sbjct: 902  GIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYRRFIKDFSKIVKPLC 961

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            +LLEKEA F FD ACL AF  LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRR
Sbjct: 962  HLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCDASDYAVGAVLGQRR 1021

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Sbjct: 1022 DKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQVIIYTDHAAIRYLF 1081

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL
Sbjct: 1082 AKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQL 1141

Query: 361  YQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKC 420
              +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + Y W++PLL++   D +IR+C
Sbjct: 1142 LAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDEPLLFRTGPDRVIRRC 1201

Query: 421  VPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGN 480
            VP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN
Sbjct: 1202 VPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDSYEFVKRCDRCQRTGN 1261

Query: 481  ISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDA 540
            +S + ++P+  + EVELFDVWGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDA
Sbjct: 1262 LSNKSQMPLNNMQEVELFDVWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTTNDA 1321

Query: 541  RTVVKFLHKNIFTRF--------------------------------------------- 590
            RTV+KF HKNIF+RF                                             
Sbjct: 1322 RTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAE 1381

BLAST of Cla97C11G216264 vs. NCBI nr
Match: XP_012846413.1 (PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata])

HSP 1 Score: 880.9 bits (2275), Expect = 6.1e-252
Identity = 421/647 (65.07%), Postives = 497/647 (76.82%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 947  MLDRLGGFEYYCFLDGYSGYNQISIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 1006

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VLQRC + NLVLNWEKCHFMV E
Sbjct: 1007 CMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLQRCTETNLVLNWEKCHFMVRE 1066

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL 
Sbjct: 1067 GIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYRRFIKDFSKIVKPLC 1126

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            +LLEKEA F FD ACL AF  LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRR
Sbjct: 1127 HLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCDASDYAVGAVLGQRR 1186

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Sbjct: 1187 DKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQVIIYTDHAAIRYLF 1246

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  +E  +   I E F DEQL
Sbjct: 1247 AKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILDEVPAEGNIQESFPDEQL 1306

Query: 361  YQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKC 420
              +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + Y W++PLL++   D +IR+C
Sbjct: 1307 LAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDEPLLFRTGPDRVIRRC 1366

Query: 421  VPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGN 480
            VP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN
Sbjct: 1367 VPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDSYEFVKWCDRCQRTGN 1426

Query: 481  ISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDA 540
            +S + ++P+  + EVELFDVWGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT TNDA
Sbjct: 1427 LSNKSQMPLNNMQEVELFDVWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTTNDA 1486

Query: 541  RTVVKFLHKNIFTRF--------------------------------------------- 590
            RTV+KF HKNIF+RF                                             
Sbjct: 1487 RTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAE 1546

BLAST of Cla97C11G216264 vs. NCBI nr
Match: XP_012833379.1 (PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata])

HSP 1 Score: 879.4 bits (2271), Expect = 1.8e-251
Identity = 420/647 (64.91%), Postives = 496/647 (76.66%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRL G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 872  MLDRLGGFEYYCFLDGYSGYNQISIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 931

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +EVFMD FSVFGSSFD C+ NL  VL+RC + NLVLNWEKCHFMV E
Sbjct: 932  CMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETNLVLNWEKCHFMVRE 991

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGHAGFYRRFIKDFSKI KPL 
Sbjct: 992  GIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYRRFIKDFSKIVKPLC 1051

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            +LLEKEA F FD ACL AF  LKE+L  +PI++ P+W + FEIMCDASDYA+GAVLGQRR
Sbjct: 1052 HLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCDASDYAVGAVLGQRR 1111

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +F+AIYYSSRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF
Sbjct: 1112 DKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQVIIYTDHAAIRYLF 1171

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E F DEQL
Sbjct: 1172 AKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVPAEGNIQESFPDEQL 1231

Query: 361  YQVKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRKC 420
              +    PW+AD+ N+LA G +P +++Y QKK+FLH+ + Y W++PLL++   D +IR+C
Sbjct: 1232 LAISTHTPWYADVANFLASGIIPDDLSYHQKKKFLHDSRFYLWDEPLLFRTGPDRVIRRC 1291

Query: 421  VPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTGN 480
            VP+ EV  IL  CH+SP GGH G +RTAAKVLQSGF+WP+LF+D   FVK CDRCQRTGN
Sbjct: 1292 VPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDSYDFVKRCDRCQRTGN 1351

Query: 481  ISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTNDA 540
            +S + ++P+  + EVELFDVWGIDFMGPFP S NG LYIL+AVDYVSKWVEAIAT  NDA
Sbjct: 1352 LSNKSQMPLNNMQEVELFDVWGIDFMGPFP-SSNGKLYILLAVDYVSKWVEAIATTANDA 1411

Query: 541  RTVVKFLHKNIFTRF--------------------------------------------- 590
            RTV+KF HKNIF+RF                                             
Sbjct: 1412 RTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHKIALAYHPQTNGLAE 1471

BLAST of Cla97C11G216264 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 2.8e-66
Identity = 143/345 (41.45%), Postives = 204/345 (59.13%), Query Frame = 0

Query: 1   MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
           +L +L   +++  +D   G++QI + PE   K  F+  +G + + RMPFGL NAP TFQR
Sbjct: 288 ILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQR 347

Query: 61  CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
           CM  I + L+     V++D   VF +S D  L +L  V ++   ANL L  +KC F+  E
Sbjct: 348 CMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQE 407

Query: 121 GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
              LGH ++  G++ +  KI AI++ P PT  K +++FLG  G+YR+FI +F+ IAKP++
Sbjct: 408 TTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMT 467

Query: 181 NLLEKEAKF-IFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQR 240
             L+K  K    +     AF  LK  +   PI+ VPD+++ F +  DASD ALGAVL Q 
Sbjct: 468 KCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQD 527

Query: 241 RDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL 300
                  + Y SRTL+  +  Y+T EKELLA+V+A   FR YLLG    + +DH  L +L
Sbjct: 528 G----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQPLSWL 587

Query: 301 FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE 345
           +  KD   +L RW + L EFD +IK  KG EN VAD LSRI+ EE
Sbjct: 588 YRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEE 628

BLAST of Cla97C11G216264 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 2.4e-62
Identity = 132/351 (37.61%), Postives = 203/351 (57.83%), Query Frame = 0

Query: 2   LDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQRC 61
           L  L    ++  LD  SG++QI +   D  K  F+   G + F R+PFGL NAP  FQR 
Sbjct: 205 LASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRM 264

Query: 62  MMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTEG 121
           +  I +  I  +  V++D   VF   +D+   NL  VL     ANL +N EK HF+ T+ 
Sbjct: 265 IDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQV 324

Query: 122 IVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLSN 181
             LG+ V+  G++ D  K+ AI ++PPPT+VK ++ FLG   +YR+FI+D++K+AKPL+N
Sbjct: 325 EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTN 384

Query: 182 LL-----------EKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDY 241
           L              +     D+  L +FN LK  L ++ I+  P +++ F +  DAS++
Sbjct: 385 LTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNW 444

Query: 242 ALGAVLGQRRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS-KIVV 301
           A+GAVL Q      R I Y SR+L+ T++ Y T EKE+LA+++++D  R+YL G+  I V
Sbjct: 445 AIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKV 504

Query: 302 HTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI 341
           +TDH  L +    ++   +L RW   ++E++ E+  + G  NVVAD LSRI
Sbjct: 505 YTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRI 555

BLAST of Cla97C11G216264 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 240.4 bits (612), Expect = 5.4e-62
Identity = 176/573 (30.72%), Postives = 280/573 (48.87%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            +L R+     +  LD +SGY+QI + P+D+ K  F  P G + +  MPFGL NAP+TF R
Sbjct: 672  LLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR 731

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
             M   F+ L    + V++D   +F  S +    +L  VL+R ++ NL++  +KC F   E
Sbjct: 732  YMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEE 791

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL- 180
               LG+ +  + +   + K  AI   P P  VK  + FLG   +YRRFI + SKIA+P+ 
Sbjct: 792  TEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQ 851

Query: 181  ------SNLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALG 240
                  S   EK+ K         A + LK+ L  +P++V  +   ++ +  DAS   +G
Sbjct: 852  LFICDKSQWTEKQDK---------AIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIG 911

Query: 241  AVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT 300
            AVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + T
Sbjct: 912  AVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRT 971

Query: 301  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPI 360
            DH +L  L  K +   R+ RW+  L  +D  ++   G +NVVAD +SR            
Sbjct: 972  DHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRP 1031

Query: 361  VEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----MNYQQKKRFLHNI-KSYHWED 420
            ++  S +  Y+           +  L   ++ P       +YQ+K        K+Y  ED
Sbjct: 1032 IDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLED 1091

Query: 421  PLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKD 480
             ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YWP L   
Sbjct: 1092 EMIY-----YQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKI-SPIYYWPKLQHS 1151

Query: 481  CCTFVKSCDRCQR-TGNISRQHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILV 540
               ++++C +CQ    +  R H L  P+ PI E    D+  +DF+   P + N    ILV
Sbjct: 1152 IIQYIRTCVQCQLIKSHRPRLHGLLQPL-PIAEGRWLDI-SMDFVTGLPPTSNNLNMILV 1211

Query: 541  AVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT 554
             VD  SK    IATR T DA  ++  L + IF+
Sbjct: 1212 VVDRFSKRAHFIATRKTLDATQLIDLLFRYIFS 1225

BLAST of Cla97C11G216264 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 238.4 bits (607), Expect = 2.1e-61
Identity = 176/573 (30.72%), Postives = 278/573 (48.52%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            +L R+     +  LD +SGY+QI + P+D+ K  F  P G + +  MPFGL NAP+TF R
Sbjct: 698  LLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR 757

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
             M   F+ L    + V++D   +F  S +    +L  VL+R ++ NL++  +KC F   E
Sbjct: 758  YMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEE 817

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPL- 180
               LG+ +  + +   + K  AI   P P  VK  + FLG   +YRRFI + SKIA+P+ 
Sbjct: 818  TEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQ 877

Query: 181  ------SNLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALG 240
                  S   EK+ K         A   LK  L  +P++V  +   ++ +  DAS   +G
Sbjct: 878  LFICDKSQWTEKQDK---------AIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIG 937

Query: 241  AVLGQ--RRDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHT 300
            AVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + T
Sbjct: 938  AVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRT 997

Query: 301  DHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPI 360
            DH +L  L  K +   R+ RW+  L  +D  ++   G +NVVAD +SR            
Sbjct: 998  DHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRP 1057

Query: 361  VEKFSDEQLYQVKDSLPWFADIVNYLAGGHLPPN-----MNYQQKKRFLHNI-KSYHWED 420
            ++  S +  Y+           +  L   ++ P       +YQ+K        K+Y  ED
Sbjct: 1058 IDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLED 1117

Query: 421  PLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSGFYWPSLFKD 480
             ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YWP L   
Sbjct: 1118 EMIY-----YQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKI-SPIYYWPKLQHS 1177

Query: 481  CCTFVKSCDRCQR-TGNISRQHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILV 540
               ++++C +CQ    +  R H L  P+ PI E    D+  +DF+   P + N    ILV
Sbjct: 1178 IIQYIRTCVQCQLIKSHRPRLHGLLQPL-PIAEGRWLDI-SMDFVTGLPPTSNNLNMILV 1237

Query: 541  AVDYVSKWVEAIATR-TNDARTVVKFLHKNIFT 554
             VD  SK    IATR T DA  ++  L + IF+
Sbjct: 1238 VVDRFSKRAHFIATRKTLDATQLIDLLFRYIFS 1251

BLAST of Cla97C11G216264 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.3e-60
Identity = 134/345 (38.84%), Postives = 200/345 (57.97%), Query Frame = 0

Query: 1   MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
           +L +L    ++  +D   G++QI +  E   K  F+   G + + RMPFGL NAP TFQR
Sbjct: 287 ILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQR 346

Query: 61  CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
           CM +I + L+     V++D   +F +S    L ++  V  +  DANL L  +KC F+  E
Sbjct: 347 CMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKE 406

Query: 121 GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
              LGH V+  G++ +  K+ AI   P PT  K +R+FLG  G+YR+FI +++ IAKP++
Sbjct: 407 ANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMT 466

Query: 181 NLLEKEAKFIFDD-ACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQR 240
           + L+K  K        + AF  LK  +I  PI+ +PD+ + F +  DAS+ ALGAVL Q 
Sbjct: 467 SCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQN 526

Query: 241 RDNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL 300
                  I + SRTL++ +  Y+  EKELLA+V+A   FR YLLG + ++ +DH  L++L
Sbjct: 527 G----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQPLRWL 586

Query: 301 FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEE 345
              K+   +L RW + L E+  +I   KG EN VAD LSRI+ EE
Sbjct: 587 HNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEE 627

BLAST of Cla97C11G216264 vs. ExPASy TrEMBL
Match: A0A6P8CBX2 (Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1)

HSP 1 Score: 855.1 bits (2208), Expect = 1.7e-244
Identity = 408/648 (62.96%), Postives = 491/648 (75.77%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            ML++LAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 881  MLEKLAGHDYYCFLDGYSGYNQIHIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 940

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +E+FMD FSVFG SF+SCL NL  VL+RC++ NL+LNWEKCHFMV E
Sbjct: 941  CMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETNLLLNWEKCHFMVRE 1000

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGHAGFYRRFIKDFSKI++PL 
Sbjct: 1001 GIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYRRFIKDFSKISRPLC 1060

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            NLLEK++ F+F+D CL AFN LKE+L +AP+IV P+W   FE+MCDASDYA+GAVLGQRR
Sbjct: 1061 NLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCDASDYAVGAVLGQRR 1120

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
              +F AIYY+SRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF
Sbjct: 1121 GKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSKIIVYTDHAALKYLF 1180

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKF DEQL
Sbjct: 1181 AKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLDS--PINEKFPDEQL 1240

Query: 361  YQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRK 420
            +  +   LPW+ADIVNY+     P  ++ QQKK+FLH++K Y W++P L+K CAD +IR+
Sbjct: 1241 HVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWDEPYLFKYCADQVIRR 1300

Query: 421  CVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTG 480
            CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTG
Sbjct: 1301 CVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHDCRNYIMSCAPCQRTG 1360

Query: 481  NISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTND 540
            NISR+HE+P   IL +ELFDVWGIDFMGPFP S++   YILVAVDYVSKWVEA+A ++ND
Sbjct: 1361 NISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSN-KYILVAVDYVSKWVEAVALQSND 1420

Query: 541  ARTVVKFLHKNIFTRF-------------------------------------------- 590
            AR V++FL KNIF+RF                                            
Sbjct: 1421 ARVVIRFLKKNIFSRFGVPRAIISDGGSHFCNRQFEKLLSKYGVTHKIATPYHPQTCGQV 1480

BLAST of Cla97C11G216264 vs. ExPASy TrEMBL
Match: A0A2G9FWY3 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=4 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 5.0e-244
Identity = 412/649 (63.48%), Postives = 496/649 (76.43%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRLAG   YCFLDGYSGYNQI IAPEDQEK+TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 780  MLDRLAGKEFYCFLDGYSGYNQIAIAPEDQEKITFTCPYGTFAFRRMPFGLCNAPATFQR 839

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMM+IF  ++E+ +EVFMD FSV+G+SFD CL NL+ VL+RC+D NL+LNWEKCHFMV E
Sbjct: 840  CMMAIFTDMVENCLEVFMDDFSVYGNSFDECLNNLSCVLKRCEDTNLILNWEKCHFMVQE 899

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVS +G+EVD+AK+  IE+LPPPT+VKGVRSFLGHAGFYRRFIKDFSKI+KPL 
Sbjct: 900  GIVLGHKVSNRGIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYRRFIKDFSKISKPLC 959

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            NLLEK+  F FDDAC  AFN LK RLI+APII VPDWS  FE+MCDASD+A+GAVLGQR+
Sbjct: 960  NLLEKDIPFNFDDACRDAFNDLKGRLISAPIITVPDWSFPFELMCDASDFAVGAVLGQRK 1019

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +FR+IYY+S+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL 
Sbjct: 1020 DKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTKVIVYTDHAAIRYLI 1079

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQ 360
             KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + F DEQ
Sbjct: 1080 EKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESPAKTDEPNLINDNFPDEQ 1139

Query: 361  LYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIR 420
            L   V   +PW+ADIVNYL  G +P +++ QQKK+FL + + Y W+DP L+K   DN++R
Sbjct: 1140 LLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWDDPFLFKQGPDNILR 1199

Query: 421  KCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRT 480
            +CVP+ E+  IL  CHASPYGGHF   RTAAK+LQSGF+WP+LFKD  +FV +CDRCQRT
Sbjct: 1200 RCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNLFKDAHSFVANCDRCQRT 1259

Query: 481  GNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTN 540
            GNISR+HE+P+  ILEVELFDVWGIDFMGPF  S+ G +YILVAVDYVSKWVEA A   N
Sbjct: 1260 GNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSF-GNMYILVAVDYVSKWVEAAAVPNN 1319

Query: 541  DARTVVKFLHKNIFTRF------------------------------------------- 590
            D++ VV F+ KNIFTRF                                           
Sbjct: 1320 DSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYGVKHKISTPYHPQTSGQ 1379

BLAST of Cla97C11G216264 vs. ExPASy TrEMBL
Match: A0A6P8DLJ8 (Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116205794 PE=4 SV=1)

HSP 1 Score: 848.2 bits (2190), Expect = 2.1e-242
Identity = 405/648 (62.50%), Postives = 488/648 (75.31%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            ML++L G+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAP TFQR
Sbjct: 843  MLEKLVGHDYYCFLDGYSGYNQIHIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR 902

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMMSIF  ++E+ +E+FMD FSVFG SF+SCL NL  VL+RC++ NL+LNWEKCHFMV E
Sbjct: 903  CMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETNLLLNWEKCHFMVRE 962

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGHAGFYRRFIKDFSKI++PL 
Sbjct: 963  GIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYRRFIKDFSKISRPLC 1022

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            NLLEK++ F+F+D CL AFN LKE+L +AP+IV P+W   FE+MC ASDYA+GAVLGQRR
Sbjct: 1023 NLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCGASDYAVGAVLGQRR 1082

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
              +F AIYY+SRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF
Sbjct: 1083 GKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSKIIVYTDHAALKYLF 1142

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFSDEQL 360
             K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKF DEQL
Sbjct: 1143 AKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLDS--PINEKFPDEQL 1202

Query: 361  YQVK-DSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIRK 420
            +  +   LPW+ADIVNY+     P  ++ QQKK+FLH++K Y W++P L+K CAD +IR+
Sbjct: 1203 HVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWDEPYLFKYCADQVIRR 1262

Query: 421  CVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRTG 480
            CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTG
Sbjct: 1263 CVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHDCRNYIMSCAPCQRTG 1322

Query: 481  NISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTND 540
            NISR+HE+P   IL +ELFDVWGIDFMGPFP S++   YILVAVDYVSKWVEA+A ++ND
Sbjct: 1323 NISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSN-KYILVAVDYVSKWVEAVALQSND 1382

Query: 541  ARTVVKFLHKNIFTRF-------------------------------------------- 590
            AR V++FL KNIF+R                                             
Sbjct: 1383 ARVVIRFLKKNIFSRVGVPRAIISDGGSHFCNRQFEKLLSKYGVTHKIATPYHPQTCGQV 1442

BLAST of Cla97C11G216264 vs. ExPASy TrEMBL
Match: A0A2G9HWF8 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_05441 PE=4 SV=1)

HSP 1 Score: 842.0 bits (2174), Expect = 1.5e-240
Identity = 407/649 (62.71%), Postives = 491/649 (75.65%), Query Frame = 0

Query: 1    MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
            MLDRLAG   YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+R+PF LCNAP TFQR
Sbjct: 878  MLDRLAGKEFYCFLDGYSGYNQIAIAPEDQEKTTFTCPYGTFAFRRIPFRLCNAPATFQR 937

Query: 61   CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
            CMM+IF  ++E+ +EVFMD FSV+G SFD CL NL+ VL+RC+D NLVLNWEKCHFMV E
Sbjct: 938  CMMAIFTDMVENCLEVFMDDFSVYGDSFDECLNNLSCVLKRCEDTNLVLNWEKCHFMVQE 997

Query: 121  GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
            GIVLGHKVS +G+EVD+AK+  IE+LPP T+VKGVRSFLGHAGFYRRFIKDF KI+KPL 
Sbjct: 998  GIVLGHKVSNRGIEVDKAKLETIEKLPPSTSVKGVRSFLGHAGFYRRFIKDFYKISKPLC 1057

Query: 181  NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
             LLEK+  F FDDACL AF+ LK RLI+APII VPDWS  FE+MCDASD+A+GAVLGQR+
Sbjct: 1058 KLLEKDIPFKFDDACLDAFDDLKRRLISAPIITVPDWSFPFELMCDASDFAIGAVLGQRK 1117

Query: 241  DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
            D +FR+IYY+S+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL 
Sbjct: 1118 DKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTKVIVYTDHAAIRYLI 1177

Query: 301  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFSDEQ 360
             KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + F DEQ
Sbjct: 1178 EKKDAKPRLIRWVLLLQEFDLEIRDRKGIENQIADHLSRLESPAKTDEPNLINDNFPDEQ 1237

Query: 361  LYQ-VKDSLPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCADNMIR 420
            L   V   +PW+ADIVNYL  G +P +++ QQKK+FL + + Y W+DP L+K   DN++R
Sbjct: 1238 LLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWDDPFLFKQGPDNILR 1297

Query: 421  KCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDRCQRT 480
            +CVP+ E+  I   CHASPYGGHF   RTAAK+LQSGF+WP+LFKD  +FV +CDRCQRT
Sbjct: 1298 RCVPEIEMNDIFEQCHASPYGGHFHRDRTAAKILQSGFFWPNLFKDVHSFVTNCDRCQRT 1357

Query: 481  GNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIATRTN 540
            GNISR+HE+P+K ILEVELFDVWGIDFMGPF  S+ G +YILVAVDY+SKWVEA+A   N
Sbjct: 1358 GNISRRHEMPLKTILEVELFDVWGIDFMGPFVPSF-GNMYILVAVDYMSKWVEAVAVPNN 1417

Query: 541  DARTVVKFLHKNIFTRF------------------------------------------- 590
            D++ VV F+ KNIFTRF                                           
Sbjct: 1418 DSKVVVNFIKKNIFTRFGTPRAIISDGGTHFYNRSFEALLSKYGVKHKISTPYHPQTSGQ 1477

BLAST of Cla97C11G216264 vs. ExPASy TrEMBL
Match: A0A4Y1RS99 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 2.0e-240
Identity = 409/653 (62.63%), Postives = 488/653 (74.73%), Query Frame = 0

Query: 1   MLDRLAGYSHYCFLDGYSGYNQIVIAPEDQEKMTFTCPYGTFAFKRMPFGLCNAPTTFQR 60
           ML+RLAG+++YCFLDGYSGYNQI IAPEDQEK TFTCP+GTFA++RMPFGLCNAP TFQR
Sbjct: 102 MLERLAGHAYYCFLDGYSGYNQIPIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 161

Query: 61  CMMSIFQGLIEDIMEVFMDYFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMVTE 120
           CMMSIF  ++E  +EVFMD FSVFGSSFDSCL NL  VL RC++ NLVLNWEKCHFMV E
Sbjct: 162 CMMSIFSDMVERFIEVFMDDFSVFGSSFDSCLDNLALVLARCEETNLVLNWEKCHFMVQE 221

Query: 121 GIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHAGFYRRFIKDFSKIAKPLS 180
           GIVLGHK+S +G+EVDRAKI  IE+LPPP+ VKG+RSFLGHAGFYRRFIKDFSKI KPL 
Sbjct: 222 GIVLGHKISARGIEVDRAKIETIEKLPPPSTVKGIRSFLGHAGFYRRFIKDFSKITKPLC 281

Query: 181 NLLEKEAKFIFDDACLLAFNTLKERLIAAPIIVVPDWSQSFEIMCDASDYALGAVLGQRR 240
            LL K+++F FD  CL AFN LK +L  AP+I+ PDW   FEIMCDASDYA+GAVLGQR+
Sbjct: 282 KLLLKDSEFNFDSDCLEAFNLLKTKLTTAPVIMAPDWELPFEIMCDASDYAIGAVLGQRK 341

Query: 241 DNMFRAIYYSSRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLF 300
           + +   I+Y+SRTL++ Q  Y TTEKELLAVVFA+DKFRSYLLG+K++V+TDHAALK+L 
Sbjct: 342 NKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLLGAKVIVYTDHAALKFLL 401

Query: 301 VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFSDE 360
            KK++KPRL+RW+LLLQEFD+EI+D+KG ENVVADHLSR+  E+E  +   PI+E F DE
Sbjct: 402 AKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVREDEVIEDVGPILETFPDE 461

Query: 361 QLYQVKDS----LPWFADIVNYLAGGHLPPNMNYQQKKRFLHNIKSYHWEDPLLYKVCAD 420
           QLY +  +     PW+AD VNYLA G LPP+M++ QKK+FL  +K Y+W+DP L+K   D
Sbjct: 462 QLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSLVKHYYWDDPYLWKHGPD 521

Query: 421 NMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSGFYWPSLFKDCCTFVKSCDR 480
            +IR+CVP+ E+  IL  CH    GGH+G ++T AKVLQSGF+WP+LFKD   FV  CD 
Sbjct: 522 QVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFFWPTLFKDAQDFVARCDP 581

Query: 481 CQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAVDYVSKWVEAIA 540
           CQRTGNIS ++++P+  ILEVELFDVWGIDFMGPFP SY G LYILVAVDYVSKWVEA A
Sbjct: 582 CQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASY-GNLYILVAVDYVSKWVEAAA 641

Query: 541 TRTNDARTVVKFLHKNIFTRF--------------------------------------- 590
             TNDA+ VV+FL KNIFTRF                                       
Sbjct: 642 LPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLLAKYGITHKVSTPYHPQ 701

BLAST of Cla97C11G216264 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 85.1 bits (209), Expect = 2.1e-16
Identity = 46/130 (35.38%), Postives = 71/130 (54.62%), Query Frame = 0

Query: 94  NLTRVLQRCQDANLVLNWEKCHFMVTEGIVLGHK--VSKKGLEVDRAKIVAIEQLPPPTN 153
           +L  VLQ  +      N +KC F   +   LGH+  +S +G+  D AK+ A+   P P N
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 154 VKGVRSFLGHAGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIAAPI 213
              +R FLG  G+YRRF+K++ KI +PL+ LL+K +   + +   LAF  LK  +   P+
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNS-LKWTEMAALAFKALKGAVTTLPV 122

Query: 214 IVVPDWSQSF 222
           + +PD    F
Sbjct: 123 LALPDLKLPF 131

BLAST of Cla97C11G216264 vs. TAIR 10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein )

HSP 1 Score: 84.7 bits (208), Expect = 2.7e-16
Identity = 36/56 (64.29%), Postives = 44/56 (78.57%), Query Frame = 0

Query: 451 VLQSGFYWPSLFKDCCTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFM 507
           VLQ+GFYWP+ FKD   FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Sbjct: 35  VLQAGFYWPTTFKDAHGFVSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFM 90

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023874613.12.4e-26468.16uncharacterized protein LOC111987139 [Quercus suber][more]
XP_012833687.14.6e-25265.07PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata] >XP_012857... [more]
XP_012847037.14.6e-25265.07PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata][more]
XP_012846413.16.1e-25265.07PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata][more]
XP_012833379.11.8e-25164.91PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
P043232.8e-6641.45Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
Q8I7P92.4e-6237.61Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
Q993155.4e-6230.72Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG52.1e-6130.72Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P208251.3e-6038.84Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
Match NameE-valueIdentityDescription
A0A6P8CBX21.7e-24462.96Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1[more]
A0A2G9FWY35.0e-24463.48Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=... [more]
A0A6P8DLJ82.1e-24262.50Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116205794 PE=4 SV=1[more]
A0A2G9HWF81.5e-24062.71Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_05441 PE=... [more]
A0A4Y1RS992.0e-24062.63Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.12.1e-1635.38DNA/RNA polymerases superfamily protein [more]
ATMG00750.12.7e-1664.29GAG/POL/ENV polyprotein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 3..127
e-value: 3.3E-15
score: 56.2
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 138..230
e-value: 8.2E-27
score: 95.1
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 1..128
e-value: 1.3E-35
score: 124.6
NoneNo IPR availableGENE3D1.10.340.70coord: 385..478
e-value: 1.2E-17
score: 65.9
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 21..53
e-value: 1.3E-35
score: 124.6
NoneNo IPR availablePANTHERPTHR24559:SF373REVERSE TRANSCRIPTASE DOMAIN, RIBONUCLEASE H-LIKE DOMAIN PROTEIN-RELATEDcoord: 29..555
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 29..555
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 222..340
e-value: 6.90917E-59
score: 190.782
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1..128
e-value: 8.49823E-54
score: 179.328
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 493..560
e-value: 3.9E-15
score: 57.6
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 421..478
e-value: 3.0E-14
score: 52.8
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 216..319
e-value: 1.2E-34
score: 118.7
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 487..602
score: 9.355089
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 497..588
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 2..323

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G216264.1Cla97C11G216264.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding